unicodedata name for \u000a

"Martin v. Löwis" martin at v.loewis.de
Sun Aug 22 05:52:26 EDT 2004


Ken Beesley wrote:
> OK. I see that for 000A there is not now an official Unicode name in 
> 4.0, and that "LINE FEED (LF)" is an alias. Such an alias, shown in 
> uppercase letters, indicates that it _was_ the name of the character in 
> The Unicode Standard, Version 1.0. See The Unicode Standard 4.0, p. 415 
> ("Aliases"). This seems odd. One intuitively assumes that any defined 
> Unicode character has a Unicode name.

Indeed, this intuition is wrong. Other Unicode characters that don't
have names are:
- surrogates (U+D800..U+DFFF); it is debatable whether these are
   characters, though
- private use characters (U+E000..U+F8FF, U+F0000..U+FFFFD,
   U+10000..U+10FFFD).

Regards,
Martin



More information about the Python-list mailing list