unicode filenames

Neil Hodgson nhodgson at bigpond.net.au
Mon Feb 3 04:32:58 EST 2003


Alex Martelli:

> Similar considerations apply for any other multibyte encoding
> (such as, UTF-8) that is NOT specifically and carefully
> designed to avoid ever needing a byte of value 47 (0x2F) in
> order to represent ANY character except a slash.  I am not
> aware of any such multi-byte encoding -- there may be some,
> but, even if one can be found, using it would still fall WELL
> short of "any other encoding whatsoever" as you claimed.

   UTF-8 is a superset of ASCII. A slash has the same representation in
UTF-8 as ASCII. No multi-byte UTF-8 character may contain a byte < 128.

   Neil






More information about the Python-list mailing list