[Python-Dev] unicode/string asymmetries

Martin v. Loewis martin@v.loewis.de
Thu, 10 Jan 2002 20:44:26 +0100


> >    unicode("some string", "unicode-escape")
[...]
> For example the copyright symbol "©" (repr("©") gives "\xa9").
> Now I want to convert this string to unicode.
> u"©" works fine, unicode(variable) gives an ASCII decoding error.

As I said: unicode-escape is the precise encoding that is used to
parse Unicode strings from source files. It interprets all bytes above
128 as Latin-1.

Regards,
Martin