Unicode program representation

Neil Hodgson neilh at hare.net.au
Mon Apr 3 07:46:24 EDT 2000


> >    I think this should be changed to interpreting the literal as a UTF-8
> > literal. The advantage here is that non-roman string literals become a
> > natural part of the language.
>
> however, perfectly valid ISO Latin 1 strings (or any other standard
> 8-bit encoding) are invalid UTF-8 strings.   changing the default to
> UTF-8 this is likely to break many existing scripts.

   But there are relatively few scripts currently using the u" form since it
has yet to appear in a release version of Python.

   Neil






More information about the Python-list mailing list