editing in Unicode

Bertilo Wennergren bertilow at hem.passagen.se
Thu Sep 7 11:19:02 EDT 2000


"Roland Mas":

> Bertilo Wennergren (2000-09-07 12:49:10 +0200) :

> > What about using this:

> > msg = u'@#&'

>   Hmm.  Not sure, since you would have to explicitly state somewhere
> that the "argument" to the u'' construct has to be considered as
> encoded in UTF-8.

Yes, that would be nice. Is there a way to state that somewhere?

>  Why UTF-8 and not -16, or Latin-1 or something else?

Well, Latin-1 wouldn't do if I want to enter lots of characters that 
are not present in Latin 1. UTF-16 would do great, but would make the
code harder to deal with in non-Unicode editors. UTF-8 is better
since it is backwards compatible with ASCII.

-- 
#####################################################################
                         Bertilo Wennergren
                 <http://purl.oclc.org/net/bertilo>
                     <bertilow at hem.passagen.se>
#####################################################################





More information about the Python-list mailing list