Encoding of surrogate code points to UTF-8

wxjmfauth at gmail.com wxjmfauth at gmail.com
Wed Oct 9 04:22:54 EDT 2013


Le mercredi 9 octobre 2013 08:20:05 UTC+2, Steven D'Aprano a écrit :
> 
> 
> > http://www.unicode.org/versions/Unicode6.2.0/ch02.pdf#G13708 "All three
> 
> > encoding forms can be used to represent the full range of encoded
> 
> > characters in the Unicode Standard; ... Each of the three Unicode
> 
> > encoding forms can be efficiently transformed into eith er of the other
> 
> > two without any loss of data."
> 
> 

Yes, 

and what Unicode.org does not say is that these coding
schemes (like any coding scheme) should be used in an
exclusive way.

Probably, because it is too obvious to understand.

jmf





More information about the Python-list mailing list