Encoding of surrogate code points to UTF-8

Ned Batchelder ned at nedbatchelder.com
Wed Oct 9 06:22:20 EDT 2013


On 10/9/13 4:22 AM, wxjmfauth at gmail.com wrote:
> Le mercredi 9 octobre 2013 08:20:05 UTC+2, Steven D'Aprano a écrit :
>>
>>> http://www.unicode.org/versions/Unicode6.2.0/ch02.pdf#G13708 "All three
>>> encoding forms can be used to represent the full range of encoded
>>> characters in the Unicode Standard; ... Each of the three Unicode
>>> encoding forms can be efficiently transformed into eith er of the other
>>> two without any loss of data."
>>
> Yes,
>
> and what Unicode.org does not say is that these coding
> schemes (like any coding scheme) should be used in an
> exclusive way.

Can you clarify what you mean by "in an exclusive way"?

--Ned.

> Probably, because it is too obvious to understand.
>
> jmf
>
>




More information about the Python-list mailing list