Unicode surrogate pairs (Python 3.4)

Jon Ribbens jon+usenet at unequivocal.co.uk
Sun May 3 15:20:04 EDT 2015


On 2015-05-03, MRAB <python at mrabarnett.plus.com> wrote:
> There's also a mistake in this bit:
>
> """
> # Note that according to the \uXXXX escaping convention, a supplemental 
> character (> 0x10FFFF) is represented
> # by a sequence of two surrogate characters: the first between D800 and 
> DBFF, and the second between DC00 and DFFF.
> """

Do you mean that it should say "(> 0xFFFF)" ? Far be it from me to
correct the Unicode Consortium on the subject of Unicode ;-)



More information about the Python-list mailing list