Encoding of surrogate code points to UTF-8

Neil Cerutti neilc at norwich.edu
Tue Oct 8 11:54:30 EDT 2013

Previous message (by thread): Encoding of surrogate code points to UTF-8
Next message (by thread): Encoding of surrogate code points to UTF-8
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 2013-10-08, Neil Cerutti <neilc at norwich.edu> wrote:
> In any case, "\ud800\udc01" isn't a valid unicode string. In a
> perfect world it would automatically get converted to
> '\u00010001' without intervention.

This last paragraph is erroneous. I must have had a typo in my
testing.

-- 
Neil Cerutti

Previous message (by thread): Encoding of surrogate code points to UTF-8
Next message (by thread): Encoding of surrogate code points to UTF-8
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the Python-list mailing list