[I18n-sig] How does Python Unicode treat surrogates?

Tim Peters tim@digicool.com
Mon, 25 Jun 2001 17:12:42 -0400


[Tom Emerson]
> I must admit that I wasn't aware of the "\U00020000" notation. I still
> think it should limit itself to 6 digits, not 8.

[Guido]
> Too late -- It's some kind of standard already (maybe borrowed
> from Java?).

We borrowed \U12345678 notation from the current ISO/ANSI C standard
("C99").  A space with 2**20 characters isn't going to last either -- and
unlike the Unicode folks, X3J11 didn't have any reason to indulge wishful
thinking on this point <wink>.