[issue3297] Python interpreter uses Unicode surrogate pairs only before the pyc is created

Adam Olsen report at bugs.python.org
Sat Jul 12 02:56:43 CEST 2008


Adam Olsen <rhamph at gmail.com> added the comment:

No, the configure options are wrong - we do use UTF-16 and UTF-32. 
Although modern UCS-4 has been restricted down to the range of UTF-32
(it used to be larger!), UCS-2 still doesn't support the supplementary
planes (ie no surrogates.)

If it really was UCS-2, the repr wouldn't be u'\U00010123' on windows. 
It'd be a pair of ill-formed code units instead.

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue3297>
_______________________________________


More information about the Python-bugs-list mailing list