[issue3811] Update Unicode database to 5.1.0

Martin v. Löwis report at bugs.python.org
Thu Sep 11 08:05:23 CEST 2008


Martin v. Löwis <martin at v.loewis.de> added the comment:

> The control characters?

Indeed, also the private-use characters. test_unicode explicitly
comments that the test is about unassigned characters, although
I don't understand the purpose of that test (it then also tests
a surrogate character, which is also guaranteed to remain
unprintable).

One of the characters that is guaranteed to remain unassigned is
U+FFFE (and its mirrors in other planes, e.g. U+1FFFE, ...).
This guarantee is made to support the BOM. Along with U+FFFF,
these are non-characters. #765036 once suggested that Python should
refuse to represent them at all, but that proposal was rejected.

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue3811>
_______________________________________


More information about the Python-bugs-list mailing list