[issue26917] unicodedata.normalize(): bug in Hangul Composition
Armin Rigo
report at bugs.python.org
Tue May 3 06:29:22 EDT 2016
Armin Rigo added the comment:
See also https://bitbucket.org/pypy/pypy/issues/2289/incorrect-unicode-normalization . It seems that you reached the same conclusion than the OP in that issue: the problem would really be that normalizing "\uafb8\u11a7" should not drop the second character. Both Python and PyPy do that, but Python adds the "is_normalized()" check, so in some cases it returns the correct unmodified result.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue26917>
_______________________________________
More information about the Python-bugs-list
mailing list