[issue5200] unicode.normalize gives wrong result for some characters

Martin v. Löwis report at bugs.python.org
Tue Feb 10 22:32:18 CET 2009


Martin v. Löwis <martin at v.loewis.de> added the comment:

> The same applies  "Å" and "A", "Ä" and "A" and "Ö" and "O"
> which also are also different letters as "Ø" and "O" are. 

Sure. And rightfully, they "Å" is *not* (I repeat: not)
normalized as "A", under NFD:

py> unicodedata.normalize("NFD", u"Å")
u'A\u030a'

> Maybe not in the unicode world but in treal life.

They are different letters also in the Unicode world.

> That's why I'm a little confused.

I think the confusion comes from your assumption that
normalizing "Å" produces "A". It does not. Really not.

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue5200>
_______________________________________


More information about the Python-bugs-list mailing list