Incorrect title case?

MRAB google at mrabarnett.plus.com
Fri Jan 16 17:07:03 EST 2009


Python 2.6.1

I've just found that the following 4 Unicode characters/codepoints don't
behave as I'd expect: Dž (U+01C5), Lj (U+01C8), Nj (U+01CB), Dz (U+01F2).

For example, u"\u01C5".istitle() returns True and
unicodedata.category(u"\u01C5") returns "Lt", but u"\u01C5".title()
returns u'\u01C4', which is the uppercase equivalent. Are these mistakes
in the Unicode database?




More information about the Python-list mailing list