[issue17252] Latin Capital Letter I with Dot Above
Firat Ozgul
report at bugs.python.org
Wed Feb 20 12:31:59 CET 2013
Firat Ozgul added the comment:
In Python, things like lowercasing-uppercasing and sorting were always problematic with regard to Turkish language. For instance, whatever the locale is, you cannot lowercase the word 'KADIN' (woman) in Turkish correctly::
>>> "KADIN".lower()
'kadin'
... which is wrong. That should be 'kadın' ('kad\u0131n'). Likewise 'kitap' (book)::
>>> "kitap".upper()
'KITAP'
... which is wrong. That should be 'KİTAP' ('K\u0130TAP').
As for this thread, in 3.3, Python does a completely different thing::
>>> "KİTAP".lower()
'ki\u0307tap' #wrong
In Python 3.2, this was::
>>> "KİTAP".lower()
'kitap' #correct
'i' and 'i\u0307' are not the same.
Turkish Python programmers define their own upper(), lower(), title(), swapcase() and casefold() methods and use their own sorting techniques.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue17252>
_______________________________________
More information about the Python-bugs-list
mailing list