[issue17252] Latin Capital Letter I with Dot Above

Firat Ozgul report at bugs.python.org
Wed Feb 20 12:31:59 CET 2013


Firat Ozgul added the comment:

In Python, things like lowercasing-uppercasing and sorting were always problematic with regard to Turkish language. For instance, whatever the locale is, you cannot lowercase the word 'KADIN' (woman) in Turkish correctly::

    >>> "KADIN".lower()

    'kadin'

... which is wrong. That should be 'kadın' ('kad\u0131n'). Likewise 'kitap' (book)::

    >>> "kitap".upper()

    'KITAP'

... which is wrong. That should be 'KİTAP' ('K\u0130TAP').

As for this thread, in 3.3, Python does a completely different thing::

    >>> "KİTAP".lower()

    'ki\u0307tap' #wrong

In Python 3.2, this was::

    >>> "KİTAP".lower()

    'kitap' #correct

'i' and 'i\u0307' are not the same. 

Turkish Python programmers define their own upper(), lower(), title(), swapcase() and casefold() methods and use their own sorting techniques.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue17252>
_______________________________________


More information about the Python-bugs-list mailing list