[issue35639] Lowecasing Unicode Characters

Erdem Uney report at bugs.python.org
Wed Jan 2 07:03:45 EST 2019


New submission from Erdem Uney <erdemuney at gmail.com>:

assert 'ŞİŞLİ'.lower() == 'şişli'

Lowercasing the capital İ (with a dot on - \u0130) adds a unicode character \u0307 after i and if there is a following character it adds that dot (\u0307) over that character. The behavior is different in Python 2.7.10 where it adds the dot on top of 'i'.

Accord to Unicode Specifications character \u0130 should be converted to character \u0069.

----------
messages: 332865
nosy: kingofsevens
priority: normal
severity: normal
status: open
title: Lowecasing Unicode Characters
type: behavior
versions: Python 3.6

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue35639>
_______________________________________


More information about the Python-bugs-list mailing list