[issue12736] Request for python casemapping functions to use full not simple casemaps per Unicode's recommendation

Guido van Rossum report at bugs.python.org
Fri Aug 26 23:11:24 CEST 2011


Guido van Rossum <guido at python.org> added the comment:

I presume this applies to builtin str methods like .lower(), right?  I think it is a good thing to do for Python 3.3.

We'd need to define what should happen in edge cases, e.g. when (against all odds) a string happens to contain a lone surrogate or some other code point or sequence of code points that the Unicode standard considers illegal.  I think it should not fail but just leave those code points alone.

Does this require us to import more data files from the Unicode standard?  By itself that doesn't scare me.

Would this also affect .islower() and friends?

----------
nosy: +gvanrossum

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12736>
_______________________________________


More information about the Python-bugs-list mailing list