[issue12736] Request for python casemapping functions to use full not simple casemaps per Unicode's recommendation

Fri Aug 26 23:11:24 CEST 2011

Guido van Rossum <guido at python.org> added the comment:

I presume this applies to builtin str methods like .lower(), right?  I think it is a good thing to do for Python 3.3.

We'd need to define what should happen in edge cases, e.g. when (against all odds) a string happens to contain a lone surrogate or some other code point or sequence of code points that the Unicode standard considers illegal.  I think it should not fail but just leave those code points alone.

Does this require us to import more data files from the Unicode standard?  By itself that doesn't scare me.

Would this also affect .islower() and friends?

----------
nosy: +gvanrossum

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12736>
_______________________________________