Python Unicode handling wins again -- mostly

Fri Nov 29 21:12:34 EST 2013

On Sat, Nov 30, 2013 at 1:08 PM, Roy Smith <roy at panix.com> wrote:
> I would certainly expect, x.lower() == x.upper().lower(), to be True for
> all values of x over the set of valid unicode codepoints.  Having
> u"\uFB04".upper() ==> "FFL" breaks that.  I would also expect len(x) ==
> len(x.upper()) to be True.

That's a nice theory, but the Unicode consortium disagrees with you on
both points.

ChrisA