incorrect upper()/lower() of UTF-8

Fredrik Lundh fredrik at pythonware.com
Sat Jun 29 08:06:23 EDT 2002


Jim Henry wrote

> It appears that python does not uppercase/lowercase
> utf-8 strings properly:

It's not supposed to: the 8-bit string type can hold either
text strings or binary buffers.  The encode method takes
a text string and returns a binary buffer.

For reliable processing of Unicode data, use Unicode strings.

</F>





More information about the Python-list mailing list