Micro Python -- a lean and efficient implementation of Python 3

Marko Rauhamaa marko at pacujo.net
Wed Jun 4 08:17:31 EDT 2014


Robin Becker <robin at reportlab.com>:

>>>> u'\xc5ngstr\xf6m'==u'\xc5ngstro\u0308m'
> False

Now *that* would be a valid reason for our resident Unicode expert to
complain! Py3 in no way solves text representation issues definitively.

> I know this is artificial

Not at all. It probably is out of scope for Python, but it is a real
cause for human suffering. What's Unicode for "résumé"?

Note, for example, that Google manages to sort out issues like these. It
sees past diacritics and even case ending.


Marko



More information about the Python-list mailing list