[Python-Dev] Python and the Unicode Character Database

Antoine Pitrou solipsis at pitrou.net
Thu Dec 2 17:56:00 CET 2010


Le jeudi 02 décembre 2010 à 11:41 -0500, Alexander Belopolsky a écrit :
> 
> Note that my point is not to find the correct answer here, but to
> demonstrate that we as a group don't have the expertise to get parsing
> of Arabic text right.

I don't understand why you think Arabic or Hebrew text is any different
from Western text. Surely right-to-left isn't more conceptually
complicated than left-to-right, is it?

The fact that mixed rtl + ltr can render bizarrely or is awkward to cut
and paste is quite off-topic for our discussion.

> If we've got it right for Arabic, it is by
> chance and not by design.  This still leaves us with 41 other types of
> digits for at least 30 different languages.

So why do you trust the Unicode standard on other things and not on this
one?

Regards

Antoine.




More information about the Python-Dev mailing list