Oh look, another language (ceylon)
wxjmfauth at gmail.com
wxjmfauth at gmail.com
Tue Nov 19 04:10:55 EST 2013
Le lundi 18 novembre 2013 14:31:33 UTC+1, Steven D'Aprano a écrit :
>
>
> ... choose one of the three bad choices: ...
>
>
>
> * choose UTF-16 or UTF-8, and have O(n) primitive string operations (like
>
> Haskell and, apparently, Ceylon);
>
>
>
> * or UTF-16 without support for the supplementary planes (which makes it
>
> virtually UCS-2), like Javascript;
>
>
>
> * choose UTF-32, and use two or four times as much memory as needed.
>
>
>
Nothing can beat the coding schemes endorsed by Unicode.
They are all working on the smallest possible entity
level (Unicode Transformation *Units*) with a unique
set of these entities.
To not forget. A set of characters is an artificial
construction and by nature it can not follow the
logic of a more "natural" set, eg. integers.
jmf
More information about the Python-list
mailing list