Oh look, another language (ceylon)

wxjmfauth at gmail.com wxjmfauth at gmail.com
Tue Nov 19 04:10:55 EST 2013


Le lundi 18 novembre 2013 14:31:33 UTC+1, Steven D'Aprano a écrit :
> 
> 
> ...   choose one of the three bad choices:  ...
> 
> 
> 
> * choose UTF-16 or UTF-8, and have O(n) primitive string operations (like 
> 
> Haskell and, apparently, Ceylon);
> 
> 
> 
> * or UTF-16 without support for the supplementary planes (which makes it 
> 
> virtually UCS-2), like Javascript;
> 
> 
> 
> * choose UTF-32, and use two or four times as much memory as needed.
> 
> 
> 

Nothing can beat the coding schemes endorsed by Unicode.

They are all working on the smallest possible entity
level (Unicode Transformation *Units*) with a unique
set of these entities.

To not forget. A set of characters is an artificial
construction and by nature it can not follow the
logic of a more "natural" set, eg. integers.

jmf




More information about the Python-list mailing list