Pure python implementation of string-like class

Xavier Morel xavier.morel at masklinn.net
Sat Feb 25 15:06:10 EST 2006


Ross Ridge wrote:
> Steve Holden wrote:
>> "Wider than UTF-16" doesn't make sense.
> 
> It makes perfect sense.
> 
>                                                               Ross
> Ridge
> 

Not if you're still within Unicode / Universal Character Set code space. 
While UCS-4 technically goes beyond any Unicode Transformation Format 
(UTF-7, 8, 16 and 32 stop at 10FFFF) it also goes beyond the range of 
the UCS itself (0-10FFFF). UTF-32 is the limitation of UCS-4 to the 
Unicode standard.

While it could be argued that Unicode/UCS limit of 10FFFF was chosen 
_because_ of the limitations of UTF-16, It's probably irrelevant to the 
discussion.



More information about the Python-list mailing list