Unicode Newbie

Martin v. Löwis martin at v.loewis.de
Tue Sep 16 15:40:28 EDT 2003


Manuel Huesser <sylphaleya at hta.fhz.ch> writes:

> Yep Unicode supports less characters than there are possible with
> utf-8 (ucs range = 2 ** 31).
> 
> so there is no possibilty to support the full range of the ucs
> character set with python?

The ucs range (for UCS-4) is *not* 2**31; it is 17*2**16. It was 2**32
in ISO/IEC 10646:1993 (I believe), but it got constrained in 10646:2000.

It is certainly possible to represent 2**32 different values in a
Python Unicode character - but you will have to change the Python
interpreter source code for that.

Regards,
Martin




More information about the Python-list mailing list