Unicode Newbie

Manuel Huesser sylphaleya at hta.fhz.ch
Tue Sep 16 06:46:56 EDT 2003


> > "\xfc\x12\x12\x12\x12\x12\x12" is an example for a 7
> > byte utf-8 string. But on encoding i get the following
> > error:
> >
> > UTF-8 decoding error: unsupported Unicode code range
> 
> Unicode supports ~2**20 code points (17*64k), not 2**31 characters.
> Your example is not a valid UTF-8 string.

Yep Unicode supports less characters than there are possible with
utf-8 (ucs range = 2 ** 31).

so there is no possibilty to support the full range of the ucs
character set with python?

Manuel




More information about the Python-list mailing list