Encoding sniffer?

Neil Hodgson nyamatongwe+thunder at gmail.com
Fri Jan 6 18:25:41 EST 2006


    Ralf Muschall:

> Diez B. Roggisch wrote:
>>AFAIK iso-8859-1 has all codepoints taken - so you won't go beyond that
>>in your example.
> IIRC the range 128-159 (i.e. control codes with the high bit set)
> are unused.

    ISO 8859-1 and ISO-8859-1 (extra hyphen) differ in that ISO-8859-1 
includes the control codes in 128-159 (as well as the low control codes) 
as defined by ISO 6429. ISO 6429 is not freely available online but the 
equivalent ECMA standard ECMA 48 is:
http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-048.pdf

    Neil



More information about the Python-list mailing list