Encoding sniffer?
Neil Hodgson
nyamatongwe+thunder at gmail.com
Fri Jan 6 18:25:41 EST 2006
Ralf Muschall:
> Diez B. Roggisch wrote:
>>AFAIK iso-8859-1 has all codepoints taken - so you won't go beyond that
>>in your example.
> IIRC the range 128-159 (i.e. control codes with the high bit set)
> are unused.
ISO 8859-1 and ISO-8859-1 (extra hyphen) differ in that ISO-8859-1
includes the control codes in 128-159 (as well as the low control codes)
as defined by ISO 6429. ISO 6429 is not freely available online but the
equivalent ECMA standard ECMA 48 is:
http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-048.pdf
Neil
More information about the Python-list
mailing list