[Python-Dev] Can't decode \x876F character encoded in Shift JIS charset ?

Amaury Forgeot d'Arc amauryfa at gmail.com
Thu Feb 7 18:12:35 CET 2008


Hello,

Nicolas Dumazet :
> >>> unicode('\x87\x6F', "shift jis")
> Traceback (most recent call last):
>   File "<stdin>", line 1, in <module>
> UnicodeDecodeError: 'shift_jis' codec can't decode bytes in position 0-1:
> illegal multibyte sequence
>
> Still, \x87\x6F is a valid Shift-JIS character :
> http://demo.icu-project.org/icu-bin/convexp?conv=ibm-943_P15A-2003&b=87&s=MIME#layout,
> it is "©L"...

It is possible that the encoding is actually "shift jis 2004" or
"cp932", which are both extensions to the original shift jis.
Please continue this discussion on comp.lang.python; or fill a bug request.

Cheers quand m¨ºme,

-- 
Amaury Forgeot d'Arc


More information about the Python-Dev mailing list