Is there any way to decode String using unknown codec?

MRAB python at mrabarnett.plus.com
Thu Jun 28 07:28:06 EDT 2012


On 28/06/2012 02:14, howmuchistoday at gmail.com wrote:
> Hi
> I'm a Korean and when I use modules like sys, os, &c,
> sometimes the interpreter show me broken strings like
> '\x13\xb3\x12\xc8'.
> It mustbe the Korean "alphabet" but I can't decode it to the rightway.
> I tried to decode it using codecs like cp949,mbcs,utf-8
> but It failed.
> The only way I found is eval('\x13\xb3\x12\xc8').
> It raises an Error with showing right Korean.
> Is there any way to deal it being not broken?
>
It might be UTF-16:

 >>> b'\x13\xb3\x12\xc8'.decode("utf16")
'댓젒'

I don't know Korean, but that looks reasonable!



More information about the Python-list mailing list