problem with unicode

Fri Apr 25 16:42:03 EDT 2008

John Machin wrote:
> On Apr 25, 10:01 pm, Bjoern Schliessmann <usenet-

>> >>> media="x???[?"
>> >>> print repr(media.decode("utf-8"))
>>
>> u'x\u30ef\u30e6\u30ed[\u30e8'

(dang, KNode doesn't autodetect encodings ...)

> But that_unicode_string.encode("utf-8") produces
> 'x\xe3\x83\xaf\xe3\x83\xa6\xe3\x83\xad[\xe3\x83\xa8'
> which does not contain the complained-about byte 0x9c in position
> 1 (or any other position) -- how can that be?

Probably the OP used a different encoding. That seems even more
likely given the fact that his postings have a Japanese encoding
(but this one doesn't produce any 0x9c, either).

Regards,

Björn

-- 
BOFH excuse #346:

Your/our computer(s) had suffered a memory leak, and we are waiting
for them to be topped up.