[Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces

"Martin v. Löwis" martin at v.loewis.de
Thu Apr 30 21:10:37 CEST 2009


MRAB wrote:
> One further question: should the encoder accept a string like
> u'\xDCC2\xDC80'? That would encode to b'\xC2\x80'

Indeed so.

> which, when decoded, would give u'\x80'.

Assuming the encoding is UTF-8, yes.

> Does the PEP only guarantee that strings decoded
> from the filesystem are reversible, but not check what might be de novo
> strings?

Exactly so.

Regards,
Martin


More information about the Python-Dev mailing list