[Python-Dev] PEP 383 update: utf8b is now the error handler

Lino Mastrodomenico l.mastrodomenico at gmail.com
Sun May 3 15:29:27 CEST 2009


2009/5/3 "Martin v. Löwis" <martin at v.loewis.de>:
> With issue 3672 resolved, it is now unnecessary to introduce
> an utf-8b codec, since the utf-8 codec will properly report errors
> for all byte sequences invalid in UTF-8, including lone surrogates.
> Therefore, utf-8b can be implemented solely through the error handler.

That's even nicer. One minor detail though, in the sentence:

    "non-decodable bytes >128 will be represented as lone half surrogate"

">" should be ">=".

-- 
Lino Mastrodomenico


More information about the Python-Dev mailing list