[Python-Dev] PEP 383 update: utf8b is now the error handler
Antoine Pitrou
solipsis at pitrou.net
Thu May 7 01:27:00 CEST 2009
Martin v. Löwis <martin <at> v.loewis.de> writes:
> py> b'\xed\xa0\x80'.decode("utf-8","surrogates")
> '\ud800'
The point is, "surrogates" does not mean anything intuitive for an /error
handler/. You seem to be the only one who finds this name explicit enough,
perhaps because you chose it.
Most other handlers' names have verbs in them ("ignore", "replace",
"xmlcharrefreplace", etc.).
Regards
Antoine.
More information about the Python-Dev
mailing list