[issue12892] UTF-16 and UTF-32 codecs should reject (lone) surrogates

Antoine Pitrou report at bugs.python.org
Tue Oct 8 12:30:56 CEST 2013


Antoine Pitrou added the comment:

> UTF-8, UTF-16 and UTF-32 codecs need to be as fast as possible
> in Python to not create performance problems when converting
> between platform Unicode data and the internal formats
> used in Python.

"As fast as possible" is a platonic dream.
They only need to be fast enough not to be bottlenecks.
If you know of a *Python* workload where UTF-16 decoding is the
bottleneck, I'd like to know about it :-)

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12892>
_______________________________________


More information about the Python-bugs-list mailing list