[issue12892] UTF-16 and UTF-32 codecs should reject (lone) surrogates

Antoine Pitrou report at bugs.python.org
Tue Oct 8 11:03:35 CEST 2013


Antoine Pitrou added the comment:

> On 08.10.2013 10:46, Antoine Pitrou wrote:
> > 
> > utf-16 isn't that widely used, so it's probably fine if it becomes
> > a bit slower.
> 
> It's the default encoding for Unicode text files and APIs on Windows,
> so I'd say it *is* widely used :-)

I've never seen any UTF-16 text files. Do you have other data?

APIs are irrelevant. You only pass very small strings to then (e.g.
file paths).

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12892>
_______________________________________


More information about the Python-bugs-list mailing list