[Python-ideas] Processing surrogates in

Stephen J. Turnbull stephen at xemacs.org
Fri May 8 00:30:11 CEST 2015


Chris Barker writes:

 > I've read many of the rants about UTF-16, but in fact, it's really
 > not any worse than UTF-8

Yes, it is.  It's not ASCII compatible.  You can safely use the usual
libc string APIs on UTF-8 (except for any that might return only part
of a string), but not on UTF-16 (nulls).  This is a pretty big
advantage for UTF-8 in practice.






More information about the Python-ideas mailing list