Python 3 is killing Python

Michael Torrie torriem at gmail.com
Mon Jul 14 23:47:32 EDT 2014


On 05/31/2014 09:48 AM, jmf wrote:
> <falsehoods about python and unicode>

Absolutely FALSE.  Python 3.3 and up can handle any and all unicode
characters you want to throw at it, without surprises such as what you
get in javascript.  Python 3 uses UTF-4 encoding under the hood, with a
compression optimization that removes leading zeros from binary
representation of each character.

Windows command consoles are not unicode compliant, and so running
python programs a command prompt console will often lead to exceptions
because Python must convert unicode to the character set that the
console is using, and when a character is hit that cannot be encoded
Python defaults to being correct and throws an exception, instead of
failing silently.




More information about the Python-list mailing list