Python 3 is killing Python

Chris Angelico rosuav at gmail.com
Tue Jul 15 00:20:36 EDT 2014


On Tue, Jul 15, 2014 at 1:47 PM, Michael Torrie <torriem at gmail.com> wrote:
> Python 3 uses UTF-4 encoding under the hood, with a
> compression optimization that removes leading zeros from binary
> representation of each character.

Sorry to nitpick, but in the interests of terminological accuracy I
have to point out that it's UTF-32 or UCS-4, not UTF-4 :)

But otherwise, yes, quite correct. And a system that few, but not no,
other languages use; I do wonder if other languages have considered
switching to this kind of system, but avoided it lest jmf start
haunting them too...

ChrisA



More information about the Python-list mailing list