Flexible string representation, unicode, typography, ...

Roy Smith roy at panix.com
Thu Aug 30 07:02:24 EDT 2012


In article <503f0e45$0$9416$c3e8da3$76491128 at news.astraweb.com>,
 Steven D'Aprano <steve+comp.lang.python at pearwood.info> wrote:

> The only thing which is innovative here is that instead of the Python 
> compiler declaring that "all strings will be stored in UCS-2", the 
> compiler chooses an implementation for each string as needed. So some 
> strings will be stored internally as UCS-4, some as UCS-2, and some as 
> ASCII (which is a standard, but not the Unicode consortium's standard).

Is the implementation smart enough to know that x == y is always False 
if x and y are using different internal representations?



More information about the Python-list mailing list