Compile Python 3 interpreter to force 2-byte unicode

Terry Reedy tjreedy at udel.edu
Sat Nov 25 18:59:44 EST 2017


On 11/25/2017 5:12 PM, Chris Angelico wrote:
> On Sun, Nov 26, 2017 at 9:05 AM,  <wojtek.mula at gmail.com> wrote:
>> Hi, my goal is to obtain an interpreter that internally
>> uses UCS-2. Such a simple code should print 65535:
>>
>>    import sys
>>    print sys.maxunicode
>>
>> This is enabled in Windows, but I want the same in Linux.
>> What options have I pass to the configure script?

You must be trying to compile 2.7.  There may be Linux distributions 
that compile this way.  If you want to use, or ever encounter, non-BMP 
chars, using surrogate pairs is problematical.  By my reading of the 
official UCS-2 docs, Python's old 16-bit unicode implementation is not 
fully compliant.  Others have claimed that is it not a UCS-2 implementation.

> Why do you want to? What useful value do you have in creating this
> buggy interpreter?

> Ever since Python 3.3, that has simply not been an
> option. The bug has been solved.

If you want to seriously work with unicode, many recommend using modern 
Python.

-- 
Terry Jan Reedy




More information about the Python-list mailing list