Multibyte Character Surport for Python

Stephen J. Turnbull stephen at xemacs.org
Sat May 11 08:31:19 EDT 2002


>>>>> "Martin" == Martin v Löwis <loewis at informatik.hu-berlin.de> writes:

    >> 1. In Python 3.0, the input character set is unicode - either
    >> UTF-16 or UTF-8 (I'm not prepared to make a solid arguement one
    >> way or the other at this time.)

    Martin> Actually, PEP 263 gives a much wider choice; consider this
    Martin> aspect solved.

Some of us consider the wider choice to be a severe defect of PEP 263.

That doesn't mean we think that Python should prohibit writing
programs in arbitrary user-specified encodings.  Only that the
facility for transforming a non-Unicode program into Unicode should be
provided as a standard library facility, rather than part of the
language.  The lexical properties of the language would be specified
in terms of Unicode.


-- 
Institute of Policy and Planning Sciences     http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
 My nostalgia for Icon makes me forget about any of the bad things.  I don't
have much nostalgia for Perl, so its faults I remember.  Scott Gilbert c.l.py



More information about the Python-list mailing list