[Python-3000] How will unicode get used?

Marcin 'Qrczak' Kowalczyk qrczak at knm.org.pl
Tue Sep 26 21:20:24 CEST 2006


"Martin v. Löwis" <martin at v.loewis.de> writes:

> It is somewhat tricky to implement a normalization algorithm in
> UTF-16, since you must combine surrogate pairs first in order to
> find out what the canonical decomposition of the code point is;
> but it's just more code, and no problem in principle.

The same issue is with virtually any algorithm: more code,
more complex code is needed with UTF-16 than with UTF-32.

-- 
   __("<         Marcin Kowalczyk
   \__/       qrczak at knm.org.pl
    ^^     http://qrnik.knm.org.pl/~qrczak/


More information about the Python-3000 mailing list