Python's handling of unicode surrogates
Ross Ridge
rridge at caffeine.csclub.uwaterloo.ca
Fri Apr 20 19:49:09 EDT 2007
Rhamphoryncus <rhamph at gmail.com> wrote:
>The only code that will be changed is that which doesn't handle
>surrogates properly. Some will start working properly. Some (ie
>random.choice(u'\U00100000\uFFFF')) will fail explicitly (rather than
>silently).
You're falsely assuming that any code that doesn't support surrogates
is broken. Supporting surrogates is no more required than supporting
combining characters, right-to-left languages or lower case letters.
Ross Ridge
--
l/ // Ross Ridge -- The Great HTMU
[oo][oo] rridge at csclub.uwaterloo.ca
-()-/()/ http://www.csclub.uwaterloo.ca/~rridge/
db //
More information about the Python-list
mailing list