[Python-3000] How will unicode get used?

Michael Chermside mcherm at mcherm.com
Thu Sep 21 00:46:55 CEST 2006


I wrote:
>>> msg = u'The ancient greeks used the letter "\U00010143" for the number 5.'
>>> msg[35:-18]
u'"\U00010143"'
>>> greek_five = msg[36:-19]
>>> len(greek_five)
2


After posting, I realized that it's worse than that. I suspect that if
I tried this on a CPython compiled with wide characters, then
len(greek_five) would be 1.

What should it be? 2? 1? Implementation-dependent?

-- Michael Chermside



More information about the Python-3000 mailing list