How to find number of characters in a unicode string?

Theerasak Photha hanumizzle at gmail.com
Wed Oct 11 02:06:07 EDT 2006


On 10 Oct 2006 22:50:21 -0700, Leo Kislov <Leo.Kislov at gmail.com> wrote:

> If python ever provide this functionality it would be I guess
> u"C\u0327".width() == 1. But it's not clear when unicode.org will
> provide recommended fixed font character width information for *all*
> characters. I recently stumbled upon Tamil language, where for example
> u'\u0b95\u0bcd', u'\u0b95\u0bbe', u'\u0b95\u0bca', u'\u0b95\u0bcc'
> looks like they have width 1,2,3 and 4 columns. To add insult to injury
> these 4 symbols are all considered *single* letter symbols :) If your
> email reader is able to show them, here they are in all their glory:
> க், கா, கொ, கௌ.

Letters? Not as such. They are, however, single syllabic units; Tamil,
like other Indic scripts, is an alphasyllabary.

I believe the syllables or sounds thus encoded are k (with nothing
after), kaa, ko, and kau.

Seamonkey is being a jerk and not rendering the glyphs properly... :?

-- Theerasak


More information about the Python-list mailing list