unicode "table of character" implementation in python
"Martin v. Löwis"
martin at v.loewis.de
Sat Sep 9 12:19:53 EDT 2006
Tim Roberts schrieb:
>> 0530..058F; Armenian
>> 0590..05FF; Hebrew
>> ...
>
> This is a fabulously useful list, Martin. Did you get this from a web
> page? Can you tell me where?
It's part of the Unicode Consortium's database (UCD, Unicode Character
Database). This specific table is called "code blocks":
http://www.unicode.org/Public/UNIDATA/Blocks.txt
Python currently has this table not compiled in, but it should be
trivial to compile this into a pure-Python table (either as a
dictionary, or a list of triples).
Regards,
Martin
More information about the Python-list
mailing list