unicode "table of character" implementation in python

"Martin v. Löwis" martin at v.loewis.de
Sat Sep 9 12:19:53 EDT 2006


Tim Roberts schrieb:
>> 0530..058F; Armenian
>> 0590..05FF; Hebrew
>> ...
> 
> This is a fabulously useful list, Martin.  Did you get this from a web
> page?  Can you tell me where?

It's part of the Unicode Consortium's database (UCD, Unicode Character
Database). This specific table is called "code blocks":

http://www.unicode.org/Public/UNIDATA/Blocks.txt

Python currently has this table not compiled in, but it should be
trivial to compile this into a pure-Python table (either as a
dictionary, or a list of triples).

Regards,
Martin



More information about the Python-list mailing list