Extend unicodedata with a name/pattern/regex search for character entity references?

Veek. M vek.m1234 at gmail.com
Sat Sep 3 07:55:23 EDT 2016


https://mail.python.org/pipermail//python-ideas/2014-October/029630.htm

Wanted to know if the above link idea, had been implemented and if 
there's a module that accepts a pattern like 'cap' and give you all the 
instances of unicode 'CAP' characters.
 ⋂ \bigcap
 ⊓ \sqcap
 ∩ \cap
 ♑ \capricornus
 ⪸ \succapprox
 ⪷ \precapprox

(above's from tex)

I found two useful modules in this regard: unicode_tex, unicodedata
but unicodedata is a builtin which does not do globs, regexs - so it's 
kind of limiting in nature.

Would be nice if you could search html/xml character entity references 
as well.



More information about the Python-list mailing list