Micro Python -- a lean and efficient implementation of Python 3

Robin Becker robin at reportlab.com
Wed Jun 4 08:31:06 EDT 2014


On 04/06/2014 13:17, Marko Rauhamaa wrote:
.........
>
> Note, for example, that Google manages to sort out issues like these. It
> sees past diacritics and even case ending.
.....
I guess they must normalize all inputs to some standard form and then search / 
eigenvectorize on those. There are quite a few diacritics and a fair few glyphs 
they could be applied to. I don't think it likely they could map all possible 
combinations to a private range.
-- 
Robin Becker




More information about the Python-list mailing list