Looking for UNICODE to ASCII Conversioni Example Code

Roy Smith roy at panix.com
Sat Oct 19 11:10:37 EDT 2013


In article <mailman.1260.1382192092.18130.python-list at python.org>,
 Zero Piraeus <z at etiol.net> wrote:

> For example, a miscreant may create the username 'míguel' in order to
> pose as another user 'miguel', relying on other users inattentiveness.
> Asciifying is one way of reducing the risk of that.

Determining if two strings are "almost the same" is not easy.  If míguel 
and miguel are to be considered the same, then why not also consider 
michael to be the same?  Or, for that matter, mike, mikey, or mick?  
There's no easy answer, and what's the right answer for some 
applications will be the wrong answer for others.

A reasonable place to start exploring this topic is 
https://en.wikipedia.org/wiki/String_metric.



More information about the Python-list mailing list