Comparing 2 similar strings?

Ed Morton morton at lsupcaemnt.com
Wed May 18 16:06:53 EDT 2005



William Park wrote:

> How do you compare 2 strings, and determine how much they are "close" to
> each other?  Eg.
>     aqwerty
>     qwertyb
> are similar to each other, except for first/last char.  But, how do I
> quantify that?
> 
> I guess you can say for the above 2 strings that
>     - at max, 6 chars out of 7 are same sequence --> 85% max
> 
> But, for
>     qawerty
>     qwerbty
> max correlation is
>     - 3 chars out of 7 are the same sequence --> 42% max
> 
> (Crossposted to 3 of my favourite newsgroup.)
>

"However you like" is probably the right answer, but one way might be to 
compare their soundex encoding 
(http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?soundex) and figure out 
percentage difference based on comparing the numeric part.

	Ed.



More information about the Python-list mailing list