Aproximative string matching

Fredrik Lundh fredrik at pythonware.com
Mon Nov 21 05:41:11 EST 2005


Tim Roberts wrote:

> >I'm searching for a library which makes aproximative string matching,
> >for example, searching in a dictionary the word "motorcycle", but
> >returns similar strings like "motorcicle".
> >
> >Is there such a library?
>
> There is an algorithm called Soundex that replaces each word by a
> 4-character string, such that all words that are pronounced similarly
> encode to the same string.
>
> The algorithm is easy to implement; you can probably find one by Googling.

Python used to ship with a soundex module, but it was removed
in 1.6, for various reasons.  here's a replacement:

    http://orca.mojam.com/~skip/python/soundex.py

</F>






More information about the Python-list mailing list