Fuzzy string comparison

Gabriel Genellina gagsl-py at yahoo.com.ar
Tue Dec 26 16:43:37 EST 2006


At Tuesday 26/12/2006 18:08, John Machin wrote:

>Wojciech Mula wrote:
> > Steve Bergman wrote:
> > > I'm looking for a module to do fuzzy comparison of strings. [...]
> >
> > Check module difflib, it returns difference between two sequences.
>
>and it's intended for comparing text files, and is relatively slow.
>
>Google "python levenshtein". You'll probably find this a better fit for
>typoed keys in a database.

Other alternatives: trigram, n-gram, Jaro's distance. There are some 
Python implem. available.


-- 
Gabriel Genellina
Softlab SRL 


	

	
		
__________________________________________________ 
Preguntá. Respondé. Descubrí. 
Todo lo que querías saber, y lo que ni imaginabas, 
está en Yahoo! Respuestas (Beta). 
¡Probalo ya! 
http://www.yahoo.com.ar/respuestas 




More information about the Python-list mailing list