Percentage matching of text
Eddie Corns
eddie at holyrood.ed.ac.uk
Fri Jul 30 12:38:30 EDT 2004
Bruce Eckel <BruceEckel at MailBlocks.com> writes:
>What I'd like to do is find an algorithm that produces the results of
>a text comparison as a percentage-match. Thus I would be able to
>assert that my test samples must match the control sample by at least
>(for example) 83% for the test to pass. Clearly, this wouldn't be a
>perfect test but it would help flag problems, which is primarily what
>I need.
How about using the edit distance? This would maybe give you finer control,
eg the maximum edit distance for a date would be within X characters if the
times are close or XX characters if completely random.
Googling for "python string edit distance" came up with a few matches.
Eddie
More information about the Python-list
mailing list