[Spambayes] Upgrading from 1.0a2

Richie Hindle richie at entrian.com
Fri Dec 5 08:39:16 EST 2003


Gary,

> I note that many of the changelog entries are for tokeniser
> improvements.  Would I have to do a retrain to get these improvements
> into my database?

In one way yes, because your current database is the result of running the
emails through the 1.0a2 tokeniser.  So say you had an email containing
"via<hide>gra" (which the token now understands, but didn't used to) then
you'll have a "via" and a "gra" token instead of one "viagra" token.  But
in another way no, because new emails will go through the new tokeniser.
Since you probably have a decent spam score for "viagra" already, any new
"via<hide>gra" email will get a hit for "viagra".

If you're getting good results, I wouldn't worry about retraining.

-- 
Richie Hindle
richie at entrian.com




More information about the Spambayes mailing list