[Spambayes] Upgrading from 1.0a2
Richie Hindle
richie at entrian.com
Fri Dec 5 08:39:16 EST 2003
Gary,
> I note that many of the changelog entries are for tokeniser
> improvements. Would I have to do a retrain to get these improvements
> into my database?
In one way yes, because your current database is the result of running the
emails through the 1.0a2 tokeniser. So say you had an email containing
"via<hide>gra" (which the token now understands, but didn't used to) then
you'll have a "via" and a "gra" token instead of one "viagra" token. But
in another way no, because new emails will go through the new tokeniser.
Since you probably have a decent spam score for "viagra" already, any new
"via<hide>gra" email will get a hit for "viagra".
If you're getting good results, I wouldn't worry about retraining.
--
Richie Hindle
richie at entrian.com
More information about the Spambayes
mailing list