[spambayes-dev] Another incremental training idea...
Skip Montanaro
skip at pobox.com
Tue Jan 13 18:21:32 EST 2004
>> For some reason, my ham/spam ratio is getting out-of-whack faster
>> that it seemed to in the past.
Kenny> This is just an unsubstantiated guess based on my experience with
Kenny> my own e-mail mix. I get ham scores near 0.00 a lot more than I
Kenny> get spam scores near 1.00. Maybe the non-edge training is
Kenny> discarding a higher percentage of hams than it is spams. I
Kenny> suppose you could correct for that by setting different edge
Kenny> thresholds, but maybe you've already done that?
No doubt. I made a change to my procmailrc file to not save spams scoring >
0.97 for training. We'll see how it goes.
This of course jives pretty well with many peoples' observation (and my
experience) that most unsures are actually spam. I think I need to adjust
some thresholds to try and reduce the number of spams which get trained on.
Skip
More information about the spambayes-dev
mailing list