[spambayes-dev] Another piece of anecdotal evidence

Skip Montanaro skip at pobox.com
Wed Jan 14 14:13:21 EST 2004


    >> How do you plan to find those mistrained messages?

    Alex> As part of my nightly retrain, I'm going to make it score each
    Alex> message (with the fully trained DB) and sort them into 6
    Alex> directories for each month: {ham,spam}{positive,unsure,negative}.
    Alex> Flipping through the hampositive directory for each month should
    Alex> make it fairly easy to spot the problems...

I'm still confused.  You've got a spam mistrained as ham.  Are you
suggesting that you expect that scoring that message against your training
database (which includes features gleaned from that message) will reveal
that it is something other than ham?  I have a very small training database
(microscopic compared to yours) and I generally find it easier to just start
from scratch when I reach the conclusion that I have some errors in my
database.

Skip




More information about the spambayes-dev mailing list