[Spambayes] FYI: Java implementation

Richard Jowsey richard at jowsey.com
Mon Jan 20 06:38:26 EST 2003


> Upgrade to Python and you would have finished a couple months ago
> <wink>.

Yeah, that thought had occurred to me too... <grin>
 
> [chi-combining] This gives it some nice
> properties for automated decision making (the cutoff points for
> gary-combining were too touchy, across test sets, and across
> time).  But if you like a mode where you simply sort msgs by
> score, you can stop with gary-combining and be happy.

I have a very large training corpus, so I'm seeing well-
separated distributions of good versus spam probs, with a 
sprinkling of "unsures" scattered through the middle. An 
uncertain cutoff at 3 sigma from the means should work, but this 
notion needs some testing. That chi2 test is definitely on the 
drawing boards, even if only for comparison purposes...

Death To Spam!

Cheers,
Richard




More information about the Spambayes mailing list