[Spambayes] Any ideas about this one?

Tim Peters tim_one at email.msn.com
Thu Mar 27 23:27:34 EST 2003


[T. Alexander Popiel]
> It probably was in the midrange zone to be ignored (.4 to .6 by
> default).

[Skip Montanaro]
> The default is 0.5 (meaning show everything):
> ...
>     clue_mailheader_cutoff: 0.5

I expect Alex had this mind:

# When scoring a message, ignore all words with
# abs(word.spamprob - 0.5) < minimum_prob_strength.
# This may be a hack, but it has proved to reduce error rates in many
# tests.  0.1 appeared to work well across all corpora.
minimum_prob_strength: 0.1

abs(p-0.5) < 0.1  is-same-as  0.4 < p < 0.6; Classifier._getclues() doesn't
return any word with a spamprob in that range.




More information about the Spambayes mailing list