[spambayes-dev] Another incremental training idea...

Toby Dickenson tdickenson at geminidataloggers.com
Thu Jan 15 09:52:00 EST 2004


On Thursday 15 January 2004 13:50, Skip Montanaro wrote:
>     Toby> If Im reading this right, my 7:1 imbalance doesnt hurt me.
>
>     Toby> filename:    unbal    bal1    bal2    bal3
>     Toby> ham:spam:  14560:1992      1992:1992
>     Toby>                    1992:1992       1992:1992
>     Toby> fp total:        0       0       1       0
>     Toby> fp %:         0.00    0.00    0.05    0.00
>     Toby> fn total:       12       6       8       6
>     Toby> fn %:         0.60    0.30    0.40    0.30
>     Toby> unsure t:      102      21      23      29
>     Toby> unsure %:     0.62    0.53    0.58    0.73

> It doesn't seem to have a negative effect on false positives, but it looks
> like you will get roughly double the number of false negatives and 4-5x as
> many unsures.

4x as many unsures, out of a total population that is 4x larger. so no overall 
percentage change. Am I reading that right?

-- 
Toby Dickenson




More information about the spambayes-dev mailing list