[Spambayes] spamprob combining

Gary Robinson grobinson@transpose.com
Thu, 10 Oct 2002 08:46:01 -0400



>> If you want to try something like that, I would suggest using the
>> ARITHMETIC means in computing S and H and again using S(S+H).  That
>> would remove that bias.
> 
> That doesn't appear promising:
> 
> If
>  S = Smean = (sum p_i)/n
> 
> and
>  H = Hmean = (sum 1-p_i)/n
> 
> then Hmean = n/n - Smean = 1 - Smean, and Smean + Hmean = 1.  So whether you
> meant S*(S+H) or S/(S+H), the result is S.  To within roundoff error, that's
> what happens, too.


Ha ha ha! I should have thought of that! :)




Gary