[Spambayes] spamprob combining
Gary Robinson
grobinson@transpose.com
Thu, 10 Oct 2002 08:46:01 -0400
>> If you want to try something like that, I would suggest using the
>> ARITHMETIC means in computing S and H and again using S(S+H). That
>> would remove that bias.
>
> That doesn't appear promising:
>
> If
> S = Smean = (sum p_i)/n
>
> and
> H = Hmean = (sum 1-p_i)/n
>
> then Hmean = n/n - Smean = 1 - Smean, and Smean + Hmean = 1. So whether you
> meant S*(S+H) or S/(S+H), the result is S. To within roundoff error, that's
> what happens, too.
Ha ha ha! I should have thought of that! :)
Gary