[Spambayes] Re: For the bold

Rob Hooft rob@hooft.net
Sun, 06 Oct 2002 07:54:05 +0200


Tim Peters wrote:
[clt2]

> Nham= 7500
> RmsZham= 2.27249107964
> Nspam= 7500
> RmsZspam= 2.354280998

[clt3]
> Nham= 7500
> RmsZham= 9.77605846416
> Nspam= 7500
> RmsZspam= 10.1887670936

OOF! Under clt3 your rms values are 4x bigger! I have to look at the 
details of that: the assumption under which the rmspik.py code works is 
that the distributions of zham and zspam values are normally distributed
if all values are "mirrored" around 0. I'll have to test that assumption 
for clt1 and clt3!

Rob

-- 
Rob W.W. Hooft  ||  rob@hooft.net  ||  http://www.hooft.net/people/rob/