[Spambayes] Re: For the bold
Rob Hooft
rob@hooft.net
Sun, 06 Oct 2002 07:54:05 +0200
Tim Peters wrote:
[clt2]
> Nham= 7500
> RmsZham= 2.27249107964
> Nspam= 7500
> RmsZspam= 2.354280998
[clt3]
> Nham= 7500
> RmsZham= 9.77605846416
> Nspam= 7500
> RmsZspam= 10.1887670936
OOF! Under clt3 your rms values are 4x bigger! I have to look at the
details of that: the assumption under which the rmspik.py code works is
that the distributions of zham and zspam values are normally distributed
if all values are "mirrored" around 0. I'll have to test that assumption
for clt1 and clt3!
Rob
--
Rob W.W. Hooft || rob@hooft.net || http://www.hooft.net/people/rob/