[Spambayes] Maybe it's too late...
Tim Peters
tim.one at comcast.net
Sun Aug 24 21:49:21 EDT 2003
[Bill Yerazunis]
> You aren't keeping up with the CRM114 code base. :)
Indeed, I stopped paying attention to that at about the same time I had to
stop paying attention to the spambayes code base <wink/sigh>.
> In particular, the new format (hash, key, value) doesn't use a
> predefined number of buckets; when autogrooming or microgrooming is
> enabled, even CRM114 doesn't quite know how many buckets are active.
Well, a quick scan of the current CRM114 home page, FAQ and README didn't
turn up anything about those, so I guess understanding requires
reverse-engineering the source code? That's fair if so, I just don't have
time for it. You have a knack for unique terminology <wink -- but google
never heard of microgrooming, and associates autogrooming with monkeys and
bees>.
...
>> I'd rather defend SB on that count, as SB's scores are associated
>> with non-hashed tokens, and there's indeed no limit on how big a SB
>> database can get.
> Same in CRM114; until you hit something big enough that you need
> lseek, it's Not An Issue. In fact (again, if autogrooming or
> microgrooming is on) it's _better_ to be bigger.
There's a long thread waiting there; maybe later!
More information about the Spambayes
mailing list