[Spambayes] Mixed case words in heading

Tim Peters tim.one at comcast.net
Sun Apr 13 02:39:17 EDT 2003


[Tim Peters]
>> It's *mostly* case-insensitive, and indeed to minimize database
>> size, and because tests both ways had overall indistinguishable error
rates.

[Anthony Baxter]
> With smaller training databases, case-sensitivity actually made for
> noticeably worse results.

Good memory, Anthony!  That's right.  The thing that scares me is that this
result made intuitive sense <wink>.




More information about the Spambayes mailing list