A Moronicity of Guido van Rossum

Tony Meyer t-meyer at ihug.co.nz
Thu Sep 29 23:51:01 EDT 2005


> I know nobody wants to do add "white/black-listing", so we can do it
> probabilistically. In case it is not obvious, mailings with the words
> "jargon" or "moron" and their derrivatives should be flagged as 99.9%
> probability for Moronicity Xha Lee, Jargonizer, spam. If spam bayes  
> can't
> figure this out, then either it is not properly implemented or  
> Bayes himself
> was out to lunch.

I knew I'd regret my response <wink>.

The problem here isn't getting an appropriately spammy score for  
particular tokens, like Xah's name.  The problem is that the  
classifier has to taken into account the entire message, and the  
hammy clues outweigh the spammy ones (not unexpected, really,  
considering that other than all the trolling, the messages are  
reasonably on-topic).

This is a feature, not a bug.  It's the same feature that means that  
messages talking about spam on the spambayes mailing list, or the  
legitimate mail I get about viagra <wink>, get through to me.

=Tony.Meyer



More information about the Python-list mailing list