OT: spam filtering idea

Hans Nowak wurmy at earthlink.net
Mon Jan 13 12:41:56 EST 2003


Paul Rubin wrote:

> The idea is to run the probability coefficients through a digital
> filter, so the probabilities decay over time.  That is, you give
> special emphasis to words found in RECENTLY RECEIVED spam.  If you get
> message M with the words "banana", "elephant", and "doorknob", that
> doesn't make M is especially likely to be spam.  

Maybe in the future it will be, when all conventional spam is caught by spam 
filters, and certain words cannot be used anymore...  "Is your banana the size 
of a doorknob? Use <brand X> to make it the size of an elephant!" :-)

-- 
Hans (base64.decodestring('d3VybXlAZWFydGhsaW5rLm5ldA=='))
# decode for email address ;-)
The Pythonic Quarter:: http://www.awaretek.com/nowak/
Kaa:: http://www.awaretek.com/nowak/kaa.html





More information about the Python-list mailing list