OT: spam filtering idea
Hans Nowak
wurmy at earthlink.net
Mon Jan 13 12:41:56 EST 2003
Paul Rubin wrote:
> The idea is to run the probability coefficients through a digital
> filter, so the probabilities decay over time. That is, you give
> special emphasis to words found in RECENTLY RECEIVED spam. If you get
> message M with the words "banana", "elephant", and "doorknob", that
> doesn't make M is especially likely to be spam.
Maybe in the future it will be, when all conventional spam is caught by spam
filters, and certain words cannot be used anymore... "Is your banana the size
of a doorknob? Use <brand X> to make it the size of an elephant!" :-)
--
Hans (base64.decodestring('d3VybXlAZWFydGhsaW5rLm5ldA=='))
# decode for email address ;-)
The Pythonic Quarter:: http://www.awaretek.com/nowak/
Kaa:: http://www.awaretek.com/nowak/kaa.html
More information about the Python-list
mailing list