spam classification breaker

Robin Becker robin at jessikat.fsnet.co.uk
Thu Feb 5 04:09:55 EST 2004


This article at the BBC reports on what appears to be a genetic
algorithm or random search method for finding words that apparently fool
bayesian classifiers every time.

http://news.bbc.co.uk/1/hi/technology/3458457.stm

The author apparently had to include html reporting into the emails to
allow his mail client to report back automatically.

Of course if he'd used python the whole process of email generation and
classification could have been done in a single process and would
probably allow easier generation of the magic words.

Why Berkshire, Marriot etc should be allowed through is pretty strange
:)
-- 
Robin Becker



More information about the Python-list mailing list