[Spambayes] Re: [spambayes-dev] a useful pre-filter for auto-training bayesian systems?

Justin Mason jm at jmason.org
Thu Feb 19 20:21:44 EST 2004


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Ryan Malayter writes:
> I found this interesting. Using the social network as a first step, a
> bunch of "definite ham" and "definite spam" messages are listed. These
> can be used to train a Bayesian filter which then filters the rest of
> the unsures automatically. Very little user intervention would therefore
> be required for training, and it cuts in half the number of messages
> that must be filtered by the much-more-expensive statistical filter.
>  
> http://www.arxiv.org/abs/cond-mat/0402143 
>  
> One could even imagine users securely posting their email addressee's
> "white lists" by posting SHA-1 hashes instead of actually email
> addresses to some public forum. (This would have to be salted, of
> course). This could create a meta-social-network.
>  
> They don't seem to address the issue of a spam that has a forged address
> from your own social network, though, which might trip up this whole
> social network process.

Yeah -- spam with forged From of your address.  That has historically
been how spammers get around address-book-based whitelisting, because
everyone usually has 1 or more of their own addrs in the address book.

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFANWEoQTcbUG5Y7woRAr2uAKDC4sevuJ87uYk6zPlb6aWOik7xXgCfdn7n
/TFi3tpsMnGxI38K4cpTmUA=
=v88V
-----END PGP SIGNATURE-----


_______________________________________________
Spambayes at python.org
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html



More information about the spambayes-dev mailing list