[Spambayes] deleting "duplicate" spam before training? good idea orbad?

Skip Montanaro skip@pobox.com
Mon, 9 Sep 2002 14:35:04 -0500


    Greg> OTOH, look into DCC (Distributed Checksum Clearinghouse,
    Greg> http://www.rhyolite.com/anti-spam/dcc/), which uses fuzzy
    Greg> checksums.  It's quite likely that DCC's checksumming scheme is
    Greg> better than something any of us would throw together for personal
    Greg> use (no offense, Skip!).

None taken.  I wrote my little script before I was aware DCC existed.  Even
now, it seems like overkill for my use.

Skip