[spambayes-dev] NEWTRICKS

David Abrahams dave at boost-consulting.com
Fri Dec 26 13:16:55 EST 2003


I keep getting quite a few spams which fit the descriptions below
(from NEWTRICKS.txt):

  - Punctuation sometimes gets inserted in otherwise spammy words or phrases,
    e.g.: "Ch-eck ou=t ou-r sel)ection _of grea)t R_X -emgffj".  It might be
    helpful to try stripping punctuation.  (Idea from Paul Sorenson)

  - Similarly, some letters get replaced by numbers, e.g.: "V1agra" instead of
    "Viagra".  Mapping numbers to suitable letters might help in some
    situations.

Since "this file is for ideas that have or have not yet been tried",
I'd love to know what constitutes "trying".  Is there some official
testing procedure or corpus we can test against?  I'd like to know
whether any change I make is worth proposing.  Of course I can try it
on my own databases of Ham and Spam first...

-- 
Dave Abrahams
Boost Consulting
www.boost-consulting.com




More information about the spambayes-dev mailing list