[Spambayes] Whitelist for SpamBayes for Outlook

David Matos david.matos at comcast.net
Fri Sep 12 13:53:39 EDT 2003


Thanks for the reply, Skip! Here are the spam clues for an e-mail that is
representative of the kind that I keep seeing filtered as "Unsure," despite
training lots of messages from this sender:

Spam score: 12.55% 
Word Probability # Good # Spam 
'*H*' 99.97% - - 
'*S*' 25.07% - - 
'from:addr:adjoined.com' 0.42% 53 0 
'from:addr:operez' 0.42% 53 0 
'from:name:oscar perez' 0.42% 53 0 
'message-id:@smmia001.adjoined.net' 0.43% 52 0 
'url:story2' 5.06% 4 0 
'subject:Goes' 6.52% 3 0 
'subject:Yahoo' 6.79% 16 1 
'url:story' 11.58% 38 5 
'subject:News' 13.23% 133 21 
'url:ncid' 15.52% 1 0 
'url:tmpl' 18.96% 13 3 
'url:news' 20.12% 183 48 
'header:Received:1' 29.01% 560 239 
'proto:http' 61.93% 9835 16725 
'x-mailer:none' 62.77% 5776 10180 
'subject:; ' 65.23% 31 61 
'url:com' 65.87% 6350 12812 
'url:u' 72.92% 164 462 
'url:yahoo' 73.20% 382 1091 
'header:Message-ID:1' 73.53% 98 285 
'url:cid' 74.11% 55 165 
'url:' 74.96% 1368 4282 
'subject:! ' 89.32% 87 762 
'url:eo' 99.78% 0 102 

Message text 
Received: from mail.adjoined.com ([63.116.137.3])
          by rwcrmxc11.comcast.net (rwcrmxc11) with ESMTP
          id <20030912123209r1100769pqe>; Fri, 12 Sep 2003 12:32:09 +0000
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
MIME-Version: 1.0
Subject: Yahoo! News - Batman Goes Psycho; Bale Cast
Date: Fri, 12 Sep 2003 08:32:07 -0400
Message-ID: <7F1B2E8605ED6B4EA45F13F991588A3D033B998F at smmia001.adjoined.net>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator: 
Thread-Topic: Yahoo! News - Batman Goes Psycho; Bale Cast
Thread-Index: AcN5KeDy9XDeto2eSlaMzzKMVyFNPA==
From: "Oscar Perez" <operez at adjoined.com>
To: <matos at attbi.com>


 <<Yahoo! News - Batman Goes Psycho; Bale Cast.url>>  
 
 http://story.news.yahoo.com/news?tmpl=story2&cid=794&u=/eo/12487&ncid=

 

39 unique tokens

Unique words in message 
content-type:text/plain from:addr:adjoined.com from:addr:operez
from:name:oscar perez header:Date:1 
header:From:1 header:MIME-Version:1 header:Message-ID:1 header:Received:1
header:Subject:1 
header:To:1 message-id:@smmia001.adjoined.net proto:http subject:  subject:
-  
subject:!  subject:;  subject:Bale subject:Batman subject:Cast 
subject:Goes subject:News subject:Psycho subject:Yahoo to:2**0 
url: url:12487 url:794 url:cid url:com 
url:eo url:ncid url:news url:story url:story2 
url:tmpl url:u url:yahoo x-mailer:none   


-----Original Message-----
From: Skip Montanaro [mailto:skip at pobox.com] 
Sent: Friday, September 12, 2003 10:02 AM
To: David Matos
Cc: spambayes at python.org
Subject: RE: [Spambayes] Whitelist for SpamBayes for Outlook



    David> In practice, though, I find that mail from at least one of my
    David> acquaintances keeps getting filtered as "Unsure." This is a
    David> friend who sends me mail as often as a 20 times a day (in fact,
    David> the majority of my ham), always from the same address, but with
    David> wildly varying content, ranging from a couple words in plain text
    David> to a single picture without text to an entire page of HTML.

Any chance you can post the ham/spam clues for a couple of these messages?
I
    David> recall someone saying earlier that SB (and SAB) takes the message
    David> header into account when classifying the message--if this is the
    David> case, why isn't the "From" address a dead-giveaway? 

It's probably a very strong ham clue, but it is still just one clue.  To
SpamBayes, a clue is a clue is a clue.

    David> What can I do to help figure out what's going on with this
    David> particular filtering phenomenon?

Clues, we need clues.  ;-)

Skip




More information about the Spambayes mailing list