[spambayes-dev] [ spambayes-Feature Requests-762783 ] TMDA capabilities

SourceForge.net noreply at sourceforge.net
Tue Jul 1 09:36:11 EDT 2003


Feature Requests item #762783, was opened at 2003-06-29 12:33
Message generated for change (Comment added) made by montanaro
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702

Category: None
Group: None
Status: Open
Priority: 5
Submitted By: Why do you need this (beyond-thoughts)
Assigned to: Nobody/Anonymous (nobody)
Summary: TMDA capabilities

Initial Comment:
I just switched from TMDA (http://tmda.net/) to Spambayes.
Watching emails that were neither in black- or
whitelist was to annoying. Of course those people get a
reply (please see at TMDA-Homepage to see how it works)
but about 50% don't understand the reply-mail. (I don't
know why -- the text is very clear!).
Spambayes doesn't queue emails that can't be classified
-- you receive them as "unsure".

I think combining both techniques would cut down wrong
positive and wrong negative and the "unsure" numbers.
Additionaly it might be useful for training having a
definite black- and whitelist.

So what are good points (files, classes, methods) to
add such a feature?
Is it dependent whether pop3proxy, hammie, ... is used?

Thank You,
   Christoph Becker-Freyseng


----------------------------------------------------------------------

>Comment By: Skip Montanaro (montanaro)
Date: 2003-07-01 10:36

Message:
Logged In: YES 
user_id=44345

Whitelist functionality really isn't needed.  Spambayes already
tokenizes email addresses, so over time it effectively builds up
a whitelist for you.  Here are some examples from my current
training database (the tuple values are (nspam, nham)):

>>> db["email name:itineraries"]
(71, 10)
>>> db["email name:guido"]
(0, 8)
>>> db["email name:webmaster"]
(58, 45)
>>> db["email name:skip"]
(365, 314)
>>> db["email name:support"]
(136, 13)

Note that itineraries at mojam.com & support at microsoft.com
are frequently forged in mail I receive.  webmaster at mojam.com
and skip at pobox.com are forged a fair amount, but are also
frequently correct.  On the other hand, nobody has so far taken
Guido's name in vain in my incoming email.  (I rarely train on
Python-related email, so there are only a few messages from
Guido in my training database.)

Even if you implemented such a feature it would probably not
be as sensitive as the current tokenizing scheme.  In addition, you
would still have to scan your spam.  You will eventually get a
valid email message from someone not on your whitelist.

Regarding:

    Submitted By:
    Why do you need this (beyond-thoughts)

it's because (in general) too many people submit incomplete bug
reports anonymously and then can't be contacted to complete
their report.  This was a significant problem with the Python
project and sort of carried over to the Spambayes project.


----------------------------------------------------------------------

Comment By: Why do you need this (beyond-thoughts)
Date: 2003-07-01 09:29

Message:
Logged In: YES 
user_id=186848

I know that TMDA and Spambayes have different approaches,
but this is what makes it useful combining them.
With Spambayes till now I had no false classified emails
(just few "unsure") so I'm quite satisfied with it. However
I still have to check the emails in "Spam-Folder" because I
don't want to risk loosing some false-classified important
email.
When Spambayes had some TMDA capabilities it could just sent
a confirmation-email to the asumed spammer. If he doesn't
reply then it's really his fault. So I won't have to check
emails in the "Spam-Folder" at all.
On the other side people I send emails to could be
automatically added to a white-list so they surely won't
have trouble with Spamfilters.
I have thought of some more configurable rules that could be
added making Spambayes an interactive-AI-Spamfilter.

I'd like to make a demo-implementation but I need some
starting points. Especially I don't know how the Outlook
stuff works. But maybe I should just try implementing such a
thing for the pop3- and smtp-proxy.

Thanks,
   Christoph Becker-Freyseng


----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2003-06-30 20:38

Message:
Logged In: YES 
user_id=29957

I can't imagine that this feature would ever be added to
spambayes. It's a completely different approach to
spam-filtering, with almost nothing in common with the
existing approach. Spambayes will gradually improve as you
train it further - the initial flurry of unsures is probably
just insufficient training. Note also that you can adjust
the cutoffs to end up with more or less unsures.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702



More information about the spambayes-dev mailing list