[spambayes-bugs] [ spambayes-Feature Requests-765924 ] Spam / ham
statistics
SourceForge.net
noreply at sourceforge.net
Wed Dec 8 04:38:39 CET 2004
Feature Requests item #765924, was opened at 2003-07-05 00:51
Message generated for change (Comment added) made by anadelonbrin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498106&aid=765924&group_id=61702
Category: Outlook
Group: None
Status: Open
Priority: 5
Submitted By: Magnus Aycox (mbip)
Assigned to: Nobody/Anonymous (nobody)
Summary: Spam / ham statistics
Initial Comment:
Possibility to get statistics on how many mails were
received per hour / day and how many of these were
spam messages.
It would be great if it could be presented both as
numbers and graphically (impresses CEO's...). The
means to print it as a hard copy would be just
swell... ;o)
----------------------------------------------------------------------
>Comment By: Tony Meyer (anadelonbrin)
Date: 2004-12-08 16:38
Message:
Logged In: YES
user_id=552329
Another update: there is now a separate "Statistics" tab in
the SpamBayes Manager, which makes way for more stats. The
stats are now mostly the ones that the web interface offers,
plus ones that can be used for JGC's batting average
calculations. Thanks to Kenny Pitt!
To summarise the rest of this tracker: there is still no
graphical display, stats about the database itself, or stats
about mail received per hour/day.
----------------------------------------------------------------------
Comment By: Tony Meyer (anadelonbrin)
Date: 2004-10-15 12:47
Message:
Logged In: YES
user_id=552329
As an update: I've checked in a change to CVS that makes the
Outlook plug-in's statistics persistent across sessions, so
it's now basically the same as sb_server/sb_imapfilter's stats.
Graphics or more stats will have to wait for another day...
----------------------------------------------------------------------
Comment By: Josh Wand (joshwa)
Date: 2004-06-27 12:07
Message:
Logged In: YES
user_id=7937
Have a look at the popfile statistics page for good ideas on
how to present spam statistics. Totals and percentages for
spam, non-spam, false-pos, false-neg, and summary and detail
statistics on the corpus itself (which words are likliest to
be in spam, ham, etc).
Lovely html graphics, too.
I'd love to see these features in spambayes.
----------------------------------------------------------------------
Comment By: Murilo Pinto (mulpinsf)
Date: 2004-05-28 01:50
Message:
Logged In: YES
user_id=378491
I'd like to have a tiny button in the Outlook toolbar showing
how many spams(red)/ham(blue) SB processed in that
session. Hovering the mouse at it would give % in the "pop-
up" comment.
If clicked, it could open a "page" with more detailed stats, as
per hour/day/month/hour of the day/day of the
week/accounts/year.
Outstanding antispam, btw.
----------------------------------------------------------------------
Comment By: Tony Meyer (anadelonbrin)
Date: 2004-03-13 20:00
Message:
Logged In: YES
user_id=552329
Noting dupe in [ 915260 ] Statistics
<http://sourceforge.net/tracker/index.php?func=detail&aid=915260&group_id=61702&atid=498106>
----------------------------------------------------------------------
Comment By: Kenny Pitt (kpitt)
Date: 2003-11-21 03:21
Message:
Logged In: YES
user_id=859086
The latest CVS plugin actually does include false positives and
negatives in the statistics. The definition is this. If a
message was classified as Ham and then reclassified by the
user as Spam, it is a false negative. If a message was
classified as Spam and then reclassified by the user as Ham,
it is a false positive. A message that was originally classified
as Unsure is never considered a false positive or negative.
----------------------------------------------------------------------
Comment By: Erik Sargent (esargent)
Date: 2003-11-20 22:34
Message:
Logged In: YES
user_id=586922
Quick note on the "impossibility" of tracking false pos/neg.
Actually, since an incorrectly classified message already has
a header inserted, then you would only flag a "false" if that
header existed and was changed. This means you'd have to
check for the existence of the header before you processed
the Delete/Recover buttons, but it can be done.
----------------------------------------------------------------------
Comment By: Tony Meyer (anadelonbrin)
Date: 2003-09-30 16:19
Message:
Logged In: YES
user_id=552329
Note that the plug-in has basic stats information (cvs
version) now, although it's still only on a per session basis
(this will no doubt improve at some point).
The web interface (for non-plugin users) also now (cvs head)
has basic stats, which are persisted between sessions.
Any opinions on which statistics would be best to add?
----------------------------------------------------------------------
Comment By: Tony Meyer (anadelonbrin)
Date: 2003-07-11 16:25
Message:
Logged In: YES
user_id=552329
It's not exactly what you have asked for, but as a start, are
you aware that in the logs, each time you shut Outlook down
it prints a message like:
"SpamBayes processed 555 messages, finding 34 spam and 11
unsure"
(So you could shut Outlook down each hour/day, to generate
this message). It's unlikely that a graphical version would
ever be made, but it would be easy enough to throw numbers
like this into Excel and get pretty graphs.
The number of false positives/negatives is more difficult
because SpamBayes doesn't really have any way to know
that mail is a fp/fn. It could print the number of times
the "delete as spam" and "recover from spam" buttons are
used, I guess, but this would include all unsure mail, which
aren't exactly fp/fn's.
----------------------------------------------------------------------
Comment By: Mark Jeays (dze27)
Date: 2003-07-11 16:16
Message:
Logged In: YES
user_id=302748
I'm just another user but I think this would be a great
addition. I'd also be interested in: number of false
positives (along with percentage of total), number of false
negatives (along with percentage of total) and percentage of
mail received that is spam.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498106&aid=765924&group_id=61702
More information about the Spambayes-bugs
mailing list