[Spambayes-checkins] website faq.txt,1.3,1.4
Skip Montanaro
montanaro at users.sourceforge.net
Tue Jun 17 15:37:44 EDT 2003
Update of /cvsroot/spambayes/website
In directory sc8-pr-cvs1:/tmp/cvs-serv25778
Modified Files:
faq.txt
Log Message:
add a question and answer about things to check when the classifier isn't.
Index: faq.txt
===================================================================
RCS file: /cvsroot/spambayes/website/faq.txt,v
retrieving revision 1.3
retrieving revision 1.4
diff -C2 -d -r1.3 -r1.4
*** faq.txt 5 Jun 2003 14:19:01 -0000 1.3
--- faq.txt 17 Jun 2003 21:37:40 -0000 1.4
***************
*** 480,483 ****
--- 480,509 ----
+ Spambayes doesn't seem to catch much spam. What gives?
+ -------------------------------------------------------
+
+ Initially, Spambayes will not be able to distinguish spams from hams. With
+ no training inputs, the classifier will simply mark everything unsure. Once
+ you start training the classifier on a representative set of spams and hams
+ it should very quickly begin to improve, however. If that's not the case,
+ perhaps you have something misconfigured. Here are a couple things to
+ check:
+
+ 1. What do you have your ham and spam thresholds set to? The defaults are
+ 0.2 and 0.9, respectively, and should be reasonable starting points.
+ They should not be close together (say, 0.4 and 0.6).
+
+ 2. Have you trained on a reasonable number of hams and spams? You should
+ train on 10 to 20 of each to start with just to get a decent base. After
+ that, you should be able to train on just mistakes and messages
+ classified as unsure.
+
+ 3. Check to be sure you haven't made any classification mistakes (trained
+ spams as hams or vice versa). If so, you could really confuse things and
+ should move incorrectly classified messages to their correct locations
+ and retrain (which may happen automatically, depending on how you're
+ using Spambayes).
+
+
How do I start from scratch after messing up my training?
---------------------------------------------------------
More information about the Spambayes-checkins
mailing list