[Spambayes-checkins] spambayes README.txt,1.9,1.10
Tim Peters
tim_one@users.sourceforge.net
Sat, 07 Sep 2002 11:22:02 -0700
Update of /cvsroot/spambayes/spambayes
In directory usw-pr-cvs1:/tmp/cvs-serv16167
Modified Files:
README.txt
Log Message:
Some rearrangement.
Index: README.txt
===================================================================
RCS file: /cvsroot/spambayes/spambayes/README.txt,v
retrieving revision 1.9
retrieving revision 1.10
diff -C2 -d -r1.9 -r1.10
*** README.txt 7 Sep 2002 16:14:09 -0000 1.9
--- README.txt 7 Sep 2002 18:22:00 -0000 1.10
***************
*** 31,35 ****
hammie.py
! A spamassassin-like filter which uses timtoken (below) and
classifier (above). Needs to be made faster, especially for writes.
--- 31,35 ----
hammie.py
! A spamassassin-like filter which uses tokenizer (below) and
classifier (above). Needs to be made faster, especially for writes.
***************
*** 49,56 ****
tokenize() function of your choosing.
- unheader.py
- A script to remove unwanted headers from an mbox file. This is mostly
- useful to delete headers which incorrectly might bias the results.
-
GBayes.py
A number of tokenizers and a partial test driver. This assumes
--- 49,52 ----
***************
*** 73,84 ****
Test Data Utilities
===================
- rebal.py
- Evens out the number of messages in "standard" test data folders (see
- below).
-
cleanarch
A script to repair mbox archives by finding "From" lines that
should have been escaped, and escaping them.
mboxcount.py
Count the number of messages (both parseable and unparseable) in
--- 69,80 ----
Test Data Utilities
===================
cleanarch
A script to repair mbox archives by finding "From" lines that
should have been escaped, and escaping them.
+ unheader.py
+ A script to remove unwanted headers from an mbox file. This is mostly
+ useful to delete headers which incorrectly might bias the results.
+
mboxcount.py
Count the number of messages (both parseable and unparseable) in
***************
*** 89,92 ****
--- 85,92 ----
Split an mbox into random pieces in various ways. Tim recommends
using "the standard" test data set up instead (see below).
+
+ rebal.py
+ Evens out the number of messages in "standard" test data folders (see
+ below).