From davea at sportingbet.com.au Mon Sep 1 02:30:46 2003 From: davea at sportingbet.com.au (Dave Alford) Date: Sun Aug 31 12:48:35 2003 Subject: [Spambayes] automated response Message-ID: <10309010130.AA135273584@sportingbet.com.au> Hi, I will be on holidays from Monday the 1st of September until the following Monday (8th of September). I will, however, be checking my e-mail periodically. If you need to contact me urgently, I can be reached on 0408-089-379. For all non-urgent matters, please contact Tom Rundle on (08) 8923-8046. Thanks, Dave Alford SysAdmin Sportingbet Australia From rogerb at netspeed.com.au Mon Sep 1 09:47:10 2003 From: rogerb at netspeed.com.au (Roger Booth) Date: Sun Aug 31 18:48:20 2003 Subject: [Spambayes] Initial Start Up Problems Message-ID: <000801c37011$d0bb0be0$9a301fcb@rogerboo> I would be grateful if someone could give me some assistance to get me going with Spambayes. I've spent a day downloading and reading the documentation but haven't got anywhere. My operating system is Windows 2000 and I want to use pop3proxy with Outlook and Outlook express. I've downloaded spambayes 1.0a4 and Python 2.3. As I said in an earlier post, I can only get Python to compile the files by opening them with it. If I try to run pop3proxy.py -b it says it can't fine the file even though it is in the current directory. I've also tried the command: ">>>> from spambayes.Options import options" in python and again it can't find the file even though it is in the current directory. Is there some environmental variable that needs to be set for Python to work? I can't find a reference to it in the Python documentation. I want to configure Spambayes to talk to my ISP mail server. I read that I need to create a file "baysecustomize.ini" and enter the name of my ISP mailserver and the port it listens on. I've entered: [Server] server: mail.netspeed.com.au [Port] port: 110 Is this right? Next I want to set up Outlook to go to the pop3proxy instead of my ISP mail server. For another proxy server I've tried you enter "localhost" or "127.0.0.1" as the pop server in the account details and put the name of the ISP mail server and ":" before your username. Is the same correct for Spambayes? Thanks RB From T.A.Meyer at massey.ac.nz Mon Sep 1 12:09:49 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Aug 31 19:10:37 2003 Subject: [Spambayes] Install error with SpamBayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308C66A@its-xchg4.massey.ac.nz> > > Python tells me that hammie.py has a syntax error > > This was fixed in CVS on Tuesday 26 Aug - line 257 of > Hammie.py is missing a trailing close-parenthesis: > You could update from CVS, or fix your local copy of 1.0a4. > Soon you'll be able to download 1.0a5, which is due out any day now. However, this problem was introduced *after* 1.0a4 (I introduced it ;), so it shouldn't be happening in the 1.0a4 release. Do you get exactly the same trace with 1.0a4? Anyway, as Richie said, 1.0a5 would be the best bet. =Tony Meyer From ryanbooker at kromestudios.com Mon Sep 1 10:51:00 2003 From: ryanbooker at kromestudios.com (Ryan Booker) Date: Sun Aug 31 19:51:37 2003 Subject: [Spambayes] Using SpamBayes with SSL mail server Message-ID: <009401c3701a$bb8593e0$0901a8c0@ryanbooker> I'm trying to set up spambayes pop3proxy with an SSL pop server. the server is set to mail.blah.com:995 and the port to 995. But it always times out. Any ideas? -- Cheers, Ryan From T.A.Meyer at massey.ac.nz Mon Sep 1 15:10:40 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Aug 31 22:11:22 2003 Subject: [Spambayes] Initial Start Up Problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308C75A@its-xchg4.massey.ac.nz> > My operating system is Windows 2000 and I want to use > pop3proxy with Outlook and Outlook express. Is there any reason that you would rather use pop3proxy with Outlook than the Outlook plugin? The plug-in is almost certainly the better option. > I've downloaded spambayes 1.0a4 and Python 2.3. As I said in > an earlier post, I can only get Python to compile the files > by opening them with it. If I try to run pop3proxy.py -b it > says it can't fine the file even though it is in the current > directory. Have you run "setup.py install"? What if you run "c:\python23\python pop3proxy.py -b"? > I've also tried the command: ">>>> from spambayes.Options > import options" in python and again it can't find the file > even though it is in the current directory. Is the current directory on the path, though? "import sys;print sys.path". Again, this sounds like you haven't run "setup.py install". (Note that the 1.0a5 documentation has been improved). > Is there some environmental variable that needs to be set for > Python to work? I can't find a reference to it in the Python > documentation. No. You can set PYTHONPATH to a list of directories to search, but you shouldn't need to do this if you *install* spambayes. > I read that I need to create a file "baysecustomize.ini" and > enter the name of my ISP mailserver and the port it listens > on. I've entered: [...] > Is this right? No. Once you have pop3proxy running, go to http://localhost:8880 and do the setup there; you'll find it much easier. The options you are after are: """ [pop3proxy] remote_servers: mail.netspeed.com.au listen_ports: 110 """ > Next I want to set up Outlook to go to the pop3proxy instead > of my ISP mail server. For another proxy server I've tried > you enter "localhost" or "127.0.0.1" as the pop server in the > account details and put the name of the ISP mail server and > ":" before your username. Is the same correct for Spambayes? No. Set the server to "localhost" (or "127.0.0.1", it doesn't matter). You don't need to do anything else. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 1 15:12:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Aug 31 22:13:19 2003 Subject: [Spambayes] More filtering problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308C75D@its-xchg4.massey.ac.nz> > I finally got spambayes to move messages from my inbox , > however I have 2 other inboxes that it scores but won't > filter. I have the filters set up to watch and filter these 2 > other folders but apparently the outlook rule to move > messages as they arrive to these folders is overruling the > spambayes filter. The interaction with Outlook's rule is difficult (Outlook's fault!). Your best option is probably to enable the "timer" option (look in the configuration.html file, which you can get to via the "About" link in the main dialog), and watch all the folders than mail is moved to. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 1 17:31:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 1 00:32:15 2003 Subject: [Spambayes] Initial Start Up Problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308C844@its-xchg4.massey.ac.nz> > I want to use pop3proxy because its for an assignment and I > need to demonstrate some understanding of what is going on. ! Can I ask what the assignment is for? > I did try to install the plug in but it didn't work. [...] > AttributeError: 'module' object has no > attribute 'register' Ah, yes, I saw this bug (and left it for Mark ;) > "setup.py install" doesn't work either, even with > "c:\python23\python, it can't find the file again. What if you give full paths for everything? i.e.: C:\python23\python.exe path\to\spambayes\setup.py install"? > I tried > opening install.py with python but it didn't seem to do > anything. I guess you mean setup.py, and that would be the result - it'll only do it's thing if you pass "install" to it as an argument. > I ran "import sys;print sys.path" from python and got some > stuff about where python is which looked ok. Try this from inside Python: >>> import sys >>> sys.path.append(-1, "x:\path\to\spambayes") >>> import spambayes (Obviously, with both this and the above command, you'll have to replace the path with the path to whereever you have put spambayes). > I had tried to use http://localhost:8880 to configure but the > URL couldn't be retrieved It'll only work when pop3proxy is running...If it doesn't at that point, then the best bet is to try running it on a different port ("pop3proxy.py -u 9880 -b", for example). =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 1 18:04:39 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 1 01:05:40 2003 Subject: [Spambayes] Re: SMTP Errors using Eudora Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308C86F@its-xchg4.massey.ac.nz> > Finally got 1.12 - same result (500 error from server.) I think sourceforge is even further than 24 hours behind at the moment. The fix for all of that is supposedly only days away, though, thank goodness! > > The person that submitted the bug says that removing the '\r\n' that > > is added to "cooked" (right above where I got you to add the > > debugging statements) works for him. [...] > I get the same (expected) results [...] Hmm. I misread the bug report. The removed '\r\n' wasn't there, it was a few lines higher (in the if self.inData == True block; it should be line 225 in v1.12). Cameron Stone (thanks!) pointed out that the '\r\n' is added twice there, which (if I am still seeing straight) it shouldn't be. I've corrected this (in v1.13), and it still works for me. It would be great if it did for you, too. However, by the QUIT command we shouldn't be in that block, as far as I can figure. So there's a deeper problem, which I'll keep working on. > I feel pretty strongly that the 500 response is coming from Norton. I was beginning to wonder, except for these two new people that have reported it - at least one of them isn't using Norton. Thanks, Tony From rmalayter at bai.org Mon Sep 1 03:20:42 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 1 03:21:22 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3D7@cliff.bai.org> Image-only spams seem to be the only things that really give my trained spambayes 007 plugin trouble. Many of these have white-on-white garbage text designed to fool simple "cumulative weighting" filters (which they do very well). Spambayes seems to have trouble with them because they have so little information, and with spoofed senders and ever-changing domains, there is not much in the headers or URLs to score with statistical significance. When looking at the score reports for various image-only spams, I see that tokes url:gif and url:jpeg get high scores. But I noticed that " tokens. Only segments of the URL itself show up. Spammers often seem to do quite a bit of work to separate the IMG and SRC tags, so it might take little extra smarts in the tokenizer to make sure it gets done right. It would seem to me that these IMG and COLOR tokens would be a fairly strong spam indicators, at least with my corpus. I think they might provide more information to incriminate the messages that have little statistically to score anyway. Has anyone tried these as special tokens? If so, what were the results? Thanks for your help, -ryan- From jason.ahrens at rogers.com Mon Sep 1 10:34:54 2003 From: jason.ahrens at rogers.com (Jason Ahrens) Date: Mon Sep 1 09:35:08 2003 Subject: [Spambayes] Procmail training methods (Was: Re: Initialization error) In-Reply-To: References: <20030831152939.NNVP367241.fep02-mail.bloor.is.net.cable.rogers.com@localhost> Message-ID: On Sun, 31 Aug 2003, Richie Hindle wrote: > http://mail.python.org/pipermail/spambayes-checkins/2003-August/001930.html > > for the patch. It works! At least, I got it initialized trained on my saved good mail and spam. I threw a couple of test messages at it and they came up as 'uncertain' but I'll let it go over some real mail now and see what happens. Thanks for the help! Some questions I do have involve the best way to keep it up to date when using procmail for filtering. Any hints from anyone who's already done this? Jason -- Bizarreness is the essence of the exotic. From edrubins at andisplace.com Mon Sep 1 11:17:05 2003 From: edrubins at andisplace.com (Ed Rubinsky) Date: Mon Sep 1 10:17:27 2003 Subject: [Spambayes] Re: SMTP Errors using Eudora In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130308C86F@its-xchg4.massey. ac.nz> Message-ID: <5.1.0.14.0.20030901100051.00a89590@localhost> > > Hmm. I misread the bug report. The removed '\r\n' wasn't there, it was > > a few lines higher (in the if self.inData == True block; it should be > > line 225 in v1.12). Cameron Stone (thanks!) pointed out that the '\r\n' > > is added twice there, which (if I am still seeing straight) it shouldn't > > be. I've corrected this (in v1.13), and it still works for me. It > > would be great if it did for you, too. Cameron Stone is absolutely right. The initial '\r\n' is added in line 219, before checking the request. If it's the end of the data ('.'). it's added again in line 225 - before the QUIT command is processed. Norton is apparently taking the empty '\r\n' as QUIT, and queueing the mail for delivery. It then sees the quit and returns the 500 error (or at least someone does.) The following snippet is from a trace I added to asynchat.init_send(). The numbers in parenthesis are the number of bytes sent. Notice the blank line after the '.' which ends the message, followed by the mail queued message followed by the quit. Without the extra '\r\n' that extra line is not there and neither is the 500 error. async_send: "data " (6) async_send: "354 Please start mail input. " (30) async_send: "From: someone@somewhere.com To: somebody@someplace.com Testing SMTP. . " (82) async_send: "250 Mail queued for delivery. " (31) async_send: "quit " (6) async_send: "500 Unrecognized command " (26) async_send: "221 Closing connection. Good bye. " (35) > > However, by the QUIT command we shouldn't be in that block, as far as I > > can figure. So there's a deeper problem, which I'll keep working on. It's not. > >> I feel pretty strongly that the 500 response is coming from Norton. > > > I was beginning to wonder, except for these two new people that have > > reported it - at least one of them isn't using Norton. Are they using another anti-virus program, or something else between the MUA and smtp server? Best, Ed From richie at entrian.com Mon Sep 1 18:50:48 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Sep 1 12:50:53 2003 Subject: [Spambayes] Procmail training methods (Was: Re: Initialization error) In-Reply-To: References: <20030831152939.NNVP367241.fep02-mail.bloor.is.net.cable.rogers.com@localhost> Message-ID: [Jason] > Some questions I do have involve the best way to keep it up to date when > using procmail for filtering. proxytee.py acts as a procmail filter that doesn't change the message, but adds it to the web interface for later training. You need to be running pop3proxy.py to get the web interface, but you don't need to have any POP3 accounts configured. -- Richie Hindle richie@entrian.com From jswinden at csolve.net Mon Sep 1 16:07:30 2003 From: jswinden at csolve.net (John/Helen Swinden) Date: Mon Sep 1 15:08:58 2003 Subject: [Spambayes] A very basic, 101 question Message-ID: <007a01c370bc$4bb93f60$8d14d1d8@oemcomputer> I have Windows 98 and use Outlook Express 5. Is there any way I can use spamabyes without changing my email to Outlook 2000 (or newer version)? Best regards, John M. Swinden 2011 Concession 5, Adjala-Tosorontio R.R.#1, Loretto, Ontario, Canada, LOG 1L0 Telephone 905-936-4657 Fax 905-936-2843 From richie at entrian.com Mon Sep 1 23:05:17 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Sep 1 17:05:24 2003 Subject: [Spambayes] A very basic, 101 question In-Reply-To: <007a01c370bc$4bb93f60$8d14d1d8@oemcomputer> References: <007a01c370bc$4bb93f60$8d14d1d8@oemcomputer> Message-ID: [John] > I have Windows 98 and use Outlook Express 5. > > Is there any way I can use spamabyes without changing my email > to Outlook 2000 (or newer version)? See: http://spambayes.sourceforge.net/faq.html#does-spambayes-work-with-outlook-express and the section "Minimal setup for using the POP3 proxy and web interface" in: http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/INTEGRATION.txt?rev=HEAD&content-type=text/plain -- Richie Hindle richie@entrian.com From beckman at purplecow.com Mon Sep 1 19:43:36 2003 From: beckman at purplecow.com (Peter Beckman) Date: Mon Sep 1 18:45:02 2003 Subject: [Spambayes] X-Spambayes-Classification not appearing In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130308C311@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130308C311@its-xchg4.massey.ac.nz> Message-ID: <20030901184230.J21425@thermonuclear.org> On Fri, 29 Aug 2003, Meyer, Tony wrote: > > My .spambayesrc contains: > > [Headers] > > include_evidence: True > > include_score: True > > include_thermostat: True > > > > but I don't see anything in my headers. Using procmail; the > > X-SpamBayes-Classification is there, no other SB header. > > You're using hammiefilter, yes? At the moment only pop3proxy and > imapfilter can add those headers (or more accurately, any of the apps > that use the sbheader_message class in the message.py module). Ah. That's why. > OTOH, doesn't hammiefilter include the score in the classification > header? And there's a hammie option ([Hammie]debug_header) that adds > the evidence header (under the 'debug' name). I don't think there's any > way to get a thermostat header, though. OK, I'll try that out. > At some point hammiefilter probably will use the same class as the other > apps (it's one of the reasons it was created); it just hasn't happened > yet. Cool; I'll look for it. Thanks Tony. Beckman --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From tim.one at comcast.net Mon Sep 1 20:48:22 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Sep 1 19:48:54 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3D7@cliff.bai.org> Message-ID: [Ryan Malayter] > Image-only spams seem to be the only things that really give my > trained spambayes 007 plugin trouble. "trouble" means what? That they're classified as ham, or that they're classified as unsure? > Many of these have white-on-white garbage text Then they're not image-only . > designed to fool simple "cumulative weighting" filters (which they do > very well). Spambayes seems to have trouble with them because they have > so little information, and with spoofed senders and ever-changing > domains, there is not much in the headers or URLs to score with > statistical significance. That would be consistent with trouble meaning "classified as unsure", but not with trouble meaning "classified as ham". > When looking at the score reports for various image-only spams, I see > that tokes url:gif and url:jpeg > get high scores. But I noticed that " indicate a hosted image link, is not represented as a special token. That's true. All HTML decorations are ignored unless/until someone adds code specifically looking for one. We've added several of those over the months, but still ignore "almost all" HTML decoration. > Nor any COLOR= > tokens. Also true. > Only segments of the URL itself show up. Spammers often seem to do > quite a bit of work to separate the IMG and SRC tags, so it might > take little extra smarts in the tokenizer to make sure it gets > done right. > > It would seem to me that these IMG and COLOR tokens would be a fairly > strong spam indicators, at least with my corpus. The other side to this is that *any* evidence of HTML is a strong spam indicator in most corpora. For example, you'll find that "

" is a strong spam indicator, if you make the tokenizer produce it. Ditto "
". That's because such a high percentage of spam uses HTML. Early testing showed that tokenizing all HTML decorations produced classifiers so overwhelmed by hundreds of correlated "it used HTML" clues that virtually nothing using HTML could avoid being classified as spam -- even msgs talking *about* HTML got classed as spam if they included an example. That's why spambayes backed off to ignoring (at first) all HTML decorations. I think it's fine to add back (and have added) specific HTML decorations that are usually unique to spam. > I think they might provide more information to incriminate the messages > that have little statistically to score anyway. Has anyone tried these as > special tokens? If so, what were the results? I haven't tried it, although I've often intended to add a COLOR token to see what happens. A problem is that I don't seem to get much spam of this sort, and the stuff of this kind I get is usually classified as spam anyway (my spam and ham cutoffs are 80 and 20, btw). Work up a patch and see what happens! From T.A.Meyer at massey.ac.nz Tue Sep 2 13:21:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 1 20:22:09 2003 Subject: [Spambayes] Brief talkthrough on running as a service on Win2k? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308CA48@its-xchg4.massey.ac.nz> [Apologies for the delay!] > I'm running Win2k and use Outlook Express. So far I've been > happy with how it's working. Only problem I'm having is that > if I compose more than one message off line, then when I send > the messages, the SMTP only accepts one message and then > disconnects. I have to reconnect as many times as their are > messages. If there are too many, I get a warning from my > real SMTP server that there are too many open connections > from my address and that I should wait awhile. I'm not certain, but I think this may have now been fixed. I think this might have been happening because of the extra \r\n bug, which meant that the connection would disconnect. This might have lead to the too many open connections problem. Anyway, if you could try out 1.0a5 (due out Thursday) and let us know if it is fixed, that would be great. If it's not, then I'll try and figure out what is wrong for 1.0b1. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 2 13:37:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 1 20:38:31 2003 Subject: [Spambayes] Re: SMTP Errors using Eudora Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308CA65@its-xchg4.massey.ac.nz> [...] > Without the extra '\r\n' > that extra line is not there and neither is the 500 error. It's good to know that this does fix it (all that time, looking in the wrong place...oh well, at least I fixed some other things that needed fixing). Thanks for running the check and the other help trying to track it down. This will appear in 1.0a5, due out Thursday. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 2 14:28:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 1 21:29:32 2003 Subject: [Spambayes] imapfilter.py -- 500 server error -- unexpectedstringor buffer Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130308CA8E@its-xchg4.massey.ac.nz> > > If you're willing to, you could insert the line: > > print fol, type(fol) > Thanks for the help! [...] ('(\\NoSelect) "/" {15}', '\\var\\messages\\tron') > Once I deleted \var\messages\tron, the complete folder > list was displayed with no errors (in the web interface), > although the first letters of all the items > were omitted (e.g., "news" became "ews"). It's been a while, but I've finally managed to get around to this. The problem was that your imap server was giving back the folder name as a literal (the {15} bit), and I hadn't accounted for this (stupid protocol). This should be fixed in 1.0a5 (due 04Sep03), which would mean that you could have the \var\messages\tron folder, and the first letters should be back where they belong. Thanks for the help figuring it out. =Tony Meyer From garry at zvolve.com Tue Sep 2 02:33:42 2003 From: garry at zvolve.com (Garry Williams) Date: Tue Sep 2 01:34:23 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: ; from tim.one@comcast.net on Mon, Sep 01, 2003 at 07:48:22PM -0400 References: <792DE28E91F6EA42B4663AE761C41C2AF3D3D7@cliff.bai.org> Message-ID: <20030902013342.A6310@zvolve.com> On Mon, Sep 01, 2003 at 19:48:22 -0400, Tim Peters wrote: > [Ryan Malayter] [snip] > > It would seem to me that these IMG and COLOR tokens would be a fairly > > strong spam indicators, at least with my corpus. > > The other side to this is that *any* evidence of HTML is a strong spam > indicator in most corpora. For example, you'll find that "

" is a strong > spam indicator, if you make the tokenizer produce it. Ditto "
". That's > because such a high percentage of spam uses HTML. Early testing showed that > tokenizing all HTML decorations produced classifiers so overwhelmed by > hundreds of correlated "it used HTML" clues that virtually nothing using > HTML could avoid being classified as spam -- even msgs talking *about* HTML > got classed as spam if they included an example. That's why spambayes > backed off to ignoring (at first) all HTML decorations. I think it's fine > to add back (and have added) specific HTML decorations that are usually > unique to spam. Yet, Paul Graham thinks[*] the URL in the IMG tag will correlate accurately because it points to the spammer's message. Why not tokenize the value of the link? [*] From http://www.paulgraham.com/sofar.html: Sending the spam as an image instead of text doesn't work either, because you need certain html tags to display an image, and these all end up having very high spam probabilities. Particularly the url. If you use a domain name and it's one that has shown up in spams before, you're dead. If you use an ip address instead, you're even deader. No tokens have higher spam probabilities than numbers in a url. -- Garry Williams, Zvolve Systems, Inc., +1 770 813-4934 From T.A.Meyer at massey.ac.nz Tue Sep 2 18:42:18 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 2 01:43:11 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E4EE0@its-xchg4.massey.ac.nz> > Yet, Paul Graham thinks[*] the URL in the IMG tag will > correlate accurately because it points to the spammer's > message. Why not tokenize the value of the link? Take a look at the clues for this message. You'll see: 'url:spambayes' 0.022956 27 5 And others like it. The same tokens get generated if it was a src element rather than a href one. =Tony Meyer From list2003 at fure.net Tue Sep 2 01:01:31 2003 From: list2003 at fure.net (Jan Fure) Date: Tue Sep 2 02:44:53 2003 Subject: [Spambayes] Spam which made it through the filter In-Reply-To: References: Message-ID: <3F54404B.3050909@fure.net> Hi; The message below made it through, and I probably will discard it for the sake of training the filter, as it contains so many "good" words. Have you seen more spam messages which contains "intellectual words" rather than childish disguises of whatever is being pushed? Jan Fure > accordingly bottled bengal mesenteric humidify brainwashes midshipman access > acrobacy bounteous ponders posterity terminates acclamation braggart tented > maturity tetrachloride cottony exasperation taking adoption poisoning sane hyena > math plots activity sated ichneumon $RANDO MIZE euphemist express 7th creditably > sans tameness tarantula theological breading teeter adonis bothersome eviction > excuse bartlett expunged boarder messenger hospitalized breakdowns creating > ichneumon scored countrymen ice creepy crossbars acclimatized teaming > positioning $RANDOM IZE blueprints port cremation terse admirals creak crossed > hydronium pokes expounder teleological midwife terminologies polled evidenced > messed saucer astoria expanded memo credence > > > > hysterectomy melon tangible bender identities testifies maze horselike > horsewoman hydrophobic evidences adduct explorer acquainted accomplishment > thatch pops bossed pogrom meekest benton posterior potentiating bergman > practicably hypocycloid body blurring annale break $RANDOM IZE melts illicitly > counterpointing micrography tempers scuttling materialized screeched telescoped > actors berra schemer sceptic accruing schoolmate tentative seasons hourglass > accuracies meddler pompano beltsville imbibe adsorb evidential australis > polluting tell brambly cowgirl $RANDOMI ZE assyria bluntly boeotian acquire > adrienne crest adumbrates adjutant baku exacerbate accountability tenet > RANDOMIZE sated hustling portions pledges adventuring crawled plush hydroelectric From harri.pesonen at wicom.com Tue Sep 2 11:02:16 2003 From: harri.pesonen at wicom.com (Harri Pesonen) Date: Tue Sep 2 03:02:20 2003 Subject: [Spambayes] Spam which made it through the filter Message-ID: One idea that came to my mind is that perhaps all url: token weights should be multiplied by 10 or something. Or by 100. So if you ever get spam from certain address, then that is going to get filtered the second time. Of course, spammers might get around this by adding good urls... :-) I believe that SpamBayes has to eventually dig into Received headers, and check the address where the spam originally came from (convert the ip address to host name perhaps). The message below has (in addition to $RANDOMIZE bug that spammers will fix) many words that I never get in any of my good e-mails. But the problem is that if the spammer has a huge dictionary, which is likely, then learning these does not do much good. Harri -----Original Message----- From: Jan Fure Subject: [Spambayes] Spam which made it through the filter Hi; The message below made it through, and I probably will discard it for the sake of training the filter, as it contains so many "good" words. Have you seen more spam messages which contains "intellectual words" rather than childish disguises of whatever is being pushed? Jan Fure > accordingly bottled bengal mesenteric humidify brainwashes midshipman access > acrobacy bounteous ponders posterity terminates acclamation braggart tented > maturity tetrachloride cottony exasperation taking adoption poisoning sane hyena > math plots activity sated ichneumon $RANDO MIZE euphemist express 7th creditably > sans tameness tarantula theological breading teeter adonis bothersome eviction > excuse bartlett expunged boarder messenger hospitalized breakdowns creating > ichneumon scored countrymen ice creepy crossbars acclimatized teaming > positioning $RANDOM IZE blueprints port cremation terse admirals creak crossed > hydronium pokes expounder teleological midwife terminologies polled evidenced > messed saucer astoria expanded memo credence > > > > hysterectomy melon tangible bender identities testifies maze horselike > horsewoman hydrophobic evidences adduct explorer acquainted accomplishment > thatch pops bossed pogrom meekest benton posterior potentiating bergman > practicably hypocycloid body blurring annale break $RANDOM IZE melts illicitly > counterpointing micrography tempers scuttling materialized screeched telescoped > actors berra schemer sceptic accruing schoolmate tentative seasons hourglass > accuracies meddler pompano beltsville imbibe adsorb evidential australis > polluting tell brambly cowgirl $RANDOMI ZE assyria bluntly boeotian acquire > adrienne crest adumbrates adjutant baku exacerbate accountability tenet > RANDOMIZE sated hustling portions pledges adventuring crawled plush hydroelectric From T.A.Meyer at massey.ac.nz Tue Sep 2 20:05:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 2 03:06:38 2003 Subject: [Spambayes] Spam which made it through the filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E4EF6@its-xchg4.massey.ac.nz> > I believe that SpamBayes has to eventually dig into Received > headers, and check the address where the spam originally came > from (convert the ip address to host name perhaps). The Received headers can be used now, although there isn't any lookup. (I'm still getting around to the results from the other lookup...) > The message below has (in addition to $RANDOMIZE bug that > spammers will fix) many words that I never get in any of my > good e-mails. But the problem is that if the spammer has a > huge dictionary, which is likely, then learning these does > not do much good. I think a message_length token would be more effective against this sort of mail. Or even something funky like the ratio of hapaxes to non-hapaxes (or to message length). I don't get enough of these (in fact, I haven't got any fn's yet) to be able to test this, though. =Tony Meyer From gary at inauspicious.org Tue Sep 2 12:17:07 2003 From: gary at inauspicious.org (Gary Benson) Date: Tue Sep 2 06:17:42 2003 Subject: [Spambayes] Load-limiting Message-ID: <20030902101707.GE2824@inauspicious.org> Hi all, After a rare evening with my firewall off, the amount of mail I'd built up was enough to bring my mailserver to its knees with hammiefilter processes. To stop it from happening again I've put a lock in my .procmailrc so that only one hammiefilter can run at one time [1], but I was wondering if SpamBayes has any more sophisticated load limiting built into it. Alternatively, has anyone considered writing a SpamAssassin-style client-server interface for SpamBayes? I was thinking, and rather than having a single daemon running as root, it would be nice to have something similar to what many GNOME applications do -- the first instance runs as normal, and subsequent instances hand off execution to the first. Saves all that messy suid stuff, but I have no idea how you'd do such a thing. A named pipe in your home directory? Cheers, Gary [ gary@inauspicious.org ][ GnuPG 85A8F78B ][ http://inauspicious.org/ ] [1] Oh, the recipe is this, if you're interested: :0 fw: $HOME/.hammiefilter.lock | hammiefilter From edrubins at andisplace.com Tue Sep 2 08:32:44 2003 From: edrubins at andisplace.com (Ed Rubinsky) Date: Tue Sep 2 07:32:53 2003 Subject: [Spambayes] Re: SMTP Errors using Eudora In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130308CA65@its-xchg4.massey. ac.nz> Message-ID: <5.1.0.14.0.20030902073109.00b25140@localhost> >[...] > > Thanks for running the check and the other help trying to > > track it down. Glad to be able to contribute in some small way. Besides, it was a great way to learn some real Python *8=>). Best, Ed From tim.one at comcast.net Tue Sep 2 11:27:01 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 2 10:27:38 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <20030902013342.A6310@zvolve.com> Message-ID: [Garry Williams] > Yet, Paul Graham thinks[*] the URL in the IMG tag will correlate > accurately because it points to the spammer's message. Why not > tokenize the value of the link? spambayes tokenizes all URLs already (whether in or out of IMG tags doesn't matter). From johnq at tigr.org Tue Sep 2 12:37:59 2003 From: johnq at tigr.org (Quackenbush, John) Date: Tue Sep 2 11:38:20 2003 Subject: [Spambayes] Reinstallation trouble Message-ID: To Whom It May Concern: I installed the SpamBayes Outlook AddIn a few weeks ago and was extremely happy with it. Then I ended up traveling and using a dial-in connection about the time our server got nailed by SoBig.F. The traffic was so heavy that the only way I could get Outlook to read anything was to disable SpamBayes. When I tried to reinitialize it, the menu was still visible, but it did not seem to do anything. I read the troubleshooting guide and tried everything listed there in an attempt to correct the problem. Nothing worked - the menu was still there, but it did not seem to function. When I returned to work, I hoped SpamBayes would work. It did not. So I tried to uninstall it, download a new copy, and reinstall it. When I did this, the SpamBayes menu bar failed to appear. I tried using all the troubleshooting tips you suggest, including deleting the outcmd.dat file, to no avail. I figured the problem might be that some of the old messages had spam scores set while others did not, tried cleaning up the mailbox, uninstalling and reinstalling SpamBayes. When the menu failed to appear, I went through all of the troubleshooting suggestions again, to no avail. Any other suggestions would be appreciated. JQ From rmalayter at bai.org Tue Sep 2 11:52:39 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 2 11:53:13 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3D9@cliff.bai.org> From: Tim Peters [mailto:tim.one@comcast.net] > "trouble" means what? That they're classified > as ham, or that they're classified as unsure? Mostly they are classified as unsure, but I recall a couple classified as ham. I used to keep careful track of all this on a spreadsheet, but I seem to have gotten lazy with that, so I'm not sure I have all the data. > The other side to this is that *any* evidence of HTML > is a strong spam indicator in most corpora... virtually > nothing using HTML could avoid being classified as spam... This doesn't seem right to me, at least on an intuitive level. We're an Outlook 2003 shop, and we've used Windows Group Policies to force all internal users to create HTML messages instead of Microsoft RTF format. So a great big heaping pile of my non-spam corpus would be messages that contain


and other "innocent" HTML tags. Shouldn't the statistical nature of SpamBayes give these tokens something near 0.5 as a score, since they appear frequently in both corpora? > Work up a patch and see what happens! I just started playing with Python, so it might be a while before I can do that. I really like it at first glance, though. Are there any good IDEs that support Python, with code highlighting? Being a C++/VB guy as of late, is there a .NET-ified version of Python that I can plug into the Visual Studio 2003 IDE? I've seen the stuff on ActiveState's website, but it seems woefully out-of-date, since everything is dated 2001. From Mark.Gould at wellpoint.com Tue Sep 2 09:53:03 2003 From: Mark.Gould at wellpoint.com (Gould, Mark) Date: Tue Sep 2 11:53:39 2003 Subject: [Spambayes] spambayes Message-ID: I have Eudora 5.21 and would like to use spambayes to filter out my spam. Can this be done? If so, can you provide me with a detailed explanation as how to do this. Thanks, Mark From skip at pobox.com Tue Sep 2 12:11:24 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 2 12:11:39 2003 Subject: [Spambayes] Procmail training methods (Was: Re: Initialization error) In-Reply-To: References: <20030831152939.NNVP367241.fep02-mail.bloor.is.net.cable.rogers.com@localhost> Message-ID: <16212.49452.527268.929957@montanaro.dyndns.org> Jason> Some questions I do have involve the best way to keep it up to Jason> date when using procmail for filtering. Any hints from anyone Jason> who's already done this? In addition to proxytee.py, you might be able to teach your mail reader how to train incrementally, presuming it's sufficiently programmable. For example, if you happen to use VM within Emacs, take a look at question 2.2 of the FAQ. Skip From edrubins at mindspring.com Tue Sep 2 13:18:21 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Tue Sep 2 12:18:28 2003 Subject: [Spambayes] Eudora 5.1 and spambayes In-Reply-To: Message-ID: <5.1.0.14.0.20030902121611.00ac24d8@localhost> > > I have Eudora 5.21 and would like to use spambayes to filter out my > > spam. Can this be done? If so, can you provide me with a detailed > > explanation as how to do this. See the FAQ at http://spambayes.sourceforge.net/faq.html#how-do-i-configure-eudora-for-use-with-spambayes Best, Ed From fuerte at sci.fi Tue Sep 2 20:24:08 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Tue Sep 2 12:24:09 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <3F54C428.70105@sci.fi> From: "Ryan Malayter" >From: Tim Peters [mailto:tim.one@comcast.net] > > >>Work up a patch and see what happens! >> >> > >I just started playing with Python, so it might be a while before I can >do that. I really like it at first glance, though. > >Are there any good IDEs that support Python, with code highlighting? >Being a C++/VB guy as of late, is there a .NET-ified version of Python >that I can plug into the Visual Studio 2003 IDE? I've seen the stuff on >ActiveState's website, but it seems woefully out-of-date, since >everything is dated 2001. > > I would like to know too. Crimson Editor has (limited) syntax highlighting, but that's it. Pythonwin is the best free IDE that I know. MultiEdit (http://www.multiedit.com/) has Python support, but it is $139. Harri From skip at pobox.com Tue Sep 2 12:25:58 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 2 12:26:10 2003 Subject: [Spambayes] spambayes In-Reply-To: References: Message-ID: <16212.50326.264994.601778@montanaro.dyndns.org> Mark> I have Eudora 5.21 and would like to use spambayes to filter out Mark> my spam. Can this be done? If so, can you provide me with a Mark> detailed explanation as how to do this. Sure. Use the pop3proxy.py program. As for details, start with the Spambayes website: http://spambayes.sf.net/ There are pages of documentation and application intros and a frequently asked questions list. If those don't get you going, holler back to the list. Skip From jroberg at astorchocolate.com Tue Sep 2 13:26:58 2003 From: jroberg at astorchocolate.com (Jay Roberg) Date: Tue Sep 2 12:29:38 2003 Subject: [Spambayes] Undo for spambayes Message-ID: I installed Spambayes on a Windows 2000 Pro SP3 running Office XP at work, and was very satisfied with the results of the filtering out of unwanted emails. Over the weekend, I installed Spambays on my home PC (Windows XP SP1 running Office XP SP1), which connects to my office LAN and mailbox on the Win 2000 Exchange server via a high speed VPN connection. After running the setup and setting up the folders, the filtering process moved all my mail from the Inbox and included subfolders to the SPAM MAYBE folder I created. Is their an undo process for this last action, as I will not be able to restore each email into the subfolder it came from. Thanks for your help, Jay. Jay Roberg Jay Roberg Help Desk & Tech Support Astor Chocolate Corp. (732) 901 1000 From rmalayter at bai.org Tue Sep 2 12:54:35 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 2 12:55:10 2003 Subject: [Spambayes] General configuration to change location of database Message-ID: <792DE28E91F6EA42B4663AE761C41C2A011150BF@cliff.bai.org> From: Harriger, Alka R. [mailto:arharriger@tech.purdue.edu] >...Regardless of what I place in the file or where >I install SpamBayes, it keeps going to C:\Documents and >Settings\myusername\Application Data\SpamBayes... I've just confirmed this problem with version 007 of the plug-in on my own system (Windows XP sp1, Outlook 2002 SP2), and have reported it as bug # 799245 on the Sourceforge site. Incidentally, Alka, what is your limit for roaming profiles at Purdue? We use 30 MB here, and have very little trouble. People still occasionally leave giant files on their desktops, but in general everyone knows not to do that sort of thing by now. Regards, Ryan Malayter Sr. Network & Database Administrator Bank Administration Institute Chicago, Illinois, USA PGP Key: http://www.malayter.com/pgp-public.txt ::::::::::::::::::::::::::::::: Twas a woman who drove me to drink. I never had the courtesy to thank her. -W.C. Fields From upsbaner at web.de Tue Sep 2 20:03:54 2003 From: upsbaner at web.de (Pawo Baner) Date: Tue Sep 2 13:11:50 2003 Subject: [Spambayes] eudora mail service Message-ID: <5.2.1.1.2.20030902190115.00b26958@pop3.web.de> Dear Spam Service, I am a Eudora and not a outlook user (with Window xp), can I apply your software in the same way or is there anything to be aware of ? Thanks for your answer. sincerely Pawo Pawo Baner Kronenmattenstra?e 4 D-79100 Freiburg Tel. +49 (0) 761 700 577 Mobile +49 (0) 179 299 3611 From skip at pobox.com Tue Sep 2 13:14:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 2 13:15:16 2003 Subject: [Spambayes] eudora mail service In-Reply-To: <5.2.1.1.2.20030902190115.00b26958@pop3.web.de> References: <5.2.1.1.2.20030902190115.00b26958@pop3.web.de> Message-ID: <16212.53260.918870.764923@montanaro.dyndns.org> Pawo> I am a Eudora and not a outlook user (with Window xp), can I apply Pawo> your software in the same way or is there anything to be aware of Pawo> ? Yes. Check out the pop3proxy.py program which is part of SpamBayes. There is a question in the faq about using SpamBayes with Eudora: http://spambayes.sf.net/faq.html Skip From mlmll at netcourrier.com Tue Sep 2 20:51:35 2003 From: mlmll at netcourrier.com (MLL) Date: Tue Sep 2 13:52:11 2003 Subject: [Spambayes] Spam not always moved (when filtering too ?) Message-ID: <003501c3717a$e5708050$fa00a8c0@MAISON01> Hi all, this is my first post to this list, First, thanks to the devs for the great app, very promising. Used to try spammunition, but spambayes is better at tracking spam. I've registered a bug on SF before discovering this mailing-list. I think it's more appropriate to post here actually, so here it goes : I've set a rule in Outlook to move automatically mail send to a given alias to, say "Inbox2". Inbox2 is a subfolder of "Inbox". I've told SpamBayes to filter "Inbox" and "Inbox2" and move the spam to the "Spam" folder. Unfortunately, it seems that, while the spam in "Inbox" is appropriately moved to "Spam", it doesn't always for "inbox2". Maybe spambayes kinda competes with the rules filtering system when the mail comes first to the inbox ? On a side note, SB conflicts with the use of netfolders (best info about the latter on http://www.slipstick.com/outlook/netfolders.htm), as seen in the attached debug log. Using SB with OL2000, on either Win2KSP4 or WinXPSP1. Thanks for any help. MLL -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1307 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030902/f8ea0020/spambayes1.obj From jwillett at afgweb.com Tue Sep 2 12:52:51 2003 From: jwillett at afgweb.com (Jerry Willett) Date: Tue Sep 2 14:53:52 2003 Subject: [Spambayes] Spambayes Question Message-ID: Is there a way to view the database within spambayes to determine if an email accidentally got classified as spam but should not have been? Jerry Willett Chief Technology Officer Associated Financial Group jwillett@afgweb.com From skip at pobox.com Tue Sep 2 15:07:15 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 2 15:07:33 2003 Subject: [Spambayes] Spambayes Question In-Reply-To: References: Message-ID: <16212.60003.663719.765818@montanaro.dyndns.org> Jerry> Is there a way to view the database within spambayes to determine Jerry> if an email accidentally got classified as spam but should not Jerry> have been? Not just by looking at the database itself, since what the database contains is very far removed from the actual messages themselves. If you have mailboxes containing all the messages you used as training input, you can browse them. If you have the appropriate sort of environment, you may be able to speed up this process by scoring all your training messages and looking for scores which look like outliers (presumed ham messages which score very high, for example). Skip From georger at optonline.net Tue Sep 2 16:54:38 2003 From: georger at optonline.net (GMR) Date: Tue Sep 2 15:54:44 2003 Subject: [Spambayes] NEW ERRORS Message-ID: <000001c3718c$0b5277e0$640aa8c0@Gan.com> A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 510 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030902/9698cb91/spambayes1.obj From garth at deadlybloodyserious.com Wed Sep 3 08:44:23 2003 From: garth at deadlybloodyserious.com (Garth Kidd) Date: Tue Sep 2 17:45:41 2003 Subject: [Spambayes] Spam which made it through the filter In-Reply-To: <3F54404B.3050909@fure.net> Message-ID: <001901c3719b$78ef8ae0$8a01010a@gkiddxp2> > The message below made it through, and I probably will discard it for > the sake of training the filter, as it contains so many "good" words. Just recently, a lot of obviously spammy mail started sneaking through, particularly a batch of Nigeria variants. Have spammers started attacking Bayesian filters in earnest? Regards, Garth. From edrubins at mindspring.com Tue Sep 2 19:06:11 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Tue Sep 2 18:06:19 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <5.1.0.14.0.20030902180604.00abda80@localhost> > >Are there any good IDEs that support Python, with code highlighting? > >Being a C++/VB guy as of late, is there a .NET-ified version of Python > >that I can plug into the Visual Studio 2003 IDE? I've seen the stuff on > >ActiveState's website, but it seems woefully out-of-date, since > >everything is dated 2001. > > > > >I would like to know too. Crimson Editor has (limited) syntax >highlighting, but that's it. Pythonwin is the best free IDE that I know. > >MultiEdit (http://www.multiedit.com/) has Python support, but it is $139. EMACS comes with a pyhon mode and costs $139 less than MultiEdit. It is available under Windows. Best, Ed From rmalayter at bai.org Tue Sep 2 18:17:08 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 2 18:17:42 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2A011150DB@cliff.bai.org> From: Ed Rubinsky [mailto:edrubins@mindspring.com] > EMACS comes with a pyhon mode and costs $139 > less than MultiEdit. It is available under Windows. I'm one of the many who have used emacs while hating it (around 1992 on solaris), then grew to love it (circa 1996), then started to hate it again (circa 1999). I finally moved on because my company is a VB shop, and I love the IDE in Visual Studio. Maybe it's time to have another look at emacs, though I can't imagine giving up IntelliSense now. From beckman at purplecow.com Wed Sep 3 00:07:13 2003 From: beckman at purplecow.com (Peter Beckman) Date: Thu Sep 4 14:19:06 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A011150DB@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A011150DB@cliff.bai.org> Message-ID: <20030902230516.T21425@thermonuclear.org> On Tue, 2 Sep 2003, Ryan Malayter wrote: > From: Ed Rubinsky [mailto:edrubins@mindspring.com] > > EMACS comes with a pyhon mode and costs $139 > > less than MultiEdit. It is available under Windows. > > I'm one of the many who have used emacs while hating it (around 1992 on > solaris), then grew to love it (circa 1996), then started to hate it > again (circa 1999). Vim rocks. I started using it in 1992 and haven't grown to hate it since. All sorts of contextual highlighting, available in windows, linux, freebsd, solaris, hell pretty much everything. Note I say VIM -- not vi. It's VI iMproved. Much better than normal vi. http://www.vim.org/ Beckman --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From T.A.Meyer at massey.ac.nz Wed Sep 3 14:23:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:27:14 2003 Subject: [Spambayes] NEW ERRORS Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5113@its-xchg4.massey.ac.nz> [empty mail snipped ] I'm afraid we need a bit more information than that to try and help. (The log doesn't have any errors in it). Could you describe what is going wrong? =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 3 14:29:02 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:27:18 2003 Subject: [Spambayes] Load-limiting Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E511E@its-xchg4.massey.ac.nz> > Alternatively, has anyone considered writing a > SpamAssassin-style client-server interface for SpamBayes? I think (someone chime in if I'm wrong) that this is what hammiesrv.py and hammiecli.py do. Take a look at them and see if they do what you want. (Or can be adapted to do what you want). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 3 14:37:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:27:24 2003 Subject: [Spambayes] Undo for spambayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E512F@its-xchg4.massey.ac.nz> > After running the setup and setting up the folders, the > filtering process moved all my mail from the Inbox and > included subfolders to the SPAM MAYBE folder I created. > > Is their an undo process for this last action, as I will not > be able to restore each email into the subfolder it came from. If you use the "recover from spam" button, it should move the mail from the 'maybe' folder back into the folder it originally came from, and also do the required training. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 3 14:26:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:27:32 2003 Subject: [Spambayes] Spam not always moved (when filtering too ?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5118@its-xchg4.massey.ac.nz> > Unfortunately, it seems that, while the spam in "Inbox" is > appropriately moved to "Spam", it doesn't always for > "inbox2". Maybe spambayes kinda competes with the rules > filtering system when the mail comes first to the inbox ? This is more-or-less what happens. It's up to Outlook when it tells spambayes that there is new mail, and when Outlook decides to do this depends on a whole host of things. Your best solution is to enable the 'timer'. This is an experimental feature in 007 (look at the configuration guide, which you get to via the 'about' button), but in the soon-to-be-release 008, it is just a regular feature (in other words, you don't have to be wary about the 'experimental nature' of it). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 3 14:36:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:27:39 2003 Subject: [Spambayes] RE: [spambayes-bugs] Spambayes repeatedly classifies messages frommailing list as SPAM despite multiple (20+) recoveries fromspam folder Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E512C@its-xchg4.massey.ac.nz> > Sorry, I should have thought of that. I'm not really that > surprised that the message gets flagged as spam initially - > I'm just surprised that after a couple of weeks of "teaching" > that this mailing list message hasn't been "whitelisted." > > Spam Score: 0.833733 > 'url:mydomain' 0.00493094 17079 0 [...] > 'skip:g 10' 0.639491 176 1 > 'cannot' 0.643938 171 1 [...] Do you have really unbalanced numbers of ham & spam? For example, "cannot" is in 171 ham messages, but only 1 spam message - it really shouldn't get a score of 0.64. Spambayes works best trained with roughly equal numbers of ham & spam; we're still trying to come up with a good method of working with unbalanced training data. At the moment there is an option (defaults to 'on' in the Outlook plug-in) that adjusts the scores for unbalanced mail. It looks like this is what is happening here - because of the imbalance, a perfectly hammy word like "cannot" is getting a 0.64 score. Two suggestions: o Try disabling the experimental_ham_spam_imbalance_adjustment option. You can do this by changing the default_bayes_customize.ini file in your data folder (see the FAQ for the location). You don't need to retrain to see the effects of this, so have a look at that same message with the option off and see what the score is. Note that you might get more false-negatives, though. o Try retraining with roughly equal numbers of ham and spam (just take a random selection of the bigger collection) and see how that works. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 3 14:54:29 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 14:28:34 2003 Subject: [Spambayes] General configuration to change location of database Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E514C@its-xchg4.massey.ac.nz> > From: Harriger, Alka R. [mailto:arharriger@tech.purdue.edu] > >...Regardless of what I place in the file or where > >I install SpamBayes, it keeps going to C:\Documents and > >Settings\myusername\Application Data\SpamBayes... > > I've just confirmed this problem with version 007 of the plug-in on my > own system (Windows XP sp1, Outlook 2002 SP2), and have reported it as > bug # 799245 on the Sourceforge site. In which location are you putting your default_configuration.ini file? This works perfectly for me using Windows XP sp1, Outlook 2002 SP2 and either current cvs or 007, if I put the default_configuration.ini file in the application data directory (and point the data directory at a network drive). I haven't tried putting it in the spambayes directory. =Tony Meyer From mlmll at netcourrier.com Wed Sep 3 09:45:42 2003 From: mlmll at netcourrier.com (MLL) Date: Thu Sep 4 14:29:34 2003 Subject: [Spambayes] Spam not always moved (when filtering too ?) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5118@its-xchg4.massey.ac.nz> Message-ID: <000501c371e6$ff1ca530$fa00a8c0@MAISON01> > > Unfortunately, it seems that, while the spam in "Inbox" is > > appropriately moved to "Spam", it doesn't always for > > "inbox2". Maybe spambayes kinda competes with the rules > > filtering system when the mail comes first to the inbox ? > > This is more-or-less what happens. It's up to Outlook when it tells > spambayes that there is new mail, and when Outlook decides to do this > depends on a whole host of things. > > Your best solution is to enable the 'timer'. This is an experimental > feature in 007 (look at the configuration guide, which you get to via > the 'about' button), but in the soon-to-be-release 008, it is just a > regular feature (in other words, you don't have to be wary about the > 'experimental nature' of it). Thanks ! Just inserted : [Experimental] timer_start_delay:3000 timer_interval:2000 timer_only_receive_folders:False in the ini file. Let's see what happens... MLL -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2112 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030903/8011ea35/winmail.bin From wsy at merl.com Thu Sep 4 08:12:24 2003 From: wsy at merl.com (Bill Yerazunis) Date: Thu Sep 4 14:30:34 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3D9@cliff.bai.org> (rmalayter@bai.org) References: <792DE28E91F6EA42B4663AE761C41C2AF3D3D9@cliff.bai.org> Message-ID: <200309041112.h84BCOH10377@localhost.localdomain> From: "Ryan Malayter" From: Tim Peters [mailto:tim.one@comcast.net] [...] > The other side to this is that *any* evidence of HTML > is a strong spam indicator in most corpora... virtually > nothing using HTML could avoid being classified as spam... This doesn't seem right to me, at least on an intuitive level. We're an Outlook 2003 shop, and we've used Windows Group Policies to force all internal users to create HTML messages instead of Microsoft RTF format. So a great big heaping pile of my non-spam corpus would be messages that contain


and other "innocent" HTML tags. Shouldn't the statistical nature of SpamBayes give these tokens something near 0.5 as a score, since they appear frequently in both corpora? No, my corpora agree with Tim Peters - spammers use HTML far more often than "normal" users. Statistically speaking, HTML mail is either from a spammer or from a clueless git, and in either case can usually be delayed without penalty or discarded outright. Similarly, base-64 encodes are almost _always_ trash. -Bill Yerazunis From fuerte at sci.fi Wed Sep 3 22:09:22 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Thu Sep 4 14:32:01 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <3F562E52.8070202@sci.fi> From: "Ryan Malayter" >From: Ed Rubinsky [mailto:edrubins@mindspring.com] > > >>EMACS comes with a pyhon mode and costs $139 >>less than MultiEdit. It is available under Windows. >> >> > >I'm one of the many who have used emacs while hating it (around 1992 on >solaris), then grew to love it (circa 1996), then started to hate it >again (circa 1999). > Hehe, that's a good one. :-) >I finally moved on because my company is a VB shop, and I love the IDE >in Visual Studio. Maybe it's time to have another look at emacs, though >I can't imagine giving up IntelliSense now. > > If emacs has a mode where all keys work like in Visual Studio, then it is an option. Pythonwin has limited intellisense, btw. Harri From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 12:07:01 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 14:57:48 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041607.h84G70n09591@hoemail1.firewall.lucent.com> The e-mail message (file: details.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 11:59:20 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 14:59:12 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041559.h84FxKh04458@hoemail1.firewall.lucent.com> The e-mail message (file: movie0045.pif) you sent to contains a virus (WORM_SOBIG.F). From thetasig at comcast.net Thu Sep 4 13:15:23 2003 From: thetasig at comcast.net (theta sigma) Date: Thu Sep 4 15:15:32 2003 Subject: [Spambayes] Eudora-Using Newbie Message-ID: <5.2.1.1.2.20030904120931.00b6e718@mail.comcast.net> Well, now I've read what seems to be all the FAQs and many of the archives in the forum list. The problem is that I do not understand how to do something that is commanded in the FAQ - and there doesn't seem to be an explanation of how to carry it out. Within the Eudora GUI I am supposed to: "Configure pop3proxy for each of Eudora's personalities' POP3 servers, specifying a separate port for each. For example 1110, 1120, 1130 and 1140 for four personalities. Do the same for smtpproxy - for example 1115, 1125, 1135 and 1145 corresponding to the four POP3 servers." But within the "properties" dialogue for personalities in Eudora there is no "slot" or "text box" or specific place to specify a port. It seems like such a simple instruction but I have no idea where to specify the port inside of Eudora 5.2. This, I think, is the main problem with my configuration errors. If I could know how to do this then I think I could make it all work. However... I tried to use some of my (very meager) background and within Eudora I specified the POP3 server as "localhost:1110" and the SMTP server as "localhost:1115" making the apparently incorrect assumption that all I needed to do was add the colon and a port number in a format that many servers seem to use. I ran the "setup.py install" command and that seemed to work as it ran a series of builds and other stuff in the command line window. Then it ran install scripts creating many .py files including the pop3proxy.py. Then I ran the "pop3proxy.py -b" and the web interface came up stating that I have POP3 proxy running on 1110, 1120, 1130 proxying mail to mail.comcast.net:110, etc., for the other 2 accounts. and the SMTP proxy running on 1115, 1125, 1135, etc. I have modified the eudora.ini file as instructed and the accounts are showing up with properties looking like this: POP3 server=localhost:1110, SMTP server=localhost:1115, and 2 other accounts likewise. However, Eudora complains: "Error getting network address for localhost:1110 - cause: host not found (11001)", etc. for all 3 accounts. Being somewhat of a neophyte I am not able to further debug this problem. If you can help me directly that's great, or if you can point me to an answer or additional information on "what to do next" after one has read [presumably] all the FAQs, etc. There may be other things I need to read but I'll be darned if I can find which ones and where. I'm stuck for now. Many Thanks! -=mark=- From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 15:24:31 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 15:24:40 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041924.h84JOUe20220@hoemail1.firewall.lucent.com> The e-mail message (file: your_details.pif) you sent to contains a virus (WORM_SOBIG.F). From norton_antivirus_gateways at lcsnet.com Thu Sep 4 15:14:04 2003 From: norton_antivirus_gateways at lcsnet.com (norton_antivirus_gateways@lcsnet.com) Date: Thu Sep 4 15:31:33 2003 Subject: [Spambayes] Content violation Message-ID: Content violation found in email message. From: spambayes@python.org To: leskykim@lcsnet.com Subject: Re: Thank you! Matching Subject: re: thank you! From doug.cogswell at advizorsolutions.com Thu Sep 4 15:32:59 2003 From: doug.cogswell at advizorsolutions.com (doug.cogswell@advizorsolutions.com) Date: Thu Sep 4 15:33:11 2003 Subject: [Spambayes] RE: Questions Message-ID: <7D0EE54926BAD21195610090273C206F0330DEE9@VISMAIL1> One more . . . I can't seem to get the filtering to work. I've had SpamBayes up and trained for a day, and ran the "start filtering" yesterday. This morning I shut off my Outlook filter. Since then not one email had been filtered. So, just now I ran "start filtering" to hopefully clear out my inbox spam, and nothing cleared. How do I turn on the filtering? doug -----Original Message----- From: Cogswell, Doug Sent: Thursday, September 04, 2003 8:41 AM To: 'spambayes@python.org' Subject: Questions Just downloaded and installed the SpamBayes Outlook 2000 plug-in. Several questions: 1. Do I periodically need to "retrain" the system, or does it learn automatically in real time as I move messages in and out of my spam folder? 2. Can I periodically clean out the spam folder, or does the system need the history to maintain its scoring and learning? 3. When I disconnect from the network (I use a laptop), will the system catch the spam when I log back on? E.g., does it take a pass through the outlook inbox when i come back on the network? If so, what trigger this -- re-opening outlook, or just the fact that i come back on a network? Thanks, doug. From arharriger at tech.purdue.edu Tue Sep 2 22:03:04 2003 From: arharriger at tech.purdue.edu (Harriger, Alka R.) Date: Thu Sep 4 15:50:44 2003 Subject: [Spambayes] General configuration to change location of datab ase Message-ID: <843ADAD78A65D411BB5C00D0B71DEA411A0EAB75@sotcomm.tech.purdue.edu> I tried several variations, including placing a (new) default_configuration.ini file with a [General] section to define the data_directory location, but couldn't get it to work. I'm guessing that the syntax I used to specify the location of the data directory on a network drive was invalid, but with the available documentation, I couldn't figure it out. My current office system is Win 2000 & I use Outlook 2000. Alka Harriger -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Tuesday, September 02, 2003 8:54 PM To: Ryan Malayter; spambayes@python.org Cc: Harriger, Alka R. Subject: RE: [Spambayes] General configuration to change location of database > From: Harriger, Alka R. [mailto:arharriger@tech.purdue.edu] > >...Regardless of what I place in the file or where > >I install SpamBayes, it keeps going to C:\Documents and > >Settings\myusername\Application Data\SpamBayes... > > I've just confirmed this problem with version 007 of the plug-in on my > own system (Windows XP sp1, Outlook 2002 SP2), and have reported it as > bug # 799245 on the Sourceforge site. In which location are you putting your default_configuration.ini file? This works perfectly for me using Windows XP sp1, Outlook 2002 SP2 and either current cvs or 007, if I put the default_configuration.ini file in the application data directory (and point the data directory at a network drive). I haven't tried putting it in the spambayes directory. =Tony Meyer From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 15:52:02 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 15:52:19 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041952.h84JpvT08000@hoemail1.firewall.lucent.com> The e-mail message (file: movie0045.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 11:38:31 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 15:59:14 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041538.h84FcUV22679@hoemail1.firewall.lucent.com> The e-mail message (file: your_document.pif) you sent to contains a virus (WORM_SOBIG.F). From mhammond at keypoint.com.au Thu Sep 4 09:48:23 2003 From: mhammond at keypoint.com.au (Mark Hammond) Date: Thu Sep 4 15:59:18 2003 Subject: [Spambayes] General configuration to change location of database In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A011150BF@cliff.bai.org> Message-ID: <332a01c3726d$7c1357b0$f502a8c0@eden> > I've just confirmed this problem with version 007 of the plug-in on my > own system (Windows XP sp1, Outlook 2002 SP2), and have reported it as > bug # 799245 on the Sourceforge site. > > Incidentally, Alka, what is your limit for roaming profiles at Purdue? > We use 30 MB here, and have very little trouble. People still > occasionally leave giant files on their desktops, but in general > everyone knows not to do that sort of thing by now. I just added the following comment to that bug: Note that the data should go in a file called "default_configuration.ini". I am fairly sure this works, as when it was first released a number of people had problems with the feature - eg, some people found we prevented spaces or $ as appearing in the filename. Also, SpamBayes had a bug in that if the default directory had an extended character, SpamBayes would also fail to work. A number of people reported a reasonable work around was to specify the data directory to be a name with only ascii characters. If you still believe this a bug, please give exact details of what you tried - the fully qualified name of the ini file you edited, the complete contents of that file, and the complete contents of the log. If you do this, please also reopen the bug. Mark -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2340 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030904/ac483a56/winmail.bin From wiseguyclothing at cox.net Thu Sep 4 00:07:06 2003 From: wiseguyclothing at cox.net (John Canepa) Date: Thu Sep 4 16:00:39 2003 Subject: [Spambayes] Cannot 'Delete As Spam'....Help Plz! Message-ID: I am using SpamBayes Version 0.7 with Windows XP. Everytime I hit 'Delete As Spam', it tells me that I must enable SpamBayes first. I thought it was automatically enabled. How do I get this to work and enable SpamBayes?? Thanx a million! -John From rmalayter at bai.org Wed Sep 3 12:16:23 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Sep 4 16:13:47 2003 Subject: [Spambayes] General configuration to change location of database Message-ID: <792DE28E91F6EA42B4663AE761C41C2A011150E5@cliff.bai.org> From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] > In which location are you putting your default_configuration.ini file? C:\documents and settings\username\Application Data\Default_configuration.ini I get initialization errors. I also tried moving the [profilename].ini file to the data directory, which seems to be indicated by the documentation, but this didn't work either. I think the documentation is either unclear or incorrect. What exacty were the changes you made to from the default installation to get this to work? Nothing I've tried works at all, they all resulting in a "spambayes plug-in failed to initialize" error. From skip at pobox.com Thu Sep 4 16:17:56 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 4 16:18:06 2003 Subject: [Spambayes] Eudora-Using Newbie In-Reply-To: <5.2.1.1.2.20030904120931.00b6e718@mail.comcast.net> References: <5.2.1.1.2.20030904120931.00b6e718@mail.comcast.net> Message-ID: <16215.40436.45888.408621@montanaro.dyndns.org> theta> Within the Eudora GUI I am supposed to: theta> "Configure pop3proxy for each of Eudora's personalities' POP3 theta> servers, specifying a separate port for each. For example 1110, theta> 1120, 1130 and 1140 for four personalities. Do the same for theta> smtpproxy - for example 1115, 1125, 1135 and 1145 corresponding theta> to the four POP3 servers." This is done using a web browser; it's not embedded in Eudora. Pop3proxy should listen to port 8880 by default. Try visiting http://localhost:8880/ from a web browser. Skip From davea at sportingbet.com.au Fri Sep 5 06:51:45 2003 From: davea at sportingbet.com.au (Dave Alford) Date: Thu Sep 4 16:21:58 2003 Subject: [Spambayes] automated response Message-ID: <10309050551.AA135528536@sportingbet.com.au> Hi, I will be on holidays from Monday the 1st of September until the following Monday (8th of September). I will, however, be checking my e-mail periodically. If you need to contact me urgently, I can be reached on 0408-089-379. For all non-urgent matters, please contact Tom Rundle on (08) 8923-8046. Thanks, Dave Alford SysAdmin Sportingbet Australia From brian at brightrock.com Thu Sep 4 12:13:27 2003 From: brian at brightrock.com (Brian Schwarz) Date: Thu Sep 4 16:29:52 2003 Subject: [Spambayes] Re: [spambayes-bugs] Spambayes repeatedly classifies messages frommailing list as SPAM despite multiple (20+) recoveries fromspam folder References: <1ED4ECF91CDED24C8D012BCF2B034F13031E512C@its-xchg4.massey.ac.nz> Message-ID: <005701c37323$46a6ec20$1fd2f10a@amr.corp.intel.com> Meyer, Tony wrote: > Do you have really unbalanced numbers of ham & spam? For example, > "cannot" is in 171 ham messages, but only 1 spam message - it really > shouldn't get a score of 0.64. > > Spambayes works best trained with roughly equal numbers of ham & spam; > we're still trying to come up with a good method of working with > unbalanced training data. At the moment there is an option (defaults > to 'on' in the Outlook plug-in) that adjusts the scores for unbalanced > mail. It looks like this is what is happening here - because of the > imbalance, a perfectly hammy word like "cannot" is getting a 0.64 > score. OK, that makes sense. I have ~1000 ham and only ~100 spam messages. When I was doing the training, I assumed that more data was preferable, and I had a lot more stored examples of the good stuff. I'll try your suggestions. Even with that hiccup, the program has done a pretty good job out of the box. Thanks, Brian From rmalayter at bai.org Thu Sep 4 16:32:06 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Sep 4 16:32:10 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3E9@cliff.bai.org> From: Bill Yerazunis [mailto:wsy@merl.com] > No, my corpora agree with Tim Peters > - spammers use HTML far more often > than "normal" users. In my opinion, this is simply untrue, at least for the American corporate user. At most of the U.S.-based companies I've dealt with, HTML mail is widely used. I remember a Gartner survey that said something like 90% of corporate desktops have either Outlook/Exchange or Notes/Domino for messaging. Recent versions of both of these mail clients create HTML messages by default rather than plain text or some proprietary rich text format. I'd say 80% or more of business email I get - most of it from technical people - is HTML mail, simply because most people leave their Outlook or Notes mail client in its default configuration. > Statistically speaking, HTML mail is > either from a spammer or from a clueless > git, and in either case can usually be > delayed without penalty or discarded outright. As indicated above, I do not think this analysis is true anymore. And characterizing someone as a clueless git because they don't change their mail client's default message format or "love" plain text... Well, let us know when you get back to the real world. > Similarly, base-64 encodes are almost _always_ trash. I agree, except for in-line images sent with email newsletters and the like. Regards, Ryan Malayter Sr. Network & Database Administrator Bank Administration Institute Chicago, Illinois, USA PGP Key: http://www.malayter.com/pgp-public.txt From skip at pobox.com Thu Sep 4 16:53:32 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 4 16:53:43 2003 Subject: [Spambayes] Load-limiting In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E511E@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E511E@its-xchg4.massey.ac.nz> Message-ID: <16215.42572.721248.669848@montanaro.dyndns.org> >> Alternatively, has anyone considered writing a SpamAssassin-style >> client-server interface for SpamBayes? Tony> I think (someone chime in if I'm wrong) that this is what Tony> hammiesrv.py and hammiecli.py do. Take a look at them and see if Tony> they do what you want. (Or can be adapted to do what you want). Yeah, that's it, however, it's not nearly as big a win as spamd/spamc for a couple reasons. Python starts up faster once pyc's have been generated, and SpamBayes is significantly faster than SpamAssassin to start with, at least the last time I used SA (about a year ago) it was. Skip From edrubins at mindspring.com Thu Sep 4 17:56:45 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Thu Sep 4 16:56:58 2003 Subject: [Spambayes] Re: Eudora-Using Newbie In-Reply-To: Message-ID: <5.1.0.14.0.20030904164225.00aa5a80@localhost> > >Within the Eudora GUI I am supposed to: > > > "Configure pop3proxy for each of Eudora's personalities' POP3 servers, > > specifying a separate port for each. For example 1110, 1120, 1130 and 1140 > > for four personalities. Do the same for smtpproxy - for example 1115, > 1125, > > 1135 and 1145 corresponding to the four POP3 servers." The spambayes part of the configuration is done via a web browser. As you found out, starting pop3proxy with the -b option starts the browser. Click on the configuration page link and fill in the appropriate information, then save it (sounds like you did.) > > But within the "properties" dialogue for personalities in Eudora there is > > no "slot" or "text box" or specific place to specify a port. It seems > like > >such a simple instruction but I have no idea where to specify the port >inside of Eudora 5.2. This, I think, is the main problem with my >configuration errors. If I could know how to do this then I think I could >make it all work. Right, You need to make the changes in eudora.ini. > > However... I tried to use some of my (very meager) background and within > > Eudora I specified the POP3 server as "localhost:1110" and the SMTP server > > as "localhost:1115" making the apparently incorrect assumption that all I > > needed to do was add the colon and a port number in a format that many > > servers seem to use. Right again. > > I ran the "setup.py install" command and that seemed to work as it ran a > > series of builds and other stuff in the command line window. Then it ran > > install scripts creating many .py files including the > pop3proxy.py. Then I > > ran the "pop3proxy.py -b" and the web interface came up stating that I > have > > POP3 proxy running on 1110, 1120, 1130 proxying mail to > > mail.comcast.net:110, etc., for the other 2 accounts. and the SMTP proxy > > running on 1115, 1125, 1135, etc. This is the page with the link to the configuration page, but it sounds as if you have pop3proxy and smtpproxy properly configured. > > I have modified the eudora.ini file as instructed and the accounts are > > showing up with properties looking like this: POP3 server=localhost:1110, > > SMTP server=localhost:1115, and 2 other accounts likewise. > > > > However, Eudora complains: > > "Error getting network address for localhost:1110 - cause: host not found > > (11001)", etc. for all 3 accounts. Try changing localhost to 127.0.0.1 > > Being somewhat of a neophyte I am not able to further debug this problem. > > > If you can help me directly that's great, or if you can point me to an > > answer or additional information on "what to do next" after one has read > > [presumably] all the FAQs, etc. There may be other things I need to read > > but I'll be darned if I can find which ones and where. I'm stuck for now. Yell if you have problems with these suggestions, if they don't fix your problems or if they cause new ones ("every solution breeds new problems - Murphy. :) ) > > Many Thanks! Glad to help - we are helping, aren't we *8=>). Best, Ed From mlmll at netcourrier.com Thu Sep 4 08:41:50 2003 From: mlmll at netcourrier.com (MLL) Date: Thu Sep 4 16:59:23 2003 Subject: [Spambayes] Spam not always moved (when filtering too ?) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5118@its-xchg4.massey.ac.nz> Message-ID: <000201c372a7$3d799930$fa00a8c0@MAISON01> Resend - looks like 1st post didn't make it through to the list -------------------------------------- > > Unfortunately, it seems that, while the spam in "Inbox" is > > appropriately moved to "Spam", it doesn't always for > > "inbox2". Maybe spambayes kinda competes with the rules > > filtering system when the mail comes first to the inbox ? > > This is more-or-less what happens. It's up to Outlook when it tells > spambayes that there is new mail, and when Outlook decides to do this > depends on a whole host of things. > > Your best solution is to enable the 'timer'. This is an experimental > feature in 007 (look at the configuration guide, which you get to via > the 'about' button), but in the soon-to-be-release 008, it is just a > regular feature (in other words, you don't have to be wary about the > 'experimental nature' of it). Thanks ! Just inserted : [Experimental] timer_start_delay:3000 timer_interval:2000 timer_only_receive_folders:False in the ini file. Let's see what happens... MLL -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2172 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030904/c11d01d9/winmail.bin From doug.cogswell at advizorsolutions.com Thu Sep 4 09:40:51 2003 From: doug.cogswell at advizorsolutions.com (doug.cogswell@advizorsolutions.com) Date: Thu Sep 4 17:01:16 2003 Subject: [Spambayes] Questions Message-ID: <7D0EE54926BAD21195610090273C206F0330DEDC@VISMAIL1> Just downloaded and installed the SpamBayes Outlook 2000 plug-in. Several questions: 1. Do I periodically need to "retrain" the system, or does it learn automatically in real time as I move messages in and out of my spam folder? 2. Can I periodically clean out the spam folder, or does the system need the history to maintain its scoring and learning? 3. When I disconnect from the network (I use a laptop), will the system catch the spam when I log back on? E.g., does it take a pass through the outlook inbox when i come back on the network? If so, what trigger this -- re-opening outlook, or just the fact that i come back on a network? Thanks, doug. From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 17:03:29 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 17:04:00 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042103.h84L3Sh22537@hoemail1.firewall.lucent.com> The e-mail message (file: document_9446.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 17:12:02 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 17:12:08 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042112.h84LC2I28579@hoemail1.firewall.lucent.com> The e-mail message (file: movie0045.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 17:25:24 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 17:25:30 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042125.h84LPKl07168@hoemail1.firewall.lucent.com> The e-mail message (file: document_all.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 15:01:50 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 17:29:28 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041901.h84J1n405210@hoemail1.firewall.lucent.com> The e-mail message (file: thank_you.pif) you sent to contains a virus (WORM_SOBIG.F). From richie at entrian.com Thu Sep 4 23:50:16 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Sep 4 17:50:53 2003 Subject: [Spambayes] Spambayes 1.0a5 released Message-ID: [Resending - apologies if this arrives twice, but I've been having email troubles] Spambayes 1.0a5 has been released. Here's WHAT_IS_NEW.txt: This file covers the major changes between each release. For more details, the reader is referred to the changelog (changelog.txt in the main directory of the archive), or for extreme details, to the check-ins archive (please see ) Changes are broken into sections for each application, plus one that will probably only interest developers, and one for everything else. Any actions necessary to move to this release from the previous release are noted in the "Transition" section. New in Alpha Release 5 ====================== -------------------------- ** Incompatible changes ** -------------------------- The values taken by some options have changed, so if you're upgrading from a previous version, you may need to update your configuration file (.spambayesrc or bayescustomize.ini) o allow_remote_connections now takes a list of allowed IP addresses, or the word 'localhost', or an asterisk (meaning all connections are accepted). o notate_to and notate_subject now take a comma-separated list of one or more of 'spam', 'ham' and 'unsure', allowing you to control which classes of message are notated. Outlook Plugin -------------- o Added a diagnostics dialog with functions to make it easier for users to help developers track down and fix bugs. o Added a 'timer' method of determining when to filter mail that should work better with Outlook's rule system. o Added a button on the Advanced tab of the dialog to display the SpamBayes data folder. o Moved "Filter Now" to an item on the drop down menu on the toolbar. o Items that can be filtered and trained include "IPM.Note" (normal messages) and "IPM.Anti-Virus*" (virus alerts by some software). o Changed the default filter action to "move" (instead of "untouched"). o Added a Wizard to assist with initial configuration (this will present itself when necessary). o Changed to allow filtering to be enabled, even if no training has been done. o Added a "New Folder" button to the folder selector dialog. o Massive changes to the dialog system (which should fix some problems), including changing the configuration dialog to a tabbed interface. o "Show Clues" now shows the percentage, as well as the raw score. o Added a "Help" menu to the drop down menu, with various information. o Added the ability to check for the latest version via an item on the drop down menu. o Hopefully, the "unread flag" issue is now fixed. o Fixed many problems with working on systems where English is not the default language, or where profile names have non-English characters. POP3 Proxy / SMTP Proxy / POP3 Proxy Service -------------------------------------------- o Fixed "assert hamcount <= nham" problem. o Starting and stopping the POP3 Proxy service (for Windows NT, Windows 2000 and Windows XP users) has been improved. Most noticeably, this means that the SMTP Proxy will start (if it is needed) as well. o Improve the "notate to" and "notate subject" options, so that ham and unsure messages can also be (optionally) notated in these fields. o Add the ability to skip caching messages that are over a (user configurable) size, so that you can keep the size of the cache directories smaller, once these messages are correctly classified. o Added the ability to skip caching messages that have a precedence of "bulk" (most mailing list messages), so that you can keep the size of the cache directories (and review list) smaller, once these messages are correctly classified. o Fixed the "ASCII decoding error" problem. o The SMTP proxy tries harder to pass on the command formatted exactly as it was given. This should make it more reliable. o Add the ability to have the SMTP proxy train on the message sent to it, rather than looking up the id in the cache (which is still possible, and generally the better option). o Removed the ability to add the SpamBayes identification number to the body of messages (it can still be added as a header). o The review messages page now puts unsure messages at the top. o The POP3 proxy should now work with fetchmail. o You can once again specify local addresses as well as ports for the POP3 proxy to listen on (was broken in 1.0a3 and 1.0a4). o A bug with the SMTP proxy that would show up in some cases as an "unrecognised command" error the mail client (particularly Eudora) was fixed. IMAP Filter ----------- o If you didn't use the -p switch to enter your password interactively, imapfilter would try and get it from the options, but if it wasn't there yet (because you hadn't done the setup yet), it would crash. This is now fixed. General ------- o Added the ability to store the SpamBayes database in a mySQL or postreSGL database table (currently supported by hammiefilter and the POP3 proxy). o Removed the ability to use the 'dumbdbm' as the storage method. (See the FAQ for reasons why). o We now allow the '@' and '=' characters in paths. o Added a simple n-way classifier using a cascade of binary SpamBayes classifiers. o Added version information to the web interface. o Fixed the yellow colour of the header boxes in the web interface. o Fixed restoring defaults from the web interface. o Added a missing line break in the status pane on the web interface when there are no proxies configured. o Prevent the "Show clues" links on the web interface's training page from word-wrapping and making all the table rows two lines high. o You can now put "*" at the end of a word in the "Word Query" box on the web interface, and have it show you the first ten words, and how many words there are in total, in the database that start with that word. o The web interface now supports HTTP-Auth. o Added a new script (code-named 'overkill.py') which enables 'drag and drop' training for POP3 users. This is currently still in the experimental stage, and anyone interested in trying it out should enquire on the SpamBayes mailing list (). Developer --------- o Created a directory for test suites, including a storage.py test. o An empty 'allowed values' now allows an empty string. o Add a get_option method, so an option instance itself can be fetched. o Support fetching the "latest" set of version data from the spambayes web site. Transition ========== If you are transitioning from a version older than 1.0a4, please also read the notes in the previous release notes (accessible from ). o If you were previously using the 'dumbdbm' storage method (you will have files called "hammie.db.dat", "hamie.db.dir" and "hammie.db.bak", rather than one file called "hammie.db"), then you will need to change to using either a pickle (please see the FAQ: ), bsddb, gdbm, or one of the new SQL based storage methods. The 'dumbdbm' storage method resulting in many databases being corrupted, and was never the best choice for storage, in any case. Although you can use the dbExpImp.py script to convert your database to your new storage system, we recommend that you retrain from scratch, as it is most likely that your database has been corrupted. o If you were using the options to notate the "To" or "Subject" headers with the message's classification, you will need to update your configuration file, as the format for these options have changed. o The ability to add the SpamBayes id to the message body has been removed, which means that Outlook Express users can no longer use the SMTP proxy and have it retrieve messages from the cache. These users can use the SMTP proxy by training on the forwarded message itself, but this is not recommended, as clues in the message will have changed (the "From" address will be yours, for example). At this time, you will have to use the web interface for training, although there is the possibility of 'drag and drop' training being added in a release in the near future. Reported Bugs Fixed =================== The following bugs tracked via the Sourceforge system were fixed: 776808, 795749, 787251, 790051, 743131, 779319, 785389, 786952, 788495, 790406, 788008, 787296, 788002, 780612, 784323, 784296, 780819, 780801, 779049, 765912, 777026, 777165, 693387, 690418, 719586, 769346, 761499, 769346, 773452, 765042, 760062, 768162, 768221, 797776, 797316, 796996 A url containing the details of these bugs can be made by appending the bug number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498103&aid= Feature Requests Added ====================== The following feature requests tracked via the Sourceforge system were added: 789916, 698036, 796832, 791319 A url containing the details of these feature requests can be made by appending the request number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498106&aid= Patches integrated =================== The following patches tracked via the Sourceforge system were integrated: 791254, 790615, 788001, 769981, 791393 A url containing the details of these feature requests can be made by appending the request number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498105&aid= -- Richie Hindle richie@entrian.com From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 12:06:03 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 17:54:52 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041606.h84G62X09020@hoemail1.firewall.lucent.com> The e-mail message (file: document_9446.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 12:08:48 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 18:07:12 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041608.h84G8ld10749@hoemail1.firewall.lucent.com> The e-mail message (file: document_all.pif) you sent to contains a virus (WORM_SOBIG.F). From rob at ce1.com Thu Sep 4 16:32:02 2003 From: rob at ce1.com (Rob Anderson) Date: Thu Sep 4 18:31:57 2003 Subject: [Spambayes] TechTV Recommends SpamBayes over SpamAssassin, etc. Message-ID: http://www.techtv.com/screensavers/products/story/0,24330,3515016,00.html This article is how I found your software. Currently running on 3 winboxes in our small office. Great program, thank you! From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 18:50:51 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 18:50:58 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042250.h84Mops20915@hoemail1.firewall.lucent.com> The e-mail message (file: movie0045.pif) you sent to contains a virus (WORM_SOBIG.F). From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 18:52:30 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 18:52:41 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042252.h84MqUt21719@hoemail1.firewall.lucent.com> The e-mail message (file: document_all.pif) you sent to contains a virus (WORM_SOBIG.F). From T.A.Meyer at massey.ac.nz Fri Sep 5 12:04:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 19:04:57 2003 Subject: [Spambayes] Questions Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E573A@its-xchg4.massey.ac.nz> > 1. Do I periodically need to "retrain" the system, or does > it learn automatically in real time as I move messages in and > out of my spam folder? If you check the "Train a message as ... when it is moved ..." boxes in the main dialog, then it will train messages for you as you move them into folders that you have specified are ham/spam. If you haven't checked this, then you'll need to retrain (although you don't have to retrain from scratch). > 2. Can I periodically clean out the > spam folder, or does the system need the history to maintain > its scoring and learning? See FAQ 3.6 and 4.6: > 3. When I disconnect from the > network (I use a laptop), will the system catch the spam when > I log back on? E.g., does it take a pass through the outlook > inbox when i come back on the network? If so, what trigger > this -- re-opening outlook, or just the fact that i come back > on a network? When you re-open Outlook it checks to see if there are any messages that it hasn't scored yet (in the folders that it is watching) and scores them. So yes it will, and when you re-open Outlook. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 5 12:05:50 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 19:05:58 2003 Subject: [Spambayes] RE: Questions Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E573C@its-xchg4.massey.ac.nz> > I've had SpamBayes up and trained for a day, and ran the > "start filtering" yesterday. This morning I shut off my > Outlook filter. Since then not one email had been filtered. > So, just now I ran "start filtering" to hopefully clear out > my inbox spam, and nothing cleared. > > How do I turn on the filtering? If you add the "spam" column to your view, are the messages being scored? If so, are they scored above the threshold for movement? Have you set up the rule (in the spambayes dialog) to move messages, and told them where to move to?). =Tony Meyer From Ben_Dulaney at RGCResources.com Thu Sep 4 14:30:51 2003 From: Ben_Dulaney at RGCResources.com (Dulaney, Ben) Date: Thu Sep 4 19:09:52 2003 Subject: [Spambayes] Error encountered Message-ID: I am using Outlook plug-in release 007 on Windows XP Professional in Outlook XP. I have tested this on two different computers and have experienced the exact same problem both times. Initially I have the plug-in installed and happily filtering away. The problem arises when I install the Exchange 2000 Management tools on my PC so I can script some functions in ADSI for email accounts. These tools require the Windows 2000 Management tools to be loaded first (Win2003 tools on WinXP). After successful installation of the Exchange tools, I open Outlook and it crashes. The next time I open Outlook I get a message that the add-in 'SpamBayes' will be disabled due to the fact that it caused Outlook to crash. At this point if I try to re-enable the plug-in (through Help-About-Disabled Items) I get another message that an add-on is causing problems and Outlook can try to correct the problem without interfering with the application that caused it. If I ask it to correct it then it comes back and says that it couldn't. If I respond to not correct it, then there are no more messages. Either way, SpamBayes is completely disabled in Outlook now, and I cannot even get it to work again. I've tried removing and reinstalling SpamBayes and that does not help. I do not have the exact error messages as I cannot get SpamBayes to work again so that I can break it. Any help would be appreciated as this is a wonderful tool and I would love to continue my use of it. Sincerely, Ben Dulaney Systems Administrator RGC Resources From skip at pobox.com Thu Sep 4 17:11:16 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 4 19:34:18 2003 Subject: [Spambayes] Interesting note... Message-ID: <16215.43636.127904.357469@montanaro.dyndns.org> I found this virus response note sent to webmaster@python.org interesting: Thank you for emailing Abtivan Concepts. Currently this mail box is been spammed by other advertisers. We cannot recieved any email right now. Sorry for the inconvinence. Please send your emails to questions@abtivan.com. Again, we are sorry for this inconvinence and thanks for your understanding. One spammer spamming another? That's what I call poetic justice. ;-) Skip From joshgass at bellsouth.net Wed Sep 3 17:12:46 2003 From: joshgass at bellsouth.net (Josh Gass) Date: Thu Sep 4 19:39:54 2003 Subject: [Spambayes] Contribute to the SpamBates Team Message-ID: <000001c37257$c0a4c190$0a00a8c0@joshbedroom> Hey, I am an experienced webmaster, I just installed the filter and saw that any help would be appreciated. I didn't know if this qualifies so I am sending you an email. Some of my work is in the links below. XhardWare.Net http://www.xhardware.net/vv/index.html Thanks, Josh Gass From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 19:41:31 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 19:41:37 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309042341.h84NfVx14027@hoemail1.firewall.lucent.com> The e-mail message (file: thank_you.pif) you sent to contains a virus (WORM_SOBIG.F). From rglasx at hotmail.com Wed Sep 3 11:17:11 2003 From: rglasx at hotmail.com (Ra glas) Date: Thu Sep 4 20:02:14 2003 Subject: [Spambayes] IMAP filter failes (readonly issue) - HELP!!! :( Message-ID: Hi, I'm using IMAP as my main mail service, and was pointed to by a friend to SpamBayes as a good solution for killing all the annoying spam flying around lately. However, I've not been able to get the thing going since I get "imaplib.readonly: HamBox is not writable" error (complete trace below). I've used the very helpful info from "Setting up Server-Side Spam filtering for IMAP" (http://www.boost-consulting.com/writing/server-side.html), and it was a real treat - you should at least add a link to it in the FAQ. Still no-go :( I'm using a windows system and my mail server is Mdaemon. I've tried to close outlook while running the script, but it didn't change a thing. No one was using the server at the time of the test. I've installed the following packages: Python-2.3.exe win32all-155.exe (Python w32 add-ons) spambayes-1.0a4.zip I've setup bayescustomize.ini as follows: [imap] server: 127.0.0.1 username: rani password: filter_folders: Inbox unsure_folder: incoming-unsure spam_folder: incoming-spam ham_train_folders: HamBox spam_train_folders: SpamBox and here is what I got: C:\Program Files\python\spambayes-1.0a4>..\python.exe imapfilter.py -t -v SpamBayes IMAP Filter Alpha1, version 0.01 (May 2003), using SpamBayes IMAP Filter Web Interface Alpha1, version 0.01 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database hammie.db... Loading state from hammie.db database hammie.db is a new database Done. Training Traceback (most recent call last): File "imapfilter.py", line 776, in ? run() File "imapfilter.py", line 762, in run imap_filter.Train() File "imapfilter.py", line 590, in Train imap.SelectFolder(fol) File "imapfilter.py", line 231, in SelectFolder response = self.select(folder, False) File "C:\Program Files\python\lib\imaplib.py", line 616, in select raise self.readonly('%s is not writable' % mailbox) imaplib.readonly: HamBox is not writable Many thanks to all of you and keep up the *G R E A T* work!!! :) C Ya, Rani _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From mhammond at keypoint.com.au Fri Sep 5 11:19:02 2003 From: mhammond at keypoint.com.au (Mark Hammond) Date: Thu Sep 4 20:19:06 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3E9@cliff.bai.org> Message-ID: <2a1601c37343$4f362f20$f502a8c0@eden> > From: Bill Yerazunis [mailto:wsy@merl.com] > > > No, my corpora agree with Tim Peters > > - spammers use HTML far more often > > than "normal" users. > > In my opinion, this is simply untrue, The problem is that there is no such thing as a "normal" user when considering SpamBayes, and particularly the Outlook addin. Some "normal" users get the vast majority of their ham as html. I know lots personally. Some "normal" users get more plain text - but I don't hang out with many of them in real life! > > Statistically speaking, HTML mail is > > either from a spammer or from a clueless > > git, and in either case can usually be > > delayed without penalty or discarded outright. > > As indicated above, I do not think this analysis is true anymore. And > characterizing someone as a clueless git because they don't > change their > mail client's default message format or "love" plain text... Well, let > us know when you get back to the real world. I agree 100%. I fear we are starting to sound condescending - such labels and telling users "trust us, you really don't want that feature" doesn't help anyone. > > Similarly, base-64 encodes are almost _always_ trash. > > I agree, except for in-line images sent with email newsletters and the > like. The problem is that we simply don't know. One man's trash is another's treasure. We clearly need more research, but we have to be careful not to base too many assumptions on testing the mail of us geeks Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2496 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030905/9ca2c19c/winmail-0001.bin From edrubins at mindspring.com Thu Sep 4 17:02:09 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Thu Sep 4 20:25:21 2003 Subject: [Spambayes] Re: Editor ad naseum (was something to do with image only spams.) In-Reply-To: Message-ID: <5.1.0.14.0.20030904155258.00ac2b40@localhost> > >>I finally moved on because my company is a VB shop, and I love the IDE > >>in Visual Studio. Maybe it's time to have another look at emacs, though > >>I can't imagine giving up IntelliSense now. > >> > >> > >If emacs has a mode where all keys work like in Visual Studio, then it > >is an option. Pythonwin has limited intellisense, btw. I think we've gotten a little off-topic *8=>). emacs does emulate some other editors out of the box - Brief is the emulation I use. There is a PC selection mode for a CUA interface, which might be a start toward Visual Studio. Rebinding keys is not that difficult. And now back to our regularly scheduled discussion - any one remember what it was? Best, Ed From T.A.Meyer at massey.ac.nz Fri Sep 5 13:27:39 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 20:28:03 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E57A5@its-xchg4.massey.ac.nz> > The problem is that we simply don't know. One man's trash is > another's treasure. We clearly need more research, but we > have to be careful not to base too many assumptions on > testing the mail of us geeks This is where it would be great if some of the people that have volunteered to help (or some that haven't yet) would be willing to run tests on various options. They don't have to do any coding, just get the testing setup ready and run a couple of scripts. The process is reasonably straightforward already, but if people have trouble, it wouldn't be that hard to make a GUI for it. =Tony Meyer From rmalayter at bai.org Thu Sep 4 20:29:05 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Sep 4 20:29:13 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3EC@cliff.bai.org> From: Tim Peters [mailto:tim.one@comcast.net] >> At most of the U.S.-based companies I've dealt with, HTML mail is >> widely used. > What percentage of them have you dealt with? I haven't bumped into my first > such yet. I've dealt with a very small percentage, of course. But enough for my opinion to be statistically valid, I think. Our company exchanges mail with the largest US banks and technology vendors for the financial services industry (HP, IBM, NCR, etc.). A very large percentage of their mail seems to originate with Outlook/Exchange, and the rest seems to come from Notes/Domino shops. The only "pure SMTP" mail I get seems to come from academic institutions or staff at our ISP. >> I'd say 80% or more of business email I get - most of it from >> technical people - is HTML mail, simply because most people leave >> their Outlook or Notes mail client in its default configuration. > I guess our technical contacts don't overlap, > then. The only HTML email I ever get is from > spammers... The vast majority of "technical" people I correspond with are implementation consultants and programmers. These people don't often subscribe to mailing lists, don't think much about spam or system administration, nor do they seem to care much about security. They just want to be able to use italics and bold text in their email. > Anyway, spambayes does a ton of work already to > avoid penalizing HTML email just for using HTML. > If you've got reason to suspect that isn't working > as hoped for in your data, that's something we should > dig into. No, I'm seeing a 96% capture rate, with no false positives, which isn't too shabby. But I've heard of better. I was just guessing if all HTML tags were tokenized, I would see better performance against image-only spams (which seem to get though my filter more than anything else). But as you mentioned, the "everything is a token" approach hasn't worked all that well with SpamBayes and HTML. Perhaps it would fare much better with a "sliding window" tokenizer such as that used by CRM-114. I'm betting CRM-114 would hammer "img src http" and other image-related tag sequences with my corpus, while leaving

,
and the like alone. Has anyone looked at implementing a sparse windowing system similar to CRM-114's in SpamBayes? Intuitively, it seems like this would do much better with HTML tags as well as mail header information, at the expense of CPU and DB storage. Regards, -Ryan- From tim.one at comcast.net Thu Sep 4 19:43:32 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 4 20:40:19 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3E9@cliff.bai.org> Message-ID: The damage to email services done by the Sobig worm is making discussion just about impossible; I'll try to paste together several msgs I've seen in this thread, although I may well be missing some. [Tim Peters] >>>> The other side to this is that *any* evidence of HTML >>>> is a strong spam indicator in most corpora... virtually ?>>> nothing using HTML could avoid being classified as spam... [Ryan Malayter] >>> This doesn't seem right to me, at least on an intuitive level. >>> We're an Outlook 2003 shop, and we've used Windows Group Policies >>> to force all internal users to create HTML messages instead of >>> Microsoft RTF format. That's why I said "most corpora" -- you're definitely an exception on this mailing list. In fact, you're the only person to date who has owned up to having a high number of HTML ham! That's OK, though -- spambayes doesn't penalize HTML just for being HTML. Many other systems do. >>> So a great big heaping pile of my non-spam corpus would be messages >>> that contain


and other "innocent" HTML tags. Shouldn't the >>> statistical nature of SpamBayes give these tokens something near >>> 0.5 as a score, since they appear frequently in both corpora? Yes, *if* spambayes produced such tokens. But it doesn't -- spambayes strips almost all evidence of HTML. Else for the (so far) vast majority of people who get very little HTML ham, their HTML ham would have virtually no chance of getting correctly classified as ham. [Bill Yerazunis] >> No, my corpora agree with Tim Peters - spammers use HTML far more >> often than "normal" users. [Ryan] > In my opinion, this is simply untrue, at least for the American > corporate user. Your opinion about Bill's corpora is irrelevant -- I have no doubt that Bill characterizes the data he's seen correctly. Or that you characterize yours correctly. You two simply have very different kinds of data (and I suspect, but don't know, that the kinds of data Bill sees are much closer to the kinds most spambayes users to date see). >> Statistically speaking, HTML mail is either from a spammer or from >> a clueless git, and in either case can usually be delayed without >> penalty or discarded outright. The spambayes project has never agreed with that, largely because both my sisters delight in creating elaborate HTML email, and I really enjoy it . I'm also active (more off than on lately, alas) trying to help newbies with various tech issues, and-- like my sisters --they naturally gravitate toward fancy email features. I've come to believe that we 7-bit-ASCII-slinging, plain-text, Courier-loving, "email is for the exchange of information" types are destined to become an insignificant minority of email users. The only thing holding back our extinction is that, so far, we still run everything . >> Similarly, base-64 encodes are almost _always_ trash. In my data too, although I'd switch the emphasis to _almost_. > At most of the U.S.-based companies I've dealt with, HTML mail is > widely used. What percentage of them have you dealt with? I haven't bumped into my first such yet. > I remember a Gartner survey that said something like 90% of corporate > desktops have either Outlook/Exchange or Notes/Domino for messaging. > Recent versions of both of these mail clients create HTML messages by > default rather than plain text or some proprietary rich text format. Cool! So things are changing. > I'd say 80% or more of business email I get - most of it from > technical people - is HTML mail, simply because most people leave > their Outlook or Notes mail client in its default configuration. I guess our technical contacts don't overlap, then. The only HTML email I ever get is from spammers, non-personal marketing collateral from companies I do business with, corporate newsletters, relatives, and newbies. I get about 600 emails a day, more of it from non-newbies asking tech questions than I could possibly reply to even if that were my full-time job, and they're never in HTML format. Anyway, spambayes does a ton of work already to avoid penalizing HTML email just for using HTML. If you've got reason to suspect that isn't working as hoped for in your data, that's something we should dig into. The kinds of HTML ham I do get are classified as Unsure more often than plain-text read-alikes, but not really because they use HTML -- it's almost always because they embed URLs pointing to .jpg or .gif (etc) files out on the net, and those file extensions are tokenized, and turn out to be spam clues in my data. It takes "more hammy content than usual" to overcome that. From mhammond at keypoint.com.au Thu Sep 4 09:48:55 2003 From: mhammond at keypoint.com.au (Mark Hammond) Date: Thu Sep 4 20:49:19 2003 Subject: [Spambayes] Undo for spambayes In-Reply-To: Message-ID: <332e01c3726d$8e5458c0$f502a8c0@eden> > Is their an undo process for this last action, as I will not > be able to restore each email into the subfolder it came from. No, there isn't. It would be a nice feature though :) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1540 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030904/42c67842/winmail.bin From ta-meyer at ihug.co.nz Fri Sep 5 14:05:58 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Thu Sep 4 21:06:20 2003 Subject: [Spambayes] Big changes to cvs Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212AE1B@its-xchg4.massey.ac.nz> If you don't use spambayes from cvs, you can ignore this. The major changes discussed on spambayes-dev for the 1.0a6 have been started. For those who haven't read that thread, this includes renaming and moving the majority of the scripts (just about everything in the root directory), and removing backwards compatibility code from the configuration file reading. This means that your system *will* break when you upgrade from cvs. At the very least, you'll need to update your system to use the renamed scripts. At the moment, if you use "setup.py install", all the old scripts will still be there. We'll get something together soonish to remove them. Hopefully, removing the backwards compatibility code from the options won't break of the scripts. I can't test everything, though, so I'll only be 'certain' about sb-imapfilter, sb-server (ex pop3proxy), sb-smtpproxy, timtest and timcv working. If people could check the scripts they use, that would be great. If you are using old names for options in your configuration script, they won't work any more (they have been deprecated for some time now). There is a script in 1.0a5 that will convert them for you, or you can do it by hand. If things break too bad for you, please use the 1.0a5 release (or cvs with the 1.0a5 release tag). That's why the releases are so close together. Thanks for the help! =Tony Meyer From B-Morgan at concentric.net Thu Sep 4 20:43:07 2003 From: B-Morgan at concentric.net (Brad Morgan) Date: Thu Sep 4 21:42:57 2003 Subject: [Spambayes] Need suggestions an email package (slightly OT) Message-ID: <000501c3734f$11a06200$0400a8c0@bradmorgan> I've been using Spambayes and Outlook to download my email from my ISP POP3 mailbox. I just got back from a 3 week business trip overseas and discovered that my ISP had deleted my email because I exceeded their quota. If I leave Outlook running when I'm not around, the first virus that Norton finds puts a dialog box on the screen and that's it for downloading email until I close the dialog box. To solve this problem, I'd like to install an IMAP server on my Linux firewall and download my POP3 email to this server. I can then use Outlook's (and Spambayes') IMAP support to read my email both from home and when I travel. An added bonus would be a web interface. Can anyone here suggest a package (or two) that would solve my problem? Thanks, Brad Morgan From alaina at raqia.com Thu Sep 4 22:50:44 2003 From: alaina at raqia.com (Alain Arnaud) Date: Thu Sep 4 21:44:13 2003 Subject: [Spambayes] Eudora Message-ID: <5.1.0.14.0.20030904214919.00a9f3b8@pop3.theworld.com> I use Eudora on a Win2k. Can i use the executable or the pop3proxy and edit the eudora ini file? Alain From no-reply at hoemail1.firewall.lucent.com Thu Sep 4 14:13:57 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Thu Sep 4 22:17:16 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309041813.h84IDvT00065@hoemail1.firewall.lucent.com> The e-mail message (file: your_details.pif) you sent to contains a virus (WORM_SOBIG.F). From dat at neoucom.edu Wed Sep 3 09:21:43 2003 From: dat at neoucom.edu (Debbie Tummel) Date: Thu Sep 4 23:18:36 2003 Subject: [Spambayes] Enable Filtering Message-ID: I have just installed Spambayes. I have it trained and pointed to the folders to filter, but when i select the "Delete as Spam" button, it tells me it has to be enabled. I can't find where to enable it at. If it's the filtering button on the manager, it's dimmed out and I don't know how to fix that. Thank you debbie ******************* Deborah A. Tummel Program Secretary Office of Continuing Medical Education NE Ohio Universities College of Medicine P.O. Box 95 Rootstown, OH 44272 Phone: 330-325-6575 Fax: 330-325-5929 E-mail: dat@neoucom.edu From ta-meyer at ihug.co.nz Fri Sep 5 16:21:07 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Thu Sep 4 23:21:15 2003 Subject: [Spambayes] Feature Freeze Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212AE1F@its-xchg4.massey.ac.nz> I figure we should probably announce this ;) Apologies for cross-posting this and the previous message. As of now, we're not adding any new features to spambayes [1]. The reason for this is to iron out any remaining bugs, without possibly adding more, so that we can release 1.0b1. If you come up with feature you would like added before the freeze is over (a couple of weeks, probably), then the best idea is to open a patch or feature request on sourceforge (http://sourceforge.net/projects/spambayes) so that it doesn't get lost. It will be dealt with at some point, but please be patient. (Developers: I guess if we want we could open a 1.1a1 branch if anyone wants to add new features? You lot are more clued than I am about cvs) In addition, it would be great if everyone can test out 1.0a5 and 1.0a6 (due very soon) and let us know about any bugs that you find. It would be even better if you could check the existing bug trackers to see if it has already been reported! Thanks! =Tony Meyer [1] This excludes the Outlook plug-in (Mark is his own law ) and building the new Windows binary. Both of these are self-contained and don't affect anything else. From T.A.Meyer at massey.ac.nz Fri Sep 5 16:24:41 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 23:24:48 2003 Subject: [Spambayes] Feature Freeze Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E589E@its-xchg4.massey.ac.nz> > As of now, we're not adding any new features to spambayes This sounds a bit unclear. In case I confused anyone. We're not adding any new features to spambayes *for the next few weeks*. :) =Tony Meyer From fuerte at sci.fi Fri Sep 5 07:07:56 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Thu Sep 4 23:25:34 2003 Subject: [Spambayes] Spambayes 1.0a5 comments In-Reply-To: References: Message-ID: <3F57FE0C.6020305@sci.fi> First, thanks! :-) I tested the new word query wild card option, works great. Except that it would be better to have a tabular view, it would take much less space. Also we could have an option for the maximum number of hits on the query page (drop down of 10, 100, 1000 etc). The Tokenize button that I wished for didn't make it (it would be right to Classify, and output all tokens in a message). I wonder if there is a command line tool that shows all tokens for a message? I would like to know how the advanced option "*Mine the received headers" *works. I don't have it enabled but perhaps should. Harri From mhammond at keypoint.com.au Thu Sep 4 09:42:23 2003 From: mhammond at keypoint.com.au (Mark Hammond) Date: Thu Sep 4 23:27:15 2003 Subject: [Spambayes] Spam not always moved (when filtering too ?) In-Reply-To: <003501c3717a$e5708050$fa00a8c0@MAISON01> Message-ID: <32df01c3726c$a4915c10$f502a8c0@eden> > Unfortunately, it seems that, while the spam in "Inbox" is > appropriately moved to "Spam", it doesn't always for > "inbox2". Maybe spambayes kinda competes with the rules > filtering system when the mail comes first to the inbox ? Please see the "configuration guide", and find info on how to enable the timer. This will be exposed in the user interface for version 008 and later. > On a side note, SB conflicts with the use of netfolders (best > info about the latter on > http://www.slipstick.com/outlook/netfolders.htm), as seen in > the attached debug log. The log you attached shows nothing out of the ordinary - what specific problems did you have? Mark. From tim.one at comcast.net Thu Sep 4 21:44:53 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 4 23:27:30 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <2a1601c37343$4f362f20$f502a8c0@eden> Message-ID: [Mark Hammond] > Some "normal" users get the vast majority of their ham as html. I > know lots personally. Cool! I don't know any. How's the addin working for them? Everyone who has endured my comments in tokenizer.py knows I was obsessively determined not to penalize HTML msgs, but since I don't know anyone who gets a ton of HTML ham, the effectiveness of all that hasn't been tested by me. >>> Statistically speaking, HTML mail is >>> either from a spammer or from a clueless >>> git, and in either case can usually be >>> delayed without penalty or discarded outright. >> As indicated above, I do not think this analysis is true anymore. And >> characterizing someone as a clueless git because they don't change >> their mail client's default message format or "love" plain text... >> Well, let us know when you get back to the real world. > I agree 100%. I fear we are starting to sound condescending - such > labels and telling users "trust us, you really don't want that > feature" doesn't help anyone. Note that the "clueless git" comment was from Bill (CRM114's developer), not one of the loving spambayes folks . >>> Similarly, base-64 encodes are almost _always_ trash. >> I agree, except for in-line images sent with email newsletters and >> the like. > The problem is that we simply don't know. One man's trash is > another's treasure. We clearly need more research, but we have to > be careful not to base too many assumptions on testing the mail of > us geeks spambayes decodes base64 sections-- so long as they have a text/* type --and judges based on the decoded contents. Merely using base64 doesn't count for or against a msg in our code. If it's ill-formed base64, we synthesize a control: couldn't decode token and try harder to decode it anyway. Apart from that, the classifier has no way to know whether base64 was involved. The principle here (which may never have been formulated clearly) is that spambayes wants to score what the end user *sees*, not necessarily how it was coded. We don't stray from the principle often; the chief exceptions are the presence of HTML obfuscation tricks that have no conceivable justification except an intent to deceive spam filters. Even then, we *usually* just un-obfuscate and score what the end user sees anyway. The biggest trick we're missing here is not accounting for that "foreground color approximately equal to background color" hides text the end user *doesn't* see -- it would be spambayesian not to produce any tokens for that kind of invisible text. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1036 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030904/65444103/winmail.bin From mhammond at skippinet.com.au Fri Sep 5 14:29:26 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Sep 4 23:29:35 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: Message-ID: <0e3b01c3735d$e8d19d30$f502a8c0@eden> > [Mark Hammond] > > Some "normal" users get the vast majority of their ham as html. I > > know lots personally. > > Cool! I don't know any. How's the addin working for them? > Everyone who > has endured my comments in tokenizer.py knows I was > obsessively determined > not to penalize HTML msgs, but since I don't know anyone who > gets a ton of > HTML ham, the effectiveness of all that hasn't been tested by me. Sorry, I should have been clearer - I know these people, but they don't use the addin - they either just use web mail, or outlook express, or don't have a spam problem - but they all use HTML, mainly as that is the default. Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030905/cf7d8084/winmail.bin From T.A.Meyer at massey.ac.nz Fri Sep 5 16:31:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 23:31:16 2003 Subject: [Spambayes] Enable Filtering Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E58A5@its-xchg4.massey.ac.nz> > I have just installed Spambayes. I have it trained and > pointed to the folders to filter, but when i select the > "Delete as Spam" button, it tells me it has to be enabled. I > can't find where to enable it at. If it's the filtering > button on the manager, it's dimmed out and I don't know how > to fix that. Please see FAQ 3.8: =Tony Meyer From tim.one at comcast.net Fri Sep 5 00:39:37 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 4 23:39:44 2003 Subject: [Spambayes] Feature Freeze In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E589E@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer's evil twin] >> As of now, we're not adding any new features to spambayes [The real Tony Meyer] > This sounds a bit unclear. No, it doesn't -- it was perfectly clear, and I got a greatly appreciated laugh from it -- thanks! > In case I confused anyone. We're not adding any new features to > spambayes *for the next few weeks*. Spoilsport . From T.A.Meyer at massey.ac.nz Fri Sep 5 16:40:15 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 23:40:38 2003 Subject: [Spambayes] Spambayes 1.0a5 comments Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E58AF@its-xchg4.massey.ac.nz> > First, thanks! :-) >From us all, you're welcome :) > I tested the new word query wild card option, works great. > Except that it would be better to have a tabular view, it would take > much less space. Also we could have an option for the maximum number > of hits on the query page (drop down of 10, 100, 1000 etc). I agree with all of this. It was just the quickest way to do it, and I didn't have much time for spambayes before the 1.0a5 deadline. I think what I'll probably do (if no-one objects) is create an 'advanced find' query next to the existing one, and change the existing one back to how it was. The advanced one can have an option for the number of maximum hits, plus lots of other options (doing a regex search instead of a plain wildcard, for example). The results from this can be put into a separately designed page. All of this is a couple of weeks away, though :) (If you like, you can reopen that feature request so that I remember, but I probably will). > The Tokenize button that I wished for didn't make it (it > would be right to Classify, and output all tokens in a message). Sorry, didn't get a chance to do it; the main difficulty was that I'm not sure how to present all of this without having a huge page. This should be in the 1.1a1 release in some form, though. > I wonder if there is a command line tool that shows all tokens for > a message? You can probably use hammiefilter to do this somehow. Otherwise, try something like this (untested): >>> from spambayes import tokenizer >>> f = file("path\to\my\message", "r") >>> msg = f.read() >>> f.close() >>> g = tokenizer.Tokenizer().tokenize(msg) >>> for t in g: >>> print t from in Python (just "python" from a command prompt should get you into Python). =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 5 16:42:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 4 23:42:33 2003 Subject: [Spambayes] Feature Freeze Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E58B5@its-xchg4.massey.ac.nz> > [Tony Meyer's evil twin] > >> As of now, we're not adding any new features to spambayes > > [The real Tony Meyer] > > This sounds a bit unclear. > > No, it doesn't -- it was perfectly clear, and I got a greatly > appreciated laugh from it -- thanks! :) It was that ta-meyer@ihug.co.nz fellow that got it wrong. I (t.a.meyer@massey.ac.nz) am far too clever ;) =Tony Meyer From rglasx at hotmail.com Thu Sep 4 20:57:17 2003 From: rglasx at hotmail.com (Ra glas) Date: Thu Sep 4 23:50:02 2003 Subject: [Spambayes] IMAP filter failes (readonly issue) - HELP!!! :( Message-ID: Hi, I'm using IMAP as my main mail service, and was pointed to by a friend to SpamBayes as a good solution for killing all the annoying spam flying around lately. However, I've not been able to get the thing going since I get "imaplib.readonly: HamBox is not writable" error (complete trace below). I've used the very helpful info from "Setting up Server-Side Spam filtering for IMAP" (http://www.boost-consulting.com/writing/server-side.html), and it was a real treat - you should at least add a link to it in the FAQ. Still no-go :( I'm using a windows system and my mail server is Mdaemon. I've tried to close outlook while running the script, but it didn't change a thing. No one was using the server at the time of the test. I've installed the following packages: Python-2.3.exe win32all-155.exe (Python w32 add-ons) spambayes-1.0a4.zip I've setup bayescustomize.ini as follows: [imap] server: 127.0.0.1 username: rani password: filter_folders: Inbox unsure_folder: incoming-unsure spam_folder: incoming-spam ham_train_folders: HamBox spam_train_folders: SpamBox and here is what I got: C:\Program Files\python\spambayes-1.0a4>..\python.exe imapfilter.py -t -v SpamBayes IMAP Filter Alpha1, version 0.01 (May 2003), using SpamBayes IMAP Filter Web Interface Alpha1, version 0.01 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database hammie.db... Loading state from hammie.db database hammie.db is a new database Done. Training Traceback (most recent call last): File "imapfilter.py", line 776, in ? run() File "imapfilter.py", line 762, in run imap_filter.Train() File "imapfilter.py", line 590, in Train imap.SelectFolder(fol) File "imapfilter.py", line 231, in SelectFolder response = self.select(folder, False) File "C:\Program Files\python\lib\imaplib.py", line 616, in select raise self.readonly('%s is not writable' % mailbox) imaplib.readonly: HamBox is not writable Many thanks to all of you and keep up the *G R E A T* work!!! :) C Ya, Rani _________________________________________________________________ MSN 8 with e-mail virus protection service: 2 months FREE* http://join.msn.com/?page=features/virus From T.A.Meyer at massey.ac.nz Fri Sep 5 17:08:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 00:09:24 2003 Subject: [Spambayes] IMAP filter failes (readonly issue) - HELP!!! :( Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E58CE@its-xchg4.massey.ac.nz> > I've used the very helpful info from "Setting up Server-Side > Spam filtering for IMAP" > (http://www.boost-consulting.com/writing/server-side.html), and it > was a real treat - you should at least add a link to it in the FAQ. I could have sworn we added this somewhere. I'll have a look and add it if it hasn't been. > and here is what I got: [...] > File "C:\Program Files\python\lib\imaplib.py", line 616, in select > raise self.readonly('%s is not writable' % mailbox) > imaplib.readonly: HamBox is not writable This sounds very much like that imap folder is read-only (at least for the user that is logging in). If you log into the imap server (manually, or with a mail client), can you make any changes to this folder? =Tony Meyer From rmalayter at bai.org Thu Sep 4 12:55:06 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 5 00:21:12 2003 Subject: [Spambayes] General configuration to change location of database Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01115103@cliff.bai.org> From: Mark Hammond [mailto:mhammond@keypoint.com.au] > Note that the data should go in a file called > "default_configuration.ini". I am fairly sure > this works I still think the "move database" feature is NOT working as documented. I created a file: C:\Documents and Settings\rmalayter\Application Data\SpamBayes\default_configuration.ini This file has only the following two lines: [General] data_directory: "C:\temp" I then copy my default_bayes_database.db and default_message_database.db, as well as .ini, to the c:\temp directory. When I start outlook, I get a "spambayes plug-in failed to initialize" error. Please note I have full permissions to every directory on the machine, so that is not the issue. I laso tried these varaiations on the data_directory line, and none of them worked: data_directory:"C\temp" data_directory:C:\temp data_directory: C:\temp\ data_directory:C:\temp\ data_directory: "C:\temp\" > If you still believe this a bug I do, at the very least in the documentation, but possibly a regression error in the plug-in. What is the "default_bayes_customize.ini" file's purpose? I don't see it mentioned in the documentation. > please give exact details of what you tried - > the fully qualified name of the ini file you edited, > the complete contents of that file, and the complete > contents of the log. > If you do this, please also reopen the bug. The logfile results: Error connecting to Outlook! Traceback (most recent call last): File "out1.pyz/addin", line 1053, in OnConnection File "out1.pyz/manager", line 724, in GetManager File "out1.pyz/manager", line 221, in __init__ LookupError: no codec search functions registered: can't find encoding ERROR: 'There was an error initializing the SpamBayes addin\r\n\r\nPlease re-start Outlook and try again.' Traceback (most recent call last): File "out1.pyz/addin", line 1053, in OnConnection File "out1.pyz/manager", line 724, in GetManager File "out1.pyz/manager", line 221, in __init__ LookupError: no codec search functions registered: can't find encoding SpamBayes - Disconnecting from Outlook Addin terminating: 0 COM client and 1 COM servers exist. I will re-open the bug. -Ryan- From ltrag at optonline.net Fri Sep 5 01:24:35 2003 From: ltrag at optonline.net (Lanny Trager) Date: Fri Sep 5 00:24:42 2003 Subject: [Spambayes] Can I upgrade without losing my training? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E58CE@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E58CE@its-xchg4.massey.ac.nz> Message-ID: <200309050024.35779.ltrag@optonline.net> And if I can, how can this be accomplished? Thanks in advance. Lanny /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From T.A.Meyer at massey.ac.nz Fri Sep 5 17:32:42 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 00:32:58 2003 Subject: [Spambayes] General configuration to change location of database Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E58ED@its-xchg4.massey.ac.nz> > What is the "default_bayes_customize.ini" file's purpose? I > don't see it mentioned in the documentation. This holds any customisation options for the underlying classification engine. This file is the equivalent of the bayescustomize.ini or .spambayesrc files that are mentioned elsewhere, but for the Outlook plug-in. The FAQ has lots of information about the options that can go in these files. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 5 21:06:43 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 04:07:17 2003 Subject: [Spambayes] Can I upgrade without losing my training? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5920@its-xchg4.massey.ac.nz> Yes. > And if I can, how can this be accomplished? AFAIK, for all the applications (it helps to say which one you're using) you can just do the install over the top of the old one and everything will work fine. This is definitely the case with the Outlook plug-in (unless, possibly, you are upgrading from a really old version like 003), pop3proxy, and imapfilter. =Tony Meyer From tmachielsen at team.inter.net Fri Sep 5 11:41:14 2003 From: tmachielsen at team.inter.net (Ton Machielsen) Date: Fri Sep 5 04:41:32 2003 Subject: [Spambayes] New to this list and trying to get ths working on Outlook Express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5920@its-xchg4.massey.ac.nz> Message-ID: <008b01c37389$7b0bcab0$0601a8c0@LaptopTon> Hi all, I found spambayes through a link in a tagline, downloaded it, installed Python and spambayes, tried to get it working on Outlook Express, got a million errors and subscribed to this list. ;-) My question (probably asked a zillion times, but couldn't find anything usefull in the archives) is: Is there a step-by-step newbie-with-no-knowledge-at-all guide on how to use spambayes with Outlook Express? When i run pop3proxy.py -b i get: C:\SpamBayes>pop3proxy.py -b SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... Traceback (most recent call last): File "C:\SpamBayes\pop3proxy.py", line 819, in ? run() File "C:\SpamBayes\pop3proxy.py", line 800, in run prepare(state=state) File "C:\SpamBayes\pop3proxy.py", line 744, in prepare state)) File "C:\SpamBayes\smtpproxy.py", line 521, in CreateProxies trainer) File "C:\SpamBayes\smtpproxy.py", line 253, in __init__ proxyArgs) File "C:\SpamBayes\spambayes\Dibbler.py", line 267, in __init__ self.bind(port) File "C:\Python23\lib\asyncore.py", line 300, in bind return self.socket.bind(addr) File "", line 1, in bind socket.error: (10013, 'Permission denied') Permission denied on a Windows system? ;-) Well, Windows XP Pro at this end. Latest stable Python. What else do you kings need to help me out? Regards, Ton. From T.A.Meyer at massey.ac.nz Fri Sep 5 21:45:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 04:46:13 2003 Subject: [Spambayes] New to this list and trying to get ths working onOutlook Express Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5922@its-xchg4.massey.ac.nz> > Is there a > step-by-step newbie-with-no-knowledge-at-all guide on how to > use spambayes with Outlook Express? The readme.txt in the 1.0a5 release is as close as you get. It's not going to help with this problem ;) > When i run pop3proxy.py -b i get: [...] > File "C:\Python23\lib\asyncore.py", line 300, in bind > return self.socket.bind(addr) > File "", line 1, in bind > socket.error: (10013, 'Permission denied') This probably means that the port that it's trying to bind is already taken. If you haven't set anything up, then this almost certainly is port 8880, which it's trying to bind to put the interface on. If you have set it up, then it might be ports 25 or 110 (or whatever you've set it to proxy on). Try this and see if it works: pop3proxy.py -u 8881 -b (or some other port number that you know is free). =Tony Meyer From tmachielsen at team.inter.net Fri Sep 5 12:03:08 2003 From: tmachielsen at team.inter.net (Ton Machielsen) Date: Fri Sep 5 05:03:24 2003 Subject: [Spambayes] New to this list and trying to get ths working on Outlook Express (part 2) In-Reply-To: <008b01c37389$7b0bcab0$0601a8c0@LaptopTon> Message-ID: <009201c3738c$8a5444e0$0601a8c0@LaptopTon> Downloaded a05, installed it, ran pop3proxy.py -b and got the web interface. Configured the pop and smtp servers, try to save my changes and get: 500 Server error Traceback (most recent call last): File "C:\SpamBayes\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\SpamBayes\spambayes\UserInterface.py", line 688, in onChangeopts self.reReadOptions() File "C:\SpamBayes\spambayes\ProxyUI.py", line 517, in reReadOptions state = self.state_recreator() File "C:\SpamBayes\pop3proxy.py", line 720, in _recreateState prepare(state) File "C:\SpamBayes\pop3proxy.py", line 744, in prepare state)) File "C:\SpamBayes\smtpproxy.py", line 521, in CreateProxies trainer) File "C:\SpamBayes\smtpproxy.py", line 253, in __init__ proxyArgs) File "C:\SpamBayes\spambayes\Dibbler.py", line 267, in __init__ self.bind(port) File "C:\Python23\lib\asyncore.py", line 300, in bind return self.socket.bind(addr) File "", line 1, in bind error: (10013, 'Permission denied') I guess i'm not lucky here. ;-) Ton. From tmachielsen at team.inter.net Fri Sep 5 12:08:43 2003 From: tmachielsen at team.inter.net (Ton Machielsen) Date: Fri Sep 5 05:08:57 2003 Subject: [Spambayes] New to this list and trying to get ths working onOutlook Express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5922@its-xchg4.massey.ac.nz> Message-ID: <009501c3738d$524ff2a0$0601a8c0@LaptopTon> Yo, that did it!! Thanks! Ton. -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: viernes, 05 de septiembre de 2003 10:46 To: Ton Machielsen; spambayes@python.org Subject: RE: [Spambayes] New to this list and trying to get ths working onOutlook Express > Is there a > step-by-step newbie-with-no-knowledge-at-all guide on how to > use spambayes with Outlook Express? The readme.txt in the 1.0a5 release is as close as you get. It's not going to help with this problem ;) > When i run pop3proxy.py -b i get: [...] > File "C:\Python23\lib\asyncore.py", line 300, in bind > return self.socket.bind(addr) > File "", line 1, in bind > socket.error: (10013, 'Permission denied') This probably means that the port that it's trying to bind is already taken. If you haven't set anything up, then this almost certainly is port 8880, which it's trying to bind to put the interface on. If you have set it up, then it might be ports 25 or 110 (or whatever you've set it to proxy on). Try this and see if it works: pop3proxy.py -u 8881 -b (or some other port number that you know is free). =Tony Meyer From jason.ahrens at rogers.com Thu Sep 4 13:25:44 2003 From: jason.ahrens at rogers.com (jason.ahrens@rogers.com) Date: Fri Sep 5 06:02:45 2003 Subject: [Spambayes] Spambayes processing Message-ID: <20030904162544.OOCG83282.fep01-mail.bloor.is.net.cable.rogers.com@localhost> (I seem to be having problems getting this sent. Appologies if this ends up being duplicated) Alright, So after much touble and patching, I finally got Spambayes installed. I trained spambayes with the saved good mail and spam I had. I ended up with a .hammie.db of approx 1.2M. I set spambayes in Procmail, and set spambays a working. The problem is, everything comes up as "uncertain" with a score or 0.50. This sounds like Spambayes is working as if it didn't have a database to work with. My ~/.spambayesrc contains the following. I have checked a few times to ensure the filepath is correct: [Storage] persistent_use_database = True persistent_storage_file = /home/kovu/.hammiedb Any hints as to what may be up is greatly appreciated. Thanks Jason 1 From T.A.Meyer at massey.ac.nz Fri Sep 5 23:07:40 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 06:07:53 2003 Subject: [Spambayes] Spambayes processing Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5923@its-xchg4.massey.ac.nz> > The problem is, everything comes up as "uncertain" with a > score or 0.50. This sounds like Spambayes is working as if it > didn't have a database to work with. Indeed it does. > My ~/.spambayesrc contains the following. I have checked a > few times to ensure the filepath is correct: > [Storage] > persistent_use_database = True > persistent_storage_file = /home/kovu/.hammiedb > > Any hints as to what may be up is greatly appreciated. Maybe spambayes isn't finding the .spambayesrc file? If you put some sort of garbage in there and then run hammiefilter (I presume that's what you're using), you should get an error printed out if it finds the file. If it doesn't, then you can get the envar BAYESCUSTOMIZE to point to it (but let us know, because it should find it in ~/). If it is, then the problem must lie with the persistent_storage_file option, somehow... =Tony Meyer From richie at entrian.com Fri Sep 5 12:50:35 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 5 06:50:56 2003 Subject: [Spambayes] Spambayes processing Message-ID: [Jason] > I ended up with a .hammie.db of approx 1.2M > [...] > persistent_storage_file = /home/kovu/.hammiedb Could be just a typo in your message, but those filenames don't match. -- Richie Hindle richie@entrian.com From david.handy at automsoft.com Fri Sep 5 13:05:10 2003 From: david.handy at automsoft.com (David Handy) Date: Fri Sep 5 07:02:18 2003 Subject: [Spambayes] Spambayes no longer recognizes "Spam" folder Message-ID: <98F7681DB5B4D311AC9D0090279C217385C549@SOL> Hi -- I'm running the Spambayes Outlook plug-in [Beta 1, binary version 3, July 03] on Outlook 2000 SR-1 [version 9..0.0.3821] and Windows 2000 [SP 3]. It's been running fine for a month or two. But yesterday, Spambayes just stopped "seeing" my spam folder (which is called "Spam"). It shows in the Spambayes Manager fields, and won't respond to any changes I make. I've tried renaming and re-assigning the spam folder but it doesn't work. The possible spam folder appears to work fine. Any ideas why this is so? Wd appreciate replies offlist. tia, David Handy From gbsk at gci.net Fri Sep 5 04:24:43 2003 From: gbsk at gci.net (Guru Bandhu Khalsa) Date: Fri Sep 5 07:25:06 2003 Subject: [Spambayes] spambayes Message-ID: <017901c373a0$57e409f0$2d9aed18@D6TBFF31> Hello, I just got Window s XP. Does spambayes work with Outlook Express? I saw that it is compatable with Outlook, which I don't have. I need a good anti-spam software. Thanks, GB From seant at iname.com Fri Sep 5 08:33:59 2003 From: seant at iname.com (Sean True) Date: Fri Sep 5 07:33:52 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <0e3b01c3735d$e8d19d30$f502a8c0@eden> Message-ID: <004101c373a1$9e455a60$0201a8c0@swapwizard.com> > > Cool! I don't know any. How's the addin working for them? > > Everyone who > > has endured my comments in tokenizer.py knows I was > > obsessively determined > > not to penalize HTML msgs, but since I don't know anyone who > > gets a ton of > > HTML ham, the effectiveness of all that hasn't been tested by me. > > Sorry, I should have been clearer - I know these people, but > they don't use the addin - they either just use web mail, or > outlook express, or don't have a spam problem - but they all > use HTML, mainly as that is the default. > > Mark. HTML mail works just fine. I get TONS of html ham, and I know our customers do. Since they pay us to make sure that InBoxer/SpamAtBay work for them, and yell when it doesn't, I'm pretty sure the tokenizer work here was _well_ worth while. That said, it might be good to have a button for: "I hate html mail, tokenize the html mark up and let the chips fall where they may" -- Sean From edrubins at mindspring.com Fri Sep 5 09:01:55 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Fri Sep 5 08:02:00 2003 Subject: [Spambayes] Re: Eudora In-Reply-To: Message-ID: <5.1.0.14.0.20030905075802.00a961e8@localhost> >I use Eudora on a Win2k. Can i use the executable or the pop3proxy and edit >the eudora ini file? > >Alain See FAQ 2.3 at http://spambayes.sourceforge.net/faq.html#how-do-i-configure-eudora-for-use-with-spambayes. Yell if you have problems. I'd recommend using release 1.05a as this fixes at least one problem with Eudora and smtpproxy. Good luck. Best, Ed From richie at entrian.com Thu Sep 4 10:07:21 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 5 08:11:20 2003 Subject: [Spambayes] Spambayes 1.0a5 released Message-ID: Spambayes 1.0a5 has been released. Here's WHAT_IS_NEW.txt: This file covers the major changes between each release. For more details, the reader is referred to the changelog (changelog.txt in the main directory of the archive), or for extreme details, to the check-ins archive (please see ) Changes are broken into sections for each application, plus one that will probably only interest developers, and one for everything else. Any actions necessary to move to this release from the previous release are noted in the "Transition" section. New in Alpha Release 5 ====================== -------------------------- ** Incompatible changes ** -------------------------- The values taken by some options have changed, so if you're upgrading from a previous version, you may need to update your configuration file (.spambayesrc or bayescustomize.ini) o allow_remote_connections now takes a list of allowed IP addresses, or the word 'localhost', or an asterisk (meaning all connections are accepted). o notate_to and notate_subject now take a comma-separated list of one or more of 'spam', 'ham' and 'unsure', allowing you to control which classes of message are notated. Outlook Plugin -------------- o Added a diagnostics dialog with functions to make it easier for users to help developers track down and fix bugs. o Added a 'timer' method of determining when to filter mail that should work better with Outlook's rule system. o Added a button on the Advanced tab of the dialog to display the SpamBayes data folder. o Moved "Filter Now" to an item on the drop down menu on the toolbar. o Items that can be filtered and trained include "IPM.Note" (normal messages) and "IPM.Anti-Virus*" (virus alerts by some software). o Changed the default filter action to "move" (instead of "untouched"). o Added a Wizard to assist with initial configuration (this will present itself when necessary). o Changed to allow filtering to be enabled, even if no training has been done. o Added a "New Folder" button to the folder selector dialog. o Massive changes to the dialog system (which should fix some problems), including changing the configuration dialog to a tabbed interface. o "Show Clues" now shows the percentage, as well as the raw score. o Added a "Help" menu to the drop down menu, with various information. o Added the ability to check for the latest version via an item on the drop down menu. o Hopefully, the "unread flag" issue is now fixed. o Fixed many problems with working on systems where English is not the default language, or where profile names have non-English characters. POP3 Proxy / SMTP Proxy / POP3 Proxy Service -------------------------------------------- o Fixed "assert hamcount <= nham" problem. o Starting and stopping the POP3 Proxy service (for Windows NT, Windows 2000 and Windows XP users) has been improved. Most noticeably, this means that the SMTP Proxy will start (if it is needed) as well. o Improve the "notate to" and "notate subject" options, so that ham and unsure messages can also be (optionally) notated in these fields. o Add the ability to skip caching messages that are over a (user configurable) size, so that you can keep the size of the cache directories smaller, once these messages are correctly classified. o Added the ability to skip caching messages that have a precedence of "bulk" (most mailing list messages), so that you can keep the size of the cache directories (and review list) smaller, once these messages are correctly classified. o Fixed the "ASCII decoding error" problem. o The SMTP proxy tries harder to pass on the command formatted exactly as it was given. This should make it more reliable. o Add the ability to have the SMTP proxy train on the message sent to it, rather than looking up the id in the cache (which is still possible, and generally the better option). o Removed the ability to add the SpamBayes identification number to the body of messages (it can still be added as a header). o The review messages page now puts unsure messages at the top. o The POP3 proxy should now work with fetchmail. o You can once again specify local addresses as well as ports for the POP3 proxy to listen on (was broken in 1.0a3 and 1.0a4). o A bug with the SMTP proxy that would show up in some cases as an "unrecognised command" error the mail client (particularly Eudora) was fixed. IMAP Filter ----------- o If you didn't use the -p switch to enter your password interactively, imapfilter would try and get it from the options, but if it wasn't there yet (because you hadn't done the setup yet), it would crash. This is now fixed. General ------- o Added the ability to store the SpamBayes database in a mySQL or postreSGL database table (currently supported by hammiefilter and the POP3 proxy). o Removed the ability to use the 'dumbdbm' as the storage method. (See the FAQ for reasons why). o We now allow the '@' and '=' characters in paths. o Added a simple n-way classifier using a cascade of binary SpamBayes classifiers. o Added version information to the web interface. o Fixed the yellow colour of the header boxes in the web interface. o Fixed restoring defaults from the web interface. o Added a missing line break in the status pane on the web interface when there are no proxies configured. o Prevent the "Show clues" links on the web interface's training page from word-wrapping and making all the table rows two lines high. o You can now put "*" at the end of a word in the "Word Query" box on the web interface, and have it show you the first ten words, and how many words there are in total, in the database that start with that word. o The web interface now supports HTTP-Auth. o Added a new script (code-named 'overkill.py') which enables 'drag and drop' training for POP3 users. This is currently still in the experimental stage, and anyone interested in trying it out should enquire on the SpamBayes mailing list (). Developer --------- o Created a directory for test suites, including a storage.py test. o An empty 'allowed values' now allows an empty string. o Add a get_option method, so an option instance itself can be fetched. o Support fetching the "latest" set of version data from the spambayes web site. Transition ========== If you are transitioning from a version older than 1.0a4, please also read the notes in the previous release notes (accessible from ). o If you were previously using the 'dumbdbm' storage method (you will have files called "hammie.db.dat", "hamie.db.dir" and "hammie.db.bak", rather than one file called "hammie.db"), then you will need to change to using either a pickle (please see the FAQ: ), bsddb, gdbm, or one of the new SQL based storage methods. The 'dumbdbm' storage method resulting in many databases being corrupted, and was never the best choice for storage, in any case. Although you can use the dbExpImp.py script to convert your database to your new storage system, we recommend that you retrain from scratch, as it is most likely that your database has been corrupted. o If you were using the options to notate the "To" or "Subject" headers with the message's classification, you will need to update your configuration file, as the format for these options have changed. o The ability to add the SpamBayes id to the message body has been removed, which means that Outlook Express users can no longer use the SMTP proxy and have it retrieve messages from the cache. These users can use the SMTP proxy by training on the forwarded message itself, but this is not recommended, as clues in the message will have changed (the "From" address will be yours, for example). At this time, you will have to use the web interface for training, although there is the possibility of 'drag and drop' training being added in a release in the near future. Reported Bugs Fixed =================== The following bugs tracked via the Sourceforge system were fixed: 776808, 795749, 787251, 790051, 743131, 779319, 785389, 786952, 788495, 790406, 788008, 787296, 788002, 780612, 784323, 784296, 780819, 780801, 779049, 765912, 777026, 777165, 693387, 690418, 719586, 769346, 761499, 769346, 773452, 765042, 760062, 768162, 768221, 797776, 797316, 796996 A url containing the details of these bugs can be made by appending the bug number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498103&aid= Feature Requests Added ====================== The following feature requests tracked via the Sourceforge system were added: 789916, 698036, 796832, 791319 A url containing the details of these feature requests can be made by appending the request number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498106&aid= Patches integrated =================== The following patches tracked via the Sourceforge system were integrated: 791254, 790615, 788001, 769981, 791393 A url containing the details of these feature requests can be made by appending the request number to this url: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498105&aid= -- Richie Hindle richie@entrian.com From rcoe at CambridgeMA.GOV Fri Sep 5 09:52:41 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Sep 5 08:55:29 2003 Subject: [Spambayes] RE: Feature Freeze Message-ID: To those of us who exist mostly in the Microsoft world, the idea of actually fixing bugs before continuing to cram in more "features" is unnatural, bordering on the bizarre. Are you sure you're willing to go through with this?? Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Tony Meyer [mailto:ta-meyer@ihug.co.nz] > Sent: Thursday, September 04, 2003 11:21 PM > To: spambayes@python.org; spambayes-dev@python.org > Subject: [Spambayes] Feature Freeze > > > I figure we should probably announce this ;) Apologies for > cross-posting this and the previous message. > > As of now, we're not adding any new features to spambayes [1]. > The reason for this is to iron out any remaining bugs, without > possibly adding more, so that we can release 1.0b1. > > ... From dreed at srdcorp.com Fri Sep 5 10:01:41 2003 From: dreed at srdcorp.com (Reed, David) Date: Fri Sep 5 09:03:36 2003 Subject: [Spambayes] RE: Feature Freeze Message-ID: <4C542552EF434E43B2F2572854A73CBD6F6A46@corp-mail1.srdcorp.com> Yeah, really, how FORWARD is that??? ***spaced out in the microsoft zone...*** David -----Original Message----- From: Coe, Bob [mailto:rcoe@CambridgeMA.GOV] Sent: Friday, September 05, 2003 8:53 AM To: spambayes@Python.ORG Subject: [Spambayes] RE: Feature Freeze To those of us who exist mostly in the Microsoft world, the idea of actually fixing bugs before continuing to cram in more "features" is unnatural, bordering on the bizarre. Are you sure you're willing to go through with this?? Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Tony Meyer [mailto:ta-meyer@ihug.co.nz] > Sent: Thursday, September 04, 2003 11:21 PM > To: spambayes@python.org; spambayes-dev@python.org > Subject: [Spambayes] Feature Freeze > > > I figure we should probably announce this ;) Apologies for > cross-posting this and the previous message. > > As of now, we're not adding any new features to spambayes [1]. > The reason for this is to iron out any remaining bugs, without > possibly adding more, so that we can release 1.0b1. > > ... _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From rcoe at CambridgeMA.GOV Fri Sep 5 10:04:42 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Sep 5 09:06:12 2003 Subject: [Spambayes] RE: Spambayes 1.0a5 released Message-ID: I'm a bit confused. Your announcement suggests that the new release includes a new version (008?) of the outlook plugin, but everything on the Web site says that 007 is still current. Please clarify. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Richie Hindle [mailto:richie@entrian.com] > Sent: Thursday, September 04, 2003 5:50 PM > To: spambayes@python.org; spambayes-dev@python.org; > spambayes-announce@python.org > Subject: [Spambayes] Spambayes 1.0a5 released > > > > [Resending - apologies if this arrives twice, but I've been > having email troubles] > > Spambayes 1.0a5 has been released. Here's WHAT_IS_NEW.txt: > ... > > Outlook Plugin > -------------- > o Added a diagnostics dialog with functions to make it easier for users to > help developers track down and fix bugs. > o Added a 'timer' method of determining when to filter mail that should work > > etc., etc. From rcoe at CambridgeMA.GOV Fri Sep 5 10:51:24 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Sep 5 09:52:54 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: Actually, I think the Outlook default is RTF, an idiosyncratic Microsoftian compromise format that combines most of the drawbacks of plain text with the disadvantages of HTML. Bob > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Mark Hammond > Sent: Thursday, September 04, 2003 11:29 PM > To: 'Tim Peters' > Cc: spambayes@python.org > Subject: RE: [Spambayes] how spambayes handles image-only spams > > > Sorry, I should have been clearer - I know these people, but > they don't use the addin - they either just use web mail, or > outlook express, or don't have a spam problem - but they all > use HTML, mainly as that is the default. > > Mark. From skip at pobox.com Fri Sep 5 09:54:54 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 5 09:55:02 2003 Subject: [Spambayes] Big changes to cvs In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212AE1B@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130212AE1B@its-xchg4.massey.ac.nz> Message-ID: <16216.38318.586531.912642@montanaro.dyndns.org> Tony> If you don't use spambayes from cvs, you can ignore this. The Tony> major changes discussed on spambayes-dev for the 1.0a6 have been Tony> started. For those who haven't read that thread, this includes Tony> renaming and moving the majority of the scripts (just about Tony> everything in the root directory), and removing backwards Tony> compatibility code from the configuration file reading. One nit. The indication in the checkin message and I thought in the discussion was that the script prefix was going to be "sb-". Instead the prefix is "sb_". Skip From rmalayter at bai.org Fri Sep 5 11:23:42 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 5 11:23:46 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3ED@cliff.bai.org> From: Coe, Bob [mailto:rcoe@CambridgeMA.GOV] > Actually, I think the Outlook default is > RTF, an idiosyncratic Microsoftian compromise > format that combines most of the drawbacks of > plain text with the disadvantages of HTML. I think RTF was the default for Outlook 97, 98, and perhaps 2000. But I think Outlook 2002 (i.e. Outlook XP) defaults to HTML. But anyway, Exchange server converts RTF messages bound for the internet to HTML by default, so the end result of the default settings is still a lot of HTML ham coming from Outlook/Exchange shops. Regards, Ryan From Markus.Truzzi at dcsg.com Fri Sep 5 12:25:08 2003 From: Markus.Truzzi at dcsg.com (Truzzi, Markus) Date: Fri Sep 5 11:27:38 2003 Subject: [Spambayes] SPAM Message-ID: <3F3D0548F4B9D61199D00000E87F1B6B06FB77D1@mail.dickssportinggoods.com> To Whom it may concern, I am reviewing SPAM products and a friend sent a link to your site. Reviewing the site I am unclear if your product can do what I need and that is to be mail SPAM blocker for a business of about 1200 mailboxes. We have Exchange 5.5 and working to go to Exchange2003. We would like the products that are loaded to a separate server that all mail routes through for filtering. Does this product do that? Is it compatible with our environment and number of people? If so, can you send a link to the proper product? Thanks Markus Truzzi Sr. Network Analyst 412.809.0100 x3204 Cell 412.841.5126 markus.truzzi@dcsg.com From tim.one at comcast.net Fri Sep 5 12:55:47 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Sep 5 11:55:51 2003 Subject: [spambayes-dev] Re: [Spambayes] Big changes to cvs In-Reply-To: <16216.38318.586531.912642@montanaro.dyndns.org> Message-ID: [Skip] > One nit. The indication in the checkin message and I thought in the > discussion was that the script prefix was going to be "sb-". Instead > the prefix is "sb_". Tony quickly discovered that a Python module with name starting with "sb-" can't be used in an import statement from any other module. I'm unclear on why it was thought desirable to uglify all the names, but making modules unimportable should have an obvious downside to everyone . From thetasig at comcast.net Fri Sep 5 10:00:15 2003 From: thetasig at comcast.net (theta sigma) Date: Fri Sep 5 12:07:47 2003 Subject: [Spambayes] Re: Eudora SSL Send Error In-Reply-To: <5.1.0.14.0.20030905064946.04c8e848@localhost> References: <5.2.1.1.2.20030904222049.01d873c0@localhost> <5.1.0.14.0.20030904210053.00ab3978@localhost> <5.2.1.1.2.20030904160751.01d7a818@localhost> <5.1.0.14.0.20030904164225.00aa5a80@localhost> Message-ID: <5.2.1.1.2.20030905084727.01cfef90@localhost> At 06:52 AM 9/5/2003 -0400, Ed R. wrote: >Just had a thought - there is an smtpproxy Eudora - Norton Anti-Virus >problem that is fixed in the new release (1.05a.) It just might fix your >SSL problem as well. Then again, it may not. If you install 1.05a could >you try sending email through smtpproxy again just to see? Got the new version running. Well that still didn't work. However, I did experiment and changed the setting in the Eudora Personalities / Properties dialogue to "Never" use the STARTTLS (secure sockets layer?). When I don't use SSL the message will send without problems. However, I think I'd prefer to use SSL for messages I send. So if it is something that can be supported in future versions of SpamBayes I think that would be helpful to Eudora users. A temporary work around is to use the standard SMTP paths (not the localhost) bypassing SpamBayes when sending. Here is the text of the error message ("Dominant" is the primary account): , Connecting to the Mail Server..., EHLO [08:52:15 AM] SSL Negotiation Failed: You have configured this personality/protocol to reject any exchange key lengths below 0. ,But the negotiated exchange key length is -1 Hence this established secure channel is unacceptable. Connection will be dropped. Cause: Eudora got tired of waiting for the server. (10100) Regards, -=mark=- From vanhorn at whidbey.com Fri Sep 5 10:28:05 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Fri Sep 5 12:28:12 2003 Subject: [Spambayes] SPAM References: <3F3D0548F4B9D61199D00000E87F1B6B06FB77D1@mail.dickssportinggoods.com> Message-ID: <3F58B995.EB12D360@whidbey.com> Although it is possible that some of the pieces that makeup the SpamBayes project will be integrated at the server level, this will, of necessity, dramatically dilute it's effectiveness. SpamBayes is a client-side solution that gets its strength from training on exactly what a specific user does and doesn't want to see in his mailbox. Of course, if you've got some Python talent in your organization and want to help create tools for the server side, let's talk! I, for one, would love to see some progress in that direction. Van "Truzzi, Markus" wrote: > To Whom it may concern, > > I am reviewing SPAM products and a friend sent a link to your site. > Reviewing the site I am unclear if your product can do what I need and that > is to be mail SPAM blocker for a business of about 1200 mailboxes. We have > Exchange 5.5 and working to go to Exchange2003. We would like the products > that are loaded to a separate server that all mail routes through for > filtering. > > Does this product do that? > Is it compatible with our environment and number of people? > If so, can you send a link to the proper product? > > Thanks > > Markus Truzzi > Sr. Network Analyst > 412.809.0100 x3204 > Cell 412.841.5126 > markus.truzzi@dcsg.com > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From rmalayter at bai.org Fri Sep 5 12:55:04 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 5 12:55:13 2003 Subject: [Spambayes] SPAM Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01115127@cliff.bai.org> Check out http://assp.sourceforge.net/, it works at the SMTP level and is what you're really looking for. That said, we use iHateSpam server Edition in our Exchange 2000 site, and we love it. There are a bunch of other commercial products out there as well. -----Original Message----- From: Truzzi, Markus [mailto:Markus.Truzzi@dcsg.com] Sent: Friday, September 05, 2003 10:25 AM To: 'spambayes@python.org' Subject: [Spambayes] SPAM To Whom it may concern, I am reviewing SPAM products and a friend sent a link to your site. Reviewing the site I am unclear if your product can do what I need and that is to be mail SPAM blocker for a business of about 1200 mailboxes. We have Exchange 5.5 and working to go to Exchange2003. We would like the products that are loaded to a separate server that all mail routes through for filtering. Does this product do that? Is it compatible with our environment and number of people? If so, can you send a link to the proper product? Thanks Markus Truzzi Sr. Network Analyst 412.809.0100 x3204 Cell 412.841.5126 markus.truzzi@dcsg.com _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From rmalayter at bai.org Fri Sep 5 12:50:35 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 5 13:13:45 2003 Subject: [Spambayes] Spambayes repeatedly classifies essages frommailing list as SPAM despite multiple (20+)recoveries fromspam folder Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01115126@cliff.bai.org> Meyer, Tony wrote: > Spambayes works best trained with roughly equal numbers of ham & spam; > we're still trying to come up with a good method of working with > unbalanced training data. At the moment there is an option (defaults I have several dozen folders in my mailbox that contain different types of ham. All told, this is about 7000 messages, and I have about 1500 spam messages. I used these as my training corpus with plug-in version 007. Should I instead create a "sample" folder of ham that contains about 1500 messages and train with that? What about adding a feature to the plug-in that would could the number of messages in each training folder, then use a random subsample of each folder (spam or ham) as necessary to create a balanced training corpus? From ltrag at optonline.net Fri Sep 5 14:15:51 2003 From: ltrag at optonline.net (Lanny Trager) Date: Fri Sep 5 13:16:24 2003 Subject: [Spambayes] Can I upgrade without losing my training? In-Reply-To: <5.2.1.1.2.20030905091620.00b6b000@localhost> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E58CE@its-xchg4.massey.ac.nz> <5.2.1.1.2.20030905091620.00b6b000@localhost> Message-ID: <200309051315.51730.ltrag@optonline.net> First, I must apologize to my fellow list members for the paucity of details regarding my specific setup. I humbly and profusely apologize. I am running SpamBayes on RedHat 9 with the latest RedHat kernel (2.4.20-20.9), using Kmail and of course pop3proxy. I was until this morning running "spambayes-1.0a4", I have upgraded to spambayes-1.0a5 and so far all of the problems that I was having seem to have been resolved by the upgrade. In spambayes-1.0a4 I first noticed that not all of my incoming e-mail was showing up on the review page. Some would show up after I went to the config page and hit the save button. I had trained 42 ham and 42 spam by pasting the source including full headers into the Train on a message box. SpamBayes was not doing very well in catching spam though. This just didn't jive with what I had read about SapmBayes so I started searching the mailing list archives for a clue. It was begining to look like my database was corrupt. Having found some messages that suggested that my system might be using Dumbdbm and that that was not the preferred database, I downloaded which_database.py and ran it. It told me that I had Pickle, Dumbdbm, Dbhash, and Bsddb available. But claimed that my storage hammie.db was none. After a bit of fiddling, including going back a version to 1.0a3 I managed to corrupt the database enough so that I no longer got the X-spambayes: ham - spam or unsure. I was now getting X-Spambayes-Exception: exceptions.AssertionError() in probability() at spambayes/classifier.py line 307: assert hamcount <=nham This was actually helpfull to me, the mystery was over, the database was definitively currupted. By then I had subscribed to the list and recieved notification that 1.0a5 was available. This morning, I started from scratch with 1.0a5 and all seems well so far. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From no-reply at hoemail1.firewall.lucent.com Fri Sep 5 14:52:48 2003 From: no-reply at hoemail1.firewall.lucent.com (no-reply@hoemail1.firewall.lucent.com) Date: Fri Sep 5 14:52:54 2003 Subject: [Spambayes] Virus Alert Message-ID: <200309051852.h85Iql308139@hoemail1.firewall.lucent.com> The e-mail message (file: document_all.pif) you sent to contains a virus (WORM_SOBIG.F). From richie at entrian.com Fri Sep 5 22:19:11 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 5 16:19:19 2003 Subject: [Spambayes] RE: Spambayes 1.0a5 released In-Reply-To: References: Message-ID: [Bob] > I'm a bit confused. Your announcement suggests that the new release > includes a new version (008?) of the outlook plugin, but everything > on the Web site says that 007 is still current. Please clarify. 1.0a5 is a source release - it includes the source of the Outlook plugin, but you should wait for the next plugin binary release to get the new plugin features. The source releases cover the POP3 proxy, IMAP proxy, procmail filters, web interface... everything but the Outlook plugin, really. The plugin code is included more for interested Python developers than for end users (possibly it shouldn't be there at all - Mark?) -- Richie Hindle richie@entrian.com From stephen at theboulets.net Fri Sep 5 16:53:40 2003 From: stephen at theboulets.net (stephen) Date: Fri Sep 5 16:53:43 2003 Subject: [Spambayes] Setting up SpamBayes for a multiuser environment Message-ID: <20030905205340.M37056@theboulets.net> I'd like to set up SpamBayes for two users on my linux box. (I use the pop proxy.) It's been working great for me, and seems to be much faster than SpamAssassin (which admittedly uses Vilpul's Razor). I'd like to have a separate database for each user. Is the best way to do this to have the source distribution in each user's home directory, and to start the web interface on different ports? BTW, anyone working on kmail integration? _________ Stephen From richie at entrian.com Fri Sep 5 23:12:51 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 5 17:12:58 2003 Subject: [Spambayes] Setting up SpamBayes for a multiuser environment In-Reply-To: <20030905205340.M37056@theboulets.net> References: <20030905205340.M37056@theboulets.net> Message-ID: Hi Stephen, > I'd like to have a separate database for each user. Is the best way to do this > to have the source distribution in each user's home directory, and to start > the web interface on different ports? No need to duplicate the code. Install it using "setup.py install", which will put the executable scripts in /usr/bin (or similar), and let each user run it in a different working directory with different ports configured. > BTW, anyone working on kmail integration? Not that I know of. I do know that people are successfully using both the POP3 proxy and the procmail scripts with KMail. -- Richie Hindle richie@entrian.com From fuerte at sci.fi Sat Sep 6 01:34:57 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Fri Sep 5 17:35:01 2003 Subject: [Spambayes] Spambayes 1.0a5 comments In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E58AF@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E58AF@its-xchg4.massey.ac.nz> Message-ID: <3F590181.7000404@sci.fi> Meyer, Tony wrote: >You can probably use hammiefilter to do this somehow. Otherwise, try >something like this (untested): > > > >>>>from spambayes import tokenizer >>>>f = file("path\to\my\message", "r") >>>>msg = f.read() >>>>f.close() >>>>g = tokenizer.Tokenizer().tokenize(msg) >>>>for t in g: >>>> print t >>>> >>>> > >from in Python (just "python" from a command prompt should get you into >Python). > > I'll try it, thanks (I'm getting used to Python now). Some more comments on the Review page: It is good to have Unsure messages in top. But the button to Train should be there too. Also I would like it to train only Unsure messages, and then have Train All button to train also ham and spam messages. Another option would be to have all other messages selected as Defer by default. Harri From stephen at theboulets.net Fri Sep 5 17:48:47 2003 From: stephen at theboulets.net (stephen) Date: Fri Sep 5 17:48:52 2003 Subject: [Spambayes] Setting up SpamBayes for a multiuser environment In-Reply-To: References: <20030905205340.M37056@theboulets.net> Message-ID: <20030905214847.M20009@theboulets.net> Thanks. Would putting the commands in .xinitrc make the most sense? -- Stephen On Fri, 05 Sep 2003 22:12:51 +0100, Richie Hindle wrote > Hi Stephen, > > > I'd like to have a separate database for each user. Is the best way to do this > > to have the source distribution in each user's home directory, and to start > > the web interface on different ports? > > No need to duplicate the code. Install it using "setup.py install", > which will put the executable scripts in /usr/bin (or similar), and > let each user run it in a different working directory with different > ports configured. > > > BTW, anyone working on kmail integration? > > Not that I know of. I do know that people are successfully using > both the POP3 proxy and the procmail scripts with KMail. > > -- > Richie Hindle > richie@entrian.com _________ Stephen From richie at entrian.com Sat Sep 6 00:08:25 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 5 18:08:34 2003 Subject: [Spambayes] Setting up SpamBayes for a multiuser environment In-Reply-To: <20030905214847.M20009@theboulets.net> References: <20030905205340.M37056@theboulets.net> <20030905214847.M20009@theboulets.net> Message-ID: Hi Stephen, > Thanks. Would putting the commands in .xinitrc make the most sense? I guess so, though I'm no Linux expert. As long as your users always log in and start X (like if you were using xdm), it sounds like it should work. Hopefully someone more clueful will be along with a more definitive answer. -- Richie Hindle richie@entrian.com From T.A.Meyer at massey.ac.nz Sat Sep 6 14:47:36 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 21:48:00 2003 Subject: [Spambayes] RE: Spambayes 1.0a5 released Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E593D@its-xchg4.massey.ac.nz> > The plugin code is included [in the source releases] more for > interested Python developers than for end users (possibly it > shouldn't be there at all - Mark?) I'm not Mark, but I vote it stays. It means that people can run from the source if they need/want to without having to bother with CVS. It also makes it easier if you're running a slightly modified version, although I guess this comes under the "interested Python developers" tag. I don't think it's true at the moment, but at some points in the past, the source version has worked when the installer hasn't, so it's useful then, too. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 6 14:48:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 21:49:02 2003 Subject: [Spambayes] spambayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E593E@its-xchg4.massey.ac.nz> > I just got Window s XP. Does spambayes work with Outlook > Express? I saw that it is compatable with Outlook, which I > don't have. I need a good anti-spam software. FAQ 2.1: =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 6 15:43:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 22:43:28 2003 Subject: [Spambayes] NEW ERRORS Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5949@its-xchg4.massey.ac.nz> > It hangs or takes forever during training I dont even see a > progress bar(Im running outlook 2000 sp-1)...Im running win > xp P4 1.5ghz Does it work correctly apart from training? Was that your only log file? If it does get stuck during training, there should be an error in the log. Are you using the latest version? (007). =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 6 15:45:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 5 22:45:49 2003 Subject: [Spambayes] Undo for spambayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E594A@its-xchg4.massey.ac.nz> > My understanding of the program is that the database is > stored in the application data folder. By default, yes. > If we use profiles and redirected folders, and the application > data is stored on the file server, is there a problem installing > Spambayes on different PC's that are used by the same user with > the same login, and have access to the same mailbox and user settings > from different locations within the internal LAN? There shouldn't be, no, as long as SpamBayes can access the files in the data directory, then it should work fine. > I realize that when I installed Spambays at home, the > database directory remained local to the PC, as redirected > files and user profiles do not initiate when connecting > through VPN over the Internet. Is there a problem with > leaving this setup in place, and in effect having two > different databases (one on the local home PC and one on the > file server), or should I uninstall the one I have at home? You should be fine having two separate databases; it's just two different users, really. =Tony Meyer From ta-meyer at ihug.co.nz Sat Sep 6 17:09:30 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Sat Sep 6 00:09:36 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031B74D5@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212AE37@its-xchg4.massey.ac.nz> > I hope you don't mind my writing to you. I'm trying to use > the SpamBayes imapfilter.py on my system (WinXP). I'm having some > difficulties which may or may not be related to the alpha status > of the software. I found you named as author. No worries, although for future reference, you should send these sorts of things to the spambayes mailing list. See . If it's an imapfilter question I'll probably still be the one to answer it, but you might get a faster answer from someone else. > I can start the web interface and set the configuration > options without running into any problems (although the names > of my IMAP folders are garbled). You're probably using 1.0a4, right? 1.0a5 was just released and one of the bugs it fixes is to do with displaying the folder names - hopefully this would fix that. > When I try to run a training session (with "Deleted Items" set as > the folder that contains spam for training) I get this message: [...] > File "C:\PROGRA~1\Python23\lib\imaplib.py", line 616, in select > raise self.readonly('%s is not writable' % mailbox) > imaplib.readonly: Deleted Items is not writable > > I get the same error regardless of which folder I try to use > for training. There was another message from someone with this same error just recently, although I hadn't heard of it until then. To me, it looks like you (with the username/password in the config file) don't have permission to write to that folder. imapfilter needs to be able to write to all the folders that it uses. If you login (using the same username/password) to the account in your normal mail client, are you able to make changes (add a message, for example) to these folders? =Tony Meyer From rafael.scholl at bbox.ch Sat Sep 6 13:30:14 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Sat Sep 6 06:32:30 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212AE37@its-xchg4.massey. ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031B74D5@its-xchg4.massey.ac.nz> Message-ID: <6.0.0.9.0.20030906120637.01b9d310@mail.bbox.ch> > > I can start the web interface and set the configuration > > options without running into any problems (although the names > > of my IMAP folders are garbled). > >You're probably using 1.0a4, right? 1.0a5 was just released and one of the >bugs it fixes is to do with displaying the folder names - hopefully this >would fix that. Where do I find that? I tried sourceforge.net and found a new code, as far as I can tell. When I try to run it, however, it stills calls itself "Alpha 1, version 0.01 (May 2003)" (see below), and I get this error: C:\Program Files\Python23\Scripts>imapfilter.py -b SpamBayes IMAP Filter Alpha1, version 0.01 (May 2003), using SpamBayes IMAP Filter Web Interface Alpha1, version 0.01 and engine SpamBayes Beta2, version 0.2 (July 2003). Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\imapfilter.py", line 803, in ? run() File "C:\Program Files\Python23\Scripts\imapfilter.py", line 735, in run classifier = storage.open_storage(bdbname, useDBM) AttributeError: 'module' object has no attribute 'open_storage' > > When I try to run a training session (with "Deleted Items" set as > > the folder that contains spam for training) I get this message: >[...] > > File "C:\PROGRA~1\Python23\lib\imaplib.py", line 616, in select > > raise self.readonly('%s is not writable' % mailbox) > > imaplib.readonly: Deleted Items is not writable > > > > I get the same error regardless of which folder I try to use > > for training. > >There was another message from someone with this same error just recently, >although I hadn't heard of it until then. To me, it looks like you (with >the username/password in the config file) don't have permission to write to >that folder. imapfilter needs to be able to write to all the folders that >it uses. If you login (using the same username/password) to the account in >your normal mail client, are you able to make changes (add a message, for >example) to these folders? I'm using Eudora as a mail client. I can compose a message and store it in my local Out Folder, then transfer it to the Deleted Items (there's no option to save a message directly to an imap folder in Eudora). When it's there, I can open the message. I can delete it, too. I can't tell if this means I have writing permission. I've approached my sysop with this problem. He says that only the server has the permission to write to the imap folders. This sounds to me as though my Eudora were talking to some sort of interface which relays commands to the server without having any writing permissions itself. Unfortunately, I'm completely ignorant of the inner workings of the imap system. Thanks for your help! Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From wsy at merl.com Sat Sep 6 11:29:07 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sat Sep 6 10:29:12 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3E9@cliff.bai.org> (rmalayter@bai.org) References: <792DE28E91F6EA42B4663AE761C41C2AF3D3E9@cliff.bai.org> Message-ID: <200309061429.h86ET7r16035@localhost.localdomain> From: "Ryan Malayter" > Statistically speaking, HTML mail is > either from a spammer or from a clueless > git, and in either case can usually be > delayed without penalty or discarded outright. As indicated above, I do not think this analysis is true anymore. And characterizing someone as a clueless git because they don't change their mail client's default message format or "love" plain text... Well, let us know when you get back to the real world. Um.... you're arguing politics of desire against actual measured statistics. In my current CRM114 corpus (which is running realtime and delivering better accuracy than I myself can deliver- well over 99.9%): SingleToken Spam Nonspam

49 0
207 32 48 0 I'm getting this error running pop3proxy. Anyone know what could be causing it? $ pop3proxy.py -b Traceback (most recent call last): File "/usr/bin/pop3proxy.py", line 97, in ? import spambayes.message File "/usr/lib/python2.2/site-packages/spambayes/message.py", line 137, in ? msginfoDB = MessageInfoDB(message_info_db_name) File "/usr/lib/python2.2/site-packages/spambayes/message.py", line 113, in __init__ self.dbm = dbmstorage.open(self.db_name, self.mode) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 59, in open return f(*args) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 41, in open_best return f(*args) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 23, in open_gdbm return gdbm.open(*args) gdbm.error: Bad magic number _________ Stephen From tdickenson at devmail.geminidataloggers.co.uk Sat Sep 6 16:58:59 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Sat Sep 6 10:59:03 2003 Subject: [Spambayes] kmail In-Reply-To: <20030905205340.M37056@theboulets.net> References: <20030905205340.M37056@theboulets.net> Message-ID: <200309061558.59035.tdickenson@devmail.geminidataloggers.co.uk> On Friday 05 September 2003 21:53, stephen wrote: > BTW, anyone working on kmail integration? Im using spambayes with kmail, using kmail filters for classification. For training, I have a script that performs a 'full' retrain assuming everything in my 'spam' folder is spam, and everything in every other folder is ham. Ive occasionally looked at deeper integration, but the kde cvs trunk is too bleeding-edge for me at the moment. -- Toby Dickenson From thuan_q_tran at yahoo.com Sat Sep 6 15:03:04 2003 From: thuan_q_tran at yahoo.com (thuan tran) Date: Sat Sep 6 17:03:09 2003 Subject: [Spambayes] Plug in installation question Message-ID: <20030906210304.16281.qmail@web40406.mail.yahoo.com> Hi, Spambayes .7, outlook 2k, win2k patch 4 The installation went without error, and Outlook displays the SpamBaye button. I can set the Spam information in the manager. However, the tools->options->others->advanced->Com Addin is empty, and "Delete as Spam" complains that SpamBayes is not enabled. The FAQ mentioned that SpamBayes needs to be re-installed but the installation went without an error. Any pointers are appreciated, Thanks __________________________________ Do you Yahoo!? Yahoo! SiteBuilder - Free, easy-to-use web site design software http://sitebuilder.yahoo.com From postmaster at ducks.org Sun Sep 7 05:30:18 2003 From: postmaster at ducks.org (MTASieve) Date: Sun Sep 7 00:30:20 2003 Subject: [Spambayes] [MTASieve:d0b1gzh6] Attachment block message notification Message-ID: <1kbmsjvk60c511> E-mail from: Postmaster@ducks.org A message containing an attachment type which is not permitted. You may want to contact your system administrator for more information. Information about the problem message: FROM: spambayes@python.org TO: ajones@ducks.org Subject: Re: Approved Attachment Name: details.pif From mhammond at skippinet.com.au Sun Sep 7 16:57:35 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Sep 7 01:57:35 2003 Subject: [Spambayes] Plug in installation question In-Reply-To: <20030906210304.16281.qmail@web40406.mail.yahoo.com> Message-ID: <079c01c37504$ef76dfd0$f502a8c0@eden> > Spambayes .7, outlook 2k, win2k patch 4 > > The installation went without error, and Outlook > displays > the SpamBaye button. I can set the Spam information > in the manager. However, the > > tools->options->others->advanced->Com Addin > > is empty, and "Delete as Spam" complains that > SpamBayes > is not enabled. The FAQ mentioned that SpamBayes needs > to be re-installed but the installation went without > an error. Go to the "SpamBayes Manager", and click the "About" button - this has information about the configuration process. Then you must ensure you have clicked the "Enable Spambayes" button in the manager (it will be disabled until you have configured it correctly) Mark. From rogerb at netspeed.com.au Sun Sep 7 18:55:30 2003 From: rogerb at netspeed.com.au (Roger Booth) Date: Sun Sep 7 03:56:49 2003 Subject: [Spambayes] Spam for Training Message-ID: <003701c37515$695c1030$04ab21cb@rogerboo> Where can I get some spam to train Spambayes. It needs to be in Windows format. Thanks RB From m0davis at pacbell.net Sun Sep 7 04:38:39 2003 From: m0davis at pacbell.net (Martin Davis) Date: Sun Sep 7 06:38:44 2003 Subject: [Spambayes] Help with pop3 mail proxy Message-ID: <20030907103839.10422.qmail@web80410.mail.yahoo.com> The user name for my real email account is "m0davis@pacbell.net", not "m0davis". When I go to the Spambayes configuration page and try to configure the user name to be m0davis@pacbell.net, it tells me: 'm0davis@pacbell.net' is not a value valid for [POP3 Proxy Options] User name Am I doing something wrong? Or does Spambayes need to be changed to support user names like mine? Thanks, -Martin From richie at entrian.com Sun Sep 7 14:47:35 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Sep 7 08:47:44 2003 Subject: [Spambayes] Help with pop3 mail proxy In-Reply-To: <20030907103839.10422.qmail@web80410.mail.yahoo.com> References: <20030907103839.10422.qmail@web80410.mail.yahoo.com> Message-ID: Hi Martin, > The user name for my real email account is > "m0davis@pacbell.net", not "m0davis". When I go to > the Spambayes configuration page and try to configure > the user name to be m0davis@pacbell.net [...] I think you've misunderstood an option somewhere. The configuration page doesn't need to know the username for your email account. Are you perhaps looking at "User name: If you activated the HTTP authentication option, you can modify the authorized user name here."? That's for configuring a username and password for the Spambayes web interface - it has nothing to do with your email account. Or have I misunderstood what you're looking at? -- Richie Hindle richie@entrian.com From schlosser.voytek at sympatico.ca Sun Sep 7 11:13:34 2003 From: schlosser.voytek at sympatico.ca (Rudy Voytek) Date: Sun Sep 7 10:13:47 2003 Subject: [Spambayes] Outlook Express Message-ID: <000801c3754a$3b233a60$6933fea9@user6ksvftyr1s> spambayes.sourceforge.net/windows.html Does not let me use it for Outlook Express on XP op system. Do you have a version that will work on Express? Best regards....Rudy Voytek From ltrag at optonline.net Sun Sep 7 11:25:23 2003 From: ltrag at optonline.net (Lanny Trager) Date: Sun Sep 7 10:25:30 2003 Subject: [Spambayes] Outlook Express In-Reply-To: <000801c3754a$3b233a60$6933fea9@user6ksvftyr1s> References: <000801c3754a$3b233a60$6933fea9@user6ksvftyr1s> Message-ID: <200309071025.23416.ltrag@optonline.net> On Sunday 07 September 2003 10:13, Rudy Voytek wrote: > spambayes.sourceforge.net/windows.html > > Does not let me use it for Outlook Express on XP op system. > Do you have a version that will work on Express? > > Best regards....Rudy Voytek > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html Rudy, This is covered in the FAQ: http://spambayes.sourceforge.net/faq.html#does-spambayes-work-with-outlook-express Lanny From ltrag at optonline.net Sun Sep 7 11:26:07 2003 From: ltrag at optonline.net (Lanny Trager) Date: Sun Sep 7 10:26:14 2003 Subject: [Spambayes] Outlook Express In-Reply-To: <000801c3754a$3b233a60$6933fea9@user6ksvftyr1s> References: <000801c3754a$3b233a60$6933fea9@user6ksvftyr1s> Message-ID: <200309071026.07155.ltrag@optonline.net> On Sunday 07 September 2003 10:13, Rudy Voytek wrote: > spambayes.sourceforge.net/windows.html > > Does not let me use it for Outlook Express on XP op system. > Do you have a version that will work on Express? > > Best regards....Rudy Voytek > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html Rudy, This is covered in the FAQ: http://spambayes.sourceforge.net/faq.html#does-spambayes-work-with-outlook-express Lanny From stephen at theboulets.net Sun Sep 7 10:54:15 2003 From: stephen at theboulets.net (Stephen Boulet) Date: Sun Sep 7 10:54:28 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <20030906144244.M59552@theboulets.net> References: <20030906144244.M59552@theboulets.net> Message-ID: <200309070954.15653.stephen@theboulets.net> After further experimenting, I found it works if I deleted all of the spambayes related files from the directory from which I'm running the program: _pop3proxyham.mbox bayescustomize.ini pop3proxy-ham-cache/ pop3proxy-unknown-cache/ _pop3proxyspam.mbox hammie.db pop3proxy-spam-cache/ spambayes.messageinfo.db I still have the problem of not being able to run as a user. Anyone know how to give myself permission? Here's the message (using v. 1.0a5): Loading database... Traceback (most recent call last): File "/usr/bin/pop3proxy.py", line 820, in ? run() File "/usr/bin/pop3proxy.py", line 814, in run start(state=state) File "/usr/bin/pop3proxy.py", line 752, in start main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) File "/usr/bin/pop3proxy.py", line 729, in main _createProxies(servers, proxyPorts) File "/usr/bin/pop3proxy.py", line 707, in _createProxies listener = BayesProxyListener(server, serverPort, proxyPort) File "/usr/bin/pop3proxy.py", line 299, in __init__ Dibbler.Listener.__init__(self, proxyPort, BayesProxy, proxyArgs) File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 267, in __init__ self.bind(port) File "/usr/lib/python2.2/asyncore.py", line 312, in bind return self.socket.bind (addr) socket.error: (13, 'Permission denied') On Saturday 06 September 2003 09:42 am, stephen wrote: > I'm getting this error running pop3proxy. Anyone know what could be causing > it? > > $ pop3proxy.py -b > Traceback (most recent call last): > File "/usr/bin/pop3proxy.py", line 97, in ? > import spambayes.message > File "/usr/lib/python2.2/site-packages/spambayes/message.py", line 137, > in ? > msginfoDB = MessageInfoDB(message_info_db_name) > File "/usr/lib/python2.2/site-packages/spambayes/message.py", line 113, > in __init__ > self.dbm = dbmstorage.open(self.db_name, self.mode) > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 59, > in open > return f(*args) > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 41, > in open_best > return f(*args) > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 23, > in open_gdbm > return gdbm.open(*args) > gdbm.error: Bad magic number > > _________ > Stephen > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- Stephen From here to there and there to here, funny things are everywhere. -- Dr Seuss From francesca.tagliente at tin.it Sun Sep 7 20:04:02 2003 From: francesca.tagliente at tin.it (Cantini Tagliente) Date: Sun Sep 7 13:04:15 2003 Subject: [Spambayes] (no subject) Message-ID: <000901c37562$0a676960$b76b7550@APTIVA> From m0davis at pacbell.net Sun Sep 7 11:22:06 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Sun Sep 7 13:22:16 2003 Subject: [Spambayes] Re: Help with pop3 mail proxy Message-ID: <3F5B693E.8000802@pacbell.net> Forget it. That field has absolutely nothing to do with my POP3 user account. I figured out my real problem was my mail server was acting strange. Seems to work now! Thanks and sorry for the stupid question! From shalehperry at comcast.net Sun Sep 7 12:54:35 2003 From: shalehperry at comcast.net (Sean 'Shaleh' Perry) Date: Sun Sep 7 14:54:40 2003 Subject: [Spambayes] DB_RUNRECOVERY error Message-ID: <200309071154.35194.shalehperry@comcast.net> I have been using Spambayes on Linux happily for about two months now. I stopped the process for a machine reboot about 10 days ago. Started it up again, all was well. About 2 days later I started getting this: Exception bsddb._db.DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') in > ignored or a message like it in every email and my filtering stopped. Right as the spam deluge from the viruses really got going. Needless to say, you don't know how good you have it until it stops working (-: I have two questions: 1) what could have caused it? 2) how can I fix it? Sure, I can start all over but I am concerned about this happening again. I just had the system trained well enough. From skip at pobox.com Sat Sep 6 17:50:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Sep 7 16:37:01 2003 Subject: [Spambayes] RE: Spambayes 1.0a5 released In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E593D@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E593D@its-xchg4.massey.ac.nz> Message-ID: <16218.22154.867238.101615@montanaro.dyndns.org> Tony> I'm not Mark, but I vote it stays. Ditto. In addition to the reasons Tony mentioned, it is handy for debugging to have the source. It allows users to perhaps provide more input for Mark and the other Windows wizards when trying to solve problems which can't (easily) be duplicated by the developers. Skip From skip at pobox.com Sun Sep 7 16:51:58 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Sep 7 16:51:58 2003 Subject: [Spambayes] DB_RUNRECOVERY error In-Reply-To: <200309071154.35194.shalehperry@comcast.net> References: <200309071154.35194.shalehperry@comcast.net> Message-ID: <16219.39534.403441.147629@montanaro.dyndns.org> Sean> I have been using Spambayes on Linux happily for about two months Sean> now. I stopped the process for a machine reboot about 10 days Sean> ago. Started it up again, all was well. About 2 days later I Sean> started getting this: Sean> Exception bsddb._db.DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Sean> Fatal error, run database recovery -- fatal region error detected; Sean> run recovery') in ignored ... Sean> 1) what could have caused it? Sean> 2) how can I fix it? I've been in touch with the Sleepycat folks about this. I believe the problem is that the way Python's bsddb module opens a database when you call bsddb.hashopen() is not thread-safe. You *might* be able to recover the file using the db_recover command (if you installed Sleepycat's library from source or have the right packages installed on your Linux system you should have it), but The Sleepycat folks tell me that's not the right way to do things. You didn't mention which SpamBayes app you were using. I've got some mail from them. I'll try and dredge it up and post it when I have a moment. Skip From skip at pobox.com Sun Sep 7 16:56:21 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Sep 7 16:57:44 2003 Subject: [Spambayes] Invitation to join the Spambayes group (fwd) Message-ID: <16219.39797.838435.528127@montanaro.dyndns.org> japeters@pacbell.net has invited to join the Spambayes group hosted by Yahoo! Groups, a free, easy-to-use community service. ... By joining Spambayes, you will be able to exchange messages with other group members, store photos and files, coordinate events and more. ... I see you are active on the SpamBayes project. I found your email by Googling for Spambayes. I hope you will join this group so we can help others get started (Like me) Is there some way the existing spambayes@python.org mailing list is inadequate to the task? All the people who are best equipped to help already read that list. -- Skip Montanaro Got gigs? http://www.musi-cal.com/ http://www.mojam.com/ Got spam? http://spambayes.sf.net/ From richie at entrian.com Sun Sep 7 23:04:44 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Sep 7 17:04:49 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309070954.15653.stephen@theboulets.net> References: <20030906144244.M59552@theboulets.net> <200309070954.15653.stephen@theboulets.net> Message-ID: <277nlvkc2lquqsn0tlp01euil44qb64rma@4ax.com> [Stephen] > I still have the problem of not being able to run as a user. > [...] > socket.error: (13, 'Permission denied') On unix, normal users can't bind to privileged ports, which means you can't run the pop3proxy on port 110. You need to use a high port (over 1024) instead. You can configure the port through the web interface - see the "SpamBayes Ports" option. 1110 is a popular choice. -- Richie Hindle richie@entrian.com From stephen at theboulets.net Sun Sep 7 18:51:47 2003 From: stephen at theboulets.net (Stephen Boulet) Date: Sun Sep 7 18:51:59 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <277nlvkc2lquqsn0tlp01euil44qb64rma@4ax.com> References: <20030906144244.M59552@theboulets.net> <200309070954.15653.stephen@theboulets.net> <277nlvkc2lquqsn0tlp01euil44qb64rma@4ax.com> Message-ID: <200309071751.47299.stephen@theboulets.net> I still get the error when I launch it like: pop3proxy.py -u 1110 Doesn't seem to be a firewall issue, since stopping the firewall doesn't change things. -- Stephen On Sunday 07 September 2003 04:04 pm, Richie Hindle wrote: > [Stephen] > > > I still have the problem of not being able to run as a user. > > [...] > > socket.error: (13, 'Permission denied') > > On unix, normal users can't bind to privileged ports, which means you > can't run the pop3proxy on port 110. You need to use a high port (over > 1024) instead. You can configure the port through the web interface - see > the "SpamBayes Ports" option. 1110 is a popular choice. From T.A.Meyer at massey.ac.nz Mon Sep 8 12:12:50 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 7 19:13:24 2003 Subject: [Spambayes] Error running pop3proxy.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5AC6@its-xchg4.massey.ac.nz> > I still get the error when I launch it like: > > pop3proxy.py -u 1110 That's not changing the address that the proxy is served on, it's changing the address that the user interface is served on. If you run "pop3proxy.py -h" it will give you some information about what the various options do. You really want to get the interface going and use the configuration page to set it up, rather than use the command line switches. =Tony Meyer From ltrag at optonline.net Sun Sep 7 20:23:41 2003 From: ltrag at optonline.net (Lanny Trager) Date: Sun Sep 7 19:23:48 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309071751.47299.stephen@theboulets.net> References: <20030906144244.M59552@theboulets.net> <277nlvkc2lquqsn0tlp01euil44qb64rma@4ax.com> <200309071751.47299.stephen@theboulets.net> Message-ID: <200309071923.41391.ltrag@optonline.net> On Sunday 07 September 2003 18:51, Stephen Boulet wrote: > I still get the error when I launch it like: > > pop3proxy.py -u 1110 > > Doesn't seem to be a firewall issue, since stopping the firewall doesn't > change things. > > -- Stephen > > On Sunday 07 September 2003 04:04 pm, Richie Hindle wrote: > > [Stephen] > > > > > I still have the problem of not being able to run as a user. > > > [...] > > > socket.error: (13, 'Permission denied') > > > > On unix, normal users can't bind to privileged ports, which means you > > can't run the pop3proxy on port 110. You need to use a high port (over > > 1024) instead. You can configure the port through the web interface - > > see the "SpamBayes Ports" option. 1110 is a popular choice. Stephen, The -u flag sets the user interface port which by default is 8880. Try "python pop3proxy.py -b" (without the quotes) that will bring up your default browser. And start the pop3proxy. When your browser opens, it should be at the SpamBayes Web Interface: Home. (If not, point your browser to localhost:8880) On the upper right hand side of the page there is a link "Configuration page" follow that link to the configuration page. The second box down is where you can set the ports that pop3proxy will listen to. For instance, I have pop3proxy pointed to two different pop servers, therefore I have the ports set to 1115,1110 (notice these ports are above 1024). These are the ports that you will point your mail client to, and you will point it to pop3proxy by replacing (in your client) the addresses of the pop servers at your isp with "localhost", and put that/those servers in the first box on the config (in SpamBayes) page seperated by a comma. I hope I am being clear. If not let me know and I'll stick with you till you get it working. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From arohter at nolar.com Sun Sep 7 19:27:02 2003 From: arohter at nolar.com (Alon) Date: Sun Sep 7 19:27:09 2003 Subject: [Spambayes] Imapfilter never-ending appending headers Message-ID: <184540453.1062959222@[192.168.0.20]> Just upgraded to spambayes 1.0a5 from a4, and now whenever I run imapfilter -c (to classify), spambayes just keeps appending X-Spambayes headers to each email: .... Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Spambayes-MailId: 1062930343 X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 1.53260861224e-07 X-Spambayes-MailId: 1062930343 X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 1.53260861224e-07 X-Spambayes-MailId: 1062930343 X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 1.53260861224e-07 X-Spambayes-MailId: 1062930343 X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 1.53260861224e-07 X-Spambayes-MailId: 1062930343 X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 1.53260861224e-07 X-Spambayes-MailId: 1062930343 .... This never used to happen before. Also, when I run the imapfilter training mode (imapfilter -t) on an existing database, it does a complete retraining instead of just training the new mail as it did in a4. I haven't changed anything from the a4->a5 transition, except change the spambayes symbolic link to --> spambayes-1.0a5/. Here's my .ini file: [Headers] include_evidence:False include_score:True [Storage] messageinfo_storage_file:/var/spambayes/main.messagei nfo.db persistent_storage_file:/var/spambayes/main.db [imap] password:xxxxx server:xxxxx.com username:nolar ham_train_folders:INBOX.Accounts,INBOX.Sent-Mail spam_folder:INBOX.SPAM-NEWSPAM spam_train_folders:INBOX.SPAM unsure_folder:INBOX Is this a bug, or am I missing something? From T.A.Meyer at massey.ac.nz Mon Sep 8 12:31:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 7 19:31:26 2003 Subject: [Spambayes] Imapfilter never-ending appending headers Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5B08@its-xchg4.massey.ac.nz> > Just upgraded to spambayes 1.0a5 from a4, and now > whenever I run imapfilter -c (to classify), spambayes > just keeps appending X-Spambayes headers to each > email: [...] > Is this a bug, or am I missing something? It sounds like either spambayes isn't finding the message id in the headers, or it's not finding the id in the messageinfo db. If you run imapfilter with -v, do you see a message (at the top) that looks like: "Warning: no dbm modules available for MessageInfoDB"? If not, could you run imapfilter with the switch "-i4" and attach the stuff that's printed out to the bug tracker that you opened? Thanks. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 8 12:34:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 7 19:34:41 2003 Subject: [Spambayes] Spam for Training Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5B0F@its-xchg4.massey.ac.nz> > Where can I get some spam to train Spambayes. It needs to be > in Windows format. You're best off using your own spam, since this is what you want spambayes to classify. (If you don't get enough spam to train spambayes, then you probably don't need to use it). If you do want some for another reason (a testing corpus, for example), then you could try: =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 8 12:41:37 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 7 19:41:57 2003 Subject: [Spambayes] RE: imapfilter.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5B25@its-xchg4.massey.ac.nz> > > You're probably using 1.0a4, right? 1.0a5 was just released > > and one of the bugs it fixes is to do with displaying the folder > > names - hopefully this would fix that. > > Where do I find that? , which takes you to . > I tried sourceforge.net and found a new code, as far as I can > tell. When I try to run it, however, it stills calls itself "Alpha 1, > version 0.01 (May 2003)" (see below), The 1.0a5 release should say 0.02 (September 2003). It's in the spambayes-1.0a5 archive, in any case :) [...] > AttributeError: 'module' object has no attribute 'open_storage' This looks like you have tried to use the 1.0a5 imapfilter.py file with the rest of the 1.0a4 release. That's not going to work, because it relies on changes that have been made to the rest of the files. You need to use the complete 1.0a5 release. > I'm using Eudora as a mail client. I can compose a message > and store it in my local Out Folder, then transfer it to the > Deleted Items (there's no option to save a message directly to an > imap folder in Eudora). When it's there, I can open the message. > I can delete it, too. I can't tell if this means I have writing permission. It sounds like it to me. If you can store a message in the folder, then you are effectively writing it in there. > I've approached my sysop with this problem. He says that only > the server has the permission to write to the imap folders. I think what he means by this is actually writing to the location on disk. He presumably doesn't mean that the folders are all read-only (otherwise the Eudora operations above wouldn't work). If you can get 1.0a5 working, it would be good to see the output from running "imapfilter.py -t -c -i4". This will print out a whole heap of information that contains the actual IMAP responses, and might help figure out where the problem is occurring. =Tony Meyer From bob at 1776.com Sun Sep 7 20:04:31 2003 From: bob at 1776.com (Robert K. Coe) Date: Sun Sep 7 19:59:25 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309061429.h86ET7r16035@localhost.localdomain> Message-ID: <000001c37594$66c78ff0$6501a8c0@CambridgeMA.GOV> I wonder if you may be overlooking something that could skew your statistics. My experience has been that when I create an HTML message, Outlook actually sends it as a multi-part MIME construct incorporating both HTML and plain-text forms of the message. If the recipient reads the message with an HTML-capable email reader, he'll see the HTML form of the message; otherwise he'll see the plain-text form. If you're collecting your statistics with a plain-text mail reader, or if you're looking only at the plain-text version in a multi-part message, you may be understating the actual use of HTML in messages sent to you. In fact, if someone knows how to get Outlook to stop sending a plain-text version of HTML messages, I'd like to hear about it. Now that almost everybody can read HTML messages, I think the plain-text version is superfluous. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Bill Yerazunis [mailto:wsy@merl.com] > Sent: Saturday, September 06, 2003 10:29 AM > To: rmalayter@bai.org > Cc: spambayes@python.org > Subject: Re: [Spambayes] how spambayes handles image-only spams > > > > From: "Ryan Malayter" > > > Statistically speaking, HTML mail is > > either from a spammer or from a clueless > > git, and in either case can usually be > > delayed without penalty or discarded outright. > > As indicated above, I do not think this analysis is true anymore. And > characterizing someone as a clueless git because they don't change their > mail client's default message format or "love" plain text... Well, let > us know when you get back to the real world. > > Um.... you're arguing politics of desire against actual measured > statistics. > > In my current CRM114 corpus (which is running realtime and delivering > better accuracy than I myself can deliver- well over 99.9%): > > SingleToken Spam Nonspam > >

49 0 >
207 32 > 48 0 > > Other HTML tokens have similar statistics. The margin of error on > each of these (aliasing probability) is 1 - 1/2^64, in other words, a > few billionths of a percent of a chance that this is due to aliasing > in the database. > > E pur si moivre, dude. E pur si moivre. > > -Bill Yerazunis From tim.one at comcast.net Sun Sep 7 21:04:43 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 7 20:04:47 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3EC@cliff.bai.org> Message-ID: [Ryan Malayter] > ... > But as you mentioned, the "everything is a token" approach hasn't > worked all that well with SpamBayes and HTML. Perhaps it would fare > much better with a "sliding window" tokenizer such as that used by > CRM-114. I'm betting CRM-114 would hammer "img src http" and other > image-related tag sequences with my corpus, I bet it would too, although you seem to be wishing away the "=value" parts of tags here; I don't think CRM114 ignores them. > while leaving

,
and the like alone. So give it a try and report back <0.1 wink>. > Has anyone looked at implementing a sparse windowing system similar to > CRM-114's in SpamBayes? Yes, and there's a lot about it in the archives, probably hard to find now but too detailed to be worth the effort of summarizing now. > Intuitively, it seems like this would do much better with HTML tags For the kinds of people (unlike you) who get just a little bit of HTML ham, I expect it would be even worse than the first spambayes was for them (last year, at the start, spambayes didn't throw out HTML decorations; and that was a disaster, as we've covered, for most spambayes users who have little HTML ham; I expect CRM114 would penalize the mere presence of HTML even more heavily than that spambayes did). > as well as mail header information, at the expense of CPU and DB > storage. Both of those, yes. The use of hashing (which current CRM114 may or may not do anymore) also caused baffling mistakes, where "baffling" == "makes no sense to humans". From tim.one at comcast.net Sun Sep 7 21:08:04 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 7 20:08:08 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <004101c373a1$9e455a60$0201a8c0@swapwizard.com> Message-ID: [Sean True] > HTML mail works just fine. I get TONS of html ham, and I know our > customers do. Since they pay us to make sure that InBoxer/SpamAtBay I bet Laura would tell you to stick to one name, for marketing clarity. > work for them, and yell when it doesn't, I'm pretty sure the tokenizer > work here was _well_ worth while. That said, it might be good > to have a button for: "I hate html mail, tokenize the html mark up > and let the chips fall where they may" We used to have an option for that, very early on; I expect the code implementing it went away before you saw this project; it would be easy to add back, although split-on-whitespace seems a poor tokenization strategy for encoded stuff (e.g., bgcolor="#33cccc"> 12 characters), spambayes goes to compress it into a synthesized "skip" token). IOW, a crap job would be easy to slam back in, but a good job harder -- I lack interest for either and time for the latter. From wsy at merl.com Sun Sep 7 21:12:55 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sun Sep 7 20:12:59 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <000001c37594$66c78ff0$6501a8c0@CambridgeMA.GOV> (bob@1776.com) References: <000001c37594$66c78ff0$6501a8c0@CambridgeMA.GOV> Message-ID: <200309080012.h880Ctv00743@localhost.localdomain> From: "Robert K. Coe" I wonder if you may be overlooking something that could skew your statistics. My experience has been that when I create an HTML message, Outlook actually sends it as a multi-part MIME construct incorporating both HTML and plain-text forms of the message. If the recipient reads the message with an HTML-capable email reader, he'll see the HTML form of the message; otherwise he'll see the plain-text form. If you're collecting your statistics with a plain-text mail reader, or if you're looking only at the plain-text version in a multi-part message, you may be understating the actual use of HTML in messages sent to you. Actually, no. I read email with Emacs, and I get the _WHOLE_ text (headers and everything, multipart mimes, all that) just as it was recieved on the SMTP port. Additionally, I get a postprocessed section (not an attachment) with all of the base64's expanded, <--interupptus--> comments removed, etc. That's what I feed back into the learning cycle; so I even get things you probably don't get, like KOI-8 russian text and text that had been so sliced up by spammus interruptus that you can't read it. Do you do any reassembly, or is there any chance that you are not getting the ASCII text if there is any? In fact, if someone knows how to get Outlook to stop sending a plain-text version of HTML messages, I'd like to hear about it. Now that almost everybody can read HTML messages, I think the plain-text version is superfluous. No, you have it the other way 'round. The HTML version is superfluous, the plain text is all you need. :-) -Bill Yerazunis From rafael.scholl at bbox.ch Mon Sep 8 03:23:31 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Sun Sep 7 20:23:41 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5B25@its-xchg4.massey. ac.nz> Message-ID: <6.0.0.9.0.20030908014451.01b76fa8@mail.bbox.ch> >This looks like you have tried to use the 1.0a5 imapfilter.py file with >the rest of the 1.0a4 release. Yes, that's precisely what I did. >If you can get 1.0a5 working, it would be good to see the output from >running "imapfilter.py -t -c -i4". This will print out a whole heap of >information that contains the actual IMAP responses, and might help >figure out where the problem is occurring. Ok, I got the 1.0a5 files and ran "setup.py install". (Should I have uninstalled the previous package first?) In the web interface, the folder names still appear garbled with the first letter and the last missing for all folders except for Deleted Items and Sent Items. Additionally, when I tried to save the training folder this error appeared: 500 Server error Traceback (most recent call last): File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\ImapUI.py", line 250, in onChangeopts UserInterface.UserInterface.onChangeopts(self, **parms) File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\UserInterface.py", line 665, in onChangeopts errmsg = self.verifyInput(parms, pmap) UnboundLocalError: local variable 'pmap' referenced before assignment However, it was possible to set the configure the mail server and the user name and password. I then edited bayescustomize.ini and included "spam_train_folders:Deleted Items" manually. With this done I ran "imapfilter.py -t -c -i4" and this was the result: C:\Program Files\Python23\Scripts>imapfilter.py -t -c -i4 SpamBayes IMAP Filter Alpha2, version 0.02 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). 15:53.90 > BOBC1 LOGIN "name@mailserver.ch" "password" 15:53.92 < BOBC1 OK LOGIN completed 15:53.92 > BOBC2 SELECT "Deleted Items" 15:53.93 < * FLAGS (\Answered \Flagged \Deleted \Seen \Draft) 15:53.93 < * 0 EXISTS 15:53.93 < * 0 RECENT 15:53.93 < * OK [PERMANENTFLAGS (\Answered \Flagged \Deleted \Seen \Draft \*)] 15:53.95 < * OK [UNSEEN 0] 15:53.95 < * OK [UIDVALIDITY 433788907] UIDs are valid 15:53.95 < BOBC2 OK [READ-WRITE] opened Deleted Items 15:53.95 > BOBC3 UID SEARCH UNDELETED 15:53.96 < BOBC3 OK UID SEARCH completed Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\imapfilter.py", line 803, in ? run() File "C:\Program Files\Python23\Scripts\imapfilter.py", line 789, in run imap_filter.Train() File "C:\Program Files\Python23\Scripts\imapfilter.py", line 630, in Train num_spam_trained = folder.Train(self.classifier, True) File "C:\Program Files\Python23\Scripts\imapfilter.py", line 542, in Train for msg in self: File "C:\Program Files\Python23\Scripts\imapfilter.py", line 468, in __iter__ for key in self.keys(): File "C:\Program Files\Python23\Scripts\imapfilter.py", line 490, in keys return response[1][0].split(' ') AttributeError: 'NoneType' object has no attribute 'split' So it seems to me that there's some sort of new problem, but I can't make heads or tails of it. Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From wsy at merl.com Sun Sep 7 21:25:00 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sun Sep 7 20:25:04 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <200309080025.h880P0800760@localhost.localdomain> From: "Tim Peters" [Ryan Malayter] > ... > But as you mentioned, the "everything is a token" approach hasn't > worked all that well with SpamBayes and HTML. Perhaps it would fare > much better with a "sliding window" tokenizer such as that used by > CRM-114. I'm betting CRM-114 would hammer "img src http" and other > image-related tag sequences with my corpus, I bet it would too, although you seem to be wishing away the "=value" parts of tags here; I don't think CRM114 ignores them. It both ignores them and does NOT ignore them. That's the beauty of the sparse binary polynomial hash - if you have the phrase: beware the jabberwock my son you get all these token features: beware beware the beware jabberwock beware the jabberwock beware my beware the my beware jabberwock my beware the jabberwock my beware son beware the son beware jabberwock son beware the jabberwock son beware my son beware the my son beware jabberwock my son beware the jabberwock my son so you both would, and would not get features corresponding to =value stuff. Now, both the woulds and would-nots get trained in, but since the would-nots would statistically dominate the woulds, you have no need to worry about the woulds; they'd be (eventually) groomed out of the database as room was needed for other tokens. For the kinds of people (unlike you) who get just a little bit of HTML ham, I expect it would be even worse than the first spambayes was for them (last year, at the start, spambayes didn't throw out HTML decorations; and that was a disaster, as we've covered, for most spambayes users who have little HTML ham; I expect CRM114 would penalize the mere presence of HTML even more heavily than that spambayes did). Only as heavily as the statistics indicate. CRM114 has no ingrained notion of "good" or "bad", only a statistical aggregation of those ideals. If YOUR mail has a lot of HTML spam, then yeah, it will weight against. If you have a lot of good HTML mail, then there will not be any significant prejudice. You will also get weights according to the kind of HTML tags you get. If your spam uses a lot of tables and font colors, but your regular email does not, then ,
, etc will have "spam" statistics but

,
, and

won't. > as well as mail header information, at the expense of CPU and DB > storage. Both of those, yes. The use of hashing (which current CRM114 may or may not do anymore) also caused baffling mistakes, where "baffling" == "makes no sense to humans". It still uses hashing, but now with effectively a 64-bit key that _is_ checked. The chance of a hash clash is something like 10^-20th, a number so small that I don't know the pseudogreek prefix for it but it's smaller than a nano-nano-chance. -Bill Yerazunis From shalehperry at comcast.net Sun Sep 7 18:25:53 2003 From: shalehperry at comcast.net (Sean 'Shaleh' Perry) Date: Sun Sep 7 20:26:04 2003 Subject: [Spambayes] DB_RUNRECOVERY error In-Reply-To: <16219.39534.403441.147629@montanaro.dyndns.org> References: <200309071154.35194.shalehperry@comcast.net> <16219.39534.403441.147629@montanaro.dyndns.org> Message-ID: <200309071725.53385.shalehperry@comcast.net> On Sunday 07 September 2003 13:51, Skip Montanaro wrote: > > I've been in touch with the Sleepycat folks about this. I believe the > problem is that the way Python's bsddb module opens a database when you > call bsddb.hashopen() is not thread-safe. You *might* be able to recover > the file using the db_recover command (if you installed Sleepycat's library > from source or have the right packages installed on your Linux system you > should have it), but The Sleepycat folks tell me that's not the right way > to do things. You didn't mention which SpamBayes app you were using. > > I've got some mail from them. I'll try and dredge it up and post it when I > have a moment. > sorry about that. spambayes 1.0a4 using pop3proxy. Sounds like I get to retrain. Joy. Is the berkeley db the best choice for me next time? From tim.one at comcast.net Sun Sep 7 21:26:53 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 7 20:26:56 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309061429.h86ET7r16035@localhost.localdomain> Message-ID: [Bill Yerazunis] >>> Statistically speaking, HTML mail is either from a spammer or from a >>> clueless git, and in either case can usually be delayed without >>> penalty or discarded outright. [Ryan Malayter] >> As indicated above, I do not think this analysis is true anymore. >> And characterizing someone as a clueless git because they don't >> change their mail client's default message format or "love" plain >> text... Well, let us know when you get back to the real world. [Bill] > Um.... you're arguing politics of desire against actual measured > statistics. > > In my current CRM114 corpus (which is running realtime and delivering > better accuracy than I myself can deliver- well over 99.9%): > > SingleToken Spam Nonspam > >

49 0 >
207 32 >

48 0 > > Other HTML tokens have similar statistics. The margin of error on > each of these (aliasing probability) is 1 - 1/2^64, in other words, a > few billionths of a percent of a chance that this is due to aliasing > in the database. You two get very different kinds of email mixes, and that's all there is to this. And someone who uses Emacs to read email is by definition too geeky to be representative of anyone in the real world . > E pur si moivre, dude. E pur si moivre. There more kinds of email users in heaven and earth than are dreamt of in your classifier, Bill. I don't get an email mix anything like my sisters get either, except for the spam. I know they got a lot more HTML ham than I get, and sounds like Ryan gets even more than they do. So it goes -- people are different. From arohter at nolar.com Sun Sep 7 20:42:16 2003 From: arohter at nolar.com (Alon) Date: Sun Sep 7 20:42:24 2003 Subject: [Spambayes] Imapfilter never-ending appending headers Message-ID: <189054078.1062963736@[192.168.0.20]> Here's a -v -c output: Loading state from /var/spambayes/arohter.db database /var/spambayes/arohter.db is an existing database, with 442 spam and 1953 ham Loading database /var/spambayes/arohter.db... Done. Classifying ***************** Classified 17 ham, 0 spam, and 0 unsure. Classifying took 27.2827889919 seconds. I also attached the verbose -i4 output to my original bug report ( ) Hmmm...checking my /var/spambayes/arohter.messageinfo.db stats, it doesn't look like it's being updated...the timestamp never changes (and file size stays at only 12,288 bytes), even after several classification and training runs (with new mail). aLoN >> Just upgraded to spambayes 1.0a5 from a4, and now >> whenever I run imapfilter -c (to classify), spambayes >> just keeps appending X-Spambayes headers to each >> email: > [...] >> Is this a bug, or am I missing something? > > It sounds like either spambayes isn't finding the message id in the > headers, or it's not finding the id in the messageinfo db. If you run > imapfilter with -v, do you see a message (at the top) that looks like: > "Warning: no dbm modules available for MessageInfoDB"? > > If not, could you run imapfilter with the switch "-i4" and attach the > stuff that's printed out to the bug tracker that you opened? Thanks. > > =Tony Meyer > From tim.one at comcast.net Sun Sep 7 21:42:54 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 7 20:42:58 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309080025.h880P0800760@localhost.localdomain> Message-ID: [Bill Yerazunis] > It both ignores them and does NOT ignore them. That's the beauty > of the sparse binary polynomial hash - if you have the phrase: > > beware the jabberwock my son > > you get all these token features: > > beware > beware the > beware jabberwock > beware the jabberwock > beware my > beware the my > beware jabberwock my > beware the jabberwock my > beware son > beware the son > beware jabberwock son > beware the jabberwock son > beware my son > beware the my son > beware jabberwock my son > beware the jabberwock my son > > so you both would, and would not get features corresponding to =value > stuff. I don't know -- it depends on how you split the bytestream into units. When Ryan hypothesized he'd see "img src http", it's assuming all sorts of stuff, such as that you drop the "<" presumably preceding "img", drop the '="' presumably following "src", fold case, and so on. > Now, both the woulds and would-nots get trained in, but since the > would-nots would statistically dominate the woulds, you have no need > to worry about the woulds; they'd be (eventually) groomed out of the > database as room was needed for other tokens. >> For the kinds of people (unlike you) who get just a little bit >> of HTML ham, I expect it would be even worse than the first spambayes >> was for them (last year, at the start, spambayes didn't throw out HTML >> decorations; and that was a disaster, as we've covered, for most >> spambayes users who have little HTML ham; I expect CRM114 would penalize >> the mere presence of HTML even more heavily than that spambayes did). > Only as heavily as the statistics indicate. Of course, but I said "for those who get just a little HTML ham" twice in the quoted paragraph. I don't think the context was unclear. > CRM114 has no ingrained notion of "good" or "bad", only a statistical > aggregation of those ideals. Same here. > If YOUR mail has a lot of HTML spam, then yeah, it will weight against. > If you have a lot of good HTML mail, then there will not be any > significant prejudice. The quoted paragraph was talking about people with little HTML ham. > You will also get weights according to the kind of HTML tags you get. > If your spam uses a lot of tables and font colors, but your regular > email does not, then ,
, etc will have "spam" > statistics but

,
, and

won't. Try it and see what happens. spambayes only uses unigrams, and we had plenty of experience with this. CRM114 will have all those unigrams too, plus a giant (compared to just the unigrams) collection of "hey, there was HTML!" pairs, triples, etc. The unigrams alone are enough to kill the classifier's usefulness for people with little (but not no) HTML ham. > ... > It still uses hashing, but now with effectively a 64-bit key that _is_ > checked. Don't know what it means to check a 64-bit key. Maybe it means you're using 64-bit hash codes now, and that a hash code leads to a chain of original tokens hashing to that code, and "check" means the incoming token is compared to the entries in the hash chain. Or maybe it doesn't . > The chance of a hash clash is something like 10^-20th, The chance of a hash clash on a pair of 64-bit keys is 2**-64, of course, but you can expect to see the first collision after hashing O(2**32) tokens. > a number so small that I don't know the pseudogreek prefix for it but > it's smaller than a nano-nano-chance. The Birthday Paradox lowers it back into the comfortably small billions . From wsy at merl.com Sun Sep 7 21:56:59 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sun Sep 7 20:57:03 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <200309080056.h880ux400814@localhost.localdomain> From: "Tim Peters" You two get very different kinds of email mixes, and that's all there is to this. And someone who uses Emacs to read email is by definition too geeky to be representative of anyone in the real world . No, I use Emacs to read the mail so that I can see what the spammers are up to. For normal use (i.e. when I'm flirting with someone), I use Yahoo Mail. :) > E pur si moivre, dude. E pur si moivre. There more kinds of email users in heaven and earth than are dreamt of in your classifier, Bill. That's one thing I _like_ about this list. At least y'all are moderately literate. :-) I don't get an email mix anything like my sisters get either, except for the spam. I know they got a lot more HTML ham than I get, and sounds like Ryan gets even more than they do. So it goes -- people are different. So noted. :-) Well, on the grounds that the SpamAssassin corpus is a little less biased, I re-ran the tests against the .css files that the SA test corpus generates (using the TOE learning strategy). Accuracy on this corpus is just over 98% for crm114, and barely 70% for me-the-human. The results for SA test corpus: Token Spam Nonspam

143 144
380 289

67 119 is totally equivocal. Does this help? :) -Bill Yerazunis From wsy at merl.com Sun Sep 7 22:10:13 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sun Sep 7 21:10:19 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <200309080110.h881ADp00889@localhost.localdomain> From: "Tim Peters" > so you both would, and would not get features corresponding to =value > stuff. I don't know -- it depends on how you split the bytestream into units. When Ryan hypothesized he'd see "img src http", it's assuming all sorts of stuff, such as that you drop the "<" presumably preceding "img", drop the '="' presumably following "src", fold case, and so on. The current tokenizing regex in the CRM114 distribution is: [[:graph:]][-.,:[:alnum:]]*[[:graph:]]? which was handcrafted to make good tokens out of a lot of stuff, from HTML to IP addresses. The < doesn't get dropped, nor would the closing > on a tag, but it _would_ break img src=foobar into img src= foobar with the usual sparse window mumbo-jumbo after that. > If your spam uses a lot of tables and font colors, but your regular > email does not, then ,
, etc will have "spam" > statistics but

,
, and

won't. Try it and see what happens. spambayes only uses unigrams, and we had plenty of experience with this. CRM114 will have all those unigrams too, plus a giant (compared to just the unigrams) collection of "hey, there was HTML!" pairs, triples, etc. The unigrams alone are enough to kill the classifier's usefulness for people with little (but not no) HTML ham. See previous message - in the SA test corpus (nasty one, too) CRM114 found that there wasn't a lot of info in the HTML... but still delivered better than 98 per cent accuracy. This is on the corpus that I myself can rarely do even 70% on manually. > ... > It still uses hashing, but now with effectively a 64-bit key that _is_ > checked. Don't know what it means to check a 64-bit key. Maybe it means you're using 64-bit hash codes now, and that a hash code leads to a chain of original tokens hashing to that code, and "check" means the incoming token is compared to the entries in the hash chain. Or maybe it doesn't . Yep. The hash is cut down and used to choose a starting place in the .css file, then the file is searched for an _exact_ match to that 64-bit hash. > The chance of a hash clash is something like 10^-20th, The chance of a hash clash on a pair of 64-bit keys is 2**-64, of course, but you can expect to see the first collision after hashing O(2**32) tokens. Um, yeah. After a couple of billion independent tokens, you might expect a collision. But since the size of the .css file is far smaller, you'd have to take into account that grooming will expire old tokens out and so you never really get that many "live" token entries at any one time. Right now, people are using 1 to 4 million tokens in their .css files. The Birthday Paradox lowers it back into the comfortably small billions . Well, what's the Birthday Paradox limit for for when you only need a birthday collision with someone in the previous 10 million or so entries? :-) -Bill Yerazunis From stephen at theboulets.net Sun Sep 7 21:47:39 2003 From: stephen at theboulets.net (Stephen Boulet) Date: Sun Sep 7 21:47:53 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309071923.41391.ltrag@optonline.net> References: <20030906144244.M59552@theboulets.net> <200309071751.47299.stephen@theboulets.net> <200309071923.41391.ltrag@optonline.net> Message-ID: <200309072047.39875.stephen@theboulets.net> That's what I've been doing. I do have my BROWSER variable set. $ python /usr/bin/pop3proxy.py -b SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... Traceback (most recent call last): File "/usr/bin/pop3proxy.py", line 820, in ? run() File "/usr/bin/pop3proxy.py", line 814, in run start(state=state) File "/usr/bin/pop3proxy.py", line 752, in start main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) File "/usr/bin/pop3proxy.py", line 729, in main _createProxies(servers, proxyPorts) File "/usr/bin/pop3proxy.py", line 707, in _createProxies listener = BayesProxyListener(server, serverPort, proxyPort) File "/usr/bin/pop3proxy.py", line 299, in __init__ Dibbler.Listener.__init__(self, proxyPort, BayesProxy, proxyArgs) File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 267, in __init__ self.bind(port) File "/usr/lib/python2.2/asyncore.py", line 312, in bind return self.socket.bind (addr) socket.error: (13, 'Permission denied') On Sunday 07 September 2003 06:23 pm, Lanny Trager wrote: > On Sunday 07 September 2003 18:51, Stephen Boulet wrote: > > I still get the error when I launch it like: > > > > pop3proxy.py -u 1110 > > > > Doesn't seem to be a firewall issue, since stopping the firewall doesn't > > change things. > > > > -- Stephen > > > > On Sunday 07 September 2003 04:04 pm, Richie Hindle wrote: > > > [Stephen] > > > > > > > I still have the problem of not being able to run as a user. > > > > [...] > > > > socket.error: (13, 'Permission denied') > > > > > > On unix, normal users can't bind to privileged ports, which means you > > > can't run the pop3proxy on port 110. You need to use a high port (over > > > 1024) instead. You can configure the port through the web interface - > > > see the "SpamBayes Ports" option. 1110 is a popular choice. > > Stephen, > > The -u flag sets the user interface port which by default is 8880. Try > "python pop3proxy.py -b" (without the quotes) that will bring up your > default browser. And start the pop3proxy. > > When your browser opens, it should be at the SpamBayes Web Interface: Home. > (If not, point your browser to localhost:8880) > > On the upper right hand side of the page there is a link "Configuration > page" follow that link to the configuration page. > > The second box down is where you can set the ports that pop3proxy will > listen to. > > For instance, I have pop3proxy pointed to two different pop servers, > therefore I have the ports set to 1115,1110 (notice these ports are above > 1024). These are the ports that you will point your mail client to, and > you will point it to pop3proxy by replacing (in your client) the addresses > of the pop servers at your isp with "localhost", and put that/those servers > in the first box on the config (in SpamBayes) page seperated by a comma. > > I hope I am being clear. If not let me know and I'll stick with you till > you get it working. > > Lanny -- Stephen From here to there and there to here, funny things are everywhere. -- Dr Seuss From mjveteto at sprynet.com Sun Sep 7 22:49:17 2003 From: mjveteto at sprynet.com (Mike Veteto) Date: Sun Sep 7 22:49:22 2003 Subject: [Spambayes] Can I configure SpamBayes to work with Netscape Message-ID: <3F5BEE2C.50863181@sprynet.com> I just read with GREAT interest about your work. While I am not current on programming skill, I still have some technical skills. Has anyone worked with the Netscape or Communicator mail programs and SpamBayes ? I would be interested in helping this effort in any way I could. I have chosen to stay with Netscape, both because old habits die hard and it seems to be less of a target than Outlook. Thanks, Mike Veteto mjveteto@sprynet.com From stephen at theboulets.net Sun Sep 7 23:17:01 2003 From: stephen at theboulets.net (Stephen Boulet) Date: Sun Sep 7 23:17:13 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309072047.39875.stephen@theboulets.net> References: <20030906144244.M59552@theboulets.net> <200309071923.41391.ltrag@optonline.net> <200309072047.39875.stephen@theboulets.net> Message-ID: <200309072217.01135.stephen@theboulets.net> Here's another bit of data - netcat seems to let me bind the port ("$ nc -l -p 1025" seems to work, though I don't think it's supposed to give any output), but the python code doesn't. What's up with that? -- Stephen On Sunday 07 September 2003 08:47 pm, Stephen Boulet wrote: > That's what I've been doing. I do have my BROWSER variable set. > > $ python /usr/bin/pop3proxy.py -b > SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), > using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 > and engine SpamBayes Beta2, version 0.2 (July 2003). > > Loading database... > Traceback (most recent call last): > File "/usr/bin/pop3proxy.py", line 820, in ? > run() > File "/usr/bin/pop3proxy.py", line 814, in run > start(state=state) > File "/usr/bin/pop3proxy.py", line 752, in start > main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) > File "/usr/bin/pop3proxy.py", line 729, in main > _createProxies(servers, proxyPorts) > File "/usr/bin/pop3proxy.py", line 707, in _createProxies > listener = BayesProxyListener(server, serverPort, proxyPort) > File "/usr/bin/pop3proxy.py", line 299, in __init__ > Dibbler.Listener.__init__(self, proxyPort, BayesProxy, proxyArgs) > File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 267, > in __init__ > self.bind(port) > File "/usr/lib/python2.2/asyncore.py", line 312, in bind > return self.socket.bind (addr) > socket.error: (13, 'Permission denied') > > On Sunday 07 September 2003 06:23 pm, Lanny Trager wrote: > > On Sunday 07 September 2003 18:51, Stephen Boulet wrote: > > > I still get the error when I launch it like: > > > > > > pop3proxy.py -u 1110 > > > > > > Doesn't seem to be a firewall issue, since stopping the firewall > > > doesn't change things. > > > > > > -- Stephen > > > > > > On Sunday 07 September 2003 04:04 pm, Richie Hindle wrote: > > > > [Stephen] > > > > > > > > > I still have the problem of not being able to run as a user. > > > > > [...] > > > > > socket.error: (13, 'Permission denied') > > > > > > > > On unix, normal users can't bind to privileged ports, which means you > > > > can't run the pop3proxy on port 110. You need to use a high port > > > > (over 1024) instead. You can configure the port through the web > > > > interface - see the "SpamBayes Ports" option. 1110 is a popular > > > > choice. > > > > Stephen, > > > > The -u flag sets the user interface port which by default is 8880. Try > > "python pop3proxy.py -b" (without the quotes) that will bring up your > > default browser. And start the pop3proxy. > > > > When your browser opens, it should be at the SpamBayes Web Interface: > > Home. (If not, point your browser to localhost:8880) > > > > On the upper right hand side of the page there is a link "Configuration > > page" follow that link to the configuration page. > > > > The second box down is where you can set the ports that pop3proxy will > > listen to. > > > > For instance, I have pop3proxy pointed to two different pop servers, > > therefore I have the ports set to 1115,1110 (notice these ports are above > > 1024). These are the ports that you will point your mail client to, and > > you will point it to pop3proxy by replacing (in your client) the > > addresses of the pop servers at your isp with "localhost", and put > > that/those servers in the first box on the config (in SpamBayes) page > > seperated by a comma. > > > > I hope I am being clear. If not let me know and I'll stick with you till > > you get it working. > > > > Lanny From T.A.Meyer at massey.ac.nz Mon Sep 8 16:22:18 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 7 23:23:17 2003 Subject: [Spambayes] Error running pop3proxy.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5CD8@its-xchg4.massey.ac.nz> > Here's another bit of data - netcat seems to let me bind the > port ("$ nc -l -p 1025" seems to work, though I don't think > it's supposed to give any output), It'll output whatever a connection to that port gives. If you telnet to localhost:1025 while netcat is running, whatever you type should be echoed on the screen. > but the python code doesn't. What's up with that? If you run "pop3proxy.py -b", then these ports may be bound: * port 8880, or whatever you have set in the "html_ui":"port" option. * whatever ports you are proxying POP3 servers on. * whatever ports you are proxying SMTP servers on. With an empty bayescustomize.ini/.spambayesrc, only the first applies. Can you definitely bind port 8880? =Tony Meyer From john at ncad.net Mon Sep 8 00:50:23 2003 From: john at ncad.net (NCADjohn) Date: Sun Sep 7 23:50:04 2003 Subject: [Spambayes] SpamBayes file list Message-ID: Mary Fran, Thank you for your article. http://www.computerworld.com/softwaretopics/software/groupware/story/0,1 0801,83684,00.html I will no longer refuse offers to subscribe to ComputerWorld! I used to subscribe while in grad school 1981-85 M.S. Computer Engineering Then i pursued the PC rev while y'all were a little slow to the track. But now, getting to the bottom of SPAM is #1. and many thanks N for getting me to you. Here is the question: At http://spambayes.sourceforge.net/background.html there is offered a glimpse at core code implementing the "filter" Appears elegant and simple. I wrote a lexical scanner (token parser) in school. SpamBayes appears to be exactly as i would have done it! Nice Woik! I would like to assess the basic file sizes. Before i explode the exe, would someone send a simple text copy (no attachment) of directory of files, so i may consider the file sizes of active code? Wunnerful! Bonus-- Still enjoy occasionally referencing EF Codd's classic RDBM articles published in CW back in the 80s. Is there electronic form of that? Thank you, sincerely, john Earth Model Do From ltrag at optonline.net Mon Sep 8 01:07:00 2003 From: ltrag at optonline.net (Lanny Trager) Date: Mon Sep 8 00:07:10 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309072217.01135.stephen@theboulets.net> References: <20030906144244.M59552@theboulets.net> <200309072047.39875.stephen@theboulets.net> <200309072217.01135.stephen@theboulets.net> Message-ID: <200309080007.00600.ltrag@optonline.net> Stephen, In your earlier e-mail you said that you were running the command: pop3proxy.py -u 1110 If that was indeed the command that you were running, it was changing the port that you needed to point your browser to, nothing more, nothing less. Try this, change the name of your "bayescustomize.ini" to "bayescustomize.bak" or anything but its real name so that SpamBayes doesn't see it. Then run "pop3proxy.py -b" -(or "python pop3proxy.py -b" if your setup is happier with that) just that with no other flags. Then use the web interface to set the configuration. After you have set the configuration and saved it, your new "bayescustomize.ini" should look something like this: [html_ui] display_to:True [pop3proxy] listen_ports:1110 remote_servers:pop.your_pop_server.net add_mailid_to:header [smtpproxy] listen_ports:1120 remote_servers:mail.your_smtp_server.net Let me know if that helps. Lanny From T.A.Meyer at massey.ac.nz Mon Sep 8 17:13:03 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 00:13:16 2003 Subject: [Spambayes] SpamBayes file list Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D31@its-xchg4.massey.ac.nz> > I would like to assess the basic file sizes. > Before i explode the exe, would someone send a simple text copy (no > attachment) of directory of files, > so i may consider the file sizes of active code? This is what's in the 1.0a5 source release: drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/ -rwx------ rjh/None 38782 2003-09-02 09:36:42 spambayes-1.0a5/CHANGELOG.txt drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/contrib/ -rwx------ rjh/None 4661 2003-01-23 16:46:27 spambayes-1.0a5/contrib/BULK.txt -rwx------ rjh/None 8326 2003-01-29 15:23:34 spambayes-1.0a5/contrib/bulkgraph.py -rwx------ rjh/None 238 2003-01-23 16:46:27 spambayes-1.0a5/contrib/bulktrain.sh -rwx------ rjh/None 937 2003-05-04 15:10:36 spambayes-1.0a5/contrib/mod_spambayes.py -rwx------ rjh/None 1019 2003-06-05 16:28:37 spambayes-1.0a5/contrib/muttrc -rwx------ rjh/None 2547 2003-08-14 02:39:35 spambayes-1.0a5/contrib/nway.py -rwx------ rjh/None 746 2003-01-23 16:46:27 spambayes-1.0a5/contrib/procmailrc -rwx------ rjh/None 5653 2003-01-29 15:23:34 spambayes-1.0a5/contrib/SmarterHTTPServer.py -rwx------ rjh/None 5386 2003-06-11 02:03:26 spambayes-1.0a5/contrib/spambayes.el -rwx------ rjh/None 7865 2003-04-28 12:36:03 spambayes-1.0a5/dbExpImp.py -rwx------ rjh/None 114 2003-01-14 17:38:19 spambayes-1.0a5/hammie.py -rwx------ rjh/None 3190 2003-01-10 04:39:29 spambayes-1.0a5/HAMMIE.txt -rwx------ rjh/None 582 2003-06-04 03:45:48 spambayes-1.0a5/hammiecli.py -rwx------ rjh/None 5337 2003-08-14 10:49:41 spambayes-1.0a5/hammiefilter.py -rwx------ rjh/None 4024 2003-01-14 17:38:19 spambayes-1.0a5/hammiesrv.py -rwx------ rjh/None 31375 2003-09-03 12:36:59 spambayes-1.0a5/imapfilter.py -rwx------ rjh/None 18105 2003-06-22 05:06:03 spambayes-1.0a5/INTEGRATION.txt -rwx------ rjh/None 2649 2003-05-26 13:53:00 spambayes-1.0a5/LICENSE.txt -rwx------ rjh/None 5141 2003-04-11 01:28:27 spambayes-1.0a5/mailsort.py -rwx------ rjh/None 8683 2003-08-14 10:59:52 spambayes-1.0a5/mboxtrain.py -rwx------ rjh/None 13506 2003-04-28 12:36:03 spambayes-1.0a5/notesfilter.py drwx------ rjh/None 0 2003-09-04 19:37:57 spambayes-1.0a5/Outlook2000/ -rwx------ rjh/None 6664 2003-09-01 19:27:31 spambayes-1.0a5/Outlook2000/about.html -rwx------ rjh/None 61704 2003-09-01 15:54:12 spambayes-1.0a5/Outlook2000/addin.py -rwx------ rjh/None 18425 2003-08-26 14:35:45 spambayes-1.0a5/Outlook2000/config.py -rwx------ rjh/None 5949 2003-08-29 12:08:43 spambayes-1.0a5/Outlook2000/config_wizard.py -rwx------ rjh/None 1285 2002-11-18 13:40:04 spambayes-1.0a5/Outlook2000/default_bayes_customize.ini drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/dialogs/ -rwx------ rjh/None 10267 2003-09-01 17:34:19 spambayes-1.0a5/Outlook2000/dialogs/async_processor.py -rwx------ rjh/None 23690 2003-08-29 18:23:50 spambayes-1.0a5/Outlook2000/dialogs/dialog_map.py -rwx------ rjh/None 12996 2003-08-27 14:11:38 spambayes-1.0a5/Outlook2000/dialogs/dlgcore.py -rwx------ rjh/None 511 2003-08-10 19:26:50 spambayes-1.0a5/Outlook2000/dialogs/dlgutils.py -rwx------ rjh/None 28343 2003-08-27 18:53:27 spambayes-1.0a5/Outlook2000/dialogs/FolderSelector.py -rwx------ rjh/None 13520 2003-08-26 18:25:33 spambayes-1.0a5/Outlook2000/dialogs/opt_processors.py -rwx------ rjh/None 3791 2003-08-27 16:09:28 spambayes-1.0a5/Outlook2000/dialogs/processors.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/dialogs/resources/ -rwx------ rjh/None 2878 2003-08-10 19:26:50 spambayes-1.0a5/Outlook2000/dialogs/resources/folders.bmp -rwx------ rjh/None 1414 2003-08-26 22:57:44 spambayes-1.0a5/Outlook2000/dialogs/resources/rc2py.py -rwx------ rjh/None 1547 2003-08-20 14:12:48 spambayes-1.0a5/Outlook2000/dialogs/resources/rclabels2text.py -rwx------ rjh/None 12654 2003-08-26 19:23:04 spambayes-1.0a5/Outlook2000/dialogs/resources/rcparser.py -rwx------ rjh/None 30338 2003-08-30 02:48:31 spambayes-1.0a5/Outlook2000/dialogs/resources/sblogo.bmp -rwx------ rjh/None 33214 2003-08-30 02:48:31 spambayes-1.0a5/Outlook2000/dialogs/resources/sbwizlogo.bmp -rwx------ rjh/None 54 2003-08-10 19:26:50 spambayes-1.0a5/Outlook2000/dialogs/resources/__init__.py -rwx------ rjh/None 914 2003-08-26 14:27:26 spambayes-1.0a5/Outlook2000/dialogs/test_dialogs.py -rwx------ rjh/None 12683 2003-08-26 16:52:06 spambayes-1.0a5/Outlook2000/dialogs/wizard_processors.py -rwx------ rjh/None 3198 2003-08-27 09:29:08 spambayes-1.0a5/Outlook2000/dialogs/__init__.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/docs/ -rwx------ rjh/None 11594 2003-08-26 14:35:45 spambayes-1.0a5/Outlook2000/docs/configuration.html -rwx------ rjh/None 14470 2003-08-29 18:24:48 spambayes-1.0a5/Outlook2000/docs/troubleshooting.html -rwx------ rjh/None 7250 2003-08-30 03:18:22 spambayes-1.0a5/Outlook2000/docs/welcome.html -rwx------ rjh/None 4602 2003-05-04 00:21:59 spambayes-1.0a5/Outlook2000/export.py -rwx------ rjh/None 6023 2003-08-25 13:33:11 spambayes-1.0a5/Outlook2000/filter.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/images/ -rwx------ rjh/None 3126 2003-06-04 14:49:10 spambayes-1.0a5/Outlook2000/images/delete_as_spam.bmp -rwx------ rjh/None 3126 2003-06-04 14:49:11 spambayes-1.0a5/Outlook2000/images/recover_ham.bmp drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/installer/ -rwx------ rjh/None 1700 2003-08-10 19:06:12 spambayes-1.0a5/Outlook2000/installer/crank.py -rwx------ rjh/None 253 2003-02-20 20:16:31 spambayes-1.0a5/Outlook2000/installer/README.txt -rwx------ rjh/None 1404 2003-02-20 20:16:31 spambayes-1.0a5/Outlook2000/installer/spambayes_addin.py -rwx------ rjh/None 34948 2003-08-29 12:10:33 spambayes-1.0a5/Outlook2000/manager.py -rwx------ rjh/None 50537 2003-09-01 19:29:48 spambayes-1.0a5/Outlook2000/msgstore.py -rwx------ rjh/None 3721 2003-08-27 21:06:16 spambayes-1.0a5/Outlook2000/README.txt drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/Outlook2000/sandbox/ -rwx------ rjh/None 6514 2002-11-24 00:07:15 spambayes-1.0a5/Outlook2000/sandbox/delete_outlook_field.py -rwx------ rjh/None 818 2003-06-16 16:51:29 spambayes-1.0a5/Outlook2000/sandbox/dump_profiles.py -rwx------ rjh/None 4963 2003-09-01 18:04:08 spambayes-1.0a5/Outlook2000/sandbox/dump_props.py -rwx------ rjh/None 756 2003-08-20 01:32:29 spambayes-1.0a5/Outlook2000/sandbox/extract_bad_msg_from_log.py -rwx------ rjh/None 4351 2003-09-01 18:04:35 spambayes-1.0a5/Outlook2000/sandbox/extract_prop.py -rwx------ rjh/None 3305 2003-07-28 15:24:16 spambayes-1.0a5/Outlook2000/sandbox/find_dupe_props.py -rwx------ rjh/None 7636 2003-09-01 17:36:34 spambayes-1.0a5/Outlook2000/sandbox/mapi_driver.py -rwx------ rjh/None 2516 2003-06-17 14:04:43 spambayes-1.0a5/Outlook2000/sandbox/set_read_flag.py -rwx------ rjh/None 20368 2003-09-01 17:35:58 spambayes-1.0a5/Outlook2000/tester.py -rwx------ rjh/None 7345 2003-08-27 23:28:47 spambayes-1.0a5/Outlook2000/train.py -rwx------ rjh/None 879 2003-09-04 19:37:58 spambayes-1.0a5/PKG-INFO -rwx------ rjh/None 32525 2003-09-01 22:33:48 spambayes-1.0a5/pop3proxy.py -rwx------ rjh/None 5566 2003-07-29 20:12:10 spambayes-1.0a5/POP3PROXY.txt -rwx------ rjh/None 4252 2003-04-28 12:36:03 spambayes-1.0a5/proxytee.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/pspam/ -rwx------ rjh/None 10618 2003-01-21 08:24:54 spambayes-1.0a5/pspam/pop.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/pspam/pspam/ -rwx------ rjh/None 403 2002-11-04 16:44:20 spambayes-1.0a5/pspam/pspam/database.py -rwx------ rjh/None 2082 2002-11-12 19:03:20 spambayes-1.0a5/pspam/pspam/folder.py -rwx------ rjh/None 168 2002-11-04 16:44:20 spambayes-1.0a5/pspam/pspam/message.py -rwx------ rjh/None 1203 2003-01-29 15:23:34 spambayes-1.0a5/pspam/pspam/options.py -rwx------ rjh/None 3284 2003-01-14 17:38:20 spambayes-1.0a5/pspam/pspam/profile.py -rwx------ rjh/None 522 2002-11-04 16:44:20 spambayes-1.0a5/pspam/pspam/__init__.py -rwx------ rjh/None 825 2002-11-04 16:44:19 spambayes-1.0a5/pspam/README.txt -rwx------ rjh/None 885 2002-11-12 19:03:20 spambayes-1.0a5/pspam/scoremsg.py -rwx------ rjh/None 1470 2002-11-12 19:03:20 spambayes-1.0a5/pspam/update.py -rwx------ rjh/None 361 2002-11-04 16:44:19 spambayes-1.0a5/pspam/vmspam.ini -rwx------ rjh/None 182 2002-11-04 16:44:19 spambayes-1.0a5/pspam/zeo.sh -rwx------ rjh/None 12786 2003-08-25 13:36:41 spambayes-1.0a5/README-DEVEL.txt -rwx------ rjh/None 19300 2003-08-25 20:55:19 spambayes-1.0a5/README.txt drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/scripts/ -rwx------ rjh/None 341 2003-07-27 19:41:09 spambayes-1.0a5/scripts/README.txt -rwx------ rjh/None 35309 2003-07-27 19:48:14 spambayes-1.0a5/scripts/sb-overkill.py -rwx------ rjh/None 2058 2003-07-07 12:09:27 spambayes-1.0a5/setup.py -rwx------ rjh/None 22165 2003-09-02 13:03:17 spambayes-1.0a5/smtpproxy.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/spambayes/ -rwx------ rjh/None 5621 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/cdb.py -rwx------ rjh/None 898 2003-04-11 01:28:28 spambayes-1.0a5/spambayes/cdb_classifier.py -rwx------ rjh/None 5403 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/chi2.py -rwx------ rjh/None 18020 2003-09-02 09:07:32 spambayes-1.0a5/spambayes/classifier.py -rwx------ rjh/None 11181 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/compatheapq.py -rwx------ rjh/None 16267 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/compatsets.py -rwx------ rjh/None 13570 2003-05-04 15:05:33 spambayes-1.0a5/spambayes/Corpus.py -rwx------ rjh/None 6093 2003-05-26 12:00:52 spambayes-1.0a5/spambayes/CostCounter.py -rwx------ rjh/None 1415 2003-08-07 12:05:14 spambayes-1.0a5/spambayes/dbmstorage.py -rwx------ rjh/None 30485 2003-09-03 10:47:48 spambayes-1.0a5/spambayes/Dibbler.py -rwx------ rjh/None 22249 2003-05-05 13:22:11 spambayes-1.0a5/spambayes/FileCorpus.py -rwx------ rjh/None 8192 2003-08-27 00:57:31 spambayes-1.0a5/spambayes/hammie.py -rwx------ rjh/None 6492 2003-08-14 10:48:38 spambayes-1.0a5/spambayes/hammiebulk.py -rwx------ rjh/None 6350 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/Histogram.py -rwx------ rjh/None 11123 2003-08-26 16:30:41 spambayes-1.0a5/spambayes/ImapUI.py -rwx------ rjh/None 5262 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/mboxutils.py -rwx------ rjh/None 12946 2003-08-26 18:49:56 spambayes-1.0a5/spambayes/message.py -rwx------ rjh/None 2994 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/msgs.py -rwx------ rjh/None 18194 2003-08-20 20:33:26 spambayes-1.0a5/spambayes/oe_mailbox.py -rwx------ rjh/None 2299 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/optimize.py -rwx------ rjh/None 47996 2003-09-01 18:07:27 spambayes-1.0a5/spambayes/Options.py -rwx------ rjh/None 28649 2003-08-25 21:00:36 spambayes-1.0a5/spambayes/OptionsClass.py -rwx------ rjh/None 23866 2003-09-01 18:07:27 spambayes-1.0a5/spambayes/ProxyUI.py -rwx------ rjh/None 41631 2003-02-01 06:32:27 spambayes-1.0a5/spambayes/PyMeldLite.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/spambayes/resources/ -rwx------ rjh/None 1162 2003-01-18 08:21:14 spambayes-1.0a5/spambayes/resources/classify.gif -rwx------ rjh/None 3501 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/classify_gif.py -rwx------ rjh/None 950 2003-01-18 08:21:15 spambayes-1.0a5/spambayes/resources/config.gif -rwx------ rjh/None 3042 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/config_gif.py -rwx------ rjh/None 1258 2003-01-18 08:21:17 spambayes-1.0a5/spambayes/resources/helmet.gif -rwx------ rjh/None 3692 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/helmet_gif.py -rwx------ rjh/None 946 2003-01-18 08:21:20 spambayes-1.0a5/spambayes/resources/message.gif -rwx------ rjh/None 3022 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/message_gif.py -rwx------ rjh/None 1146 2003-01-18 08:21:21 spambayes-1.0a5/spambayes/resources/query.gif -rwx------ rjh/None 3432 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/query_gif.py -rwx------ rjh/None 3825 2003-01-29 15:23:35 spambayes-1.0a5/spambayes/resources/scanning__init__.py -rwx------ rjh/None 1165 2003-01-18 08:21:22 spambayes-1.0a5/spambayes/resources/status.gif -rwx------ rjh/None 3469 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/status_gif.py -rwx------ rjh/None 1337 2003-01-18 08:21:24 spambayes-1.0a5/spambayes/resources/train.gif -rwx------ rjh/None 3856 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/train_gif.py -rwx------ rjh/None 15285 2003-08-31 09:37:10 spambayes-1.0a5/spambayes/resources/ui.html -rwx------ rjh/None 8687 2003-01-18 08:21:26 spambayes-1.0a5/spambayes/resources/ui.psp -rwx------ rjh/None 6962 2003-08-31 09:37:10 spambayes-1.0a5/spambayes/resources/ui_html.py -rwx------ rjh/None 17723 2003-07-14 04:12:28 spambayes-1.0a5/spambayes/resources/ui_psp.py -rwx------ rjh/None 62 2003-01-18 08:21:13 spambayes-1.0a5/spambayes/resources/__init__.py -rwx------ rjh/None 22432 2003-09-02 09:07:32 spambayes-1.0a5/spambayes/storage.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/spambayes/test/ -rwx------ rjh/None 172 2003-07-25 15:22:04 spambayes-1.0a5/spambayes/test/README.txt -rwx------ rjh/None 5860 2003-07-25 17:17:22 spambayes-1.0a5/spambayes/test/test_storage.py -rwx------ rjh/None 11448 2003-01-29 15:23:34 spambayes-1.0a5/spambayes/TestDriver.py -rwx------ rjh/None 7077 2003-01-14 17:38:20 spambayes-1.0a5/spambayes/Tester.py -rwx------ rjh/None 53887 2003-06-28 13:51:57 spambayes-1.0a5/spambayes/tokenizer.py -rwx------ rjh/None 32551 2003-09-01 22:33:48 spambayes-1.0a5/spambayes/UserInterface.py -rwx------ rjh/None 8245 2003-09-02 13:31:04 spambayes-1.0a5/spambayes/Version.py -rwx------ rjh/None 41 2003-09-04 19:35:07 spambayes-1.0a5/spambayes/__init__.py -rwx------ rjh/None 10278 2002-09-06 08:55:02 spambayes-1.0a5/TESTING.txt drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/testtools/ -rwx------ rjh/None 4872 2003-01-17 18:42:53 spambayes-1.0a5/testtools/cmp.py -rwx------ rjh/None 2829 2003-02-28 12:02:45 spambayes-1.0a5/testtools/es2hs.py -rwx------ rjh/None 1074 2003-01-17 18:42:54 spambayes-1.0a5/testtools/fpfn.py -rwx------ rjh/None 2883 2003-05-26 12:00:54 spambayes-1.0a5/testtools/incremental.HOWTO.txt -rwx------ rjh/None 13190 2003-03-01 05:57:53 spambayes-1.0a5/testtools/incremental.py -rwx------ rjh/None 552 2003-03-01 17:13:07 spambayes-1.0a5/testtools/incremental.TODO.txt -rwx------ rjh/None 4231 2003-01-29 15:23:35 spambayes-1.0a5/testtools/mboxtest.py -rwx------ rjh/None 5094 2003-03-01 17:13:08 spambayes-1.0a5/testtools/mkgraph.py -rwx------ rjh/None 3194 2003-03-01 17:13:08 spambayes-1.0a5/testtools/mksets.py -rwx------ rjh/None 11875 2003-04-20 17:00:29 spambayes-1.0a5/testtools/pop3proxytest.py -rwx------ rjh/None 2769 2003-01-17 18:42:54 spambayes-1.0a5/testtools/rates.py -rwx------ rjh/None 3014 2003-03-08 12:33:22 spambayes-1.0a5/testtools/regimes.py -rwx------ rjh/None 2332 2003-01-29 15:23:35 spambayes-1.0a5/testtools/simplexloop.py -rwx------ rjh/None 2763 2003-02-28 12:02:45 spambayes-1.0a5/testtools/sort+group.py -rwx------ rjh/None 6831 2003-01-17 18:42:54 spambayes-1.0a5/testtools/table.py -rwx------ rjh/None 5159 2003-03-08 12:33:23 spambayes-1.0a5/testtools/timcv.py -rwx------ rjh/None 3081 2003-04-30 18:43:28 spambayes-1.0a5/testtools/timtest.py -rwx------ rjh/None 12170 2003-05-15 12:35:38 spambayes-1.0a5/testtools/urlslurper.py -rwx------ rjh/None 6281 2003-01-29 15:23:35 spambayes-1.0a5/testtools/weaktest.py -rwx------ rjh/None 4347 2003-01-29 15:23:34 spambayes-1.0a5/unheader.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/utilities/ -rwx------ rjh/None 2664 2003-04-25 14:29:16 spambayes-1.0a5/utilities/convert_config_file.py -rwx------ rjh/None 520 2003-02-17 05:06:07 spambayes-1.0a5/utilities/dump_cdb.py -rwx------ rjh/None 2043 2003-05-04 15:16:45 spambayes-1.0a5/utilities/HistToGNU.py -rwx------ rjh/None 3268 2003-01-22 09:19:09 spambayes-1.0a5/utilities/loosecksum.py -rwx------ rjh/None 2163 2003-01-17 18:42:53 spambayes-1.0a5/utilities/mboxcount.py -rwx------ rjh/None 3046 2003-05-04 15:16:45 spambayes-1.0a5/utilities/pop3graph.py -rwx------ rjh/None 5680 2003-01-17 18:42:53 spambayes-1.0a5/utilities/rebal.py -rwx------ rjh/None 2256 2003-01-17 18:42:53 spambayes-1.0a5/utilities/split.py -rwx------ rjh/None 2822 2003-01-17 18:42:53 spambayes-1.0a5/utilities/splitn.py -rwx------ rjh/None 3592 2003-01-17 18:42:53 spambayes-1.0a5/utilities/splitndirs.py -rwx------ rjh/None 2215 2003-08-16 19:46:41 spambayes-1.0a5/utilities/which_database.py -rwx------ rjh/None 9532 2003-09-04 19:14:57 spambayes-1.0a5/WHAT_IS_NEW.txt drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/windows/ -rwx------ rjh/None 19608 2003-09-02 20:29:01 spambayes-1.0a5/windows/autoconfigure.py -rwx------ rjh/None 5698 2003-08-31 14:17:46 spambayes-1.0a5/windows/pop3proxy_service.py -rwx------ rjh/None 8047 2003-09-03 08:12:13 spambayes-1.0a5/windows/pop3proxy_tray.py drwx------ rjh/None 0 2003-09-04 19:37:58 spambayes-1.0a5/windows/py2exe/ -rwx------ rjh/None 350 2003-08-30 02:46:18 spambayes-1.0a5/windows/py2exe/README.txt -rwx------ rjh/None 1817 2003-08-30 02:44:08 spambayes-1.0a5/windows/py2exe/setup_all.py -rwx------ rjh/None 232 2003-03-13 01:22:44 spambayes-1.0a5/windows/README.txt "Active code", however, is a trickier question. Obviously this is only the *.py files, but many of those are only used in certain applications. You can certainly discount any in the testtools, utilities, spam and contrib directories. You can discount the Outlook2000 directory unless you are using it, and the windows directory unless you are using pop3proxy as a service. For a more accurate count, you'll have to run it and look at which files get a .pyc version created. These (and the one you launch) are the ones used. HTH. =Tony Meyer From HLyner at seznam.cz Mon Sep 8 07:57:16 2003 From: HLyner at seznam.cz (Heinz Lyner) Date: Mon Sep 8 00:57:27 2003 Subject: [Spambayes] Outlook 2000 SP 3 / XP Prof SP-1 Message-ID: I have lost the "Spam" field in the field chooser of Outlook. For some folders it worked, but now I have no longer such a field availble in the user defined fields? Heinz Lyner From T.A.Meyer at massey.ac.nz Mon Sep 8 18:51:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 01:52:11 2003 Subject: [Spambayes] Imapfilter never-ending appending headers Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D8B@its-xchg4.massey.ac.nz> [original post] > messageinfo_storage_file:/var/spambayes/main.messageinfo.db [later] > Hmmm...checking my /var/spambayes/arohter.messageinfo.db > stats, it doesn't look like it's being updated...the timestamp > never changes (and file size stays at only 12,288 bytes), even > after several classification and training runs (with new mail). The -i4 information looks like it's finding the messages ok, which leads me to suspect the messageinfo db. Your two messages have different names for this file - does the file&path in your .ini match the file that you're looking at? If this isn't being updated, then that is the problem - spambayes thinks that it hasn't seen the message before and so adds the information to it. There are only three reasons I can think of offhand why the file wouldn't be updated: 1. you don't have a suitable dbm module available - 1.0a4 let people use dumbdbm, but 1.0a5 doesn't, so you need bsddb, gdbm or something like that. 2. you have a suitable dbm module, but the existing dbm module is dumbdbm 3. spambayes can't find the file However, there are flaws with all three of these. #1 should print out an error explaining what the problem is. #2 should crash and die with some sort of traceback. With #3, spambayes should just create the new file and then keep using it, unless it gets deleted between sessions. Anyway, knowing that the messageinfo db referenced in the ini definitely isn't being touched would be a good start. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 8 19:00:36 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 02:01:09 2003 Subject: [Spambayes] Spambayes 1.0a5 comments Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D92@its-xchg4.massey.ac.nz> > It is good to have Unsure messages in top. But the button > to Train should be there too. Wouldn't this encourage people to click 'Train' without actually checking that the classification is correct? (Having the "save" button at the top (and bottom) of the config, page, however, would be good). > Also I would like it to train only Unsure messages, and then > have Train All button to train also ham and spam messages. > Another option would be to have all other messages selected > as Defer by default. The default is to have messages selected as their existing classification. Most of the time this should be right, so makes sense as the default. If you want to change them all to 'defer', this only takes a single click (on the 'defer' heading). An option to only review unsures is an interesting idea. If we had more data about which training regeme produces the best results, and training on unsures only did well, then this would definitely be a good idea. As it is, I wonder if it would just make things more complicated, unnecessarily. You can open a feature request if you really want it, and people can debate the idea. (Note that it won't get added until the end of the feature freeze, in any case). =Tony Meyer From stephen at theboulets.net Mon Sep 8 02:04:16 2003 From: stephen at theboulets.net (Stephen Boulet) Date: Mon Sep 8 02:04:30 2003 Subject: [Spambayes] Error running pop3proxy.py In-Reply-To: <200309080007.00600.ltrag@optonline.net> References: <20030906144244.M59552@theboulets.net> <200309072217.01135.stephen@theboulets.net> <200309080007.00600.ltrag@optonline.net> Message-ID: <200309080104.16996.stephen@theboulets.net> Thanks. That did it. There is another error. When I go to the configuration page and try to set, say, the spamminess cutoff to 0.8 (or change anything else there), I get the following error. After that, I need to restart the proxy to get it to work again, although the changes will have been saved. Error: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 688, in onChangeopts self.reReadOptions() File "/usr/lib/python2.2/site-packages/spambayes/ProxyUI.py", line 517, in reReadOptions state = self.state_recreator() File "/usr/bin/pop3proxy.py", line 721, in _recreateState prepare(state) File "/usr/bin/pop3proxy.py", line 737, in prepare state.createWorkers() File "/usr/bin/pop3proxy.py", line 618, in createWorkers self.bayes = storage.open_storage(filename, self.useDB) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 647, in open_storage return klass(data_source_name) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 157, in __init__ self.load() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 165, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 53, in open return f(*args) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 36, in open_best return f(*args) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 23, in open_gdbm return gdbm.open(*args) error: (11, 'Resource temporarily unavailable') On Sunday 07 September 2003 11:07 pm, Lanny Trager wrote: > Stephen, > > In your earlier e-mail you said that you were running the command: > pop3proxy.py -u 1110 > > If that was indeed the command that you were running, it was changing the > port that you needed to point your browser to, nothing more, nothing less. > > Try this, change the name of your "bayescustomize.ini" to > "bayescustomize.bak" or anything but its real name so that SpamBayes > doesn't see it. > > Then run "pop3proxy.py -b" -(or "python pop3proxy.py -b" if your setup is > happier with that) just that with no other flags. Then use the web > interface to set the configuration. > > After you have set the configuration and saved it, your new > "bayescustomize.ini" should look something like this: > > [html_ui] > display_to:True > [pop3proxy] > listen_ports:1110 > remote_servers:pop.your_pop_server.net > add_mailid_to:header > [smtpproxy] > listen_ports:1120 > remote_servers:mail.your_smtp_server.net > > Let me know if that helps. > > Lanny From arohter at nolar.com Mon Sep 8 02:09:33 2003 From: arohter at nolar.com (Alon) Date: Mon Sep 8 02:09:42 2003 Subject: [Spambayes] Imapfilter never-ending appending headers In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D8B@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D8B@its-xchg4.massey.ac.nz > Message-ID: <10352203.1062983373@[192.168.0.20]> Ya, I should have mentioned I changed my .ini file a bit: ---------------- [Headers] include_evidence:False include_score:True [Storage] messageinfo_storage_file:/var/spambayes/arohter.messageinfo.db persistent_storage_file:/var/spambayes/arohter.db [imap] password:password server:server.com username:arohter ham_train_folders:INBOX.Accounts,INBOX.Keep,INBOX.Orders,INBOX.Sent-Mail,IN BOX.Soccer,INBOX.TO-BE-DELETED spam_folder:INBOX.SPAM-NEWSPAM spam_train_folders:INBOX.SPAM unsure_folder:INBOX ----------------- The messageinfo db referenced in the ini definitely is not being touched for some reason. It never changes from it's default first creation. I can switch back to using 1.0a4 just fine, but as soon as I start using a5, it no longer touches messageinfo.db. I'm going to test using Python 2.3, on the off chance it fixes things. aLoN > [original post] >> messageinfo_storage_file:/var/spambayes/main.messageinfo.db > [later] >> Hmmm...checking my /var/spambayes/arohter.messageinfo.db >> stats, it doesn't look like it's being updated...the timestamp >> never changes (and file size stays at only 12,288 bytes), even >> after several classification and training runs (with new mail). > > The -i4 information looks like it's finding the messages ok, which leads > me to suspect the messageinfo db. Your two messages have different > names for this file - does the file&path in your .ini match the file > that you're looking at? If this isn't being updated, then that is the > problem - spambayes thinks that it hasn't seen the message before and so > adds the information to it. > > There are only three reasons I can think of offhand why the file > wouldn't be updated: > 1. you don't have a suitable dbm module available - 1.0a4 let people > use dumbdbm, but 1.0a5 doesn't, so you need bsddb, gdbm or something > like that. > 2. you have a suitable dbm module, but the existing dbm module is > dumbdbm > 3. spambayes can't find the file > > However, there are flaws with all three of these. #1 should print out > an error explaining what the problem is. #2 should crash and die with > some sort of traceback. With #3, spambayes should just create the new > file and then keep using it, unless it gets deleted between sessions. > > Anyway, knowing that the messageinfo db referenced in the ini definitely > isn't being touched would be a good start. > > =Tony Meyer > From T.A.Meyer at massey.ac.nz Mon Sep 8 19:12:10 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 02:12:36 2003 Subject: [Spambayes] gdbm error (was Error running pop3proxy.py) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D98@its-xchg4.massey.ac.nz> [spambayes-dev people: could someone who knows more about the dbm modules confirm that I'm right here?] > There is another error. When I go to the configuration page > and try to set, say, the spamminess cutoff to 0.8 (or change > anything else there), I get the following error. After that, > I need to restart the proxy to get it to work again, although > the changes will have been saved. > > Error: [...] > File > "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > line 23, in > open_gdbm > return gdbm.open(*args) > > error: (11, 'Resource temporarily unavailable') I think this is an actual bug. When you change the options, everything is reloaded with the new settings, including the database. This looks to me like gdbm won't let anyone else (including spambayes) open the db while spambayes has it open, so we need to close it first. We should probably be doing that anyway. I suspect that this doesn't show up using bsddb (and wouldn't using a pickle). =Tony Meyer From arohter at nolar.com Mon Sep 8 02:32:30 2003 From: arohter at nolar.com (Alon) Date: Mon Sep 8 02:32:41 2003 Subject: [Spambayes] Imapfilter never-ending appending headers In-Reply-To: <10352203.1062983373@[192.168.0.20]> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D8B@its-xchg4.massey.ac.nz > <10352203.1062983373@[192.168.0.20]> Message-ID: <11728468.1062984750@[192.168.0.20]> OK, tested with Python 2.3: Now timestamp on arohter.messageinfo.db is updated every time I run imapfilter, but the file size never changes beyond the first 12,288 bytes......and the X-Spambayes headers continue to pile up :( aLoN > The messageinfo db referenced in the ini definitely is not being touched > for some reason. It never changes from it's default first creation. I > can switch back to using 1.0a4 just fine, but as soon as I start using > a5, it no longer touches messageinfo.db. > > I'm going to test using Python 2.3, on the off chance it fixes things. > > aLoN > > From T.A.Meyer at massey.ac.nz Mon Sep 8 20:02:50 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 03:03:10 2003 Subject: [Spambayes] RE: imapfilter.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5DA1@its-xchg4.massey.ac.nz> > Ok, I got the 1.0a5 files and ran "setup.py install". (Should I have > uninstalled the previous package first?) No, it doesn't matter. > In the web interface, the folder names still appear garbled > with the first letter and the last missing for all folders except > for Deleted Items and Sent Items. Drat. Could you run "imapfilter.py -i4 -b" and send me the output that you get if you display the folder selection page? > Additionally, when I tried to save the training folder this error > appeared: [...] > errmsg = self.verifyInput(parms, pmap) > UnboundLocalError: local variable 'pmap' referenced before assignment Drat again. This is a bug introduced when the 'advanced configuration' page was added. I've fixed it in cvs, so it'll be fixed in the next release. > With this done I ran > "imapfilter.py -t -c -i4" and this was the result: [...] > 15:53.95 > BOBC3 UID SEARCH UNDELETED > 15:53.96 < BOBC3 OK UID SEARCH completed [...] > AttributeError: 'NoneType' object has no attribute 'split' Drat once more. All the imap servers I've come across (looking at the RFC, it's unclear which is correct) return something like this: """ 51:27.68 > DOFO3 UID SEARCH UNDELETED 51:27.85 < * SEARCH 51:27.87 < DOFO3 OK SEARCH done. """ imapfilter dies because it's expecting that "* SEARCH" line. If you change line 488 of imapfilter.py from: if response[1][0] == "": to if not response[1][0]: That should fix it. I've checked in basically the same fix for the next release. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 8 22:16:22 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 05:16:35 2003 Subject: [Spambayes] Can I configure SpamBayes to work with Netscape Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5DAA@its-xchg4.massey.ac.nz> > I just read with GREAT interest about your work. While I am > not current on programming skill, I still have some technical > skills. Has anyone worked with the Netscape or Communicator > mail programs and SpamBayes? The POP3 proxy (or IMAP filter, depending on which you use) should work fine with Netscape. Download the 1.0a5 release and give it a go. > I would be interested in > helping this effort in any way I could. Great. The first place to start is to setup Netscape to use spambayes, and let us know how the documentation could be improved so that it's easier to do. (Well, unless something goes wrong, in which case the place to start is to let us know about the problem). The FAQ (link at the bottom) has information about how people can help apart from that. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 8 22:20:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 8 05:20:41 2003 Subject: [Spambayes] Spambayes repeatedly classifies essages frommailinglist as SPAM despite multiple (20+)recoveries fromspam folder Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13031E5DAB@its-xchg4.massey.ac.nz> > I have several dozen folders in my mailbox that contain > different types of ham. All told, this is about 7000 > messages, and I have about 1500 spam messages. I used these > as my training corpus with plug-in version 007. > > Should I instead create a "sample" folder of ham that > contains about 1500 messages and train with that? That's probably a good idea. A little imbalance doesn't hurt (you could have 2000, for example), but equal numbers are best. > What about adding a feature to the plug-in that would could > the number of messages in each training folder, then use a > random subsample of each folder (spam or ham) as necessary to > create a balanced training corpus? An interesting idea. I've opened a feature request here: We'll see what Mark has to say ;) =Tony Meyer From mhammond at skippinet.com.au Mon Sep 8 22:33:18 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon Sep 8 07:33:24 2003 Subject: [Spambayes] Outlook Addin version 008 released Message-ID: <07ff01c375fd$00b89ba0$f502a8c0@eden> I have just released version 008 of the SpamBayes Outlook Addin. This version has a number of improvements - most obvious is the new SpamBayes Manager dialog, and first-time users will see a new configuration wizard. The major changes in this release: * Configuration Wizard for first-time users. SpamBayes will now allow itself to be configured with no training information - it will "learn as it goes". * New SpamBayes Manager dialogs with many more options exposed, including background processing (the timer.) * A number of bugs fixed when watching multiple folders and moving messages between the two. * SpamBayes dropdown on toolbar includes more items, including links to documentation. * Improvements to the toolbar code, making them far more reliable. * Improvements to IMAP and POP filtering. * Better processing of HTML clues for non Exchange Server users. * SpamBayes no longer breaks auto-complete in Outlook 2002. NOTE: This and all future releases are being made via the SourceForge file release system. You can find all the information at: http://www.spambayes.org/windows.html For all problems, please see the above page - it has links to the "troubleshooting guide", and will be updated with the most recent available information about the release. Regards, Mark. From rcoe at CambridgeMA.GOV Mon Sep 8 08:36:54 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Sep 8 07:36:58 2003 Subject: [Spambayes] RE: Outlook 2000 SP 3 / XP Prof SP-1 Message-ID: I've found that I can't maintain a view that includes the "spam" field = in any folder that I haven't filtered. Outlook will slap my wrist and = delete the rule from the view. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 =B7 617-349-4217 =B7 fax = 617-349-6165 > -----Original Message----- > From: Heinz Lyner [mailto:HLyner@seznam.cz] > Sent: Monday, September 08, 2003 12:57 AM > To: spambayes@python.org > Subject: [Spambayes] Outlook 2000 SP 3 / XP Prof SP-1 >=20 >=20 > I have lost the "Spam" field in the field chooser of Outlook. For some > folders it worked, but now I have no longer such a field availble in = the > user defined fields? >=20 > Heinz Lyner From skip at pobox.com Mon Sep 8 09:29:03 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 09:29:12 2003 Subject: [Spambayes] Re: [spambayes-dev] gdbm error (was Error running pop3proxy.py) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D98@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D98@its-xchg4.massey.ac.nz> Message-ID: <16220.33823.743751.29403@montanaro.dyndns.org> >> return gdbm.open(*args) >> >> error: (11, 'Resource temporarily unavailable') Tony> I think this is an actual bug. When you change the options, Tony> everything is reloaded with the new settings, including the Tony> database. This looks to me like gdbm won't let anyone else Tony> (including spambayes) open the db while spambayes has it open, ... I'm not a gdbm user, but that appears to be the case in the two tests I tried from the Python interpreter prompt. Multiple readers are allowed, but if anybody has the file open for write access, nobody else can open it. Skip From skip at pobox.com Mon Sep 8 09:42:19 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 09:42:25 2003 Subject: [Spambayes] Re: ham,Invitation to join the Spambayes group (fwd) In-Reply-To: <002701c37601$70f364f0$6502a8c0@Centrino> References: <16219.39797.838435.528127@montanaro.dyndns.org> <002701c37601$70f364f0$6502a8c0@Centrino> Message-ID: <16220.34619.345607.207375@montanaro.dyndns.org> John> Is it not true that you can more easily find answers on a John> YahooGroup. On the spambayes@python.org mailing list you have to John> search each month one at a time or is there an easier way? Just google for spambayes site:mail.python.org to find references about spambayes on mail.python.org. You can of course include other terms to narrow the search. Still, it doesn't make any difference how easy Yahoo! Groups is to search, I don't think the SpamBayes community is large enough to support more than one user support mailing list. That list is spambayes@python.org, and as far as I know all the SpamBayes developers subscribe to it. Skip From dbettinger at cxtec.com Mon Sep 8 10:42:33 2003 From: dbettinger at cxtec.com (Dean Bettinger) Date: Mon Sep 8 09:43:02 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialog manager Message-ID: I just installed the 008 version as an upgrade to 007. Thanks a bunch for resolving the Outlook 2002 auto-complete issue and all the other terrific things you've done in this version and the rest! However, when I open the SpamBayes Manager and click on the tabs once I click on the Training tab the dialog box is stuck on the Training dialog box even though the tab title changes. Dean W. Bettinger Director of Information Technology CXtec (CABLExpress Technologies) 5404 South Bay Road PO Box 4799 Syracuse, NY 13221-4799 Phone: 315.476.3000 x2595 Cell: 315.430.5213 Fax: 315.455.1800 www.CXtec.com CXtec helps customers reduce the cost of their networking infrastructure. From skip at pobox.com Mon Sep 8 10:47:18 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 10:47:25 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <000001c37594$66c78ff0$6501a8c0@CambridgeMA.GOV> References: <200309061429.h86ET7r16035@localhost.localdomain> <000001c37594$66c78ff0$6501a8c0@CambridgeMA.GOV> Message-ID: <16220.38518.986791.497416@montanaro.dyndns.org> Bob> Now that almost everybody can read HTML messages, I think the Bob> plain-text version is superfluous. There are at least a few people (myself included) who find it a waste of bandwidth to send an HTML file or a Word document (yes, I get some of those as well) when the content can be represented just as well with 7-bit ASCII. Skip From thetasig at comcast.net Mon Sep 8 08:48:09 2003 From: thetasig at comcast.net (theta sigma) Date: Mon Sep 8 10:48:14 2003 Subject: [Spambayes] Mailing Lists Message-ID: <5.2.1.1.2.20030908074452.00b709d0@localhost> I subscribe to a number of mailing lists. All of these are ham. They always come from / to a consistent e-mail address. However, I can't seem to find a way to inform SpamBayes that all of these are ham. I have trained on them and on spam (balanced training). It continues to misclassify some of the mailing list e-mail due to certain tokens (legitimate uses of them). Any suggestions on how to handle this? Thx. -=mark=- From skip at pobox.com Mon Sep 8 10:53:17 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 10:53:19 2003 Subject: [Spambayes] DB_RUNRECOVERY error In-Reply-To: <200309071725.53385.shalehperry@comcast.net> References: <200309071154.35194.shalehperry@comcast.net> <16219.39534.403441.147629@montanaro.dyndns.org> <200309071725.53385.shalehperry@comcast.net> Message-ID: <16220.38877.284595.554796@montanaro.dyndns.org> Sean> sorry about that. spambayes 1.0a4 using pop3proxy. Sean> Sounds like I get to retrain. Joy. Is the berkeley db the best Sean> choice for me next time? If you're using pop3proxy and don't share your database with other SB apps (hammiefilter, etc), then a pickle is probably your best choice. Skip From newsguy at earthlink.net Mon Sep 8 10:53:38 2003 From: newsguy at earthlink.net (newsguy) Date: Mon Sep 8 10:53:48 2003 Subject: [Spambayes] Outlook Plug-in does not filter mail moved by rules from inbox to other folders Message-ID: <000e01c37618$fcf08660$04c56b18@BillSusanDesk> Outlook 10 moves some email from inbox to other folders based on email address of incoming mail. Spambayes outlook plug-in filters mail remaining in inbox appropriately, but doesn't filter this mail that the rules move. (either in the inbox prior to the move, or in the destination folder after the move). All folders. inbox, and destination folders are specified for filtering. When rules to move mail are disabled, (leaving all mail in inbox), All mail is filtered appropriately. Bill From rcoe at CambridgeMA.GOV Mon Sep 8 12:05:11 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Sep 8 11:08:09 2003 Subject: [Spambayes] RE: how spambayes handles image-only spams Message-ID: You are, of course, entitled to your opinion. Bob > -----Original Message----- > From: Skip Montanaro [mailto:skip@pobox.com] > Sent: Monday, September 08, 2003 10:47 AM > To: Coe, Bob (home) > Cc: spambayes@python.org > Subject: RE: [Spambayes] how spambayes handles image-only spams > > > > Bob> Now that almost everybody can read HTML messages, I think > Bob> the plain-text version is superfluous. > > There are at least a few people (myself included) who find it > a waste of bandwidth to send an HTML file or a Word document > (yes, I get some of those as well) when the content can be > represented just as well with 7-bit ASCII. > > Skip From rcoe at CambridgeMA.GOV Mon Sep 8 12:36:46 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Sep 8 11:39:35 2003 Subject: [Spambayes] RE: Outlook Addin 008 binary - problem with new dialogmanager Message-ID: I confirm that, and I think we need an "008a" to fix it. I don't want to have to try to explain that one to users. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Dean Bettinger [mailto:dbettinger@cxtec.com] > Sent: Monday, September 08, 2003 9:43 AM > To: spambayes@python.org > Subject: [Spambayes] Outlook Addin 008 binary - problem with new > dialogmanager > > > I just installed the 008 version as an upgrade to 007. Thanks a bunch for > resolving the Outlook 2002 auto-complete issue and all the > other terrific things you've done in this version and the rest! > > However, when I open the SpamBayes Manager and click on the tabs once I > click on the Training tab the dialog box is stuck on the Training dialog box > even though the tab title changes. > > Dean W. Bettinger > Director of Information Technology > CXtec (CABLExpress Technologies) > 5404 South Bay Road > PO Box 4799 > Syracuse, NY 13221-4799 > Phone: 315.476.3000 x2595 > Cell: 315.430.5213 > Fax: 315.455.1800 > www.CXtec.com From ltrag at optonline.net Mon Sep 8 13:40:09 2003 From: ltrag at optonline.net (Lanny Trager) Date: Mon Sep 8 12:41:06 2003 Subject: [Spambayes] Classification not always added to the header Message-ID: <200309081240.09173.ltrag@optonline.net> spambayes-1.0a5 (getting mail from two different pop servers) RedHat 9 Kmail Yesterday was my first day using spambayes-1.0a5, because the previous version's db got corrupted I began training from scratch. As the day wore on and through about 50 e-mails (about 25% spam) SpamBayes accuracy increased very nicely. All of the e-mail showed up on the review page. Last night, I shut my system down for the night. When I brought it up this morning (on at least the first two runs) SpamBayes did not classify about half of the incoming mail. They do not show up in the review page and there is no classification in the headers. I ran which_database.py and got the following results: Invalid option add_mailid_to in section pop3proxy in file bayescustomize.ini Pickle is available. Dumbdbm is available. Dbhash is available. Bsddb[3] is available. Your storage hammie.db is a: dbhash I hadn't seen that first line before so -- I went to the config page and saved it, then re-ran which_database.py with the following results: Pickle is available. Dumbdbm is available. Dbhash is available. Bsddb[3] is available. Your storage hammie.db is a: dbhash As of last night my bayescustomize.ini looked like this: (If this was actually the bayescustomize.ini from spambayes-1.0a4, I don't remember using the old one. In fact I had tried to use the 1.0a4 bayescustomize.ini but had a problem so I renamed it and created a new one using the configuration page.) [html_ui] display_to:True [pop3proxy] listen_ports:1115,1110 remote_servers:pop.pop_server1.net,pop.pop_server2.net add_mailid_to:header [smtpproxy] listen_ports:1120 remote_servers:mail.smtp_server.net After going back to the configuration page and changing one option and saving, it looks like this: [html_ui] display_to:True [pop3proxy] listen_ports:1115,1110 remote_servers:pop.pop_server1.net,pop.pop_server2.net [smtpproxy] listen_ports:1120 remote_servers:mail.smtp_server.net [Tokenizer] -- (the changed option) mine_received_headers:True -- (the changed option) After the most recent save of the configuration page I have received three e-mails all of which had the classification added to the header. Possibly saving the configuration page thus re-creating the bayescustomize.ini fixed the problem. I'm just mystified as to why it happened in the first place. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From rmalayter at bai.org Mon Sep 8 12:52:25 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 8 12:52:29 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3F1@cliff.bai.org> From: Bill Yerazunis [mailto:wsy@merl.com] > Um.... you're arguing politics of desire > against actual measured statistics. Not really, I have no stake in the prevalence of HTML mail. I'm just think that corpora with small amounts of HTML ham are not representative of the general, Windows-using email population. And I also think the trend of "more HTML ham" will continue, because of the default configurations of popular mail clients. Given the fact wonderful folks like you actually write these filters for the Internet community, I am simply concerned that some harmful design decisions were made because your ham corpora are so devoid of HTML. > on the grounds that the SpamAssassin corpus > is a little less biased, I re-ran the tests ... > So, it seems that "font" is somewhat spammy, > and so is "br", but

is totally equivocal. This is what I was getting at, here are results from the most recent 1549 messages of each of my own corpora, which are probably biased towards HTML ham: ham ham % spam spam %

953 61.5% 1022 66.0%
1223 79.0% 1009 65.1%

> > In the web interface, the folder names still appear garbled > > with the first letter and the last missing for all folders except > > for Deleted Items and Sent Items. > >Drat. Could you run "imapfilter.py -i4 -b" and send me the output that >you get if you display the folder selection page? User interface url is http://localhost:8880/ 11:10.71 > BKKJ1 LOGIN "username" "password" 11:10.73 < BKKJ1 OK LOGIN completed 11:10.73 > BKKJ2 LIST "" * 11:10.75 < * LIST (\Select \Noinferiors) "/" INBOX 11:10.75 < * LIST (\Select) "/" "Deleted Items" 11:10.75 < * LIST (\Select) "/" Drafts 11:10.75 < * LIST (\Select) "/" Science 11:10.75 < * LIST (\Select) "/" "Sent Items" 11:10.75 < * LIST (\Select) "/" SPAM 11:10.75 < * LIST (\Select) "/" Unsure 11:10.75 < BKKJ2 OK LIST completed Interestingly, the names are displayed correctly here, and those that are correct in the HTML interface as well are in quotation marks! >imapfilter dies because it's expecting that "* SEARCH" line. If you >change line 488 of imapfilter.py from: > if response[1][0] == "": >to > if not response[1][0]: > >That should fix it. I've checked in basically the same fix for the next >release. The fix works, thanks a lot! Can't tell you how unreasonably giggly I am about this. Unfortunately, I'm encountering the same problem as somebody else on the list: Every time I classify my Inbox, a new set of SpamBayes headers is attached to every message. In verbose mode, there's no warning "no dbm modules available for MessageInfoDB?". I can send the -i4 readout if you think it'll help. Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From ltrag at optonline.net Mon Sep 8 15:33:06 2003 From: ltrag at optonline.net (Lanny Trager) Date: Mon Sep 8 14:33:21 2003 Subject: [Spambayes] Classification not always added to the header: Follow-up In-Reply-To: <200309081240.09173.ltrag@optonline.net> References: <200309081240.09173.ltrag@optonline.net> Message-ID: <200309081433.06642.ltrag@optonline.net> Two more spams just arrived and neither were classified. What is going on here? On Monday 08 September 2003 12:40, Lanny Trager wrote: > spambayes-1.0a5 (getting mail from two different pop servers) > RedHat 9 > Kmail > > Yesterday was my first day using spambayes-1.0a5, because the previous > version's db got corrupted I began training from scratch. As the day wore > on and through about 50 e-mails (about 25% spam) SpamBayes accuracy > increased very nicely. All of the e-mail showed up on the review page. > > Last night, I shut my system down for the night. When I brought it up this > morning (on at least the first two runs) SpamBayes did not classify about > half of the incoming mail. They do not show up in the review page and > there is no classification in the headers. > > I ran which_database.py and got the following results: > > Invalid option add_mailid_to in section pop3proxy in file > bayescustomize.ini Pickle is available. > Dumbdbm is available. > Dbhash is available. > Bsddb[3] is available. > > Your storage hammie.db is a: dbhash > > I hadn't seen that first line before so -- > > I went to the config page and saved it, then re-ran which_database.py with > the following results: > > Pickle is available. > Dumbdbm is available. > Dbhash is available. > Bsddb[3] is available. > > Your storage hammie.db is a: dbhash > > As of last night my bayescustomize.ini looked like this: (If this was > actually the bayescustomize.ini from spambayes-1.0a4, I don't remember > using the old one. In fact I had tried to use the 1.0a4 bayescustomize.ini > but had a problem so I renamed it and created a new one using the > configuration page.) > > [html_ui] > display_to:True > [pop3proxy] > listen_ports:1115,1110 > remote_servers:pop.pop_server1.net,pop.pop_server2.net > add_mailid_to:header > [smtpproxy] > listen_ports:1120 > remote_servers:mail.smtp_server.net > > After going back to the configuration page and changing one option and > saving, it looks like this: > > [html_ui] > display_to:True > [pop3proxy] > listen_ports:1115,1110 > remote_servers:pop.pop_server1.net,pop.pop_server2.net > [smtpproxy] > listen_ports:1120 > remote_servers:mail.smtp_server.net > [Tokenizer] -- (the changed option) > mine_received_headers:True -- (the changed option) > > After the most recent save of the configuration page I have received three > e-mails all of which had the classification added to the header. Possibly > saving the configuration page thus re-creating the bayescustomize.ini fixed > the problem. > > I'm just mystified as to why it happened in the first place. > > Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From vanhorn at whidbey.com Mon Sep 8 12:42:52 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Mon Sep 8 14:42:59 2003 Subject: [Spambayes] how spambayes handles image-only spams References: <792DE28E91F6EA42B4663AE761C41C2AF3D3F1@cliff.bai.org> Message-ID: <3F5CCDAC.8AE5B379@whidbey.com> Another datum: When I took over the Quotes of the Day project in 2000, the daily mailings had always been text only. Being a traditional guy, I continued that, but wanting to sell some advertising, I added an HTML version. In mid-2000, I think that subscribers were choosing HTML and text in roughly equal numbers, but this was at a time when AOL users couldn't read HTML mail reliably, and my signup page clearly warned AOL users not to choose HTML. In mid-2003, the proportion is at least 90% choosing HTML, I can go days at a stretch without seeing a single new text subscriber. Every issue includes a mailto: link to switch from one format to the other, I rarely see this option used anymore. In the most recent mailing, 60.4% of the subscribers had elected to receive HTML. I certainly don't consider this proof, and I don't dispute that there are traditionalists that still live in a text-dominated environment, but it's pretty clear to me that the world as a whole has overwhelmingly chosen to go with HTML in e-mail. Being at least a little traditional myself, I intend to keep the text version of my mailing available as long as any subscriber wants it, even if I think that my HTML version is very tasteful and actually more readable than the text. Van Ryan Malayter wrote: > From: Bill Yerazunis [mailto:wsy@merl.com] > > > Um.... you're arguing politics of desire > > against actual measured statistics. > > Not really, I have no stake in the prevalence of HTML mail. I'm just > think that corpora with small amounts of HTML ham are not representative > of the general, Windows-using email population. And I also think the > trend of "more HTML ham" will continue, because of the default > configurations of popular mail clients. > > Given the fact wonderful folks like you actually write these filters for > the Internet community, I am simply concerned that some harmful design > decisions were made because your ham corpora are so devoid of HTML. > > > on the grounds that the SpamAssassin corpus > > is a little less biased, I re-ran the tests > ... > > So, it seems that "font" is somewhat spammy, > > and so is "br", but >

is totally equivocal. > > This is what I was getting at, here are results from the most recent > 1549 messages of each of my own corpora, which are probably biased > towards HTML ham: > > ham ham % spam spam % > >

953 61.5% 1022 66.0% >
1223 79.0% 1009 65.1% >

> Total 1549 1549 > > As you can see, because so many people who use Outlook, Outlook Express, > and Notes to send me ham, HTML tags are present in a great amount of > what I receive. (Except of course for when someone is sending excerpts from a spreadsheet to me, and which is only used when people send me photos or joke images.) > > My basic argument is that arbitrarily throwing out some HTML tokens in > the parser, while leaving others, might make the filter more effective > for only certain corpora. What test corpora was this decision based on? > > I think keeping some form of detection of image-only spam, which seems to slip through SpamBayes more > often than other types of spam. I also think it would be even better to > have a multi-word token something like that produced by the CRM-114 > token generator, which could find multi-tag strings like These suggestions are just based on my knowledge of the algorithms > involved and the contents of my corpora, I don't know enough python to > really give them a try in SpamBayes (although I'm working on that ;-). > > Regards, > -Ryan- > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From skip at pobox.com Mon Sep 8 15:03:48 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 15:04:00 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3F1@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2AF3D3F1@cliff.bai.org> Message-ID: <16220.53908.962367.331165@montanaro.dyndns.org> Ryan> This is what I was getting at, here are results from the most Ryan> recent 1549 messages of each of my own corpora, which are probably Ryan> biased towards HTML ham: Ryan> ham ham % spam spam % Ryan>

953 61.5% 1022 66.0% Ryan>
1223 79.0% 1009 65.1% Ryan>

Total 1549 1549 Ryan> As you can see, because so many people who use Outlook, Outlook Ryan> Express, and Notes to send me ham, HTML tags are present in a Ryan> great amount of what I receive. (Except of course for only seems to be ham when someone is sending excerpts from a Ryan> spreadsheet to me, and me photos or joke images.) Do you have any evidence which suggests that SpamBayes is not properly scoring your mail? Ryan> My basic argument is that arbitrarily throwing out some HTML Ryan> tokens in the parser, while leaving others, might make the filter Ryan> more effective for only certain corpora. What test corpora was Ryan> this decision based on? What HTML tokens are kept? Which are thrown out? As far as I know, all are discarded, though URLs are checked. I've appended a sample message pulled from mail I received within the last hour or so. Note that the score is bogus. I added these options to my ini file: [Classifier] max_discriminators: 1000 minimum_prob_strength: 0.0 to make sure all tokens were included in the debug header. The message actually scored 0.99 (rounded) using my current training database without resorting to tokenizing HTML. Note, in particular, that url:gif is a fairly spammy token for me. If most of the mail you get containing tags is spam, I suspect url:gif and url:jpg are spammy for you as well. Note all the url:* synthetic tokens that were generated. Also, note that the url:imgemail_r?_c1 tokens are generated even though they only appear in the tags. I think SpamBayes is extracting just about all the useful content it can from the message already, even from the tags. Adding an html:img token probably wouldn't change the way any given message scores (it wouldn't be much spammier than url:gif or url:jpg). It appears that SpamBayes is already generating several URL-related tokens per tag. It would simply add one more synthetic token to all those which are currently generated. Return-Path: Received: from localhost [127.0.0.1] by localhost with POP3 (fetchmail-6.1.0) for skip@localhost (single-drop); Mon, 08 Sep 2003 13:04:51 -0500 (CDT) Received: (from cmaster@localhost) by manatee.mojam.com (8.11.6/8.11.6) id h88I1QD16395 for skip@manatee.mojam.com; Mon, 8 Sep 2003 13:01:26 -0500 Received: from thanatos.imocos.com (thanatos.imocos.com [195.126.165.234] (may be forged)) by manatee.mojam.com (8.11.6/8.11.6) with SMTP id h88I1MG16381 for ; Mon, 8 Sep 2003 13:01:22 -0500 Received: from [134.178.134.172] by thanatos.imocos.com with ESMTP id E576BA30510; Tue, 09 Sep 2003 02:56:11 +0300 Message-ID: <2$08fr$w2u616$$-p2581-d15u46mzc@12iumxr> X-Mailer: MIME-tools 5.503 (Entity 5.501) MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="93F30FF9E3.0F__DFDC0_43" X-Priority: 3 X-MSMail-Priority: Normal X-UIDL: 1&9"!2Na"!p2I"!5EQ"! From: "Jarrod Hendrickson" To: webmaster@webfast.com Subject: RE: Join Global Remove NO Spam List Date: Tue, 09 Sep 03 02:56:11 GMT Reply-To: "Jarrod Hendrickson" X-Spambayes-Classification: spam; 0.88 X-Spambayes-Debug: '*H*': 0.01; '*S*': 0.76; 'subject:List': 0.07; 'subject:Spam': 0.26; 'subject:: ': 0.27; 'header:Message-ID:1': 0.39; 'subject:Global': 0.41; 'header:Received:4': 0.44; 'proto:http': 0.47; 'subject: ': 0.48; 'url:net': 0.49; 'header:To:1': 0.50; 'header:Subject:1': 0.50; 'header:From:1': 0.50; 'header:Date:1': 0.50; 'from:addr:d72lpkjdbc': 0.50; 'from:name:jarrod hendrickson': 0.50; 'message-id:@12iumxr': 0.50; 'url:gr1': 0.50; 'url:imgemail_r1_c1': 0.50; 'url:imgemail_r2_c1': 0.50; 'url:it5150': 0.50; 'url:sr1': 0.50; 'header:Return-Path:1': 0.50; 'to:2**0': 0.51; 'to:no real name:2**0': 0.53; 'header:MIME-Version:1': 0.53; 'header:Reply-To:1': 0.57; 'url:www': 0.63; 'subject:Remove': 0.66; 'subject:Join': 0.72; 'content-type:multipart/alternative': 0.75; 'url:htm': 0.82; 'content-type:text/html': 0.83; 'earthenware': 0.84; 'from:addr:laoficina.com': 0.84; 'to:addr:webmaster': 0.88; 'url:gif': 0.89; 'x-mailer:mime-tools 5.503 (entity 5.501)': 0.99; 'to:addr:webfast.com': 0.99 --93F30FF9E3.0F__DFDC0_43 Content-Type: text/html; Content-Transfer-Encoding: quoted-printable earthenware
--93F30FF9E3.0F__DFDC0_43-- Ryan> I think keeping some form of my detection of image-only spam, which seems to slip through Ryan> SpamBayes more often than other types of spam. I also think it Ryan> would be even better to have a multi-word token something like Ryan> that produced by the CRM-114 token generator, which could find Ryan> multi-tag strings like based on my knowledge of the algorithms involved and the contents Ryan> of my corpora, I don't know enough python to really give them a Ryan> try in SpamBayes (although I'm working on that ;-). Speculating doesn't make it so. You have to back your "I think"s up with some tests. Skip From skip at pobox.com Mon Sep 8 15:09:16 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 15:09:26 2003 Subject: [Spambayes] Classification not always added to the header: Follow-up In-Reply-To: <200309081433.06642.ltrag@optonline.net> References: <200309081240.09173.ltrag@optonline.net> <200309081433.06642.ltrag@optonline.net> Message-ID: <16220.54236.479183.165197@montanaro.dyndns.org> >> Last night, I shut my system down for the night. Did you cleanly exit pop3proxy before the shutdown? If not, the database file may have gotten hosed. Skip From fuerte at sci.fi Mon Sep 8 23:11:14 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Mon Sep 8 15:11:21 2003 Subject: [Spambayes] Spambayes 1.0a5 comments In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D92@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13031E5D92@its-xchg4.massey.ac.nz> Message-ID: <3F5CD452.60004@sci.fi> Meyer, Tony wrote: >>It is good to have Unsure messages in top. But the button >>to Train should be there too. >> >> > >Wouldn't this encourage people to click 'Train' without actually >checking that the classification is correct? > >(Having the "save" button at the top (and bottom) of the config, page, >however, would be good). > But if the Train would train only Unsure messages... And why the Unsure messages were moved to the top then? >>Also I would like it to train only Unsure messages, and then >>have Train All button to train also ham and spam messages. >>Another option would be to have all other messages selected >>as Defer by default. >> >> > >The default is to have messages selected as their existing >classification. Most of the time this should be right, so makes sense >as the default. If you want to change them all to 'defer', this only >takes a single click (on the 'defer' heading). > Thanks, I didn't know that. >An option to only review unsures is an interesting idea. If we had more >data about which training regeme produces the best results, and training >on unsures only did well, then this would definitely be a good idea. As >it is, I wonder if it would just make things more complicated, >unnecessarily. You can open a feature request if you really want it, >and people can debate the idea. (Note that it won't get added until the >end of the feature freeze, in any case). > > It would be better to have a separate Train All button to train also Ham and Spam messages, because I have already: Spam: *1251* Ham: *535 *Or, wouldn't it be great if SpamBayes detected the amount of learnt spam and ham, and in my case, only learnt all ham by default? All spam messages would have Defer. That way the amount of spam and ham would get even eventually. Harri From rmalayter at bai.org Mon Sep 8 15:56:12 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 8 15:56:16 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3F4@cliff.bai.org> From: Skip Montanaro [mailto:skip@pobox.com] > Do you have any evidence which suggests > that SpamBayes is not properly scoring > your mail? Nothing but a few spams of the "image-only" type spams that slipped through with scores in the 60% range. > What HTML tokens are kept? Which are > thrown out? As far as I know, all are > discarded, though URLs are checked. I consider URLs to basically be HTML tags, since they often come from the inside of an HREF or IMG tag. Even if it's a plain-text URL, that serves the same function as an HREF tag, so it should be handled the same, right? > If most of the mail you get containing > tags is spam, I suspect url:gif and > url:jpg are spammy for you as well. As you suspected they are fairly, but not overwhelmingly spammy: 'url:gif' 0.767005 775 526 'url:jpg' 0.798242 398 325 > I think SpamBayes is extracting just about > all the useful content it can from the message > already, even from the tags. Adding an > html:img token probably wouldn't change the way > any given message scores (it wouldn't be much > spammier than url:gif or url:jpg). Most, but not all, of the useful content. For instance, in my corpora, HTML comments are only present in spam. Wouldn't an html:comment token provide more discriminating information than skipping the token altogether? Also, looking for a FONT tag with a COLOR= would help discriminate the "image-only" spams I see. The only reason I raised this issue is a few "image-only" spams with white-on-white random text have gotten by my SpamBayes (007 Plug-in) filter. That said, I'm getting a 96.3% capture rate now, with zero false positives to date. I'm very happy with SpamBayes, I just want to help make it a little bit better if possible. My intuition tells me ignoring HTML tags is ignoring useful content, but I could be totally wrong. I'm going to figure out how to add these tokens to a customized parser on my own, and report on the results. I'll see if they help at all. Regards, -Ryan- From arohter at nolar.com Mon Sep 8 16:05:53 2003 From: arohter at nolar.com (Alon) Date: Mon Sep 8 16:06:10 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> References: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> Message-ID: <60537687.1063033553@[192.168.0.20]> Sjoerd thinks this particular problem ( ) might be fixed with the latest spambayes/message.py (v 1.3.6). However, I cannot test it since anon cvs access is waaay behind. I'll let you know as soon as someone sends me the updated file, or anon cvs updates. aLoN > Unfortunately, I'm encountering the same problem as somebody else on the > list: Every time I classify my Inbox, a new set of SpamBayes headers is > attached to every message. In verbose mode, there's no warning "no dbm > modules available for MessageInfoDB?". I can send the -i4 readout if you > think it'll help. > > Rafael Scholl From skip at pobox.com Mon Sep 8 16:09:55 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 16:10:00 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3F4@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2AF3D3F4@cliff.bai.org> Message-ID: <16220.57875.543102.433193@montanaro.dyndns.org> Ryan> Most, but not all, of the useful content. For instance, in my Ryan> corpora, HTML comments are only present in spam. Wouldn't an Ryan> html:comment token provide more discriminating information than Ryan> skipping the token altogether? Maybe. Try it and see. I know you begged off because of your Python beginner status in an earlier message, however the tokenizer is pretty straightforward stuff. If you check out the code from cvs, the worst that can happen is you screw it up so badly you have to delete the broken file(s) and execute "cvs up" to get back to a stable baseline. Ryan> Also, looking for a FONT tag with a COLOR= would help Ryan> discriminate the "image-only" spams I see. The only reason I Ryan> raised this issue is a few "image-only" spams with white-on-white Ryan> random text have gotten by my SpamBayes (007 Plug-in) filter. Again, you have to try it and see. I think we've established fairly well that most of the SpamBayes developers get very little valid HTML email. If we were to try any of your suggestions using our existing training databases the results would be inconclusive, at best. Ryan> That said, I'm getting a 96.3% capture rate now, with zero false Ryan> positives to date. I'm very happy with SpamBayes, I just want to Ryan> help make it a little bit better if possible. My intuition tells Ryan> me ignoring HTML tags is ignoring useful content, but I could be Ryan> totally wrong. Intuition is a trap which was laid (apparently by Bayes himself) for everyone whose SourceForge userids are associated with the tokenizer module. ;-) Ryan> I'm going to figure out how to add these tokens to a customized Ryan> parser on my own, and report on the results. I'll see if they help Ryan> at all. Why do you need a customized parser? You'd probably reach your end goal faster by reading and modifying tokenizer.py. If you have questions about it, post to spambayes-dev@python.org. I'm sure a few rudimentary Python questions not directly related to SpamBayes would probably be tolerated, at least in the context of a SpamBayes-related post, but if you have a lot of them, you'd be better off posting such missives to help@python.org or by joining the tutor@python.org mailing list. Skip From skip at pobox.com Mon Sep 8 16:11:31 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 16:11:32 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <60537687.1063033553@[192.168.0.20]> References: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> <60537687.1063033553@[192.168.0.20]> Message-ID: <16220.57971.503365.184141@montanaro.dyndns.org> alon> Sjoerd thinks this particular problem ( alon> 1702&atid=498103> ) might be fixed with the latest spambayes/message.py (v alon> 1.3.6). However, I cannot test it since anon cvs access is waaay alon> behind. I'll let you know as soon as someone sends me the updated alon> file, or anon cvs updates. If you're keen to get ahold of the latest source, register at SourceForge and then do a non-anonymous checkout. Skip From arohter at nolar.com Mon Sep 8 16:25:17 2003 From: arohter at nolar.com (Alon) Date: Mon Sep 8 16:25:28 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <16220.57971.503365.184141@montanaro.dyndns.org> References: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> <60537687.1063033553@[192.168.0.20]> <16220.57971.503365.184141@montanaro.dyndns.org> Message-ID: <61703109.1063034717@[192.168.0.20]> I am registered on sf, as a developer for another project. However, I don't think sf allows non-anon access, unless you're a developer on that particular project: cvs -z3 -d:ext:nolar@cvs.sourceforge.net:/cvsroot/spambayes co spambayes cvs server: Updating spambayes cvs server: failed to create lock directory for `/cvsroot/spambayes/spambayes' (/cvsroot/spambayes/spambayes/#cvs.lock): Permission denied cvs server: failed to obtain dir lock in repository `/cvsroot/spambayes/spambayes' cvs [server aborted]: read lock failed - giving up aLoN > If you're keen to get ahold of the latest source, register at SourceForge > and then do a non-anonymous checkout. > > Skip > From skip at pobox.com Mon Sep 8 16:44:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 16:44:36 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <61703109.1063034717@[192.168.0.20]> References: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> <60537687.1063033553@[192.168.0.20]> <16220.57971.503365.184141@montanaro.dyndns.org> <61703109.1063034717@[192.168.0.20]> Message-ID: <16220.59954.708818.605534@montanaro.dyndns.org> alon> I am registered on sf, as a developer for another project. alon> However, I don't think sf allows non-anon access, unless you're a alon> developer on that particular project: Oh, damn, you're right. If you'd like a tarball, let me know. Skip From ltrag at optonline.net Mon Sep 8 17:48:30 2003 From: ltrag at optonline.net (Lanny Trager) Date: Mon Sep 8 16:48:44 2003 Subject: [Spambayes] Classification not always added to the header: Follow-up In-Reply-To: <16220.54236.479183.165197@montanaro.dyndns.org> References: <200309081240.09173.ltrag@optonline.net> <200309081433.06642.ltrag@optonline.net> <16220.54236.479183.165197@montanaro.dyndns.org> Message-ID: <200309081647.31636.ltrag@optonline.net> On Monday 08 September 2003 15:09, Skip Montanaro wrote: > >> Last night, I shut my system down for the night. > > Did you cleanly exit pop3proxy before the shutdown? If not, the database > file may have gotten hosed. > > Skip Skip, Yes, last night I did exit pop3proxy by clicking on the save and shutdown button. I am still mystified as to how the offending line entered my ini file, though for now I am willing to accept that I somehow inadvertently did it myself. This morning though, after regenerating the ini file by saving the config page, I did not shutdown and restart. Thus although the ini file was fixed my problem initially was not. Since then, I have shutdown, again using the save and shutdown button, and restarted. it seems to be working again. Thanks for your help Lanny From skip at pobox.com Mon Sep 8 17:03:25 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 8 17:04:36 2003 Subject: [Spambayes] Classification not always added to the header: Follow-up In-Reply-To: <200309081647.31636.ltrag@optonline.net> References: <200309081240.09173.ltrag@optonline.net> <200309081433.06642.ltrag@optonline.net> <16220.54236.479183.165197@montanaro.dyndns.org> <200309081647.31636.ltrag@optonline.net> Message-ID: <16220.61085.285487.192769@montanaro.dyndns.org> Lanny> This morning though, after regenerating the ini file by saving Lanny> the config page, I did not shutdown and restart. That would do it. Most apps only read their config files at startup. Many Unix daemons will reload their configs or restart when sent a HUP signal. I suppose pop3proxy's web interface could grow a "reload configuration file" button if it doesn't already have one. Skip From ltrag at optonline.net Mon Sep 8 18:24:48 2003 From: ltrag at optonline.net (Lanny Trager) Date: Mon Sep 8 17:25:28 2003 Subject: [Spambayes] Classification not always added to the header: The Plot Thickens In-Reply-To: <16220.61085.285487.192769@montanaro.dyndns.org> References: <200309081240.09173.ltrag@optonline.net> <200309081647.31636.ltrag@optonline.net> <16220.61085.285487.192769@montanaro.dyndns.org> Message-ID: <200309081724.48485.ltrag@optonline.net> On Monday 08 September 2003 17:03, Skip Montanaro wrote: > Lanny> This morning though, after regenerating the ini file by saving > Lanny> the config page, I did not shutdown and restart. > > That would do it. Most apps only read their config files at startup. Many > Unix daemons will reload their configs or restart when sent a HUP signal. > I suppose pop3proxy's web interface could grow a "reload configuration > file" button if it doesn't already have one. > > Skip Skip, Shortly after responding to you I got another spam and it was not classified and did not show up on the Review page. What differentiated this one from others is that it was addressed to: me@machine.myispPOP.net (an old alias at my ISP that for me at least guarantees that it is spam) as opposed to others which go to me@myispPOP.net Now, where the plot thickens is that other spams to this alias address have been classified by SpamBayes and have showed up on the Review page. This was after I had restarted SpamBayes and it had been working fine. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From arohter at nolar.com Mon Sep 8 17:42:33 2003 From: arohter at nolar.com (Alon) Date: Mon Sep 8 17:42:44 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <60537687.1063033553@[192.168.0.20]> References: <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> <60537687.1063033553@[192.168.0.20]> Message-ID: <66332937.1063039353@[192.168.0.20]> Yup, new message.py fixes the problem. You can grab it here ( ), or just wait til the next release. aLoN > Sjoerd thinks this particular problem ( > =6 1702&atid=498103> ) might be fixed with the latest > spambayes/message.py (v 1.3.6). However, I cannot test it since anon cvs > access is waaay behind. I'll let you know as soon as someone sends me the > updated file, or anon cvs updates. > > aLoN > > > >> Unfortunately, I'm encountering the same problem as somebody else on the >> list: Every time I classify my Inbox, a new set of SpamBayes headers is >> attached to every message. In verbose mode, there's no warning "no dbm >> modules available for MessageInfoDB?". I can send the -i4 readout if you >> think it'll help. >> >> Rafael Scholl > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From wsy at merl.com Mon Sep 8 18:43:08 2003 From: wsy at merl.com (Bill Yerazunis) Date: Mon Sep 8 17:43:17 2003 Subject: [Spambayes] Mailing Lists In-Reply-To: <5.2.1.1.2.20030908074452.00b709d0@localhost> (message from theta sigma on Mon, 08 Sep 2003 07:48:09 -0700) References: <5.2.1.1.2.20030908074452.00b709d0@localhost> Message-ID: <200309082143.h88Lh8f03321@localhost.localdomain> From: theta sigma I subscribe to a number of mailing lists. All of these are ham. They always come from / to a consistent e-mail address. However, I can't seem to find a way to inform SpamBayes that all of these are ham. I have trained on them and on spam (balanced training). It continues to misclassify some of the mailing list e-mail due to certain tokens (legitimate uses of them). Any suggestions on how to handle this? This is one of the cases where a whitelist is actually valuable. The others are (in order of importance) Your paramour Your lawyer Your boss ;-) -Bill Yerazunis From mhammond at skippinet.com.au Tue Sep 9 09:26:32 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon Sep 8 18:26:34 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialogmanager In-Reply-To: Message-ID: <1fb301c37658$41883130$f502a8c0@eden> Can you please send me a log? I can't get this behaviour. Thanks, Mark. > -----Original Message----- > From: spambayes-bounces+mhammond=keypoint.com.au@python.org > [mailto:spambayes-bounces+mhammond=keypoint.com.au@python.org] > On Behalf > Of Dean Bettinger > Sent: Monday, 8 September 2003 11:43 PM > To: spambayes@python.org > Subject: [Spambayes] Outlook Addin 008 binary - problem with new > dialogmanager > > > I just installed the 008 version as an upgrade to 007. > Thanks a bunch for > resolving the Outlook 2002 auto-complete issue and all the > other terrific > things you've done in this version and the rest! > > However, when I open the SpamBayes Manager and click on the > tabs once I > click on the Training tab the dialog box is stuck on the > Training dialog box > even though the tab title changes. > > Dean W. Bettinger > Director of Information Technology > CXtec (CABLExpress Technologies) > 5404 South Bay Road > PO Box 4799 > Syracuse, NY 13221-4799 > Phone: 315.476.3000 x2595 > Cell: 315.430.5213 > Fax: 315.455.1800 > www.CXtec.com > > CXtec helps customers reduce the cost of their networking > infrastructure. > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From dbettinger at cxtec.com Mon Sep 8 19:53:32 2003 From: dbettinger at cxtec.com (Dean Bettinger) Date: Mon Sep 8 18:53:49 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialo gmanager Message-ID: Invalid option timer_start_delay in section Experimental in file C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_customize.ini Invalid option timer_interval in section Experimental in file C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_customize.ini Loaded bayes database from 'C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 1173 spam and 6378 good messages SpamBayes Outlook Addin, Binary version 0.8 (September 8, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Inbox - Spam Processing missed spam in folder 'Inbox' by starting a timer Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 221, in OnInitDialog File "out1.pyz/dialogs.dlgcore", line 258, in LoadAllControls File "out1.pyz/dialogs.opt_processors", line 36, in Init File "out1.pyz/dialogs.opt_processors", line 320, in UpdateControl_FromValue File "out1.pyz/msgstore", line 337, in GetFolder msgstore.MsgStoreException: MsgStoreException: Exception 0x80004005 (Unspecified error): Unspecified error Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Saving configuration -> C:\Documents and Settings\DBetting\Application Data\SpamBayes\MS Exchange Settings.ini -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Monday, September 08, 2003 6:27 PM To: 'Dean Bettinger'; spambayes@python.org Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new dialogmanager Can you please send me a log? I can't get this behaviour. Thanks, Mark. > -----Original Message----- > From: spambayes-bounces+mhammond=keypoint.com.au@python.org > [mailto:spambayes-bounces+mhammond=keypoint.com.au@python.org] > On Behalf > Of Dean Bettinger > Sent: Monday, 8 September 2003 11:43 PM > To: spambayes@python.org > Subject: [Spambayes] Outlook Addin 008 binary - problem with new > dialogmanager > > > I just installed the 008 version as an upgrade to 007. > Thanks a bunch for > resolving the Outlook 2002 auto-complete issue and all the > other terrific > things you've done in this version and the rest! > > However, when I open the SpamBayes Manager and click on the > tabs once I > click on the Training tab the dialog box is stuck on the > Training dialog box > even though the tab title changes. > > Dean W. Bettinger > Director of Information Technology > CXtec (CABLExpress Technologies) > 5404 South Bay Road > PO Box 4799 > Syracuse, NY 13221-4799 > Phone: 315.476.3000 x2595 > Cell: 315.430.5213 > Fax: 315.455.1800 > www.CXtec.com > > CXtec helps customers reduce the cost of their networking > infrastructure. > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From rafael.scholl at bbox.ch Tue Sep 9 02:12:06 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Mon Sep 8 19:12:17 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <66332937.1063039353@[192.168.0.20]> References: <60537687.1063033553@[192.168.0.20]> <6.0.0.9.0.20030908124435.01b9a4a8@mail.bbox.ch> <60537687.1063033553@[192.168.0.20]> Message-ID: <6.0.0.9.0.20030909004342.01b8ed18@mail.bbox.ch> >Yup, new message.py fixes the problem. You can grab it here ( >le_id=61001&aid=801952> ), or just wait til the next release. It works, thanks! Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From dbettinger at cxtec.com Mon Sep 8 21:23:28 2003 From: dbettinger at cxtec.com (Dean Bettinger) Date: Mon Sep 8 20:23:45 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialo g manager Message-ID: Hmmm... After playing around a while, I realized that I couldn't make any changes to the Training tab dialog items. I then used the configuration wizard to set training "I haven't prepared" and went to then end of that wizard. I'm now able to use the training tab as designed and the problem disappeared. Must be something in my configuration weirded out the program. Dean -----Original Message----- From: Dean Bettinger [mailto:dbettinger@cxtec.com] Sent: Monday, September 08, 2003 6:54 PM To: 'Mark Hammond'; spambayes@python.org Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new dialo gmanager Invalid option timer_start_delay in section Experimental in file C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_customize.ini Invalid option timer_interval in section Experimental in file C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_customize.ini Loaded bayes database from 'C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\DBetting\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 1173 spam and 6378 good messages SpamBayes Outlook Addin, Binary version 0.8 (September 8, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Inbox - Spam Processing missed spam in folder 'Inbox' by starting a timer Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 221, in OnInitDialog File "out1.pyz/dialogs.dlgcore", line 258, in LoadAllControls File "out1.pyz/dialogs.opt_processors", line 36, in Init File "out1.pyz/dialogs.opt_processors", line 320, in UpdateControl_FromValue File "out1.pyz/msgstore", line 337, in GetFolder msgstore.MsgStoreException: MsgStoreException: Exception 0x80004005 (Unspecified error): Unspecified error Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 270, in SaveAllControls File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.opt_processors", line 39, in Done File "out1.pyz/dialogs.opt_processors", line 167, in UpdateValue_FromControl KeyError: '' Saving configuration -> C:\Documents and Settings\DBetting\Application Data\SpamBayes\MS Exchange Settings.ini -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Monday, September 08, 2003 6:27 PM To: 'Dean Bettinger'; spambayes@python.org Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new dialogmanager Can you please send me a log? I can't get this behaviour. Thanks, Mark. > -----Original Message----- > From: spambayes-bounces+mhammond=keypoint.com.au@python.org > [mailto:spambayes-bounces+mhammond=keypoint.com.au@python.org] > On Behalf > Of Dean Bettinger > Sent: Monday, 8 September 2003 11:43 PM > To: spambayes@python.org > Subject: [Spambayes] Outlook Addin 008 binary - problem with new > dialogmanager > > > I just installed the 008 version as an upgrade to 007. > Thanks a bunch for > resolving the Outlook 2002 auto-complete issue and all the > other terrific > things you've done in this version and the rest! > > However, when I open the SpamBayes Manager and click on the > tabs once I > click on the Training tab the dialog box is stuck on the > Training dialog box > even though the tab title changes. > > Dean W. Bettinger > Director of Information Technology > CXtec (CABLExpress Technologies) > 5404 South Bay Road > PO Box 4799 > Syracuse, NY 13221-4799 > Phone: 315.476.3000 x2595 > Cell: 315.430.5213 > Fax: 315.455.1800 > www.CXtec.com > > CXtec helps customers reduce the cost of their networking > infrastructure. > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From tim.one at comcast.net Tue Sep 9 00:11:02 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 9 00:57:53 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309080056.h880ux400814@localhost.localdomain> Message-ID: >>> E pur si moivre, dude. E pur si moivre. >> There more kinds of email users in heaven and earth than are >> dreamt of in your classifier, Bill. [Bill Yerazunis] > That's one thing I _like_ about this list. At least y'all are > moderately literate. :-) Perhaps, but I actually had no idea what epursimoivre meant . > ... > Well, on the grounds that the SpamAssassin corpus is a little less > biased, I re-ran the tests against the .css files that the SA test > corpus generates (using the TOE learning strategy). Accuracy on this > corpus is just over 98% for crm114, and barely 70% for me-the-human. > > The results for SA test corpus: > > Token Spam Nonspam >

143 144 >
380 289 >

67 119 > > So, it seems that "font" is somewhat spammy, and so is "br", > but is totally equivocal. > > Does this help? :) It helps Ryan's thesis that HTML isn't uncommon in ham. If the stats had been more like Token Spam Nonspam

143 10
380 5

67 0 Message-ID: [Ryan Malayter, to Bill Yerazunis] > ... > My basic argument is that arbitrarily throwing out some HTML tokens in > the parser, while leaving others, might make the filter more effective > for only certain corpora. What test corpora was this decision based > on? Note that Bill doesn't work on spambayes; he heads CRM114: http://crm114.sourceforge.net/ IIUC, CRM114 doesn't throw away any HTML decorations; spambayes throws away all HTML decorations (although it sucks out and specially tokenizes everything that "looks like" a URL, and regardless of whether it's hiding in HTML or sitting in plain view). spambayes was developed against many peoples' test corpora, although, as I said before, I don't think any of them had a significant quantity of HTML ham (and I don't think Bill's did either). Some did have *some* HTML ham, though, and that's what drove the spambayes decision to throw away HTML decorations (else-- and we're just going in circles here --there was no chance that the little bit of HTML ham wouldn't get misclassified as spam every time; my saying that isn't a matter of argument, it's reporting what actually happened). > I think keeping some form of detection of image-only spam, which seems to slip through SpamBayes > more often than other types of spam. I believe that, not really because it's image-only, but because image-only messages often have very few features to judge. It's hard for a brief message to get a strong score in either direction in spambayes. It's easier under CRM114 because that generates many more features from a given message than spambayes generates. > I also think it would be even better to have a multi-word token something > like that produced by the CRM-114 token generator, which could find > multi-tag strings like my knowledge of the algorithms involved and the contents of my corpora, > I don't know enough python to really give them a try in SpamBayes > (although I'm working on that ;-). In spambayes I'd be more inclined to write special code to identify the img-src-http dance, and synthesize a token for that. It's only one token, though, and all tokens carry the same weight here -- it may still not be enough to give "a typical" short message of this ilk a strong enough score to nail it. The only way to know is to try it. From chris at lockergnome.com Mon Sep 8 20:20:19 2003 From: chris at lockergnome.com (Chris Pirillo) Date: Tue Sep 9 01:07:08 2003 Subject: [Spambayes] New Icons for Outlook Plugin Message-ID: <008701c37678$ea1b0be0$641ba8c0@stormshadow> e-Greetings! I finally installed SpamBayes for Outlook, and... I wish I had done so yesterday. ;) Anyway, I decided that the face icons (while cute), did not quite fit my style of thinking: http://chris.pirillo.com/images/outlookplugin.gif The 8-ball comes with Outlook, but I created the Cue ball in two minutes in the editor. You might think about using this in future versions. Each option is much easier to detect, and when one is behind the 8-ball, well... they're dead meat. Thanks for doing this for Outlook users. It's wonderful! Chris Pirillo http://www.lockergnome.com/ From T.A.Meyer at massey.ac.nz Tue Sep 9 13:58:06 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:09:15 2003 Subject: [Spambayes] RE: Outlook Addin 008 binary - problem with newdialogmanager Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A5F8@its-xchg4.massey.ac.nz> [Dean] > However, when I open the SpamBayes Manager and click on the tabs once > I click on the Training tab the dialog box is stuck on the Training > dialog box even though the tab title changes. [Bob] > I confirm that, and I think we need an "008a" to fix it. I > don't want to have to try to explain that one to users. What version of Outlook are you using? Neither Mark nor I see this with either the source or the binary. I have Outlook 2002 SP2, and I think he has both 2002 SP2 and 2000. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 14:21:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:23:54 2003 Subject: [Spambayes] Initial Start Up Problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A61E@its-xchg4.massey.ac.nz> > Sorry to keep pestering you but I think I'm almost there. I > have the pop3proxy working. The only problem is that I can't > get the test and review pages from the user interface, the > configure page has gone missing too, even when pop3proxy is > running. If all those pages are missing, then I suspect they all are (if it looks like the front page is there, it might just be the browser cache). > You said something in your last email about trying > the proxy on a different port. It only works for incoming > mail when it is set to 110 but I've tried it with other > numbers (you suggeste 9980) to access the above pages. I t > didn't work. Do you have any other suggestions? You're changing the port that the interface is on, not the port that the proxy is served on, right? If you use the '-u' switch, this changes the *interface* port. For example, if you run "pop3proxy.py -u 8881" you will find the interface at . To change the port that the proxy is served on, you use the configuration page (preferably), and then change your mail client to use the port that you have chosen. If pop3proxy is working, then you should leave the proxy port alone. You might need to change the interface port, though, depending on your system. > At the moment I am not using the stmpproxy for outgoing mail, > I'm connecting straight to my mail server. Idon't suppose > could be causing any problems? No, using smtpproxy is completely optional. =Tony Meyer From japeters at pacbell.net Tue Sep 9 14:36:42 2003 From: japeters at pacbell.net (John A. Peters) Date: Tue Sep 9 01:34:12 2003 Subject: [Spambayes] Newbie questions on XP Message-ID: <02dd01c37712$141da670$6502a8c0@Centrino> I like SpamBayes, it works well and has great potiental. To train it on my Windows XP machine with POP3 the only way I have had success is the Train on Proxied messages - Review Messages What does this error message mean? Training... Trained on 1 message. Saving... Done. 500 Server error Traceback (most recent call last): File "C:\spambayes\spambayes-1.0a5\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\spambayes\spambayes-1.0a5\spambayes\ProxyUI.py", line 402, in onReview messageInfo = self._makeMessageInfo(message) File "C:\spambayes\spambayes-1.0a5\spambayes\ProxyUI.py", line 492, in _makeMessageInfo text = text.replace(' ', ' ') # Else they'll be quoted AttributeError: 'NoneType' object has no attribute 'replace' The other way I do it is to take messages from the unsure folder and put them in to either of two folders. Train+Good Train-Spam Then I have to copy these two files and give them a shorter name since I seem to be unable to browse them with the long file names that Windows OE assigns to them. The browser windo is not big enough for the long file name. Then I point the trainer at these files. That works fine but is time consuming. I would love to be able to foreward messages to spambayes_ham@localhost and spambayes_spam@localhost but they do not work. If any one wants to take a look at my system I am running VNC or maybe the XP remote could be used. By the way the text boxs are almost too short where the above address are shown in in the web browser section labled SMTP Proxy Options Thank You, John A. Peters, San Francisco, (415) 509-3599 Cell japeters@pacbell.net Japeters1939 on Yahoo IM From T.A.Meyer at massey.ac.nz Tue Sep 9 12:56:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:48:48 2003 Subject: [Spambayes] Undo for spambayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A596@its-xchg4.massey.ac.nz> > Can I delete the contents of the SPAM folder as time goes on, > as it is growing steadily. I remember reading in the > documentation that some of the contents should remain in the > event that the databases have to be rebuilt and the software > retrained. > > How much would be necessary to complete this operation? Please see FAQ 4.5: =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 13:00:48 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:48:56 2003 Subject: [Spambayes] Mailing Lists Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A59E@its-xchg4.massey.ac.nz> > I subscribe to a number of mailing lists. All of these are > ham. They always come from / to a consistent e-mail address. However, > I can't seem to find a way to inform SpamBayes that all of these are ham. > I have trained on them and on spam (balanced training). It continues to > misclassify some of the mailing list e-mail due to certain tokens > (legitimate uses of them). > > Any suggestions on how to handle this? The problem here is that spam might still come from those lists. Can you give an example of a message that is being misclassified (including the clues for the message)? (And by misclassified, do you mean as unsure or as spam?). One way around this (although sufficient training should stop this) is to not filter that mail. Have a rule in your mail client move these messages into a separate folder and don't have spambayes filter that folder. (BTW, it's great if you can include the mail client and spambayes application that you are using when you post; it makes comments like this clearer). =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 13:34:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:49:04 2003 Subject: [Spambayes] RE: imapfilter.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A5D3@its-xchg4.massey.ac.nz> > 11:10.75 < * LIST (\Select) "/" "Sent Items" > 11:10.75 < * LIST (\Select) "/" SPAM [...] > Interestingly, the names are displayed correctly here, and > those that are correct in the HTML interface as well are in > quotation marks! The quotation marks are meant to be there (according to the RFC). imapfilter assumes that they are and strips the first and last characters, which should be the quotation marks. I've checked in a fix so that it only strips them if they are quotation marks, and not otherwise. > Unfortunately, I'm encountering the same problem as somebody > else on the list: Every time I classify my Inbox, a new set of > SpamBayes headers is attached to every message. I had managed to track this down to the messageinfo db that gets created, and then Sjoerd checked in a fix for this too (thanks!). Quite an incentive for imapfilter users to upgrade to 1.0a6 when it comes out shortly, I suppose (dammit). I posted the updated message.py in the previous message; you should be able to replace yours with this and fix that problem. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 13:55:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:49:09 2003 Subject: [Spambayes] Classification not always added to the header: ThePlot Thickens Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A5F2@its-xchg4.massey.ac.nz> [Lanny] > This morning though, after regenerating the ini > file by saving the config page, I did not shutdown and restart. [Skip] > That would do it. Most apps only read their config files at startup. > Many Unix daemons will reload their configs or restart when sent a HUP > signal. I suppose pop3proxy's web interface could grow a "reload > configuration file" button if it doesn't already have one. Clicking 'save' on the config (or advanced config) page should reload the options. The invalid option shouldn't have stopped mail being classified, though. That option was changed in 1.0a5 (a note about it is at the top of the release notes). The add_mailid_to option was replaced with the "add_unique_id" option (in the "Headers" section, not "pop3proxy"), and now defaults to True (so it shouldn't need to be in the config file). [Lanny] > Shortly after responding to you I got another spam and it was > not classified and did not show up on the Review page. If you look at the headers of the message, is there a X-Spambayes-Exception header? If something went wrong, then it wouldn't be classified or be visible on the review page, but it should have an exception header that lets us know exactly what went wrong. If it doesn't have any spambayes headers, then something rather more odd is going on. =Tony Meyer From adam.walker at rbwconsulting.com Mon Sep 8 21:30:32 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Tue Sep 9 01:50:25 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialog manager In-Reply-To: Message-ID: <20030909003049.6CFA913E23A@sack.dreamhost.com> I just posted this to the bug on sf... """ Ok, to reproduce. Create a new test folder. Set the spam folder to the new folder. Delete the folder and empty the trash. The problem is that the FolderID processor (and the folder selection dialog) except a "None" value return instead of an exception. """ I also had a patch, but as it relates to the msgstore, I thought it best to let Mark make the call. --Adam > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Dean Bettinger > Sent: Monday, September 08, 2003 8:23 PM > To: 'Mark Hammond'; spambayes@python.org > Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new > dialog manager > > Hmmm... After playing around a while, I realized that I couldn't make any > changes to the Training tab dialog items. I then used the configuration > wizard to set training "I haven't prepared" and went to then end of that > wizard. I'm now able to use the training tab as designed and the problem > disappeared. > > Must be something in my configuration weirded out the program. > > Dean > From T.A.Meyer at massey.ac.nz Tue Sep 9 13:24:43 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 01:50:31 2003 Subject: [Spambayes] RE: imapfilter.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A5CA@its-xchg4.massey.ac.nz> > Sjoerd thinks this particular problem ( > 52&group_id=6 > 1702&atid=498103> ) might be fixed with the latest > spambayes/message.py > However, I cannot test it since anon cvs access is waaay behind. > I'll let you know as soon as someone sends me the updated > file, or anon cvs updates. Yes, I imagine it would be. I knew it must be something to do with the message.py changes to the messageinfo database. Annoyingly, this could mean subtle problems for pop3proxy users, too. message.py from cvs is attached. =Tony Meyer -------------- next part -------------- A non-text attachment was scrubbed... Name: message.py Type: application/octet-stream Size: 13344 bytes Desc: message.py Url : http://mail.python.org/pipermail/spambayes/attachments/20030909/c558f878/message-0001.obj From mhammond at skippinet.com.au Tue Sep 9 12:22:23 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 9 01:50:38 2003 Subject: [Spambayes] Outlook Addin 008 binary - problem with new dialog manager In-Reply-To: <20030909003049.6CFA913E23A@sack.dreamhost.com> Message-ID: <013201c37670$d32296e0$f502a8c0@eden> Thanks Adam, Yes, any "invalid" folder would cause this problem. Unfortunately, "invalid" also means valid folders that are currently offline :( I've made a fix, and I guess I better release a new version :( I will stick to a 008 release - 008a.exe, I think (unless someone has a better idea :) Mark. > -----Original Message----- > From: Adam Walker [mailto:adam.walker@rbwconsulting.com] > Sent: Tuesday, 9 September 2003 10:31 AM > To: 'Dean Bettinger'; 'Mark Hammond'; spambayes@python.org > Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new > dialog manager > > > I just posted this to the bug on sf... > > """ > Ok, to reproduce. Create a new test folder. Set the spam > folder to the new folder. Delete the folder and empty the trash. > > > > The problem is that the FolderID processor (and the folder > selection dialog) except a "None" value return instead of an > exception. > """ > > I also had a patch, but as it relates to the msgstore, I > thought it best to > let Mark make the call. > --Adam > > > -----Original Message----- > > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] > > On Behalf Of Dean Bettinger > > Sent: Monday, September 08, 2003 8:23 PM > > To: 'Mark Hammond'; spambayes@python.org > > Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new > > dialog manager > > > > Hmmm... After playing around a while, I realized that I > couldn't make any > > changes to the Training tab dialog items. I then used the > configuration > > wizard to set training "I haven't prepared" and went to > then end of that > > wizard. I'm now able to use the training tab as designed > and the problem > > disappeared. > > > > Must be something in my configuration weirded out the program. > > > > Dean > > > > From T.A.Meyer at massey.ac.nz Tue Sep 9 19:00:03 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 02:00:34 2003 Subject: [Spambayes] Newbie questions on XP Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7A6@its-xchg4.massey.ac.nz> > What does this error message mean? [...] > File "C:\spambayes\spambayes-1.0a5\spambayes\ProxyUI.py", > line 492, in _makeMessageInfo > text = text.replace(' ', ' ') # Else they'll be quoted > > AttributeError: 'NoneType' object has no attribute 'replace' Does this happen with any messages? (i.e. if you clear out the cache directories, does it immediately happen?). I'll check in a fix for this, but I'm not sure I see why 'text' is ending up as None. Could you post a copy of a message that crashes it like this? > I would love to be able to foreward messages to > spambayes_ham@localhost and spambayes_spam@localhost but they > do not work. If any one wants to take a look at my system I > am running VNC or maybe the XP remote could be used. Your message looked like it was written in Outlook Express; is this the case? OE will not work with the SMTP proxy, because it won't forward the headers of the messages. You could use it manually, I suppose: * right click on the message, choose 'properties' * click the 'details' tab * scroll down until you find a "X-Spambayes-ID" header * select that line and press control-c * press 'cancel' * compose a new message to spambayes_ham or spambayes_spam * paste the line that was copied into the subject * send It's a fair bit of effort, though. Your best hope (IMO), is the forthcoming spambayes application that will let you drag mail to be trained into a (provided) folder, within OE (or other mail clients). It's still in development, but should be ready for (alpha) release soon. =Tony Meyer From mhammond at skippinet.com.au Tue Sep 9 17:24:49 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 9 02:24:51 2003 Subject: [Spambayes] New Icons for Outlook Plugin In-Reply-To: <008701c37678$ea1b0be0$641ba8c0@stormshadow> Message-ID: <010a01c3769b$146b4a50$f502a8c0@eden> > e-Greetings! > > I finally installed SpamBayes for Outlook, and... I wish I had done so > yesterday. ;) Anyway, I decided that the face icons (while > cute), did not > quite fit my style of thinking: > > http://chris.pirillo.com/images/outlookplugin.gif Sorry, but I don't like them that much, as I have to think too much to work out what they mean. I guess the "black ball" is a reference to being "black balled", which I don't think is a reference to pool :) I quite like Inboxer - it has a mail icon, with a big blue tick or red cross. Also, I unfortunately need the images in a .bmp format. And one day I need to crack the transparency issue :( > Thanks for doing this for Outlook users. It's wonderful! Our pleasure - for some definitions of the word "pleasure" Mark. From rcoe at CambridgeMA.GOV Mon Sep 8 23:29:44 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Tue Sep 9 02:36:58 2003 Subject: [Spambayes] RE: Outlook Addin 008 binary - problem with new dialog manager Message-ID: > From: Dean Bettinger [mailto:dbettinger@cxtec.com] > Sent: Monday, September 08, 2003 8:23 PM > To: 'Mark Hammond'; spambayes@python.org > Subject: RE: [Spambayes] Outlook Addin 008 binary - problem with new > dialog manager > > > Hmmm... After playing around a while, I realized that I couldn't make any > changes to the Training tab dialog items. I then used the configuration > wizard to set training "I haven't prepared" and went to the end of that > wizard. I'm now able to use the training tab as designed and the problem > disappeared. > > Must be something in my configuration weirded out the program. > > Dean No, there's definitely a problem. I see the same behavior you saw. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 From ltrag at optonline.net Mon Sep 8 23:47:35 2003 From: ltrag at optonline.net (Lanny Trager) Date: Tue Sep 9 02:37:42 2003 Subject: [Spambayes] Classification not always added to the header: ThePlot Thickens In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332A5F2@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130332A5F2@its-xchg4.massey.ac.nz> Message-ID: <200309082247.35492.ltrag@optonline.net> On Monday 08 September 2003 20:55, Meyer, Tony wrote: > [Lanny] > > > This morning though, after regenerating the ini > > file by saving the config page, I did not shutdown and restart. > > [Skip] > > > That would do it. Most apps only read their config files at startup. > > > > Many Unix daemons will reload their configs or restart when sent a HUP > > > > signal. I suppose pop3proxy's web interface could grow a "reload > > configuration file" button if it doesn't already have one. > > Clicking 'save' on the config (or advanced config) page should reload > the options. > > The invalid option shouldn't have stopped mail being classified, though. > That option was changed in 1.0a5 (a note about it is at the top of the > release notes). The add_mailid_to option was replaced with the > "add_unique_id" option (in the "Headers" section, not "pop3proxy"), and > now defaults to True (so it shouldn't need to be in the config file). > > [Lanny] > > > Shortly after responding to you I got another spam and it was > > not classified and did not show up on the Review page. > > If you look at the headers of the message, is there a > X-Spambayes-Exception header? If something went wrong, then it wouldn't > be classified or be visible on the review page, but it should have an > exception header that lets us know exactly what went wrong. If it > doesn't have any spambayes headers, then something rather more odd is > going on. > > =Tony Meyer Tony, Nothing at all in the headers, from SpamBayes that is: Return-Path: Received: from [200.106.25.173] (HELO wxs.nl) by admin.nni.com (CommuniGate Pro SMTP 4.1b2) with ESMTP id 186073940 for lanny@munged.munged.net; Mon, 08 Sep 2003 17:05:33 -0400 Message-ID: <3F5D199D.D8E8C121@optusnet.com.au> From: "Dorthea Hollister" To: lanny@munged.munged.net Subject: The problem is solved Date: Tue, 09 Sep 2003 00:06:53 +0000 MIME-Version: 1.0 User-Agent: Mozilla/5.036 (Windows; U; Windows NT 5.0; U; NT4.0; en-us) Gecko/25250101 Content-Type: text/html Content-Transfer-Encoding: 8bit Status: R X-Status: N X-KMail-EncryptionState: X-KMail-SignatureState: That's what has me stumped. There doesn't seem to be a clue. I'm going to shut down and save SpamBayes and then shutdown the machine itself around mid-night or so and start again in the morning. Lanny From mhammond at skippinet.com.au Tue Sep 9 17:41:13 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 9 02:41:23 2003 Subject: [Spambayes] RE: Outlook Addin 008 binary - problem with new dialogmanager In-Reply-To: Message-ID: <01c001c3769d$5ced3b60$f502a8c0@eden> > No, there's definitely a problem. I see the same behavior you saw. Please try the new 008.1 version. Thanks, Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1484 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030909/3319d89a/winmail.bin From anthony at interlink.com.au Tue Sep 9 17:40:56 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 02:42:07 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: Message-ID: <200309090640.h896evvB019793@localhost.localdomain> >>> "Tim Peters" wrote > spambayes was developed against many peoples' test corpora, although, as I > said before, I don't think any of them had a significant quantity of HTML > ham (and I don't think Bill's did either). Not true - my mungo-testset of 30K or so items had a significant amount of HTML - this is why I jumped up and down on the HTML stuff. Tim's original python-list archive had almost no HTML ham. > In spambayes I'd be more inclined to write special code to identify the > img-src-http dance, and synthesize a token for that. It's only one token, > though, and all tokens carry the same weight here -- it may still not be > enough to give "a typical" short message of this ilk a strong enough score > to nail it. The only way to know is to try it. Note also that synthesising a bunch of (highly correlated) clues to attempt to fix the problem of the single URL spams often leads to unexpected (bad) consequences with scoring other email. In general, highly correlated clues are bad. This is why it's necessary to test additions to make sure that in fixing one problem you're not adding 4 others. Someone should go through the list archives and work out the ratio of attempted tokeniser tricks that made things worse to ones that actually improved the situation. I'd guess that it's something like 4 rejected for every one that went in... Anthony -- Anthony Baxter It's never too late to have a happy childhood. From T.A.Meyer at massey.ac.nz Tue Sep 9 21:29:10 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:29:37 2003 Subject: [Spambayes] Classification not always added to the header: ThePlotThickens Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B1@its-xchg4.massey.ac.nz> > Nothing at all in the headers, from SpamBayes that is: [...] > That's what has me stumped. There doesn't seem to be a clue. > I'm going to shut down and save SpamBayes and then shutdown the > machine itself around mid-night or so and start again in the morning. I can't see how a message can go through pop3proxy and not get any X-Spambayes header added at all. This leads me to believe that the message isn't going through pop3proxy at all. Is it possible that your mail client is connecting directly to the POP3 server in some cases? =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 21:31:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:31:34 2003 Subject: [Spambayes] Contribute to the SpamBates Team Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B2@its-xchg4.massey.ac.nz> > I am an experienced webmaster, I just installed the filter > and saw that any help would be appreciated. I didn't know if > this qualifies so I am sending you an email. I was hoping that someone else would answer this, but since they haven't... :) I'm sure everyone would be open to patches of the website. You can get a copy of the current version from cvs (the "website" directory right at the top). I would imagine, though, that people would prefer something that's relatively simple and text-based. I could be wrong :) Comments about the pop3proxy/imapfilter web interface would also be welcome. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 21:32:52 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:33:15 2003 Subject: [Spambayes] Eudora Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B3@its-xchg4.massey.ac.nz> > I use Eudora on a Win2k. Can i use the executable or the > pop3proxy and edit the eudora ini file? Yes, you can use pop3proxy (not the Outlook plug-in, obviously). See FAQ 2.3: =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 21:34:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:34:25 2003 Subject: [Spambayes] Cannot 'Delete As Spam'....Help Plz! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B4@its-xchg4.massey.ac.nz> > I am using SpamBayes Version 0.7 with Windows XP. Everytime > I hit 'Delete As Spam', it tells me that I must enable > SpamBayes first. I thought it was automatically enabled. > How do I get this to work and enable SpamBayes?? You may wish to upgrade to version 008.1, which has a training wizard and is enabled by default. If not, please see FAQ 3.8: =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 21:38:01 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:38:19 2003 Subject: [Spambayes] Using SpamBayes with SSL mail server Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B6@its-xchg4.massey.ac.nz> > I'm trying to set up spambayes pop3proxy with an SSL pop server. > > the server is set to mail.blah.com:995 and the port to 995. > But it always times out. There is an open feature request for this. You can track it here: It was opened in June, though, so there's not much in the way of fast movement. I'll try and get to this once the current feature freeze is over (in a couple of weeks), so it should be in 1.1a1. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 9 21:39:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 04:39:18 2003 Subject: [Spambayes] Re: Eudora SSL Send Error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B7@its-xchg4.massey.ac.nz> > When I don't use SSL the message > will send without problems. However, I think I'd prefer to > use SSL for messages I send. So if it is something that can be supported > in future versions of SpamBayes I think that would be helpful to Eudora > users. There is an open feature request for this. You can track it here: It was opened in June, though, so there's not much in the way of fast movement. I'll try and get to this once the current feature freeze is over (in a couple of weeks), so it should be in 1.1a1. =Tony Meyer From dbettinger at cxtec.com Tue Sep 9 08:10:39 2003 From: dbettinger at cxtec.com (Dean Bettinger) Date: Tue Sep 9 07:10:55 2003 Subject: [Spambayes] RE: Outlook Addin 008 binary - problem with new d ialogmanager Message-ID: Mark: I spotted the new version last night (EST) and installed it. It works well, but I wasn't experiencing the problem any longer since the config wizard had resolved a problem in my config. I note that the revised version was published on SourceForge LESS THAN 11 HOURS after I'd posted the first bug report! Too bad that people who SELL software and sell support can't match the Open Source community for responsiveness! Many thanks to you Mark and to all who actively participate in making SpamBayes better every day. It is already the best spam filter and it just keeps getting better! Dean -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Tuesday, September 09, 2003 2:41 AM To: 'Coe, Bob'; 'Dean Bettinger' Cc: spambayes@Python.org Subject: RE: [Spambayes] RE: Outlook Addin 008 binary - problem with new dialogmanager > No, there's definitely a problem. I see the same behavior you saw. Please try the new 008.1 version. Thanks, Mark. From ltrag at optonline.net Tue Sep 9 09:44:40 2003 From: ltrag at optonline.net (Lanny Trager) Date: Tue Sep 9 08:44:49 2003 Subject: [Spambayes] Classification not always added to the header: ThePlotThickens In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B1@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130332A7B1@its-xchg4.massey.ac.nz> Message-ID: <200309090844.40692.ltrag@optonline.net> On Tuesday 09 September 2003 04:29, Meyer, Tony wrote: > > Nothing at all in the headers, from SpamBayes that is: > > [...] > > > That's what has me stumped. There doesn't seem to be a clue. > > I'm going to shut down and save SpamBayes and then shutdown the > > machine itself around mid-night or so and start again in the morning. > > I can't see how a message can go through pop3proxy and not get any > X-Spambayes header added at all. This leads me to believe that the > message isn't going through pop3proxy at all. Is it possible that your > mail client is connecting directly to the POP3 server in some cases? > > =Tony Meyer Nope, I checked Kmail, all of my accounts (3) point to pop3proxy. There is no reference whatsoever to any pop or SMTP server in Kmail except to pop3proxy. This morning I started everything up and got 40 e-mails 1 in my in-box - it had the spambayes header 7 in my trash - 4 had spambayes headers - 3 did not (trash is where I filter all my spam to) 4 from another mailing list - none had spambayes headers 1 to another mailing list - did not have spambayes headers 1 to another list - it had the spambayes header 25 to spambayes list - all had the spambayes header While writing this e-mail 2 more spam came - one had the header and the other did not. However, both showed up on the review page. None of the other e-mails without the spambayes headers showed up on the review page. Also one of the e-mails that did have a spambayes header also did not show up on the review page. Results of which_database.py: Pickle is available. Dumbdbm is available. Dbhash is available. Bsddb[3] is available. Your storage hammie.db is a: dbhash bayescustomize.ini: [html_ui] display_to:True [pop3proxy] listen_ports:1115,1110 remote_servers:pop.munged.net,pop.munged.net [smtpproxy] listen_ports:1120 remote_servers:mail.munged.net [Tokenizer] mine_received_headers:True Lanny From tuller.neolaw at matavnet.hu Tue Sep 9 15:28:43 2003 From: tuller.neolaw at matavnet.hu (=?iso-8859-2?Q?dr._Ver=E9b_Bal=E1zs?=) Date: Tue Sep 9 09:20:54 2003 Subject: [Spambayes] Not-spam Message-ID: <001d01c376d6$02f07da0$0d00a8c0@medi3> Dear All, I am using the latest version of spambayes with W98 & MS Outlook 2000. I did the training and the necessary config. I could on my own and the plug-in works just fine so far. Having that said, I have 2-3 valid senders from whom I often receive valid and important messages, with regular (non-suspicious) header, subject etc.. Spambayes put all of them into my spam folder and after training it to consider these e-mails as valid, non-spams, the program continues to do the same. Could you give me a clue as how to proceed? Thanks in advance Best regards dr. Balazs Vereb From rafael.scholl at bbox.ch Tue Sep 9 16:40:58 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Tue Sep 9 09:41:07 2003 Subject: [Spambayes] RE: imapfilter.py In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332A5D3@its-xchg4.massey. ac.nz> Message-ID: <6.0.0.9.0.20030909151520.01bb8900@mail.bbox.ch> >shortly, I suppose (dammit). I posted the updated message.py in the >previous message; you should be able to replace yours with this and fix >that problem. Everything works beautifully now. (So far!) Thanks again for your assistance. I can't wait to see how well the system does once it's properly trained. I haven't quite figured out how to best integrate the filter with my Eudora. It seems easiest to run imapfilter.py constantly and instruct it to classify email once every ten minutes or so. On a purely cosmetic level, it would be nice to be able to put it in the windows systray -- it's in the taskbar now. Do you happen to know if that's doable with a python program? Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From anthony at interlink.com.au Wed Sep 10 01:00:38 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 10:00:56 2003 Subject: [Spambayes] binning 1.0 spams Message-ID: <200309091400.h89E0diJ023182@localhost.localdomain> Just a report from a happy user. After considering it for a number of months, I've now got SB tossing emails that score 1.0 into a folder that gets auto-cleansed. Stuff that scores between spam-cutoff and 1.0 gets checked and trained on, as do unsures. This has meant, for me, that 95% or more of the spam never bothers me at all ;) (I still review the <1.0 spam, just in case). It's almost like using email back before the plagues that Cantor & Siegel and Microsoft unleashed upon the world... Anthony From anthony at interlink.com.au Wed Sep 10 01:03:45 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 10:04:09 2003 Subject: [Spambayes] Not-spam In-Reply-To: <001d01c376d6$02f07da0$0d00a8c0@medi3> Message-ID: <200309091403.h89E3jFX023250@localhost.localdomain> >>> "=?iso-8859-2?Q?dr._Ver=E9b_Bal=E1zs?=" wrote > Having that said, I have 2-3 valid senders from whom I often receive valid > and important messages, with regular (non-suspicious) header, subject etc.. > Spambayes put all of them into my spam folder and after training it to > consider these e-mails as valid, non-spams, the program continues to do the > same. You could put in outlook rules to catch these addresses before spambayes classifies them, or else train on a few more. What are the clues that spambayes is using to decide that these messages are spam? From anthony at interlink.com.au Wed Sep 10 01:06:25 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 10:07:08 2003 Subject: [Spambayes] Classification not always added to the header: ThePlotThickens In-Reply-To: <200309090844.40692.ltrag@optonline.net> Message-ID: <200309091406.h89E6P2R023299@localhost.localdomain> >>> Lanny Trager wrote > Nope, I checked Kmail, all of my accounts (3) point to pop3proxy. There is no > reference whatsoever to any pop or SMTP server in Kmail except to pop3proxy. You might want to try running tcpdump to check if Kmail doesn't have some wacky cached version of your POP settings in some strange little file in .kde or something. In the past when I've played with Kmail, I ended up nuking every config file it touched to get it to forget some old settings. From wsy at merl.com Tue Sep 9 11:19:53 2003 From: wsy at merl.com (Bill Yerazunis) Date: Tue Sep 9 10:20:17 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: References: Message-ID: <200309091419.h89EJrI05406@localhost.localdomain> From: "Tim Peters" >>> E pur si moivre, dude. E pur si moivre. >> There more kinds of email users in heaven and earth than are >> dreamt of in your classifier, Bill. [Bill Yerazunis] > That's one thing I _like_ about this list. At least y'all are > moderately literate. :-) Perhaps, but I actually had no idea what epursimoivre meant . E pur si moivre --> "Nevertheless, it _does_ move". Apocryphally, what Galileo said after being tortured continuously by the Spanish Inquisition for four days and finally recanting his observational evidence that the Earth moved. > Well, on the grounds that the SpamAssassin corpus is a little less > biased, I re-ran the tests against the .css files that the SA test > corpus generates (using the TOE learning strategy). Accuracy on this > corpus is just over 98% for crm114, and barely 70% for me-the-human. > > The results for SA test corpus: > > Token Spam Nonspam >

143 144 >
380 289 >

67 119 > > So, it seems that "font" is somewhat spammy, and so is "br", > but is totally equivocal. > > Does this help? :) It helps Ryan's thesis that HTML isn't uncommon in ham. If the stats had been more like Token Spam Nonspam

143 10
380 5

67 0 References: <200309091406.h89E6P2R023299@localhost.localdomain> Message-ID: <200309091044.04400.ltrag@optonline.net> On Tuesday 09 September 2003 10:06, Anthony Baxter wrote: > >>> Lanny Trager wrote > > > > Nope, I checked Kmail, all of my accounts (3) point to pop3proxy. There > > is no reference whatsoever to any pop or SMTP server in Kmail except to > > pop3proxy. > > You might want to try running tcpdump to check if Kmail doesn't have some > wacky cached version of your POP settings in some strange little file in > .kde or something. In the past when I've played with Kmail, I ended up > nuking every config file it touched to get it to forget some old settings. I appreciate your suggestion. Unfortunately I am not as literate in TCPDUMP as I might be. Could you possibly tell me what command line options to use. Thanks Lanny From mscott at perspectivedata.com Tue Sep 9 11:53:44 2003 From: mscott at perspectivedata.com (Michele Scott) Date: Tue Sep 9 10:50:34 2003 Subject: [Spambayes] Unable to Enable Filtering Message-ID: <00c101c376e2$2b4fc380$850aa8c0@wonderwoman> I have recently downloaded and installed Spambayes Binary Version 0.7. I have followed the installation and troubleshooting guide. However, I am unable to enable the filtering. I have defined a Spam folder and trained messages. It seems to identify 29 good email and 52 spam. However, I am unable to delete as spam and recover from spam, prompts for me to enable filtering. The check box for me to enable this option remains disabled and says I have to define a folder to filter, I have, "Spam" folder in my Inbox. Also, does not automatically filter my messages, assuming this is due to the enabling of the filter!? Help? OS: Windows 2000 Outlook: 2000 Spambayes: Binary 0.7 Michele We all wish an hour had a hundred minutes -- did anyone ask the minutes about this ? -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 8111 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030909/838ebbb3/spambayes4.obj From anthony at interlink.com.au Wed Sep 10 02:10:44 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 11:11:13 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309091419.h89EJrI05406@localhost.localdomain> Message-ID: <200309091510.h89FAiok023924@localhost.localdomain> >>> Bill Yerazunis wrote > How well does SpamBayes do on the SpamAssassin test corpus? That's the stuff available here: http://spamassassin.org/publiccorpus/ ? I'm not aware that anyone's run against it - perhaps someone's got some CPU cycles burning a hole in their pocket and would like to try? From anthony at interlink.com.au Wed Sep 10 02:14:22 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Sep 9 11:14:37 2003 Subject: [Spambayes] Unable to Enable Filtering In-Reply-To: <00c101c376e2$2b4fc380$850aa8c0@wonderwoman> Message-ID: <200309091514.h89FEM6C024004@localhost.localdomain> >>> "Michele Scott" wrote > I have recently downloaded and installed Spambayes Binary Version 0.7. You might like to try the 0.81 version released in the last day or so. I understand it has a nicer configuration interface. Anthony From rmalayter at bai.org Tue Sep 9 11:42:23 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 9 11:42:33 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3F6@cliff.bai.org> From: Bill Yerazunis [mailto:wsy@merl.com] > Well, what happens in CRM114 is not that > the HTML causes confusion, it does get > factored in, but when you have a nearly > 1:1 ratio in the hits, it basically doesn't > make any difference to the end value. Bill, how does CRM-114 handle a typical References: <5.2.1.1.2.20030908074452.00b709d0@localhost> <200309082143.h88Lh8f03321@localhost.localdomain> Message-ID: <16221.62870.605485.649738@montanaro.dyndns.org> thetasig> I subscribe to a number of mailing lists. All of these are thetasig> ham. thetasig> Any suggestions on how to handle this? Bill> This is one of the cases where a whitelist is actually valuable. Bill> The others are (in order of importance) Bill> Your paramour Bill> Your lawyer Bill> Your boss Then set it up in your procmailrc file or Outlook rules or whatever so that those valuable addresses are dumped in the correct mailbox before SpamBayes sees them. We went around and around on this during the last week or two. SpamBayes does its job very well. If you want whitelists, your mail software probably supports them in some way. There's no need to add this functionality to SpamBayes. Skip From thetasig at comcast.net Tue Sep 9 09:47:14 2003 From: thetasig at comcast.net (theta sigma) Date: Tue Sep 9 11:47:18 2003 Subject: [Spambayes] Eudora Filters Message-ID: <5.2.1.1.2.20030909084204.00b72520@localhost> It seems that Eudora is not able to "see" the Spambayes headers and so the filtering process misses them and does not move the e-mails to an appropriate folder. Sample header: Reply-To: [sender removed-theta] X-Mailer: WhatCounts X-Spambayes-Classification: spam X-Spambayes-MailId: 1063122101 [body follows] I've set up 2 filters for "ham" and "spam" headers but they are not working. I've classified the filtering variously on and on but still no-go. Has anyone been able to get these filters to work in Eudora? Is it, perhaps, a Eudora bug? Thx. -=mark=- From wsy at merl.com Tue Sep 9 13:23:17 2003 From: wsy at merl.com (Bill Yerazunis) Date: Tue Sep 9 12:23:21 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3F6@cliff.bai.org> (rmalayter@bai.org) References: <792DE28E91F6EA42B4663AE761C41C2AF3D3F6@cliff.bai.org> Message-ID: <200309091623.h89GNHa05669@localhost.localdomain> From: "Ryan Malayter" From: Bill Yerazunis [mailto:wsy@merl.com] > Well, what happens in CRM114 is not that > the HTML causes confusion, it does get > factored in, but when you have a nearly > 1:1 ratio in the hits, it basically doesn't > make any difference to the end value. Bill, how does CRM-114 handle a typical Afternoon. The problem that I'm having is after installation, the tool bar is no where to be found. It looks like the add-in is loading (if I'm reading the log correctly), but the tool bar is missing when I start Outlook. Any help will be appreciated. Windows 2000 SP3 Outlook XP Version of SpamBayes: version 008.1 Log entries: Registered: SpamBayes.OutlookAddin Registration complete. Lindbergh Hampshire, Jr. Global Server Management NCR Corp. Dayton, OH Work#937.445.6378 Fax#937.445.2261 Lindbergh.Hampshire@NCR.COM From tim.one at comcast.net Tue Sep 9 13:39:06 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 9 12:39:05 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309091419.h89EJrI05406@localhost.localdomain> Message-ID: >>>> E pur si moivre, dude. E pur si moivre. >>> There more kinds of email users in heaven and earth than are >>> dreamt of in your classifier, Bill. [Bill Yerazunis] >>> That's one thing I _like_ about this list. At least y'all are >>> moderately literate. :-) >> Perhaps, but I actually had no idea what epursimoivre meant . [Bill Yerazunis] > E pur si moivre --> "Nevertheless, it _does_ move". Apocryphally, > what Galileo said after being tortured continuously by the Spanish > Inquisition for four days and finally recanting his observational > evidence that the Earth moved. Ah! I think memory is mixing Italian with French here. s/moivre/muove/ and google goes from 0 hits to thousands. That's easy to remember because then "e pur si muove" is an anagram of "pursue movie" . >>> Well, on the grounds that the SpamAssassin corpus is a little >>> less biased, I re-ran the tests against the .css files that the >>> SA test corpus generates (using the TOE learning strategy). >>> Accuracy on this corpus is just over 98% for crm114, and barely >>> 70% for me-the-human. The results for SA test corpus: >>> >>> Token Spam Nonspam >>>

143 144 >>>
380 289 >>>

67 119 >>> >> >> >>> So, it seems that "font" is somewhat spammy, and so is "br", >>> but is totally equivocal. > >>> Does this help? :) [Tim] >> It helps Ryan's thesis that HTML isn't uncommon in ham. If the >> stats had been more like >> >> Token Spam Nonspam >>

143 10 >>
380 5 >>

67 0 >> > > >> which is much closer to my actual ham-spam breakdown wrt HTML, I >> believe CR114 would have had a very hard time classifying the ham >> correctly (note that there would also be literally dozens of other >> distinct unique-to-HTML strings in that ham too, all with high >> spam counts and low ham counts). *This* kind of distribution is >> what spambayes had lots of experience with, and is what caused us >> to throw away HTML decorations. [Bill] > Well, what happens in CRM114 is not that the HTML causes confusion, it > does get factored in, but when you have a nearly 1:1 ratio in the > hits, it basically doesn't make any difference to the end value. Sure. We keep talking past each other here, and I don't know why. spambayes doesn't strip HTML because it's afraid of HTML, it's specifically to avoid penalizing ham for the mere presence of HTML, for those people (like most of the spambayes developers) who have very little (but yet some) HTML ham. It's not the 1::1 corpora that drove the decision, it was the 100::1 corpora (which is in fact typical of my own email). > How well does SpamBayes do on the SpamAssassin test corpus? I haven't tried it, and I don't recall anyone else here trying it. It wasn't interesting to me because spambayes was initially developed to filter Mailman mailing lists, and later expanded to single-user classifiers. Those both have a kind of focus that aggregating many peoples' ham would lose, and finding out how badly that loses would be of only idle academic interest to me. It's of more interest to other people here, though, so I'll let them do the work . From m0davis at pacbell.net Tue Sep 9 14:00:40 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Tue Sep 9 16:00:52 2003 Subject: [Spambayes] Re: Help with pop3 mail proxy Message-ID: <3F5E3168.30704@pacbell.net> Forget it. That field has absolutely nothing to do with my POP3 user account. I figured out my real problem was my mail server was acting strange. Seems to work now! Thanks and sorry for the stupid question! From edrubins at mindspring.com Tue Sep 9 17:35:44 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Tue Sep 9 16:35:58 2003 Subject: [Spambayes] Eudora Filters In-Reply-To: Message-ID: <5.1.0.14.0.20030909162456.00abe600@localhost> > >It seems that Eudora is not able to "see" the Spambayes headers and so the > >fltering process misses them and does not move the e-mails to an > >appropriate folder. > > > Sample header: > > > Reply-To: [sender removed-theta] > > X-Mailer: WhatCounts > > X-Spambayes-Classification: spam > > X-Spambayes-MailId: 1063122101 > > > [body follows] > > > > I've set up 2 filters for "ham" and "spam" headers but they are not > > working. I've classified the filtering variously on and on > header> but still no-go. > > > Has anyone been able to get these filters to work in Eudora? Is it, > > perhaps, a Eudora bug? I have it filtering both unsure and spam with the following filters, unsure is a folder I created to hold, well messages spambayes wasn't sure of. The order of the filters may matter - make sure you spambayes filters come first. . Match Incoming Header X-Spambayes-Classification contains spam Make Status read Transfer To trash Match Incoming Header X-Spambayes-Classification unsure spam Transfer To unsure. Best, Ed From tim.one at comcast.net Tue Sep 9 18:29:21 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 9 17:29:21 2003 Subject: [Spambayes] New Icons for Outlook Plugin In-Reply-To: <008701c37678$ea1b0be0$641ba8c0@stormshadow> Message-ID: [Chris Pirillo] > I finally installed SpamBayes for Outlook, and... I wish I had done so > yesterday. ;) Anyway, I decided that the face icons (while cute), did not > quite fit my style of thinking: > > http://chris.pirillo.com/images/outlookplugin.gif Hmm. I clicked on the link before reading more, and I have to confess I had no idea what the icons were meant to represent. It's really better if they're obvious to everyone at first glance. It was fun watching Mark try to figure out the intuitive significance of the 8-ball, for example: he's Australian, and God only knows what "behind the 8-ball" sounds like to him. BTW, the frowny-face "delete as spam" icon we currently use is actually a photograph of Mark! This is a little known fact . From loopsouth at loopsouth.com Tue Sep 9 17:42:31 2003 From: loopsouth at loopsouth.com (LoopSouth) Date: Tue Sep 9 17:39:05 2003 Subject: [Spambayes] Upgrade issues... Message-ID: If I upgrade from version .7 to the latest version... (I am running windows BTW with outlook) will Spambayes keep settings from the previous version, or do I have to train Spambayes all over again? -Josh From T.A.Meyer at massey.ac.nz Wed Sep 10 12:36:20 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 19:36:45 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A95A@its-xchg4.massey.ac.nz> > >>> Bill Yerazunis wrote > > How well does SpamBayes do on the SpamAssassin test corpus? > > That's the stuff available here: > http://spamassassin.org/publiccorpus/ ? > > I'm not aware that anyone's run against it - perhaps someone's got some CPU > cycles burning a hole in their pocket and would like to try? I occasionally run against this. An example (with 'fresh from cvs' spambayes): filename: no_sean_sas ham:spam: 7580:7580 fp total: 44 fp %: 0.58 fn total: 16 fn %: 0.21 unsure t: 356 unsure %: 2.35 real cost: $527.20 best cost: $592.40 h mean: 3.40 h sdev: 14.19 s mean: 97.94 s sdev: 9.43 mean diff: 94.54 k: 4.00 (This was done a few weeks back, but I don't think that anything has changed that would effect it since). Note that I haven't gone through the corpora to check that there isn't any misclassified mail. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 10 12:43:37 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 19:45:15 2003 Subject: [Spambayes] Eudora Filters Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A962@its-xchg4.massey.ac.nz> > It seems that Eudora is not able to "see" the Spambayes > headers and so the filtering process misses them and does not > move the e-mails to an appropriate folder. What version of Eudora is this? It works for me with (Windows) Eudora 5.2. > I've set up 2 filters for "ham" and "spam" headers but they are not > working. I've classified the filtering variously on > and on but still no-go. Try looking for the exact "X-Spambayes-Classification" header. This is an example rule from my filters.pce: """ rule SpamBayes-Spam transfer Junk Mail.mbx incoming manual header X-Spambayes-Classification verb contains value spam conjunction ignore header verb contains value """ (This was set up via the Eudora GUI, though). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 10 12:46:01 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 19:46:19 2003 Subject: [Spambayes] Upgrade issues... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A966@its-xchg4.massey.ac.nz> > If I upgrade from version .7 to the latest version... (I am > running windows BTW with outlook) will Spambayes keep > settings from the previous version, or do I have to train > Spambayes all over again? Your settings and database will be retained. =Tony Meyer From DavidG at fool.com Tue Sep 9 21:03:53 2003 From: DavidG at fool.com (David Gardner) Date: Tue Sep 9 20:04:19 2003 Subject: [Spambayes] Outlook 2002 misplaced registry lookup? Message-ID: <4F59D6584343A24C9C336F12B076CABEC8BBF3@rover.foolhq.com> Dev Team, I love this application, and thank the programmers who are working on it. I have tried it some on my laptop, where it has been devastatingly effective. I am now, as a non-techie, endeavoring to use this on my (newer) desktop home PC. My home PC runs Windows XP, and I have Outlook 2002. I am using the newest version of SpamBayes, running the 0081 Outlook Setup file. When I attempt to install the app, I get a surprising dialog box that states "Outlook appears to not be installed." It then asks me if I want to install anyway, which I do, but it's clear that my Outlook is just never seeing (or using) the SpamBayes plug-in. It's as if the SpamBayes plug-in program doesn't believe I have Outlook on my machine, but I assure you I'm using my own legit version of Outlook 2002, from the OEM, with my own product license number, via the PC I bought from Dell. Being the type who likes to try to figure out stuff for myself, I read the FAQs unsuccessfully and then decided to search Google, where I happened upon this page: http://mail.python.org/pipermail/spambayes-checkins/2003-July/001473.html It appears to my untrained eye that the code is looking for a registry key where mine is NOT. Further, check out this this (completely unrelated but still relevant) page: http://www.slipstick.com/dev/ol2002problems.htm ... so... the SpamBayes code is not looking in the right place for my Outlook registry key? The Outlook registry key is in fact under my 10.0 directory, when I do a RegEdit - which is NOT, I believe, where SpamBayes is looking for it. Hence, it doesn't see my Outlook so that it can plug itself in.... That's the present best guess I can make, though I'm mostly an ignorant dude about this stuff, so I could easily be misunderstanding something. And I'm quite sure that I haven't "discovered" a real SpamBayes "bug" about the Outlook 2002 version, since I know so many others would be using Outlook 2002 with SpamBayes successfully. But the upshot of it is that SpamBayes doesn't appear to be installing correctly on my desktop PC - doesn't show up anywhere as a Toolbar possibility, and I truly can't figure out what I should do next to get it working on my big machine as sweetly as it's working on my laptop. Again, I've read through whatever FAQs I can find, without finding an answer. I would love some help, here, from anyone who has the time. Best, David G. From john at comnet-tech.com Tue Sep 9 21:19:08 2003 From: john at comnet-tech.com (John A) Date: Tue Sep 9 20:19:03 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 31 In-Reply-To: Message-ID: unsubscribe -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of spambayes-request@python.org Sent: Tuesday, September 09, 2003 12:00 PM To: spambayes@python.org Subject: Spambayes Digest, Vol 61, Issue 31 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. Re: Classification not always added to the header: ThePlotThickens (Lanny Trager) 2. Unable to Enable Filtering (Michele Scott) 3. Re: how spambayes handles image-only spams (Anthony Baxter) 4. Re: Unable to Enable Filtering (Anthony Baxter) 5. RE: how spambayes handles image-only spams (Ryan Malayter) 6. Re: Mailing Lists (Skip Montanaro) 7. Eudora Filters (theta sigma) ---------------------------------------------------------------------- Message: 1 Date: Tue, 09 Sep 2003 10:44:04 -0400 From: Lanny Trager Subject: Re: [Spambayes] Classification not always added to the header: ThePlotThickens To: spambayes@python.org Message-ID: <200309091044.04400.ltrag@optonline.net> Content-Type: text/plain; charset=iso-8859-1 On Tuesday 09 September 2003 10:06, Anthony Baxter wrote: > >>> Lanny Trager wrote > > > > Nope, I checked Kmail, all of my accounts (3) point to pop3proxy. There > > is no reference whatsoever to any pop or SMTP server in Kmail except to > > pop3proxy. > > You might want to try running tcpdump to check if Kmail doesn't have some > wacky cached version of your POP settings in some strange little file in > .kde or something. In the past when I've played with Kmail, I ended up > nuking every config file it touched to get it to forget some old settings. I appreciate your suggestion. Unfortunately I am not as literate in TCPDUMP as I might be. Could you possibly tell me what command line options to use. Thanks Lanny ------------------------------ Message: 2 Date: Tue, 9 Sep 2003 10:53:44 -0400 From: "Michele Scott" Subject: [Spambayes] Unable to Enable Filtering To: Message-ID: <00c101c376e2$2b4fc380$850aa8c0@wonderwoman> Content-Type: text/plain; charset="iso-8859-1" I have recently downloaded and installed Spambayes Binary Version 0.7. I have followed the installation and troubleshooting guide. However, I am unable to enable the filtering. I have defined a Spam folder and trained messages. It seems to identify 29 good email and 52 spam. However, I am unable to delete as spam and recover from spam, prompts for me to enable filtering. The check box for me to enable this option remains disabled and says I have to define a folder to filter, I have, "Spam" folder in my Inbox. Also, does not automatically filter my messages, assuming this is due to the enabling of the filter!? Help? OS: Windows 2000 Outlook: 2000 Spambayes: Binary 0.7 Michele We all wish an hour had a hundred minutes -- did anyone ask the minutes about this ? -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 8111 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030909/838ebbb3/spa mbayes4-0001.obj ------------------------------ Message: 3 Date: Wed, 10 Sep 2003 01:10:44 +1000 From: Anthony Baxter Subject: Re: [Spambayes] how spambayes handles image-only spams To: Bill Yerazunis Cc: spambayes@python.org Message-ID: <200309091510.h89FAiok023924@localhost.localdomain> Content-Type: text/plain; charset="us-ascii" >>> Bill Yerazunis wrote > How well does SpamBayes do on the SpamAssassin test corpus? That's the stuff available here: http://spamassassin.org/publiccorpus/ ? I'm not aware that anyone's run against it - perhaps someone's got some CPU cycles burning a hole in their pocket and would like to try? ------------------------------ Message: 4 Date: Wed, 10 Sep 2003 01:14:22 +1000 From: Anthony Baxter Subject: Re: [Spambayes] Unable to Enable Filtering To: mscott@perspectivedata.com Cc: spambayes@python.org Message-ID: <200309091514.h89FEM6C024004@localhost.localdomain> Content-Type: text/plain; charset="us-ascii" >>> "Michele Scott" wrote > I have recently downloaded and installed Spambayes Binary Version 0.7. You might like to try the 0.81 version released in the last day or so. I understand it has a nicer configuration interface. Anthony ------------------------------ Message: 5 Date: Tue, 9 Sep 2003 10:42:23 -0500 From: "Ryan Malayter" Subject: RE: [Spambayes] how spambayes handles image-only spams To: Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3F6@cliff.bai.org> Content-Type: text/plain; charset="us-ascii" From: Bill Yerazunis [mailto:wsy@merl.com] > Well, what happens in CRM114 is not that > the HTML causes confusion, it does get > factored in, but when you have a nearly > 1:1 ratio in the hits, it basically doesn't > make any difference to the end value. Bill, how does CRM-114 handle a typical Subject: Re: [Spambayes] Mailing Lists To: Bill Yerazunis Cc: spambayes@python.org Message-ID: <16221.62870.605485.649738@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii thetasig> I subscribe to a number of mailing lists. All of these are thetasig> ham. thetasig> Any suggestions on how to handle this? Bill> This is one of the cases where a whitelist is actually valuable. Bill> The others are (in order of importance) Bill> Your paramour Bill> Your lawyer Bill> Your boss Then set it up in your procmailrc file or Outlook rules or whatever so that those valuable addresses are dumped in the correct mailbox before SpamBayes sees them. We went around and around on this during the last week or two. SpamBayes does its job very well. If you want whitelists, your mail software probably supports them in some way. There's no need to add this functionality to SpamBayes. Skip ------------------------------ Message: 7 Date: Tue, 09 Sep 2003 08:47:14 -0700 From: theta sigma Subject: [Spambayes] Eudora Filters To: spambayes@python.org Message-ID: <5.2.1.1.2.20030909084204.00b72520@localhost> Content-Type: text/plain; charset="us-ascii"; format=flowed It seems that Eudora is not able to "see" the Spambayes headers and so the filtering process misses them and does not move the e-mails to an appropriate folder. Sample header: Reply-To: [sender removed-theta] X-Mailer: WhatCounts X-Spambayes-Classification: spam X-Spambayes-MailId: 1063122101 [body follows] I've set up 2 filters for "ham" and "spam" headers but they are not working. I've classified the filtering variously on and on but still no-go. Has anyone been able to get these filters to work in Eudora? Is it, perhaps, a Eudora bug? Thx. -=mark=- ------------------------------ _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html End of Spambayes Digest, Vol 61, Issue 31 ***************************************** From mhammond at skippinet.com.au Wed Sep 10 11:56:30 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 9 20:56:33 2003 Subject: [Spambayes] Outlook 2002 misplaced registry lookup? In-Reply-To: <4F59D6584343A24C9C336F12B076CABEC8BBF3@rover.foolhq.com> Message-ID: <285501c37736$5fa56500$f502a8c0@eden> > When I attempt to install the app, I get a surprising dialog > box that states > "Outlook appears to not be installed." It then asks me if I > want to install > anyway, which I do, but it's clear that my Outlook is just > never seeing (or > using) the SpamBayes plug-in. It's as if the SpamBayes plug-in program > doesn't believe I have Outlook on my machine, but I assure > you I'm using my > own legit version of Outlook 2002, from the OEM, with my own > product license > number, via the PC I bought from Dell. > > Being the type who likes to try to figure out stuff for > myself, I read the > FAQs unsuccessfully and then decided to search Google, where > I happened upon > this page: > > http://mail.python.org/pipermail/spambayes-checkins/2003-July/ 001473.html > It appears to my untrained eye that the code is looking for a registry key > where mine is NOT. The installer program simply checks if the key 'Software\Microsoft\Office\Outlook' exists under HKEY_CURRENT_USER. I assume you didn't have this key (but probably will now, as we should have created it) The "addin registration" code involves 2 steps - one is installing a COM object (which I assume has worked) and the other is listing the addin in the "Software\Microsoft\Office\Outlook\Addins" part of the registry. You should find this key now exists, as SpamBayes created it - can you check it does? Does "Help->About->Disabled Items" list it? > Further, check out this this (completely unrelated but still relevant) page: Excellent detective work :) But unfortunately, that doens't appear relevant to this issue. > The Outlook registry key is in fact under my 10.0 directory, when I do a > RegEdit - which is NOT, I believe, where SpamBayes is looking for it. Hence, > it doesn't see my Outlook so that it can plug itself in.... My registry has "Software\Microsoft\Office\10.0\Outlook", but also "Software\Microsoft\Office\Outlook" with the addins (including a "Microsoft VBA Addin" that mysteriously appeared :)). Significantly, I also have a "Software\Microsoft\Office\Outlook\Outlook 10 Accounts" key, which implies Outlook 10 does store *some* stuff there. > But the upshot of it is that SpamBayes doesn't appear to be installing > correctly on my desktop PC - doesn't show up anywhere as a Toolbar > possibility, and I truly can't figure out what I should do next to get it > working on my big machine as sweetly as it's working on my laptop. I'm sure you won't be alone here though, so I'd love to sort it out too. Mark. From T.A.Meyer at massey.ac.nz Wed Sep 10 13:59:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 9 20:59:47 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 31 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332A9F3@its-xchg4.massey.ac.nz> > unsubscribe Look at the bottom of the digest: > _______________________________________________ > Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html Follow the link at the top, and you'll see how to unsubscribe. =Tony Meyer From edrubins at mindspring.com Tue Sep 9 22:22:37 2003 From: edrubins at mindspring.com (Ed Rubinsky) Date: Tue Sep 9 21:22:43 2003 Subject: [Spambayes] Eudora Filters In-Reply-To: Message-ID: <5.1.0.14.0.20030909211849.035f1c88@localhost> > > Match Incoming Header X-Spambayes-Classification unsure spam Transfer > To unsure. ^^^^^^^^^^^^^^^ should read: Match Incoming Header X-Spambayes-Classification contains unsure Transfer To unsure ^^^^^^^^^^^^^^^^^^ Sorry for the confusion. Best, Ed From DavidG at fool.com Tue Sep 9 22:30:11 2003 From: DavidG at fool.com (David Gardner) Date: Tue Sep 9 21:30:21 2003 Subject: [Spambayes] Outlook 2002 misplaced registry lookup? Message-ID: <4F59D6584343A24C9C336F12B076CABEC8BBF5@rover.foolhq.com> Mark (et al.), Thank you for the prompt response, and here's one back. Unfortunately, unlike you, I lack the following Registry path: HKEY_CURRENT_USER\Software\Microsoft\Office\Outlook Alas, no such bird. My Outlook is only listed under "10.0," so if SpamBayes is trying to create it, it's not working. However, I rather suspect it's not even managing to create it, as it still looks to me as if the program isn't functioning properly from the very moment I click to install it, when it gives the error message mentioned before. Further, for the record, there is no SpamBayes plugin listed in the following path, either: Software\Microsoft\Office\10.0\Outlook\Addins Indeed, I actually have no Outlook add-ins, so there isn't even an "Addins" subdirectory on my machine. It looks to me, the total NON-techie (unfortunately), that if the install program looked to check for Outlook within "10.0" for the benefit of us poor Outlook 2002 users who have our registries structured that way, well, wouldn't the program then install and work? >>> Does "Help->About->Disabled Items" list it? Again, nothing doing. Nothing listed there. Thanks for the benefit of your attention, David P.P.S. We're a bit better-known in the States, but if you've ever heard of The Motley Fool (www.Fool.com), I'm one of the two co-founding brothers and I wanted you to know that our Foolish techies are fans of this project and your work -- hence, how I found this mailing list. Thanks. -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Tuesday, September 09, 2003 8:57 PM To: 'David Gardner'; spambayes@python.org Subject: RE: [Spambayes] Outlook 2002 misplaced registry lookup? > When I attempt to install the app, I get a surprising dialog > box that states > "Outlook appears to not be installed." It then asks me if I > want to install > anyway, which I do, but it's clear that my Outlook is just > never seeing (or > using) the SpamBayes plug-in. It's as if the SpamBayes plug-in program > doesn't believe I have Outlook on my machine, but I assure > you I'm using my > own legit version of Outlook 2002, from the OEM, with my own > product license > number, via the PC I bought from Dell. > > Being the type who likes to try to figure out stuff for > myself, I read the > FAQs unsuccessfully and then decided to search Google, where > I happened upon > this page: > > http://mail.python.org/pipermail/spambayes-checkins/2003-July/ 001473.html > It appears to my untrained eye that the code is looking for a registry key > where mine is NOT. The installer program simply checks if the key 'Software\Microsoft\Office\Outlook' exists under HKEY_CURRENT_USER. I assume you didn't have this key (but probably will now, as we should have created it) The "addin registration" code involves 2 steps - one is installing a COM object (which I assume has worked) and the other is listing the addin in the "Software\Microsoft\Office\Outlook\Addins" part of the registry. You should find this key now exists, as SpamBayes created it - can you check it does? Does "Help->About->Disabled Items" list it? > Further, check out this this (completely unrelated but still relevant) page: Excellent detective work :) But unfortunately, that doens't appear relevant to this issue. > The Outlook registry key is in fact under my 10.0 directory, when I do a > RegEdit - which is NOT, I believe, where SpamBayes is looking for it. Hence, > it doesn't see my Outlook so that it can plug itself in.... My registry has "Software\Microsoft\Office\10.0\Outlook", but also "Software\Microsoft\Office\Outlook" with the addins (including a "Microsoft VBA Addin" that mysteriously appeared :)). Significantly, I also have a "Software\Microsoft\Office\Outlook\Outlook 10 Accounts" key, which implies Outlook 10 does store *some* stuff there. > But the upshot of it is that SpamBayes doesn't appear to be installing > correctly on my desktop PC - doesn't show up anywhere as a Toolbar > possibility, and I truly can't figure out what I should do next to get it > working on my big machine as sweetly as it's working on my laptop. I'm sure you won't be alone here though, so I'd love to sort it out too. Mark. From tim.one at comcast.net Tue Sep 9 22:47:36 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 9 21:47:53 2003 Subject: [Spambayes] RE: [spambayes-bugs] Spambayes repeatedly classifies messagesfrommailing list as SPAM despite multiple (20+) recoveriesfromspam folder In-Reply-To: <000a01c37169$da6baba0$1fd2f10a@amr.corp.intel.com> Message-ID: [Brian Schwarz] >>> It's not a big issue, but I've noticed that my m-w.com "word >>> of the day" is consistently flagged by Spambayes as spam [...] >>> Here is an example of the mailing list messages that keep >>> getting mislabeled as spam. [Tony Meyer] >> Would you also be able to send a copy of the tokens (with >> scores/count) that this message produces? You can do this via the >> "Show Clues" command in the Outlook plug-in, or via the web >> interface, or with the "debug" header in hammiefilter. (It would >> also be great to know which application you are using). >> >> It's the clues that provide the clues ;), not the message itself. [Brian] > Sorry, I should have thought of that. I'm not really that surprised > that the message gets flagged as spam initially - I'm just surprised > that after a couple of weeks of "teaching" that this mailing list > message hasn't been "whitelisted." > > Spam Score: 0.833733 > > > word spamprob #ham #spam > '*H*' 0.0566883 - - > '*S*' 0.724154 - - > 'url:mydomain' 0.00493094 17079 0 Yikes! From this line I deduce you trained on about 378 *times* more ham messages than spam messages. spambayes works best if you train on an approximately equal number of each. Continuing to train on even more ham than spam probably isn't going to help you at all. So try balancing your training data (i.e., train on much less ham) and see what happens. If you don't want to try that, find your default_bayes_customize.ini file, and change the line experimental_ham_spam_imbalance_adjustment: True to experimental_ham_spam_imbalance_adjustment: False That should help your specific problem a lot, but may increase the false negative rate too (may give hammier scores to genuine spam messages). If you try it, let us know what happens. Nobody developing this code had such extreme training-set imbalance, and we really don't know what to do about it. Nothing we've tried so far works well for everyone in its presence (apart from users finding a way to balance their training data themselves). From tim.one at comcast.net Tue Sep 9 23:26:44 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 9 22:26:49 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <200309090640.h896evvB019793@localhost.localdomain> Message-ID: [Tim] >> spambayes was developed against many peoples' test corpora, >> although, as I said before, I don't think any of them had a >> significant quantity of HTML ham (and I don't think Bill's did >> either). [Anthony Baxter] > Not true - my mungo-testset of 30K or so items had a significant > amount of HTML - this is why I jumped up and down on the HTML stuff. Oh, now you want to be treated like a human too ? OK, you're right. > Tim's original python-list archive had almost no HTML ham. That's true. The only HTML ham it had was mailing-list administrivia requests ("subscribe", "unsubscribe") sent to a wrong address, and a few newbie questions. Drops in the ocean. >> In spambayes I'd be more inclined to write special code to identify >> the img-src-http dance, and synthesize a token for that. It's only >> one token, though, and all tokens carry the same weight here -- it >> may still not be enough to give "a typical" short message of this >> ilk a strong enough score to nail it. The only way to know is to >> try it. > Note also that synthesising a bunch of (highly correlated) clues to > attempt to fix the problem of the single URL spams often leads to > unexpected (bad) consequences with scoring other email. In general, > highly correlated clues are bad. Well, correlation actually appears to help us more often than it hurts us, and stripping HTML in the tokenizer was a hack to blind the classifier to the strongest source of harmful correlation I know of. We've added a few pieces of HTML evidence back since then, and it's helped. I don't know whether an img-source-http token would help or hurt (hence "the only way to know is to try it"), since it's very likely correlated with existing synthesized url:jpg and url:gif tokens. I get the latter in ham too, but mostly in (wanted) HTML marketing collateral of various kinds with *tons* of hammy clues -- they score so hammy now that one new contradictory token won't hurt them. But that's just me. > This is why it's necessary to test additions to make sure that in fixing > one problem you're not adding 4 others. Yup! > Someone should go through the list archives and work out the ratio of > attempted tokeniser tricks that made things worse to ones that actually > improved the situation. I'd guess that it's something like 4 rejected for > every one that went in... Whatever you came up with would be an underestimate! In the very early days, I tried new stuff 7 days a week whenever I was awake. Performance was much worse then and it was much easier to find strong improvements. The only evidence of those experiments anyone saw was about the ones that worked, since those are the ones that got checked in. Words about *some* others survived in the comments and in scattered email. I think I threw away about 9 changes then for each that got checked in. Goodness, I *still* personally insulted that folding case turned out to work as well as preserving it <0.5 wink>. From jmwilson at knology.net Wed Sep 10 03:31:47 2003 From: jmwilson at knology.net (jmwilson@knology.net) Date: Tue Sep 9 22:31:48 2003 Subject: [Spambayes] How to setup Spambayes for Domino server Message-ID: <20030910023144.8033.qmail@webmail2.knology.net> I am running the software on Redhat 8. The domino server is running SMTP. How do I set it up to scan the emails that come through. I have already got it installed. Could you please help? You can give me a call at 706-571-7000 Ext 7413 between 8am - 5pm est Or email me back at jwilson@charbroil.com Thanks, James Wilson From mjeaya069 at rogers.com Wed Sep 10 01:44:47 2003 From: mjeaya069 at rogers.com (Mark Jeays) Date: Wed Sep 10 00:42:18 2003 Subject: [Spambayes] Spam that bypasses spambayes Message-ID: I've been getting some messages recently that seem to fight bayesian filters directly and are getting through with ease (under 0.05). Any thoughts on spambayes performance on these? Is there a chance of catching them? Currently my database is at a mere 134 real email (this is not my primary account) and a disturbing 3674 spam. Exhibit A: *** hey what's up...it's been a while...i'm just on my way out but i wanted to let you know about this really cool product I saw on TechTv, I found a website that is selling it dirt cheap so you may want to grab one while you can, the link is below....gotta take off, later http://www.gadgitz.biz/promo.php?id=93778 *** Exhibit B: *** Hi, haven't heard from in awhile, what's up? I'm still at that company working away. Boring as always. I just got a heard about an awesome website that's saving me tons of money already on my medications. I'm getting most my stuff their now as it's the cheapeast I've found online, and it's fast. With the secure site and Live overnight tracking it's safe & you can't really go wrong. Check it out, may save you some extra dollars. Anyways, I'll put link below. Say Hi to everyone and take care. http://www.pharmacysale.biz/pharm/ *** Do I just need to wait a bit for "overnight" and "tracking" and "medications" to increase their spamminess number? I suppose so... Regards, Mark Jeays From anthony at interlink.com.au Wed Sep 10 15:39:48 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Sep 10 00:42:45 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: Message-ID: <200309100439.h8A4dmu8030273@localhost.localdomain> >>> "Tim Peters" wrote > Well, correlation actually appears to help us more often than it hurts us, > and stripping HTML in the tokenizer was a hack to blind the classifier to > the strongest source of harmful correlation I know of. We also don't (by default) tokenise all headers, either, because of the harmful correlation of the bazillions of mailman headers Barry has seen fit to inflict on us <0.5 wink> > they score so hammy now that one new contradictory token > won't hurt them. But that's just me. You're hammy? I kinda figured as such. > > [failed tokeniser mods] > Whatever you came up with would be an underestimate! In the very early > days, I tried new stuff 7 days a week whenever I was awake. I did similar things. In hindsight I regret not recording the various things I tried - I'm pretty sure I had the same brilliant, obvious, and wrong ideas a couple of times. > Goodness, I *still* > personally insulted that folding case turned out to work as well as > preserving it <0.5 wink>. Well, you know the spammers did that deliberately to annoy the people working on spam filtering solutions. -- Anthony Baxter It's never too late to have a happy childhood. From anthony at interlink.com.au Wed Sep 10 15:50:54 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Sep 10 00:51:13 2003 Subject: [Spambayes] Spam that bypasses spambayes In-Reply-To: Message-ID: <200309100450.h8A4osoU030416@localhost.localdomain> >>> "Mark Jeays" wrote > I've been getting some messages recently that seem to fight bayesian filters > directly and are getting through with ease (under 0.05). Any thoughts on > spambayes performance on these? Is there a chance of catching them? I did a grep through my spam folder and saw a couple of these sorts - (in particular, grepping for "what's up"). My installation scored them between 0.9 and 1.0, which suggests it's just a matter of training on a few of them. -- Anthony Baxter It's never too late to have a happy childhood. From cdman at coder.hu Wed Sep 10 07:59:31 2003 From: cdman at coder.hu (Balazs Attila Mihaly) Date: Wed Sep 10 00:59:38 2003 Subject: [Spambayes] Watch out for this Message-ID: Hello. Sorry for writing without registering to any mailing list or at sf.net, but I'm kind of lazy. Anyway, I've become kind of intereseted in text classification after reading some of the papers here. For me spam isn't a big issue yet (getting 2-5 spam mails daily can be supported), but still I hope you are progressing well. Maybe later on I'll install it too. Now the issue I'm writing about: there has been a lot of talk about the tokenizing part (should it include the html tags or not, etc.), but I would like to add one idea of my own: as you know in html pages characters can be written as #;, where represents the ASCII (or maybe UNICODE - I'm not sure) code of the character. Now, if you don't convert these characters back to their corresponding values, a spammer could use almost infinite variations of words (writing hel#;o or #ello or replacing multiple characters with their codes). So my suggestion would be to convert them back. PHP has already such a function (html_entity_decode), so you could take a look at their source code for help. -Cd-MaN P.S. Sorry if this issue has already been bought up. From harri.pesonen at wicom.com Wed Sep 10 10:30:04 2003 From: harri.pesonen at wicom.com (Harri Pesonen) Date: Wed Sep 10 02:30:10 2003 Subject: [Spambayes] Spam that bypasses spambayes Message-ID: I had an idea a couple of weeks ago, that all url tokens should have more weight than other tokens. The spammer just wants you to click on some url, so the other text is not so important. They could even put random words there, and they have. And maybe the url server address should be tokenized the same way as the address is tokenized in Received header. So the address below would yield url:biz url:gadgitz.biz url:www.gadgitz.biz Now it just does url:www url:gadgitz url:biz Maybe it should do both and have more weight that way. Also decode % encoding and find server names for ip addresses... :-) Harri -----Original Message----- From: "Mark Jeays" I've been getting some messages recently that seem to fight bayesian filters directly and are getting through with ease (under 0.05). Any thoughts on spambayes performance on these? Is there a chance of catching them? Currently my database is at a mere 134 real email (this is not my primary account) and a disturbing 3674 spam. Exhibit A: *** hey what's up...it's been a while...i'm just on my way out but i wanted to let you know about this really cool product I saw on TechTv, I found a website that is selling it dirt cheap so you may want to grab one while you can, the link is below....gotta take off, later http://www.gadgitz.biz/promo.php?id=93778 *** Exhibit B: *** Hi, haven't heard from in awhile, what's up? I'm still at that company working away. Boring as always. I just got a heard about an awesome website that's saving me tons of money already on my medications. I'm getting most my stuff their now as it's the cheapeast I've found online, and it's fast. With the secure site and Live overnight tracking it's safe & you can't really go wrong. Check it out, may save you some extra dollars. Anyways, I'll put link below. Say Hi to everyone and take care. http://www.pharmacysale.biz/pharm/ *** Do I just need to wait a bit for "overnight" and "tracking" and "medications" to increase their spamminess number? I suppose so... Regards, Mark Jeays From roger at site0.com Wed Sep 10 02:02:30 2003 From: roger at site0.com (Roger Issa) Date: Wed Sep 10 04:02:56 2003 Subject: [Spambayes] Saving Training for New OS Message-ID: <000501c37771$e27e3e80$6400a8c0@roger> I have been using spambayes for quite a while and have trained spam filtering quite well with junk mail and "good" mail. The problem is I plan on reinstalling my OS and wanted to know if there was a specific file that contains the training data. which I should copy/save over to the folder under the new OS. If so, where can I locate this data file? -Thank You From T.A.Meyer at massey.ac.nz Wed Sep 10 21:20:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 10 04:21:20 2003 Subject: [Spambayes] Saving Training for New OS Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332AB7E@its-xchg4.massey.ac.nz> > The problem is I plan on reinstalling my OS and wanted to > know if there was a specific file that contains the training > data. which I should copy/save over to the folder under the new OS. Please see FAQ 3.7: > If so, where can I locate this data file? The FAQ also answers this, but note if you have version 008[.1] of the plugin, you can also find the data directory via the 'advanced' tab in the main dialog. =Tony Meyer From awhig at yahoo.com Tue Sep 9 23:23:28 2003 From: awhig at yahoo.com (Richard Cook) Date: Wed Sep 10 09:59:14 2003 Subject: [Spambayes] Problem with SpamBayes Message-ID: <20030910052328.23231.qmail@web9204.mail.yahoo.com> Hi, I started using SpamBayes last week with the .7x version. I just upgraded to .81, but I'm still having a problem I experienced with .7x. Platform: Windows XP Program: Outlook 2000 I started using the plugin on my wife's profile on the machine and it worked great. I then tried to use it in my profile, and the plugin does not work at all. I try to train the SpamBayes and it fails to see any messages. It finishes the training in a split second and ends up saying it has 0 good message and 0 spam. Trying to get the training to see a message, I selected all my folder to train on, and managed to get 1 message to appear as good. I put the folders back to their proper settings, retrained and got the 0 good 0 spam result again... If I try and delete a message as Spam it always says, "No filterable mail items are selected". I don't know what to do. It still works great on my wife's profile though... Thanks for your help, Rich Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] *** SpamBayes is NOT enabled, so will not filter incoming mail. *** Current version is 0.81, latest is 0.81. GetNextPage with current 0 IDD_WIZARD_WELCOME GetNextPage with current 3 IDD_WIZARD_FOLDERS_TRAIN About to train with [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. GetNextPage with current 4 IDD_WIZARD_TRAIN GetNextPage with current 1 IDD_WIZARD_FOLDERS_WATCH GetNextPage with current 2 IDD_WIZARD_FOLDERS_REST Wizard Done! Saving wizard changes Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: True timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 2 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: True timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: True folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 2 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- ERROR: 'No filterable mail items are selected' Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: True timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: True folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 2 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- GetNextPage with current 0 IDD_WIZARD_WELCOME GetNextPage with current 5 IDD_WIZARD_TRAINING_IS_IMPORTANT Wizard Done! Saving wizard changes Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Ignoring OnCommand for 65535 GetNextPage with current 0 IDD_WIZARD_WELCOME GetNextPage with current 3 IDD_WIZARD_FOLDERS_TRAIN About to train with [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. GetNextPage with current 4 IDD_WIZARD_TRAIN GetNextPage with current 1 IDD_WIZARD_FOLDERS_WATCH GetNextPage with current 2 IDD_WIZARD_FOLDERS_REST Wizard Done! Saving wizard changes Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 2 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 2 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.99774ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.2754ms Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Ignoring OnCommand for IDC_START Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 10 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- OnSelectionChange ERROR: 'No filterable mail items are selected' OnActivate OnActivate OnActivate OnActivate Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: False only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 10 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: False train_manual_spam: True train_recovered_spam: True -- end of configuration -- OnActivate OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnSelectionChange OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnSelectionChange OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.82817ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.49554ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.91253ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.61874ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.90164ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.21031ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.86364ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.27401ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 2.10166ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.51594ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.77704ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.25752ms Checked 0 in folder Calendar - 0 new entries found. Checked 0 in folder Contacts - 0 new entries found. Checked 0 in folder Drafts - 0 new entries found. Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Possible Spam - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Checked 0 in folder Journal - 0 new entries found. Checked 0 in folder Notes - 0 new entries found. Checked 0 in folder Tasks - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 1.64518ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.43241ms Checked 0 in folder Calendar - 0 new entries found. Checked 0 in folder Contacts - 0 new entries found. Checked 0 in folder Drafts - 0 new entries found. Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Possible Spam - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Checked 0 in folder Journal - 0 new entries found. Checked 0 in folder Notes - 0 new entries found. Checked 0 in folder Tasks - 0 new entries found. Checked 0 in folder Top of Personal Folders - 0 new entries found. Checked 1 in folder Deleted Items - 1 new entries found. Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Possible Spam - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Checked 0 in folder Outbox - 0 new entries found. Checked 0 in folder Sent Items - 0 new entries found. Checked 0 in folder Calendar - 0 new entries found. Checked 0 in folder Contacts - 0 new entries found. Checked 0 in folder Journal - 0 new entries found. Checked 0 in folder Notes - 0 new entries found. Checked 0 in folder Tasks - 0 new entries found. Checked 0 in folder Drafts - 0 new entries found. Checked 0 in folder Calendar - 0 new entries found. Checked 0 in folder Contacts - 0 new entries found. Checked 0 in folder Drafts - 0 new entries found. Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Possible Spam - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Checked 0 in folder Journal - 0 new entries found. Checked 0 in folder Notes - 0 new entries found. Checked 0 in folder Tasks - 0 new entries found. Saving bayes database with 1 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 2.32041ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 1 spam and 0 good messages Loaded databases in 4.37151ms Checked 0 in folder Inbox - 0 new entries found. Checked 0 in folder Spam - 0 new entries found. Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_bayes_database.db -> C:\Documents and Settings\user\Application Data\SpamBayes\$sbtemp$default_message_database.db Saved databases in 2.97217ms Loaded bayes database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\user\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.50392ms OnActivate Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: True only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 10 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: True train_manual_spam: True train_recovered_spam: True -- end of configuration -- OnActivate OnActivate OnActivate OnSelectionChange OnActivate OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange ERROR: 'No filterable mail items are selected' OnActivate OnSelectionChange ERROR: 'No filterable mail items are selected' OnActivate OnBeforeFolderSwitch OnViewSwitch OnFolderSwitch OnSelectionChange OnSelectionChange ERROR: 'No filterable mail items are selected' OnActivate OnActivate Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: True only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 10 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: True train_manual_spam: True train_recovered_spam: True -- end of configuration -- OnActivate OnActivate Saving configuration -> C:\Documents and Settings\user\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Dumping configuration to save: [Experimental] timer_interval: 1000 timer_only_receive_folders: True timer_start_delay: 0 [Filter] enabled: True filter_now: False save_spam_info: True spam_action: Moved spam_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000') spam_mark_as_read: False spam_threshold: 90.0 timer_enabled: False timer_interval: 1.0 timer_only_receive_folders: True timer_start_delay: 2.0 unsure_action: Moved unsure_folder_id: ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5DE2810000') unsure_mark_as_read: False unsure_threshold: 15.0 watch_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] watch_include_sub: False [Filter_Now] action_all: False folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000'), ('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] include_sub: True only_unread: False only_unseen: False [General] data_directory: delete_as_spam_message_state: None field_score_name: Spam recover_from_spam_message_state: None verbose: 10 [Training] ham_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D82800000')] ham_include_sub: False rebuild: True rescore: True spam_folder_ids: [('0000000038A1BB1005E5101AA1BB08002B2A56C200005053545052582E444C4C00000000000000004E495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E642053657474696E67735C757365725C4C6F63616C2053657474696E67735C4170706C69636174696F6E20446174615C4D6963726F736F66745C4F75746C6F6F6B5C6F75746C6F6F6B2E70737400', '0000000089627CAEFF57214EA85495C9002CCC5D02820000')] spam_include_sub: True train_manual_spam: True train_recovered_spam: True -- end of configuration -- ERROR: 'No filterable mail items are selected' OnActivate From AAnand at mathematica-mpr.com Wed Sep 10 11:25:17 2003 From: AAnand at mathematica-mpr.com (Anu Anand) Date: Wed Sep 10 10:25:33 2003 Subject: [Spambayes] Question On Installtion Message-ID: <897E2332A97AD311AEBB00508B116D5418C350A5@mpr1> Hi I am looking for various steps to atke before installing spambayes on a 98 machine, the reason I want steps as I will know what files get installed where so it's easy to troubleshoot . As right know I have to test it before I start deploying companywide. From ruevec at fivestarcu.com Wed Sep 10 10:58:18 2003 From: ruevec at fivestarcu.com (Rueve, Christina) Date: Wed Sep 10 10:58:43 2003 Subject: [Spambayes] error Message-ID: I am getting the following error during installation: Unable to register the DLL/OCX: DllRegister Server Failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. I click on Retry and nothing happened. I then clicked on igore. The version is binary version 0.6. PC is XP with Office XP. Please advise. Chris Rueve LAN/Client Administrator Five Star Credit Union ruevec@fivestarcu.com CONFIDENTIALITY NOTICE: The material in this electronic transmission is either private, confidential, leagally privileged or constitutes work product and is intended only for the use of the individual (s) named above. If you are not the intended recipient, be advised that unauthorized use, disclosure, copying distribution, or the taking of any action in reliance on this information is strictly prohibited. If you have received this transmission in error, please immediately notify us by telephone or return electronic transmission. From skip at pobox.com Wed Sep 10 11:11:50 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 11:12:00 2003 Subject: [Spambayes] How to setup Spambayes for Domino server In-Reply-To: <20030910023144.8033.qmail@webmail2.knology.net> References: <20030910023144.8033.qmail@webmail2.knology.net> Message-ID: <16223.16182.793043.684012@montanaro.dyndns.org> James> I am running the software on Redhat 8. The domino server is James> running SMTP. How do I set it up to scan the emails that come James> through. I have already got it installed. Could you please help? You didn't say anything about the mail reader you use. SpamBayes works best at the client where each user can train on their own mail. That said, I'm not sure anyone has investigated integrating it with a Domino server, though others have done so for other SMTP servers. Skip From skip at pobox.com Wed Sep 10 11:18:43 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 11:18:54 2003 Subject: [Spambayes] Watch out for this In-Reply-To: References: Message-ID: <16223.16595.885303.645498@montanaro.dyndns.org> Balazs> would like to add one idea of my own: as you know in html pages Balazs> characters can be written as #;, where Balazs> represents the ASCII (or maybe UNICODE - I'm not sure) code of Balazs> the character. Now, if you don't convert these characters back Balazs> to their corresponding values, a spammer could use almost Balazs> infinite variations of words (writing hel#;o or Balazs> #ello or replacing multiple Balazs> characters with their codes). So my suggestion would be to Balazs> convert them back. Good suggestion. I'm not sure if the tokenizer does this already, but a quick grep for '&#[0-9];' through my current training database (about 3 million lines) suggests this is still fairly infrequently used. I only found about 2100 lines (around 0.07%) of the lines contained a numeric entity. If/when the spammers start using such techniques and they turn out to cause problems for the classifier, it should be fairly easy to extend the tokenizer to make the necessary substitutions. Skip From gjewell at cnnxn.com Wed Sep 10 10:19:35 2003 From: gjewell at cnnxn.com (Greg Jewell) Date: Wed Sep 10 11:19:39 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <741C3B0BAF7B1F4F8FCC50AC4766CBF00500F9@phoenix.ossconnexn.com> Hi All, I've been using Spambayes for a few weeks now and am extremely happy with it. Congratulations and thanks to all of the developers for working on this project! I encountered a problem this morning, though, and I'm not certain what the issue is. I received a spam email that is disguised as if it's a message returned as undeliverable. Here's the subject of the mail: "Undeliverable: Read: gjeve get Free Printer Cartridges and more". For some reason, Spambayes says that this message is unfilterable. It won't give it a spam probability value, and when I click "Delete as Spam" an error message pops up saying that No filterable mail items have been selected. Has anybody else encountered this before? Thanks, Greg Jewell From lklawson at heapy.com Wed Sep 10 12:34:00 2003 From: lklawson at heapy.com (Kirk Lawson) Date: Wed Sep 10 11:35:58 2003 Subject: [Spambayes] Outlook 2000 forms problem Message-ID: <508EFB3001053100@c2smtp.heapy.com> I've installed the Outlook pluging on a 6 clients with good success. However two clients frequently get an error since installation: "Custom form could not be opened. Outlook will use an Outlook form instead." Can't seem to find it in the FAQ or the list archives. Kirk Lawson, Peace favor your sword --- "In these modern times, many men are wounded for not having weapons or knowledge of their use." -Achille Marozzo, 1536 From skip at pobox.com Wed Sep 10 11:49:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 11:50:01 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <16223.16595.885303.645498@montanaro.dyndns.org> References: <16223.16595.885303.645498@montanaro.dyndns.org> Message-ID: <16223.18464.104287.924773@montanaro.dyndns.org> Skip> Good suggestion. I'm not sure if the tokenizer does this already, Skip> but a quick grep for '&#[0-9];' through my current training Skip> database (about 3 million lines) suggests this is still fairly Skip> infrequently used. Before anyone chides me for the incorrect regex, I actually grepped for '&#[0-9]+;'. The results were correct. I just mistyped the regex in my message. S From anthony at interlink.com.au Thu Sep 11 03:22:29 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Sep 10 12:23:00 2003 Subject: [Spambayes] error In-Reply-To: Message-ID: <200309101622.h8AGMTmc003320@localhost.localdomain> >>> "Rueve, Christina" wrote > I am getting the following error during installation: Unable to > register the DLL/OCX: DllRegister Server Failed; code 0x00000000. Click > Retry to try again, Ignore to proceed anyway (not recommended), or Abort > to cancel installation. I click on Retry and nothing happened. I then > clicked on igore. The version is binary version 0.6. PC is XP with > Office XP. Please advise. Please try again with the 0.81 version on the website. Anthony From anthony at interlink.com.au Thu Sep 11 03:25:32 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Sep 10 12:26:07 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <16223.16595.885303.645498@montanaro.dyndns.org> Message-ID: <200309101625.h8AGPWUA003410@localhost.localdomain> >>> Skip Montanaro wrote > Good suggestion. I'm not sure if the tokenizer does this already, but a > quick grep for '&#[0-9];' through my current training database (about 3 > million lines) suggests this is still fairly infrequently used. I only > found about 2100 lines (around 0.07%) of the lines contained a numeric > entity. If/when the spammers start using such techniques and they turn out > to cause problems for the classifier, it should be fairly easy to extend the > tokenizer to make the necessary substitutions. Maybe we should have a file somewhere of "yet to be tested" tokeniser ideas? And update it with a comment when we find what does or doesn't work? (Ref the discussion yesterday about tokenising tricks tried and abandoned...) From anthony at interlink.com.au Thu Sep 11 03:26:51 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Sep 10 12:27:19 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin In-Reply-To: <741C3B0BAF7B1F4F8FCC50AC4766CBF00500F9@phoenix.ossconnexn.com> Message-ID: <200309101626.h8AGQpZi003448@localhost.localdomain> >>> "Greg Jewell" wrote > I encountered a problem this morning, though, and I'm not certain what > the issue is. I received a spam email that is disguised as if it's a > message returne d as undeliverable. Here's the subject of the mail: > "Undeliverable: Read: gjeve get Free Printer Cartridges and more". > For some reason, Spambayes says that this message is unfilterable. It > won't give it a spam probability value, and when I click "Delete as > Spam" an error message pops up saying that No filterable mail items > have been selected. > > Has anybody else encountered this before? Hm. Is it possible that the message's MIME structure is totally messed up, and the tokeniser's bailing out? It's far too long since I looked at that bit of the code... Tony? Anthony From JRedmond at ymcastlouis.org Wed Sep 10 12:32:03 2003 From: JRedmond at ymcastlouis.org (JRedmond@ymcastlouis.org) Date: Wed Sep 10 12:34:15 2003 Subject: [Spambayes] How to setup Spambayes for Domino server Message-ID: Check out http://www.openntf.org, the Domino/Notes open-source community. There's talk of adding Bayesian filtering to their OpenNTFMail template and pushing the workload off onto the client side, and Spambayes will probably wind up getting integrated somehow. (They're in the process of relocating their servers today, though, so you'll have to check it out tomorrow.) In the meantime, I'd suggest you resist the urge to integrate the Domino server's SMTP and Router processes and Bayesian filtering - speaking from experience, that approach can only bring you pain. Instead, use Spambayes as a proxy. ************************************ James Redmond, Domino Administrator YMCA of Greater St. Louis jredmond@ymcastlouis.org From rmalayter at bai.org Wed Sep 10 13:18:08 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Sep 10 13:18:13 2003 Subject: [Spambayes] Watch out for this Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3FB@cliff.bai.org> From: Anthony Baxter [mailto:anthony@interlink.com.au] > Maybe we should have a file somewhere of > "yet to be tested" tokeniser ideas? And > update it with a comment when we find what > does or doesn't work? This sounds like a good idea to me, and would prevent a lot of unnecessary duplication of effort. Perhaps in the RFE section of the SpamBayes CVS? We could preface the subject of all tokenizing entries with a [tokenizer] tag or something. From m0davis at pacbell.net Wed Sep 10 11:41:12 2003 From: m0davis at pacbell.net (Martin Davis) Date: Wed Sep 10 13:41:18 2003 Subject: [Spambayes] Error in classifier.py? Message-ID: <20030910174112.12074.qmail@web80401.mail.yahoo.com> (Forgive me if this message has been sent more than once.) I finally thought I had Spambayes' pop3proxy working when Istarted getting the following error (see below) whenever I try toreceieve an email message. I am running Mozilla 1.5b on Windows XPPro. Note that is was working correctly for a short while. I don'tknow what happened to cause the error. I do notice that Spambayes WebInterface reports "Totalemails trained: Spam: 5 Ham: 92". However, I know that recently there were as many as 10spam messages and 100 ham messages. Please help. Thanks. Error dump follows: ------------------------------------------------------------------------ SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... SMTP Listener on port 25 is proxyingsmtp.pacbell.yahoo.com: 25 Listener on port 110 is proxying pop.pacbell.yahoo.com:110 User interface url is http://localhost:8880/ Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\pop3proxy.py", line 437, inonRetr evidence=True) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line22 3, in chi2_spamprob clues = self._getclues(wordstream) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line46 0, in _getclues prob = self.probability(record) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line31 0, in probability assert spamcount <= nspam AssertionError From skip at pobox.com Wed Sep 10 14:40:30 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 14:40:40 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D3FB@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2AF3D3FB@cliff.bai.org> Message-ID: <16223.28702.409585.775186@montanaro.dyndns.org> Ryan> From: Anthony Baxter [mailto:anthony@interlink.com.au] >> Maybe we should have a file somewhere of "yet to be tested" tokeniser >> ideas? And update it with a comment when we find what does or doesn't >> work? +1 Ryan> This sounds like a good idea to me, and would prevent a lot of Ryan> unnecessary duplication of effort. Or at least a lot of unnecessary duplication of suggestions. ;-) Any entries in that file which are shot down should probably be explained. In fact, perhaps a quick summary of all the stuff the tokenizer does would be useful. Why make the spammers read our source code? ;-) Skip From skip at pobox.com Wed Sep 10 14:51:33 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 14:51:44 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <20030910174112.12074.qmail@web80401.mail.yahoo.com> References: <20030910174112.12074.qmail@web80401.mail.yahoo.com> Message-ID: <16223.29365.436886.368204@montanaro.dyndns.org> Martin> I finally thought I had Spambayes' pop3proxy working when Martin> Istarted getting the following error ... Martin> File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line310, in probability Martin> assert spamcount <= nspam Martin> AssertionError This is a sure sign of a corrupted database file. If you execute this Python code: import whichdb whichdb.whichdb(r"c:\path\to\your\database\file") what does it report? I suggest you rename it (in case someone wants to look at it) and restart your training. Since you're using pop3proxy I suggest you use the pickle format. To do this, add these lines to your ini file: [Storage] persistent_use_database: False Make sure you move your corrupt database out of the way first. Skip From TheAceMan at bargainace.com Wed Sep 10 16:00:15 2003 From: TheAceMan at bargainace.com (TheAceMan) Date: Wed Sep 10 15:58:12 2003 Subject: [Spambayes] Rule Wizard Message-ID: <000001c377d6$2865e920$6401a8c0@theaceman> I am currently running Winsows XP Pro and MS Outlook 2002. I recently added SpamBayes to outlook. But it seems one of my rules is overriding spambaye. When I get a new message it is moved to the spam folder then moved to another folder because of rules I had previously. However I can not open the rule wizard any more either. I'm unsure why it won't open. Outlook runs normally and everything seems to work yet I can not open the rule wizard, I can create new rules but right clicking on a message but can't see old rules by selecting tools | rule wizard. I can try reinstalling Office or Outlook or XP but would like some advice before I do that. Thanks in advance, Kevin From tom.boland at cpsinc.com Wed Sep 10 14:16:36 2003 From: tom.boland at cpsinc.com (Tom Boland) Date: Wed Sep 10 16:16:51 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook Message-ID: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> I know you say it's not needed *but it is*!! This is the only thing stopping me from implementing this product (and making donations) over anything else. I need to make sure that email from certain people (my boss, his boss, all local domain mail ,etc.) gets through 100% of the time. Even after several months of training, some mail from these people gets dumped into Quarantine: Junk. I even saved 200 messages from one person, trained it as good mail, and I still get mail from that person (my CEO, no less) sent to the Quarantine. I know, I know that this goes against your belief in the One True God of Bayesian filtering, but in the real world, we REALLY need at least a white list and preferably a blacklist, too. Thanks for an otherwise fantastic program. Keep up the good work. Tom Boland IT Manager Commercial Programming Systems From skip at pobox.com Wed Sep 10 16:22:29 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 16:22:44 2003 Subject: [Spambayes] How to setup Spambayes for Domino server In-Reply-To: <20030910194532.8526.qmail@webmail1.knology.net> References: <20030910194532.8526.qmail@webmail1.knology.net> Message-ID: <16223.34821.348988.769104@montanaro.dyndns.org> (make sure you reply to the entire list so you get the benefit of everyone's eyeballs and brains...) James> I am running the software on Redhat 8. The domino server is James> running SMTP. How do I set it up to scan the emails that come James> through. I have already got it installed. Could you please James> help? Skip> You didn't say anything about the mail reader you use. SpamBayes Skip> works best at the client where each user can train on their own Skip> mail. That said, I'm not sure anyone has investigated integrating Skip> it with a Domino server, though others have done so for other SMTP Skip> servers. James> We use Lotus Notes client to read mail. What I was tring to do is James> set it up to be its own server. Is that possible for all email be James> filtered through this server then go to the Domino server. There is an smtpproxy script, but I'm not sure if this is the sort of situation it was designed for. Another SB user wrote an SMTP proxy which performs the necessary checks at SMTP transfer time, rejecting the message if it scores as spam, then calling the real SMTP server if it passes that test. That sounds like something you could try. Search the list archives for the past few weeks. Skip From skip at pobox.com Wed Sep 10 16:28:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 16:28:30 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <20030910200205.82097.qmail@web80410.mail.yahoo.com> References: <16223.29365.436886.368204@montanaro.dyndns.org> <20030910200205.82097.qmail@web80410.mail.yahoo.com> Message-ID: <16223.35162.304062.296851@montanaro.dyndns.org> >>> whichdb.whichdb(r"c:\program files\python23\scripts\hammie.db") 'dbhash' That's what I figured. Note that some bugs related to storage have been fixed in the past few weeks. You might want to try 1.0a5 if you're not up to that rev yet. Martin> I made the change you suggested and renamed the database. Martin> Working fine so far. Thanks. Martin> P.S. I didn't post this reply to the list. If that's wrong, let Martin> me know. Not wrong, but when you're trying to solve a problem it helps to get more than one perspective instead of just that of the first wacko to reply. ;-) In particular, others appear to be much better at remembering that "the database transmogrification bug was fixed in version 3.14159 of somereallybigmodule.py". Skip From adam.walker at rbwconsulting.com Wed Sep 10 17:31:31 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Wed Sep 10 16:32:00 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> Message-ID: <20030910203146.6F999862AA@plunder.dreamhost.com> You can use outlook builtin rules to make black or white lists. With a good, balanced training set, I've never needed them. I used one other Bayesian before SpamBayes with a white and black list, which performed terribly. I ditched the lists and retrained with the same product and got MUCH better results, but still not good as SpamBayes. Thanks, Adam Walker (located somewhere in the real world) > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Tom Boland > Sent: Wednesday, September 10, 2003 4:17 PM > To: 'spambayes@python.org' > Subject: [Spambayes] Whitelist for SpamBayes for Outlook > > I know you say it's not needed *but it is*!! > > This is the only thing stopping me from implementing this product (and > making donations) over anything else. I need to make sure that email from > certain people (my boss, his boss, all local domain mail ,etc.) gets > through > 100% of the time. > > Even after several months of training, some mail from these people gets > dumped into Quarantine: Junk. I even saved 200 messages from one person, > trained it as good mail, and I still get mail from that person (my CEO, no > less) sent to the Quarantine. > > I know, I know that this goes against your belief in the One True God of > Bayesian filtering, but in the real world, we REALLY need at least a white > list and preferably a blacklist, too. > > Thanks for an otherwise fantastic program. Keep up the good work. > > Tom Boland > IT Manager > Commercial Programming Systems > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From skip at pobox.com Wed Sep 10 16:35:41 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 16:35:54 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> References: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> Message-ID: <16223.35613.854154.484775@montanaro.dyndns.org> Tom> Even after several months of training, some mail from these people Tom> gets dumped into Quarantine: Junk. I even saved 200 messages from Tom> one person, trained it as good mail, and I still get mail from that Tom> person (my CEO, no less) sent to the Quarantine. Sounds to me like you probably have some significant errors in your training. If you have 200 messages from the big cheese trained as "good" and his messages still get quarantined, I have to believe there's something wrong with your training. Can you post the clues from the quarantined message without giving away the company's secrets? It would be helpful to see them. I don't know if the Outlook plugin displays the ham and spam counts for various clues as well, but that would also be helpful. Tom> I know, I know that this goes against your belief in the One True Tom> God of Bayesian filtering, but in the real world, we REALLY need at Tom> least a white list and preferably a blacklist, too. That's not it. Outlook already allows you to define such rules. Mark would simply be reimplementing something you can already do. If what you're really asking for is for the Outlook plugin to implement whitelists and blacklists in an easier fashion than you can do in Outlook directly, I think your beef is with Microsoft. ;-) Skip From StarPilot at sol3.net Wed Sep 10 15:45:23 2003 From: StarPilot at sol3.net (StarPilot) Date: Wed Sep 10 16:47:10 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <16223.35613.854154.484775@montanaro.dyndns.org> Message-ID: <023601c377dc$75b03ae0$c902a8c0@sol3.int> Am I missing something? I am running the Outlook plug-in (love it death!) and it has a Black/White list function. I am 2 versions out of date at the moment: did it go away? I use it and the Outlook rules (100+ rules!) and it seems to work just fine. I've seen <10 emails that got classified as possible spam that should not have. With as many lists as I am on I get >1500 emails/week and very rarely now do I get a misclassification. Confused. Keith -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Skip Montanaro Sent: Wednesday, September 10, 2003 2:36 PM To: Tom Boland Cc: 'spambayes@python.org' Subject: Re: [Spambayes] Whitelist for SpamBayes for Outlook Tom> Even after several months of training, some mail from these people Tom> gets dumped into Quarantine: Junk. I even saved 200 messages from Tom> one person, trained it as good mail, and I still get mail from that Tom> person (my CEO, no less) sent to the Quarantine. Sounds to me like you probably have some significant errors in your training. If you have 200 messages from the big cheese trained as "good" and his messages still get quarantined, I have to believe there's something wrong with your training. Can you post the clues from the quarantined message without giving away the company's secrets? It would be helpful to see them. I don't know if the Outlook plugin displays the ham and spam counts for various clues as well, but that would also be helpful. Tom> I know, I know that this goes against your belief in the One True Tom> God of Bayesian filtering, but in the real world, we REALLY need at Tom> least a white list and preferably a blacklist, too. That's not it. Outlook already allows you to define such rules. Mark would simply be reimplementing something you can already do. If what you're really asking for is for the Outlook plugin to implement whitelists and blacklists in an easier fashion than you can do in Outlook directly, I think your beef is with Microsoft. ;-) Skip _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From skip at pobox.com Wed Sep 10 16:55:45 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 16:55:57 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <023601c377dc$75b03ae0$c902a8c0@sol3.int> References: <16223.35613.854154.484775@montanaro.dyndns.org> <023601c377dc$75b03ae0$c902a8c0@sol3.int> Message-ID: <16223.36817.852977.418235@montanaro.dyndns.org> Keith> Am I missing something? I am running the Outlook plug-in (love Keith> it death!) and it has a Black/White list function. I am 2 Keith> versions out of date at the moment: did it go away? There is a whitelist capability in the InBoxer/SpamAtBay commercial version. Are you using it perhaps? Skip From m0davis at pacbell.net Wed Sep 10 15:10:38 2003 From: m0davis at pacbell.net (Martin Davis) Date: Wed Sep 10 17:16:25 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: Message-ID: <20030910211038.80808.qmail@web80405.mail.yahoo.com> The problem occured while I was using 1.0a5. One idea I have for why it might have gotten corrupted: While running the pop3proxy I halted it by closing the "DOS" box, instead of going into the web interface and choosing "Save and shutdown" first. Could this have caused the problem? -Martin === Skip Wrote: ================================ Subject: Re: [Spambayes] Error in classifier.py? To: Martin Davis >>> whichdb.whichdb(r"c:\program files\python23\scripts\hammie.db") 'dbhash' That's what I figured. Note that some bugs related to storage have been fixed in the past few weeks. You might want to try 1.0a5 if you're not up to that rev yet. From john at comnet-tech.com Wed Sep 10 18:19:58 2003 From: john at comnet-tech.com (John A) Date: Wed Sep 10 17:19:53 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 36 In-Reply-To: Message-ID: Please unsubscribe me from this mailing list. I have been trying to get off of it for several weeks now. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of spambayes-request@python.org Sent: Wednesday, September 10, 2003 4:29 PM To: spambayes@python.org Subject: Spambayes Digest, Vol 61, Issue 36 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. Re: error (Anthony Baxter) 2. Re: Watch out for this (Anthony Baxter) 3. Re: Unfilterable email using Outlook plugin (Anthony Baxter) 4. Re: How to setup Spambayes for Domino server (JRedmond@ymcastlouis.org) 5. RE: Watch out for this (Ryan Malayter) 6. Error in classifier.py? (Martin Davis) 7. RE: Watch out for this (Skip Montanaro) 8. Re: Error in classifier.py? (Skip Montanaro) 9. Rule Wizard (TheAceMan) 10. Whitelist for SpamBayes for Outlook (Tom Boland) 11. Re: How to setup Spambayes for Domino server (Skip Montanaro) 12. Re: Error in classifier.py? (Skip Montanaro) ---------------------------------------------------------------------- Message: 1 Date: Thu, 11 Sep 2003 02:22:29 +1000 From: Anthony Baxter Subject: Re: [Spambayes] error To: "Rueve, Christina" Cc: spambayes@python.org Message-ID: <200309101622.h8AGMTmc003320@localhost.localdomain> Content-Type: text/plain; charset="us-ascii" >>> "Rueve, Christina" wrote > I am getting the following error during installation: Unable to > register the DLL/OCX: DllRegister Server Failed; code 0x00000000. Click > Retry to try again, Ignore to proceed anyway (not recommended), or Abort > to cancel installation. I click on Retry and nothing happened. I then > clicked on igore. The version is binary version 0.6. PC is XP with > Office XP. Please advise. Please try again with the 0.81 version on the website. Anthony ------------------------------ Message: 2 Date: Thu, 11 Sep 2003 02:25:32 +1000 From: Anthony Baxter Subject: Re: [Spambayes] Watch out for this To: skip@pobox.com Cc: spambayes@python.org Message-ID: <200309101625.h8AGPWUA003410@localhost.localdomain> Content-Type: text/plain; charset="us-ascii" >>> Skip Montanaro wrote > Good suggestion. I'm not sure if the tokenizer does this already, but a > quick grep for '&#[0-9];' through my current training database (about 3 > million lines) suggests this is still fairly infrequently used. I only > found about 2100 lines (around 0.07%) of the lines contained a numeric > entity. If/when the spammers start using such techniques and they turn out > to cause problems for the classifier, it should be fairly easy to extend the > tokenizer to make the necessary substitutions. Maybe we should have a file somewhere of "yet to be tested" tokeniser ideas? And update it with a comment when we find what does or doesn't work? (Ref the discussion yesterday about tokenising tricks tried and abandoned...) ------------------------------ Message: 3 Date: Thu, 11 Sep 2003 02:26:51 +1000 From: Anthony Baxter Subject: Re: [Spambayes] Unfilterable email using Outlook plugin To: "Greg Jewell" Cc: spambayes@python.org Message-ID: <200309101626.h8AGQpZi003448@localhost.localdomain> Content-Type: text/plain; charset="us-ascii" >>> "Greg Jewell" wrote > I encountered a problem this morning, though, and I'm not certain what > the issue is. I received a spam email that is disguised as if it's a > message returne d as undeliverable. Here's the subject of the mail: > "Undeliverable: Read: gjeve get Free Printer Cartridges and more". > For some reason, Spambayes says that this message is unfilterable. It > won't give it a spam probability value, and when I click "Delete as > Spam" an error message pops up saying that No filterable mail items > have been selected. > > Has anybody else encountered this before? Hm. Is it possible that the message's MIME structure is totally messed up, and the tokeniser's bailing out? It's far too long since I looked at that bit of the code... Tony? Anthony ------------------------------ Message: 4 Date: Wed, 10 Sep 2003 11:32:03 -0500 From: JRedmond@ymcastlouis.org Subject: Re: [Spambayes] How to setup Spambayes for Domino server To: spambayes@python.org Message-ID: Content-Type: text/plain; charset=us-ascii Check out http://www.openntf.org, the Domino/Notes open-source community. There's talk of adding Bayesian filtering to their OpenNTFMail template and pushing the workload off onto the client side, and Spambayes will probably wind up getting integrated somehow. (They're in the process of relocating their servers today, though, so you'll have to check it out tomorrow.) In the meantime, I'd suggest you resist the urge to integrate the Domino server's SMTP and Router processes and Bayesian filtering - speaking from experience, that approach can only bring you pain. Instead, use Spambayes as a proxy. ************************************ James Redmond, Domino Administrator YMCA of Greater St. Louis jredmond@ymcastlouis.org ------------------------------ Message: 5 Date: Wed, 10 Sep 2003 12:18:08 -0500 From: "Ryan Malayter" Subject: RE: [Spambayes] Watch out for this To: "Anthony Baxter" , Cc: spambayes@python.org Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D3FB@cliff.bai.org> Content-Type: text/plain; charset="us-ascii" From: Anthony Baxter [mailto:anthony@interlink.com.au] > Maybe we should have a file somewhere of > "yet to be tested" tokeniser ideas? And > update it with a comment when we find what > does or doesn't work? This sounds like a good idea to me, and would prevent a lot of unnecessary duplication of effort. Perhaps in the RFE section of the SpamBayes CVS? We could preface the subject of all tokenizing entries with a [tokenizer] tag or something. ------------------------------ Message: 6 Date: Wed, 10 Sep 2003 10:41:12 -0700 (PDT) From: Martin Davis Subject: [Spambayes] Error in classifier.py? To: spambayes@python.org Message-ID: <20030910174112.12074.qmail@web80401.mail.yahoo.com> Content-Type: text/plain; charset=us-ascii (Forgive me if this message has been sent more than once.) I finally thought I had Spambayes' pop3proxy working when Istarted getting the following error (see below) whenever I try toreceieve an email message. I am running Mozilla 1.5b on Windows XPPro. Note that is was working correctly for a short while. I don'tknow what happened to cause the error. I do notice that Spambayes WebInterface reports "Totalemails trained: Spam: 5 Ham: 92". However, I know that recently there were as many as 10spam messages and 100 ham messages. Please help. Thanks. Error dump follows: ------------------------------------------------------------------------ SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... SMTP Listener on port 25 is proxyingsmtp.pacbell.yahoo.com: 25 Listener on port 110 is proxying pop.pacbell.yahoo.com:110 User interface url is http://localhost:8880/ Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\pop3proxy.py", line 437, inonRetr evidence=True) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line22 3, in chi2_spamprob clues = self._getclues(wordstream) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line46 0, in _getclues prob = self.probability(record) File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line31 0, in probability assert spamcount <= nspam AssertionError ------------------------------ Message: 7 Date: Wed, 10 Sep 2003 13:40:30 -0500 From: Skip Montanaro Subject: RE: [Spambayes] Watch out for this To: "Ryan Malayter" Cc: spambayes@python.org Message-ID: <16223.28702.409585.775186@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii Ryan> From: Anthony Baxter [mailto:anthony@interlink.com.au] >> Maybe we should have a file somewhere of "yet to be tested" tokeniser >> ideas? And update it with a comment when we find what does or doesn't >> work? +1 Ryan> This sounds like a good idea to me, and would prevent a lot of Ryan> unnecessary duplication of effort. Or at least a lot of unnecessary duplication of suggestions. ;-) Any entries in that file which are shot down should probably be explained. In fact, perhaps a quick summary of all the stuff the tokenizer does would be useful. Why make the spammers read our source code? ;-) Skip ------------------------------ Message: 8 Date: Wed, 10 Sep 2003 13:51:33 -0500 From: Skip Montanaro Subject: Re: [Spambayes] Error in classifier.py? To: Martin Davis Cc: spambayes@python.org Message-ID: <16223.29365.436886.368204@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii Martin> I finally thought I had Spambayes' pop3proxy working when Martin> Istarted getting the following error ... Martin> File"C:\PROGRA~1\Python23\Lib\site-packages\spambayes\classifier.py", line310, in probability Martin> assert spamcount <= nspam Martin> AssertionError This is a sure sign of a corrupted database file. If you execute this Python code: import whichdb whichdb.whichdb(r"c:\path\to\your\database\file") what does it report? I suggest you rename it (in case someone wants to look at it) and restart your training. Since you're using pop3proxy I suggest you use the pickle format. To do this, add these lines to your ini file: [Storage] persistent_use_database: False Make sure you move your corrupt database out of the way first. Skip ------------------------------ Message: 9 Date: Wed, 10 Sep 2003 15:00:15 -0500 From: "TheAceMan" Subject: [Spambayes] Rule Wizard To: Message-ID: <000001c377d6$2865e920$6401a8c0@theaceman> Content-Type: text/plain; charset="us-ascii" I am currently running Winsows XP Pro and MS Outlook 2002. I recently added SpamBayes to outlook. But it seems one of my rules is overriding spambaye. When I get a new message it is moved to the spam folder then moved to another folder because of rules I had previously. However I can not open the rule wizard any more either. I'm unsure why it won't open. Outlook runs normally and everything seems to work yet I can not open the rule wizard, I can create new rules but right clicking on a message but can't see old rules by selecting tools | rule wizard. I can try reinstalling Office or Outlook or XP but would like some advice before I do that. Thanks in advance, Kevin ------------------------------ Message: 10 Date: Wed, 10 Sep 2003 13:16:36 -0700 From: Tom Boland Subject: [Spambayes] Whitelist for SpamBayes for Outlook To: "'spambayes@python.org'" Message-ID: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> Content-Type: text/plain; charset="iso-8859-1" I know you say it's not needed *but it is*!! This is the only thing stopping me from implementing this product (and making donations) over anything else. I need to make sure that email from certain people (my boss, his boss, all local domain mail ,etc.) gets through 100% of the time. Even after several months of training, some mail from these people gets dumped into Quarantine: Junk. I even saved 200 messages from one person, trained it as good mail, and I still get mail from that person (my CEO, no less) sent to the Quarantine. I know, I know that this goes against your belief in the One True God of Bayesian filtering, but in the real world, we REALLY need at least a white list and preferably a blacklist, too. Thanks for an otherwise fantastic program. Keep up the good work. Tom Boland IT Manager Commercial Programming Systems ------------------------------ Message: 11 Date: Wed, 10 Sep 2003 15:22:29 -0500 From: Skip Montanaro Subject: Re: [Spambayes] How to setup Spambayes for Domino server To: "jmwilson@knology.net" Cc: spambayes@python.org Message-ID: <16223.34821.348988.769104@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii (make sure you reply to the entire list so you get the benefit of everyone's eyeballs and brains...) James> I am running the software on Redhat 8. The domino server is James> running SMTP. How do I set it up to scan the emails that come James> through. I have already got it installed. Could you please James> help? Skip> You didn't say anything about the mail reader you use. SpamBayes Skip> works best at the client where each user can train on their own Skip> mail. That said, I'm not sure anyone has investigated integrating Skip> it with a Domino server, though others have done so for other SMTP Skip> servers. James> We use Lotus Notes client to read mail. What I was tring to do is James> set it up to be its own server. Is that possible for all email be James> filtered through this server then go to the Domino server. There is an smtpproxy script, but I'm not sure if this is the sort of situation it was designed for. Another SB user wrote an SMTP proxy which performs the necessary checks at SMTP transfer time, rejecting the message if it scores as spam, then calling the real SMTP server if it passes that test. That sounds like something you could try. Search the list archives for the past few weeks. Skip ------------------------------ Message: 12 Date: Wed, 10 Sep 2003 15:28:10 -0500 From: Skip Montanaro Subject: Re: [Spambayes] Error in classifier.py? To: Martin Davis Cc: spambayes@python.org Message-ID: <16223.35162.304062.296851@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii >>> whichdb.whichdb(r"c:\program files\python23\scripts\hammie.db") 'dbhash' That's what I figured. Note that some bugs related to storage have been fixed in the past few weeks. You might want to try 1.0a5 if you're not up to that rev yet. Martin> I made the change you suggested and renamed the database. Martin> Working fine so far. Thanks. Martin> P.S. I didn't post this reply to the list. If that's wrong, let Martin> me know. Not wrong, but when you're trying to solve a problem it helps to get more than one perspective instead of just that of the first wacko to reply. ;-) In particular, others appear to be much better at remembering that "the database transmogrification bug was fixed in version 3.14159 of somereallybigmodule.py". Skip ------------------------------ _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html End of Spambayes Digest, Vol 61, Issue 36 ***************************************** From richie at entrian.com Wed Sep 10 23:19:51 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Sep 10 17:20:00 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <20030910211038.80808.qmail@web80405.mail.yahoo.com> References: <20030910211038.80808.qmail@web80405.mail.yahoo.com> Message-ID: [Martin] > One idea I have for why it might have gotten > corrupted: While running the pop3proxy I halted it by > closing the "DOS" box, instead of going into the web > interface and choosing "Save and shutdown" first. > Could this have caused the problem? Possibly - could you have been training at the time you closed it down? I (kind of speculatively) fixed a bug whereby the database could become corrupt because of improper shutdown, and that fix is in 1.0a5. But if you kill it *during* training, it could still cause corruption. If you're weren't training at the time, my fix may not have been worth the bits it was encoded on... -- Richie Hindle richie@entrian.com From richie at entrian.com Wed Sep 10 23:21:54 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Sep 10 17:22:05 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 36 In-Reply-To: References: Message-ID: [John] > Please unsubscribe me from this mailing list. I have been trying to get off > of it for several weeks now. The instructions on how to unsubscribe are right there in the post you're replying to: > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.python.org/mailman/listinfo/spambayes > or, via email, send a message with subject or body 'help' to > spambayes-request@python.org -- Richie Hindle richie@entrian.com From skip at pobox.com Wed Sep 10 17:25:40 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 17:25:53 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <20030910211038.80808.qmail@web80405.mail.yahoo.com> References: <20030910211038.80808.qmail@web80405.mail.yahoo.com> Message-ID: <16223.38612.223857.328983@montanaro.dyndns.org> Martin> The problem occured while I was using 1.0a5. One idea I have Martin> for why it might have gotten corrupted: While running the Martin> pop3proxy I halted it by closing the "DOS" box, instead of going Martin> into the web interface and choosing "Save and shutdown" first. Martin> Could this have caused the problem? Yup. Perhaps we can guard against that by wrapping the loop in a try: loop_forever() except ???: save_database() Any idea what exception gets raised when you kill a DOS box with Python running? Skip From StarPilot at sol3.net Wed Sep 10 16:10:03 2003 From: StarPilot at sol3.net (StarPilot) Date: Wed Sep 10 17:33:39 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <16223.36817.852977.418235@montanaro.dyndns.org> Message-ID: <023701c377df$e7bc5e40$c902a8c0@sol3.int> Ahh! OK. I am using the SpamAtBayes (now InBoxer product). Kudos for this product. My wife and I get so much spam and it nails 99% or more of it dead on. The only real ones still getting through are the all image ones and the occasional new twists to s.p31_1ing or _s p e l l i n g_ (spelling). ;-) Keith -----Original Message----- From: Skip Montanaro [mailto:skip@pobox.com] Sent: Wednesday, September 10, 2003 2:56 PM To: StarPilot@sol3.net Cc: spambayes@python.org Subject: RE: [Spambayes] Whitelist for SpamBayes for Outlook Keith> Am I missing something? I am running the Outlook plug-in (love Keith> it death!) and it has a Black/White list function. I am 2 Keith> versions out of date at the moment: did it go away? There is a whitelist capability in the InBoxer/SpamAtBay commercial version. Are you using it perhaps? Skip From richie at entrian.com Wed Sep 10 23:35:38 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Sep 10 17:35:49 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <16223.38612.223857.328983@montanaro.dyndns.org> References: <20030910211038.80808.qmail@web80405.mail.yahoo.com> <16223.38612.223857.328983@montanaro.dyndns.org> Message-ID: [Martin] > While running the > pop3proxy I halted it by closing the "DOS" box, instead of going > into the web interface and choosing "Save and shutdown" first. > Could this have caused the problem? [Skp] > Any idea what exception gets raised when you kill a DOS box with Python > running? None, sadly. Running this: import time file("1", "w") try: time.sleep(10) except: file("2", "w") and killing the DOS box creates file "1" but no file "2". -- Richie Hindle richie@entrian.com From tim.one at comcast.net Wed Sep 10 18:36:49 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Sep 10 17:36:52 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <16223.38612.223857.328983@montanaro.dyndns.org> Message-ID: [Skip] > Any idea what exception gets raised when you kill a DOS box with > Python running? None; it's like doing sys._exit() (no exception is raised, no finally blocks are executed; open files may be left in inconsistent states; etc); Bad Idea. From tim.one at comcast.net Wed Sep 10 18:44:54 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Sep 10 17:44:57 2003 Subject: [Spambayes] Watch out for this In-Reply-To: Message-ID: [Balazs Attila Mihaly] > ... > as you know in html pages characters can be written as #; > where represents the ASCII (or maybe UNICODE - I'm not > sure) code of the character. Now, if you don't convert these characters > back to their corresponding values ... spambayes already decodes numeric character entities. That's what # Replace numeric character entities (like a for the letter # 'a'). text = numeric_entity_re.sub(numeric_entity_replacer, text) in Tokenizer.tokenize_body() does. It's a relatively recent addition. I didn't see false negatives due to this trick before adding the decoding, but did get a number of irritating Unsures that were stopped cold by doing this decoding. From skip at pobox.com Wed Sep 10 17:46:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 10 17:46:19 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: References: <16223.38612.223857.328983@montanaro.dyndns.org> Message-ID: <16223.39837.499008.261724@montanaro.dyndns.org> >> Any idea what exception gets raised when you kill a DOS box with >> Python running? Tim> None; it's like doing sys._exit() Bummer. Skip From jmoyles at cdre.com Wed Sep 10 17:31:09 2003 From: jmoyles at cdre.com (John Moyles) Date: Wed Sep 10 18:31:07 2003 Subject: [Spambayes] SpamBayes Wish List Item: The ability to DELETE spam Message-ID: <000601c377eb$3cbac980$2a010a0a@jmpc> Thanks for the great product (running it as an outlook plugin). I've turned several people on to the product. When touching a "spam" or "possible" spam message, you can leave it untouched, move, or copy it. How about the option to delete (and/or permanantly delete it)? If this is considered to dangerous how about a third column with a slider and the option to delete the message - So I could set my slider to x% and it would delete and messages that it scores at x% or better. This came about on our end because I end up manually blowing away the spam messages (I get about 1,000 spam messages a day now), which takes extra time on my part. I considered using Outlooks autoarchive feature to do the same thing once per day, *but* autoarchive sends "not read" receipts for any message not read and deleted via the autoarchive, regardless of your outlook settings. Which sucks. Is it possible through the plugin, and is it a solid enough an idea to contemplate? - John --- John Moyles Aurora (Denver), CO Land Line: (720) 870-2626 Mobile: (720) 201-8702 Fax: (720) 294-9934 Text Page: mobile@cdre.com From rmalayter at bai.org Wed Sep 10 19:17:32 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Sep 10 19:17:37 2003 Subject: [Spambayes] SpamBayes Wish List Item: The ability to DELETE spam Message-ID: <792DE28E91F6EA42B4663AE761C41C2A0111517F@cliff.bai.org> From: John Moyles [mailto:jmoyles@cdre.com] > I considered using Outlooks autoarchive feature > to do the same thing once per day, *but* autoarchive > sends "not read" receipts for any message not read > and deleted via the autoarchive, regardless of your > outlook settings. Which sucks. If you're an Exchange server shop, the server-side Mailbox Manager service can do what you want. From mhammond at skippinet.com.au Thu Sep 11 10:32:05 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 10 19:32:05 2003 Subject: [Spambayes] Rule Wizard In-Reply-To: <000001c377d6$2865e920$6401a8c0@theaceman> Message-ID: <352a01c377f3$c0122a00$f502a8c0@eden> > I am currently running Winsows XP Pro and MS Outlook 2002. I recently > added SpamBayes to outlook. But it seems one of my rules is > overriding > spambaye. Unfortunately, we can't guarantee to be run before Outlook rules. If you enable "background processing", we do our best to filter *after* outlook rules. So you can have your rules move stuff around, and get SpamBayes to watch your inbox *and* the folders that Outlook moves messages to. > However I > can not open the rule wizard any more either. I'm unsure why it won't > open. Outlook runs normally and everything seems to work yet > I can not > open the rule wizard, I can create new rules but right clicking on a > message but can't see old rules by selecting tools | rule > wizard. I can > try reinstalling Office or Outlook or XP but would like some advice > before I do that. That does sound strange. I would try re-booting before re-installing :) Mark. From mhammond at skippinet.com.au Thu Sep 11 10:34:31 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 10 19:34:33 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin In-Reply-To: <741C3B0BAF7B1F4F8FCC50AC4766CBF00500F9@phoenix.ossconnexn.com> Message-ID: <353001c377f4$16dc7ca0$f502a8c0@eden> > I've been using Spambayes for a few weeks now and am > extremely happy with it. Congratulations and thanks to all > of the developers for working on this project! > > I encountered a problem this morning, though, and I'm not > certain what the issue is. I received a spam email that is > disguised as if it's a message returned as undeliverable. > Here's the subject of the mail: "Undeliverable: Read: gjeve > get Free Printer Cartridges and more". For some reason, > Spambayes says that this message is unfilterable. It won't > give it a spam probability value, and when I click "Delete as > Spam" an error message pops up saying that No filterable mail > items have been selected. Please try upgrading to the 008 release - we did fix some "unfilterable" issues there. If that version has the same problem, please forward the mail to me. Thanks, Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2040 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/90127a88/winmail-0001.bin From gjewell at cnnxn.com Wed Sep 10 18:36:42 2003 From: gjewell at cnnxn.com (Greg Jewell) Date: Wed Sep 10 19:36:51 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <741C3B0BAF7B1F4F8FCC50AC4766CBF02489DD@phoenix.ossconnexn.com> I've upgraded to 0081 and am still encountering the problem. The forwarded message will follow. Thanks for looking at this. Greg Jewell >-----Original Message----- >From: Mark Hammond [mailto:mhammond@skippinet.com.au] >Sent: Wednesday, September 10, 2003 5:35 PM >To: Greg Jewell; spambayes@python.org >Subject: RE: [Spambayes] Unfilterable email using Outlook plugin > > >> I've been using Spambayes for a few weeks now and am >> extremely happy with it. Congratulations and thanks to all >> of the developers for working on this project! >> >> I encountered a problem this morning, though, and I'm not >> certain what the issue is. I received a spam email that is >> disguised as if it's a message returned as undeliverable. >> Here's the subject of the mail: "Undeliverable: Read: gjeve >> get Free Printer Cartridges and more". For some reason, >> Spambayes says that this message is unfilterable. It won't >> give it a spam probability value, and when I click "Delete as >> Spam" an error message pops up saying that No filterable mail >> items have been selected. > >Please try upgrading to the 008 release - we did fix some >"unfilterable" issues there. If that version has the same >problem, please forward the mail to me. > >Thanks, > >Mark. > From mhammond at skippinet.com.au Thu Sep 11 10:43:58 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 10 19:43:55 2003 Subject: [Spambayes] Problem with SpamBayes In-Reply-To: <20030910052328.23231.qmail@web9204.mail.yahoo.com> Message-ID: <354801c377f5$68700310$f502a8c0@eden> > I started using the plugin on my wife's profile on the > machine and it worked great. I then tried to use it in my > profile, and the plugin does not work at all. When you say "profile", do you mean a different Outlook profile, or a different Windows user name? If you mean different "Outlook Profile", I assume this then means from the same Windows user? None of that should be a problem, but it may be relevant :) > I try to train the SpamBayes and it fails to see any > messages. It finishes the training in a split second and ends > up saying it has 0 good message and 0 spam. Trying to get the > training to see a message, I selected all my folder to train > on, and managed to get 1 message to appear as good. I put the > folders back to their proper settings, retrained and got the > 0 good 0 spam result again... > > If I try and delete a message as Spam it always says, "No > filterable mail items are selected". This is very strange. Do all of your messages in Outlook's message lists have a normal envelope icon? Either way, I have opened https://sourceforge.net/tracker/index.php?func=detail&aid=804075&group_id=61 702&atid=498103 to track this - you may like to "monitor" this bug. In the meantime, I am afraid I am stumped unless you are willing to setup and run the "source code" version (see the SpamBayes pages). This would allow me to send you quick updates to nail the problem and have the fix wind up in the next binary. Mark. From mhammond at skippinet.com.au Thu Sep 11 10:45:19 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 10 19:45:15 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin In-Reply-To: <741C3B0BAF7B1F4F8FCC50AC4766CBF02489DD@phoenix.ossconnexn.com> Message-ID: <354901c377f5$9874f340$f502a8c0@eden> > I've upgraded to 0081 and am still encountering the problem. Damn :) > The forwarded message will follow. Thanks for looking at this. Also see https://sourceforge.net/tracker/index.php?func=detail&aid=804075&group_id=61 702&atid=498103 - a similar (but at this stage sounding slightly different) problem exists there. I'll check out the mail when it arrives though. Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1732 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/793fa685/winmail.bin From T.A.Meyer at massey.ac.nz Thu Sep 11 12:46:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 10 19:47:17 2003 Subject: [Spambayes] Watch out for this Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332AC9A@its-xchg4.massey.ac.nz> > Maybe we should have a file somewhere of "yet to be tested" > tokeniser ideas? And update it with a comment when we find > what does or doesn't work? (Ref the discussion yesterday about > tokenising tricks tried and abandoned...) +1. What about a wiki on spambayes.org? Does the sf hosting allow that sort of thing? =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 11 12:51:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 10 19:52:04 2003 Subject: [Spambayes] Question On Installtion Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332ACA7@its-xchg4.massey.ac.nz> > I am looking for various steps to atke before installing > spambayes on a 98 machine, the reason I want steps as I will > know what files get installed where so it's easy to > troubleshoot . As right know I have to test it before I start > deploying companywide. We need more information than that. What mail client do you use? What type of server you do you your mail from? (POP3, IMAP, Exchange, something else...). If you take a look at the README.txt included in the 1.0a5 release it does cover installation fairly thoroughly. Regarding where files get installed: if you run "setup.py install" then you'll end up with a spambayes directory (with various files) in your Python site-packages directory, and a collection of scripts in your Python scripts directory. If you're running the Outlook plug-in from source, you get to choose where files are installed, plus a registry key or two is created, and data is stored in the Windows "Application Data" directory (see the FAQ to locate it). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 11 12:55:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 10 19:56:14 2003 Subject: [Spambayes] Error in classifier.py? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332ACAF@its-xchg4.massey.ac.nz> [Skip] > Any idea what exception gets raised when you kill a DOS box with > Python running? [Tim] > None; it's like doing sys._exit() (no exception is raised, no > finally blocks are executed; open files may be left in > inconsistent states; etc); Bad Idea. I think this is the biggest argument for creating the 'tray' app for Windows 9x/ME users, so that they use that to start/shutdown the proxy. Shutting down the whole machine still (presumably) _exit-style closes it, but hopefully everything else that might have been using the proxy is also dead by that time. I think that the Windows NT/2K/XP users that run the pop3proxy service avoid this, as presumably services are given the chance to close nicely. =Tony Meyer From mhammond at skippinet.com.au Thu Sep 11 11:18:23 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 10 20:18:25 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <357401c377fa$37545970$f502a8c0@eden> The problem is that the message in question *is* a bounce. Some spammer has used your "from" address, so the bounce is to you. The interesting thing is that Outlook has decided to transform this bounce into an "undeliverable report", which it handles slightly differently - it gets a different icon, and a different "form" which includes a "resend" option etc. Not *all* bounces are handled this way by Outlook, but clearly some are. Unfortutately, this is a "feature" gone wrong. I initially thought that Outlook only created such messages for "internal" bounces - ie, for messages that failed delivery *before* they moved to the internet - particularly in an Exchange Server environment. Initially, my experience was that I had only ever seen these "bounce" messages for legitimate bounces - but SpamBayes would then filter them, and depending on your training, would often score tham as Spam. This was clearly bad for legitmate bounce messages. Since then I have seen exactly one external bounce message be handled this way in my own mail - and now yours makes a second. I'm not sure what to do about this now... I guess opening a bug at source forge would be good :) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2168 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/897ad691/winmail.bin From tim.one at comcast.net Wed Sep 10 22:47:24 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Sep 10 21:47:29 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332AC9A@its-xchg4.massey.ac.nz> Message-ID: [Anthony Baxter] >> Maybe we should have a file somewhere of "yet to be tested" >> tokeniser ideas? And update it with a comment when we find >> what does or doesn't work? (Ref the discussion yesterday about >> tokenising tricks tried and abandoned...) +1 [Tony Meyer] > +1. > > What about a wiki on spambayes.org? Does the sf hosting allow that > sort of thing? Not sure. It's trivial to check in a WHATEVER.txt file and have at from there; that's probably why it hasn't been done . From tim.one at comcast.net Wed Sep 10 22:51:34 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Sep 10 21:51:39 2003 Subject: [Spambayes] Error in classifier.py? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332ACAF@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer, on people killing DOS boxes with a program still running] > I think this is the biggest argument for creating the 'tray' app for > Windows 9x/ME users, so that they use that to start/shutdown the > proxy. Don't know. If you try to close a DOS box under Win9x while a program is running in it, a big "are you *sure* you want to do this?" dialog box pops up, strongly discouraging you, and recommending that you terminate the program properly instead. Most Windows users have enough sense to resort to this only in an emergency. > Shutting down the whole machine still (presumably) _exit-style > closes it, but hopefully everything else that might have been using > the proxy is also dead by that time. > > I think that the Windows NT/2K/XP users that run the pop3proxy service > avoid this, as presumably services are given the chance to close > nicely. They should. OTOH, most WinNT+ users are probably slinging Outlook and using its addin. From awhig at yahoo.com Wed Sep 10 19:53:39 2003 From: awhig at yahoo.com (Richard Cook) Date: Wed Sep 10 21:53:43 2003 Subject: [Spambayes] Problem with SpamBayes In-Reply-To: <354801c377f5$68700310$f502a8c0@eden> Message-ID: <20030911015339.66720.qmail@web9206.mail.yahoo.com> Yes, I'm logging on as two different users. One works, the one I installed the plug in as, and one doesn't. The db and config file are stored in two different "document and settings" folders. The one that works has a 600K + db file, and the mine is still at the 24K default size... I'm willing to run the source code version. Can I install that version and have it be executed from the plugin? Thanks, Rich Mark Hammond wrote: > I started using the plugin on my wife's profile on the > machine and it worked great. I then tried to use it in my > profile, and the plugin does not work at all. When you say "profile", do you mean a different Outlook profile, or a different Windows user name? If you mean different "Outlook Profile", I assume this then means from the same Windows user? None of that should be a problem, but it may be relevant :) > I try to train the SpamBayes and it fails to see any > messages. It finishes the training in a split second and ends > up saying it has 0 good message and 0 spam. Trying to get the > training to see a message, I selected all my folder to train > on, and managed to get 1 message to appear as good. I put the > folders back to their proper settings, retrained and got the > 0 good 0 spam result again... > > If I try and delete a message as Spam it always says, "No > filterable mail items are selected". This is very strange. Do all of your messages in Outlook's message lists have a normal envelope icon? Either way, I have opened https://sourceforge.net/tracker/index.php?func=detail&aid=804075&group_id=61 702&atid=498103 to track this - you may like to "monitor" this bug. In the meantime, I am afraid I am stumped unless you are willing to setup and run the "source code" version (see the SpamBayes pages). This would allow me to send you quick updates to nail the problem and have the fix wind up in the next binary. Mark. From anthony at interlink.com.au Thu Sep 11 15:59:21 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu Sep 11 01:00:09 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> Message-ID: <200309110459.h8B4xMp2009727@localhost.localdomain> >>> Tom Boland wrote > I know you say it's not needed *but it is*!! Please see the new paragraphs in the FAQ under whitelisting: If you really need whitelisting, consider implementing rules in your mailer to intercept the messages before they're passed to spambayes. None of the existing Spambayes developers have an interest in implementing whitelisting - if it's something you feel really strongly about, download the source and look into making a patch. If someone implements whitelisting, including all the user interface side needed to manage it, there's no reason it wouldn't be accepted into the source code. -- Anthony Baxter It's never too late to have a happy childhood. From wsy at merl.com Thu Sep 11 03:18:10 2003 From: wsy at merl.com (Bill Yerazunis) Date: Thu Sep 11 02:18:17 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332A95A@its-xchg4.massey.ac.nz> (T.A.Meyer@massey.ac.nz) References: <1ED4ECF91CDED24C8D012BCF2B034F130332A95A@its-xchg4.massey.ac.nz> Message-ID: <200309110618.h8B6IAC09665@localhost.localdomain> From: "Meyer, Tony" > I'm not aware that anyone's run against it - perhaps someone's got some CPU > cycles burning a hole in their pocket and would like to try? I occasionally run against this. An example (with 'fresh from cvs' spambayes): filename: no_sean_sas ham:spam: 7580:7580 fp total: 44 fp %: 0.58 fn total: 16 fn %: 0.21 unsure t: 356 unsure %: 2.35 real cost: $527.20 best cost: $592.40 h mean: 3.40 h sdev: 14.19 s mean: 97.94 s sdev: 9.43 mean diff: 94.54 k: 4.00 What's your test protocol? I did "shuffle messages randomly, but preserve knowledge of which class they were in, then train with the first 90% and then test with the last 10%". Repeat as needed... -Bill Yerazunis From anthony at interlink.com.au Thu Sep 11 17:58:47 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu Sep 11 02:59:37 2003 Subject: [Spambayes] Watch out for this In-Reply-To: Message-ID: <200309110658.h8B6wmtJ010680@localhost.localdomain> >>> "Tim Peters" wrote > Not sure. It's trivial to check in a WHATEVER.txt file and have at from > there; that's probably why it hasn't been done . I went with NEWTRICKS.txt rather than WHATEVER.txt, but it's checked in. I doubt we'd need a wiki for this - and I really don't feel like playing 'un-vandalize the wiki' on a regular basis... From T.A.Meyer at massey.ac.nz Thu Sep 11 20:08:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 11 03:08:36 2003 Subject: [Spambayes] Watch out for this Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332AEA0@its-xchg4.massey.ac.nz> > I doubt we'd need a wiki for this - and I really don't feel > like playing 'un-vandalize the wiki' on a regular basis... Fair enough. My main reason for suggesting it was that then non-developers could easily make suggestions and add results, without having to get a developer to check the changes in. =Tony Meyer From anthony at interlink.com.au Thu Sep 11 18:11:33 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu Sep 11 03:12:13 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332AEA0@its-xchg4.massey.ac.nz> Message-ID: <200309110711.h8B7BXtX010810@localhost.localdomain> >>> "Meyer, Tony" wrote > Fair enough. My main reason for suggesting it was that then > non-developers could easily make suggestions and add results, without > having to get a developer to check the changes in. If the tidal wave of hints gets to be too much, we can try that. From T.A.Meyer at massey.ac.nz Thu Sep 11 22:55:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 11 05:56:09 2003 Subject: [Spambayes] how spambayes handles image-only spams Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332AEA8@its-xchg4.massey.ac.nz> > What's your test protocol? I did "shuffle messages randomly, > but preserve knowledge of which class they were in, then > train with the first 90% and then test with the last 10%". > Repeat as needed... I did "rebal.py -n5", which IIRC is roughly equivalent to "shuffle messages randomly, but preserve knowledge of which class they were in". I then did "timtest.py -n5". I'm happy to admit I understand little of what the testing code does, just how to interpret (most of) the results that it gives me. This is one of the strengths of the spambayes testing suite, IMO (not that I have tried any other testing suites). The readme says that it does this: """ Runs an NxN test grid, skipping the diagonal: N classifiers are built. N-1 runs are done with each classifier. Each classifier is trained on 1 set, and predicts against each of the N-1 remaining sets (those not used to train the classifier). """ So in my case, I think this means that I train with the first 20%, then test with each of the remaining 20%s (and repeat). I may be wrong . =Tony Meyer From slwst8 at earthlink.net Thu Sep 11 08:35:50 2003 From: slwst8 at earthlink.net (Shannon) Date: Thu Sep 11 08:36:22 2003 Subject: [Spambayes] Error when installing - 'Unable to register DLL' Message-ID: <000001c37861$3f06ffd0$6401a8c0@NewDesktop> I'm running Windows XP and Outlook 2002 on an AMD Athlon XP machine with 512K RAM. I turned off my virus protection during install and shutdown Outlook during install. But I got the attached error at the end of the install. I can not figure out why. Can you please help? Shannon -------------- next part -------------- A non-text attachment was scrubbed... Name: Error.jpg Type: image/jpeg Size: 11230 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/173bc237/Error-0001.jpg From skip at pobox.com Thu Sep 11 10:26:56 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 11 10:27:08 2003 Subject: [Spambayes] Watch out for this In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332AC9A@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130332AC9A@its-xchg4.massey.ac.nz> Message-ID: <16224.34352.179144.718560@montanaro.dyndns.org> Tony> What about a wiki on spambayes.org? Does the sf hosting allow Tony> that sort of thing? Dunno. If a wiki's the best option, I'd be happy to host it on Mojam's server. I run several MoinMoin wikis already. Skip From m0davis at pacbell.net Thu Sep 11 08:19:11 2003 From: m0davis at pacbell.net (Martin Davis) Date: Thu Sep 11 10:27:55 2003 Subject: [Spambayes] Error in classifier.py? Message-ID: <20030911141911.79835.qmail@web80406.mail.yahoo.com> (Again, forgive me if this message was sent more than once... I have been having trouble with sending email with Mozilla for an entire day---now switching to web-based email.) Richie, it was possibly training at the time I closed the DOS box. I can't remember. I guess the real solution would be to have it running as a "real" windows app in the system tray? I've tried running "pop3proxy_tray", but I haven't had any luck. I get this: C:\Program Files\Python23\Scripts>pop3proxy_tray.py Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\pop3proxy_tray.py", line 209, in ? main() File "C:\Program Files\Python23\Scripts\pop3proxy_tray.py", line 205, in main w = MainWindow() File "C:\Program Files\Python23\Scripts\pop3proxy_tray.py", line 90, in __init __ nid = (self.hwnd, 0, flags, WM_TASKBAR_NOTIFY, self.hstartedicon, "SpamBayes ") AttributeError: 'MainWindow' object has no attribute 'hstartedicon' [Richie Hindle] >Possibly - could you have been training at the time you closed it down? I >(kind of speculatively) fixed a bug whereby the database could become >corrupt because of improper shutdown, and that fix is in 1.0a5. But if >you kill it *during* training, it could still cause corruption. If you're >weren't training at the time, my fix may not have been worth the bits it >was encoded on... > > > From mnewcome at fordahl.com Thu Sep 11 17:51:57 2003 From: mnewcome at fordahl.com (Mark Newcome) Date: Thu Sep 11 10:54:33 2003 Subject: [Spambayes] (no subject) Message-ID: Windows XP, Office XP. Outlook froze, dead buttons appeared, I followed all instructions reinstalled and now no menu appears. What do I do? Thanks, Mark Newcome Fordahl Frequency Control Products 9980 Kings Road Gainesville, GA 30506 Tele 770-888-0104 X115 Fax 770-888-4451 Home Page www.fordahl.com -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/333d5282/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/333d5282/spambayes2.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/333d5282/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/333d5282/spambayes4.obj From gjewell at cnnxn.com Thu Sep 11 09:55:41 2003 From: gjewell at cnnxn.com (Greg Jewell) Date: Thu Sep 11 10:55:45 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <741C3B0BAF7B1F4F8FCC50AC4766CBF02489E1@phoenix.ossconnexn.com> Thanks for taking a look at this and responding so quickly. As it's not a terrible inconvenience, I'm not yet concerned about it. Hopefully a reasonable solution will be found for this, but I don't think it's critical at this point in time. Greg Jewell >-----Original Message----- >From: Mark Hammond [mailto:mhammond@skippinet.com.au] >Sent: Wednesday, September 10, 2003 6:18 PM >To: Greg Jewell; spambayes@python.org >Subject: RE: [Spambayes] Unfilterable email using Outlook plugin > > >The problem is that the message in question *is* a bounce. >Some spammer has used your "from" address, so the bounce is to you. > >The interesting thing is that Outlook has decided to transform >this bounce into an "undeliverable report", which it handles >slightly differently - it gets a different icon, and a >different "form" which includes a "resend" option etc. Not >*all* bounces are handled this way by Outlook, but clearly some are. > >Unfortutately, this is a "feature" gone wrong. I initially >thought that Outlook only created such messages for "internal" >bounces - ie, for messages that failed delivery *before* they >moved to the internet - particularly in an Exchange Server environment. > >Initially, my experience was that I had only ever seen these >"bounce" messages for legitimate bounces - but SpamBayes would >then filter them, and depending on your training, would often >score tham as Spam. This was clearly bad for legitmate bounce >messages. Since then I have seen exactly one external bounce >message be handled this way in my own mail - and now yours >makes a second. > >I'm not sure what to do about this now... I guess opening a >bug at source forge would be good :) > >Mark. > From AAnand at mathematica-mpr.com Thu Sep 11 11:57:40 2003 From: AAnand at mathematica-mpr.com (Anu Anand) Date: Thu Sep 11 10:58:22 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <897E2332A97AD311AEBB00508B116D5418C350BA@mpr1> Want to remove me from the list tried the wen form but it didn't worked. Thanks -----Original Message----- From: Greg Jewell [mailto:gjewell@cnnxn.com] Sent: Thursday, September 11, 2003 10:56 AM To: Mark Hammond; spambayes@python.org Subject: RE: [Spambayes] Unfilterable email using Outlook plugin Thanks for taking a look at this and responding so quickly. As it's not a terrible inconvenience, I'm not yet concerned about it. Hopefully a reasonable solution will be found for this, but I don't think it's critical at this point in time. Greg Jewell >-----Original Message----- >From: Mark Hammond [mailto:mhammond@skippinet.com.au] >Sent: Wednesday, September 10, 2003 6:18 PM >To: Greg Jewell; spambayes@python.org >Subject: RE: [Spambayes] Unfilterable email using Outlook plugin > > >The problem is that the message in question *is* a bounce. >Some spammer has used your "from" address, so the bounce is to you. > >The interesting thing is that Outlook has decided to transform >this bounce into an "undeliverable report", which it handles >slightly differently - it gets a different icon, and a >different "form" which includes a "resend" option etc. Not >*all* bounces are handled this way by Outlook, but clearly some are. > >Unfortutately, this is a "feature" gone wrong. I initially >thought that Outlook only created such messages for "internal" >bounces - ie, for messages that failed delivery *before* they >moved to the internet - particularly in an Exchange Server environment. > >Initially, my experience was that I had only ever seen these >"bounce" messages for legitimate bounces - but SpamBayes would >then filter them, and depending on your training, would often >score tham as Spam. This was clearly bad for legitmate bounce >messages. Since then I have seen exactly one external bounce >message be handled this way in my own mail - and now yours >makes a second. > >I'm not sure what to do about this now... I guess opening a >bug at source forge would be good :) > >Mark. > _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From tim.one at comcast.net Thu Sep 11 13:41:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 11 12:41:50 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130332AEA8@its-xchg4.massey.ac.nz> Message-ID: [Bill Yerazunis] >> What's your test protocol? I did "shuffle messages randomly, >> but preserve knowledge of which class they were in, then >> train with the first 90% and then test with the last 10%". >> Repeat as needed... [Tony Meyer] > I did "rebal.py -n5", which IIRC is roughly equivalent to "shuffle > messages randomly, but preserve knowledge of which class they were > in". I then did "timtest.py -n5". > > I'm happy to admit I understand little of what the testing code does, > just how to interpret (most of) the results that it gives me. This is > one of the strengths of the spambayes testing suite, IMO (not that I > have tried any other testing suites). > > The readme says that it does this: > """ > Runs an NxN test grid, skipping the diagonal: > N classifiers are built. > N-1 runs are done with each classifier. > Each classifier is trained on 1 set, and predicts against each of > the N-1 remaining sets (those not used to train the > classifier). """ > > So in my case, I think this means that I train with the first 20%, > then test with each of the remaining 20%s (and repeat). I may be > wrong . That's a good description of what it does. It's not the preferred way to test, because it's hard to interpret the results, is slow (N**2-N test runs are made), and it's brutal (in your case using -n5, each classifier built is tested against 4x as many messages as it was trained on). timtest.py is a more traditional cross-validation test driver, probably much closer to what Bill is doing. It's easier to interpret the results, runs faster, and will almost always deliver "better-looking results" than timtest.py delivers, because the cross-validation driver trains on many more messages than it tries to classify (the opposite is true of timtest.py). From tim.one at comcast.net Thu Sep 11 13:48:39 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 11 12:48:45 2003 Subject: [Spambayes] how spambayes handles image-only spams In-Reply-To: Message-ID: [Tim, waking up] > timtest.py is a more traditional cross-validation test driver, I meant timcv.py, of course. Provided such a file actually exists . From LSayler at plazare.com Thu Sep 11 13:17:35 2003 From: LSayler at plazare.com (Larry Sayler) Date: Thu Sep 11 13:18:13 2003 Subject: [Spambayes] question Message-ID: <4B03DC7E2B88D411B1C600508B9A4EEFB61AB0@PLAZA_ES1> We have an exchange 5.5 server in our office. Will this program work behind that server as well? Larry Sayler lsayler@plazare.com From rcoe at CambridgeMA.GOV Thu Sep 11 16:23:02 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Thu Sep 11 15:24:35 2003 Subject: [Spambayes] RE: Unfilterable email using Outlook plugin Message-ID: If you're using a view that groups messages, you get the "unfilterable" message if you use "Delete as spam" on the group header. (By contrast, Outlook does let you move or delete grouped messages that way.) This is still true in version 0.81. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Mark Hammond > Sent: Wednesday, September 10, 2003 7:35 PM > To: 'Greg Jewell'; spambayes@python.org > Subject: RE: [Spambayes] Unfilterable email using Outlook plugin > > > > I've been using Spambayes for a few weeks now and am > > extremely happy with it. Congratulations and thanks to all > > of the developers for working on this project! > > > > I encountered a problem this morning, though, and I'm not > > certain what the issue is. I received a spam email that is > > disguised as if it's a message returned as undeliverable. > > Here's the subject of the mail: "Undeliverable: Read: gjeve > > get Free Printer Cartridges and more". For some reason, > > Spambayes says that this message is unfilterable. It won't > > give it a spam probability value, and when I click "Delete as > > Spam" an error message pops up saying that No filterable mail > > items have been selected. > > Please try upgrading to the 008 release - we did fix some > "unfilterable" issues there. If that version has the same > problem, please forward the mail to me. > > Thanks, > > Mark. > From rhansen at fortressinv.com Thu Sep 11 17:16:54 2003 From: rhansen at fortressinv.com (Rick Hansen) Date: Thu Sep 11 16:14:11 2003 Subject: [Spambayes] Question about filtering with Blackberry device Message-ID: <9CCB0FC31CB88642AB5BDD16431D788601089E0B@FORTRESSEX.fortressinv.com> I love this product immensely! Works great. I'm using WinXP with Outlook2002. My firm uses the Blackberry Enterprise Server, not the Desktop-Level Redirectors. If an email is spam, I wish to get it filtered to spam and never redirected to my Blackberry device (by the server). Is this possible? If it's not an easy or know answer, no big deal. Just thought I'd ask. Thanks, Rick From ebjornson at senvid.com Thu Sep 11 14:27:34 2003 From: ebjornson at senvid.com (Eric Bjornson) Date: Thu Sep 11 16:28:30 2003 Subject: [Spambayes] Doesn't do filtering automatically Message-ID: I'm a first-time user and I just installed SpamBayes as a plugin for my Outlook 2000. I did the training, I assigned the folder so the filter status says "Watching 'Inbox'...", and the checkbox shows that it is enabled. I can manually filter messages by using the "Filter Messages..." dialog, but when new email arrives it doesn't do it automatically. I've tried adding a delay by using the Advanced Tab and enabling a filter timer of 5 seconds, but it doesn't seem to help. I hope there's just some simple step that I've forgotten, but it isn't clear what it is. Any help would be greatly appreciated. Thanks, Eric From rmalayter at bai.org Thu Sep 11 17:00:32 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Sep 11 17:00:35 2003 Subject: [Spambayes] Question about filtering with Blackberry device Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D406@cliff.bai.org> From: Rick Hansen [mailto:rhansen@fortressinv.com] > If an email is spam, I wish to get it > filtered to spam and never redirected > to my Blackberry device (by the server). > Is this possible? Your organization needs to install a filter that works at the SMTP level, or at least the Exchange AVAPI level, so spam gets filtered before the Blackberry Enterprise software can forward it. We use iHateSpam Server Edition from http://www.sunbelt-software.com, it works fairly well, with 90-95% capture rates. There are literally dozens of other players in the Exchange anti-spam market space, some are McAffee, Computer Associates, Trend Micro, Symantec. You also might want to look at http://assp.sf.net for an open-source Bayesian filter. In any case, you're going to have to involve your IT people to get something working. Regards, Ryan From mhammond at skippinet.com.au Fri Sep 12 09:16:12 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Sep 11 18:16:23 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin In-Reply-To: <897E2332A97AD311AEBB00508B116D5418C350BA@mpr1> Message-ID: <037601c378b2$4fb50100$f502a8c0@eden> > Want to remove me from the list tried the wen form but it > didn't worked. > Thanks I have unsubscribed you. Mark From ltrag at optonline.net Thu Sep 11 19:57:39 2003 From: ltrag at optonline.net (Lanny Trager) Date: Thu Sep 11 19:02:57 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: References: Message-ID: <200309111857.39713.ltrag@optonline.net> This is a followup on a problem that I have been having since I began using Kmail a few weeks ago. Seen before with the Subject: Classification not always added to the header Briefly, I am running: spambayes-1.0a5 (getting mail from two different pop servers) RedHat 9 Kmail After properly shutting down SpamBayes, Kmail and then my machine, the next morning SpamBayes does not classify the first few e-mails from one of the 3 pop servers that I use. After, much investigation, I discovered that Kmail places several files in "/home/myusername/.kde/share/apps/kmail" (myusername is a munge of the actual user name). These files are called (after SpamBayes installation) mungeduser@localhost:port#_that_email_client_points_to, there is one of these files for each pop server or in the case of one of the pop servers that I have two accounts on there are two of these files each pointing to the same server and port but with a different user name, reflecting my user names on the pop server. The purpose of these files is to keep track of what you want to download later and what is to be kept on the server. This is reflected by two commands: downloadLater= seenUidList= In my case there are perhaps 80 e-mails that I have chosen to keep on one of the servers, so after seenUidList= there are a series of numbers separated by commas. I don't think that it is a coincidence that this is the server from which e-mails are not being classified all of the time. (Usually the problem occurs only the first one or two times that I get my e-mail after starting SpamBayes and kmail in the morning. As an experiment I moved these files to another directory and recreated them, leaving the command in Kmail to leave the e-mail on the server. By recreating the files I had emptied them so that Kmail was not aware of what was on the server. I did this twice, the first time Kmail/SpamBayes proceeded to download the mail from the server but hung after the first few e-mails were downloaded. The messages that appeared in the terminal were: Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``the first time that I've been on my windows machine in many months so I '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``ZXJlIENhdGFsb2cgTWFuYWdlci4gIFJlc3VsdGVkIGluIGN1c3RvbWVycyBnZXR0aW5nIG5ldyBj'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``AAAAABYHAAAAAAAAPAoAAAAAAAAAAAAAAAAAANAHAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABH'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 110, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Continuation line seen before first header error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x83580a4> (exceptions.ValueError:unpack list of wrong size [/usr/lib/python2.2/asyncore.py|poll|99] [/usr/lib/python2.2/asyncore.py|handle_read_event|396] [/usr/lib/python2.2/asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|146] [pop3proxy.py|onServerLine|214] [pop3proxy.py|onResponse|276] [pop3proxy.py|onTransaction|377] [pop3proxy.py|onRetr|509]) Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``Y8bX2Dke4pk7Zj8uT5gOhPUe1UEuGhVomj3e2MDH1qbWHe5fnferSRvuBO5Cvesq7s7nDMkWVbnA'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``Lh1rfy3X88xAnKiqUy/dKACBFbtRHB1V7UZc4Rm4qoOhLoDBn6pfOWZnUzY5PKviPNvr/9BRe6Pb'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``alt="Careers"> '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, I stopped both SpamBayes and Kmail. I Restarted SpamBayes and Kmail and tried the experiment again. It hung again but this time I decided to let it go, after a few minutes it continued to download the emails but did not add any classifications to the rest of the e-mails. (It did add classifications to the first few e-mails.) p3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 110, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Continuation line seen before first header Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``we can't afford to be. "Is your ISP blocking us? Is their ISP blocking us?'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``
    '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``sjtVETUus2U8AsHqb18zVUHyhCPhHxY6Is53zU6VeEcHGSsdTQOf+lo4VnpvlpAIwqBxUfFRqpXx'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``eGcDwMGQdb8BztWiF1zgSOf0wGup9IO/POFQv+mwteufEB7WyBPvMcazTvGuP27jYOf4szX+'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``) Breathing Problems?'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``else'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``............................................................'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``//-->'' After this last experiment, I left Kmail and SpamBayes running and all e-mails that arrived subsequent to the experiment were classified. While I might be willing to remove the mail from the server in order to derive the benefits of SpamBayes, I would really rather not. I think that SpamBayes ought to be capable of dealing with mail left on a pop server. For folks like me who retrieve their mail using more than one machine this is essential. When I am on the road I want to read my mail, but when I get back home I want to be able to download that same mail to my home machine. Lanny From chauhan1001 at netzero.com Fri Sep 12 00:38:25 2003 From: chauhan1001 at netzero.com (Chauhan) Date: Thu Sep 11 19:39:21 2003 Subject: [Spambayes] Help with pop3proxy.py Message-ID: <20030911.163903.27313.425417@webmail06.nyc.untd.com> I just installed spambayes-1.0a5 on my Windows XP (home edition) and got this error when ran pop3proxy.py ########## C:\Python22\Scripts>pop3proxy.py -b SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... Traceback (most recent call last): File "C:\Python22\Scripts\pop3proxy.py", line 820, in ? run() File "C:\Python22\Scripts\pop3proxy.py", line 801, in run prepare(state=state) File "C:\Python22\Scripts\pop3proxy.py", line 737, in prepare state.createWorkers() File "C:\Python22\Scripts\pop3proxy.py", line 618, in createWorkers self.bayes = storage.open_storage(filename, self.useDB) File "C:\Python22\Lib\site-packages\spambayes\storage.py", line 652, in open_storage print >> sys.stderr, """\ UnboundLocalError: local variable 'sys' referenced before assignment ########## Can somebody tell me what I am doing wrong. Also README.txt says "Now launch pop3proxy, either by running the "pop3proxy_service.py" script (for those using Windows 2000, Windows NT or Windows XP), or the "pop3proxy.py" script (for everyone else)." But in my installation I cannot find pop3proxy_service.py Chauhan ________________________________________________________________ The best thing to hit the internet in years - NetZero HiSpeed! Surf the web up to FIVE TIMES FASTER! Only $14.95/ month -visit www.netzero.com to sign up today! From jan_ebert at web.de Fri Sep 12 02:53:24 2003 From: jan_ebert at web.de (Jan Ebert) Date: Thu Sep 11 19:53:30 2003 Subject: [Spambayes] Outlook Rules Recover Spam Message-ID: <003601c378bf$e3bf2f30$0345a8c0@HOODJan> Hello. I am using the SpamBayes Outlook PlugIn in addition to a lot of e-mail rules defined with the "Rules Wizard". Unfortunately it seems that some of these rules move e-mails back into my mail folders after these mails have been correctly put into the spam folder by SpamBayes before. For example, I want an e-mail rule to move messages received via a special POP3 account to a special folder. If spam mail arrives through this folder, SpamBayes scores the mail correctly as spam but finally it is moved to the destination folder as specified in the Outlook rule and does not stay in the spam folder as I would like it. How can I make sure that Outlook rules are not applied to messages that have already been moved by SpamBayes? Thanks a lot, Jan From mark at mcdonaldjacobs.com Thu Sep 11 18:20:30 2003 From: mark at mcdonaldjacobs.com (Mark Clift) Date: Thu Sep 11 20:20:34 2003 Subject: [Spambayes] Trouble installing Spambayes Message-ID: <754A6AAD74D7C84680B5052996986E53295B1C@MCDONALDJACOB01.mjmjcpa.com> Windows 2000P SP3 Outlook 2000 SP3 Spambayes 8.01 binary The Spambayes Menu Bar installs but the initiation Wizard will not run. After reading the troubleshooting page I tried reinstalling the application, no change; running regsvr32.exe spambayes_addin.dll fails "specified modulel could not be found" in OUtlook the tools/options/other/advanced/com addins is blank Thank you <> Mark * * * * * -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1530 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20030911/da0f0050/spambayes1.obj From chauhan1001 at netzero.com Fri Sep 12 01:41:52 2003 From: chauhan1001 at netzero.com (Chauhan) Date: Thu Sep 11 20:46:51 2003 Subject: [Spambayes] Help with pop3proxy.py Message-ID: <20030911.174630.2956.480972@webmail10.nyc.untd.com> I installed Python 2.3 and spambayes works fine. Chauhan ________________________________________________________________ The best thing to hit the internet in years - NetZero HiSpeed! Surf the web up to FIVE TIMES FASTER! Only $14.95/ month -visit www.netzero.com to sign up today! _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html ________________________________________________________________ The best thing to hit the internet in years - NetZero HiSpeed! Surf the web up to FIVE TIMES FASTER! Only $14.95/ month -visit www.netzero.com to sign up today! From T.A.Meyer at massey.ac.nz Fri Sep 12 14:01:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 11 21:01:56 2003 Subject: [SPAM] Re: [Spambayes] Error in classifier.py? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332B07F@its-xchg4.massey.ac.nz> > I guess the real solution would be to have it running > as a "real" windows app in the system tray? I've > tried running "pop3proxy_tray", but I haven't had any > luck. pop3proxy_tray sort of fell into the 1.0a5 release, but isn't really officially there (which is why it's not referred to in any of the documentation). This is what the app will be eventually, though, and should make it into the next release, working to a certain extent. There are also plans for a binary installer for pop3proxy that will remove the need to have Python installed, and take care of setup and the like. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 12 14:06:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 11 21:06:38 2003 Subject: [Spambayes] Help with pop3proxy.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332B088@its-xchg4.massey.ac.nz> > Can somebody tell me what I am doing wrong. What this is meant to do is tell you that you don't have a suitable dbm module available, and that you need to use a pickle, install bsddb3, or install Python 2.3 (which you did). Unfortunately a bug means that it crashes instead; this is fixed for the next release. You've already carried out the steps that you need to in order to get pop3proxy running, anyway. > Also README.txt says > > "Now launch pop3proxy, either by running the "pop3proxy_service.py" > script (for those using Windows 2000, Windows NT or Windows > XP), or the "pop3proxy.py" script (for everyone else)." > > But in my installation I cannot find pop3proxy_service.py It should be in the windows directory of the 1.0a5 archive. I don't think it gets installed into the Python scripts directory with "setup.py install"; it probably doesn't belong there anyway. The readme could probably be clearer on this point; I'll try and fix that. =Tony Meyer From ltrag at optonline.net Thu Sep 11 22:18:42 2003 From: ltrag at optonline.net (Lanny Trager) Date: Thu Sep 11 21:19:07 2003 Subject: [Spambayes] Bug in SpamBayes - It won't classify all new mail from a pop server that has old mail saved on it Message-ID: <200309112118.42858.ltrag@optonline.net> Sorry, this was previosly sent as a reply to anlther thread by accident: This is a followup on a problem that I have been having since I began using Kmail a few weeks ago. Seen before with the Subject: Classification not always added to the header Briefly, I am running: spambayes-1.0a5 (getting mail from two different pop servers) RedHat 9 Kmail After properly shutting down SpamBayes, Kmail and then my machine, the next morning SpamBayes does not classify the first few e-mails from one of the 3 pop servers that I use. After, much investigation, I discovered that Kmail places several files in "/home/myusername/.kde/share/apps/kmail" (myusername is a munge of the actual user name). These files are called (after SpamBayes installation) mungeduser@localhost:port#_that_email_client_points_to, there is one of these files for each pop server or in the case of one of the pop servers that I have two accounts on there are two of these files each pointing to the same server and port but with a different user name, reflecting my user names on the pop server. The purpose of these files is to keep track of what you want to download later and what is to be kept on the server. This is reflected by two commands: downloadLater= seenUidList= In my case there are perhaps 80 e-mails that I have chosen to keep on one of the servers, so after seenUidList= there are a series of numbers separated by commas. I don't think that it is a coincidence that this is the server from which e-mails are not being classified all of the time. (Usually the problem occurs only the first one or two times that I get my e-mail after starting SpamBayes and kmail in the morning. As an experiment I moved these files to another directory and recreated them, leaving the command in Kmail to leave the e-mail on the server. By recreating the files I had emptied them so that Kmail was not aware of what was on the server. I did this twice, the first time Kmail/SpamBayes proceeded to download the mail from the server but hung after the first few e-mails were downloaded. The messages that appeared in the terminal were: Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``the first time that I've been on my windows machine in many months so I '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``ZXJlIENhdGFsb2cgTWFuYWdlci4gIFJlc3VsdGVkIGluIGN1c3RvbWVycyBnZXR0aW5nIG5ldyBj'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``AAAAABYHAAAAAAAAPAoAAAAAAAAAAAAAAAAAANAHAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABH'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 110, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Continuation line seen before first header error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x83580a4> (exceptions.ValueError:unpack list of wrong size [/usr/lib/python2.2/asyncore.py|poll|99] [/usr/lib/python2.2/asyncore.py|handle_read_event|396] [/usr/lib/python2.2/asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|146] [pop3proxy.py|onServerLine|214] [pop3proxy.py|onResponse|276] [pop3proxy.py|onTransaction|377] [pop3proxy.py|onRetr|509]) Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``Y8bX2Dke4pk7Zj8uT5gOhPUe1UEuGhVomj3e2MDH1qbWHe5fnferSRvuBO5Cvesq7s7nDMkWVbnA'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``Lh1rfy3X88xAnKiqUy/dKACBFbtRHB1V7UZc4Rm4qoOhLoDBn6pfOWZnUzY5PKviPNvr/9BRe6Pb'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``alt="Careers"> '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, I stopped both SpamBayes and Kmail. I Restarted SpamBayes and Kmail and tried the experiment again. It hung again but this time I decided to let it go, after a few minutes it continued to download the emails but did not add any classifications to the rest of the e-mails. (It did add classifications to the first few e-mails.) p3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 110, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Continuation line seen before first header Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``we can't afford to be. "Is your ISP blocking us? Is their ISP blocking us?'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``
      '' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``sjtVETUus2U8AsHqb18zVUHyhCPhHxY6Is53zU6VeEcHGSsdTQOf+lo4VnpvlpAIwqBxUfFRqpXx'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``eGcDwMGQdb8BztWiF1zgSOf0wGup9IO/POFQv+mwteufEB7WyBPvMcazTvGuP27jYOf4szX+'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``) Breathing Problems?'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``else'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``............................................................'' Traceback (most recent call last): File "pop3proxy.py", line 433, in onRetr msg.setPayload(messageText) File "spambayes/message.py", line 177, in setPayload prs._parseheaders(self, fp) File "/usr/lib/python2.2/email/Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( HeaderParseError: Not a header, not a continuation: ``//-->'' After this last experiment, I left Kmail and SpamBayes running and all e-mails that arrived subsequent to the experiment were classified. While I might be willing to remove the mail from the server in order to derive the benefits of SpamBayes, I would really rather not. I think that SpamBayes ought to be capable of dealing with mail left on a pop server. For folks like me who retrieve their mail using more than one machine this is essential. When I am on the road I want to read my mail, but when I get back home I want to be able to download that same mail to my home machine. Lanny From tim.one at comcast.net Thu Sep 11 23:45:46 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 11 22:45:57 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <516DF10AD075D611853F00B0D03DF0E69C672D@CPSLA1> Message-ID: [Tom Boland] > I know you say it's not needed *but it is*!! > > This is the only thing stopping me from implementing this product (and > making donations) over anything else. I need to make sure that email > from certain people (my boss, his boss, all local domain mail ,etc.) > gets through 100% of the time. > > Even after several months of training, some mail from these people > gets dumped into Quarantine: Junk. I even saved 200 messages from one > person, trained it as good mail, and I still get mail from that > person (my CEO, no less) sent to the Quarantine. It sounds more like something else is going wrong. When there's a problem, I'd rather track down the source definitively than add new subsystems, possibly leaving the real problem unaddressed. Every new subsystem will introduce new problems too, of course. > I know, I know that this goes against your belief in the One True God > of Bayesian filtering, Heh -- despite the name, spambayes isn't a Bayesian filter in the sense that most people mean it. The only One True God we've had here is decreasing error rates, by hook or by crook. > but in the real world, The heavy sarcasm isn't helping -- but it may not be hurting either . All the developers here use spambayes too, in their own versions of the real world. I don't have any need for white- or black- lists in my world, so I've got no reason to pursue them. Free open source code developed by volunteers in their spare time is the result of people scratching their own itches -- if any developer here had an itch (problem) that could be relieved by adding black or white lists, they would do so. > we REALLY need at least a white list and preferably a blacklist, too. Has anyone directed you at InBoxer yet? http://www.inboxer.com/ That's a commercial product that happens to incorporate the spambayes code base, and adds white and black lists (among other things). It's not free of cost, presumably because somebody had to pay them to spend their time implementing things that shouldn't be needed . If you wanted to pay to add them to spambayes too, I bet someone would agree to do the work for hire. Else it's going to require someone working hard to prove that there's a real need for these gimmicks, and that the perceived need isn't actually due to a bug in the current code, or problem with the way the system has been trained. > Thanks for an otherwise fantastic program. Keep up the good work. Thank you! It's good to hear about problems, although it's best if they're accompanied by details and an offer to do whatever it takes to nail the source of the problems. From tim.one at comcast.net Fri Sep 12 00:08:18 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 11 23:08:23 2003 Subject: [Spambayes] Spam that bypasses spambayes In-Reply-To: Message-ID: [Harri Pesonen] > I had an idea a couple of weeks ago, that all url tokens should have > more weight than other tokens. The spammer just wants you to click on > some url, so the other text is not so important. They could even put > random words there, and they have. Code it and try it. spambayes *used* to have fancier URL tokenization than it has now, and results got better by simplifying it -- there's no substitute for testing ideas in a statistical system, and *everything* you try will have both good effects and bad effects (there are no pure wins). The best you can hope for is that the good outweigh the bad across a large variety of test sets, and there's no way to determine that without testing on a large variety of test sets. > And maybe the url server address should be tokenized the same way as > the address is tokenized in Received header. So the address below > would yield > > url:biz > url:gadgitz.biz > url:www.gadgitz.biz > > Now it just does > > url:www > url:gadgitz > url:biz I believe you're talking about http://www.gadgitz.biz/promo.php?id=93778 That actually generates 7 url tokens today: url:93778 url:biz url:gadgitz url:id url:php url:promo url:www and a proto:http token. Of these, the url:biz token has the highest spamprob in my database today, and url:id isn't far behind it. Curiously, url:promo has an only slightly spammy spamprob for me. There are no tokens in my database containing the string gadgitz, so generating more tokens containing that string wouldn't have helped. Given that spammers lose their domains only slightly less frequently than they lose their email addresses, loading the database with more spammer domain names du jour doesn't sound like a good bet either. > Maybe it should do both and have more weight that way. Also decode % > encoding and find server names for ip addresses... :-) I hesitate to put in anything by default that requires going off the local machine (whether to suck down a web page or just to do a DNS lookup). That may be OK in an industrial-strength setting with industrial-strength connectivity, but lots of users are stuck on slow dialup lines to sluggish ISPs. Controlling such stuff by options, disabled by default, would be fine. From mhammond at skippinet.com.au Fri Sep 12 14:11:19 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Sep 11 23:27:43 2003 Subject: [Spambayes] Trouble installing Spambayes In-Reply-To: <754A6AAD74D7C84680B5052996986E53295B1C@MCDONALDJACOB01.mjmjcpa.com> Message-ID: <03e101c378db$89b24b00$f502a8c0@eden> > The Spambayes Menu Bar installs but the initiation Wizard > will not run. Yes, this is a bug in the Wizard that you are the first to report! I'm afraid you will need to configure SpamBayes manually. This should simply be a matter of going to the "Filtering" tab and selecting the folders you want to use, then "Enable SpamBayes" from the "General" tab. If you have folders with pre-sorted ham and spam, then the "training" tab should be used. Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1732 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030912/56be9204/winmail.bin From chauhan1001 at netzero.com Fri Sep 12 06:11:00 2003 From: chauhan1001 at netzero.com (Chauhan) Date: Fri Sep 12 01:12:00 2003 Subject: [Spambayes] Help with pop3proxy.py Message-ID: <20030911.221152.4393.493115@webmail20.nyc.untd.com> Hi > You've already carried out the steps that you need to in order to get pop3proxy running, anyway. I am still having a problem to start pop3proxy as I don't know how to fetch email from my service provider. According to FAQ (4.10) I should press button "Get new email" but there is none on home or config page. I have set pop3 and smpt setting in http://localhost:8880/config as: Remote Servers: mail.icenter.net SpamBayes Ports: 1110 Allowed remote connections: localhost HTTP Authentication: Basic User name: MyName (same as i use to login to fetch my mail from service provider) Password: MyPassword (same as i use to login to fetch my mail from service provider) SMTP Proxy Options Remote Servers: mail.icenter.net SpamBayes Ports: 1024 I have some confusion regarding SpamBayes Ports settings I am assuming these are for communicating with Outlookexpress and not with my mail server (service provider). If these are for mail server then I have to use 110 and 25. >It should be in the windows directory of the 1.0a5 archive. pop3proxy_service.py is in windows directory. For the time being I will use pop3proxy.py as I don't have windows extentions installed on phython2.3 but I do have for 2.2. Many thanks for help, Chauhan ________________________________________________________________ The best thing to hit the internet in years - NetZero HiSpeed! Surf the web up to FIVE TIMES FASTER! Only $14.95/ month -visit www.netzero.com to sign up today! From david.matos at comcast.net Fri Sep 12 03:02:11 2003 From: david.matos at comcast.net (David Matos) Date: Fri Sep 12 02:01:38 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: Message-ID: <002001c378f3$6aafd480$8d80b042@dexter> Just my 2 cents: I second some of what Tom Boland said about mail from certain senders showing up repeatedly in the "Unsure" folder despite extensive training. Before I go any further, let me say that I've been using SpamAtBay for a while now, and I'm not sure how much overlap still exists between SAB and SpamBayes. That said, I loved SpamBayes (I still read all the postings to this list) and originally switched to SAB only for the whitelist option, which at the time sounded like a great feature. After reading the discussions here, however, I quit using the whitelist feature a while ago. The resident experts make a convincing argument for why the training/filtering should make the whitelist redundant. In practice, though, I find that mail from at least one of my acquaintances keeps getting filtered as "Unsure." This is a friend who sends me mail as often as a 20 times a day (in fact, the majority of my ham), always from the same address, but with wildly varying content, ranging from a couple words in plain text to a single picture without text to an entire page of HTML. I recall someone saying earlier that SB (and SAB) takes the message header into account when classifying the message--if this is the case, why isn't the "From" address a dead-giveaway? Before this filtering anomaly can be chalked up to "training error" let me say that I'm exceedingly fastidious when it comes to categorizing incoming mail, i.e. I (re)classify everything from this person as ham. What can I do to help figure out what's going on with this particular filtering phenomenon? From anders.eriksson at morateknikutveckling.se Fri Sep 12 10:17:08 2003 From: anders.eriksson at morateknikutveckling.se (Anders Eriksson) Date: Fri Sep 12 03:20:14 2003 Subject: [Spambayes] Re: Doesn't do filtering automatically References: Message-ID: <87ujnovo6vm6.dlg@morateknikutveckling.se> On Thu, 11 Sep 2003 13:27:34 -0700, Eric Bjornson wrote: > I'm a first-time user and I just installed SpamBayes as a plugin > for my Outlook 2000. I did the training, I assigned the folder > so the filter status says "Watching 'Inbox'...", and the checkbox > shows that it is enabled. I bet you didn't reboot the computer after installing ;-) I got the same strange behavior but after a reboot everything started to work! Hope it does for you also! // Anders From wsy at merl.com Fri Sep 12 09:04:59 2003 From: wsy at merl.com (Bill Yerazunis) Date: Fri Sep 12 08:05:18 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: References: Message-ID: <200309121204.h8CC4xv13783@localhost.localdomain> From: "Tim Peters" Heh -- despite the name, spambayes isn't a Bayesian filter in the sense that most people mean it. The only One True God we've had here is decreasing error rates, by hook or by crook. > but in the real world, The heavy sarcasm isn't helping -- but it may not be hurting either . All the developers here use spambayes too, in their own versions of the real world. I don't have any need for white- or black- lists in my world, so I've got no reason to pursue them. Free open source code developed by volunteers in their spare time is the result of people scratching their own itches -- if any developer here had an itch (problem) that could be relieved by adding black or white lists, they would do so. > we REALLY need at least a white list and preferably a blacklist, too. If I hadn't redirected the following to a different (non-test) email account, I would consider the following to be for my mandatory whitelist: 1) my lawyer 2) my (current) girlfriend My boss, he can take his chances with a partially-trained filter. :) -Bill Yerazunis From barry at python.org Fri Sep 12 13:07:16 2003 From: barry at python.org (Barry Warsaw) Date: Fri Sep 12 08:07:18 2003 Subject: [Spambayes] [Fwd: SpamBayes instead of Outlook mail rules] Message-ID: <1063368432.19907.31.camel@anthem> The first sentence of the second paragraph instructs me to make this suggestion, which I do by way of forward. :) -Barry -------------- next part -------------- An embedded message was scrubbed... From: "Haakon M. Eriksen" Subject: SpamBayes instead of Outlook mail rules Date: Fri, 12 Sep 2003 10:27:15 +0200 Size: 3665 Url: http://mail.python.org/pipermail/spambayes/attachments/20030912/31342f2a/attachment.eml From papaDoc at videotron.ca Fri Sep 12 09:46:08 2003 From: papaDoc at videotron.ca (papaDoc) Date: Fri Sep 12 08:45:22 2003 Subject: [Spambayes] Help with pop3proxy.py In-Reply-To: <20030911.221152.4393.493115@webmail20.nyc.untd.com> References: <20030911.221152.4393.493115@webmail20.nyc.untd.com> Message-ID: <3F61C010.9000309@videotron.ca> Hi, >I am still having a problem to start pop3proxy as I don't know how to >fetch email from my service provider. > When using pop3proxy (or its newer name). What your are doing is indirectly talking to your service provider. You talk to pop3proxy and it talks to your service provider. Pressing the "Get new email" is done in your mail reader (for me it is mozilla) The only difference is that mozilla should be set up to talk to localhost on port 1110 (see your setup below) not mail.icenter.net on port 110. >According to FAQ (4.10) I should >press button "Get new email" but there is none on home or config page. I >have set pop3 and smpt setting in http://localhost:8880/config as: > >Remote Servers: mail.icenter.net >SpamBayes Ports: 1110 > This is used to when you want to access pop3proxy from another machine where it is running. The way your have set it up will not work since "Allowed remote connections:" should be the name of the machine running pop3proxy. The user name and password can be/should be different then the one you use at your service provider. >Allowed remote connections: localhost >HTTP Authentication: Basic >User name: MyName (same as i use to login to fetch my mail from service provider) >Password: MyPassword (same as i use to login to fetch my mail from service provider) > > I never use that so I can't help ;-( >SMTP Proxy Options >Remote Servers: mail.icenter.net >SpamBayes Ports: 1024 > Remi From skip at pobox.com Fri Sep 12 08:57:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 12 08:58:04 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: <200309111857.39713.ltrag@optonline.net> References: <200309111857.39713.ltrag@optonline.net> Message-ID: <16225.49873.563897.301932@montanaro.dyndns.org> Lanny> I think that SpamBayes ought to be capable of dealing with mail Lanny> left on a pop server. This is perhaps a dumb question, but if you want to leave mail on the server, why don't you use IMAP? Skip From skip at pobox.com Fri Sep 12 09:01:20 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 12 09:05:12 2003 Subject: [Spambayes] Help with pop3proxy.py In-Reply-To: <20030911.163903.27313.425417@webmail06.nyc.untd.com> References: <20030911.163903.27313.425417@webmail06.nyc.untd.com> Message-ID: <16225.50080.987029.32507@montanaro.dyndns.org> chauhan> I just installed spambayes-1.0a5 on my Windows XP (home chauhan> edition) and got this error when ran pop3proxy.py ... chauhan> UnboundLocalError: local variable 'sys' referenced before assignment You're not doing anything wrong. It's a bug in the code. It's fixed in CVS. All the application files got renamed after 1.0a5 was released, so doing CVS checkout may or may not be convenient for you. If you decide not to checkout from CVS, right after the print statement at line 652 you should see import sys Delete that line and you should be okay. Skip From skip at pobox.com Fri Sep 12 10:02:25 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 12 10:02:54 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <002001c378f3$6aafd480$8d80b042@dexter> References: <002001c378f3$6aafd480$8d80b042@dexter> Message-ID: <16225.53745.869336.472050@montanaro.dyndns.org> David> In practice, though, I find that mail from at least one of my David> acquaintances keeps getting filtered as "Unsure." This is a David> friend who sends me mail as often as a 20 times a day (in fact, David> the majority of my ham), always from the same address, but with David> wildly varying content, ranging from a couple words in plain text David> to a single picture without text to an entire page of HTML. Any chance you can post the ham/spam clues for a couple of these messages? I David> recall someone saying earlier that SB (and SAB) takes the message David> header into account when classifying the message--if this is the David> case, why isn't the "From" address a dead-giveaway? It's probably a very strong ham clue, but it is still just one clue. To SpamBayes, a clue is a clue is a clue. David> What can I do to help figure out what's going on with this David> particular filtering phenomenon? Clues, we need clues. ;-) Skip From ltrag at optonline.net Fri Sep 12 10:54:04 2003 From: ltrag at optonline.net (Lanny Trager) Date: Fri Sep 12 10:54:24 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: <16225.49873.563897.301932@montanaro.dyndns.org> References: <200309111857.39713.ltrag@optonline.net> <16225.49873.563897.301932@montanaro.dyndns.org> Message-ID: <200309121054.04969.ltrag@optonline.net> On Friday 12 September 2003 08:57, Skip Montanaro wrote: > Lanny> I think that SpamBayes ought to be capable of dealing with mail > Lanny> left on a pop server. > > This is perhaps a dumb question, but if you want to leave mail on the > server, why don't you use IMAP? > > Skip Skip, It is not a dumb question. The answer might be dumb though. IMAP is not an option at this particular ISP. BTW, I removed all the mail on the server last night in order to get a more definitive picture. This morning 23 e-mails arrived and 4 were not classified. This is driving me crazy. Is there a way to turn on logging in SpamBayes so that I can get a better idea of whats going on, at least from the SpamBayes perspective. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From pbharrison at compuserve.com Fri Sep 12 12:10:10 2003 From: pbharrison at compuserve.com (Peter B. Harrison, MD) Date: Fri Sep 12 12:09:59 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: <200309121054.04969.ltrag@optonline.net> Message-ID: I find routinely when I logon in the morning that 80% or so of my mail is not rated by SB: The spam score field is blank. I just close outlook, then re-open it, and everything gets processed fine. It seems that now that I have cable modem, either Outlook, SB, or both is too busy to do what I want initially, but once the unread mail is loaded, SB scores it easily. This is a minor irritant, and during the day, because there is usually not a big dump of new mail, it seems to not be a problem. I LOVE SpamBayes. Thanks to all those putting in the effort. Peter Harrison. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Lanny Trager Sent: Friday, September 12, 2003 7:54 AM To: spambayes@python.org Subject: Re: [Spambayes] I think that I have found a bug in SpamBayes On Friday 12 September 2003 08:57, Skip Montanaro wrote: > Lanny> I think that SpamBayes ought to be capable of dealing with mail > Lanny> left on a pop server. > > This is perhaps a dumb question, but if you want to leave mail on the > server, why don't you use IMAP? > > Skip Skip, It is not a dumb question. The answer might be dumb though. IMAP is not an option at this particular ISP. BTW, I removed all the mail on the server last night in order to get a more definitive picture. This morning 23 e-mails arrived and 4 were not classified. This is driving me crazy. Is there a way to turn on logging in SpamBayes so that I can get a better idea of whats going on, at least from the SpamBayes perspective. Lanny -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From chauhan1001 at netzero.com Fri Sep 12 12:17:16 2003 From: chauhan1001 at netzero.com (Chauhan) Date: Fri Sep 12 12:19:15 2003 Subject: [Spambayes] Help with pop3proxy.py Message-ID: <20030912.091819.23538.121484@webmail02.nyc.untd.com> Hi, My pop3proxy is working fine as I can send and receive messages via pop3proxy using e-mail client. Many thanks for help. Chauhan ________________________________________________________________ The best thing to hit the internet in years - NetZero HiSpeed! Surf the web up to FIVE TIMES FASTER! Only $14.95/ month -visit www.netzero.com to sign up today! From mspencer at evidentdata.com Fri Sep 12 12:27:20 2003 From: mspencer at evidentdata.com (Mark G. Spencer) Date: Fri Sep 12 12:27:26 2003 Subject: [Spambayes] Problem with "Delete As Spam" in Outlook plugin .. Message-ID: <004601c3794a$bdbc7050$800101df@edi.evidentdata.com> I get a massive amount of spam as I have promiscuous receipt of email enabled on one of my domains. I've noticed that after a week of training, the SpamBayes Outlook plugin rarely filters out spam, and very infrequently filters email as unsure. I understand part of this may be due to having 120 emails trained as good, and 4,000 as bad? From some responses I've seen here it looks like I need to have a more even distribution of good and bad email trained? This won't (and can't ever) work for me as I get buried by mountains of spam, but I'm way off my topic now .. ;) Anyway .. When I "Delete As Spam" a small number of emails, say fifty of them, I get the hourglass for a few seconds, my disk spins, and the emails are properly transferred. When I "Delete as Spam" a large number of emails, say five hundred or even thousands, the "Delete As Spam" option stays grey for a few seconds, then goes back to its normal color like I hadn't pressed it! The disk doesn't spin, no hourglass, nothing. I've waited a couple minutes, and nothing happened. Is this a known issue when using "Delete As Spam" on large amounts of email? Each night I can have up to a thousand spams waiting for me .. Since SpamBayes currently is not filtering these (rarely it will grab a few) I have to manually "Delete As Spam" each morning. Thanks, Mark From tim.one at comcast.net Fri Sep 12 12:37:13 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Sep 12 12:37:21 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: Message-ID: [Peter Harrison] > I find routinely when I logon in the morning that 80% or so of my > mail is not rated by SB: The spam score field is blank. I just close > outlook, then re-open it, and everything gets processed fine. It > seems that now that I have cable modem, either Outlook, SB, or both > is too busy to do what I want initially, but once the unread mail is > loaded, SB scores it easily. The current CVS version of the Outlook addin (and sorry, I don't know whether that's been released yet) has an "enable background filtering" checkbox on the Advanced tab of the SpamBayes manager. I checked that, set "Processing start delay" to 2 seconds and "Delay between processing items" to 0.25 seconds, and now the addin doesn't appear to be missing anything when a large number of emails come zooming in (I often get hundreds at a time then). More importantly to me (I didn't really mind shutting down Outlook and reopening it before) is that I can again "do something" with Outlook while the email is coming in: before, some Outlook thread or other wasn't even getting enough cycles to update the display, and Outlook was totally unresponsive until all the new messages had arrived. So you should play with this when it's available (which it already may be). I think the *real* reason these options were added was to make it much more likely that Outlook runs its own rules on new messages before it tells SpamBayes about new messages (there was no rhyme or reason to it before, as we've got no influence over when Outlook decides to tell spambayes anything -- what we can do is control when we *do* something with them). From ltrag at optonline.net Fri Sep 12 13:00:28 2003 From: ltrag at optonline.net (Lanny Trager) Date: Fri Sep 12 13:00:33 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: References: Message-ID: <200309121300.28596.ltrag@optonline.net> On Friday 12 September 2003 12:10, Peter B. Harrison, MD wrote: > I find routinely when I logon in the morning that 80% or so of my mail is > not rated by SB: The spam score field is blank. I just close outlook, then > re-open it, and everything gets processed fine. It seems that now that I > have cable modem, either Outlook, SB, or both is too busy to do what I want > initially, but once the unread mail is loaded, SB scores it easily. Because I am using pop3proxy on a Linux machine to Kmail, I don't have the option that you mention. Additionally, when SB misses mail in my setup I have to search through headers and then cut and paste the e-mails into the SB web GUI. This is really a pain the neck. I think that Tim Peters response to this thread however give a hint at a possible solution. Unfortunately, there are no options for "Processing start delay" or "Delay between processing items" in the pop3proxy GUI. If one of the SB developers could give me a hint about where to invoke these options with my setup (pop3proxy - Kmail) I would be happy to alter the code and test it on my system. > > This is a minor irritant, and during the day, because there is usually not > a big dump of new mail, it seems to not be a problem. I LOVE SpamBayes. > Thanks to all those putting in the effort. I have high hopes for SpamBayes and also would like to thank all those putting in the effort. > > Peter Harrison. > > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Lanny Trager > Sent: Friday, September 12, 2003 7:54 AM > To: spambayes@python.org > Subject: Re: [Spambayes] I think that I have found a bug in SpamBayes > > On Friday 12 September 2003 08:57, Skip Montanaro wrote: > > Lanny> I think that SpamBayes ought to be capable of dealing with > > mail Lanny> left on a pop server. > > > > This is perhaps a dumb question, but if you want to leave mail on the > > server, why don't you use IMAP? > > > > Skip > > Skip, > > It is not a dumb question. The answer might be dumb though. IMAP is not > an option at this particular ISP. > > BTW, I removed all the mail on the server last night in order to get a more > definitive picture. This morning 23 e-mails arrived and 4 were not > classified. > > This is driving me crazy. Is there a way to turn on logging in SpamBayes > so that I can get a better idea of whats going on, at least from the > SpamBayes perspective. > > Lanny > > -- > /"\ > \ / > X ASCII Ribbon Campaign > / \ Against HTML Email > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From tim.one at comcast.net Fri Sep 12 13:19:21 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Sep 12 13:19:29 2003 Subject: [Spambayes] Problem with "Delete As Spam" in Outlook plugin .. In-Reply-To: <004601c3794a$bdbc7050$800101df@edi.evidentdata.com> Message-ID: [Mark G. Spencer] > I get a massive amount of spam as I have promiscuous receipt of email > enabled on one of my domains. I've noticed that after a week of > training, the SpamBayes Outlook plugin rarely filters out spam, and > very infrequently filters email as unsure. I understand part of this > may be due to having 120 emails trained as good, and 4,000 as bad? Yes, imbalance is bad, and a factor of more than 30 difference is extreme. The system wasn't designed, tested, or tuned with extremely unbalanced amounts of training data, and we still don't have a good scheme for dealing with it when it occurs. If you find your default_bayes_customize.ini and change the line experimental_ham_spam_imbalance_adjustment: True to experimental_ham_spam_imbalance_adjustment: False your specific complaint today should go away, but then it's also likely that your FP rate will increase (all messages will tend to get spammier scores then). > From some responses I've seen here it looks like I need to have a > more even distribution of good and bad email trained? This won't > (and can't ever) work for me as I get buried by mountains of spam, > but I'm way off my topic now .. ;) No, how you train is entirely up to you. If you get a great deal more spam than ham but want to keep your training data balanced, it's really not hard to do so: throw out most of your spam, keeping only a fraction of it to train on. Or, if you want to keep all the spam (beats me ... up to you), copy a small fraction of it to a new here's-the-spam-I-train-on folder. This is a statistical system, and doesn't need exhaustive training. Indeed, if there were a feasible way to do it, it would be best if the system were able to train on a relatively small *random* sample of all the email you get. > Anyway .. When I "Delete As Spam" a small number of emails, say fifty > of them, I get the hourglass for a few seconds, my disk spins, and > the emails are properly transferred. When I "Delete as Spam" a large > number of emails, say five hundred or even thousands, the "Delete As > Spam" option stays grey for a few seconds, then goes back to its > normal color like I hadn't pressed it! The disk doesn't spin, no > hourglass, nothing. I've waited a couple minutes, and nothing > happened. Heh -- I didn't even know you *could* select multiple messages before hitting "Delete as Spam". So, sorry, don't know anything about this. But change the option as described above, or balance your training data (ditto), and you probably won't care about this other issue anymore. From david.matos at comcast.net Fri Sep 12 13:53:39 2003 From: david.matos at comcast.net (David Matos) Date: Fri Sep 12 13:54:02 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook Message-ID: <000001c37956$cd023c50$8d80b042@dexter> Thanks for the reply, Skip! Here are the spam clues for an e-mail that is representative of the kind that I keep seeing filtered as "Unsure," despite training lots of messages from this sender: Spam score: 12.55% Word Probability # Good # Spam '*H*' 99.97% - - '*S*' 25.07% - - 'from:addr:adjoined.com' 0.42% 53 0 'from:addr:operez' 0.42% 53 0 'from:name:oscar perez' 0.42% 53 0 'message-id:@smmia001.adjoined.net' 0.43% 52 0 'url:story2' 5.06% 4 0 'subject:Goes' 6.52% 3 0 'subject:Yahoo' 6.79% 16 1 'url:story' 11.58% 38 5 'subject:News' 13.23% 133 21 'url:ncid' 15.52% 1 0 'url:tmpl' 18.96% 13 3 'url:news' 20.12% 183 48 'header:Received:1' 29.01% 560 239 'proto:http' 61.93% 9835 16725 'x-mailer:none' 62.77% 5776 10180 'subject:; ' 65.23% 31 61 'url:com' 65.87% 6350 12812 'url:u' 72.92% 164 462 'url:yahoo' 73.20% 382 1091 'header:Message-ID:1' 73.53% 98 285 'url:cid' 74.11% 55 165 'url:' 74.96% 1368 4282 'subject:! ' 89.32% 87 762 'url:eo' 99.78% 0 102 Message text Received: from mail.adjoined.com ([63.116.137.3]) by rwcrmxc11.comcast.net (rwcrmxc11) with ESMTP id <20030912123209r1100769pqe>; Fri, 12 Sep 2003 12:32:09 +0000 X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0 content-class: urn:content-classes:message MIME-Version: 1.0 Subject: Yahoo! News - Batman Goes Psycho; Bale Cast Date: Fri, 12 Sep 2003 08:32:07 -0400 Message-ID: <7F1B2E8605ED6B4EA45F13F991588A3D033B998F@smmia001.adjoined.net> X-MS-Has-Attach: yes X-MS-TNEF-Correlator: Thread-Topic: Yahoo! News - Batman Goes Psycho; Bale Cast Thread-Index: AcN5KeDy9XDeto2eSlaMzzKMVyFNPA== From: "Oscar Perez" To: <> http://story.news.yahoo.com/news?tmpl=story2&cid=794&u=/eo/12487&ncid= 39 unique tokens Unique words in message content-type:text/plain from:addr:adjoined.com from:addr:operez from:name:oscar perez header:Date:1 header:From:1 header:MIME-Version:1 header:Message-ID:1 header:Received:1 header:Subject:1 header:To:1 message-id:@smmia001.adjoined.net proto:http subject: subject: - subject:! subject:; subject:Bale subject:Batman subject:Cast subject:Goes subject:News subject:Psycho subject:Yahoo to:2**0 url: url:12487 url:794 url:cid url:com url:eo url:ncid url:news url:story url:story2 url:tmpl url:u url:yahoo x-mailer:none -----Original Message----- From: Skip Montanaro [mailto:skip@pobox.com] Sent: Friday, September 12, 2003 10:02 AM To: David Matos Cc: spambayes@python.org Subject: RE: [Spambayes] Whitelist for SpamBayes for Outlook David> In practice, though, I find that mail from at least one of my David> acquaintances keeps getting filtered as "Unsure." This is a David> friend who sends me mail as often as a 20 times a day (in fact, David> the majority of my ham), always from the same address, but with David> wildly varying content, ranging from a couple words in plain text David> to a single picture without text to an entire page of HTML. Any chance you can post the ham/spam clues for a couple of these messages? I David> recall someone saying earlier that SB (and SAB) takes the message David> header into account when classifying the message--if this is the David> case, why isn't the "From" address a dead-giveaway? It's probably a very strong ham clue, but it is still just one clue. To SpamBayes, a clue is a clue is a clue. David> What can I do to help figure out what's going on with this David> particular filtering phenomenon? Clues, we need clues. ;-) Skip From tim.one at comcast.net Fri Sep 12 14:40:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Sep 12 14:41:03 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <000001c37956$cd023c50$8d80b042@dexter> Message-ID: [David Matos] > Thanks for the reply, Skip! Here are the spam clues for an e-mail How did you get these? The subject line here contains "Outlook", and this looks vaguely like an Outlook addin "Show spam clues for current message" report, but there are many differences from how that report looks in any reasonably recent version of the addin. So if you're using an old version of the addin, the first thing to do is to update to a current version. > that is representative of the kind that I keep seeing filtered as > "Unsure," despite training lots of messages from this sender: > > Spam score: 12.55% That would have scored as ham for me. What do you have your ham cutoff set to? My ham cutoff is set to 20, and my spam cutoff to 80. If most of the msgs from this dude score under 20, boosting your ham cutoff to 20 will solve most of your problem. > Word Probability # Good # Spam > '*H*' 99.97% - - > '*S*' 25.07% - - > 'from:addr:adjoined.com' 0.42% 53 0 > 'from:addr:operez' 0.42% 53 0 > 'from:name:oscar perez' 0.42% 53 0 > 'message-id:@smmia001.adjoined.net' 0.43% 52 0 There end all the clues that your friend sent the message. Enabling the mine-received-headers option would generate more. The rest of his message consists of a URL, so there's not much else to go on, and what there is looks more spammy than not (relative to everything else you've trained on). > 'url:story2' 5.06% 4 0 > 'subject:Goes' 6.52% 3 0 > 'subject:Yahoo' 6.79% 16 1 > 'url:story' 11.58% 38 5 > 'subject:News' 13.23% 133 21 > 'url:ncid' 15.52% 1 0 > 'url:tmpl' 18.96% 13 3 > 'url:news' 20.12% 183 48 > 'header:Received:1' 29.01% 560 239 > 'proto:http' 61.93% 9835 16725 > 'x-mailer:none' 62.77% 5776 10180 > 'subject:; ' 65.23% 31 61 > 'url:com' 65.87% 6350 12812 > 'url:u' 72.92% 164 462 > 'url:yahoo' 73.20% 382 1091 > 'header:Message-ID:1' 73.53% 98 285 > 'url:cid' 74.11% 55 165 > 'url:' 74.96% 1368 4282 > 'subject:! ' 89.32% 87 762 > 'url:eo' 99.78% 0 102 > > Message text > Received: from mail.adjoined.com ([63.116.137.3]) > by rwcrmxc11.comcast.net (rwcrmxc11) with ESMTP > id <20030912123209r1100769pqe>; Fri, 12 Sep 2003 12:32:09 > +0000 X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0 > content-class: urn:content-classes:message > MIME-Version: 1.0 > Subject: Yahoo! News - Batman Goes Psycho; Bale Cast > Date: Fri, 12 Sep 2003 08:32:07 -0400 > Message-ID: > <7F1B2E8605ED6B4EA45F13F991588A3D033B998F@smmia001.adjoined.net> > X-MS-Has-Attach: yes X-MS-TNEF-Correlator: > Thread-Topic: Yahoo! News - Batman Goes Psycho; Bale Cast > Thread-Index: AcN5KeDy9XDeto2eSlaMzzKMVyFNPA== > From: "Oscar Perez" > To: > > > <> > > http://story.news.yahoo.com/news?tmpl=story2&cid=794&u=/eo/12487&ncid= If this is ham to you, it may be the best argument for a whitelist I've seen . From tim.one at comcast.net Fri Sep 12 15:03:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Sep 12 15:03:48 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: <002001c378f3$6aafd480$8d80b042@dexter> Message-ID: [David Matos] > ... > In practice, though, I find that mail from at least one of my > acquaintances keeps getting filtered as "Unsure." This is a friend > who sends me mail as often as a 20 times a day (in fact, the majority > of my ham), always from the same address, ... Hmm. The stats you sent later showed that you had trained on his address 53 times as ham, so about 3 days' worth of his stuff. If he in fact accounts for the majority of your ham, though, the majority of all the ham you've trained on should have been messages from him. If the percentage of ham you've trained on from him is much less than the percentage of ham you actually get from him, then your training data misrepresents your true email mix, in the direction of under-valuing "message from him" as a ham clue. From richie at entrian.com Fri Sep 12 15:08:34 2003 From: richie at entrian.com (Richie Hindle) Date: Fri Sep 12 15:08:41 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: <200309121300.28596.ltrag@optonline.net> References: <200309121300.28596.ltrag@optonline.net> Message-ID: <4964mvkusalnsm8a33pkar8pgnkeenuadf@4ax.com> > I think that Tim Peters response to this thread however give a hint at a > possible solution. Unfortunately, there are no options for "Processing start > delay" or "Delay between processing items" in the pop3proxy GUI. > > If one of the SB developers could give me a hint about where to invoke these > options with my setup (pop3proxy - Kmail) I would be happy to alter the code > and test it on my system. Those options are meaningless for the pop3proxy. However, you can turn on verbose output, which will do two things: output debug messages to your console, and create a log file of the POP3 conversation. You can switch this on by adding: [globals] verbose=True to your options file (which will be either bayescustomize.ini in your working directory or .spambayesrc in your home directory) -- Richie Hindle richie@entrian.com From seant at iname.com Fri Sep 12 15:09:01 2003 From: seant at iname.com (Sean True) Date: Fri Sep 12 15:09:02 2003 Subject: [Spambayes] Whitelist for SpamBayes for Outlook In-Reply-To: Message-ID: <000701c37961$53ec9cb0$0201a8c0@swapwizard.com> Dave is using SpamAtBay, which accounts for the differences in display. The statistics are heavily weighted by the 30K starter set that is included in SAB for people who don't have much to train from and are in a hurry. That is, everyone. We preset thresholds to 10 and 90 by default. The next version of SAB (1.1) will have a much lighter set of statistics that are derived from more message -- these adapt significantly faster. I ran the SpamAssassin corpus on these new models with a "correct every mistake" training paradigm. It got _very_ good results. -- Sean > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On Behalf Of Tim Peters > Sent: Friday, September 12, 2003 2:41 PM > To: David Matos > Cc: spambayes@python.org > Subject: RE: [Spambayes] Whitelist for SpamBayes for Outlook > > > [David Matos] > > Thanks for the reply, Skip! Here are the spam clues for an e-mail > > How did you get these? The subject line here contains > "Outlook", and this > looks vaguely like an Outlook addin "Show spam clues for > current message" > report, but there are many differences from how that report > looks in any > reasonably recent version of the addin. So if you're using > an old version > of the addin, the first thing to do is to update to a current version. > > > that is representative of the kind that I keep seeing filtered as > > "Unsure," despite training lots of messages from this sender: > > > > Spam score: 12.55% > > That would have scored as ham for me. What do you have your > ham cutoff set > to? My ham cutoff is set to 20, and my spam cutoff to 80. > If most of the > msgs from this dude score under 20, boosting your ham cutoff > to 20 will > solve most of your problem. > > > Word Probability # Good # Spam > > '*H*' 99.97% - - > > '*S*' 25.07% - - > > 'from:addr:adjoined.com' 0.42% 53 0 > > 'from:addr:operez' 0.42% 53 0 > > 'from:name:oscar perez' 0.42% 53 0 > > 'message-id:@smmia001.adjoined.net' 0.43% 52 0 > > There end all the clues that your friend sent the message. > Enabling the > mine-received-headers option would generate more. The rest > of his message > consists of a URL, so there's not much else to go on, and > what there is > looks more spammy than not (relative to everything else > you've trained on). > > > 'url:story2' 5.06% 4 0 > > 'subject:Goes' 6.52% 3 0 > > 'subject:Yahoo' 6.79% 16 1 > > 'url:story' 11.58% 38 5 > > 'subject:News' 13.23% 133 21 > > 'url:ncid' 15.52% 1 0 > > 'url:tmpl' 18.96% 13 3 > > 'url:news' 20.12% 183 48 > > 'header:Received:1' 29.01% 560 239 > > 'proto:http' 61.93% 9835 16725 > > 'x-mailer:none' 62.77% 5776 10180 > > 'subject:; ' 65.23% 31 61 > > 'url:com' 65.87% 6350 12812 > > 'url:u' 72.92% 164 462 > > 'url:yahoo' 73.20% 382 1091 > > 'header:Message-ID:1' 73.53% 98 285 > > 'url:cid' 74.11% 55 165 > > 'url:' 74.96% 1368 4282 > > 'subject:! ' 89.32% 87 762 > > 'url:eo' 99.78% 0 102 > > > > Message text > > Received: from mail.adjoined.com ([63.116.137.3]) > > by rwcrmxc11.comcast.net (rwcrmxc11) with ESMTP > > id <20030912123209r1100769pqe>; Fri, 12 Sep 2003 12:32:09 > > +0000 X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0 > > content-class: urn:content-classes:message > > MIME-Version: 1.0 > > Subject: Yahoo! News - Batman Goes Psycho; Bale Cast > > Date: Fri, 12 Sep 2003 08:32:07 -0400 > > Message-ID: > > <7F1B2E8605ED6B4EA45F13F991588A3D033B998F@smmia001.adjoined.net> > > X-MS-Has-Attach: yes X-MS-TNEF-Correlator: > > Thread-Topic: Yahoo! News - Batman Goes Psycho; Bale Cast > > Thread-Index: AcN5KeDy9XDeto2eSlaMzzKMVyFNPA== > > From: "Oscar Perez" > > To: > > > > > > <> > > > > > http://story.news.yahoo.com/news?tmpl=story2> &cid=794&u=/eo/12487&ncid= > > If this is ham to you, it may be the best argument for a > whitelist I've seen > . > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From ltrag at optonline.net Fri Sep 12 18:12:41 2003 From: ltrag at optonline.net (Lanny Trager) Date: Fri Sep 12 18:12:46 2003 Subject: [Spambayes] I think that I have found a bug in SpamBayes In-Reply-To: <4964mvkusalnsm8a33pkar8pgnkeenuadf@4ax.com> References: <200309121300.28596.ltrag@optonline.net> <4964mvkusalnsm8a33pkar8pgnkeenuadf@4ax.com> Message-ID: <200309121812.41783.ltrag@optonline.net> Richie, Thanks for getting back to me. I had found that this afternoon and played with it a bit to familiarize myself with it. I will set it back to True before I shut down tonight. Hopefully, tomorrow will bring a clue as to what is happening. Thanks again, and especially thanks for SpamBayes. Lanny On Friday 12 September 2003 15:08, Richie Hindle wrote: > > I think that Tim Peters response to this thread however give a hint at a > > possible solution. Unfortunately, there are no options for "Processing > > start delay" or "Delay between processing items" in the pop3proxy GUI. > > > > If one of the SB developers could give me a hint about where to invoke > > these options with my setup (pop3proxy - Kmail) I would be happy to alter > > the code and test it on my system. > > Those options are meaningless for the pop3proxy. However, you can turn on > verbose output, which will do two things: output debug messages to your > console, and create a log file of the POP3 conversation. You can switch > this on by adding: > > [globals] > verbose=True > > to your options file (which will be either bayescustomize.ini in your > working directory or .spambayesrc in your home directory) -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From T.A.Meyer at massey.ac.nz Fri Sep 12 23:52:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 12 23:52:31 2003 Subject: [Spambayes] [Fwd: SpamBayes instead of Outlook mail rules] Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130332B1DE@its-xchg4.massey.ac.nz> > The first sentence of the second paragraph instructs me to > make this suggestion, which I do by way of forward. :) Did you suggest to them that POPFile (or rather the Outlook plug-in for POPFile - ?Outclass?) would be a better way to go? =Tony Meyer From rcoe at CambridgeMA.GOV Sat Sep 13 14:04:04 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sat Sep 13 14:06:54 2003 Subject: [Spambayes] RE: I think that I have found a bug in SpamBayes Message-ID: I proposed this before, but I don't think anyone picked up on it..... I'll bet that a large percentage of the Spambayes-on-Outlook user community picks up its mail via Exchange Server. If your inbox is in your Exchange mailbox, then any Outlook rule whose destination is another inbox folder will be executed on the server, before the client ever sees the message. I think this means that such rules will never conflict with Spambayes, which works only on the client. If anybody thinks this isn't how things work, please say so. Otherwise, I think Exchange users can quit worrying about interference between the two rulesets. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Tim Peters [mailto:tim.one@comcast.net] > Sent: Friday, September 12, 2003 12:37 PM > To: pbharrison@compuserve.com > Cc: Spambayes > Subject: RE: [Spambayes] I think that I have found a bug in SpamBayes > > > [Peter Harrison] > > I find routinely when I logon in the morning that 80% or so of my > > mail is not rated by SB: The spam score field is blank. I > just close > > outlook, then re-open it, and everything gets processed fine. It > > seems that now that I have cable modem, either Outlook, SB, or both > > is too busy to do what I want initially, but once the unread mail is > > loaded, SB scores it easily. > > The current CVS version of the Outlook addin (and sorry, I don't know > whether that's been released yet) has an "enable background filtering" > checkbox on the Advanced tab of the SpamBayes manager. I > checked that, set > "Processing start delay" to 2 seconds and "Delay between > processing items" > to 0.25 seconds, and now the addin doesn't appear to be > missing anything > when a large number of emails come zooming in (I often get > hundreds at a > time then). More importantly to me (I didn't really mind > shutting down > Outlook and reopening it before) is that I can again "do > something" with > Outlook while the email is coming in: before, some Outlook > thread or other > wasn't even getting enough cycles to update the display, and > Outlook was > totally unresponsive until all the new messages had arrived. > > So you should play with this when it's available (which it > already may be). > I think the *real* reason these options were added was to > make it much more > likely that Outlook runs its own rules on new messages before it tells > SpamBayes about new messages (there was no rhyme or reason to > it before, as > we've got no influence over when Outlook decides to tell spambayes > anything -- what we can do is control when we *do* something > with them). > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From Dougie-Fresh at Comcast.net Sat Sep 13 15:04:49 2003 From: Dougie-Fresh at Comcast.net (Andrew Nonnenmacher) Date: Sat Sep 13 15:04:51 2003 Subject: [Spambayes] Outlook Safemode Message-ID: <000001c37a29$e9f68180$52003644@anonn> Okay, i've looked over the Trouble Shooting, and i couldn't see anything that mentioned this. After installing version 008.1 with the self installer, to use with my Outlook XP on Windows XP. Ever since installing Spambayes, I get an error that wants to send itself to Microsoft, and have to restart Outlook in safe mode. When in safe mode, the filtering works fine as far as i can tell, and haven't really had any problems with this. I was wondering if this is the first time you've heard about this, and if there is a way to fix it. Thank you. Andy From japeters at pacbell.net Sat Sep 13 21:46:20 2003 From: japeters at pacbell.net (John A. Peters) Date: Sat Sep 13 21:48:40 2003 Subject: [Spambayes] Fw: Settings for Spambayes and then OutlookExpress Message-ID: <000d01c37a61$ff9b73a0$6400a8c0@eTower> Is there some place this documentation can be published, after a proof reading? John Peters Start --------------------------------------------------- Spambayes & Outlook Express will work together. When you are setup and you tell OE to Send/Recv it will tell Spambayes to get your mail for OE and SB will add some tags to the title line of each message. You will use OE's Tools\Message Rules to send the Spam messages to a Spam folder and the Unsure messages to an unsure folder, but leave the good ham messages in the OE in box. Leave the good messages in the in box because OE can only work on messages in the in box. It can not apply rules to messages in other folders. Settings for SpamBayes to work with OE Down load Spambayes and tell it to open. (Due to a bug or feature in the installer program you have to:) Change the default path in the installer dialog to - C:\ Now go to a DOS box and - The DOS box is in programs\accessories\Command Prompt Right click, drag and drop it on to the desk top or the tray. Tell it to copy the icon, not move it. Change directories cd \ cd s Result is C:\spambayes-1.0a5> Build it in DOS window: >cd \spambayes >\python23\python setup.py build Install it >\python23\python setup.py install Configure it >\python23\python pop3proxy.py -b Note the \ at the start of this string. browser will come up at http://localhost:8880 press 'Configure' button Find the blue Configuration page and click on it enter for Remote Server postoffice.pacbell.net Spambayes Ports: 110 Display To: in message review No Allowed remote connections: local host Skip to next section named Header Options Notate to: Leave these three unchecked for OE Classify in subject header: [ ] ham [ x ] spam [ x ] unsure Note: I leave ham un checked so the good messages will not have that extra baggage in the subject line. SMTP Proxy Options Remote Servers: mail.pacbell.net SpamBayes Ports: 25 Skip past the Storage Options Skip past the Statistics Options press 'Save' at bottom of web page. Settings for Outlook Express to work with SpamBayes Tools Accounts (Internet accounts) Properties General tab Fill in as appropriate (Name and email address) Servers tab Server information Incoming server is POP3 Incoming mail POP3 is local host Outgoing mail SMTP is mail.pacbell.net Incoming mail server Account name - (No changes - first part of your email address usually) Password ******* (The one you use to get your mail from downtown) Connection tab - Usually no changes - (Choose LAN or dialup) Security tab - No changes Advanced tab Outgoing mail SMTP 25 [ ] This server requires a secure connection (SSL) Not checked Incoming mail (POP3) 110 [ ] This server requires a secure connection (SSL ) Not checked Server Timeouts ------------------------ Short 1 minute Not Break apart Delivery [ x ] Check to leave a copy of message on server [ x ] Remove from server after 5 days (My recommendation) [ ] Remove from server when deleted from deleted items Apply OK Message Rules do the work. Outlook Express Tools Message Rules Mail New 1. Select the Conditions for your rule; [ ] [ x ] Where the Subject line contains specific words [ ] [ ] 2. Select the Actions for your rule: [ x ] Move it to the specified folder [ ] [ ] [ ] Now click on each of the blue underlined words and fill them in. 4. Name of the rule #1 Change this to a good name since you will have more than one. Ok You can now apply your new rules Touch the Apply Now... button You might want to [ x ] the Include subfolders button Touch the next Apply Now... button This document by John A. Peters Corrections to Japeters@pacbell.net From ted_rice at cserv.net Sat Sep 13 22:46:15 2003 From: ted_rice at cserv.net (Ted Rice) Date: Sat Sep 13 22:46:38 2003 Subject: [Spambayes] start up Message-ID: <01e801c37a6a$601496f0$f09d61cc@RiceFamily> I just downloaded Spambayes for the first time and after a lot of work I managed to get it running - sort of. It seems to work well when it works, but I have to run the command-line C:\python23\python C:\sambayes\pop3proxy.py -b every time I start up the computer before I can collect email - or EVERY time if I close out the command line box. Is there any way to have it load automatically on start-up and stay in memory rather than have to do this every time? I'm new at this so I may have missed something. Ted Rice . From alasnik at smilezone.com Sun Sep 14 16:40:15 2003 From: alasnik at smilezone.com (Adam Lasnik) Date: Sun Sep 14 16:40:34 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Any advice? Message-ID: <021d01c37b00$6781eb90$5500fea9@shortygeorge> A few weeks ago, I posted a note expressing concerns that SpamBayes was doing an inadequate job filtering my spam (using the Outlook plugin). I've now filtered more mail (approximately 1000 ham and spam), and SpamBayes' performance has remained abysmal (only 1 false negative but more than 30% false negatives) :( I know that SpamBayes is fine software (judging from the many positive comments I've read), so I'm assuming there's just some glitch with my installation or operating environment. What's especially frustrating is that I've classified many of the nearly-exact-same spams -- such as the Nigerian spams, also the "I just saw your profile and I'm new to the area..." crap and so on -- as spam, but this doesn't help. Subsequent copies of this stuff gets scored at 0% or a mere 4%. I'm using the latest Outlook filter (.81). Any ideas about what I can do? Thanks! :) Regards, Adam http://smilezone.com/ -- Everything to make you smile :) From tim.one at comcast.net Sun Sep 14 16:45:31 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 14 16:45:31 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: <021d01c37b00$6781eb90$5500fea9@shortygeorge> Message-ID: [Adam Lasnik] > A few weeks ago, I posted a note expressing concerns that SpamBayes > was doing an inadequate job filtering my spam (using the Outlook > plugin). > > I've now filtered more mail (approximately 1000 ham and spam), and > SpamBayes' performance has remained abysmal (only 1 false negative but > more than 30% false negatives) :( > > I know that SpamBayes is fine software (judging from the many positive > comments I've read), so I'm assuming there's just some glitch with my > installation or operating environment. > > What's especially frustrating is that I've classified many of the > nearly-exact-same spams -- such as the Nigerian spams, also the "I > just saw your profile and I'm new to the area..." crap and so on -- > as spam, but this doesn't help. Subsequent copies of this stuff gets > scored at 0% or a mere 4%. > > I'm using the latest Outlook filter (.81). Any ideas about what I can > do? Thanks! :) We need clues. How may ham have you trained on? How many spam have you trained on? Use the "Show spam clues for current message ..." menu item to generate the internal evidence spambayes is going on for one of your problem messages, and send it along. From beckman at purplecow.com Sun Sep 14 16:51:51 2003 From: beckman at purplecow.com (Peter Beckman) Date: Sun Sep 14 16:51:55 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Any advice? In-Reply-To: <021d01c37b00$6781eb90$5500fea9@shortygeorge> References: <021d01c37b00$6781eb90$5500fea9@shortygeorge> Message-ID: <20030914164732.U36425@thermonuclear.org> On Sun, 14 Sep 2003, Adam Lasnik wrote: > I've now filtered more mail (approximately 1000 ham and spam), and > SpamBayes' performance has remained abysmal (only 1 false negative but > more than 30% false negatives) :( You mean 30% of your mail goes to the unsure folder, or 30% of your mail is spam, or 30% of the spam goes into your inbox? > What's especially frustrating is that I've classified many of the > nearly-exact-same spams -- such as the Nigerian spams, also the "I just > saw your profile and I'm new to the area..." crap and so on -- as spam, > but this doesn't help. Subsequent copies of this stuff gets scored at > 0% or a mere 4%. Nigerian Scam spams are difficult -- while they have the same concept and idea, they don't all use the exact same form letter. Most spam uses URLs, HTML, images, works like "porn" "viagra" etc, stuff that's easy to filter. Nigerian scam spam is usually warm and fuzzy, translated by someone differently each time. They are sent by individuals who use their own software, rather than hire a professional spammer. There will always be the spam you can't filter. It's impossible to filter everything 100% of the time. Now granted, you should be seeing more like a 98% spam removal... but I dunno. I moved the spam-cutoff from .90 to .80, and that helped to remove some from the unsure box. Beckman --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From beckman at purplecow.com Sun Sep 14 16:53:17 2003 From: beckman at purplecow.com (Peter Beckman) Date: Sun Sep 14 16:53:22 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: References: Message-ID: <20030914165236.P36425@thermonuclear.org> On Sun, 14 Sep 2003, Tim Peters wrote: > [Adam Lasnik] > > I've now filtered more mail (approximately 1000 ham and spam), and > > SpamBayes' performance has remained abysmal (only 1 false negative but > > more than 30% false negatives) :( Filtered? Or trained Spambayes on? There is a difference. Filtering mail does not train SpamBayes. Training mail does not filter it. --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From tfbiv at comcast.net Sun Sep 14 17:48:39 2003 From: tfbiv at comcast.net (Tom Bates) Date: Sun Sep 14 17:45:45 2003 Subject: [Spambayes] Suggestion for HTML analysis Message-ID: I'm new to the list. I hope this topic hasn't already been beat to death, but recently I've gotten HTML-formatted spam that attempts to circumvent recognition by inserting copious amounts of HTML garbage tags between letters, like so (an actual sample): Consolidation All this just to spell the word "Consolidation" without detection. I think Spambayes is fooled by this technique, because I don't see any of the operative words in the analysis. Could Spambayes look for an opening tag and go into HTML detection mode? Spambayes is working very well for me. About 40% of spam goes into the "maybe" bucket at this point, and that percentage seems to be slowl improving. Thanks Tom From marc at strobealific.com Sun Sep 14 17:53:46 2003 From: marc at strobealific.com (Marc Bowyer) Date: Sun Sep 14 17:53:59 2003 Subject: [Spambayes] possible bug or glitch Message-ID: I did a defrag and the next time I opened Outlook, my "Junk Email" folder was gone. I created a new folder and tried to configure SpamBayes and it froze. I uninstalled it, rebooted, and reinstalled it. I can't get the configuration wizard to open. It doesn't give me any errors...it just stopped working. I am using Windows XP and Outlook 2000 with all the latestupdates and patches. Thanks, Marc Bowyer From alasnik at smilezone.com Sun Sep 14 17:59:19 2003 From: alasnik at smilezone.com (Adam Lasnik) Date: Sun Sep 14 17:59:21 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: <20030914165236.P36425@thermonuclear.org> Message-ID: <022501c37b0b$7313ccc0$5500fea9@shortygeorge> Thanks for all your prompt help! Here are some clarifications. - There are about 1,000 each of spam and ham in my SpamBayes database - By false positives, I mean that SpamBayes treats the mails as ham, not unsure. - I realize the Nigerian scams use different language, but with these and many of the other spams SpamBayes is missing, there is still quite a bit of consistent language. And, for instance, the fake "I saw your profile!" spams use almost the exact same language each time, and 99% of the time, the listed e-mail address is from Hotmail (when barely any of my friends use Hotmail). These should register as something other than 0% or 4%. - Changing my cutoffs wouldn't help. This isn't a matter of SpamBayes ALMOST getting it right (e.g., 78% 'spam'), but rather, not even coming close. - I've included a typical "I saw your profile" spam below. Note that it now says spam score of 85%, but this is AFTER I reclassified it. Would it be more helpful for me to submit Clues to this list from e-mail before I reclassify it? Thanks again for your help! Regards, Adam http://smilezone.com/ -- Everything to make you smile :) ----- Spam Score: 85% (0.8516) word spamprob #ham #spam '*H*' 0.134717 - - '*S*' 0.837917 - - "i'm" 0.0796865 330 33 'old' 0.197553 81 23 'found' 0.204999 164 49 'years' 0.228916 154 53 'saw' 0.230668 55 19 'pictures' 0.232825 60 21 'back' 0.264985 215 90 'name' 0.299411 123 61 'email' 0.304498 446 227 'from:addr:yahoo.com' 0.313371 134 71 'skip:i 10' 0.314242 216 115 'want' 0.321543 256 141 'and' 0.366785 947 638 'female' 0.382766 14 10 'your' 0.399498 809 626 'header:Received:2' 0.694116 198 523 'skip:[ 40' 0.707537 2 6 'byee...' 0.828196 0 1 'email name:willie_342_morning' 0.828196 0 1 'from:addr:girlacm' 0.828196 0 1 'subject:...#!!' 0.828196 0 1 'to:addr:imo' 0.828196 0 1 'anna...^' 0.896278 0 2 'subject:hello...' 0.896278 0 2 'message-id:@rogue.bloghosts.com' 0.922656 7 100 '^^^^' 0.942138 0 4 'whatever..' 0.969292 0 8 'x-mailer:microsoft outlook express 6.00.2462.0000' 0.972516 0 9 Message Stream: Status: U Return-Path: Received: from rogue.bloghosts.com ([64.246.52.17]) by albert.mail.atl.earthlink.net (Earthlink Mail Service) with ESMTP id 19YAl84jH3Nl3qU0 for ; Sun, 14 Sep 2003 13:07:10 -0400 (EDT) Received: from [217.84.167.152] (helo=sender1003) by rogue.bloghosts.com with esmtp (Exim 4.20) id 19yaL5-0004VC-IS for thatadamguy@smilezone.com; Sun, 14 Sep 2003 13:07:08 -0400 From: girlaCm@yahoo.com Subject: hello...#!! Content-Type: text/plain Content-Transfer-Encoding: text/plain Date: Sun, 15 Sep 2002 01:03:48 -0700 X-Priority: 3 X-Library: Indy 10.00.14-B To: imO X-Mailer: Microsoft Outlook Express 6.00.2462.0000 Message-Id: X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - rogue.bloghosts.com X-AntiAbuse: Original Domain - smilezone.com X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - yahoo.com Hello, I'm 22 years old female and my name is Anna...^ I saw your profile on the net and found to be interesting.. email me back at Willie_342_morning@hotmail.com if you want to exchange pictures or whatever.. ^^^^ Hugs, byee... [U/.QWL,b61NpB)t.2qH=?Buj\T?n.R;gBb:ce6\s0] Message Tokens: 49 unique tokens '^^^^' 'and' 'anna...^' 'back' 'byee...' 'cc:none' 'content-type:text/plain' 'email' 'email addr:hotmail.com' 'email name:willie_342_morning' 'exchange' 'female' 'found' 'from:addr:girlacm' 'from:addr:yahoo.com' 'from:no real name:2**0' 'header:Date:1' 'header:From:1' 'header:Message-Id:1' 'header:Received:2' 'header:Return-Path:1' 'header:Subject:1' 'header:To:1' 'hello,' 'hugs,' "i'm" 'message-id:@rogue.bloghosts.com' 'name' 'net' 'old' 'pictures' 'profile' 'reply-to:none' 'saw' 'sender:none' 'skip:[ 40' 'skip:i 10' 'subject:...#!!' 'subject:hello...' 'the' 'to:2**0' 'to:addr:imo' 'to:no real name:2**0' 'want' 'whatever..' 'x-mailer:microsoft outlook express 6.00.2462.0000' 'years' 'you' 'your' From matt at mondoinfo.com Sun Sep 14 18:02:27 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Sun Sep 14 18:03:11 2003 Subject: [Spambayes] Suggestion for HTML analysis In-Reply-To: References: Message-ID: <1063576243.12.1678@sake.mondoinfo.com> Dear Tom, > I'm new to the list. Hello and welcome. > recently I've gotten HTML-formatted spam that attempts > to circumvent recognition by inserting copious amounts of HTML > garbage tags between letters > I think Spambayes is fooled by this technique, because I don't see > any of the operative words in the analysis Tim Peters added that in May. From the CVS checkin comment: I dug into a small collection of Unsures that looked like blatant spam, and discovered they were all using this kind of trick: Wrinkle Reduction That is, disguising words by inserting HTML nonsense tags. We replaced each tag with a blank, yielding the pretty useless tokens "Wr", "inkle", "Reduc" and "tion". We previously fixed a similar problem using embedded HTML comments. I should have fixed this other one then. Regards, Matt From beckman at purplecow.com Sun Sep 14 18:10:07 2003 From: beckman at purplecow.com (Peter Beckman) Date: Sun Sep 14 18:10:11 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: <022501c37b0b$7313ccc0$5500fea9@shortygeorge> References: <022501c37b0b$7313ccc0$5500fea9@shortygeorge> Message-ID: <20030914180912.K36425@thermonuclear.org> On Sun, 14 Sep 2003, Adam Lasnik wrote: > - I've included a typical "I saw your profile" spam below. Note that it > now says spam score of 85%, but this is AFTER I reclassified it. Would > it be more helpful for me to submit Clues to this list from e-mail > before I reclassify it? I noticed that the email didn't even have a X-SpamBayes-Probability: header (or whatever it is). Maybe that's the problem; some of your mail isn't getting tagged. Check it in the headers. Beckman --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From tim.one at comcast.net Sun Sep 14 18:23:23 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 14 18:25:14 2003 Subject: [Spambayes] Suggestion for HTML analysis In-Reply-To: Message-ID: [Tom Bates] > I'm new to the list. I hope this topic hasn't already been beat to > death, but recently I've gotten HTML-formatted spam that attempts to > circumvent recognition by inserting copious amounts of HTML garbage > tags between letters, like so (an actual sample): > > Co hgmysios i gdkqfwvin da byn > wkt pt g py wd > k!>nsoli ebsi vhviyrz > auu xxq tp > ffpmsck wklzmuyvtb tg u lhk cqny rm > r > yb!>dat b qdsg > bm > > jhj > qyjq gbbbej eu > pf > chlhqj sedz g stb p mbjo ned ybssswbv yg!>ion > > All this just to spell the word "Consolidation" without detection. I > think Spambayes is fooled by this technique, because I don't see any > of the operative words in the analysis. It's not fooled, but it's doing something you're not expecting: testing showed that tokens longer than 12 characters ("Consolidation" has 13) boost the database size more than they help classification. Instead a "summary token" is generated for each such thing, of the form skip: Specifically, the token Consolidation generates a skip:c 10 token. I mailed this portion of your email to myself and verified that this is indeed what it does with it (throws out the junk tags, finds the token Consolidation, and replaces it with a synthesized 'skip:c 10' token). > Could Spambayes look for an opening tag and go into HTML > detection mode? Spambayes is *always* in "HTML detection mode". Too much spam relies on extremely forgiving HTML parsers for spambayes to trigger on niceties like the existence of a tag -- lots and lots and lots of HTML spam isn't legal HTML. So spambayes assumes that everything it gets is HTML, and applies its own extremely forgiving parsing to it. > Spambayes is working very well for me. About 40% of spam goes into the > "maybe" bucket at this point, and that percentage seems to be slowl > improving. That's actually pretty bad, but we need more details to suggest why it might be so bad. Like which version of the code you're using, which client (e.g., Outlook addin, pop3proxy, ...), what you set your ham and spam cutoffs to, how many total ham you've trained on, ditto spam, ..., and a thousand other things we'll torture you with one at a time . From tfbiv at comcast.net Sun Sep 14 19:01:03 2003 From: tfbiv at comcast.net (Tom Bates) Date: Sun Sep 14 18:58:06 2003 Subject: [Spambayes] Checking for update not working Message-ID: Tim Peters suggested that Spambayes should do better than 40% maybe's, so I started on a mission to improve that. As a first step, I decided to see if I had the latest version. Under the SpamBayes button, there is a "Check for new version" button. When I try that, I get this dialog box: "There was an error checking for the latest version For specific details on the error, please see the Spambayes log Please check your internet connection, or try again later" with just an OK button. I didn't find a log file in any obvious place. I have a cable modem, so I'm always online. Anyway, I called up the Spambayes Manager, and it turns out, I was using 0.7, so I just updated to 0.81. Install went flawlessly. Thanks very much for making that part painless. But the "Check for new version" problem is still there. And still no log file, at least not something that ends in .log. So I'll use 0.81 for a while and see how that works. Many thanks to the developers. Even at 40% maybes, it still makes the chore of expunging spam a lot easier. I don't think I've gotten any false positives, either. Tom From alasnik at smilezone.com Sun Sep 14 19:02:53 2003 From: alasnik at smilezone.com (Adam Lasnik) Date: Sun Sep 14 19:03:00 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: <20030914180912.K36425@thermonuclear.org> Message-ID: <023401c37b14$546148d0$5500fea9@shortygeorge> > From: Peter Beckman [mailto:beckman@purplecow.com] > Sent: Sunday, September 14, 2003 3:10 PM > I noticed that the email didn't even have a X-SpamBayes-Probability: > header (or whatever it is). Maybe that's the problem; some of your mail > isn't getting tagged. > > Check it in the headers. I just sent myself a test message. It showed up as a 4% spam probability in the SPAM Outlook field, but when looking at headers (reproduced below), there is no mention of Spam or SpamBayes. Is that a problem? - Adam Status: U Return-Path: Received: from rogue.bloghosts.com ([64.246.52.17]) by samuel.mail.atl.earthlink.net (Earthlink Mail Service) with ESMTP id 19YFPz4sK3Nl3pv0 for ; Sun, 14 Sep 2003 18:58:56 -0400 (EDT) Received: from [64.4.14.80] (helo=hotmail.com) by rogue.bloghosts.com with esmtp (Exim 4.20) id 19yfpW-0002Um-CX for adam@smilezone.com; Sun, 14 Sep 2003 18:58:54 -0400 Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Sun, 14 Sep 2003 15:58:54 -0700 Received: from 66.167.204.93 by law10-oe23.law10.hotmail.com with DAV; Sun, 14 Sep 2003 22:58:53 +0000 X-Originating-IP: [66.167.204.93] X-Originating-Email: [smileguyadam@hotmail.com] Reply-To: From: "Adam the SmileGuy" To: "'Adam Lasnik'" Subject: testing, testing, 123 Date: Sun, 14 Sep 2003 15:58:56 -0700 Organization: Adam's SmileZone Message-ID: <023301c37b13$c67bc0e0$5500fea9@shortygeorge> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.2616 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165 Importance: Normal X-OriginalArrivalTime: 14 Sep 2003 22:58:54.0027 (UTC) FILETIME=[C53DE9B0:01C37B13] X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - rogue.bloghosts.com X-AntiAbuse: Original Domain - smilezone.com X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - hotmail.com From alasnik at smilezone.com Sun Sep 14 19:03:55 2003 From: alasnik at smilezone.com (Adam Lasnik) Date: Sun Sep 14 19:04:01 2003 Subject: [Spambayes] Checking for update not working In-Reply-To: Message-ID: <023501c37b14$7976ff70$5500fea9@shortygeorge> > "There was an error checking for the latest version > For specific details on the error, please see the Spambayes log I'm having the same problem on .81. - Adam From adam.walker at rbwconsulting.com Sun Sep 14 19:09:31 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Sun Sep 14 19:09:39 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes -Anyadvice? In-Reply-To: <023401c37b14$546148d0$5500fea9@shortygeorge> Message-ID: <20030914230934.9AC9813E2B8@sack.dreamhost.com> No, no problem. The Outlook plugin doesn't add headers to the message like the proxies do. > -----Original Message----- > I just sent myself a test message. It showed up as a 4% spam > probability in the SPAM Outlook field, but when looking at headers > (reproduced below), there is no mention of Spam or SpamBayes. Is that a > problem? > From tim.one at comcast.net Sun Sep 14 19:16:30 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 14 19:18:18 2003 Subject: [Spambayes] Checking for update not working In-Reply-To: Message-ID: [Tom Bates and Adam Lasnik both report, wrt the Outlook addins "check for new version" menu item] > "There was an error checking for the latest version > For specific details on the error, please see the Spambayes log > > Please check your internet connection, or try again later" I don't know what's up with that code, but see the same thing here. Please see the bug report I just filed, and if your symptoms are the same add a comment to the bug report saying so (sorry, you'll probably have to paste this URL back together again): http://sf.net/tracker/index.php?func=detail&aid=806238&group_id=61702&atid=4 98103 From adam.walker at rbwconsulting.com Sun Sep 14 19:22:00 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Sun Sep 14 19:22:06 2003 Subject: [Spambayes] Checking for update not working In-Reply-To: Message-ID: <20030914232202.BE25613E222@sack.dreamhost.com> It's a 404 error. Whatever file (Version.py, I think) it's looking for on the server is not there. I think someone needs to run "make version" and push the file out the website. > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Tim Peters > Sent: Sunday, September 14, 2003 7:16 PM > To: Tom Bates; alasnik@smilezone.com > Cc: spambayes@python.org > Subject: RE: [Spambayes] Checking for update not working > > [Tom Bates and Adam Lasnik both report, wrt the Outlook addins > "check for new version" menu item] > > "There was an error checking for the latest version > > For specific details on the error, please see the Spambayes log > > > > Please check your internet connection, or try again later" > > I don't know what's up with that code, but see the same thing here. > Please > see the bug report I just filed, and if your symptoms are the same add a > comment to the bug report saying so (sorry, you'll probably have to paste > this URL back together again): > > http://sf.net/tracker/index.php?func=detail&aid=806238&group_id=61702&atid > =4 > 98103 > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From beckman at purplecow.com Sun Sep 14 19:29:02 2003 From: beckman at purplecow.com (Peter Beckman) Date: Sun Sep 14 19:29:05 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes - Anyadvice? In-Reply-To: <023401c37b14$546148d0$5500fea9@shortygeorge> References: <023401c37b14$546148d0$5500fea9@shortygeorge> Message-ID: <20030914192650.K36425@thermonuclear.org> On Sun, 14 Sep 2003, Adam Lasnik wrote: > From: Peter Beckman [mailto:beckman@purplecow.com] > > > I noticed that the email didn't even have a X-SpamBayes-Probability: > > header (or whatever it is). Maybe that's the problem; some of your > > mail isn't getting tagged. > > > > Check it in the headers. > > I just sent myself a test message. It showed up as a 4% spam > probability in the SPAM Outlook field, but when looking at headers > (reproduced below), there is no mention of Spam or SpamBayes. Is that a > problem? Hmmm. Tony, anyone? I know that every piece of mail that is either trained or processed by SpamBayes has a header added to it. If your mail DOES NOT have a header to it, it would lead me to believe that there is either a configuration problem or it is working correctly and it just works differently using the Outlook plugin. Comments? --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From tim.one at comcast.net Sun Sep 14 19:32:34 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 14 19:34:25 2003 Subject: [Spambayes] Checking for update not working In-Reply-To: <20030914232202.BE25613E222@sack.dreamhost.com> Message-ID: [Adam Walker] > It's a 404 error. Whatever file (Version.py, I think) it's looking > for on the server is not there. I think someone needs to run "make > version" and push the file out the website. Please see the bug report I linked to before; the problem is deeper than a 404 error, as explained there (and the file is on the website, as you can confirm, e.g., by pasting the URL into your browser). From adam.walker at rbwconsulting.com Sun Sep 14 19:36:06 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Sun Sep 14 19:36:19 2003 Subject: [Spambayes] Checking for update not working In-Reply-To: <20030914232202.BE25613E222@sack.dreamhost.com> Message-ID: <20030914233609.8DEAD13E260@sack.dreamhost.com> I should have read the bug report first ;) When I paste "http://www.spambayes.org/download/Version.cfg" into my browser the page forwards to "http://spambayes.sourceforge.net//download/Version.cfg" which also causes the error with urlib2. however, "http://spambayes.sourceforge.net/download/Version.cfg" does not. > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Adam Walker > Sent: Sunday, September 14, 2003 7:22 PM > To: 'Tim Peters'; 'Tom Bates'; alasnik@smilezone.com > Cc: spambayes@python.org > Subject: RE: [Spambayes] Checking for update not working > > It's a 404 error. Whatever file (Version.py, I think) it's looking for on > the server is not there. I think someone needs to run "make version" and > push the file out the website. > > > -----Original Message----- > > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > > On Behalf Of Tim Peters > > Sent: Sunday, September 14, 2003 7:16 PM > > To: Tom Bates; alasnik@smilezone.com > > Cc: spambayes@python.org > > Subject: RE: [Spambayes] Checking for update not working > > > > [Tom Bates and Adam Lasnik both report, wrt the Outlook addins > > "check for new version" menu item] > > > "There was an error checking for the latest version > > > For specific details on the error, please see the Spambayes log > > > > > > Please check your internet connection, or try again later" > > > > I don't know what's up with that code, but see the same thing here. > > Please > > see the bug report I just filed, and if your symptoms are the same add a > > comment to the bug report saying so (sorry, you'll probably have to > paste > > this URL back together again): > > > > > http://sf.net/tracker/index.php?func=detail&aid=806238&group_id=61702&atid > > =4 > > 98103 > > > > > > _______________________________________________ > > Spambayes@python.org > > http://mail.python.org/mailman/listinfo/spambayes > > Check the FAQ before asking: http://spambayes.sf.net/faq.html > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From tim.one at comcast.net Sun Sep 14 19:37:32 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 14 19:39:24 2003 Subject: [Spambayes] Still getting horrible results from SpamBayes -Anyadvice? In-Reply-To: <20030914192650.K36425@thermonuclear.org> Message-ID: [Peter Beckman] > Hmmm. Tony, anyone? I know that every piece of mail that is either > trained or processed by SpamBayes has a header added to it. If your > mail DOES NOT have a header to it, it would lead me to believe that > there is either a configuration problem or it is working correctly > and it just works differently using the Outlook plugin. > > Comments? The Outlook addin is deeply integrated with Outlook, and doesn't add anything directly to messages (in particular, it doesn't add any new header lines). Instead the Outlook addin attaches a piece of "spam score" metadata to the message, which Outlook can display in a view column, and sort on (etc). Support for user-defined metadata is an Outlook feature the addin exploits. From jeff at leyser.org Sun Sep 14 19:42:08 2003 From: jeff at leyser.org (Jeff Leyser) Date: Sun Sep 14 19:42:47 2003 Subject: [Spambayes] Unsure to Ham not training properly? Message-ID: <004401c37b19$d1f0bd80$6401a8c0@collabneleyser> Using Outlook plugin 0.8. Trained on about 400 Spam and 100 Ham. I have two folders being watched, "Inbox" and "Mailbox." Outlook rules file email based on the account into one of those two. If an email is scored at, say, 40%, and thuse dumped into Unsure, and I move it to "Spam," I see the Spam score go to 100%. But if I move it to Inbox or Mailbox, the score stays the same. Is that the correct behavior? If so, why? Thanks! Jeff Leyser From T.A.Meyer at massey.ac.nz Sun Sep 14 19:56:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 14 19:56:25 2003 Subject: [Spambayes] start up Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471A90@its-xchg4.massey.ac.nz> > I just downloaded Spambayes for the first time and after > a lot of work I managed to get it running - sort of. It seems > to work well when it works, but I have to run the > command-line C:\python23\python C:\sambayes\pop3proxy.py -b > every time I start up the computer before I can collect email > - or EVERY time if I close out the command line box. > Is there any way to have it load automatically on > start-up and stay in memory rather than have to do this every time? > I'm new at this so I may have missed something. What version of windows are you using? If you're using WinNT, Win2K or WinXP you can install the pop3proxy as a Windows service. The pop3proxy_server.py script in the spambayes windows directory will do this for you. If you're using Win9x/WinME, then your best bet is just to put it in the startup items folder (you might want to run 'pythonw' instead of 'python', so that you don't get the console window popping up, and an error when you try to shutdown saying that Windows can't close the console window. Note also that a binary version of pop3proxy, which would automatically do the above, is not far off. =Tony Meyer From tfbiv at comcast.net Sun Sep 14 20:21:09 2003 From: tfbiv at comcast.net (Tom Bates) Date: Sun Sep 14 20:18:12 2003 Subject: [Spambayes] View spam score column in Outlook? Message-ID: How? May be a good question for the FAQ! :-) Tom From adam.walker at rbwconsulting.com Sun Sep 14 20:25:26 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Sun Sep 14 20:25:32 2003 Subject: [Spambayes] View spam score column in Outlook? In-Reply-To: Message-ID: <20030915002529.278D613E268@sack.dreamhost.com> If you have version .8 or higher, click the spambayes dropdown help->About SpamBayes Other wise, http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambaye s/Outlook2000/about.html?rev=HEAD > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Tom Bates > Sent: Sunday, September 14, 2003 8:21 PM > To: Spambayes Mailing List (outgoing) > Subject: [Spambayes] View spam score column in Outlook? > > How? > > May be a good question for the FAQ! :-) > > Tom > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From T.A.Meyer at massey.ac.nz Sun Sep 14 20:29:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 14 20:29:41 2003 Subject: [Spambayes] start up Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471ADB@its-xchg4.massey.ac.nz> > At 04:56 PM 9/14/2003, you wrote: > >WinXP you can install the pop3proxy as a Windows service. The > >pop3proxy_server.py script in the spambayes windows > directory will do > >this for you. > > In practice attempting to install this service garners this > error in line 49 of pop3proxy_service.py: > > import win32serviceutil, win32service > ImportError: No module named win32serviceutil Sorry, should have mentioned this: in order to use the pop3proxy_service.py script (or the Outlook plug-in) from source, you need to have not only Python installed, but also Mark Hammond's win32 extensions. =Tony Meyer From thetasig at comcast.net Sun Sep 14 22:20:41 2003 From: thetasig at comcast.net (theta sigma) Date: Sun Sep 14 22:20:47 2003 Subject: [Spambayes] Wishlist Message-ID: <6.0.0.22.2.20030914191517.01badd78@localhost> Apologies if this idea has been tossed around before. Can Spambayes delete on the server prior to downloading? It would be great if on the review messages page you could use the "delete" button to delete prior to downloading. I now use a separate program "Mailwasher" to do this. It avoids downloading viruses, etc. and also can bounce the incoming mail back to the sender. Thx. -=mark=- From T.A.Meyer at massey.ac.nz Sun Sep 14 22:55:02 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 14 22:55:17 2003 Subject: [Spambayes] Wishlist Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471BE0@its-xchg4.massey.ac.nz> > Can Spambayes delete on the server prior to downloading? It > would be great if on the review messages page you could use the "delete" > button to delete prior to downloading. One problem here is that spambayes needs to download the message to score it, so you can't avoid downloading the message. You could just classify on the headers, which gives ok (but not great) results, or (for some people) run spambayes on the server, which have been discussed before, but they're not really good general purpose solutions. (I actually have code to classify on headers only and leave spam on the server, but I've never got around to finishing it, because (I think) the sb_pop3dnd script should remove the need). The other thing is that messages are only added to the pop3proxy cache when you download them. It's a proxy, not a full mail application. In order to initiate a download (or to delete), we'd need to know your username and password as well as the details it has at the moment. This leads to the problem that if the review page is adding messages to the cache, and messages are also added via the mail client, some will end up there twice. Suggestions of methods that avoid these problems are welcome, of course. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 14 22:59:42 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 14 22:59:57 2003 Subject: [Spambayes] Unsure to Ham not training properly? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471BEA@its-xchg4.massey.ac.nz> > I have two folders being watched, "Inbox" and "Mailbox." > Outlook rules file email based on the account into one of those two. > > If an email is scored at, say, 40%, and thuse dumped into > Unsure, and I move it to "Spam," I see the Spam score go to > 100%. But if I move it to Inbox or Mailbox, the score stays > the same. Is that the correct behavior? If so, why? Do you have both the 'incremental training' checkboxes ticked? (on the 'Training' tab of the manager dialog). If so, then perhaps saving the spam score isn't possible for the "Inbox" and "Mailbox" folders? Are they different from the "Spam" folder in some way? (like one is on Exchange/IMAP/Hotmail?). If you move the message to the Inbox and then view the spam clues for the message, does the score in the clues match the score in the spam column? =Tony Meyer From jeff at leyser.org Sun Sep 14 23:20:38 2003 From: jeff at leyser.org (Jeff Leyser) Date: Sun Sep 14 23:21:18 2003 Subject: [Spambayes] Unsure to Ham not training properly? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303471BEA@its-xchg4.massey.ac.nz> Message-ID: <004b01c37b38$58363b40$6401a8c0@collabneleyser> > -----Original Message----- > From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] > Sent: Sunday, September 14, 2003 8:00 PM > To: jeff@leyser.org; spambayes@python.org > Subject: RE: [Spambayes] Unsure to Ham not training properly? > > > > I have two folders being watched, "Inbox" and "Mailbox." > > Outlook rules file email based on the account into one of those two. > > > > If an email is scored at, say, 40%, and thuse dumped into > > Unsure, and I move it to "Spam," I see the Spam score go to > > 100%. But if I move it to Inbox or Mailbox, the score stays > > the same. Is that the correct behavior? If so, why? > > Do you have both the 'incremental training' checkboxes > ticked? (on the > 'Training' tab of the manager dialog) Yes, both are checked. > If so, then perhaps saving the spam score isn't possible for > the "Inbox" > and "Mailbox" folders? Are they different from the "Spam" folder in > some way? (like one is on Exchange/IMAP/Hotmail?). If you move the > message to the Inbox and then view the spam clues for the > message, does > the score in the clues match the score in the spam column? No, all mailboxes the same, all in a single PST file on my local drive. The %age in the spam clues matches the score in the spam column. J From wsy at merl.com Mon Sep 15 07:12:20 2003 From: wsy at merl.com (wsy@merl.com) Date: Mon Sep 15 07:12:29 2003 Subject: [Spambayes] Suggestion for HTML analysis In-Reply-To: References: Message-ID: <200309151112.h8FBCKR04008@localhost.localdomain> From: "Tom Bates" I'm new to the list. I hope this topic hasn't already been beat to death, but recently I've gotten HTML-formatted spam that attempts to circumvent recognition by inserting copious amounts of HTML garbage tags between letters, like so (an actual sample): Consolidation Removing spurious HTML comments is one of the two things that CRM114 Mailfilter does that isn't totally learned-behavior (the other is popping open base64's) It helps a lot. SpamBayes should at least consider it. -Bill Yerazunis From Biren.Kundalia at aig.com Mon Sep 15 09:41:43 2003 From: Biren.Kundalia at aig.com (Kundalia, Biren) Date: Mon Sep 15 09:41:52 2003 Subject: [Spambayes] SpamBayes Outlook Plugin Message-ID: <930372E6D93A0E42A4B6A441DD557C8B0E4FF5@xlivmbx15.aig.com> Hi, We've been looking at the Outlook plugin (versions 006 through 008) on Outlook XP / WindowsXP, and are quite impressed by the results. We were wondering if you had any sort of paid support model in place for it, and if not, did you have any plans for establishing one? Regards, Biren From tim.one at comcast.net Mon Sep 15 10:13:00 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Sep 15 10:13:12 2003 Subject: [Spambayes] SpamBayes Outlook Plugin In-Reply-To: <930372E6D93A0E42A4B6A441DD557C8B0E4FF5@xlivmbx15.aig.com> Message-ID: [Kundalia, Biren] > We've been looking at the Outlook plugin (versions 006 through 008) on > Outlook XP / WindowsXP, and are quite impressed by the results. Glad to hear it! > We were wondering if you had any sort of paid support model in place > for it, and if not, did you have any plans for establishing one? We're not a company, we're volunteers spread around the world who work on SpamBayes in our spare time. So the idea of paid support is something each individual here has to decide for themself. Mark Hammond-- who wrote the vast bulk of the Outlook addin --is a self-employed consultant, so would presumably consider paid SpamBayes support to the extent that his schedule allowed and rates were competitive with his other consultancy work. Note that at least one commercial product happens to build on the SpamBayes code (our code is free for use by anyone, for any purpose): http://www.inboxer.com/ Most people submit bug reports on our SourceForge site: http://sf.net/tracker/?group_id=61702&atid=498103 You can browse that-- and the spambayes mailing list archives --to judge the quality of support you can probably expect to get without cost. If you want to tell people what to do, and/or on your schedule, then free support probably won't work well for you. From fred.bartholomai at acsatl.com Mon Sep 15 11:46:59 2003 From: fred.bartholomai at acsatl.com (Fred Bartholomai) Date: Mon Sep 15 11:47:06 2003 Subject: [Spambayes] Help needed for using spambayes on PC Windows 2000 Message-ID: <000001c37ba0$996e1ca0$1801140a@Win2000Fred> Dear SpamBayes, I need help installing SpamBayes on my PC in order to interface with a Non Outlook 2000 Email client. The client I am using is: Mozilla Thunderbird 0.2 We are having problems with Outlook 2000. Your web page mentions to download the source to my PC. My problem Is I do not have a compiler on my PC. Do you have any binary versions available That I could download ? Thank you, Fred Bartholomai ---------------------------------------------------- Fred Bartholomai Advanced Control Systems, Inc. Software Dept., R&D Consultant 770 849 0131 x334 fred.bartholomai@acsatl.com From david.a.stever at lmco.com Mon Sep 15 11:50:27 2003 From: david.a.stever at lmco.com (Stever, David A) Date: Mon Sep 15 11:53:16 2003 Subject: [Spambayes] Spambayes stopped this morning Message-ID: I've used Spambayes 0.7 with Outlook 2002 on Windows 2000 for a few weeks, and have been loving it. This morning, I restarted the machine, and noticed about 120 items filtered. When I went to the Possible Spam folder, I noted that the header only said "Recover from Spam". Went back to Inbox, and saw the same header. Same with the Spam folder. No links on the Spambayes button worked. Picked up 0.81, installed it with no change in problem. Uninstalled it, renamed the SpamBayes directory, reinstalled the program, no change. I notice that the SpamBayes directory was not recreated after the reinstall. Also, the SpamBayes toolbar was not recreated after any of these installations. I ran Spambot S&D, and didn't see anything out of the ordinary. I love SpamBayes too much to be able to run like this for very long. I am continuing to move spam to David Stever 651-456-4844 L EIS Service Desk, at Eagan Minnesota From davieboy28 at hotmail.com Mon Sep 15 12:22:31 2003 From: davieboy28 at hotmail.com (davieboy28) Date: Mon Sep 15 12:22:39 2003 Subject: [Spambayes] (no subject) Message-ID: Hi, Spambayes is very good, thank you for a great bit of code. However, it would be nice if the app would automatically mark spam and "maybe spam" as read, so the new email flag wouldn't keep lighting up, and pretty much forcing me to look at the "unread" spam, which is highlighted. Thanks, DAve From adam.walker at rbwconsulting.com Mon Sep 15 13:29:30 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Mon Sep 15 13:29:46 2003 Subject: [Spambayes] Spambayes stopped this morning In-Reply-To: Message-ID: <20030915172942.E79AD13E224@sack.dreamhost.com> Hello David, I think the troubleshooting guide may help. http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambaye s/Outlook2000/docs/troubleshooting.html?rev=HEAD You may have to paste the url back together. --Adam > -----Original Message----- > > I've used Spambayes 0.7 with Outlook 2002 on Windows 2000 for a few > weeks, and have been loving it. This morning, I restarted the machine, > and noticed about 120 items filtered. When I went to the Possible Spam > folder, I noted that the header only said "Recover from Spam". Went > back to Inbox, and saw the same header. Same with the Spam folder. No > links on the Spambayes button worked. Picked up 0.81, installed it > with no change in problem. Uninstalled it, renamed the SpamBayes > directory, reinstalled the program, no change. I notice that the > SpamBayes directory was not recreated after the reinstall. Also, the > SpamBayes toolbar was not recreated after any of these installations. > > I ran Spambot S&D, and didn't see anything out of the ordinary. > I love SpamBayes too much to be able to run like this for very long. I > am continuing to move spam to > > David Stever From tim.one at comcast.net Mon Sep 15 13:34:04 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Sep 15 13:34:09 2003 Subject: [Spambayes] Suggestion for HTML analysis In-Reply-To: <200309151112.h8FBCKR04008@localhost.localdomain> Message-ID: [wsy@merl.com] > Removing spurious HTML comments is one of the two things that > CRM114 Mailfilter does that isn't totally learned-behavior (the > other is popping open base64's) > > It helps a lot. SpamBayes should at least consider it. We already remove all HTML tags (including nonsense tags), comment sections, and style sections. base64 attachments get decoded if and only if they have MIME type text/* (so, e.g., we don't bother decoding images or audio files). These are all instances of that spambayes tries to score what the user sees, rather than how it happened to be encoded. Special parsing of embedded URLs is a big exception to that. From marc at strobealific.com Mon Sep 15 13:49:46 2003 From: marc at strobealific.com (Marc Bowyer) Date: Mon Sep 15 13:49:59 2003 Subject: [Spambayes] Recovery from Defrag problem Message-ID: Yes, I got it back. I did a complete uninstall after a reboot. I didn't open anything before. Next, I manually deleted the SpamBayes folders in Program Files & Documents and Settings/All Users & Documents and Settings/Marc Bowyer (insert your log on there). Then, I rebooted again. I then did a file search for the word "spambayes" and deleted all the files. For some reason, there were a few files that wouldn't delete. I rebooted again, did the search again, and was able to delete the final files. Rebooted again. Installed SpamBayes again. Rebooted one last time. Success! Lots of rebooting, but it worked. Marc Bowyer From T.A.Meyer at massey.ac.nz Mon Sep 15 18:21:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 15 18:21:39 2003 Subject: [Spambayes] Help needed for using spambayes on PC Windows 2000 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471E02@its-xchg4.massey.ac.nz> > Your web page mentions to download the source to my PC. My > problem Is I do not have a compiler on my PC. FWIW, what you need to have installed is Python. > Do you have any binary versions available That I could download ? A binary version that should work for anyone with POP3 (not IMAP) mail is in the works right now (versions of the binary are being tested to check that they work ok). So if you can wait a week or two, yes, there will be a binary version. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 15 18:28:37 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 15 18:28:56 2003 Subject: [Spambayes] (no subject) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303471E11@its-xchg4.massey.ac.nz> > Spambayes is very good, thank you for a great bit of code. > However, it would be nice if the app would automatically mark > spam and "maybe spam" as read, You can do this. Please see the configuration guide, which is linked from the 'about spambayes' page (in the Help menu of the main SpamBayes drop-down menu). The options that you're after are on the 'filtering' and 'training' tabs of the main dialog. (Assuming that you are using version 008.1). > so the new email flag wouldn't > keep lighting up, and pretty much forcing me to look at the > "unread" spam, which is highlighted. This is a different issue. Please see FAQ 3.9: =Tony Meyer From gcarter at rsasecurity.com Tue Sep 16 09:29:46 2003 From: gcarter at rsasecurity.com (Carter, Greg) Date: Tue Sep 16 09:31:52 2003 Subject: [Spambayes] Error Message-ID: To whom it may concern. This is the error I am receiving. Any idea of why I am getting this? * the version of Windows you are using = XP * the version of SpamBayes Bin 0.7 * any log files. <> <> <> <> -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030916/38b05a07/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 735 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030916/38b05a07/spambayes4.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 3455 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030916/38b05a07/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030916/38b05a07/spambayes2.obj From mhammond at skippinet.com.au Tue Sep 16 10:22:42 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 16 10:22:33 2003 Subject: [Spambayes] Error In-Reply-To: Message-ID: <00a901c37c5d$fde34a80$f502a8c0@eden> Unfortunately, you have struck a recent problem when checking for a new version - but there *is* a new version, and it should make configuration of SpamBayes much simpler. Please see www.spambayes.org/windows.html Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Carter, Greg > Sent: Tuesday, 16 September 2003 11:30 PM > To: 'spambayes@python.org' > Subject: [Spambayes] Error > > > To whom it may concern. This is the error I am receiving. > Any idea of why > I am getting this? > > > > > * the version of Windows you are using = XP > * the version of SpamBayes Bin 0.7 > * any log files. > > > <> <> <> > <> > > > > From pkipe at corpdatasys.com Tue Sep 16 12:46:22 2003 From: pkipe at corpdatasys.com (Peter N. Kipe) Date: Tue Sep 16 12:46:33 2003 Subject: [Spambayes] Obvious Spam Missed... Message-ID: <003401c37c72$1002c9c0$1505a8c0@CDS002> To all, I've had SpamBayes installed for over a month now, and while it IS helping, I'm not seeing the results I was expecting. I installed SpamBayes-Outlook-Setup-006.exe dated 8-6-2003, size 3,391,736 bytes. I'm using Outlook 2002 SP2 on WinXP Pro. My e-mail is served by my ISP's POP3 server. I receive 100-200 spams per day, depending upon the day. In a typical heavy spam day, SpamBayes identifies about 10 of those as certain spam, and about 60% of the rest as probable spam. What I'm saying is that probably 30% or so remains in my inbox for me to delete as spam. Examples of the kind of spam I'm referring to are "3 in 1 SmartMini Cam Blowout!!!", "you've never seen amateurs do this", and "Pkipe Quality Valiumm, prozac, ambien.ccrvu? Nrlt." These all score in the 30-40 range. I've deleted over 3500 items as spam, and recovered only about 25 from spam. When I receive new mail, I first look at the number of unread items in my spam folder. If there are any, I switch to that folder and verify that the new items are in fact spam, then mark them as "read". Next, I look at my probable spam folder to make sure that everything in it is spam. If so, I select all items, mark them as "read", then click the SpamBayes "Delete as Spam" button. Finally, I look at unread items in my inbox to see if any are spam. I select all spam I find, then mark them as "read", and click "Delete as Spam". The bottom line is that I'm trying to use the "read" flag to highlight new items, and turn the flag off after items have been reviewed. Question: Is there some aspect of the approach I'm using that might cause SpamBayes not to "learn" what I consider to be spam, or does it sound like SpamBayes is essentially working as it should be? Thanks in advance for any thoughts. Peter Kipe From MLWALKER at niaid.nih.gov Tue Sep 16 14:23:15 2003 From: MLWALKER at niaid.nih.gov (Walker, Marni (NIH/NIAID)) Date: Tue Sep 16 14:23:20 2003 Subject: [Spambayes] SpamBayes Message-ID: Kudos and thank you thank you thank you!!! This is by far the best and easiest program I have ever used, and more to the point...IT WORKS!!!! Again, thank you, thank you THANK YOU!!!! Marni-Lynne Walker From tim.one at comcast.net Tue Sep 16 14:53:07 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 16 14:53:12 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <003401c37c72$1002c9c0$1505a8c0@CDS002> Message-ID: [Peter N. Kipe] > I've had SpamBayes installed for over a month now, and while it IS > helping, I'm not seeing the results I was expecting. I installed > SpamBayes-Outlook-Setup-006.exe dated 8-6-2003, size 3,391,736 bytes. > I'm using Outlook 2002 SP2 on WinXP Pro. My e-mail is served by my > ISP's POP3 server. > ... How many total spam have you trained on, and how many total ham? spambayes works best if those are approximately equal numbers. If you have a large imbalance, close Outlook, find your default_bayes_customize.ini file and change the line experimental_ham_spam_imbalance_adjustment: True to experimental_ham_spam_imbalance_adjustment: False then start Outlook again. You don't need to retrain after this. Then let us know whether that helped. From hauser at carneylaw.com Tue Sep 16 15:27:10 2003 From: hauser at carneylaw.com (Hauser, Lee) Date: Tue Sep 16 15:23:02 2003 Subject: [Spambayes] Problem with Outlook plugin Message-ID: <6E165C6D8BDFC042A0FDB4A8635A2F2154197C@ed.carneylaw.local> We have several machines running the Outlook plugin without problem. It's great software! However, a user on one machine had to tell Outlook to disable the plugin -- Outlook was having difficulties attaching to the Exchange server which actually had nothing to do with Spambayes, and when it asked if she wanted to disable it, she said "yes." Now nothing I do will make the plugin work again. The original plugin was 0.7. I have tried uninstalling, manually clearning the plugin's registry settings under HKCU, reinstalling, re-registering the plugin DLL, and installing 0.81. (Realtime virus protection is turned off during install). The toolbar is still there, but nothing happens when I click any menu item, and no filtering is going on. Any hints about what I can do about this? Lee Hauser, System Administrator Carney Badley Spellman 700 5th Ave Suite 5800 Seattle WA 98104 206.689.4312 From rhough at chancery.com Tue Sep 16 15:26:14 2003 From: rhough at chancery.com (Richard Hough) Date: Tue Sep 16 15:26:20 2003 Subject: [Spambayes] Cannot enable Spambayes Message-ID: <6854D03E6CF5FD42B59F138A7B271E011D10AB@exchange> Great idea guys, but I cannot enable the Outlook plugin after installing it. I'm running Win2K Server and Outlook 2000. Spambayes installs OK, I went through the training session and trained hundreds of spam and good messages, have defined a watch and unsure folder, set the filtering options to move certain and suspected spam to these folders, checked the "Enable Spambayes" box in the General tab, and closed it. But no messages are ever moved to the spam folder. Opening the SpamBayes Manager again shows the Enable button is still unchecked. I can check it again but closing the dialog, or even switching to a different tab and back, will uncheck it. I created a Spam view column as suggested and it does show a high percent for spam, it just never moves it to my defined spam folder. Even 100% spam stays in my Inbox. If I run "Filter Now" on my Inbox and select "Perform all filter actions" it will move it to the spam folder, but I want it to do this automatically with mail as it arrives. Richard Hough From rmalayter at bai.org Tue Sep 16 15:45:07 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 16 15:45:16 2003 Subject: [Spambayes] Obvious Spam Missed... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A011151D8@cliff.bai.org> From: Tim Peters [mailto:tim.one@comcast.net] > If you have a large imbalance, close Outlook, > find your default_bayes_customize.ini file and > change the line Tim, can you quantify "a large imbalnace"? Is a 2:1 ratio significant enough to cause problems, or would it have to be something on the order of 10:1? I'm getting a 98% capture rate after training with a 2:1 ham-to-spam ratio, but other folks seem to be having troubles. From dave at dnh.sk.ca Tue Sep 16 15:56:29 2003 From: dave at dnh.sk.ca (Dave Hall) Date: Tue Sep 16 15:58:07 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A011151D8@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A011151D8@cliff.bai.org> Message-ID: <20030916195629.GA32110@dnh.sk.ca> On Tue, Sep 16, 2003 at 02:45:07PM -0500, Ryan Malayter wrote: > From: Tim Peters [mailto:tim.one@comcast.net] > > If you have a large imbalance, close Outlook, > > find your default_bayes_customize.ini file and > > change the line > > Tim, can you quantify "a large imbalnace"? Is a 2:1 ratio significant > enough to cause problems, or would it have to be something on the order > of 10:1? > > I'm getting a 98% capture rate after training with a 2:1 ham-to-spam > ratio, but other folks seem to be having troubles. I trained with a little over 500 ham and 1500 spam, so far nothing has been misclassified and only a few unsures in just over a week of use. It even captured a bunch of double-bounced spam to mailer-daemon on my server (it gets delivered to my inbox). A 1:3 ratio does not seem to skew things, at least not for my typical mail. -- Dave =============================================================== | <- You must be smarter than this stick to ride the Internet -Mike Handler =============================================================== From tim.one at comcast.net Tue Sep 16 16:11:44 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 16 16:11:48 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A011151D8@cliff.bai.org> Message-ID: [Ryan Malayter] > Tim, can you quantify "a large imbalnace"? Is a 2:1 ratio significant > enough to cause problems, Probably not. > or would it have to be something on the order of 10:1? The larger the worserer . We've seen people with imbalance worse than 300:1 or 1:300 (different people get extreme in different ways). > I'm getting a 98% capture rate after training with a 2:1 ham-to-spam > ratio, but other folks seem to be having troubles. Other folks are having troubles, but in each such case to date where they've stuck around long enough to answer the questions, it turns out they've had extreme imbalance in their training data. Of those, a few less have stuck around long enough to report back that disabling the experimental_ham_spam_imbalance option helped. That's all I've got to go on, and over on spambayes-dev we're talking about changing this option as a result. Do you happen to know the training ratios for the "other folks [with] troubles" you know about? From tim.one at comcast.net Tue Sep 16 16:11:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 16 16:11:54 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <20030916195629.GA32110@dnh.sk.ca> Message-ID: [Dave Hall] > I trained with a little over 500 ham and 1500 spam, so far nothing > has been misclassified and only a few unsures in just over a week of > use. It even captured a bunch of double-bounced spam to > mailer-daemon on my server (it gets delivered to my inbox). A 1:3 > ratio does not seem to skew things, at least not for my typical mail. It's more complicated than that : the experimental_ham_spam_imbalance option is enabled by default in the Outlook client, but disabled by default for all other spambayes applications. So unless we know how you're using spambayes, your testimony isn't telling us whether the option isn't needed for your 1:3 imbalance, or that the option is helping with your 1:3 imbalance. From adam.walker at rbwconsulting.com Tue Sep 16 16:22:14 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Tue Sep 16 16:22:49 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: Message-ID: <20030916202245.D8FE586276@plunder.dreamhost.com> With the option set to True, I had no trouble with 2.4:1 and some misclassifications when it was over 30:1. I have the option set to False now and a ratio of 2.3:1 and no trouble. > -----Original Message----- > [Ryan] > > I trained with a little over 500 ham and 1500 spam, so far nothing > > has been misclassified and only a few unsures in just over a week of > > use. It even captured a bunch of double-bounced spam to > > mailer-daemon on my server (it gets delivered to my inbox). A 1:3 > > ratio does not seem to skew things, at least not for my typical mail. > > [TimP] > It's more complicated than that : the > experimental_ham_spam_imbalance > option is enabled by default in the Outlook client, but disabled by > default > for all other spambayes applications. So unless we know how you're using > spambayes, your testimony isn't telling us whether the option isn't needed > for your 1:3 imbalance, or that the option is helping with your 1:3 > imbalance. > From dave-spambayes at dnh.sk.ca Tue Sep 16 16:27:20 2003 From: dave-spambayes at dnh.sk.ca (Dave Hall) Date: Tue Sep 16 16:29:08 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: References: <20030916195629.GA32110@dnh.sk.ca> Message-ID: <20030916202720.GA31473@dnh.sk.ca> On Tue, Sep 16, 2003 at 04:11:45PM -0400, Tim Peters wrote: > [Dave Hall] > > I trained with a little over 500 ham and 1500 spam, so far nothing > > has been misclassified and only a few unsures in just over a week of > > use. It even captured a bunch of double-bounced spam to > > mailer-daemon on my server (it gets delivered to my inbox). A 1:3 > > ratio does not seem to skew things, at least not for my typical mail. > > It's more complicated than that : the experimental_ham_spam_imbalance > option is enabled by default in the Outlook client, but disabled by default > for all other spambayes applications. So unless we know how you're using > spambayes, your testimony isn't telling us whether the option isn't needed > for your 1:3 imbalance, or that the option is helping with your 1:3 > imbalance. Ah, that could make a difference. I'm not using the outlook plugin, just qmail/hammie/procmail at present. -- Dave =============================================================== | <- You must be smarter than this stick to ride the Internet -Mike Handler =============================================================== From skip at pobox.com Tue Sep 16 16:31:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 16 16:31:22 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: References: <792DE28E91F6EA42B4663AE761C41C2A011151D8@cliff.bai.org> Message-ID: <16231.29454.894879.537811@montanaro.dyndns.org> >> or would it have to be something on the order of 10:1? Tim> The larger the worserer . We've seen people with imbalance Tim> worse than 300:1 or 1:300 (different people get extreme in Tim> different ways). In particular, it's almost certainly a mistake to initially train SpamBayes on all the mailboxes on your computer which contain nothing but ham. My guess is that some people are doing this ("yup, that's ham . yup, that's ham ....), declaring all the mailboxes which are full of ham to the Outlook plugin. Most people don't save the spam they receive, so doing that would cause them to immediately start off with a horrible imbalance. To make matters worse, if much of the ham saved is very old, it may not accurately reflect what current ham looks like. Since it's so easy to suck existing mailboxes into the Outlook plugin, perhaps it should warn the user if the ham:spam ratio gets too out-of-whack. Skip From rmalayter at bai.org Tue Sep 16 17:12:15 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Sep 16 17:12:20 2003 Subject: [Spambayes] Obvious Spam Missed... Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D417@cliff.bai.org> From: Skip Montanaro [mailto:skip@pobox.com] > ("yup, that's ham . yup, that's ham ....), > declaring all the mailboxes which are full of ham > to the Outlook plugin. Most people don't save the > spam they receive, so doing that would cause them to > immediately start off with a horrible imbalance. > To make matters worse, if much of the ham saved is > very old, it may not accurately reflect what current > ham looks like. Which brings us back to the "automatic subset" feature Tim, myself, and others were talking about in the SpamBayes "feature requests" forum. Like most people, I organize my messages into a lot of folders, and I want to train on messages from each of those folders, so that my personal ham, mailing list ham, and work-related ham are all represented. There's no easy way to do get a representative subset with the current Outlook plug-in, it would have to be done by hand, and that would take quite a while. The algoritm I suggested over in the feature requests forum would solve both the age and "representative sample" problems neatly. Perhaps I should just make a script (in VBA I guess) that creates the representative sample automatically, and then I can train with it and we can post it as an add-on or something. If people like it, it can be integrated into the plug-in installer. Regards, -Ryan- From mhammond at skippinet.com.au Tue Sep 16 17:31:15 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 16 17:31:08 2003 Subject: [Spambayes] Cannot enable Spambayes In-Reply-To: <6854D03E6CF5FD42B59F138A7B271E011D10AB@exchange> Message-ID: <015a01c37c99$dc9dde80$f502a8c0@eden> Please see the "troubleshooting guide" (installed, but online at http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambaye s/Outlook2000/docs/troubleshooting.html) This will have information how to locate the log file, so we can better guess your problem. Also please make sure you are using version 008 of the plugin. Mark > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Richard Hough > Sent: Wednesday, 17 September 2003 5:26 AM > To: 'spambayes@python.org' > Subject: [Spambayes] Cannot enable Spambayes > > > Great idea guys, but I cannot enable the Outlook plugin after > installing it. > > I'm running Win2K Server and Outlook 2000. Spambayes installs > OK, I went > through the training session and trained hundreds of spam and > good messages, > have defined a watch and unsure folder, set the filtering > options to move > certain and suspected spam to these folders, checked the > "Enable Spambayes" > box in the General tab, and closed it. But no messages are > ever moved to the > spam folder. > > Opening the SpamBayes Manager again shows the Enable button is still > unchecked. I can check it again but closing the dialog, or > even switching to > a different tab and back, will uncheck it. I created a Spam > view column as > suggested and it does show a high percent for spam, it just > never moves it > to my defined spam folder. Even 100% spam stays in my Inbox. > > If I run "Filter Now" on my Inbox and select "Perform all > filter actions" it > will move it to the spam folder, but I want it to do this > automatically with > mail as it arrives. > > Richard Hough > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From lklawson at heapy.com Tue Sep 16 18:42:00 2003 From: lklawson at heapy.com (Kirk Lawson) Date: Tue Sep 16 17:43:40 2003 Subject: [Spambayes] Outlook 2000 forms problem Message-ID: I've installed the Outlook pluging (version 7) on a 6 clients with good success. However two clients frequently get an error since installation: "Custom form could not be opened. Outlook will use an Outlook form instead." Can't seem to find it in the FAQ or the list archives. Kirk Lawson, Peace favor your sword --- "In these modern times, many men are wounded for not having weapons or knowledge of their use." -Achille Marozzo, 1536 From beckman at purplecow.com Tue Sep 16 17:48:31 2003 From: beckman at purplecow.com (Peter Beckman) Date: Tue Sep 16 17:48:45 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2AF3D417@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2AF3D417@cliff.bai.org> Message-ID: <20030916174601.X36425@thermonuclear.org> On Tue, 16 Sep 2003, Ryan Malayter wrote: > From: Skip Montanaro [mailto:skip@pobox.com] > > ("yup, that's ham . yup, that's ham ....), > > declaring all the mailboxes which are full of ham > > to the Outlook plugin. Most people don't save the > > spam they receive, so doing that would cause them to > > immediately start off with a horrible imbalance. > > > To make matters worse, if much of the ham saved is > > very old, it may not accurately reflect what current > > ham looks like. What about including a "starter" HammieDB, something that changes over time, based on a bunch of different people's settings. By starting with a nice ratio, hopefully most of the spam will go to the right place, and in the beginning, a lot of ham might be classified as "unsure", but that would happen anyway. At least most of the current "spam" would get marked as such. I know, email is a very personal thing and can't be "generalized" but I wonder if this would help in some cases where the people were less than aware of the ham/spam training stuff. --------------------------------------------------------------------------- Peter Beckman Internet Guy beckman@purplecow.com http://www.purplecow.com/ --------------------------------------------------------------------------- From lyle at nelson-usa.com Tue Sep 16 21:18:09 2003 From: lyle at nelson-usa.com (Lyle Nelson) Date: Tue Sep 16 19:18:06 2003 Subject: [Spambayes] The two buttons.... Message-ID: <000001c37cb9$8f537070$6501a8c0@lyleNotebook> The two buttons, Delete as spam and Spambays appear, but I can do nothing with either Button. If I click on the 'Delete as spam I get the message 'Spambays is not started'. I am running Windows XP Pro Lyle Nelson 14532 Kipling Ave. S. Savage, MN 55378 (952) 894-2587 www.nelson-usa.com lyle@nelson-usa.com mary@nelson-usa.com From T.A.Meyer at massey.ac.nz Tue Sep 16 19:20:43 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 16 19:20:56 2003 Subject: [Spambayes] Obvious Spam Missed... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13034722A1@its-xchg4.massey.ac.nz> > Since it's so easy to suck existing mailboxes into the > Outlook plugin, perhaps it should warn the user if the > ham:spam ratio gets too out-of-whack. I think this is probably a good idea. What "too out-of-whack" is, I don't know - maybe 5:1? 10:1? (or vice-versa, obviously). I'm pretty sure that somewhere in the docs it mentions that a balanced set of data is good, but I don't imagine many people actually read that bit, so simply pointing out what is good once they are going wrong will probably help a lot. At the very least it might help people fix the problem themselves, rather than using the list ;) =Tony Meyer From rhough at chancery.com Tue Sep 16 19:27:18 2003 From: rhough at chancery.com (Richard Hough) Date: Tue Sep 16 19:27:35 2003 Subject: [Spambayes] Cannot enable Spambayes Message-ID: <6854D03E6CF5FD42B59F138A7B271E011D10AF@exchange> Yeah, I tried rebooting and it starts working... for a little while. The checkbox stays checked, but soon it goes back to staying unchecked. It's interesting you mention loading a "bitmap file". I notice that after rebooting Spambayes shows a blue "Spambayes anti-spam notifier" logo at the top of the manager page. However, when I was getting the problem this logo did not appear; there was a big blank space there. It also did not show the version number; it said something like "put version number here". Very strange. Any way, as I mentioned, it's probably something wrong with my machine. After I complained to you I started getting weird graphics in other applications, like Internet Explorer and a school gradebook application. Graphics weren't appearing in web pages before I installed Spambayes but I just ignored that, thinking the pages were bad. But now the toolbar in IE and Outlook are showing black silhouettes instead of icons, and my gradebook program is not showing graphics like the logo and toolbar icons either. They were working yesterday. I installed Microsoft SQL Server on my machine this morning, and suspect that is what may be wrong. I didn't notice anything funny until I installed Spambayes, so it got the blame. Sorry for jumping to conclusions. -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Tuesday, September 16, 2003 4:11 PM To: 'Richard Hough' Subject: RE: [Spambayes] Cannot enable Spambayes It is failing to load some of the bitmap files used by the dialogs, complaining it is "out of memory". I assume you have tried re-booting? Mark. > -----Original Message----- > From: Richard Hough [mailto:rhough@chancery.com] > Sent: Wednesday, 17 September 2003 8:48 AM > To: 'Mark Hammond' > Subject: RE: [Spambayes] Cannot enable Spambayes > > > Thanks for the quick reply. I am enclosing the spambayes log file you > mentioned. > > This may be a problem with my machine and not Spambayes; I > have been having > problems with other applicatiions on my machine recently. > > Any information on makig this work would be helpful. > > -----Original Message----- > From: Mark Hammond [mailto:mhammond@skippinet.com.au] > Sent: Tuesday, September 16, 2003 2:31 PM > To: 'Richard Hough'; spambayes@python.org > Subject: RE: [Spambayes] Cannot enable Spambayes > > > Please see the "troubleshooting guide" (installed, but online at > http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spam > bayes/spambaye > s/Outlook2000/docs/troubleshooting.html) This will have > information how to > locate the log file, so we can better guess your problem. > > Also please make sure you are using version 008 of the plugin. > > Mark > > > -----Original Message----- > > From: spambayes-bounces@python.org > > [mailto:spambayes-bounces@python.org]On Behalf Of Richard Hough > > Sent: Wednesday, 17 September 2003 5:26 AM > > To: 'spambayes@python.org' > > Subject: [Spambayes] Cannot enable Spambayes > > > > > > Great idea guys, but I cannot enable the Outlook plugin after > > installing it. > > > > I'm running Win2K Server and Outlook 2000. Spambayes installs > > OK, I went > > through the training session and trained hundreds of spam and > > good messages, > > have defined a watch and unsure folder, set the filtering > > options to move > > certain and suspected spam to these folders, checked the > > "Enable Spambayes" > > box in the General tab, and closed it. But no messages are > > ever moved to the > > spam folder. > > > > Opening the SpamBayes Manager again shows the Enable button is still > > unchecked. I can check it again but closing the dialog, or > > even switching to > > a different tab and back, will uncheck it. I created a Spam > > view column as > > suggested and it does show a high percent for spam, it just > > never moves it > > to my defined spam folder. Even 100% spam stays in my Inbox. > > > > If I run "Filter Now" on my Inbox and select "Perform all > > filter actions" it > > will move it to the spam folder, but I want it to do this > > automatically with > > mail as it arrives. > > > > Richard Hough > > > > _______________________________________________ > > Spambayes@python.org > > http://mail.python.org/mailman/listinfo/spambayes > > Check the FAQ before asking: http://spambayes.sf.net/faq.html > > From mhammond at skippinet.com.au Tue Sep 16 19:30:53 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 16 19:30:50 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13034722A1@its-xchg4.massey.ac.nz> Message-ID: <018401c37caa$93e07070$f502a8c0@eden> > From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] > > Since it's so easy to suck existing mailboxes into the > > Outlook plugin, perhaps it should warn the user if the > > ham:spam ratio gets too out-of-whack. > > I think this is probably a good idea. What "too Outlook2000\dialogs\dialog_map.py has the following class: class WizardTrainingStatusProcessor(ControlProcessor): def Init(self): bayes = self.window.manager.classifier_data.bayes nspam = bayes.nspam nham = bayes.nham if nspam > 10 and nham > 10: msg = "SpamBayes has been successfully trained and configured. " \ "You should find the system is immediately effective at " \ "filtering spam." else: msg = "SpamBayes has been successfully trained and configured. " \ "However, as the number of messages trained is quite small, " \ "SpamBayes may take some time to become truly effective." win32gui.SendMessage(self.GetControl(), win32con.WM_SETTEXT, 0, msg) This would be a simple place to add this status information (it shows up in the "manager" dialog). Seeing as I picked the number "10" from my anus, I see no reason we couldn't do the same with magic cutoffs for an inbalance warning Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2344 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030917/d655cf2e/winmail.bin From T.A.Meyer at massey.ac.nz Tue Sep 16 19:53:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 16 19:53:33 2003 Subject: [Spambayes] Obvious Spam Missed... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13034722EA@its-xchg4.massey.ac.nz> [Skip] > Since it's so easy to suck existing mailboxes into the > Outlook plugin, perhaps it should warn the user if the > ham:spam ratio gets too out-of-whack. [Mark] > Outlook2000\dialogs\dialog_map.py has the following class: > > class WizardTrainingStatusProcessor(ControlProcessor): [...] > This would be a simple place to add this status information > (it shows up in the "manager" dialog). Seeing as I picked > the number "10" from my anus, I see no reason we couldn't do > the same with magic cutoffs for an inbalance warning :) I've checked this in. I chose a ratio of 1:5 or 5:1 as worth a warning. (Note that someone with a tool to adjust the dialog layout needs to make the status panel a wee bit bigger to hold the error message, or the error message needs to be reworded a bit to fit). =Tony Meyer From tim.one at comcast.net Tue Sep 16 21:48:21 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 16 21:48:26 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <20030916174601.X36425@thermonuclear.org> Message-ID: [Peter Beckman] > What about including a "starter" HammieDB, something that changes over > time, based on a bunch of different people's settings. > > By starting with a nice ratio, hopefully most of the spam will go to > the right place, and in the beginning, a lot of ham might be > classified as "unsure", but that would happen anyway. > > At least most of the current "spam" would get marked as such. > > I know, email is a very personal thing and can't be "generalized" but > I wonder if this would help in some cases where the people were less > than aware of the ham/spam training stuff. InBoxer includes a starter database. I think they put a lot of work into that, which doesn't mean we can't, but does mean it requires someone who believes in it enough to devote their free time to doing that work. One thing to watch out for is that if you put "too much" data into the starter database, more additional training is needed to cater to personal quirks than if a new user starts with an empty database. There's also a real danger of systematically ruining the classifier for some classes of users. For example, one of the co-founders of the Motley Fool posted last week, saying the folks who work there are happy with spambayes. But any reasonably large spam collection is going to contain a ton of stock-pumping scam spam, and that can poison the database for people who work with stocks for a living ("Dow" appears in 500 spam and no ham at the start ... OK, now it appears in 1 ham ..., OK, now in 2 ham, ... overcoming the initial bias can be a bitch). Not to mention those of us who moonlight as porn stars . From mail at peterbe.com Wed Sep 17 02:58:30 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed Sep 17 02:58:32 2003 Subject: [Spambayes] pop3proxy gave up all of a sudden Message-ID: <3F680616.9090707@peterbe.com> Spambayes (1.0a4) was working just fine. Then I restarted the computer and next time I get this message: C:\Python22\spambayes-1.0a4>"c:\Python22\python.exe" "C:\Python22\spambayes-1.0 4\pop3proxy.py" SpamBayes POP3 Proxy Beta1, version 0.1 (May 2003), using SpamBayes POP3 Proxy Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading database... Traceback (most recent call last): File "C:\Python22\spambayes-1.0a4\pop3proxy.py", line 777, in ? run() File "C:\Python22\spambayes-1.0a4\pop3proxy.py", line 752, in run state.createWorkers() File "C:\Python22\spambayes-1.0a4\pop3proxy.py", line 594, in createWorkers self.bayes = storage.DBDictClassifier(filename) File "spambayes\storage.py", line 147, in __init__ self.load() File "spambayes\storage.py", line 159, in load t = self.db[self.statekey] File "C:\Python22\lib\shelve.py", line 71, in __getitem__ return Unpickler(f).load() EOFError Exception exceptions.AttributeError: "'NoneType' object has no attribute 'error " in > ignored Any idea what to do? From T.A.Meyer at massey.ac.nz Wed Sep 17 03:01:38 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 17 03:01:52 2003 Subject: [Spambayes] pop3proxy gave up all of a sudden Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303472517@its-xchg4.massey.ac.nz> > Spambayes (1.0a4) was working just fine. Then I restarted the > computer and next time I get this message: [...] > " in instance at 0x00A29E8>> ignored > > Any idea what to do? Please see the FAQ: It would also be worth upgrading to 1.0a5 - if you do so you'll find that you can no longer use dumbdbm (the FAQ needs updating; I'll try and get to that shortly). =Tony Meyer From ram at mastr.net Wed Sep 17 06:15:02 2003 From: ram at mastr.net (Ralph Mastrangelo) Date: Wed Sep 17 06:15:06 2003 Subject: [Spambayes] Spam Clues Message-ID: <005f01c37d04$9256a940$6401a8c0@rogue5> Hi, A brief background of my outlook setup and spambayes plugin. I have been using spambayes for a few weeks now, and updated to the new version sometime last week. I have one spam folder and two folders I have all my good mail sent too. I have done the training a few times and I have about 100 spam messages saved. I do use the Delete as spam button to put spam in my spam folder. When I look at the spam clues on my spam it say's Spam Score: 100% (1) but it does not move it to the spam folder. I have the slider set to spam = 1.0. I would think that if its 100% it wold be moved. Anyone have any ideas? Thanks -Ralph From anders.eriksson at morateknikutveckling.se Wed Sep 17 07:27:36 2003 From: anders.eriksson at morateknikutveckling.se (Anders Eriksson) Date: Wed Sep 17 07:27:43 2003 Subject: [Spambayes] Re: Spam Clues References: <005f01c37d04$9256a940$6401a8c0@rogue5> Message-ID: Hello Ralph! On Wed, 17 Sep 2003 06:15:02 -0400, Ralph Mastrangelo wrote: > When I look at the spam clues on my spam it say's Spam Score: 100% (1) but > it does not move it to the spam folder. I have the slider set to spam = 1.0. > I would think that if its 100% it wold be moved. > > Anyone have any ideas? Did you reboot the computer after installing? I got the same strange behavior but after a reboot everything started to work! Hope it does for you also! // Anders From skip at pobox.com Wed Sep 17 10:19:04 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 17 10:19:17 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: References: <20030916174601.X36425@thermonuclear.org> Message-ID: <16232.27992.594713.446593@montanaro.dyndns.org> Tim> One thing to watch out for is that if you put "too much" data into Tim> the starter database, more additional training is needed to cater Tim> to personal quirks than if a new user starts with an empty Tim> database. If we do something like that, I think we should train on a very small set of mails, maybe no more than 20-30 of each class. After all, I think all we're trying to do is give the new user's incoming mail an initial nudge in the right direction. Ideally, the mails should be spread across domains, senders and recipients. If that's not possible, header clues which relate to the senders or recipients should be deleted before shipping. Skip From ram at mastr.net Wed Sep 17 11:56:16 2003 From: ram at mastr.net (Ralph Mastrangelo) Date: Wed Sep 17 11:56:29 2003 Subject: [Spambayes] Re: Spam Clues In-Reply-To: Message-ID: <007801c37d34$3d7d0240$6401a8c0@rogue5> Hi Anders, I did do a reboot not right after the install but with in a few days of it. Same with the upgrade. If the slider goes to 100 why does the score of the message always 100% (1). -Ralph -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Anders Eriksson Sent: Wednesday, September 17, 2003 7:28 AM To: spambayes@python.org Subject: [Spambayes] Re: Spam Clues Hello Ralph! On Wed, 17 Sep 2003 06:15:02 -0400, Ralph Mastrangelo wrote: > When I look at the spam clues on my spam it say's Spam Score: 100% > (1) but it does not move it to the spam folder. I have the slider set > to spam = 1.0. I would think that if its 100% it wold be moved. > > Anyone have any ideas? Did you reboot the computer after installing? I got the same strange behavior but after a reboot everything started to work! Hope it does for you also! // Anders _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From rmalayter at bai.org Wed Sep 17 13:02:27 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Sep 17 13:02:30 2003 Subject: [Spambayes] Spam Clues Message-ID: <792DE28E91F6EA42B4663AE761C41C2A011151EF@cliff.bai.org> From: Ralph Mastrangelo [mailto:ram@mastr.net] >When I look at the spam clues on my spam it say's Spam Score: 100% (1) >but it does not move it to the spam folder. I have the slider set to >spam = 1.0. I would think that if its 100% it wold be moved. I don't think anything ever scores a complete 100%, just 0.999947 or something like that that rounds up to 100%. So if your threshold is set at 100%, nothing will ever get moved to the spam folder. That said, why do you want your threshold at 100%? I get 0.5% false positives with the threshold set at 90%. -ryan- From bill at parducci.net Wed Sep 17 13:23:13 2003 From: bill at parducci.net (bill parducci) Date: Wed Sep 17 13:23:22 2003 Subject: [Spambayes] Obvious Spam Missed... In-Reply-To: <16232.27992.594713.446593@montanaro.dyndns.org> References: <20030916174601.X36425@thermonuclear.org> <16232.27992.594713.446593@montanaro.dyndns.org> Message-ID: <3F689881.3080705@parducci.net> Skip Montanaro wrote: Tim> One thing to watch out for is that if you put "too much" data into Tim> the starter database, more additional training is needed to cater Tim> to personal quirks than if a new user starts with an empty Tim> database. > > If we do something like that, I think we should train on a very small set of > mails, maybe no more than 20-30 of each class. After all, I think all we're > trying to do is give the new user's incoming mail an initial nudge in the > right direction. Ideally, the mails should be spread across domains, > senders and recipients. If that's not possible, header clues which relate > to the senders or recipients should be deleted before shipping. since the skew can work both ways (should someone like tim include their extracurricular activities in the ham training sample :o), wouldn't it make sense to create a number of initial databases with *only* spam in them and let the user train an appropriate amount of ham as part of the install? anecdotal evidence suggests that just about everyone has some ham laying around, yet not everyone keeps spam about. by having a couple of sample dbs (e.g. 10 spam, 50 spam, 100 spam) you could offer the value of kick starting a db, while reducing the potential for skew. just a thought... b From Arminiafan at t-online.de Wed Sep 17 13:28:10 2003 From: Arminiafan at t-online.de (Arminiafan@t-online.de) Date: Wed Sep 17 13:29:52 2003 Subject: [Spambayes] Spam Bayes for under 2000??? Message-ID: <001001c37d41$1d13f7e0$3a0afea9@S0D5M6> Hallo, Spambayes-Team. I have heard of Your faboulous program from a friend and as it's quiet simple for a user like me (I am a rookie), I wanted to get it. But I think, it's only a program for Win 2000/XP/ME etc. user, is it? My English language is weak, sorry. But reading Your instructions, I think so. I use my good old Outlook 5.0, and so I wanted to ask You, if there is any chance to use Your program Spam Bayes also for this very old version of Outlook. Don't You have a version of Spam Bayes for "old" users like me? I tried many programs of anti-spam, but some do not what I want *g* and some other are too complex for myself. As my friend shows me your Spam Bayes, I was very impressed and for the first time I had the hope, that I could terminate my 10, 20, 30 Spam-Mails daily. But now it seemes, I never could use it. But as I tell You, my English is weak and perhaps I misunderstood your homepage or perhaps I haven't seen the installation instructions for Outlook 5.0. Could this be? Will I be able to use Spam Bayes for my Outlook 5.0 on Windows 98 (don't laugh - it's true what I say *g*) With kind regards M. Eddie Holzk?mper, Germany From skip at pobox.com Wed Sep 17 13:54:42 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 17 13:54:53 2003 Subject: [Spambayes] Generating a sample training database In-Reply-To: <3F689881.3080705@parducci.net> References: <20030916174601.X36425@thermonuclear.org> <16232.27992.594713.446593@montanaro.dyndns.org> <3F689881.3080705@parducci.net> Message-ID: <16232.40930.934829.840568@montanaro.dyndns.org> bill> since the skew can work both ways (should someone like tim include bill> their extracurricular activities in the ham training sample :o), bill> wouldn't it make sense to create a number of initial databases bill> with *only* spam in them and let the user train an appropriate bill> amount of ham as part of the install? anecdotal evidence suggests bill> that just about everyone has some ham laying around, yet not bill> everyone keeps spam about. In theory, however I don't think it's trivial for people using the Outlook plugin to train on a single mail message. They'd have to move several messages from valid hammy mailboxes to a new one, train on it, then move the messages back to their original locations. We'll just have to try it and see. I'll take the lead in grabbing the ham and spam and putting together a sample training database (pickle format seems easiest). If you'd like to contribute (no more than two ham and two spam per person please), forward such messages to me and make sure the Subject: includes "Sample Ham" or "Sample Spam". I will filter such messages out using procmail before SpamBayes can see them. Skip From papaDoc at videotron.ca Wed Sep 17 15:47:36 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Sep 17 15:46:49 2003 Subject: [Spambayes] Error in pop3proxy Message-ID: <3F68BA58.1030500@videotron.ca> Hi, Today I got the same error message that these two emails describe Message sent the 26/05/2003: [Spambayes] web interface: 500 Server error when training Message sent the 06/08/2003: [Spambayes] Day 1 stack trace 500 Server error Traceback (most recent call last): File "C:\Devtools\SPAMBA~1\SPAMBA~1.0A5\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\Devtools\SPAMBA~1\SPAMBA~1.0A5\spambayes\ProxyUI.py", line 395, in onReview message = spambayes.mboxutils.get_message(cachedMessage.getSubstance()) File "C:\Devtools\SPAMBA~1\SPAMBA~1.0A5\spambayes\Corpus.py", line 349, in getSubstance return self.hdrtxt + self.payload File "C:\Devtools\SPAMBA~1\SPAMBA~1.0A5\spambayes\Corpus.py", line 296, in __getattr__ raise AttributeError, attributeName AttributeError: hdrtxt I was reviewing my email in pop3proxy web interface and at the same time I received an email. When I try to train I got the above error message. I'm using the pop3proxy from 1.a5 SpamBayes POP3 Proxy Beta2, version 0.2 (September 2003), using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 and engine SpamBayes Beta2, version 0.2 (July 2003). Note to Richie Hindle: You wanted a copy of pop3proxy-unknown-cache I don't know if you still want it ? Remi From richie at entrian.com Wed Sep 17 16:19:47 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Sep 17 16:20:03 2003 Subject: [Spambayes] Error in pop3proxy In-Reply-To: <3F68BA58.1030500@videotron.ca> References: <3F68BA58.1030500@videotron.ca> Message-ID: <6bghmv07hsu6fi2d0gdqdflnpbibon5vtn@4ax.com> [Remi] > AttributeError: hdrtxt > [...] > Note to Richie Hindle: You wanted a copy of pop3proxy-unknown-cache > I don't know if you still want it ? Yes please, if you don't mind sending it. Do you always get the error when training on that cache? Even after restarting the proxy? -- Richie Hindle richie@entrian.com From tim.one at comcast.net Wed Sep 17 22:31:16 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Sep 17 22:31:22 2003 Subject: [Spambayes] RE: [spambayes-dev] Generating a sample training database In-Reply-To: <16232.40930.934829.840568@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > In theory, however I don't think it's trivial for people using the > Outlook plugin to train on a single mail message. They'd have to > move several messages from valid hammy mailboxes to a new one, train > on it, then move the messages back to their original locations. It's not that hard, although I don't know how many users understand all the things they can do with Outlook. For example, I keep distinct ham and spam training folders (and those are *all* I train from). When I want to train on, e.g., a selection of ham, I Ctrl-Left-Click the ones I want, and drag the multi-selection to the ham training folder while holding the right mouse button down. When the button is released, a little menu pops up asking whether I want to Move (the messages), Copy (the messages), or Cancel (forget the whole thing). I select Copy, and that's the end of it. It takes much longer to read this sentence than to perform the whole operation. There's an even simpler way to copy: drag the selection to the desired folder while holding the Ctrl key down. I can never remember that, though (when using extreme shortcuts, I always end up copying when I want to move, and vice versa), so stick to the method that asks me what I want when it's nearly over (btw, same thing (depress right button while dragging) works in Windows Explorer for copying, moving, or linking files between folders). > We'll just have to try it and see. > > I'll take the lead in grabbing the ham and spam and putting together a > sample training database Cool! Thank you. > (pickle format seems easiest). If you'd like to contribute (no more > than two ham and two spam per person please), forward such messages to > me and make sure the Subject: includes "Sample Ham" or "Sample Spam". > I will filter such messages out using procmail before SpamBayes can > see them. Offhand, I suggest disabling all header-line clue generation except for Subject line. From mhammond at skippinet.com.au Wed Sep 17 23:22:24 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Sep 17 23:22:20 2003 Subject: [Spambayes] RE: [spambayes-dev] Generating a sample training database In-Reply-To: Message-ID: <04d201c37d94$14c03b00$f502a8c0@eden> > There's an even simpler way to copy: drag the selection to > the desired > folder while holding the Ctrl key down. I can never remember > that, though > (when using extreme shortcuts, I always end up copying when I > want to move, > and vice versa), so stick to the method that asks me what I > want when it's > nearly over (btw, same thing (depress right button while > dragging) works in > Windows Explorer for copying, moving, or linking files > between folders). I can never remember - but I just look for the little "+" sign in the drag icon - if there is a "+", then new items will be added (ie, items will be copied). Without the "+", no new items are added, so it is a move. Windows Explorer gives the same hints. Mark. From cedmonds at verio.net Thu Sep 18 11:27:32 2003 From: cedmonds at verio.net (Christopher Edmonds) Date: Thu Sep 18 11:27:40 2003 Subject: [Spambayes] Spam Buttons don't work Message-ID: <000001c37df9$6179d950$2de9fea9@corp.verio.net> I loaded Spam Bay. It was working fine for a month and abruptly stopped working. I loaded the newest version (006) and it doesn't do anything. I have the two buttons in my tool bar. However, the buttons are non-responsive. I click the SpamBayes and then SpamBayes Manager and nothing opens up. Any ideas? Regards, Christopher Edmonds Senior Manager / Director of Account Management NTT/Verio - A Global Fortune 15 Company mobile - (503) 577-1023 fax - (503) 296-2887 ___________________________________________ NTT/VERIO IS THE BUSINESS ISP OF CHOICE... - Global Reach, Local Presence covering 5 continents and over 150 countries. - Combining Cisco and Juniper technologies to create the most powerful Tier 1 network in the world. - Customized connectivity solutions to match your business needs. - The ONLY provider with an average and maximum Jitter SLA. - The first US ISP to deliver large-scale IPv6 connectivity. - Award winning Customer Service recognized by Boardwatch magazine. - NTT is the only telecom company to rate an AA- with S&P and Aa2 with Moody's. From ventnor5 at cox.net Thu Sep 18 12:34:54 2003 From: ventnor5 at cox.net (Ventnor5) Date: Thu Sep 18 12:34:54 2003 Subject: [Spambayes] SpamBayes: Install error under Windows XP Pro SP1, Outlook 2002 SP2 Message-ID: <000501c37e02$cae36380$e1396444@user3z4k1u67qy> > We have Windows XP Pro SP1 and run MS Outlook 2002 SP2. > > Before install: rebooted so MS Word was not running hidden; disabled > anti-virus software (McAfee 7.0) . > Ran the downloaded file Spambayes-Outlook-Setup-0081.exe, but got this > error message: > > Error > C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll > > Unable to register the DLL/OCX: DllRegisterServer failed; code > 0x00000000. > > Click Retry to try again, Ignore to proceed anyway (not > recommended), > or Abort to cancel installation." > > Clicked "Retry" button but the same message came up again. > > At http://sourceforge.net/tracker/?group_id=61702&atid=498104, I see > someone had the same problem back in June 2003. Hope you can help. > Thanks. > > Charlie > -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2268 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030918/62814bf0/winmail.bin From aaraines at pobox.com Thu Sep 18 12:50:31 2003 From: aaraines at pobox.com (Andrew A. Raines) Date: Thu Sep 18 13:00:32 2003 Subject: [Spambayes] mboxtrain croaks on spam mbox file Message-ID: Using spambayes-1.0a5, I get this error: - aar@packer:mboxes(510)$ ~/src/spambayes/mboxtrain.py -d ~/.hammiedb -s spam-archive-1 Training spam (spam-archive-1): Reading as Unix mbox Traceback (most recent call last): File "/home/aar/src/spambayes/mboxtrain.py", line 304, in ? main() File "/home/aar/src/spambayes/mboxtrain.py", line 296, in main train(h, s, True, force, trainnew, removetrained) File "/home/aar/src/spambayes/mboxtrain.py", line 221, in train mbox_train(h, path, is_spam, force) File "/home/aar/src/spambayes/mboxtrain.py", line 155, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/aar/src/spambayes/mboxtrain.py", line 83, in msg_train h.train(msg, is_spam) File "/export/home/aar/src/spambayes-1.0a5/spambayes/hammie.py", line 150, in train self.bayes.learn(tokenize(msg), is_spam) File "/export/home/aar/src/spambayes-1.0a5/spambayes/classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "/export/home/aar/src/spambayes-1.0a5/spambayes/classifier.py", line 401, in _add_msg for word in Set(wordstream): File "/usr/lib/python2.3/sets.py", line 399, in __init__ self._update(iterable) File "/usr/lib/python2.3/sets.py", line 353, in _update for element in iterable: File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 1082, in tokenize for tok in self.tokenize_headers(msg): File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 1093, in tokenize_headers for w in crack_content_xyz(x): File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 806, in crack_content_xyz fname = msg.get_filename() File "/usr/lib/python2.3/email/Message.py", line 711, in get_filename return unicode(newvalue[2], newvalue[0]) TypeError: unicode() argument 2 must be string, not None - Any idea what the email module is actually choking on? There are 2,110 messages in spam-archive-1 and this error pops up around spam number 1,800, judging from the running tally. Thanks. -Drew From jaspenn at cox.net Thu Sep 18 13:34:35 2003 From: jaspenn at cox.net (jaspenn) Date: Thu Sep 18 13:30:45 2003 Subject: [Spambayes] Configuration Wizard won't run Message-ID: <001f01c37e0b$211d8a70$5200000a@JIMP> I am using Outlook 2000 on Windows 2000. I installed Spambayes and had it working properly. I then in advertently deleted the "spam email" folder and got a message that I need to configure my folder list. However, the configuration wizard will not activate. I've reinstalled Spambayes, but still cannot get the configuration wizard to operate. Any suggestions to make it work properly? Thank you for your help. Jim Pennington The Tom Jordan Team REMAX Horizons Realty 850-476-6000 office 850-982-3306 cell 850-479-1200 home From mark_fitzgerald at keybank.com Thu Sep 18 13:42:20 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Thu Sep 18 13:43:18 2003 Subject: [Spambayes] access to database Message-ID: Is there a way to get access to the raw database in some usable format? I'm sure this has been asked a hundred times before, but I'm trying to just get a aggregate of the word list for all messages in my SPAM folder.....thanks! - Mark From skip at pobox.com Thu Sep 18 13:51:44 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 18 13:54:53 2003 Subject: [Spambayes] mboxtrain croaks on spam mbox file In-Reply-To: References: Message-ID: <16233.61616.898005.416160@montanaro.dyndns.org> >>>>> "Drew" == Andrew A Raines writes: Drew> Using spambayes-1.0a5, I get this error: ... Drew> Any idea what the email module is actually choking on? There are Drew> 2,110 messages in spam-archive-1 and this error pops up around Drew> spam number 1,800, judging from the running tally. Can you narrow it down to a single message and then attach it to a mail to the spambayes list? You might also try changing the msg.get_filename() call in tokenizer.py to try: fname = msg.get_filename() except TypeError: print >> sys.stderr, "Error:", msg.get_filename() raise Skip From skip at pobox.com Thu Sep 18 14:16:38 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 18 14:16:48 2003 Subject: [Spambayes] New SB option in CVS: html_ui:rows_per_section Message-ID: <16233.63110.429376.640670@montanaro.dyndns.org> I found it impossible to use sb_server (aka pop3proxy) if I had a large number of untrained mails. If I clicked a Discard link to set all the radio buttons in a section I was never patient enough to let the operation complete. I always wound up killing Safari after five or ten minutes. It turns out that Safari's JavaScript implementation just sucks in this regard. I modified the onHeader() function to call "alert(i)" every 100 iterations. Each interval between alert popups got longer and longer. Internet Explorer for the Mac was consistent in this regard. This suggests that Apple decided to use a linked list instead of an array for this particular data structure. At any rate, I just checked in a new option for the [html_ui] section of your ini file: "rows_per_section". It specifies (oddly enough) the maximum number of rows to display in each section of the review page. It defaults to 10000, so you shouldn't see a difference if you don't set it. On the other hand, if you set it to a "reasonable" number, say 20 or 50, you should find your review page displays faster and your keyboard doesn't sprout cobwebs while you wait for JavaScript to set all the radio buttons in a section to "Discard", at least if you use Safari. Skip From aaraines at pobox.com Thu Sep 18 14:44:15 2003 From: aaraines at pobox.com (Andrew A. Raines) Date: Thu Sep 18 14:44:00 2003 Subject: [Spambayes] Re: mboxtrain croaks on spam mbox file References: <16233.61616.898005.416160@montanaro.dyndns.org> Message-ID: Skip Montanaro writes: > Can you narrow it down to a single message and then attach it > to a mail to the spambayes list? This was, uh, tricky. Because an mbox doesn't really give me any placemarkers, and mboxtrain doesn't report the line number of the file on error, I had to split everything up. With a little reformail foo I created a separate one-file-per-directory directory for every message in the mbox which would explicitly show me on which message mboxtrain died on. Anyway, after running mboxtrain on all these baby MH's, I finally found the culprit, #1688: ,---- | From nobody Thu Aug 7 09:55:57 2003 | Return-Path: | X-Gnus-Mail-Source: maildir:~/Maildir/inbox/new | Message-ID: | Delivered-To: aar@williams.mc.vanderbilt.edu | Received: (qmail 24184 invoked by alias); 7 Aug 2003 06:35:45 -0000 | Delivered-To: postmaster@williams.mc.vanderbilt.edu | Received: (qmail 24126 invoked from network); 7 Aug 2003 06:35:44 -0000 | Received: from unknown (HELO nessus) (160.129.223.39) | by williams.mc.vanderbilt.edu with SMTP; 7 Aug 2003 06:35:44 -0000 | From: nobody@example.com | To: postmaster@[160.129.208.222] | Organization: Nessus kabale | MIME-Version: 1.0 | Subject: Nessus antivirus test 3: alternative base64 attachment | Content-Type: multipart/mixed; boundary="=-=-=" | Xref: williams spam-archive-1:1689 | Lines: 13 | X-Gnus-Article-Number: 1689 Mon Aug 11 11:08:05 2003 | | | --=-=-= | | If you can read or execute the attachment, this means that you do not | have an antivirus, or that it was disabled. | | --=-=-= | Content-Type: application/octet-stream | Content-Disposition: attachment; filename*1="eicar."; filename*2="com" | Content-Description: EICAR test file | | X5O!P%@AP[4\PZX54(P^)7CC)7}$EICAR-STANDARD-ANTIVIRUS-TEST-FILE!$H+H* | --=-=-=-- `---- > You might also try changing the msg.get_filename() call in > tokenizer.py to > > try: > fname = msg.get_filename() > except TypeError: > print >> sys.stderr, "Error:", msg.get_filename() > raise After doing this, the resulting error becomes: ,---- | aar@packer:d-1688(607)$ ~/src/spambayes/mboxtrain.py -d ~/.hammiedb-test -s . | Training spam (.): | Reading as MH mailbox | Error:Traceback (most recent call last): | | [...] | | File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 1090, in tokenize | for tok in self.tokenize_headers(msg): | File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 1101, in tokenize_headers | for w in crack_content_xyz(x): | File "/export/home/aar/src/spambayes-1.0a5/spambayes/tokenizer.py", line 813, in crack_content_xyz | print >> sys.stderr, "Error:", msg.get_filename() | File "/usr/lib/python2.3/email/Message.py", line 711, in get_filename | return unicode(newvalue[2], newvalue[0]) | TypeError: unicode() argument 2 must be string, not None `---- Thanks, -Drew From aaraines at pobox.com Thu Sep 18 15:35:16 2003 From: aaraines at pobox.com (Andrew A. Raines) Date: Thu Sep 18 15:34:59 2003 Subject: [Spambayes] Handy mbox debug technique (was: mboxtrain croaks on spam mbox file) References: <16233.61616.898005.416160@montanaro.dyndns.org> Message-ID: "Andrew A. Raines" writes: > Because an mbox doesn't really give me any placemarkers, and > mboxtrain doesn't report the line number of the file on error, > I had to split everything up. With a little reformail foo I > created a separate one-file-per-directory directory for every > message in the mbox which would explicitly show me on which > message mboxtrain died on. I had a request for the process I used to accomplish this, so I'll share it with the public for the requester's sake and others'. Copy this directly into your interactive shell: # create a diag workspace mkdir /tmp/wk; cd /tmp/wk # create a script to exploit reformail(1)'s -s functionality cat <split #!/bin/sh messdir=d-\$FILENO; mkdir \$messdir; cd \$messdir cat >\$FILENO EOF chmod +x split mkdir spam; cd spam env FILENO=1 reformail -s ../split References: <16233.61616.898005.416160@montanaro.dyndns.org> Message-ID: <16234.7605.531986.531963@montanaro.dyndns.org> Drew> Skip Montanaro writes: >> Can you narrow it down to a single message and then attach it to a >> mail to the spambayes list? ... Drew> Anyway, after running mboxtrain on all these baby MH's, I Drew> finally found the culprit, #1688: ... As I mentioned on the spambayes-dev list, I think this is a bug in the email package. I just checked in a patch to spambayes/tokenizer.py (v. 1.15) which worms around the problem. If you can't check out tokenizer.py for some reason, I've also attached a context diff against v. 1.14. Skip -------------- next part -------------- A non-text attachment was scrubbed... Name: tokenizer.diff Type: application/octet-stream Size: 1215 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030918/5cf00c84/tokenizer-0001.obj From skip at pobox.com Thu Sep 18 17:08:09 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 18 17:08:19 2003 Subject: [Spambayes] Re: mboxtrain croaks on spam mbox file In-Reply-To: References: <16233.61616.898005.416160@montanaro.dyndns.org> Message-ID: <16234.7865.233099.40522@montanaro.dyndns.org> Drew> Skip Montanaro writes: >> Can you narrow it down to a single message and then attach it >> to a mail to the spambayes list? Drew> This was, uh, tricky.... One other thing. You could also do a binary split on the mbox file. Open it in a text editor, go to around the midpoint, write the first half out to mbox1, the second to mbox2. mboxtrain on both, then repeat the split operation for the mbox file which exhibits the failure. With 2000 messages, you should need only 7 or 8 splits to isolate the culprit message. Skip From aaraines at pobox.com Thu Sep 18 17:10:32 2003 From: aaraines at pobox.com (Andrew A. Raines) Date: Thu Sep 18 17:10:15 2003 Subject: [Spambayes] Re: mboxtrain croaks on spam mbox file References: <16233.61616.898005.416160@montanaro.dyndns.org> <16234.7605.531986.531963@montanaro.dyndns.org> Message-ID: Skip Montanaro writes: > Drew> Skip Montanaro writes: > >> Can you narrow it down to a single message and then attach it to a > >> mail to the spambayes list? > ... > Drew> Anyway, after running mboxtrain on all these baby MH's, I > Drew> finally found the culprit, #1688: > ... > > As I mentioned on the spambayes-dev list, I think this is a bug in the email > package. I just checked in a patch to spambayes/tokenizer.py (v. 1.15) > which worms around the problem. If you can't check out tokenizer.py for > some reason, I've also attached a context diff against v. 1.14. Thanks Skip. I'll follow -dev to see if anything transpires from Barry. -Drew From T.A.Meyer at massey.ac.nz Thu Sep 18 17:49:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 18 17:49:57 2003 Subject: [Spambayes] Spam Buttons don't work Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130347292B@its-xchg4.massey.ac.nz> > I loaded Spam Bay. It was working fine for a month and > abruptly stopped working. I loaded the newest version (006) > and it doesn't do anything. The latest version is 008.1. It would be well worth upgrading to this and seeing if that solves the problem. =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 18 17:51:51 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 18 17:52:00 2003 Subject: [Spambayes] access to database Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303472931@its-xchg4.massey.ac.nz> > Is there a way to get access to the raw database in some > usable format? I'm sure this has been asked a hundred times before, > but I'm trying to just get a aggregate of the word list for all > messages in my SPAM folder.....thanks! Use the dbExpImp.py (sb_dbexpimp.py in cvs) script which is in the source releases. If you run "sb_dbexpimp.py -h" from a command line it will tell you how to use it. Basically you can use it to convert the database to a '`' separated text file, which you can then open in whatever you like (Excel, for example). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 18 17:55:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 18 17:55:40 2003 Subject: [Spambayes] Handy mbox debug technique (was: mboxtrain croaks on spam mbox file) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303472938@its-xchg4.massey.ac.nz> >> With a little reformail foo I created a separate >> one-file-per-directory directory for every message in the >> mbox which would explicitly show me on which message mboxtrain >> died on. > I had a request for the process I used to accomplish this, so > I'll share it with the public for the requester's sake and others'. I could be wrong, but I'm pretty sure that this is what utilities/splitndirs.py does, so people could also give that a go. utilities/split.py and utilities/splitn.py do something similar to Skip's searching suggestion. =Tony Meyer From mhammond at skippinet.com.au Thu Sep 18 18:12:58 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Sep 18 18:13:53 2003 Subject: [Spambayes] Configuration Wizard won't run In-Reply-To: <001f01c37e0b$211d8a70$5200000a@JIMP> Message-ID: <071901c37e32$04da8440$f502a8c0@eden> > I am using Outlook 2000 on Windows 2000. > I installed Spambayes and had it working properly. I then in > advertently > deleted the "spam email" folder and got a message that I need > to configure > my folder list. However, the configuration wizard will not activate. > I've reinstalled Spambayes, but still cannot get the > configuration wizard to > operate. > Any suggestions to make it work properly? > Thank you for your help. This is a bug in the wizard. You can work around it by simply using the "filtering" tab from the SpamBayes manager, and setting your folders there. Mark. From dennis at usbdt.com Thu Sep 18 12:39:46 2003 From: dennis at usbdt.com (Dennis Buettner) Date: Thu Sep 18 18:50:48 2003 Subject: [Spambayes] spambayes comment Message-ID: Sirs: Wonderful project. One comment. 1. When I select "Recover from SPAM", I believe it would be efficient to have SpamBayes analyze the content/sender of the email and then remember it for the next time. I have a certain "list" email that is consistently sent to the spam folder and the program never seems to learn that I want these emails. 2. I owe you a beer. Dennis Captain U.S. Beer Drinking Team From rmalayter at bai.org Thu Sep 18 19:06:24 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Sep 18 19:06:27 2003 Subject: [Spambayes] spambayes comment Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01115206@cliff.bai.org> From: Dennis Buettner [mailto:dennis@usbdt.com] > 1. When I select "Recover from SPAM", I > believe it would be efficient to have > SpamBayes analyze the content/sender of > the email and then remember it for the > next time. I have a certain "list" email > that is consistently sent to the spam > folder and the program never seems to > learn that I want these emails. SPamBayes already does this. When you recover from spam, statistics from that message are entered into the training database. If you could post your "show spam clues" report to the list, we might be able to figure out why the messages aren't being let through. Perhaps you accidentally maked some other messages from that list as spam. > I owe you a beer. Not me, I didn't write it, I just use the thing. But if you're buying anyway... Regards, Ryan From rosenfeld at alum.mit.edu Thu Sep 18 19:12:35 2003 From: rosenfeld at alum.mit.edu (Rob Rosenfeld) Date: Thu Sep 18 19:12:39 2003 Subject: [Spambayes] Do you need to continue training ham? Message-ID: <003001c37e3a$58da72a0$e72a1c42@rrosenfeny> Hey folks. I have moved from SpamAssassin to the SpamBayes Outlook plug-in. The integration is great. I'm a bit confused about one part. I had stockpiles of ham and spam to initially train SpamBayes with. If I understand correctly, every time SpamBayes detects and moves a spam, it trains on it, kind of giving it "ongoing" spam training. Is that correct? If it doesn't move it as spam, does it train on it as ham? Thanks, Rob -- Rob Rosenfeld rosenfeld@alum.mit.edu From tim.one at comcast.net Thu Sep 18 19:38:13 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Sep 18 19:38:21 2003 Subject: [Spambayes] Do you need to continue training ham? In-Reply-To: <003001c37e3a$58da72a0$e72a1c42@rrosenfeny> Message-ID: [Rob Rosenfeld] > Hey folks. I have moved from SpamAssassin to the SpamBayes Outlook > plug-in. The integration is great. I'm a bit confused about one > part. I had stockpiles of ham and spam to initially train SpamBayes > with. Note that spambayes works best if you train on an approximately equal number of each. It doesn't take millions , either. For example, I started this project, and my home Outlook classifier still hasn't been trained on 2000 messages total (I get about 600 per day, and my classifier database is going on one year old, so I've trained on less than 1% of the email I've received in that time). > If I understand correctly, every time SpamBayes detects and > moves a spam, it trains on it, kind of giving it "ongoing" spam > training. Is that correct? Nope. > If it doesn't move it as spam, does it train on it as ham? Not that either. It auto-trains on messages for which you explicitly click the "Recover from Spam" or "Delete as Spam" buttons. In addition, it *may* train on messages *you* move to spam or ham folders, depending on which boxes you've checked in the spambayes Manager's Training tab, section "Incremental Training". These aren't necessarily ideal training protocols, but they're the best we've been able to implement so far that most users seem able to deal with. Ideal would be to train on a small random sample of all the email you get, and expire training messages over time too. That seems hard. From blayne at horizontechnology.co.nz Thu Sep 18 23:43:19 2003 From: blayne at horizontechnology.co.nz (Blayne Kriletich) Date: Thu Sep 18 23:45:10 2003 Subject: [Spambayes] Spam Bayes Losing its Brains Message-ID: Please let me know what is happening with Spambayes in these log files. It just forgets what it is doing. Regards Blayne Kriletich Systems Engineer Horizon Technology Ltd Tel: +64 9 914 0066 Fax: +64 9 914 0055 Email: blayne@horizontechnology.co.nz Any loss/damage incurred by using this material is not the senders (Horizon Technology Ltd) responsibility. LIMITATION OF LIABILITY. Horizon Technology Ltd entire liability will be limited to resupplying the material. No warranty is made that this material is correct, free from computer virus or defect. -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 9197 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/8bcf5b97/spambayes1-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 1711 bytes Desc: spambayes2.log Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/8bcf5b97/spambayes2-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 15656 bytes Desc: spambayes3.log Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/8bcf5b97/spambayes3-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 8209 bytes Desc: spambayes4.log Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/8bcf5b97/spambayes4-0001.obj From T.A.Meyer at massey.ac.nz Thu Sep 18 23:48:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 18 23:48:35 2003 Subject: [Spambayes] Spam Bayes Losing its Brains Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13035992C1@its-xchg4.massey.ac.nz> > Please let me know what is happening with Spambayes in these > log files. It just forgets what it is doing. >From the logs, it looks like you are using a very old version of the plug-in. I would recommend that you upgrade to the latest version (008.1), which most likely will resolve this issue. You can get the latest version from . You might also wish to subscribe to the (low-volume) announce list, so that you are informed when new versions are release: . =Tony Meyer From rejoyce7 at optonline.net Fri Sep 19 08:02:41 2003 From: rejoyce7 at optonline.net (joyce velehradsky) Date: Fri Sep 19 08:02:47 2003 Subject: [Spambayes] Duh..pardon my ignorance. Message-ID: I have XP windows system, Outlook 2002. Will this work on my system. And does this work as a Trojan horse, as some do? Thanks for educating the unrefined upcoming beginners... Joyce -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 3464 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/a33710b9/winmail.bin From caroleanthony at stny.rr.com Fri Sep 19 05:04:34 2003 From: caroleanthony at stny.rr.com (carole anthony) Date: Fri Sep 19 09:02:45 2003 Subject: [Spambayes] Thank you for contacting goZing Surveys! Message-ID: <000601c37e8d$0bbe1140$0300a8c0@stny.rr.com> Am unable to get into Paid survey "just for women" from e-mail. CaroleAnthony@stny.rr.com From dr3342 at sbc.com Fri Sep 19 10:01:16 2003 From: dr3342 at sbc.com (REED, DAVID T (SBCSI)) Date: Fri Sep 19 10:01:31 2003 Subject: [Spambayes] Configuration problem Message-ID: What causes the following to show up in the "Python Trace Collector" part of "PythonWin" when trying to configure where the spam and spam-maybe go? Attempted to set [Filter] spam_folder_id with invalid value ('0000000038A1BB1005E5101AA1BB08002B2A56C200006D737073742E646C6C00000000004E 495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E64205365747469 6E67735C6472333334325C454D61696C5C6472333334322E70737400', '00000000C5EDF55ADD21D011B13B0020AFECBFBF829E0000') () Attempted to set [Filter] unsure_folder_id with invalid value ('0000000038A1BB1005E5101AA1BB08002B2A56C200006D737073742E646C6C00000000004E 495441F9BFB80100AA0037D96E000000433A5C446F63756D656E747320616E64205365747469 6E67735C6472333334325C454D61696C5C6472333334322E70737400', '00000000C5EDF55ADD21D011B13B0020AFECBFBF429D0000') () This worked fine for the last couple of months, but it all of a sudden lost what folder to send spam to when I booted my PC today. I have since tried many different things to fix it, but I can not get it to recognize a valid folder as where to move the spam to. I am currently using the source code version 1.0a4 with Python 2.3. Thanks, David T. Reed Senior Systems Manager RIMS - Content Management Team Phone: (314) 235-0058 E-Mail: david.reed@sbc.com http://search.sbc.com/rims/ From dsteins at optonline.net Fri Sep 19 10:58:45 2003 From: dsteins at optonline.net (Douglas Steinschneider) Date: Fri Sep 19 11:04:15 2003 Subject: [Spambayes] Spambayes not performing as expected Message-ID: <006201c37ebe$865c5570$fc01a8c0@crmexchangeff.mscrm.local> Hello, I was successfully identifying 95% of spam with K9 and was setting it up for friends. I decided to try Spambayes because it appeared to be easier for the less knowledgeable users to control. I presented it with a folder filled with 2000 spam messages and about 600 good messages. This is the same mail that K9 was able to perform at 95% with only 1 false positive in the last 4 months. I receive about 15 non-spam emails a day and about 60 spam emails. I am now averaging 10 that are rated 0% but are obvious spam and 20 that are suspected spam. Spambayes is correctly rating the remaining 30 as spam. I know people are having better results. I checked to make sure that I didn't mistakenly include a folder of spam in my nonspam folders chosen for the training. My good emails are mostly correspondence with several list type mail such as this list and Langalist, Lockergnome, SQL Server Central and SSWUG (SQL Server Worldwide Users Group). There are also emails from purchases made etc. Thanks Doug From rmalayter at bai.org Fri Sep 19 11:34:49 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 19 11:34:52 2003 Subject: [Spambayes] Spambayes not performing as expected Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D421@cliff.bai.org> From: Douglas Steinschneider [mailto:dsteins@optonline.net] > I am now averaging 10 that are rated 0% but are obvious > spam and 20 that are suspected spam. Spambayes is correctly > rating the remaining 30 as spam. I have noticed this problem when Outlook Rules running mess up SpamBayes processing mail. When I connect to my Exchange mailbox from home, SpamBayes always scores the most-recent spam (the one at the top of my view) as 0%, and then scores everything else correctly. When I connect from the office, everything comes up much faster, and this doesn't happen. In the latest version (0.800.1) there's a slider on the advanced tab that you can use to make spambayes wait a few seconds before processing messages. I set it to 10 seconds at home, and it has mitigated (although not eliminated) the problem. Could you "run spam clues" on an obvious spam that is stuck in your inbox with a 0% score? I'll wager it scores above 90%, and you're seeing this timing bug... > I know people are having better results. Before I started using it from home and experienced the "mystery zeros" problem described above, I was seeing 98.1% filtering with exactly 1 false positive (an eVite, which I had never trained on before). This is over thousands of messages and undreds of spam. Regards, -Ryan- From skip at pobox.com Fri Sep 19 11:20:45 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 19 12:14:17 2003 Subject: [Spambayes] Duh..pardon my ignorance. In-Reply-To: References: Message-ID: <16235.7885.26008.829732@montanaro.dyndns.org> joyce> I have XP windows system, Outlook 2002. Will this work on my joyce> system. Yes. Use the Outlook plugin available from http://spambayes.sourceforge.net/windows.html joyce> And does this work as a Trojan horse, as some do? Not sure what you're asking. How do other spam filters work as trojan horses? The SB Outlook plugin integrates into the Outlook user interface. If that doesn't suit your fancy, you can use the POP3 proxy or the IMAP filter instead. Skip From skip at pobox.com Fri Sep 19 11:07:23 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 19 12:14:22 2003 Subject: [Spambayes] Do you need to continue training ham? In-Reply-To: References: <003001c37e3a$58da72a0$e72a1c42@rrosenfeny> Message-ID: <16235.7083.680078.71303@montanaro.dyndns.org> Tim> Ideal would be to train on a small random sample of all the email Tim> you get, and expire training messages over time too. That seems Tim> hard. This doesn't help with the Outlook plugin, but the sb_upload.py (nee proxytee.py) script accepts a '-r prob' flag which says to upload the message to sb_server.py (nee pop3proxy.py) with probability 'prob'. It keeps you from drowning in messages which demand classification, but also scratches the random sample itch. FYI. Skip From hauser at carneylaw.com Fri Sep 19 12:21:15 2003 From: hauser at carneylaw.com (Hauser, Lee) Date: Fri Sep 19 12:16:59 2003 Subject: [Spambayes] Problem with Outlook plugin Message-ID: <6E165C6D8BDFC042A0FDB4A8635A2F21010B455C@ed.carneylaw.local> I sent this message early this week and have received no response...does no one have any ideas? We have several machines running the Outlook plugin without problem. It's great software! However, a user on one machine had to tell Outlook to disable the plugin -- Outlook was having difficulties attaching to the Exchange server which actually had nothing to do with Spambayes, and when it asked if she wanted to disable it, she said "yes." Now nothing I do will make the plugin work again. The original plugin was 0.7. I have tried uninstalling, manually clearning the plugin's registry settings under HKCU, reinstalling, re-registering the plugin DLL, and installing 0.81. (Realtime virus protection is turned off during install). The toolbar is still there, but nothing happens when I click any menu item, and no filtering is going on. Any hints about what I can do about this? Lee Hauser, System Administrator Carney Badley Spellman 700 5th Ave Suite 5800 Seattle WA 98104 206.689.4312 From drkrause at comcast.net Fri Sep 19 13:06:58 2003 From: drkrause at comcast.net (Doug Krause) Date: Fri Sep 19 13:03:19 2003 Subject: [Spambayes] Filtering in multiple folders Message-ID: <003b01c37ed0$719603e0$6401a8c0@DG6LFH11> This is a resend of an old message. Just upgraded to Version .8.1.....same problem.....Any thoughts on what to do?? Thanks..... Hello ...great program...many thanks! I use the Outlook plug-in. I have set up several rules to sort mail to folders for different family members. The plug-in works fine on the primary Inbox folder, but does not seem to sort out the SPAM items before moving them to the other folders. They are rated however. What I do is open Anti-SPAM Manager and click on Filter Now. Filtering is set to check unread messages for all the mailboxes, and then will move the SPAM messages to the SPAM folder. Is there any way to make this a one-step, rather than 2-step process?? Thanks......Doug Doug Krause 215 896-1827 M dougkrause@comcast.net From rcoe at CambridgeMA.GOV Fri Sep 19 13:07:23 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Sep 19 13:07:27 2003 Subject: [Spambayes] RE: Do you need to continue training ham? Message-ID: So to emphasize the "hamminess" of a group of messages (e.g., those from a given correspondent or sent to a given list), you could move them to your "Ambiguous" folder and then click "Recover from Spam". Right? Would that approach help those who have complained of a large number of false positives without forcing them into a complete retrain? Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Tim Peters [mailto:tim.one@comcast.net] > Sent: Thursday, September 18, 2003 7:38 PM > To: Rob Rosenfeld; spambayes@python.org > Subject: RE: [Spambayes] Do you need to continue training ham? > > > [Rob Rosenfeld] > > Hey folks. I have moved from SpamAssassin to the SpamBayes Outlook > > plug-in. The integration is great. I'm a bit confused about one > > part. I had stockpiles of ham and spam to initially train SpamBayes > > with. > > Note that spambayes works best if you train on an > approximately equal number > of each. It doesn't take millions , either. For > example, I started > this project, and my home Outlook classifier still hasn't > been trained on > 2000 messages total (I get about 600 per day, and my > classifier database is > going on one year old, so I've trained on less than 1% of the > email I've > received in that time). > > > If I understand correctly, every time SpamBayes detects and > > moves a spam, it trains on it, kind of giving it "ongoing" spam > > training. Is that correct? > > Nope. > > > If it doesn't move it as spam, does it train on it as ham? > > Not that either. It auto-trains on messages for which you > explicitly click > the "Recover from Spam" or "Delete as Spam" buttons. In > addition, it *may* > train on messages *you* move to spam or ham folders, > depending on which > boxes you've checked in the spambayes Manager's Training tab, section > "Incremental Training". These aren't necessarily ideal > training protocols, > but they're the best we've been able to implement so far that > most users > seem able to deal with. Ideal would be to train on a small > random sample of > all the email you get, and expire training messages over time > too. That > seems hard. > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From Rick.Reid at uk.anker-systems.com Fri Sep 19 13:30:11 2003 From: Rick.Reid at uk.anker-systems.com (Rick Reid) Date: Fri Sep 19 13:30:19 2003 Subject: [Spambayes] Messages fail to filter Message-ID: <01A5C5582591FB44898B5CF7B23BFD8E01DB7555@msexchange.central.anker> Hi I have installed SpamBayes(binary version) version 0.81 as an add-in to Outlook XP running on XP Pro. I have followed all the instructions but new mail is not getting filtered at all whether it is spam or not. I have read the troubleshooting guide and could find no explanation because I have filtering enabled and all messages fail to give a Spam reading. I have looked through the bug reports but can find nothing similar. If I choose the 'Filter Messsages' option from the addin and ask it to filter my Inbox and ask it to 'Score messages but don't perform filter action', it does so and marks up the Spam column successfully. I have definitely set my Inbox as the folder to be monitored as messages arrive. I really like the look of SpamBayes and surely I must be doing something fundamentally wrong? I have checked and rechecked the settings and I am sure they are right. Could this be a bug? I doubt it, surely there must be other users with the same config? Can you help me? Would any more info be useful? Regards Rick Reid System Consultant tel. +44 (0)870 9051318 email rick.reid@uk.anker-systems.com URL: http://www.anker-systems.com The information in this Email and in any attachment is confidential and is intended solely for the addressee. Access, copying, disclosure or use of such information by anyone else is unauthorised. If you are not the intended recipient please contact the sender, or postmaster@uk.anker-systems.com. In the unlikely event that any unauthorised Email is received, please destroy it, accept our apologies and contact postmaster@uk.anker-systems.com with details of the sender. Any information, statements or opinions contained in this Email (including any attachments) are given by the author. They are not considered binding unless confirmed in mail or fax by a duly authorised representative of Anker Systems. This Email has been checked for known viruses by the Anker Firewall. Anker, the IT Solutions Specialists for the Retail, Hospitality and Wholesale Market Sectors - http://www.anker-systems.com -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/0b4c197d/spambayes2.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 4601 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030919/0b4c197d/spambayes1.obj From cedmonds at verio.net Fri Sep 19 13:39:13 2003 From: cedmonds at verio.net (Christopher Edmonds) Date: Fri Sep 19 13:39:52 2003 Subject: [Spambayes] Spam Buttons don't work In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130347292B@its-xchg4.massey.ac.nz> Message-ID: <000601c37ed4$f1316aa0$0200000a@corp.verio.net> Thanks Tony. This still didn't fix the problem. I completely removed the program, and then installed this new one. I still get no response when I click the SpamBayes Manager toolbar. Christopher -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Thursday, September 18, 2003 2:50 PM To: cedmonds@verio.net; spambayes@python.org Subject: RE: [Spambayes] Spam Buttons don't work > I loaded Spam Bay. It was working fine for a month and > abruptly stopped working. I loaded the newest version (006) > and it doesn't do anything. The latest version is 008.1. It would be well worth upgrading to this and seeing if that solves the problem. =Tony Meyer From cmenendez at the-beach.net Fri Sep 19 12:08:34 2003 From: cmenendez at the-beach.net (mail.the-beach.net) Date: Fri Sep 19 13:41:15 2003 Subject: [Spambayes] How do I get your program Message-ID: <00c201c37ec8$48ed3510$3646fea9@D496XJ21> I would appreciate if you could send me the instructions on how to download your program. Thanks Carlos Menendez From rmalayter at bai.org Fri Sep 19 13:46:09 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Sep 19 13:46:13 2003 Subject: [Spambayes] Messages fail to filter Message-ID: <792DE28E91F6EA42B4663AE761C41C2A0111520E@cliff.bai.org> How many messages of each type (spam/ham) did you train with? You might not have enough spam trained to get accurate results yet. Also, 'score messages but don't perform filter action' does exactly what it says: it SCOREs messages, but it does not FILTER (move) them. You want to SCORE them and FILTER them. -----Original Message----- From: Rick Reid [mailto:Rick.Reid@uk.anker-systems.com] Sent: Friday, September 19, 2003 12:30 PM To: spambayes@python.org Subject: [Spambayes] Messages fail to filter Hi I have installed SpamBayes(binary version) version 0.81 as an add-in to Outlook XP running on XP Pro. I have followed all the instructions but new mail is not getting filtered at all whether it is spam or not. I have read the troubleshooting guide and could find no explanation because I have filtering enabled and all messages fail to give a Spam reading. I have looked through the bug reports but can find nothing similar. If I choose the 'Filter Messsages' option from the addin and ask it to filter my Inbox and ask it to 'Score messages but don't perform filter action', it does so and marks up the Spam column successfully. I have definitely set my Inbox as the folder to be monitored as messages arrive. I really like the look of SpamBayes and surely I must be doing something fundamentally wrong? I have checked and rechecked the settings and I am sure they are right. Could this be a bug? I doubt it, surely there must be other users with the same config? Can you help me? Would any more info be useful? Regards Rick Reid System Consultant tel. +44 (0)870 9051318 email rick.reid@uk.anker-systems.com URL: http://www.anker-systems.com The information in this Email and in any attachment is confidential and is intended solely for the addressee. Access, copying, disclosure or use of such information by anyone else is unauthorised. If you are not the intended recipient please contact the sender, or postmaster@uk.anker-systems.com. In the unlikely event that any unauthorised Email is received, please destroy it, accept our apologies and contact postmaster@uk.anker-systems.com with details of the sender. Any information, statements or opinions contained in this Email (including any attachments) are given by the author. They are not considered binding unless confirmed in mail or fax by a duly authorised representative of Anker Systems. This Email has been checked for known viruses by the Anker Firewall. Anker, the IT Solutions Specialists for the Retail, Hospitality and Wholesale Market Sectors - http://www.anker-systems.com From skip at pobox.com Fri Sep 19 14:56:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 19 14:56:13 2003 Subject: [Spambayes] How do I get your program In-Reply-To: <00c201c37ec8$48ed3510$3646fea9@D496XJ21> References: <00c201c37ec8$48ed3510$3646fea9@D496XJ21> Message-ID: <16235.20802.635491.986589@montanaro.dyndns.org> Carlos> I would appreciate if you could send me the instructions on how Carlos> to download your program. Thanks Carlos, Start here: http://www.spambayes.org/ It looks like you will need to run the sb_server.py application. Among other things, it serves as a proxy (go-between) for the POP3 protocol. You can pick up the source distribution here: https://sourceforge.net/project/showfiles.php?group_id=61702 For details about how to use SpamBayes with Outlook Express, check the FAQ: http://spambayes.sourceforge.net/faq.html and the Windows page: http://spambayes.sourceforge.net/windows.html Skip From gjewell at cnnxn.com Fri Sep 19 16:15:35 2003 From: gjewell at cnnxn.com (Greg Jewell) Date: Fri Sep 19 16:17:47 2003 Subject: [Spambayes] Unfilterable email using Outlook plugin Message-ID: <741C3B0BAF7B1F4F8FCC50AC4766CBF02489F7@phoenix.ossconnexn.com> Hi Mark, I went to source forge today to look at the bug entered for this, and didn't find one. (I misunderstood your message and didn't realize that I should submit the bug.) When I try to enter the bug myself, it says I need to log in even though I already am. If you have the time, would you be kind enough to submit this for me? Thanks again, Greg Jewell >-----Original Message----- >From: Mark Hammond [mailto:mhammond@skippinet.com.au] >Sent: Wednesday, September 10, 2003 6:18 PM >To: Greg Jewell; spambayes@python.org >Subject: RE: [Spambayes] Unfilterable email using Outlook plugin > > >The problem is that the message in question *is* a bounce. >Some spammer has used your "from" address, so the bounce is to you. > >The interesting thing is that Outlook has decided to transform >this bounce into an "undeliverable report", which it handles >slightly differently - it gets a different icon, and a >different "form" which includes a "resend" option etc. Not >*all* bounces are handled this way by Outlook, but clearly some are. > >Unfortutately, this is a "feature" gone wrong. I initially >thought that Outlook only created such messages for "internal" >bounces - ie, for messages that failed delivery *before* they >moved to the internet - particularly in an Exchange Server environment. > >Initially, my experience was that I had only ever seen these >"bounce" messages for legitimate bounces - but SpamBayes would >then filter them, and depending on your training, would often >score tham as Spam. This was clearly bad for legitmate bounce >messages. Since then I have seen exactly one external bounce >message be handled this way in my own mail - and now yours >makes a second. > >I'm not sure what to do about this now... I guess opening a >bug at source forge would be good :) > >Mark. > From wtrenker at shaw.ca Fri Sep 19 10:09:52 2003 From: wtrenker at shaw.ca (William Trenker) Date: Fri Sep 19 17:11:45 2003 Subject: [Spambayes] Re: [OT] New Play: SPAMELOT In-Reply-To: References: Message-ID: <20030919140952.4a6583b5.wtrenker@shaw.ca> On Fri, 19 Sep 2003 15:37:44 -0400 Terry Reedy wrote regarding Re: [OT] New Play: SPAMELOT: > I nominate whoever sent me over a thousand fake MS Update + attachment > spams in the last 24 hours. I nominate for hero: Spambayes! I have never received so many virus-related spam messages as this latest 'Swen' virus. Even the recent 'Klez' virus didn't come close on volume of spam messages received. This latest one has been such a pain that yesterday I decided to try out Spambayes. I have Python 2.3 on Linux here, using Sylpheed for email. I was pleasantly surprised how easy it was to setup Spambayes and its pop3proxy.py application. I waited (not long) for a few of the virus messages to come my way, trained Spambayes on just those few messages, and haven't been bothered since. I would also nominate Spambayes as Python App of the Year! Spambayes certainly is a showcase of the sophistication, and practicality, that can be achieved using Python 'out of the box'. Bill From T.A.Meyer at massey.ac.nz Fri Sep 19 18:40:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 19 18:40:19 2003 Subject: [Spambayes] Filtering in multiple folders Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599338@its-xchg4.massey.ac.nz> > I use the Outlook plug-in. I have set up several rules to > sort mail to folders for different family members. The > plug-in works fine on the primary Inbox folder, but does not > seem to sort out the SPAM items before moving them to the > other folders. They are rated however. They are definitely scored, but not moved, while other messages are scored and moved? Have you tried using the background filtering? It's on the advanced tab. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 19 18:42:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 19 18:42:14 2003 Subject: [Spambayes] Spam Buttons don't work Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599339@its-xchg4.massey.ac.nz> > Thanks Tony. This still didn't fix the problem. I > completely removed the program, and then installed this new > one. I still get no response when I click the SpamBayes > Manager toolbar. Have you tried the steps in the troubleshooting guide re: the toolbars? Uninstalling doesn't (yet) remove the toolbar, so you might have a dead one left from the old version, which is preventing 008 from creating a working one. If you go through the steps to delete the toolbar in the guide, 008 should be able to create a working one. =Tony Meyer From ta-meyer at ihug.co.nz Fri Sep 19 19:08:39 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Fri Sep 19 19:08:46 2003 Subject: [Spambayes] SpamBayes 1.0a6 Source Release Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212AF01@its-xchg4.massey.ac.nz> Version 1.0a6 of the SpamBayes source is now available. Note: this is not a release of the binary installer for the Outlook plug-in. A separate release for the plug-in installer will follow at a later date. This release follows (reasonably) close on the heals of 1.0a5 and is primarily a reshuffling and tidy-up. All the scripts have been renamed, and all the cruft supporting old option names has been removed. This will almost certainly mean that you have to edit the way you use SpamBayes (to change the name, at least). There are a few other minor improvements and bug fixes. We recommend that all source code users upgrade to the 1.0a6 release, but do so at a time when they have a few minutes available to change over references to script names, and to check their configuration files are using the correct names. Note that the plan is for this to be the final alpha release in the 1.0 series, and that 1.0 is effectively feature frozen from this point. We will continue to fix any bugs that are reported, and should be able to release a very stable 1.0b1 and then final 1.0 release in the near future. Work will soon begin on the 1.1 series, which will (no doubt!) feature many improvements. For full details about what's new in this release, please see the release notes and/or changelog: Downloads are available from . =Tony Meyer, on behalf of the spambayes team. From mhammond at skippinet.com.au Fri Sep 19 20:05:30 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Sep 19 20:05:24 2003 Subject: [Spambayes] Problem with Outlook plugin In-Reply-To: <6E165C6D8BDFC042A0FDB4A8635A2F21010B455C@ed.carneylaw.local> Message-ID: <0a2701c37f0a$e86ca2a0$f502a8c0@eden> Please see the "troubleshooting guide" from the SpamBayes dropdown's Help item (on one of the working machines :) - it is likely that the plugin needs to be re-enabled via "Help->About->Disabled Objects" Mark > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Hauser, Lee > Sent: Saturday, 20 September 2003 2:21 AM > To: spambayes@python.org > Subject: [Spambayes] Problem with Outlook plugin > > > I sent this message early this week and have received no > response...does > no one have any ideas? > > We have several machines running the Outlook plugin without problem. > It's great software! However, a user on one machine had to > tell Outlook > to disable the plugin -- Outlook was having difficulties attaching to > the Exchange server which actually had nothing to do with > Spambayes, and > when it asked if she wanted to disable it, she said "yes." > Now nothing I > do will make the plugin work again. The original plugin was > 0.7. I have > tried uninstalling, manually clearning the plugin's registry settings > under HKCU, reinstalling, re-registering the plugin DLL, and > installing > 0.81. (Realtime virus protection is turned off during install). The > toolbar is still there, but nothing happens when I click any > menu item, > and no filtering is going on. Any hints about what I can do > about this? > > Lee Hauser, System Administrator > Carney Badley Spellman > 700 5th Ave Suite 5800 > Seattle WA 98104 > 206.689.4312 > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2580 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030920/aa507760/winmail.bin From rafael.scholl at bbox.ch Fri Sep 19 20:00:47 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Fri Sep 19 20:09:37 2003 Subject: [Spambayes] imapfilter.py keeps adding new spambayes headers to messages Message-ID: <6.0.0.9.0.20030920015942.01b867f8@mail.bbox.ch> Hi everybody, Imapfilter.py adds a new set of spambayes headers to messages each time I classify my inbox. This problem has a history. I encountered it with 1.0a5 and posted to this list. It was solved by updating message.py. This seemed to work. After a while the problem reappeared. Unfortunately I can't recall doing anything that might have caused this. I didn't change a thing as far as I remember. This time, however, only some of the messages in my inbox got new headers with each time I classified. Those were usually the newest messages. (Interestingly, when I moved some of the messages to another folder and then moved them back, imapfilter.py left them alone the next time I classified my inbox.) I didn't report this problem again because I was hoping 1.0a6 would solve it. I know now that it doesn't! - Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From mhammond at skippinet.com.au Fri Sep 19 20:12:29 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Sep 19 20:19:02 2003 Subject: [Spambayes] Configuration problem In-Reply-To: Message-ID: <0a3001c37f0b$e1e820c0$f502a8c0@eden> > folder as where to move the spam to. I am currently using > the source code > version 1.0a4 with Python 2.3. Grab the new 1.0a6 release - that should solve the problem. Mark. From T.A.Meyer at massey.ac.nz Fri Sep 19 20:30:46 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 19 20:31:04 2003 Subject: [Spambayes] imapfilter.py keeps adding new spambayes headers tomessages Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359933D@its-xchg4.massey.ac.nz> > This problem has a history. I encountered it with 1.0a5 and > posted to this list. It was solved by updating message.py. > This seemed to work. After a while the problem reappeared. [...] > This time, however, only some of the messages in my inbox got > new headers with each time I classified. Those were usually the > newest messages. Headers are added if imapfilter can't find a message with that id in the messageinfo database. This sounds like your messageinfo database isn't being updated. Could you check that? In your config, do you have your messageinfo database location set as a fully qualified path? (i.e. something like "C:\spambayes\messageinfo.db", not something like "messageinfo.db"). > (Interestingly, when I moved some of the messages to another > folder and then moved them back, imapfilter.py left them alone > the next time I classified my inbox.) This is strange. You can't actually move an IMAP message (it's a stupid protocol), only copy between folders, so this is identical to having new messages arrive. > I didn't report this > problem again because I was hoping 1.0a6 would solve it. I > know now that it doesn't! For future reference, please report! If it was solved, then we could quickly let you know that, and if it isn't, then it's better to try and get it fixed before the release. =Tony Meyer From rafael.scholl at bbox.ch Fri Sep 19 23:45:33 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Fri Sep 19 23:46:06 2003 Subject: [Spambayes] imapfilter.py keeps adding new spambayes headers tomessages In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130359933D@its-xchg4.massey. ac.nz> Message-ID: <6.0.0.9.0.20030920041003.01b89b68@mail.bbox.ch> >Headers are added if imapfilter can't find a message with that id in the >messageinfo database. This sounds like your messageinfo database isn't >being updated. Could you check that? It wasn't updated after classification. And I've figured out why. I started "sb_imapfilter.py -c -v -l 10" in order to keep the filter running in the background (the idea was to avoid the hassle of having to start two programs every time I want to check e-mail). It turns out that with the "l" option enabled, the database doesn't get updated and headers get duplicated. (Note, however, that this certainly isn't the whole story. I think there's no use in my experimenting any more as you know the innards of the program. But while the imapfilter with the "l" option is running there is no duplication of headers. New mail gets classified in one run of the filter, and in the next run x minutes later the filter seems to remember those previous e-mails and doesn't re-classify them. However, when I shut the filter down and start it again, everything gets re-classified -- once!) This explains quite nicely why at times only new messages had duplicate headers: Sometimes I'd experiment with some options and start sb_imapfilter.py without the "l" option, updating the database in the process. > > (Interestingly, when I moved some of the messages to another > > folder and then moved them back, imapfilter.py left them alone > > the next time I classified my inbox.) > >This is strange. You can't actually move an IMAP message (it's a stupid >protocol), It is! >only copy between folders, so this is identical to having new >messages arrive. I see you're reasoning, but I tried again: empirically, my observation holds. I had only your message in my inbox. I classified the inbox several times, and each time a new set of headers was appended. Then I moved your message to my local "in" and back -- now I can classify all I want. Unfortunately this behaviour doesn't really fit in with the problem with the "l" option. >In your config, do you have your >messageinfo database location set as a fully qualified path? (i.e. >something like "C:\spambayes\messageinfo.db", not something like >"messageinfo.db"). It was just spambayes.messageinfo.db at first, but I've changed it to a full path now. However, this didn't have any effect on the problem. - Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From fms27 at cam.ac.uk Sat Sep 20 00:41:25 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:42:30 2003 Subject: [Spambayes] Spambayes works extremely well for me, thanks! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13031E5923@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.1.20030905133953.02287368@localhost> I have been using spambayes for almost three months now and I am extremely happy with it. After checking its performance in the first two months and finding no false positives (everything it called spam was indeed spam) I decided to allow it to delete emails unseen. This was quite a dramatic step, and a testimonial to how well spambayes works for me. Brilliant work! I shall now post a few comments from my experience, in separate messages for ease of threading. Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From fms27 at cam.ac.uk Sat Sep 20 00:42:24 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:42:36 2003 Subject: [Spambayes] smtp proxy eating messages Message-ID: <5.2.0.9.1.20030920132454.0239b840@localhost> In the first month or so of my using spambayes (1.0a3) I had the system configured with both an smpt proxy and a pop3 proxy. However I then discovered that on rare occasions, say once or twice a week, the smtp proxy was silently eating some of my outgoing messages: eudora would show them as sent, but they never got to their recipient. I always send a bcc to myself, so I can positively confirm that this was happening; but since these bccs are refiled and ignored automatically, I basically never see them unless I look for them, so I didn't notice for quite a while. Although these occurrences of lost outgoing mail were infrequent, they were intolerable; so, when I upgraded to 1.0a4, I just got rid of the smtp proxy (which wasn't providing any useful service anyway) and the problem went away. I haven't been able to figure out a pattern to these disappearances. I understand that a report like this one won't make it too easy to identify the bug unless you already know about it, so please let me know if there is any other thing I can do to make this more helpful. Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From fms27 at cam.ac.uk Sat Sep 20 00:24:43 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:42:46 2003 Subject: [Spambayes] minor UI suggestion for web page Message-ID: <5.2.0.9.1.20030920125451.030c2418@localhost> The title of the html pages generated by the UI web server doesn't mention spambayes, and it should. When you see the window title of the web browser in the taskbar, you can't easily tell this web browser window from other open ones if it says just "Home > Configuration" (probably truncated to "Home > Con" or some such anyway). It would be nicer if it had a "Spambayes" prefix as the root of this hierarchy. As a quick hack I went to spambayes-1.0a4/spambayes/UserInterface.py and edited the line html.title = name (around line 125) into # fms: make the title of the browser less generic in the taskbar html.title = "Spambayes " + name This fixed the main "home" page and the "review messages" page, but not the "configuration" page. I didn't chase this up, but this in turn seems to suggest that the "configuration" page changes its title in a non-standard way compared to the rest of this impressive and elaborate page generation framework. Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From fms27 at cam.ac.uk Sat Sep 20 00:11:02 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:42:54 2003 Subject: [Spambayes] uncaught exception on unicode parsing errors lets spam through Message-ID: <5.2.0.9.1.20030920125449.032cfe60@localhost> This is relatively new, but I now witness frequent instances (at least one every couple of days) of spam messages that were not classified as such because spambayes choked on them at the parsing stage, adding a header such as the one below. X-Spambayes-Exception: exceptions.UnicodeDecodeError('ascii' codec can't decode byte 0x92 in position 7: ordinal not in range(128)) in append() at C:\apps\python23\lib\email\Header.py line 272: ustr = unicode(s, incodec, errors) Clearly this parsing stage needs to be more robust, otherwise spammers will send out malformed messages just to evade spambayes's classification. I include a sample message as attachment. I have more if needed. Note that you can't just say that all messages with unicode errors are spam (though experimentally so far the majority of them have been). In my academic position I also get plenty of phd applications from East Asians and I have sometimes seen the Chinese characters in the From header triggering the same kind of message. I'm guessing that's because they were encoded in something other than unicode. -------------- next part -------------- Return-path: Envelope-to: fms27@hermes.cam.ac.uk Delivery-date: Fri, 19 Sep 2003 19:05:34 +0100 Received: from brown.csi.cam.ac.uk ([131.111.8.14]) by orange.csi.cam.ac.uk with esmtp (Exim 4.12) id 1A0PdO-00011G-00 for fms27@hermes.cam.ac.uk; Fri, 19 Sep 2003 19:05:34 +0100 Received: from mta2.cl.cam.ac.uk ([128.232.0.14] helo=whittlesey.cl.cam.ac.uk) by brown.csi.cam.ac.uk with esmtp (Exim 4.20) id 1A0PdN-00010W-L5 for fms27@cam.ac.uk; Fri, 19 Sep 2003 19:05:33 +0100 Received: from rose.csi.cam.ac.uk ([131.111.8.13]) by whittlesey.cl.cam.ac.uk with esmtp (Exim 3.092 #1) id 1A0Pct-0007Qv-00 for Frank.Stajano@cl.cam.ac.uk; Fri, 19 Sep 2003 19:05:03 +0100 Received: from [128.252.188.12] (helo=yahoo.com) by rose.csi.cam.ac.uk with esmtp (Exim 4.20) id 1A0Pco-0000Fa-5h for fms27@cl.cam.ac.uk; Fri, 19 Sep 2003 19:04:58 +0100 Message-ID: <1063994698.0969@mailcity.com> From: "Alisha Stevens" To: Frank.Stajano@cl.cam.ac.uk Subject: Re: I?ll tell you why! Date: Fri, 19 Sep 2003 18:04:58 +0000 MIME-Version: 1.0 X-Mailer: Pegasus Mail for Win32 (v3.12a) Content-Type: text/html Content-Transfer-Encoding: 8bit X-Cam-AntiVirus: No virus found X-Cam-SpamDetails: scanned, SpamAssassin (score=6.4, HTML_80_90 0.54, HTML_FONT_BIG 0.27, HTML_FONT_COLOR_BLUE 0.10, HTML_FONT_COLOR_RED 0.10, HTML_MESSAGE 0.10, MIME_HTML_ONLY 0.10, OBFUSCATING_COMMENT 2.60, RAZOR2_CHECK 2.06, SEE_FOR_YOURSELF 0.48) X-Cam-SpamScore: ssssss X-Cam-ScannerInfo: http://www.cam.ac.uk/cs/email/scanner/ X-Cam-AntiVirus: No virus found X-Cam-SpamDetails: Not scanned Status: O X-Spambayes-Exception: exceptions.UnicodeDecodeError('ascii' codec can't decode byte 0x92 in position 5: ordinal not in range(128)) in append() at C:\apps\python23\lib\email\Header.py line 272: ustr = unicode(s, incodec, errors) Wholesale Prescription Medications Our doctors will write you a prescription for free! Buy Your Prescription Meds Online See For Yourself... Check It Out Here Stop Receiving the offers -------------- next part -------------- Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From fms27 at cam.ac.uk Sat Sep 20 00:15:26 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:43:02 2003 Subject: [Spambayes] security of server ports Message-ID: <5.2.0.9.1.20030920125444.03307c58@localhost> As a security professional it is my duty to be paranoid. Since spambayes (or more correctly its proxies) opens some listener sockets, it would be nice if (a) by default it only accepted connections from localhost on those sockets, refusing to serve any others, and nonetheless (b) issued a warning anytime it detected a connection request on those sockets from any other machine. Same thing for the local web server offering the configuration interface. Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From T.A.Meyer at massey.ac.nz Sat Sep 20 00:45:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 20 00:45:56 2003 Subject: [Spambayes] uncaught exception on unicode parsing errors lets spamthrough Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599347@its-xchg4.massey.ac.nz> > This is relatively new, but I now witness frequent instances > (at least one every couple of days) of spam messages that were not > classified as such because spambayes choked on them at the parsing > stage, adding a header such as the one below. > > X-Spambayes-Exception: exceptions.UnicodeDecodeError('ascii' > codec can't > decode byte 0x92 in position 7: ordinal not in range(128)) in > append() at C:\apps\python23\lib\email\Header.py line 272: > ustr = unicode(s, incodec, errors) This should be fixed in both 1.0a5 and the new 1.0a6. What version are you using? =Tony Meyer From fms27 at cam.ac.uk Sat Sep 20 00:58:44 2003 From: fms27 at cam.ac.uk (Frank Stajano) Date: Sat Sep 20 00:58:50 2003 Subject: [Spambayes] uncaught exception on unicode parsing errors lets spamthrough In-Reply-To: <5.2.0.9.1.20030920134950.0329d5e0@localhost> References: <1ED4ECF91CDED24C8D012BCF2B034F1303599347@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.1.20030920135643.0332c590@localhost> Whooops, sorry for the non-public reply, Tony, I assumed reply-to was set to the list automatically but didn't check. At 2003-09-20 13:53, Frank Stajano wrote: >At 2003-09-20 13:45, you wrote: >>This should be fixed in both 1.0a5 and the new 1.0a6. > >Glad to hear this, thanks! > >> What version are >>you using? > >1.0a4. Downloading 1.0a6 as we speak... > >Note that the autoresponder I just got when posting the above stuff still >mentions 1.0a4 as the latest, although it correctly says "as of >2003-08-28". Perhaps worth updating. Frank (filologo disneyano) http://www-lce.eng.cam.ac.uk/~fms27/ From T.A.Meyer at massey.ac.nz Sat Sep 20 00:58:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 20 00:59:04 2003 Subject: [Spambayes] security of server ports Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599348@its-xchg4.massey.ac.nz> > As a security professional it is my duty to be paranoid. > Since spambayes (or more correctly its proxies) opens some > listener sockets, it would be nice if (a) by default it > only accepted connections from localhost on those sockets, > refusing to serve any others, and nonetheless (b) issued a > warning anytime it detected a connection request on those > sockets from any other machine. There's a feature request (and patch I think) for this open on sourceforge. It will presumably get integrated at some point (probably by the time 1.1a1 is released). Given that they are only proxies, there doesn't seem to be much of a risk. > Same thing for the local web server offering the > configuration interface. The web server does only accept connections from localhost, by default. With 1.0a6 you can use HTTP AUTH, if you like, or set a set/range of IPs that are allowed to connect to it. =Tony Meyer From sbehrens at gmx.li Sat Sep 20 08:32:08 2003 From: sbehrens at gmx.li (Shawn Behrens) Date: Sat Sep 20 08:27:26 2003 Subject: [Spambayes] 1.06a won't start (Win32) Message-ID: <3F6C48C8.3090900@gmx.li> Aloha, I have a working 1.05a-as-service installation. I stopped the service and copied the 1.06a files over. Attempting to start the service again failed. Attempting to run the windows/pop3proxy_service.py resulted in a complaint about not finding sb_server. Attempting to run pop3proxy.py first resulted in a complaint that the option 'no_cache_bulk_ham:True' was not recognized; and after removing that option, resulted in a traceback: C:\Program Files\spambayes>pop3proxy.py Traceback (most recent call last): File "C:\Program Files\spambayes\pop3proxy.py", line 701, in ? state = State() File "C:\Program Files\spambayes\pop3proxy.py", line 562, in __init__ if options["pop3proxy", "servers"]: File "C:\Program Files\spambayes\spambayes\OptionsClass.py", line 562, in __ge titem__ return self.get(key[0], key[1]) File "C:\Program Files\spambayes\spambayes\OptionsClass.py", line 559, in get return self.get_option(sect, opt).get() File "C:\Program Files\spambayes\spambayes\OptionsClass.py", line 553, in get_ option return self._options[sect, opt] KeyError: ('pop3proxy', 'servers') Running this on WindowsXP with Python 2.3 and win32all 1.5.3 I went back to 1.05a (again by stopping the service, copying source files over, and starting the service), and I'm good. I guess I'll try a more 'clean install' with just copying hammie.db into a fresh 1.06a dir next, just to make sure it isn't me who's causing this. Shawn From billb at mousa.demon.co.uk Sat Sep 20 12:36:24 2003 From: billb at mousa.demon.co.uk (Bill Bedford) Date: Sat Sep 20 12:38:16 2003 Subject: [Spambayes] 1.06a build problem Message-ID: <20030920173624226776.GyazMail.billb@mousa.demon.co.uk> I tried building 1.06a from the sources with this result:- running build_scripts creating build/scripts-2.3 error: file 'scripts/sb_client.py' does not exist Is there something missing from the distribution? Bill Bedford "Nothing is as important as model railways and even that isn't very important" -some wiseguy somewhere From rafael.scholl at bbox.ch Sat Sep 20 13:02:31 2003 From: rafael.scholl at bbox.ch (Rafael Scholl) Date: Sat Sep 20 13:02:40 2003 Subject: [Spambayes] why not: imapfilter.py as service? Message-ID: <6.0.0.9.0.20030920190005.01b858e8@mail.bbox.ch> Hi everybody, I saw that one can run pop3proxy.py as a service in windows. I think that's a neat way to integrate it for people who aren't using Outlook. What would it take to do the same with the imapfilter? It would be very unobtrusive as a service, running in the background, classifying messages every x minutes... - Rafael Scholl * Checked for virus by F-Secure Content Scanner 6.10b36 * From MMARTINEZ at CSREES.USDA.GOV Sat Sep 20 13:49:25 2003 From: MMARTINEZ at CSREES.USDA.GOV (Martinez, Michael) Date: Sat Sep 20 13:48:16 2003 Subject: [Spambayes] Are there plans for a daemonized or compiled version of Spambayes? Message-ID: <83EED274D3127740995A6B621B88F2E101274212@csrees-exchange.csrees.usda.gov> Hi Guys, I've been running Spambayes on our agency Linux smtp gateway for several months and very happy with its classification of spam. My gateway is a qmail system and it pipes all incoming email through the hammiefilter prior to delivery. However, a performance problem arises when the gateway gets hit during peak hours with a lot of emails. What happens is the system slows down tremendously, in part due to the number of python instances that get forked in order to scan the emails. I was wondering: are there any plans to develop a lightweight, daemonized version of Spambayes? In the same vein, are there plans to port it to C or another compiled language? How difficult would this be? Any suggestions on how to lessen the impact on the system resources under a heavy email load? Thanks, Michael Martinez Linux System Administrator ISTM/CSREES United States Department of Agriculture From japeters at pacbell.net Sat Sep 20 15:02:28 2003 From: japeters at pacbell.net (John A. Peters) Date: Sat Sep 20 15:04:57 2003 Subject: [Spambayes] Suggestion for the rest of us Message-ID: <002001c37fa9$bd36ed80$6702a8c0@eTower> I think this is the right place. I read this So what needs to be done? Currently (July) work is now being focused on improving deployment of the system (actually building the applications and the code so that Tim's sister can use the system), One suggestion is that when the DOS box can't connect it gives an error message, but when it does reconnect it is silent. The work around is to close the dos box and reload Spambayes. The result is reassuring but kind of Klunky way to bet reassured. Maybe a good message will be friendly "Reconnected - SpamBayes is proxying at . . . " From ff at breeders.net Sat Sep 20 15:13:30 2003 From: ff at breeders.net (Frank Fleischer) Date: Sat Sep 20 15:13:35 2003 Subject: [Spambayes] Outlook upgrade - 2000 to xp Message-ID: <000001c37fab$4abad800$6501a8c0@master> I'm new to this - hope it's the right place to post. I installed Spambayes addin to Outlook 2000 and it was working beautifully. I've just upgraded from Outlook 2000 to Outlook xp and now when I click on the button "Delete as Spam", I get an Invalid Configuration message box that says "You must configure the spam folder." I've tried all kinds of things (reinstall, retrain) and nothing helps. Help, please! Thanks. Frank Fleischer Dog breeders galore at www.breeders.net Dog breed guides at www.doggies.com The Straight Poop about dogs at www.straight-poop.com From ian at astounding.org.uk Sat Sep 20 15:31:32 2003 From: ian at astounding.org.uk (Ian Smith) Date: Sat Sep 20 15:31:51 2003 Subject: [Spambayes] reporet on trying 1.0a6 on SuSE Message-ID: I'm probably doing this all wrong, but I don't understand CVS or sourceforge, and someone might like some feedback on what I've found, particularly as 1.0a6 bills itself as getting the install sorted (I think I saw that somewhere, but I can't find it now). I'm trying to get 1.0a6 to work on a SuSE 8.0 linux. This is a couple of versions old SuSE, but is as fully updated as SuSE do without going to a new version. The python in SuSE 8.0 is 2.2. SuSE do not have a package containing teh python email package, so I got and installed that as directed in teh README.txt. It doesn't install unless you have teh python-development SuSE package installed (I think there's a bug report saying similar for Debian - may be worth adding a comment that if your linux offers a 'python development' package you probably want it). setup.py won't run. Try it several times, dies with horrible looking errors. On offchance, try using teh .zip rather than the .tar.gz. When I do that, setup.py gets much further - is the .tar.gz corrupt? However, setup.py still won't complete. It now moans that "name 'False' is not defined". Some poking around teh web leads me to believe that True and False were added in python 2.2.2. If that is teh case, the README statement that you need 2.2 or later is not true - you need 2.2.2 or later. I know no python (specifically, I have never in my life seen any python code before this moment), but looking in setup.py it looks like lots of programming languages do, so I thought "why not?" and added teh lines "True=1" and "False=0" at the top (after the #! line, before the "import os" line). With that change, setup.py apparently completes happily. sb_mboxtrain won't run, complains "bad interpreter: no such file or directory". Aha - I know that one, and a quick peak in teh scripts - yes indeed, all have a ^M at teh end of every line - they're dos format. Is this because I worked from teh .zip rather than the .tar.gz? Anyway, a ls sb_* | xargs -i recode -f ibmpc..lat1 {} in the /usr/bin directory fixes it. Next I run sb_mboxtrain. That fails with teh same complaint about FALSE. Reading teh traceback, it seemed the actual complaint was related to site-packages/spambayes/Options.py, so I put True=1 and False=0 lines in tehre, just above import sys, os. Now sb_mboxtrain runs without crashing type error, but comes up with a usage error, apparently I must specify -d or -D. There doesn't seem to be anything about that in teh README. So, I try sb_imapfilter.py. That doesn't run either - apparently "-b, -c or -t operands must be specified". No mention of that in teh README file, however. I note that sb_mboxtrain -h says lower case -d is DBM and upper case -D is pickle. However, sb_imapfilter -h says lower case -d is pickled and upper case -D is dbm. Is this really so? It might be easier if there was consistency in the options. Alternatively, one or other of them has a wrong usage message (though I don't know which yet). sb_imapfilter.py -b runs up lynx (the machine it's running on doesn't have X, so that's the best browser available), but lynx hangs for a long, long time at "HTTP request sent; waiting for response". Not sure where the hang is occurring, and it's never given up (or at least, I've always interrupted it first). Back to sb_mboxtrain. Living dangerously, I try "sb_mboxtrain.py -d ~/.hammiedb -g ./ham -s ./spam". That runs, and looks like it's doing something useful. So, getting the feel for just trying things I stick a single email in a file sb_test_email and try "cat sb_test_email | sb_filter.py -d ~/.hammiedb". That dies with teh complaint about False again. Not sure I fancy editing every script, and I'm also not certain that that's actually working - I might be breaking something do that, so I postpone that approach for now. At this point I give up for the day. I'm starting to suspect I'll have to scrap the SuSE packaged python and put a more up-to-date one in from source. I'm quite keen to get spambayes working, since if it does, it looks very promising... Dunno if any of that is helpful to anyone. If anyone spots anything I've done which is causing my problems, I'd be glad to hear it. regards, Ian SMith -- |\ /| no .sig |o o| |/ \| From tamara at finitemonkey.com Sat Sep 20 17:51:54 2003 From: tamara at finitemonkey.com (Tamara Brown) Date: Sat Sep 20 17:51:56 2003 Subject: [Spambayes] How can I get the program working properly? Message-ID: The Delete As Spam and SpamBayes buttons both are showing on the outlook bar but nothing happens when I click them. I have tried uninstalling the program and reinstalling it but it doesn't seem to help. How can I get the program to work? From paulo at sakamoto.com.br Sat Sep 20 18:28:56 2003 From: paulo at sakamoto.com.br (paulo@sakamoto.com.br) Date: Sat Sep 20 18:29:02 2003 Subject: [Spambayes] Are there plans for a daemonized or compiled version of Message-ID: <3982.200.158.237.45.1064096936.squirrel@www2.brasiliasp.br> Hi, Could you show us your server configuration? I also run a qmail server for a school and would like to use spambayes at the server. Thanks, Paulo Sakamoto From tim.one at comcast.net Sat Sep 20 18:35:35 2003 From: tim.one at comcast.net (Tim Peters) Date: Sat Sep 20 18:35:39 2003 Subject: [Spambayes] Are there plans for a daemonized or compiled version ofSpambayes? In-Reply-To: <83EED274D3127740995A6B621B88F2E101274212@csrees-exchange.csrees.usda.gov> Message-ID: [Martinez, Michael] > I've been running Spambayes on our agency Linux smtp gateway for > several months and very happy with its classification of spam. My > gateway is a qmail system and it pipes all incoming email through the > hammiefilter prior to delivery. Yup, running a distinct classifier for each email is a pretty crazy design for high-volume use. > However, a performance problem arises when the gateway gets hit during > peak hours with a lot of emails. What happens is the system slows down > tremendously, in part due to the number of python instances that get > forked in order to scan the emails. > > I was wondering: are there any plans to develop a lightweight, > daemonized version of Spambayes? The answer to that depends on you too: what are your plans? Python is a C program, and can be daemonized like any other. Note the project's pspam directory sets up a classifier backed by a ZODB database, which can be attached to via opening a ZEO connection. That would be a pleasant way to let multiple clients hook up at will to an always-running classifier. > In the same vein, are there plans to port it to C or another compiled > language? AFAICT, the most expensive part of running spambayes now is running Berkeley database lookups, and the Sleepycat bsddb implementation is already written in C. So profile before you presume to know what would help. Based on what I've measured, my interest in recoding any of the rest in C is nil. > How difficult would this be? It would be extremely tedious. You don't escape the needs for a database, for I/O, or for a variety of complex string-processing operations. The parts of the Python implementation that supply those to Python programmers are already coded in C, but much easier to use from Python than from C. From marc999 at agt.net Sat Sep 20 20:09:32 2003 From: marc999 at agt.net (Marc M) Date: Sat Sep 20 20:09:36 2003 Subject: [Spambayes] Deleting Spam Rather than Moving it Message-ID: I have read the FAQ: http://spambayes.sourceforge.net/faq.html#how-can-i-configure-spambayes-to-d elete-spam-rather-than-moving-it However I'm not sure if this means that it is impossible to do, or just that it is not programmed at this time. If this is possible, is it being considered for addition in future version of SpamBayes? I have looked at the Auto-Archiving from Outlook and if I right click on my Spam folder and go to Properties -> AutoArchive, and set it to delete, I get a message: "There are no global AutoArchive Options set. Global AutoArchive option will be set to archive every 14 days at startup. You can moidify these settings on the Other tab." After fiddling with it I don't beleive there is a way to set Auto-Archive for ONLY the Spam folder (BTW, I am using Outlook 2000). Since I don't want to waste disk space Auto-Archiving, especially when I instead make periodic .pst files for backup, I'm not confident that this is a great alternative for my needs. It would be really nice if SpamBayes would just directly move spam to the deleted folders tab, then Outlook will empy the folder every time it exits (if you set it to). You can always check the folder before exit (or just not check the option of having Outlook remove items from the Deleted Items folder) if you are unsure of how well you have trained SpamBayes. I've never had a false positive so this is why I crave this option so much. From T.A.Meyer at massey.ac.nz Sat Sep 20 23:57:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 20 23:57:49 2003 Subject: [Spambayes] 1.06a won't start (Win32) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599360@its-xchg4.massey.ac.nz> > I have a working 1.05a-as-service installation. I stopped the service > and copied the 1.06a files over. Attempting to start the > service again failed. Attempting to run the windows/pop3proxy_service.py > resulted in a complaint about not finding sb_server. I'd suggest that you remove the 1.0a5 service (windows/pop3proxy_service.py -remove), and then install the 1.0a6 one (windows/pop3proxy_service.py -install). > Attempting to run pop3proxy.py first resulted in a complaint that the > option 'no_cache_bulk_ham:True' was not recognized; and after > removing that option, resulted in a traceback: 'pop3proxy.py' doesn't exist in 1.0a6...this could be part of your problem :) It's guaranteed not to work, anyway. You did read the release notes/what's new file, didn't you? . =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 00:06:06 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 00:06:19 2003 Subject: [Spambayes] How can I get the program working properly? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599361@its-xchg4.massey.ac.nz> > The Delete As Spam and SpamBayes buttons both are showing on > the outlook bar but nothing happens when I click them. I have > tried uninstalling the program and reinstalling it but it > doesn't seem to help. How can I get the program to work? Try going through the steps in the troubleshooting guide. This is installed into the folder that spambayes is installed into (C:\Program Files\SpamBayes, probably), and is also available online: =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 00:16:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 00:17:09 2003 Subject: [Spambayes] 1.06a build problem Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599363@its-xchg4.massey.ac.nz> > I tried building 1.06a from the sources with this result:- > > running build_scripts > creating build/scripts-2.3 > error: file 'scripts/sb_client.py' does not exist > > Is there something missing from the distribution? Eeek! The entire scripts directory is missing. Why the zip has it and the tar.gz doesn't, I don't know (neither should given that the manifest was wrong), and how that was missed from the release candidate check, I don't know. I've just uploaded to sourceforge 1.0a6.1, which has the scripts directory. Apologies! =Tony Meyer From wtrenker at shaw.ca Sat Sep 20 18:16:43 2003 From: wtrenker at shaw.ca (William Trenker) Date: Sun Sep 21 01:17:43 2003 Subject: [Spambayes] 1.06a Upgrade Message-ID: <20030920221643.76a1a385.wtrenker@shaw.ca> I've read the release notes regarding the changes to the names of the various scripts. I use pop3proxy.py (Spambayes version 1.05a) with Sylpheed on Linux (Python 2.3) -- works great! As you know, most of the data folders and files used by pop3proxy.py have the string 'pop3proxy' in them. For example: pop3proxy-spam-cache/ or _pop3proxyham.mbox and others. When I install 1.06a will these existing data folder / file names still work with the new sb_server.py script? Thanks, Bill From T.A.Meyer at massey.ac.nz Sun Sep 21 02:53:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 02:53:24 2003 Subject: [Spambayes] 1.06a Upgrade Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599366@its-xchg4.massey.ac.nz> > I use pop3proxy.py (Spambayes version 1.05a) with Sylpheed on > Linux (Python 2.3) -- works great! As you know, most of the > data folders and files used by pop3proxy.py have the string > 'pop3proxy' in them. For example: pop3proxy-spam-cache/ or > _pop3proxyham.mbox and others. When I install 1.06a will > these existing data folder / file names still work with the > new sb_server.py script? Yes, all of those will still work fine. None of the default storage names have changed. The reason behind the change to sb_server (rather than sb_pop3proxy) was that the script actually does more than serve a POP3 proxy. (For example, it also serves the web interface, and any SMTP proxies). Eventually, it will probably be the 'main' script, whatever you're doing. At that point the default names for storage may change, but that's a while off. Even then, you could set your config script to use the old ones. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 03:00:08 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 03:00:52 2003 Subject: [Spambayes] reporet on trying 1.0a6 on SuSE Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599367@its-xchg4.massey.ac.nz> > I'm probably doing this all wrong, but I don't understand CVS or > sourceforge, and someone might like some feedback on what I've found, > particularly as 1.0a6 bills itself as getting the install > sorted (I think I saw that somewhere, but I can't find it now). I'm not sure that that claim has ever been made ;) > setup.py won't run. Try it several times, dies with horrible looking > errors. On offchance, try using the .zip rather than the > .tar.gz. When I do that, setup.py gets much further - is the .tar.gz corrupt? The tar.gz of 1.0a6 is missing the entire scripts directory, so isn't all that usable :/. There's now a 1.0a6.1 available, which has the missing stuff. > However, setup.py still won't complete. It now moans that > "name 'False' is not defined". Some poking around teh web leads me to > believe that True and False were added in python 2.2.2. If that is the > case, the README statement that you need 2.2 or later is not true - you > need 2.2.2 or later. That's a bug in setup.py. All the actual scripts should work fine in 2.2. I've checked in a fix for the setup bug. > sb_mboxtrain won't run, complains "bad interpreter: no such file or > directory". Aha - I know that one, and a quick peak in teh > scripts - yes indeed, all have a ^M at teh end of every line - they're dos > format. Is this because I worked from teh .zip rather than the .tar.gz? Yes. > Next I run sb_mboxtrain. That fails with teh same complaint about > FALSE. Reading teh traceback, it seemed the actual complaint > was related to site-packages/spambayes/Options.py, so I put True=1 and > False=0 lines in tehre, just above import sys, os. This shouldn't happen - sb_mboxtrain should define True and False before it tries to import Options.py. I don't suppose you still have a copy of the traceback? > Now sb_mboxtrain runs without crashing type error, but comes up with a > usage error, apparently I must specify -d or -D. There > doesn't seem to be anything about that in teh README. There's a bug open about that. That section of the README needs to be updated. I'm not really the right person to do so, though, because I don't use it... > So, I try sb_imapfilter.py. That doesn't run either - > apparently "-b, -c or -t operands must be specified". No mention of > that in teh README file, however. Fixed, thanks. > I note that sb_mboxtrain -h says lower case -d is DBM and > upper case -D is pickle. However, sb_imapfilter -h says lower case -d > is pickled and uppercase -D is dbm. Is this really so? It might be easier > if there was consistency in the options. Yes, unfortunately that is the case. There was some discussion a while back (on spambayes-dev) about fixing this up. I imagine that it'll get done for 1.1 (probably "-p" for pickle and "-d" for DBM), but the best option is really to put all of this in a config file (.spambayesrc, for example) rather than on the command line. > sb_imapfilter.py -b runs up lynx (the machine it's running on > doesn't have X, so that's the best browser available), but lynx hangs for > a long, long time at "HTTP request sent; waiting for response". Not sure > where the hang is occurring, and it's never given up (or at least, I've always > interrupted it first). Hmm. If you run sb_server.py and go to http://localhost:8880, does it hang then, too? Using "Lynx Version 2.8.4rel.1 (17 Jul 2001)" works fine for me. > "cat sb_test_email | sb_filter.py -d > ~/.hammiedb". That dies with teh complaint about False again. > Not sure I fancy editing every script, and I'm also not certain that > that's actually working - I might be breaking something do that, > so I postpone that approach for now. Again, if you had the traceback, that would be helpful. There's a lot of code scattered about that's meant to provide False/True if it doesn't exist, which should be working... > Dunno if any of that is helpful to anyone. Absolutely, thanks. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 03:13:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 03:13:47 2003 Subject: [Spambayes] Deleting Spam Rather than Moving it Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599368@its-xchg4.massey.ac.nz> > However I'm not sure if this means that it is impossible to > do, or just that it is not programmed at this time. Nothing's impossible . > If this is possible, is it being considered for addition in future > version of SpamBayes? Anything's considered. If enough people bother Mark (or maybe Adam) about it, then it might get added (or he'll get annoyed and ignore them ;). If someone that knew Python decides that they'd like it, then they could write a patch, and have Mark consider it. > I have looked at the Auto-Archiving from Outlook and if I > right click on my Spam folder and go to Properties -> > AutoArchive, and set it to delete, I get a message: "There > are no global AutoArchive Options set. [...] > After fiddling > with it I don't beleive there is a way to set Auto-Archive > for ONLY the Spam folder (BTW, I am using Outlook 2000). Hmm. I don't get that message at all, and can easily have only the one folder auto-archive. However, I'm using Outlook 2002/XP, so maybe that's not the case in 2k (I can't recall, and don't have a copy around). > It would be really nice if SpamBayes would just directly move > spam to the deleted folders tab, then Outlook will empy the > folder every time it exits (if you set it to). The problem with allowing people to set the deleted items folder as the spam folder is that if they also set the option to train on messages moved to the spam folder, *all* deleted mail (good or bad) would be trained as spam. Sure this is only if you set both of those options, but allowing the folder to be used only depending on other options is easily confusing. Plus, given that there is a chance of false positives, especially during early use, getting people to send spam straight to the deleted items folder is very risky. In the long term, I gather than Outlook 2003 has Junk folders of it's own that spambayes can move spam to, which act as a secondary deleted items folder. (Even Outlook 2002 will let you do this via auto-archive). We're only talking about 4 clicks (on the folder, ctrl-a, delete, confirm) after all... =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 03:16:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 03:18:14 2003 Subject: [Spambayes] Suggestion for the rest of us Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599369@its-xchg4.massey.ac.nz> > One suggestion is that when the DOS box can't connect it > gives an error message, but when it does reconnect it is > silent. The work around is to close the dos box and reload > Spambayes. The result is reassuring but kind of Klunky way > to bet reassured. I'm not clear on what you mean here. Which SpamBayes application are you using? pop3proxy/sb_server? imapfilter? Are you meaning that you try and check your mail, but you can't connect to your mail server, and if this happens you have to restart SpamBayes? That shouldn't be the case. What is the error message that you receive? =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 03:19:09 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 03:19:20 2003 Subject: [Spambayes] why not: imapfilter.py as service? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359936A@its-xchg4.massey.ac.nz> > I saw that one can run pop3proxy.py as a service in windows. > I think that's a neat way to integrate it for people who aren't > using Outlook. What would it take to do the same with the imapfilter? Very little. Once doing this for the POP3 proxy is all done, I imagine it will be for imapfilter as well (execution of imapfilter might end up running off sb_server anyway). At the moment I'd rather hold off to avoid making the same mistakes & then corrections in two apps at the same time :) =Tony Meyer From rosenfeld at alum.mit.edu Sun Sep 21 04:02:08 2003 From: rosenfeld at alum.mit.edu (Rob Rosenfeld) Date: Sun Sep 21 04:02:20 2003 Subject: [Spambayes] Recover from Spam Destination Message-ID: <008001c38016$a7fe90e0$6d5da8c0@rrosenfeny> Hello, thanks to your help I have a properly trained SpamBayes that is rarely unsure or wrong. Thanks. I am using Outlook XP with an IMAP account, so my "Personal Folders" Inbox isn't SpamBayes "watching" folder. When I do have an unsure that is ham, I click "Recover from Spam". The email is recovered, but it's restored to the Personal Folders Inbox which is unused. Have I missed a setting? Can I tell SpamBayes to recover the message to the folder it found it in? -- Rob Rosenfeld rosenfeld@alum.mit.edu From T.A.Meyer at massey.ac.nz Sun Sep 21 04:21:57 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 04:22:11 2003 Subject: [Spambayes] smtp proxy eating messages Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599372@its-xchg4.massey.ac.nz> > However I then discovered that on rare occasions, say once or > twice a week, the smtp proxy was silently eating some of my > outgoing messages: eudora would show them as > sent, but they never got to their recipient. Yikes! I've never heard this before. > Although these occurrences of lost outgoing mail were > infrequent, they were intolerable; Absolutely. > I haven't been able to figure out a pattern to these > disappearances. [...] > please let me know if there is > any other thing I can do to make this more helpful. It would be interesting to know if it happens with 1.0a6, since smtpproxy has gone through many substantial improvements since 1.0a3. If it only happened infrequently, and you don't know how to reproduce it, though, then I can't expect you to be able to tell me if it does or not. I don't suppose you know whether pop3proxy.py would print out an error message when it happened? Given that it's only proxying, I can only think of two ways this might happen - one is if smtpproxy believed that the message was to it, but that should only be if one of the recipient addresses matched the ham_address or spam_address options (spambayes_ham@localhost and spambayes_spam@localhost, by default). That seems unlikely. The other is that an error occurred during the transfer, but I would have thought that this would cause an error to be reported to Eudora, as well - that's been my experience. Unfortunately, I can't see any way to track this down, without having some way to reproduce it. If anyone else reading the list has had this happen, please let us know! =Tony Meyer From japeters at pacbell.net Sun Sep 21 19:34:50 2003 From: japeters at pacbell.net (John A. Peters) Date: Sun Sep 21 04:32:24 2003 Subject: [Spambayes] Suggestion for the rest of us continued Message-ID: <002e01c38098$f3a24c30$6702a8c0@AMD> > One suggestion is that when the DOS box can't connect it > gives an error message, but when it does reconnect it is > silent. The work around is to close the DOS box and reload > Spambayes. The result is reassuring but kind of Klunky way > to bet reassured. I'm not clear on what you mean here. Which SpamBayes application are you using? pop3proxy/sb_server? imapfilter? Are you meaning that you try and check your mail, but you can't connect to your mail server, and if this happens you have to restart SpamBayes? That shouldn't be the case. What is the error message that you receive? =Tony Meyer Yes it is pop3proxy I don't know what the /sb_server part of your message means. Any way it is not a but with SpamBayes. I dont HAVE to restart SpamBayes, I was doing it to get rid of the error messages and to be sure I wal back online, whil experimenting with my new wireless computer. Let's say that my computer is on a wireless and some how I get too far away from the transmitter. I loose my connection. SpamBayes shows an error message in the black DOS box telling me that it can not connect. Naturally it should do this, but . . . Now when I move back in to range, SpamBayes does not either 1) erase the not connected message or 2) put up a new message telling me I am now connected. Can't connect to postoffice.pacbell.net:110 (7, 'getaddrinfo faled') Can't connect to postoffice.pacbell.net:110 (7, 'getaddrinfo faled') Can't connect to postoffice.pacbell.net:110 (7, 'getaddrinfo faled') The point that the above is not as user friendly as it could be for the new users that will follow us. I was only able to get SpamBayes to work care of a professional programmer friend of mine. I am a very junior programmer in another language (Forth) and there fore am qualified to speak from the view point of those who are relatively inexperienced. Reconnected to postoffice.pacbell.net:110:25 Might be a suggested message that would be reassuring to the novice user. John A. Peters From japeters at pacbell.net Sun Sep 21 19:51:12 2003 From: japeters at pacbell.net (John A. Peters) Date: Sun Sep 21 04:48:44 2003 Subject: [Spambayes] Oops (long lines, last time) Message-ID: <004701c3809b$3cf87d30$6702a8c0@AMD> Is there a way to respond to this news group from the news group it's self? I am sending directly from OE and there fore when I respond to a message it is not quoted. I subscribed but I am not getting a news feed to my email. John A. Peters From ian at astounding.org.uk Sun Sep 21 07:44:44 2003 From: ian at astounding.org.uk (Ian Smith) Date: Sun Sep 21 07:44:59 2003 Subject: [Spambayes] reporet on trying 1.0a6 on SuSE In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599367@its-xchg4.massey.ac.nz> Message-ID: On Sun, 21 Sep 2003, Meyer, Tony wrote: > > However, setup.py still won't complete. It now moans that "name > > 'False' is not defined". Some poking around teh web leads me to > > believe that True and False were added in python 2.2.2. If that is > > the case, the README statement that you need 2.2 or later is not true > > - you need 2.2.2 or later. > > That's a bug in setup.py. All the actual scripts should work fine in > 2.2. I've checked in a fix for the setup bug. Ah. As I go on to say, I get some problems in teh scripts too. > > Next I run sb_mboxtrain. That fails with teh same complaint about > > FALSE. Reading teh traceback, it seemed the actual complaint was > > related to site-packages/spambayes/Options.py, so I put True=1 and > > False=0 lines in tehre, just above import sys, os. > > This shouldn't happen - sb_mboxtrain should define True and False before > it tries to import Options.py. I don't suppose you still have a copy of > the traceback? No, but I can take teh lines out and run it again: sb_mboxtrain.py -d ~/.hammiedb -s spambayes/spam -g spambayes/ham Traceback (most recent call last): File "/usr/bin/sb_mboxtrain.py", line 50, in ? from spambayes import hammie, mboxutils File "/usr/lib/python2.2/site-packages/spambayes/hammie.py", line 5, in ? from spambayes import storage File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 67, in ? from spambayes import classifier File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 43, in ? from spambayes.Options import options File "/usr/lib/python2.2/site-packages/spambayes/Options.py", line 33, in ? defaults = { NameError: name 'False' is not defined [I've added line breaks immediately before 'line nn' so it fits within normal email line lengths] > > sb_imapfilter.py -b runs up lynx (the machine it's running on doesn't > > have X, so that's the best browser available), but lynx hangs for a > > long, long time at "HTTP request sent; waiting for response". Not > > sure where the hang is occurring, and it's never given up (or at > > least, I've always interrupted it first). > > Hmm. If you run sb_server.py and go to http://localhost:8880, does it > hang then, too? Using "Lynx Version 2.8.4rel.1 (17 Jul 2001)" works > fine for me. Well, if I runb sb_server.py I get told False is not defined again: Traceback (most recent call last): File "/usr/bin/sb_server.py", line 101, in ? import spambayes.message File "/usr/lib/python2.2/site-packages/spambayes/message.py", line 99, in ? from spambayes.tokenizer import tokenize File "/usr/lib/python2.2/site-packages/spambayes/tokenizer.py", line 22, in ? from spambayes.Options import options File "/usr/lib/python2.2/site-packages/spambayes/Options.py", line 33, in ? defaults = { NameError: name 'False' is not defined If I put the True=1 False=0 lines back in Options.py sb_server.py works fine (at least, lynx displays a page for it). > > "cat sb_test_email | sb_filter.py -d > > ~/.hammiedb". That dies with teh complaint about False again. > > Again, if you had the traceback, that would be helpful. There's a lot > of code scattered about that's meant to provide False/True if it doesn't > exist, which should be working... Traceback (most recent call last): File "/usr/bin/sb_filter.py", line 187, in ? main() File "/usr/bin/sb_filter.py", line 157, in main h.usedb = True NameError: global name 'True' is not defined I'm still intent on getting it working on my system, so I'll hopefully have some more reports shortly. regards, Ian SMith -- |\ /| no .sig |o o| |/ \| From ccherrry at hotmail.com Sun Sep 21 10:04:58 2003 From: ccherrry at hotmail.com (s) Date: Sun Sep 21 10:03:12 2003 Subject: [Spambayes] (no subject) Message-ID: hey - could you guys please think about enhancing spambayes so that non-technical types can use it? i.e. full self-installation, faq's written in non-techno-speak, etc. broaden your audience! thanks. :) --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.509 / Virus Database: 306 - Release Date: 8/13/03 From ff at breeders.net Sun Sep 21 10:30:00 2003 From: ff at breeders.net (Frank Fleischer) Date: Sun Sep 21 10:30:09 2003 Subject: [Spambayes] Outlook upgrade Message-ID: <001801c3804c$ddcd3970$6501a8c0@master> I sent the following message yesterday and it was included in Issue 63, Item 5. Later I received Issue 64, with no response to my posting. I've normally very patient, but I saw immediate responses to other new queries, but none to mine. Have I done something wrong? I'm getting killed by spam while waiting. Here again is my question: I installed Spambayes addin to Outlook 2000 and it was working beautifully. I've just upgraded from Outlook 2000 to Outlook xp and now when I click on the button "Delete as Spam", I get an Invalid Configuration message box that says "You must configure the spam folder." I've tried all kinds of things (reinstall, retrain) and nothing helps. Help, please! Thanks. Frank Fleischer, ff@breeders.net Dog breeders galore at www.breeders.net Dog breed guides at www.doggies.com The Straight Poop about dogs at www.straight-poop.com From tim.one at comcast.net Sun Sep 21 11:30:35 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 21 11:30:38 2003 Subject: [Spambayes] Deleting Spam Rather than Moving it In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599368@its-xchg4.massey.ac.nz> Message-ID: [Marc M] >> After fiddling> with it I don't beleive there is a way to set > Auto-Archive for ONLY the Spam folder (BTW, I am using Outlook 2000). [Tony Meyer] > Hmm. I don't get that message at all, and can easily have only the > one folder auto-archive. However, I'm using Outlook 2002/XP, so maybe > that's not the case in 2k (I can't recall, and don't have a copy > around). My Outlook 2000 doesn't complain either if I set it to auto-archive just my Spam folder. I don't *want* my spam automagically removed, so I undid this change immediately after trying it, so can't swear it will work in the end. This is a completely up-to-date Outlook 2K, BTW, including all current Office Update service packs (the latest of which (SP-3) added registry keys so you can selectively disable Outlook's draconian blocking of nearly all attachments -- many people (like me ) feared to install previous service packs because they blocked OL2K attachments without hope of ever getting at them). > ... > We're only talking about 4 clicks (on the folder, ctrl-a, delete, > confirm) after all... You can skip a step by holding down the shift key while clicking delete . From wtrenker at shaw.ca Sun Sep 21 05:51:32 2003 From: wtrenker at shaw.ca (William Trenker) Date: Sun Sep 21 12:55:53 2003 Subject: [Spambayes] 1.06a Upgrade In-Reply-To: <20030920221643.76a1a385.wtrenker@shaw.ca> References: <20030920221643.76a1a385.wtrenker@shaw.ca> Message-ID: <20030921095132.2d7e406c.wtrenker@shaw.ca> On Sat, 20 Sep 2003 22:16:43 +0000 William Trenker wrote regarding [Spambayes] 1.06a Upgrade: > When I install 1.06a will these existing data folder / file names still work with the new sb_server.py script? Looks like I've answered my own question. Inspection of Options.py shows that the pre-1.06a folder and file names are still being used. Eg: ("spam_cache", "Spam cache directory", "pop3proxy-spam-cache", """Directory that SpamBayes should cache spam in. If this does not exist, it will be created.""", PATH, DO_NOT_RESTORE), Bill From ian at astounding.org.uk Sun Sep 21 16:39:02 2003 From: ian at astounding.org.uk (Ian Smith) Date: Sun Sep 21 16:39:16 2003 Subject: [Spambayes] reporet on trying 1.0a6 on SuSE In-Reply-To: Message-ID: On Sun, 21 Sep 2003, Ian Smith wrote: > I'm still intent on getting it working on my system, so I'll hopefully > have some more reports shortly. OK. Back again. Just in case lots of my problems were down to working from teh .zip rather than the .tar.gz, I've started again on my quest. Recap: SuSE 8.0, which has python 2.2 Email package installed from source Previous problems, now trying agin My uninstall process of the .zip based attempts has been a simple rm /usr/bin/sb_* Maybe that was a little rash, but I don't remember any commands starting sb_ before spambayes. My findings: As root: Now using spambayes-1.0a6.1.tar.gz "setup.py install" fails as before regarding False. Fixed with "True=1" "False=0" as before. setup.py now apparently completes happily. As user: "sb_filter.py -n -d .sb_hammie.db" responds "Created new database in /home/ian/.hammiedb". Indeed it has, so it seems -n ignores teh -d. No matter. However, this is good, because last time it got upset with the fact teh scripts were dos format files. Seems that problem was indeed down to working from teh .zip. "sb_mboxtrain.py -d .hammiedb -s ./spam -g ./ham" now apparently runs. This is good - previously it moaned about 'False'. My (naive) deduction is that the True/False problems I had were caused by working from teh .zip archive despite the fact that I'm linux. I have an email in file sb_test_mail, so I try "cat ./sb_test_mail | sb_filter.py -d ./.hammiedb". That fails, upset about True. D'oh. Said it was naive. Traceback (most recent call last): File "/usr/bin/sb_filter.py", line 187, in ? main() File "/usr/bin/sb_filter.py", line 157, in main h.usedb = True NameError: global name 'True' is not defined As root, I add "True=1" and "False=0" to sb_filter.py immediately before "import os". Back as user, "cat ./sb_test_mail | sb_filter.py -d ./.hammiedb" now runs. It is "unsure; 0.69" whether "ORDER YOUR VALIUM and XANAX HERE !!!" might be a spam. I guess my training sample wasn't big enough. Hurrah! I now consider myself a confirmed python hacker. Well, maybe not. Try sb_server.py and connecting with lynx. That works, and lets me set options which subsequently turn up in my settings file, but when I exit and shutdown I get a traceback. Lynx chops it up a bit, but trying to re-assemble it: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 477, in onSave self._doSave() File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 470, in _doSave classifier.store() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 229, in store self._write_state_key() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 233, in _write_state_key self.db[self.statekey] = (classifier.PICKLE_VERSION, File "/usr/lib/python2.2/shelve.py", line 77, in __setitem__ self.dict[key] = f.getvalue() TypeError: object does not support item assignment At this point I got in a tangle, I don't know what happened, even though I was certain I was religiously noting everything I did, but somehow everything stopped working. Tracebacks on everything I tried, some related to dbmstorage.py among others. Before writing it up here, however, I checked repeatability by wiping teh lot again and retracing my path, and everything went as above, but at this point carried on working. With a .spambayesrc in place I can now "cat ./sb_test_mail | sb_filter.py" and it works. Next, I bung in a procmail recipe to feed every email to sb_filter.py, but I don't act on teh header it puts in, I just intend to watch what it says manually for a bit: :0 fw: | /usr/bin/sb_filter.py So, I now think I have a useable system - I'm confident I can make procmail do what I want with teh spambayes header, and if needs be I can do training just with sb_mboxtrain.py (I assume - I'm assuming it adds training, not replaces existing training). I don't trust sb_server.py yet, becasue I vaguely blame it for teh tangle I listed above. I will try a more elegant learning scheme, and account that as well, if it helps. regards, Ian SMith -- |\ /| no .sig |o o| |/ \| From Justin at arushgames.com Sun Sep 21 16:55:02 2003 From: Justin at arushgames.com (Justin Chornenky) Date: Sun Sep 21 16:56:51 2003 Subject: [Spambayes] forums? Message-ID: I notice you have like 3 differnt lists depending on what type of question I have, and that none of them seem to be archived. Have you given any thoughts to starting a forum? I know as a user, its a lot more accessable, and it also allows me to go back and look for info, and get up to speed faster. I also think its ironic that to get information on cutting down on email, I have to sign up for three mailing lists :) Anyway, I love this program, and am looking for some way to get involved. I do marketing/packaging/graphic design etc for a videogame publisher currently, so know a bit about the business of marketing software. I am also familiar with the technical side of running and managing forums and communities, online advertising and that type of thing. Let me know if you would be interested in having me involved in the Spam Bayes project. -Justin --------------------- Justin Chornenky Art Director ARUSH Entertainment 480 609 8665 x213 justin@arushgames.com From T.A.Meyer at massey.ac.nz Sun Sep 21 18:12:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 18:12:59 2003 Subject: [Spambayes] (no subject) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599466@its-xchg4.massey.ac.nz> > hey - could you guys please think about enhancing spambayes > so that non-technical types can use it? i.e. full > self-installation, faq's written in non-techno-speak, etc. > broaden your audience! thanks. Have you tried the 008.1 version of the Outlook plug-in? It seems to have all of this to me. The upcoming binary release of sb_server (i.e. the POP3 proxy) will be reaching towards that level, too, although it (probably) won't have the same level of ease-of-installation immediately. If there's any FAQ that isn't clear, please let us know (with details!) so that we can fix it. It's important to remember that spambayes is an open-source project run by volunteers in their spare time, and that it's only in alpha (i.e. very pre-release). Any contributions (even of ideas, but specifics, please!) are very welcome. A goal is definitely to make it easier to use spambayes, and if you compare (for example) the 1.0a1 release to the 1.0a6 (or the first Outlook plugin to the latest one), I think you'll find that things are definitely moving in that direction. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 21 18:19:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 21 18:19:34 2003 Subject: [Spambayes] forums? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599475@its-xchg4.massey.ac.nz> > I notice you have like 3 differnt lists depending on what > type of question I have, and that none of them seem to be archived. There are five lists, and they are all archived. If you go to: there are links to all of them, including the info pages that have links to the archives. For example, has the archives for this list. > I know as a user, its a lot more accessable, and it also > allows me to go back and look for info, and get up to speed faster. If you google for, for example, "site:mail.python.org spambayes term_I_am_interested in", you should get all the info you need (which is in the archives). > I also think its ironic that to get information on cutting > down on email, I have to sign up for three mailing lists :) You should only need to sign up to one - this one. The -dev list is really only for those interested in the development of spambayes. If you have a development idea, but no interest in doing the development, then you can post it here and one of the developers will notice it (and maybe move discussion to -dev). The check-ins list is really only of use if you are a developer or running from current cvs. The bugs list likewise (if you are monitoring a particular bug, then it's easier to use sourceforge's monitoring system), although if you really want to know what bugs have been found, then you might want to read that list. You might also want to subscribe to the announce list, but it's extremely low volume, and in any case messages to it tend to be cc'd to this list anyway. > Anyway, I love this program, and am looking for some way to > get involved. I do marketing/packaging/graphic design etc for > a videogame publisher currently, so know a bit about the > business of marketing software. In what way would you like to help? The FAQ about "how can I help" has information about the main ways in which people can help - do any of them sound like you (or do any sound definitely not like you?)? One way to start is to consider what would make spambayes better for you (or what would have made it easier to setup for you, or easier for you to find out about it, ...), and figure out a way that we can improve that. > Let me know if you would be interested in having me involved > in the Spam Bayes project. Everyone's welcome! =Tony Meyer From mhammond at skippinet.com.au Sun Sep 21 18:23:01 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Sep 21 18:22:51 2003 Subject: [Spambayes] Outlook upgrade In-Reply-To: <001801c3804c$ddcd3970$6501a8c0@master> Message-ID: <0dc901c3808e$eba3ae70$f502a8c0@eden> Please ensure you have version 008.1 of the plugin (the latest). When you do, select "SpamBayes Manager", then click the "Filter" tab. From that tab, you must select folders for Ham, Spam and Unsure. It should then work fine. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Frank Fleischer > Sent: Monday, 22 September 2003 12:30 AM > To: spambayes@python.org > Subject: [Spambayes] Outlook upgrade > > > I sent the following message yesterday and it was included in > Issue 63, Item > 5. Later I received Issue 64, with no response to my posting. > I've normally > very patient, but I saw immediate responses to other new > queries, but none > to mine. Have I done something wrong? I'm getting killed by spam while > waiting. > > Here again is my question: > > I installed Spambayes addin to Outlook 2000 and it was > working beautifully. > I've just upgraded from Outlook 2000 to Outlook xp and now > when I click on > the button "Delete as Spam", I get an Invalid Configuration > message box that > says "You must configure the spam folder." I've tried all > kinds of things > (reinstall, retrain) and nothing helps. > > Help, please! Thanks. > > Frank Fleischer, ff@breeders.net > Dog breeders galore at www.breeders.net > Dog breed guides at www.doggies.com > The Straight Poop about dogs > at www.straight-poop.com > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From tim.one at comcast.net Sun Sep 21 19:08:55 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 21 19:08:59 2003 Subject: [Spambayes] Re: [OT] New Play: SPAMELOT In-Reply-To: <20030919140952.4a6583b5.wtrenker@shaw.ca> Message-ID: [William Trenker] > ... > I would also nominate Spambayes as Python App of the Year! We're close enough to the end of 2003 that we had better count that vote for 2004 too . > Spambayes certainly is a showcase of the sophistication, and > practicality, that can be achieved using Python 'out of the box'. Those who helped with the early development were probably most impressed by the speed of development Python allowed: the first implementation of spambayes was conceived, written, and working all in one day. It was (of course) much simpler then, had no real email interface apart from sucking up files containing messages, and we'll overlook that the Outlook addin threatens to consume the rest of Mark Hammond's life. Other than those, it went really fast . From Justin at arushgames.com Sun Sep 21 22:06:44 2003 From: Justin at arushgames.com (Justin Chornenky) Date: Sun Sep 21 22:08:29 2003 Subject: [Spambayes] Saving SB Filters In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599475@its-xchg4.massey.ac.nz> Message-ID: Bah - didn't realize I was posting to a list. Noob move. Anyway, Rather than save a large sampling of spam (especially those containing virii) to rebuild my SB in case of system failure... is it possible to backup/export the rules/filters that SB runs off of? Could not find reference in FAQ or recent archives. This is a large issue for me, as I work in a testing environment where we often wipe our machines after every production cycle. Or where buggy software does it for us :) I have had to retrain SB a few times now, and would like to start skipping this procedure. Any help is appreciated. -Justin -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Sunday, September 21, 2003 3:19 PM To: Justin Chornenky; spambayes@python.org Subject: RE: [Spambayes] forums? > I notice you have like 3 differnt lists depending on what > type of question I have, and that none of them seem to be archived. There are five lists, and they are all archived. If you go to: there are links to all of them, including the info pages that have links to the archives. For example, has the archives for this list. > I know as a user, its a lot more accessable, and it also > allows me to go back and look for info, and get up to speed faster. If you google for, for example, "site:mail.python.org spambayes term_I_am_interested in", you should get all the info you need (which is in the archives). > I also think its ironic that to get information on cutting > down on email, I have to sign up for three mailing lists :) You should only need to sign up to one - this one. The -dev list is really only for those interested in the development of spambayes. If you have a development idea, but no interest in doing the development, then you can post it here and one of the developers will notice it (and maybe move discussion to -dev). The check-ins list is really only of use if you are a developer or running from current cvs. The bugs list likewise (if you are monitoring a particular bug, then it's easier to use sourceforge's monitoring system), although if you really want to know what bugs have been found, then you might want to read that list. You might also want to subscribe to the announce list, but it's extremely low volume, and in any case messages to it tend to be cc'd to this list anyway. > Anyway, I love this program, and am looking for some way to > get involved. I do marketing/packaging/graphic design etc for > a videogame publisher currently, so know a bit about the > business of marketing software. In what way would you like to help? The FAQ about "how can I help" has information about the main ways in which people can help - do any of them sound like you (or do any sound definitely not like you?)? One way to start is to consider what would make spambayes better for you (or what would have made it easier to setup for you, or easier for you to find out about it, ...), and figure out a way that we can improve that. > Let me know if you would be interested in having me involved > in the Spam Bayes project. Everyone's welcome! =Tony Meyer From mark_fitzgerald at keybank.com Mon Sep 22 06:43:40 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Mon Sep 22 06:43:46 2003 Subject: [Spambayes] question on DB In-Reply-To: Message-ID: I installed the outlook plug-in, and trained it. Now, I'm wondering if there is any program that will allow me to access the .db file in a meaningful manner. What I'm trying to do is get an aggregate list of words in ALL my e-mail messages, along with their relative spamminess. Is there a way? Thanks. PS - I posted this when I first joined the list, but I haven't seen it in any of the 4 digests I've received so far. - Mark From MMARTINEZ at CSREES.USDA.GOV Mon Sep 22 08:03:09 2003 From: MMARTINEZ at CSREES.USDA.GOV (Martinez, Michael) Date: Mon Sep 22 08:04:19 2003 Subject: [Spambayes] Re: [OT] New Play: SPAMELOT Message-ID: <83EED274D3127740995A6B621B88F2E1DF7D2F@csrees-exchange.csrees.usda.gov> I love Spambayes. I wouldn't trade it for any other app. Michael Martinez Linux System Administrator ISTM/CSREES United States Department of Agriculture -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: Sunday, September 21, 2003 7:09 PM To: python-list@python.org Cc: spambayes@python.org Subject: RE: [Spambayes] Re: [OT] New Play: SPAMELOT [William Trenker] > ... > I would also nominate Spambayes as Python App of the Year! We're close enough to the end of 2003 that we had better count that vote for 2004 too . > Spambayes certainly is a showcase of the sophistication, and > practicality, that can be achieved using Python 'out of the box'. Those who helped with the early development were probably most impressed by the speed of development Python allowed: the first implementation of spambayes was conceived, written, and working all in one day. It was (of course) much simpler then, had no real email interface apart from sucking up files containing messages, and we'll overlook that the Outlook addin threatens to consume the rest of Mark Hammond's life. Other than those, it went really fast . _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From MMARTINEZ at CSREES.USDA.GOV Mon Sep 22 08:44:31 2003 From: MMARTINEZ at CSREES.USDA.GOV (Martinez, Michael) Date: Mon Sep 22 08:43:17 2003 Subject: [Spambayes] Server side instructions for qmail Message-ID: <83EED274D3127740995A6B621B88F2E101274213@csrees-exchange.csrees.usda.gov> I noticed that the FAQ contains "Postfix notes" on a server side solution: (). I'd like to submit the following "Qmail notes": Spambayes is installed on our agency's smtp / MX gateway. This machine runs Redhat Linux 7.1, qmail 1.03, qmail-scanner 1.16, and hbedv's "Antivir". Incoming mail is accepted by tcpserver and handed off to qmail-scanner. Qmail-scanner runs the virus software ("antivir") and hands the message to qmail. Qmail accepts local delivery on all domain-bound email. This email is delivered to ~alias/.qmail-default. (This is a standard configuration for qmail). ~alias/.qmail-default pipes each email through Spambayes. The .qmail-default is set up as follows: | /usr/local/spambayes/hammiefilter.py -d /usr/local/spambayes/.hammiedb | qmail-remote MSServer.csrees.usda.gov "$SENDER" $DEFAULT@csrees.usda.gov The permissions for the /usr/local/spambayes directory are set with the following command: chown -R qmailq.qmail /usr/local/spambayes As shown above, there are two pipes. The first pipes it through Spambayes. The second pipes it through qmail's remote delivery mechanism, which delivers the email to our Exchange Server. Delivered emails are filtered on a per-user basis in Outlook by setting the Rules to detect the Spambayes tag in the message header. If the tag reads "Spambayes-Classification: spam" then the email is either deleted or placed in the user's Spam folder. If it reads "Spambayes-Classification: unsure" then it's placed in the user's Unsure folder. If it reads "Spambayes-Classification: ham" then nothing special is done - it is delivered to the user's Inbox as normal. The user is given the choice of whether to set up his rules or not. Training of Spambayes is done in the following manner. Our users are given my email address and are told that, if they like, they may send emails to me that they consider spam, or that end up being "mis-classified" by the system. I created two directories: /usr/local/spambayes/training/spamdir /usr/local/spambayes/training/hamdir The emails sent to me by the users are retrieved from the qmail archive and placed into the appropriate directory. When I'm ready to do a training (which I do once or twice a month), I run the following commands: 1. I use a simple script to insert a blank From: line at the top of each email 2. I use a simple script to remove the qmail-scanner header from the bottom of each email. 3. uuencoded attachments are removed 4. cat /usr/local/spambayes/training/spamdir/* >> /usr/local/spambayes/training/spam 5. cat /usr/local/spambayes/training/hamdir/* >> /usr/local/spambayes/training/ham 6. /usr/local/spambayes/mboxtrain -d /usr/local/spambayes/.hammiedb -g /usr/local/spambayes/training/ham -s /usr/local/spambayes/training/spam (Step #6 can be run without shutting down qmail.) Most of the time, emails that are sent to me are clearly discernible as to whether they are spam or not. Occasionally there is an email that is borderline, or that one person considers spam but others don't. This is usually things like newsletter subscriptions or religious forums. In this case, I follow my own rule that if there is at least one person in the agency who needs or wants to receive this type of email, and as long as it is non-offensive, work-related, or there are a lot of people in the agency who have an interest in the topic, then I will either train it as ham, or, if it's already being tagged ham, leave it. An example of this are emails that discuss religious topics. There are a lot of people in this agency who are subscribed to religious discussion groups, so in my mind, it's good practice to make sure these messages are not tagged Spam. The above system works well on several levels. It's manageable because there's a central location for training and tagging spam (the smtp server). It's manageable also because our IT PC Support staff does not have to install Spambayes on each PC nor train all of our user's on its use. If a user does not like the way our system tags the emails, he does not have to set up his Outlook rules. But, we've had a good response from the users who are using their Rules. They're willing to put up with one or two mis-classified emails in order to have 95% of their junk email not in their Inbox. Michael Martinez Linux System Administrator ISTM/CSREES United States Department of Agriculture From MMARTINEZ at CSREES.USDA.GOV Mon Sep 22 08:57:25 2003 From: MMARTINEZ at CSREES.USDA.GOV (Martinez, Michael) Date: Mon Sep 22 08:56:10 2003 Subject: [Spambayes] Are there plans for a daemonized or compiled versionofSpambayes? Message-ID: <83EED274D3127740995A6B621B88F2E101274214@csrees-exchange.csrees.usda.gov> Let's discuss this a little. I haven't looked in depth at the Spambayes code. But being the sysadmin I'm able to look at the processes running on the system. It appears that, when an email is scanned, multiple python threads get forked. Presumably this is because "hammiefilter.py" runs other *.py scripts, or exec's multiple pythons. (True? Not true?) Assuming that's what's happening, I guess I was wondering if it would be beneficial, in the sense of being less demanding on system resources, to consolidate all the routines into a single python thread? Is this feasible and worthwhile? I'm harping on the multiple-thread issue, because, the thing that happens with high email volume is that the number of python processes grows exponentially. Another thing I'm thinking about doing to mitigate the impact on resources, is running the hammiefilter in ramdisk. Suggestions are welcome. Michael Martinez Linux System Administrator ISTM/CSREES United States Department of Agriculture -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: Saturday, September 20, 2003 6:36 PM To: Martinez, Michael Cc: spambayes@python.org; spambayes-dev@python.org Subject: RE: [Spambayes] Are there plans for a daemonized or compiled versionofSpambayes? [Martinez, Michael] > I've been running Spambayes on our agency Linux smtp gateway for > several months and very happy with its classification of spam. My > gateway is a qmail system and it pipes all incoming email through the > hammiefilter prior to delivery. Yup, running a distinct classifier for each email is a pretty crazy design for high-volume use. > However, a performance problem arises when the gateway gets hit during > peak hours with a lot of emails. What happens is the system slows down > tremendously, in part due to the number of python instances that get > forked in order to scan the emails. > > I was wondering: are there any plans to develop a lightweight, > daemonized version of Spambayes? The answer to that depends on you too: what are your plans? Python is a C program, and can be daemonized like any other. Note the project's pspam directory sets up a classifier backed by a ZODB database, which can be attached to via opening a ZEO connection. That would be a pleasant way to let multiple clients hook up at will to an always-running classifier. > In the same vein, are there plans to port it to C or another compiled > language? AFAICT, the most expensive part of running spambayes now is running Berkeley database lookups, and the Sleepycat bsddb implementation is already written in C. So profile before you presume to know what would help. Based on what I've measured, my interest in recoding any of the rest in C is nil. > How difficult would this be? It would be extremely tedious. You don't escape the needs for a database, for I/O, or for a variety of complex string-processing operations. The parts of the Python implementation that supply those to Python programmers are already coded in C, but much easier to use from Python than from C. From MMARTINEZ at CSREES.USDA.GOV Mon Sep 22 08:58:02 2003 From: MMARTINEZ at CSREES.USDA.GOV (Martinez, Michael) Date: Mon Sep 22 08:56:48 2003 Subject: [Spambayes] Are there plans for a daemonized or compiled version of Message-ID: <83EED274D3127740995A6B621B88F2E1DF7D32@csrees-exchange.csrees.usda.gov> I just posted my server config to the spambayes-dev list. Michael Martinez Linux System Administrator Marlaw Systems Technology Inc. ISTM/CSREES United States Department of Agriculture (202) 720-6223 -----Original Message----- From: paulo@sakamoto.com.br [mailto:paulo@sakamoto.com.br] Sent: Saturday, September 20, 2003 6:29 PM To: spambayes@python.org Cc: Martinez, Michael Subject: [Spambayes] Are there plans for a daemonized or compiled version of Hi, Could you show us your server configuration? I also run a qmail server for a school and would like to use spambayes at the server. Thanks, Paulo Sakamoto From Kirchel at gutenberg-rz.de Mon Sep 22 09:35:21 2003 From: Kirchel at gutenberg-rz.de (Kirchel, Oliver) Date: Mon Sep 22 09:35:35 2003 Subject: [Spambayes] Problem to start User Interface ... Message-ID: <8B5CA4B4C955D61189800002A5876765E90C3D@vgmex001> Hi to all, I have a problem so start the User Interface. If I type ./pop3proxy.py -d hammie.db ... .. I get Loading database... User interface url is http://localhost:8880/ But I saw in another post that should be Loading database ... Done. What's my problem ? Thanks in advance From skip at pobox.com Mon Sep 22 12:08:09 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 22 12:08:22 2003 Subject: [spambayes-dev] RE: [Spambayes] Are there plans for a daemonized or compiled versionofSpambayes? In-Reply-To: <83EED274D3127740995A6B621B88F2E101274214@csrees-exchange.csrees.usda.gov> References: <83EED274D3127740995A6B621B88F2E101274214@csrees-exchange.csrees.usda.gov> Message-ID: <16239.7785.146033.172889@montanaro.dyndns.org> Michael> I haven't looked in depth at the Spambayes code. But being the Michael> sysadmin I'm able to look at the processes running on the Michael> system. It appears that, when an email is scanned, multiple Michael> python threads get forked. Presumably this is because Michael> "hammiefilter.py" runs other *.py scripts, or exec's multiple Michael> pythons. (True? Not true?) Not true I don't think. Michael> Assuming that's what's happening, I guess I was wondering if it Michael> would be beneficial, in the sense of being less demanding on Michael> system resources, to consolidate all the routines into a single Michael> python thread? Is this feasible and worthwhile? I tried it quite awhile ago, but didn't code the front-end client in C, just Python. One problem is that you substitute network overhead for startup overhead. Assuming you maintain the long-running process as a Python program, you can try a couple things: 1 write a front-end client in Python and use a very simple protocol to communicate with the server (maybe a byte count followed by the message). The server would either spit back the message augmented with the usual scoring headers or just the score information, relying on the client to embellish the message. 2 If (and only if) the above isn't fast enough, write the simplest front-end client you can in C to avoid Python startup overhead. The first one will give you some idea what you're up against. Python's startup is probably the bottleneck, so I'm skeptical that the first option will gain you anything besides an architecture which is simple to experiment with. The Python-based server scores messages very quickly once the startup overhead is out of the way. Michael> Another thing I'm thinking about doing to mitigate the impact Michael> on resources, is running the hammiefilter in ramdisk. It probably won't buy you much, but it's a simple enough thing to try. Make sure you copy your database (pickle or bsddb file) to ramdisk as well. Skip From aflesch at esedona.net Mon Sep 22 12:18:29 2003 From: aflesch at esedona.net (Anthony Flesch) Date: Mon Sep 22 12:15:31 2003 Subject: [Spambayes] Kudos, suggestion Message-ID: <005901c38125$290878e0$0201a8c0@sedonadirect> Thanks so much for creating SpamBayes! It's the only thing I've found (after trying quite a few of the top-rated programs) that really works. Suggestion for a future version (Outlook plugin): how about the ability to delete the contents of the designated spam folder (similar to the Outlook "empty deleted items folder". ? Anthony Anthony Flesch 928-204-1872, Fax 928-204-2715 From skip at pobox.com Mon Sep 22 13:03:27 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 22 13:03:43 2003 Subject: [Spambayes] (no subject) In-Reply-To: References: Message-ID: <16239.11103.512103.796556@montanaro.dyndns.org> ccherrry> hey - could you guys please think about enhancing spambayes so ccherrry> that non-technical types can use it? i.e. full ccherrry> self-installation, faq's written in non-techno-speak, etc. ccherrry> broaden your audience! One problem is that we are all techno geeks, so to our warped brains it looks pretty reasonable. ;-) If you could spend a little time suggesting specific improvements with the programs or documentation, it would be much appreciated. Skip From rb at galaxyflyer.com Mon Sep 22 13:12:44 2003 From: rb at galaxyflyer.com (Robert Barnett) Date: Mon Sep 22 13:12:49 2003 Subject: [Spambayes] Feature Request Message-ID: <009801c3812c$c0910720$dd7f5142@windowsxp> Hello, I have just started using SpamBayes and so far like it a lot. I think you are all doing a fantastic job and I want to thank you for doing such a much needed program. My feature request is quite simple. I would like the option of having spam sent directly to the Deleted Items folder in Outlook instead of Junk E-mail. While I can change the directory or folder that it goes in to the Deleted Items directory/folder isn't available as an option. Another option I would like to see added is to just have SpamBayes delete the message completely. That is it is like going to the Deleted Items directory/folder and then deleting the message from there. Thanks, Robert Barnett From rmalayter at bai.org Mon Sep 22 13:46:18 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 22 13:46:21 2003 Subject: [Spambayes] Messages fail to filter Message-ID: <792DE28E91F6EA42B4663AE761C41C2A0111521E@cliff.bai.org> From: Rick Reid [mailto:Rick.Reid@uk.anker-systems.com] > I trained with about 100 spam and 900+ ham. > I can certainly collect a lot more spam if > that's what's required. First of all, I have my system trained on 2800 spams, and about 6000 hams. Basically, you need enough spams so that nothing appears too "new" to the filter. Still, you should be getting some filtering with a 100 spam training set, and spambayes should automatically filter. > When I use the manual option either to score > or score & filter, it seems to do so almost > perfectly. So why doesn't it do this automatically > when mail arrives in my inbox? I have 'enable > Spambayes' checked. Did you try adding a processing start delay (say 10 seconds) on the advanced tab of the spamBayes manager? Perhaps Outlook's running of its rules is stepping on SpamBayes. Also, make sure your Inbox folder is one of the "monitored" folders, and that your thresholds aren't set too high. Your filtering tab of SpamBayes Manager should look something very much like this: http://malayter.com/SpamBayesSettings.gif Regards, Ryan From rmalayter at bai.org Mon Sep 22 13:47:26 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 22 13:47:29 2003 Subject: [Spambayes] Messages fail to filter Message-ID: <792DE28E91F6EA42B4663AE761C41C2AF3D428@cliff.bai.org> From: Rick Reid [mailto:Rick.Reid@uk.anker-systems.com] > BTW Do you keep those 2800 spams around in your > email client? I thought it would be safe to dlete > them once Spambayes had processed them. I keep them around, for three reasons: 1) my spamBayes DB could get corrupted and I need to re-train from scratch (this has never happened to me yet, but others have reported the issue with earlier versions of SpamBayes) 2) I test out other spam filters with them, including mail-server-level filters that we use across our whole organization 3) They only take up 11 MB, and we have our mailbox limits set at 320 MB, so I have plenty of room without resorting to a separate PST file From skip at pobox.com Mon Sep 22 14:18:39 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 22 14:18:59 2003 Subject: [Spambayes] Messages fail to filter In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A0111521E@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A0111521E@cliff.bai.org> Message-ID: <16239.15615.496314.642535@montanaro.dyndns.org> >> I trained with about 100 spam and 900+ ham. I can certainly collect >> a lot more spam if that's what's required. I would start out by randomly deleting most of the ham so that your ham and spam collections are more nearly balanced, then retrain on the result. I don't know what you're using to read mail (Outlook? I'm actually replying to Ryan's note), so I can't suggest the best way to do that. Skip From lists at strisik.com Mon Sep 22 14:51:48 2003 From: lists at strisik.com (Peter Strisik) Date: Mon Sep 22 14:52:31 2003 Subject: [Spambayes] Messages fail to filter In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A0111521E@cliff.bai.org> Message-ID: <002001c3813a$95c94df0$1802a8c0@PHS> New to Spambayes (Outlook plugin .81) myself. I found that my messages didn't filter automatically until I _deselected_ the processing delay. Also, I notice that there is nothing in message headers that reflects Spambayes activity even though messages are being scored and filtered! ....Peter On Monday, September 22, 2003 9:46 AM, Ryan Malayter <> wrote, at least in part: > From: Rick Reid [mailto:Rick.Reid@uk.anker-systems.com] >> I trained with about 100 spam and 900+ ham. >> I can certainly collect a lot more spam if >> that's what's required. > > First of all, I have my system trained on 2800 spams, and about 6000 > hams. Basically, you need enough spams so that nothing appears too > "new" to the filter. Still, you should be getting some filtering with > a 100 spam training set, and spambayes should automatically filter. > >> When I use the manual option either to score >> or score & filter, it seems to do so almost >> perfectly. So why doesn't it do this automatically >> when mail arrives in my inbox? I have 'enable >> Spambayes' checked. > > Did you try adding a processing start delay (say 10 seconds) on the > advanced tab of the spamBayes manager? Perhaps Outlook's running of > its rules is stepping on SpamBayes. > > Also, make sure your Inbox folder is one of the "monitored" folders, > and that your thresholds aren't set too high. Your filtering tab of > SpamBayes Manager should look something very much like this: > http://malayter.com/SpamBayesSettings.gif > > Regards, > Ryan From bic3 at pacbell.net Mon Sep 22 15:06:32 2003 From: bic3 at pacbell.net (Broome Int'l Consortium) Date: Mon Sep 22 15:07:13 2003 Subject: [Spambayes] SpamBayes Outlook Plugin Error Message: "Failed to call Message-ID: <000001c3813c$a34c18c0$0700a8c0@bic> Hello: Hi, I've been using the Spambayes Outlook plugin for a while now with no problems (and I think it's great!) but just today it has stopped loading with Outlook. When I check the Com Add-ins panel under Tools..Options..Other..Advanced Options I am informed that SpamBayes failed to load due to a runtime error. Is there anyway to fix this? Thanks, Shop untill you drop! Fermon Ivory Broome CEO 2010 N. Beachwood Dr Suite # 202 Hollywood, CA 9068 Sales@ShopBuyWeb.com tel: fax: (888)589-3558 (775)640-9846 Powered by Plaxo Want a signature like this? -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 6246 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030922/e136cae5/attachment-0001.gif From rmalayter at bai.org Mon Sep 22 15:31:55 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Sep 22 15:32:03 2003 Subject: [Spambayes] Messages fail to filter Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01115223@cliff.bai.org> From: Peter Strisik [mailto:lists@strisik.com] > New to Spambayes (Outlook plugin .81) myself. > I found that my messages didn't filter automatically > until I _deselected_ the processing delay. I cannot repeat that behavior on either of my systems, both of which run Windows XP SP1 and Outlook XP SP2 (plus all security patches). The delay processing option seems to work just fine. > Also, I notice that there is nothing in message > headers that reflects Spambayes activity even > though messages are being scored and filtered! This is by design. The Outlook plug-in creates a new Outlook message field called "Spam", formatted as a percentage. It adds nothing to the header. There are instructions for adding the spam field to your Outlook views in the Spambayes FAQ. From lists at strisik.com Mon Sep 22 15:45:25 2003 From: lists at strisik.com (Peter Strisik) Date: Mon Sep 22 15:46:02 2003 Subject: [Spambayes] Messages fail to filter In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01115223@cliff.bai.org> Message-ID: <002c01c38142$139592a0$1802a8c0@PHS> Ryan, I will experiment with the processing delay vs. filtering or not after I accumulate more spam to train on. Thanks for clarifying about the headers. I can put that one to rest. I already have the Spam column defined and showing. ......Peter On Monday, September 22, 2003 11:32 AM, Ryan Malayter wrote, at least in part: > From: Peter Strisik [mailto:lists@strisik.com] >> New to Spambayes (Outlook plugin .81) myself. >> I found that my messages didn't filter automatically >> until I _deselected_ the processing delay. > > I cannot repeat that behavior on either of my systems, both of which > run Windows XP SP1 and Outlook XP SP2 (plus all security patches). > The delay processing option seems to work just fine. > >> Also, I notice that there is nothing in message >> headers that reflects Spambayes activity even >> though messages are being scored and filtered! > > This is by design. The Outlook plug-in creates a new Outlook message > field called "Spam", formatted as a percentage. It adds nothing to > the header. > > There are instructions for adding the spam field to your Outlook > views in the Spambayes FAQ. From Jeremy at Zawodny.com Mon Sep 22 18:02:27 2003 From: Jeremy at Zawodny.com (Jeremy Zawodny) Date: Mon Sep 22 18:03:34 2003 Subject: [Spambayes] sb_xmlrpcserver.py problems... Message-ID: <20030922220227.GA19503@thermal> As a newcomer to the SpamBayes world, I'm impressed. I've been using SpamAssassin for a while and have found that SpamBayes does an equally excellent job. The two of them together catch nearly everything. Now I'm looking to setup SpamBayes on a larger scale (for an ISP). I see that the combination of sb_client.py and sb_xmlrpcserver.py are the "spamc/spamd" of SpamBayes. However, the docs are sb_xmlrpcserver.py are pretty sparse. I've looked around the CVS tree and searched on-line but haven't found much help yet. When I try running a server: /usr/local/bin/sb_xmlrpcserver.py -d 127.0.0.1:65000 I get: Traceback (most recent call last): File "/usr/local/bin/sb_xmlrpcserver.py", line 42, in ? DEFAULTDB = hammie.DEFAULTDB AttributeError: 'module' object has no attribute 'DEFAULTDB' My Python is pretty weak (I've only begun learning). So I'm not sure where to go with this. I have created the database using "/usr/local/bin/sb_filter.py -n" and I have .spambayesrc file in the "spam" user's directory: [Storage] persistent_use_database = True persistent_storage_file = /home/5/spam/.hammiedb Can anyone shed some light onto what else I need to do? Thanks, Jeremy -- Jeremy D. Zawodny | Perl, Web, MySQL, Linux Magazine, Yahoo! | http://jeremy.zawodny.com/ From aaraines at pobox.com Mon Sep 22 18:16:03 2003 From: aaraines at pobox.com (Andrew A. Raines) Date: Mon Sep 22 18:15:48 2003 Subject: [Spambayes] Comparable bogoutil functionality? Message-ID: What's a spambayes-y or Python-y way to implement bogoutil for my hammiedb? It's a convenient way of checking to see what bogofilter thinks about a token. For instance: $ bogoutil -w ~/.bogofilter penis spam good penis 3393 0 $ bogoutil -w ~/.bogofilter viagra spam good viagra 64 0 $ bogoutil -w ~/.bogofilter raines spam good raines 0 171 See also . Is this only possible by enabling debug output, piping a message through hammiefilter, and examining the resultant header? -Drew From w9fz at ix.netcom.com Mon Sep 22 19:57:26 2003 From: w9fz at ix.netcom.com (Bruce Richardson) Date: Mon Sep 22 19:57:28 2003 Subject: [Spambayes] Starting as a Win2k service again? Message-ID: <005301c38165$473a29e0$06d2a8c0@thehip> Hi Can you point me to the part in the FAQ that talks about starting as a service in Win2k??? I couldn't find it. Do I go to Administrative Tools? Then what? Bruce Richardson - W9FZ From T.A.Meyer at massey.ac.nz Mon Sep 22 23:42:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 22 23:42:52 2003 Subject: [Spambayes] Comparable bogoutil functionality? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599A80@its-xchg4.massey.ac.nz> > What's a spambayes-y or Python-y way to implement bogoutil > for my hammiedb? It's a convenient way of checking to see > what bogofilter thinks about a token. For instance: > > $ bogoutil -w ~/.bogofilter penis > spam good > penis 3393 0 > $ bogoutil -w ~/.bogofilter viagra > spam good > viagra 64 0 > $ bogoutil -w ~/.bogofilter raines > spam good > raines 0 171 It may not be the best way, but one way is to: $ sb_server.py -b Then in the 'word query' box at the bottom of the web page that opens, type your word and press enter, and there you go. It would be easy enough to copy out the code that does this if you wanted a script that would do it. Something like (untested): >>> from spambayes.storage import open_storage >>> db = open_storage(filename[, use_db, mode]) >>> wordinfo = db._wordinfoget(word) >>> if wordinfo: print "%s\t%d\t%d\t%d" % (word, wordinfo.spamcount, wordinfo.hamcount, db.probability(wordinfo) HTH. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 22 23:48:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 22 23:49:25 2003 Subject: [Spambayes] Problem to start User Interface ... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599A8F@its-xchg4.massey.ac.nz> > I have a problem so start the User Interface. > > If I type ./pop3proxy.py -d hammie.db ... > > .. I get Loading database... User interface url is > http://localhost:8880/ Have you tried opening a browser window to ? (Note if you add the "-b" switch, one should open for you). > But I saw in another post that should be Loading database ... Done. Nope - pop3proxy will only print out "loading" (and where it is loading from if verbose is True), and not done. You probably saw a post about imapfilter, which does print "done" (yes, they should probably be consistent...). =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 22 23:53:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 22 23:53:37 2003 Subject: [Spambayes] question on DB Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599A91@its-xchg4.massey.ac.nz> > I installed the outlook plug-in, and trained it. Now, I'm > wondering if there is any program that will allow me to access > the .db file in a meaningful manner. What I'm trying to do is > get an aggregate list of words in ALL my e-mail messages, along > with their relative spamminess. Is there a way? You'll need a copy of the spambayes source (1.0a6, for example) to do this. If you open up a console window and navigate to the place to expanded the archive to, run (your path may be different): c:\spambayes\> set pythonpath=c:\spambayes c:\spambayes\> scripts\sb_dbexpimp.py -e -d mybayes.db -f mybayes.db.export Where mybayes.db is the default_bayes_database.db file, and mybayes.db.export is a 'flat', '`' separated text file that you can open in whatever you like. Note that you'll also need to have Python installed to do this. If you feel that there is the need for a binary version of this tool, please open a feature request via sourceforge, and it'll no doubt get added at some point. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 00:02:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 00:04:08 2003 Subject: [SPAM] [Spambayes] Starting as a Win2k service again? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599AA1@its-xchg4.massey.ac.nz> > Can you point me to the part in the FAQ that talks about > starting as a service in Win2k??? I couldn't find it. > > Do I go to Administrative Tools? Then what? You need to run "windows\pop3proxy_service.py -install", and then you can start it however you want. You can do it manually, or set it to start automatically by going to Administrative Tools and then Services (this is XP, I guess 2k is the same), or do "net start pop3proxy" in a console window, or "windows\pop3proxy_service.py start". (Note that the "windows" directory here is the one in the spambayes directory, not the one with all the MS Windows stuff). =Tony Meyer From syver-en at online.no Tue Sep 23 05:36:46 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 05:35:45 2003 Subject: [Spambayes] 500 Server error training in the web interface. Message-ID: After using the web interface for more or less time this message shows up when pressing the train button. If I refresh the message is not there anymore. When the error first has occurred it seems to repeat itself until I restart the SpamBayes server. Training... Trained on 1 message. Saving... 500 Server error Traceback (most recent call last): File "d:\devtools\python23\Lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "d:\devtools\python23\Lib\site-packages\spambayes\ProxyUI.py", line 324, in onReview self._doSave() File "d:\devtools\python23\Lib\site-packages\spambayes\UserInterface.py", line 470, in _doSave classifier.store() File "d:\devtools\python23\Lib\site-packages\spambayes\storage.py", line 229, in store self._write_state_key() File "d:\devtools\python23\Lib\site-packages\spambayes\storage.py", line 233, in _write_state_key self.db[self.statekey] = (classifier.PICKLE_VERSION, File "D:\devtools\Python23\lib\shelve.py", line 130, in __setitem__ self.dict[key] = f.getvalue() TypeError: object does not support item assignment -- Vennlig hilsen Syver Enstad From syver-en at online.no Tue Sep 23 06:06:49 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 06:05:54 2003 Subject: [Spambayes] Slipup wrongly trained mail Message-ID: What do I do if I have wrongly classified a mail as spam, when I should have classified it as ham? I just sent the mail to the smtp proxy once more as ham this time, would this fix the training or do I have to do anything else fix this? -- Vennlig hilsen Syver Enstad From syver-en at online.no Tue Sep 23 07:43:58 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 07:42:57 2003 Subject: [Spambayes] 500 Server error training in the web interface. In-Reply-To: References: Message-ID: Syver Enstad writes: > After using the web interface for more or less time this message shows > > up when pressing the train button. If I refresh the message is not > there anymore. When the error first has occurred it seems to repeat > itself until I restart the SpamBayes server. Could it be that training fails after changing the options in the web interface? It seems like that. I've used the web interface a long time now, classifying mails that I've gotten and it hasn't failed once. Then I changed the advanced options to put the score into the headers of received mail. Then it started failing on training. It also fails with a 500 but without a stacktrace on the "Train on a message, mbox file or dbx file" and "Classify a message". Yes additional experimentation seems to confirm this. It seems to me that the bug is that after saving options via the web interface, spambayes must be restarted, because all functionality except saving options will generate a 500 Server error with or without a stack trace. Another problem I encountered is that my mail program will hang if I try to post a ham message to spambayes_ham@localhost with the id option turned on and the ham message doesn't have an id in its headers (because it was received before I installed Spambayes). -- Vennlig hilsen Syver Enstad From syver-en at online.no Tue Sep 23 08:22:39 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 08:21:37 2003 Subject: [Spambayes] pop3proxy_tray.pyw Message-ID: If I do save and shutdown from the webinterface, I have to exit pop3proxy_tray.py(w) to restart it. -- Vennlig hilsen Syver Enstad From DarkEmperor at gmx.net Tue Sep 23 09:38:28 2003 From: DarkEmperor at gmx.net (Emperor) Date: Tue Sep 23 09:38:52 2003 Subject: [Spambayes] SpamBayes config error Message-ID: <1064324242.5787.14.camel@nils> Hi, I just installed spambayes and it works... but I still have a problem: When I installed it I had python 2.2.2 and spambayes didn't work with it, so I upgraded to 2.2.3. I runned it the first time from extracted archive dir... and since then the web UI has the wrong config path in it. As I already deleted the extracted stuff (after install), there is no file there and so I can't save the config from the web interface (and if I try the settings that are correct on start are messed up). So my question is where does spambayes save the location of the config file when I run it through sb_server.py? Thx, Nils From mark_fitzgerald at keybank.com Tue Sep 23 11:04:45 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Tue Sep 23 11:10:29 2003 Subject: [Spambayes] error on attempting to export data In-Reply-To: Message-ID: I'm trying to use: C:\spambayes>sb_dbexpimp.py -e -d default_bayes_database.db -f mybayes.db.export and the error i get is: Traceback (most recent call last): File "C:\spambayes\sb_dbexpimp.py", line 100, in ? import spambayes.storage ImportError: No module named spambayes.storage I've dumped everything in one single directory because I thought the error was caused by a path problem, but no dice. Can anyone help? thanks. From skip at pobox.com Tue Sep 23 12:45:54 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 23 12:46:04 2003 Subject: [Spambayes] error on attempting to export data In-Reply-To: References: Message-ID: <16240.30914.625898.330585@montanaro.dyndns.org> mark> I'm trying to use: mark> C:\spambayes>sb_dbexpimp.py -e -d default_bayes_database.db -f mark> mybayes.db.export mark> and the error i get is: mark> Traceback (most recent call last): mark> File "C:\spambayes\sb_dbexpimp.py", line 100, in ? mark> import spambayes.storage mark> ImportError: No module named spambayes.storage mark> I've dumped everything in one single directory because I thought mark> the error was caused by a path problem, but no dice. Can anyone mark> help? thanks. Mark, How did you install SpamBayes? From the top-level directory you would normally execute python setup.py install assuming the directory containing python.exe is on your PATH. The result would be the creation of a spambayes directory containing (among other things) storage.py in Python's site-packages directory. Later execution of python c:\spambayes\sb_dbexpimp.py ... should be able to find spambayes.storage. Skip From skip at pobox.com Tue Sep 23 12:50:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 23 12:50:43 2003 Subject: [Spambayes] sb_xmlrpcserver.py problems... In-Reply-To: <20030922220227.GA19503@thermal> References: <20030922220227.GA19503@thermal> Message-ID: <16240.31194.562869.176739@montanaro.dyndns.org> Jeremy> Now I'm looking to setup SpamBayes on a larger scale (for an Jeremy> ISP). I see that the combination of sb_client.py and Jeremy> sb_xmlrpcserver.py are the "spamc/spamd" of SpamBayes. Jeremy> However, the docs are sb_xmlrpcserver.py are pretty sparse. True. Jeremy> I've looked around the CVS tree and searched on-line but haven't Jeremy> found much help yet. When I try running a server: Jeremy> /usr/local/bin/sb_xmlrpcserver.py -d 127.0.0.1:65000 Jeremy> I get: Jeremy> Traceback (most recent call last): Jeremy> File "/usr/local/bin/sb_xmlrpcserver.py", line 42, in ? Jeremy> DEFAULTDB = hammie.DEFAULTDB Jeremy> AttributeError: 'module' object has no attribute 'DEFAULTDB' It appears it's been quite awhile since anyone actually used sb_xmlrpcserver.py. I took a stab at updating it to reflect current reality. The result is attached as a context diff against current CVS. It seems to work (that is, it doesn't generate a traceback) for me, however no headers are inserted in the message returned to the client. Perhaps someone else will spot that problem. Skip -------------- next part -------------- A non-text attachment was scrubbed... Name: sb_xmlrpcserver.diff Type: application/octet-stream Size: 5326 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030923/6e8c8a9b/sb_xmlrpcserver.obj From dbenson at macromedia.com Tue Sep 23 13:51:20 2003 From: dbenson at macromedia.com (Doug Benson) Date: Tue Sep 23 13:51:26 2003 Subject: [Spambayes] Outlook toolbar stopped working Message-ID: <7F98B18A972B8C468D7AAA52B472441802D8D5BF@ex-richrdson-01> Seems like a silly problem, but my SpamBayes Outlook toolbar has stopped working--it no longer changes its text based on which folder I'm viewing, and the SpamBayes popup menu doesn't pop up. I've tried uninstalling (which doesn't remove the toolbar) and reinstalling, to no avail. Any suggestions? Doug Benson From sourceforge at rodland.no Tue Sep 23 14:20:08 2003 From: sourceforge at rodland.no (Fredrik Rodland) Date: Tue Sep 23 14:20:33 2003 Subject: [Spambayes] Outlook toolbar stopped working In-Reply-To: <7F98B18A972B8C468D7AAA52B472441802D8D5BF@ex-richrdson-01> Message-ID: > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Doug Benson > Sent: 23. september 2003 19:51 > To: 'spambayes@python.org' > Subject: [Spambayes] Outlook toolbar stopped working > > > Seems like a silly problem, but my SpamBayes Outlook toolbar has > stopped working--it no longer changes its text based on which > folder I'm viewing, and the SpamBayes popup menu doesn't pop up. > > I've tried uninstalling (which doesn't remove the toolbar) and > reinstalling, to no avail. > Any suggestions? read: http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambaye s/Outlook2000/docs/troubleshooting.html#NoToolbar In the latest plugin this should be fixed however... Fredrik -- Fredrik R?dland Technical Architect, Stocknet, Oslo, Norway Stocknet: http://www.stocknet.com phone: +47 23 28 40 17 Private: http://rodland.no phone: +47 99 21 98 17 From skip at pobox.com Tue Sep 23 14:40:40 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 23 14:40:52 2003 Subject: [Spambayes] error on attempting to export data In-Reply-To: References: <16240.30914.625898.330585@montanaro.dyndns.org> Message-ID: <16240.37800.499512.408794@montanaro.dyndns.org> mark> I'm using python 2.3, and I downloaded the spambayes 1.0a6.zip mark> file, and unzipped the stuff to the c:\python23 subdirectory. mark> When I run what you suggested, I get: mark> c:\python23\python setup.py install mark> Traceback (most recent call last): mark> File "setup.py", line 27, in ? mark> from spambayes import __version__ mark> ImportError: No module named spambayes Hmmm... I would unzip the spambayes stuff into it's own folder. I don't think there should be a problem unzipping it int c:\Python23, but if you later want to uninstall Python23, it's now got cruft the uninstaller doesn't know what to do with. I just had occasion to do precisely this today on my Windows machine. I downloaded and ran the Python 2.3 installer, then downloaded and expanded the spambyaes-1.0a6.zip file. From a CMD prompt I executed c:\Python23\python.exe setup.py install and it installed as advertised. c:\Python23\Lib\site-packages has a spambayes folder which contains (among other things) a storage.py file. I can import spambayes and spambayes.__version__ reports 1.0a6. You should take this back to the spambayes list (I've cc'd it). You've exhausted my feeble Windows knowledge. Something's definitely not right though. Maybe you trashed some files with your mixed spambayes/python directory tree. Skip mark> - Mark mark> ----------------------------------------------- mark> Mark Fitzgerald mark> Workplace Automation Technology mark> Key Services Corporation mark> (518) 391-1110 mark> skip@pobox.com mark> 09/23/2003 12:45 PM mark> Please respond to mark> skip@pobox.com mark> To mark> mark_fitzgerald@keybank.com mark> cc mark> spambayes@python.org mark> Subject mark> Re: [Spambayes] error on attempting to export data mark> I'm trying to use: mark> C:\spambayes>sb_dbexpimp.py -e -d default_bayes_database.db -f mark> mybayes.db.export mark> and the error i get is: mark> Traceback (most recent call last): mark> File "C:\spambayes\sb_dbexpimp.py", line 100, in ? mark> import spambayes.storage mark> ImportError: No module named spambayes.storage mark> I've dumped everything in one single directory because I thought mark> the error was caused by a path problem, but no dice. Can anyone mark> help? thanks. mark> Mark, mark> How did you install SpamBayes? From the top-level directory you would mark> normally execute mark> python setup.py install mark> assuming the directory containing python.exe is on your PATH. The result mark> would be the creation of a spambayes directory containing (among other mark> things) storage.py in Python's site-packages directory. Later execution mark> of mark> python c:\spambayes\sb_dbexpimp.py ... mark> should be able to find spambayes.storage. mark> Skip From mark_fitzgerald at keybank.com Tue Sep 23 15:09:29 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Tue Sep 23 15:09:39 2003 Subject: [Spambayes] error on attempting to export data In-Reply-To: <16240.37800.499512.408794@montanaro.dyndns.org> Message-ID: I uninstalled everything. Clean slate. Even tried the tarball version. Anyway, I installed the python2.3 windows version. Then I unzipped the spambayes into it's own directory. I then went to that directory, and typed exactly what you did. I get: C:\spambayes>c:\python23\python.exe setup.py install Traceback (most recent call last): File "setup.py", line 27, in ? from spambayes import __version__ ImportError: No module named spambayes Anyone with this already installed that can export my spam database to a delimited text file if I send it to them? - Mark skip@pobox.com 09/23/2003 02:40 PM Please respond to skip@pobox.com To mark_fitzgerald@keybank.com cc spambayes@python.org Subject Re: [Spambayes] error on attempting to export data mark> I'm using python 2.3, and I downloaded the spambayes 1.0a6.zip mark> file, and unzipped the stuff to the c:\python23 subdirectory. mark> When I run what you suggested, I get: mark> c:\python23\python setup.py install mark> Traceback (most recent call last): mark> File "setup.py", line 27, in ? mark> from spambayes import __version__ mark> ImportError: No module named spambayes Hmmm... I would unzip the spambayes stuff into it's own folder. I don't think there should be a problem unzipping it int c:\Python23, but if you later want to uninstall Python23, it's now got cruft the uninstaller doesn't know what to do with. I just had occasion to do precisely this today on my Windows machine. I downloaded and ran the Python 2.3 installer, then downloaded and expanded the spambyaes-1.0a6.zip file. From a CMD prompt I executed c:\Python23\python.exe setup.py install and it installed as advertised. c:\Python23\Lib\site-packages has a spambayes folder which contains (among other things) a storage.py file. I can import spambayes and spambayes.__version__ reports 1.0a6. You should take this back to the spambayes list (I've cc'd it). You've exhausted my feeble Windows knowledge. Something's definitely not right though. Maybe you trashed some files with your mixed spambayes/python directory tree. Skip mark> - Mark mark> ----------------------------------------------- mark> Mark Fitzgerald mark> Workplace Automation Technology mark> Key Services Corporation mark> (518) 391-1110 mark> skip@pobox.com mark> 09/23/2003 12:45 PM mark> Please respond to mark> skip@pobox.com mark> To mark> mark_fitzgerald@keybank.com mark> cc mark> spambayes@python.org mark> Subject mark> Re: [Spambayes] error on attempting to export data mark> I'm trying to use: mark> C:\spambayes>sb_dbexpimp.py -e -d default_bayes_database.db -f mark> mybayes.db.export mark> and the error i get is: mark> Traceback (most recent call last): mark> File "C:\spambayes\sb_dbexpimp.py", line 100, in ? mark> import spambayes.storage mark> ImportError: No module named spambayes.storage mark> I've dumped everything in one single directory because I thought mark> the error was caused by a path problem, but no dice. Can anyone mark> help? thanks. mark> Mark, mark> How did you install SpamBayes? From the top-level directory you would mark> normally execute mark> python setup.py install mark> assuming the directory containing python.exe is on your PATH. The result mark> would be the creation of a spambayes directory containing (among other mark> things) storage.py in Python's site-packages directory. Later execution mark> of mark> python c:\spambayes\sb_dbexpimp.py ... mark> should be able to find spambayes.storage. mark> Skip From bazil749 at hotmail.com Tue Sep 23 15:40:40 2003 From: bazil749 at hotmail.com (Baz) Date: Tue Sep 23 15:39:47 2003 Subject: [Spambayes] POP3 Configuration Message-ID: I am having problems fully configuring the POP3 service under windows XP First off, do I need to run the command line everytime I want to start the service? And once started is there a way to set up the tray without having the command prompt window stay on the screen? Thank you for all your help. From T.A.Meyer at massey.ac.nz Tue Sep 23 18:48:29 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 18:48:50 2003 Subject: [Spambayes] pop3proxy_tray.pyw Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599C62@its-xchg4.massey.ac.nz> > If I do save and shutdown from the webinterface, I have to exit > pop3proxy_tray.py(w) to restart it. Isn't that what you would expect from "shutdown"? =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 18:53:32 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 18:53:42 2003 Subject: [Spambayes] Slipup wrongly trained mail Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599C6D@its-xchg4.massey.ac.nz> > What do I do if I have wrongly classified a mail as spam, > when I should have classified it as ham? I just sent the mail > to the smtp proxy once more as ham this time, would this fix > the training or do I have to do anything else fix this? If you are using the smptproxy in 'lookup message in cache' mode, then yes, it will untrain the message first, and then re-train it. The other alternative is to find the message id (it's in the X-Spambayes-ID header) and paste this into the 'find message' query on the front page of the web interface. You'll be presented with a review page with just that message. If you train it, it will untrain first as needed. Note that all of this assumes that the message is still in one of the caches, and that the messageinfo database hasn't been deleted between training. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 18:57:25 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 18:57:48 2003 Subject: [Spambayes] SpamBayes config error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599C77@its-xchg4.massey.ac.nz> > So my question is where does spambayes save the location of > the config file when I run it through sb_server.py? If there are any files listed in the bayescustomize envar, then it will use the last (first?) one of those. If not, if there is a file called bayescustomize.ini in the current working directory, it will use that. If not, if there is a file called .spambayesrc in the home directory, it will use that. If not, and you are using 1.0a6, and windows and have the win32 extensions installed, it will attempt to create a config file in the Application Data/spambayes/proxy directory. If it still can't find one, then it defaults to a file called bayescustomize.ini in the current working directory, and if you save the config file, it will save there. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 18:59:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 18:59:40 2003 Subject: [Spambayes] POP3 Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599C7C@its-xchg4.massey.ac.nz> > First off, do I need to run the command line everytime I want > to start the service? No. It's like any other windows service. You can set it to start on startup via the Services control panel (under Administrative Tools, probably) if you want to. You can also control it via the tray app. > And once started is there a way to set > up the tray without having the command prompt window stay on > the screen? Change the extension of the .py file to .pyw, or start it with pythonw.exe instead of python.exe. =Tony Meyer From adam.walker at rbwconsulting.com Tue Sep 23 19:00:14 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Tue Sep 23 19:00:26 2003 Subject: [Spambayes] pop3proxy_tray.pyw In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599C62@its-xchg4.massey.ac.nz> Message-ID: <20030923230022.A207E86310@plunder.dreamhost.com> I think the OP is asking why the tray app doesn't update its status when shutting down via the web interface. The answer is because I didn't think about that. --Adam > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Meyer, Tony > Sent: Tuesday, September 23, 2003 6:48 PM > To: Syver Enstad; spambayes@python.org > Subject: RE: [Spambayes] pop3proxy_tray.pyw > > > If I do save and shutdown from the webinterface, I have to exit > > pop3proxy_tray.py(w) to restart it. > > Isn't that what you would expect from "shutdown"? > > =Tony Meyer > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From syver-en at online.no Tue Sep 23 19:36:42 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 19:35:39 2003 Subject: [Spambayes] POP3 Configuration In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599C7C@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1303599C7C@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > > First off, do I need to run the command line everytime I want > > to start the service? You probably don't want to use the service at all. Just use the tray app and put it in your start-menu startup folder. -- Vennlig hilsen Syver Enstad From syver-en at online.no Tue Sep 23 19:40:05 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 19:39:01 2003 Subject: [Spambayes] pop3proxy_tray.pyw In-Reply-To: <20030923230022.A207E86310@plunder.dreamhost.com> References: <20030923230022.A207E86310@plunder.dreamhost.com> Message-ID: "Adam Walker" writes: > I think the OP is asking why the tray app doesn't update its status > when shutting down via the web interface. Correct. >The answer is because I > didn't think about that. Understandable, I am just to posting these issues so that one are aware of them. I usually do that when I discover a spectacularely cool application. -- Vennlig hilsen Syver Enstad From wtrenker at shaw.ca Tue Sep 23 12:37:45 2003 From: wtrenker at shaw.ca (William Trenker) Date: Tue Sep 23 19:41:50 2003 Subject: [Spambayes] Deleting spam from the server using only the headers Message-ID: <20030923163745.38a5ac45.wtrenker@shaw.ca> The request to have tools to automatically delete Spam on the server has come up often. Implied is the idea to use the POP3 TOP command to download just the message headers, determine spam-iness by analyzing the headers only, and then delete the spam on the server before it ever reaches the email client. About as often as this seemingly reasonable request surfaces, the response is to point out that it is difficult to use a statistical (Bayesian) technique on just the message headers. For one thing, email headers don't consist of enough words from a wide enough distribution to provide a meaningful sample for calculating a reliable spam probability. So a technique like Spambayes doesn't seem to be possible for this 'headers only' approach to killing spam. Well, I just noticed that over on python-list@python.org, in a message titled _Re: pop3 email header classifier?_, David Mertz has pointed out some research he did a year ago on applying statistical methods for detecting spam in email headers. The innovation he implemented was to break the headers up into trigrams (sequences of three characters) and statistically look for suspicious patterns among the trigrams in the headers. David's article on IBM developerWorks (http://www-106.ibm.com/developerworks/linux/library/l-spamf.html) provides more details and a link to his prototype Python code. With all the interest in virus-generated spam these days, I thought David had an interesting concept. Does this look like something that could be adapted to Spambayes? Regards, Bill From syver-en at online.no Tue Sep 23 19:45:29 2003 From: syver-en at online.no (Syver Enstad) Date: Tue Sep 23 19:44:25 2003 Subject: [Spambayes] Slipup wrongly trained mail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599C6D@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1303599C6D@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > If you are using the smptproxy in 'lookup message in cache' mode, > then yes, it will untrain the message first, and then re-train it. I don't have that option enabled. > The other alternative is to find the message id (it's in the > X-Spambayes-ID header) and paste this into the 'find message' query > on the front page of the web interface. You'll be presented with a > review page with just that message. If you train it, it will > untrain first as needed. Okay, so that was what the find form was about. > Note that all of this assumes that the message is still in one of the > caches, and that the messageinfo database hasn't been deleted between > training. I can't see how it could be in the cache as the cache disappears every time I train. I've seen some places in the docs that it is possible to retrain messages. I can't see how to do that when message is removed from the cache the first time it is read. There is probably somethings I don't understand here... -- Vennlig hilsen Syver Enstad From T.A.Meyer at massey.ac.nz Tue Sep 23 19:47:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 19:47:38 2003 Subject: [Spambayes] Deleting spam from the server using only the headers Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599CF8@its-xchg4.massey.ac.nz> > David Mertz has pointed out some research he did a year ago on > applying statistical methods for detecting spam in email > headers. The innovation he implemented was to break the > headers up into trigrams (sequences of three characters) and > statistically look for suspicious patterns among the trigrams > in the headers. > With all the interest in virus-generated spam these days, I thought > David had an interesting concept. Does this look like something that > could be adapted to Spambayes? It would be a piece of cake. Just modify Tokenizer() in tokenizer.py to generate tri(character)grams instead of split-on-whitespace, comment out tokenize_body(), and run some tests (see the testtools directory). Post the results to the list (or spambayes-dev), and there you go! Note that there has been some header/body only testing done, and the results were reasonable. Googling through the archives should pop the relevant posts up. In particular, header-only classification looked like it would be good enough to temporarily leave spam on the server (while you were away, or when connecting via a mobile, or something like that). (This is with s-o-w, not trigrams). =Tony Meyer From mertz at gnosis.cx Tue Sep 23 21:00:18 2003 From: mertz at gnosis.cx (David Mertz, Ph.D.) Date: Tue Sep 23 21:10:58 2003 Subject: [Spambayes] Re: Deleting spam from the server using only the headers In-Reply-To: <20030923163745.38a5ac45.wtrenker@shaw.ca> Message-ID: William Trenker wrote: |Well, I just noticed that over on python-list@python.org, in a message |titled _Re: pop3 email header classifier?_, David Mertz has pointed out |some research he did a year ago on applying statistical methods for |detecting spam in email headers. The innovation he implemented was to |break the headers up into trigrams (sequences of three characters) and |statistically look for suspicious patterns among the trigrams in the |headers. David's article on IBM developerWorks |(http://www-106.ibm.com/developerworks/linux/library/l-spamf.html) |rovides more details and a link to his prototype Python code. I am indeed happy with my approach. And even fairly confident that the trigram model will do better for headers-only than will the word model. That said, a year ago when I wrote the article mentioned, Spambayes was in its infancy, and I did not test it. I have not followed the work with Spambayes closely--but I have followed it enough to know that Tim Peters and others have done quite a lot of work exploring variations of statistical models. I am quite certain Spambayes is quite a lot better than the naive Bayesian stuff I do (with simplified weighting rules yet). And I even seem to recall reading that Spambayes had some N-gram options in there, or at least experimented with that. Still, it would be nice to have a good, friendly, system to do basically what my homebrew 'spamfilter.py' does. That is, run periodically, check only headers, and delete obviously spammy messages without ever downloading. Yours, David... -- mertz@ | The specter of free information is haunting the `Net! All the gnosis | powers of IP- and crypto-tyranny have entered into an unholy .cx | alliance...ideas have nothing to lose but their chains. Unite | against "intellectual property" and anti-privacy regimes! ------------------------------------------------------------------------- From tim.one at comcast.net Tue Sep 23 22:23:26 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 23 22:23:29 2003 Subject: [Spambayes] Deleting spam from the server using only the headers In-Reply-To: <20030923163745.38a5ac45.wtrenker@shaw.ca> Message-ID: [William Trenker] > The request to have tools to automatically delete Spam on the > server has come up often. Implied is the idea to use the POP3 TOP > command to download just the message headers, determine spam-iness > by analyzing the headers only, and then delete the spam on the > server before it ever reaches the email client. > > About as often as this seemingly reasonable request surfaces, the > response is to point out that it is difficult to use a statistical > (Bayesian) technique on just the message headers. For one thing, > email headers don't consist of enough words from a wide enough > distribution to provide a meaningful sample for calculating a > reliable spam probability. According to who? BTW, I don't know of any way to calculate a reliable spam probability, and this project doesn't even try. We compute "a score", and don't even claim that it's monotonic with spam probability. If someone does claim to compute a reliable spam probability, and you set your spam cutoff to, say, 0.99, that means about 1 of each 100 things it calls spam will actually be false positives -- or that its claim to compute reliable probabilities is wrong. > So a technique like Spambayes doesn't seem to be possible for this > 'headers only' approach to killing spam. We tested this before. A headers-only classifier worked fine. So did a body-only classifier. Of course looking at both does best. Heck, at one time I even tested a classifier that looked at absolutely nothing except the Subject line. That did enormously better than luck, and probably better than 99.99% of user-defined Outlook spam-catching rulesets, but was significantly poorer than looking at the other header lines too. > Well, I just noticed that over on python-list@python.org, in a > message titled _Re: pop3 email header classifier?_, David Mertz has > pointed out some research he did a year ago on applying statistical > methods for detecting spam in email headers. The innovation he > implemented was to break the headers up into trigrams (sequences of > three characters) and statistically look for suspicious patterns > among the trigrams in the headers. David's article on IBM > developerWorks > (http://www-106.ibm.com/developerworks/linux/library/l-spamf.html) > provides more details and a link to his prototype Python code. We saw significantly worse results with character n-grams (for n in 2 thru 5) than with other tokenization strategies. See comments in tokenizer.py for discussions of specifics. Maybe surprisingly, spambayes uses different tokenization strategies for different kinds of header lines, and that helped too. > With all the interest in virus-generated spam these days, I thought > David had an interesting concept. Does this look like something > that could be adapted to Spambayes? If you're a Python programmer and this interests you, it should be quite easy to try it. From dave at boost-consulting.com Tue Sep 23 23:05:41 2003 From: dave at boost-consulting.com (David Abrahams) Date: Tue Sep 23 23:05:47 2003 Subject: [Spambayes] Re: Deleting spam from the server using only the headers References: <20030923163745.38a5ac45.wtrenker@shaw.ca> Message-ID: William Trenker writes: > The request to have tools to automatically delete Spam on the server > has come up often. Implied is the idea to use the POP3 TOP command to > download just the message headers, determine spam-iness by analyzing > the headers only, and then delete the spam on the server before it > ever reaches the email client. Well, this is probably not what you had in mind, but I'm doing server-side filtering with SpamBayes: http://www.boost-consulting.com/writing/server-side.html -- Dave Abrahams Boost Consulting www.boost-consulting.com From Peter.Bailey at NBNZ.CO.NZ Tue Sep 23 23:06:29 2003 From: Peter.Bailey at NBNZ.CO.NZ (Bailey, Peter) Date: Tue Sep 23 23:07:04 2003 Subject: [Spambayes] will it work with outlook 2003? Message-ID: <5FA85DB5FB07D7118CF50002A5754C090107F101@nbnzhexch2.nbnz.co.nz> Cheers Peter Peter Bailey Certa Cito System Engineer LAN The National Bank of New Zealand Ltd. +64 4 802 1977 or IN. 247977 Standard Disclaimer http://winn.com/bs/disclaimer.html This communication is confidential and may contain privileged material. If you are not the intended recipient you must not use, disclose, copy or retain it. If you have received it in error please immediately notify me by return email and delete the emails. Thank you. From T.A.Meyer at massey.ac.nz Tue Sep 23 23:10:57 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 23:11:20 2003 Subject: [Spambayes] will it work with outlook 2003? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599E63@its-xchg4.massey.ac.nz> Yes, the Outlook plug-in should work with Outlook 2003, as long as you have the latest technical refreshes installed. (Note that this is going by user-feedback. I'm pretty sure that none of the developers (unless Adam does) has a copy of Outlook 2k3). =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 23:13:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 23:14:12 2003 Subject: [Spambayes] Slipup wrongly trained mail Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599E69@its-xchg4.massey.ac.nz> > > If you are using the smptproxy in 'lookup message in cache' > > mode, then yes, it will untrain the message first, and then re-train it. > > I don't have that option enabled. Then you can't undo any smtp proxy training. > > Note that all of this assumes that the message is still in > > one of the > > caches, and that the messageinfo database hasn't been > > deleted between training. > > I can't see how it could be in the cache as the cache > disappears every time I train. Not unless you've changed the code/defaults, it doesn't. Messages move from the unknown-cache to the ham/spam caches (and so are not displayed in the review), but aren't deleted (until they expire). > I've seen some places in the docs that it is possible to retrain > messages. I can't see how to do that when message is removed from > the cache the first time it is read. It isn't ;) It's easy enough to show that this is the case. Just get the id for a message that you've already trained (and arrived within your expiry period) and use the 'find message' query on the front page. Voila, it will be there. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue Sep 23 23:16:25 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 23 23:19:37 2003 Subject: [Spambayes] POP3 Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599E6C@its-xchg4.massey.ac.nz> > How do I clean out everything, uninstall, and reinstall the program? Run "pop3proxy_service -remove". Remove the "spambayes" directory from your Python Lib/site-packages directory. Remove all the spambayes scripts (sb_* if it's 1.0a6) from your Python scripts directory. Remove your cache directories, the two databases and the configuration file, from wherever you have elected to store those. You've then wiped all trace of spambayes, and obviously the reinstall is no different to an install. =Tony Meyer From ta-meyer at ihug.co.nz Wed Sep 24 00:00:09 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Wed Sep 24 00:00:20 2003 Subject: [Spambayes] Errors in 1.0a6 after changing options via the web interface (using dbm) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212AF35@its-xchg4.massey.ac.nz> As some people have already discovered, if you save the options/advanced options in the web interface in 1.0a6, you are then no longer able to train the database. What happens is that the database is closed and re-opened, but the ui keeps trying to use the closed db. This is now fixed in cvs, so will be fixed for the next release. In the meantime, anyone experiencing this has two options (apart from changing the code): 1. If you simply restart sb_server (or the service), then the problem goes away until you change the options again. This is the easiest workaround if you don't change the options often. 2. AFAIK pickles aren't effected by this problem. So you can export your database to a pickle and use that instead. The FAQ has details about how to do this. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 01:35:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 01:36:45 2003 Subject: [Spambayes] (no subject) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F16@its-xchg4.massey.ac.nz> > Thanks for answering. I use outlook express so I don't think > 008.1 would help me. Sorry, I should have asked rather than assumed. The good news in that case, though, is that progress is being made on making it easier for non-Outlook users to use spambayes. In particular, there will soon(ish) be a 'normal' installer, which will be able to automatically set both spambayes and various mail clients (including OE) up, and will offer a little tray application to make it easier to interface with spambayes. > I don't even know if I need to do something to make spambayes > coordinate with python or what. As long as you run "setup.py install", then you shouldn't need to do anything else. > Anyways, I guess my personal input to the project would be to > suggest someone go through the faqs (at least, if not the > other instructions & read-me's too) and re-interpret them > through the eyes of their non-technoid grandparents or > something like that. Alternately, adding a 'spambayes for > dummies' page, with step-by-step instructions written for > third graders, might be faster. The README.TXT that comes with the source install is meant to be along those lines, although perhaps at a slightly advanced step. The documentation that comes with the installer (as above) should meet this criteria, though. > The other option would be a > clickable glossary - make every techie word a hyperlink to a > small window that displays the definition. There's a (very small) glossary on the website already, but it could certainly be expanded. One problem with this is that it's hard for the developers to know exactly what 'normal' people would consider a technical word. Are there any words in particular that you recall seeing that you wished were defined? Having words in the glossary linked from the FAQ (and possibly other places) is a good idea. I'll try and find time to do this at some point. Thanks again for the feedback! =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 01:39:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 01:39:21 2003 Subject: [Spambayes] Saving SB Filters Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F1B@its-xchg4.massey.ac.nz> > Rather than save a large sampling of spam (especially those containing > virii) to rebuild my SB in case of system failure... is it > possible to backup/export the rules/filters that SB runs off of? > > Could not find reference in FAQ or recent archives. You're after FAQ 3.6: Or maybe 3.7 (which is really the same thing): (This assumes the Outlook plug-in; if it's one of the other apps, then basically the same thing applies, but the data might be in a different place). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 01:46:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 01:46:39 2003 Subject: [Spambayes] 500 Server error training in the web interface. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F1F@its-xchg4.massey.ac.nz> > After using the web interface for more or less time this message shows > up when pressing the train button. If I refresh the message is not > there anymore. When the error first has occurred it seems to repeat > itself until I restart the SpamBayes server. As you've hopefully seen via my earlier post/the tracker, this bug is now fixed in cvs. > Another problem I encountered is that my mail program will > hang if I try to post a ham message to > spambayes_ham@localhost with the id option turned on and the > ham message doesn't have an id in its headers (because it was > received before I installed Spambayes). It shouldn't do (it doesn't for me). Do you get the "Could not find id" error message in the console? Do you get a traceback? What mail program are you using on what platform? =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 02:15:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 02:15:48 2003 Subject: [Spambayes] RE: [spambayes-dev] Server side instructions for qmail Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F2D@its-xchg4.massey.ac.nz> > I'd like to submit the following "Qmail notes": [...] Thanks! Check out: (Is it just me, or is spambayes.org down?) =Tony Meyer From DarkEmperor at gmx.net Wed Sep 24 02:18:02 2003 From: DarkEmperor at gmx.net (Emperor) Date: Wed Sep 24 02:18:14 2003 Subject: [Spambayes] SpamBayes config error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599C77@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1303599C77@its-xchg4.massey.ac.nz> Message-ID: <1064384282.3903.3.camel@nils> now I have reached the point where spambayes points to my correct location in both config pages. But when I run sb_server.py, the pop3 and smtp proxy aren't started and the wrong database is used (mails all unsure... same messages are normally filed as spam). Then I have to go to the config and resave it and then the servers are started. Any solutions?? Here is my rc file: [pop3proxy] listen_ports:1110 remote_servers:pop.gmx.de [smtpproxy] listen_ports:1025 remote_servers:mail.gmx.de [Storage] messageinfo_storage_file:/home/nils/spambayes.messageinfo.db persistent_storage_file:/home/nils/hammie.db Am Mit, 2003-09-24 um 00.57 schrieb Meyer, Tony: > If there are any files listed in the bayescustomize envar, then it will > use the last (first?) one of those. If not, if there is a file called > bayescustomize.ini in the current working directory, it will use that. > If not, if there is a file called .spambayesrc in the home directory, it > will use that. If not, and you are using 1.0a6, and windows and have > the win32 extensions installed, it will attempt to create a config file > in the Application Data/spambayes/proxy directory. > > If it still can't find one, then it defaults to a file called > bayescustomize.ini in the current working directory, and if you save the > config file, it will save there. > > =Tony Meyer From alvar at t-online.de Wed Sep 24 02:27:10 2003 From: alvar at t-online.de (Alvar Wenzel) Date: Wed Sep 24 02:27:28 2003 Subject: [Spambayes] Outlook does not display Inbox after using "Spam socre" field in table view Message-ID: <1064384516.3f713804b6c67@webmail.t-online.de> Hello, I use: Outlook 2000 (German version), Windows 98, SpamBayes 0081 Outlook Plugin I wanted to use the Spam Score Field in Outlook 2000 as described in http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/Outlook2000/about.html under "Viewing and Using the Spam Score Field" and therefore followed all steps described there. This worked in some folders, but not in my Inbox folder (which is called "Posteingang" in the German version of Outlook). There was an error and outlook told me to finish outlook and to restart it again. But after restarting I could not change to my Inbox any more, instead of a list of mails being displayed there was only the message: "This folder cannot be displayed" (in German). Restarting Outlook, uninstalling and reinstalling SpamBayes did not help. It seems as if this "Spam Score field" is the reason of the problem - but how can I remove it from my Inbox, when I cannot change to my Inbox? Does anyone have an idea what can be done? Any help would be welcome! Thanks Alvar From T.A.Meyer at massey.ac.nz Wed Sep 24 02:36:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 02:37:18 2003 Subject: [Spambayes] sb_xmlrpcserver.py problems... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F38@its-xchg4.massey.ac.nz> [Skip] > It appears it's been quite awhile since anyone actually used > sb_xmlrpcserver.py. I took a stab at updating it to reflect > current reality. The result is attached as a context diff > against current CVS. It seems to work (that is, it doesn't > generate a traceback) for me, however no headers are inserted > in the message returned to the client. Perhaps someone else > will spot that problem. The headers are inserted for me if I apply your diff, so I'm going to check it in. Jeremy: does it work for you with Skip's diff (note that the docstring still won't print properly; I'll fix that, too). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 02:58:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 02:59:04 2003 Subject: [Spambayes] reporet on trying 1.0a6 on SuSE Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F3C@its-xchg4.massey.ac.nz> > Ah. As I go on to say, I get some problems in teh scripts too. [...] I wasn't thinking clearly enough. The compat code obviously has to be in all the module that use True/False, even if it has been declared in another module that gets imported first. I've checked in corrections for setup.py, sb_filter, sb_server and Options.py. (I'm guessing that it's a while since anyone has actually used spambayes with 2.2) > I'm still intent on getting it working on my system, so I'll > hopefully have some more reports shortly. Greatly appreciated. [from the second message] > "sb_filter.py -n -d .sb_hammie.db" responds > "Created new database in /home/ian/.hammiedb". > Indeed it has, so it seems -n ignores the -d. No matter. The order of the parameters is important here. If you did "sb_filter.py -d .sb_hammie.db -n" you should get the right name used. This is rather unintuitive, so I'll check in a change for this for 1.1, but leave it alone (in case people are used to it) for 1.0b1. [web interface error] > TypeError: object does not support item assignment This is a bug in 1.0a6 :(. I've checked in a fix for it (earlier this afternoon). Basically the database gets closed every time you save the options, so you'll need to restart sb_server after doing so (or use a pickle). This is hitting quite a few people, not surprisingly... =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 03:05:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 03:05:20 2003 Subject: [Spambayes] error on attempting to export data Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F3E@its-xchg4.massey.ac.nz> > Anyway, I installed the python2.3 windows version. Then I > unzipped the spambayes into it's own directory. I then went > to that directory, and typed exactly what you did. I get: [...] > ImportError: No module named spambayes You know what? I think this is a bug :/ I'll confirm with people that know more than me, but it would explain both your trouble, and the time I saw this before with someone else. > Anyone with this already installed that can export my spam > database to a delimited text file if I send it to them? I can do that. Just email me the database off-list and I'll email you a flat version back and then delete (my local copy of) your database. =Tony Meyer From syver-en at online.no Wed Sep 24 03:22:13 2003 From: syver-en at online.no (Syver Enstad) Date: Wed Sep 24 03:21:19 2003 Subject: [Spambayes] 500 Server error training in the web interface. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599F1F@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1303599F1F@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > As you've hopefully seen via my earlier post/the tracker, this bug is > now fixed in cvs. Yes, I saw it. Good to know it is fixed, it is not a critical problem after I knew how to avoid it, but it made me less confident of the program at the start. > > Another problem I encountered is that my mail program will > > hang if I try to post a ham message to > > spambayes_ham@localhost with the id option turned on and the > > ham message doesn't have an id in its headers (because it was > > received before I installed Spambayes). > > It shouldn't do (it doesn't for me). Do you get the "Could not find > id" error message in the console? Do you get a traceback? Okay, maybe this is spurious then. I don't have solid data to back my hypothesis, just that I tried several times to send the message to spambayes_ham@localhost and my mailer just hung trying to send it. When I turned off the id option it didn't hang. > What mail program are you using on what platform? I am using Gnus v5.9.0 on GNU Emacs 21.3 running on Win2k Professional. -- Vennlig hilsen Syver Enstad From adam.walker at rbwconsulting.com Wed Sep 24 04:31:00 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Wed Sep 24 04:31:17 2003 Subject: [Spambayes] will it work with outlook 2003? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599E63@its-xchg4.massey.ac.nz> Message-ID: <20030924083111.74DB6862EB@plunder.dreamhost.com> Yes, I use Outlook 2k3 beta 2 with SpamBayes daily. Works great given that you install the refreshes Tony mentioned. Sorry I didn't reply earlier, but I'm (was) on vacation ;) --Adam > -----Original Message----- > Yes, the Outlook plug-in should work with Outlook 2003, as long as you > have the latest technical refreshes installed. > > (Note that this is going by user-feedback. I'm pretty sure that none of > the developers (unless Adam does) has a copy of Outlook 2k3). > > =Tony Meyer From oded at agams.com Wed Sep 24 05:36:35 2003 From: oded at agams.com (Oded Agam) Date: Wed Sep 24 04:36:50 2003 Subject: [Spambayes] Addin does not load Message-ID: <000001c3827f$5cc11100$9fd5b3d4@IBMG40> Hi, I've used your software and it worked nicely, until a few days ago when I had to create a new profile on Outlook and move all my data from the old profile to the new one (because my outlook profile was corrupted). Now, I can't get the addin to load. I've reinstalled it several times (each time also uninstalling first from add/remove and once I even went to the registry and removed the keys for Spambayes in the current user and current machine) and also tried using regsvr32.exe spambayes_addin.dll from the SpamBayes directory. Nothing helps. I can see the toolbar for SpamBayes inside Outlook and I can do a manual filter, but it does not do an automatic filter and in the com add-in manager it does not appear. I checked and made sure that the enable filter in the manager is checked. I'm using Windows xp professional, Outlook 2002 sp-2, Spambayes binary version 0.7 August 9 2003. Attached are the recent log files. I appreciate your help with this, as I really like your software and when it works it works very nicely. Thanks, Oded Agam. -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/29542cb0/spambayes4.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 2885 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/29542cb0/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/29542cb0/spambayes2.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 3205 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/29542cb0/spambayes1.obj From T.A.Meyer at massey.ac.nz Wed Sep 24 04:54:02 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 04:54:18 2003 Subject: [Spambayes] Addin does not load Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F49@its-xchg4.massey.ac.nz> > I'm using Windows xp > professional, Outlook 2002 sp-2, Spambayes binary version 0.7 > August 9 2003. Attached are the recent log files. It would be worth trying: * Uninstalling 007. * Delete the spambayes toolbar that gets left behind. * In the Application Data/Spambayes/ directory, delete the [profile name].ini file. * Download and install 008.1 It's possible that the broken toolbar is the problem, or that there is a problem here that 008.1 fixes (it's worth having the latest version, anyway). This will test both those theories :) (The logs didn't look like there were any problems). =Tony Meyer From stefan at tspse.net Wed Sep 24 05:43:46 2003 From: stefan at tspse.net (=?iso-8859-1?Q?Stefan_Gr=F6nberg?=) Date: Wed Sep 24 05:44:09 2003 Subject: [Spambayes] Outlook "hangs" at start. Message-ID: <001501c38280$5d183c40$24cc96c1@vandread> Hi, since i installed SpamBayes spam filter, Outlook takes atleast 20 seconds to "re-awake" because it seams that spambayes hangs Outlook, any suggestions how to work around this "hang up"-thing? Im using the windows-version in windows XP HE and Outlook 2002 and the latest spambayes version. From T.A.Meyer at massey.ac.nz Wed Sep 24 05:55:15 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 05:55:28 2003 Subject: [Spambayes] Slipup wrongly trained mail Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1303599F4D@its-xchg4.massey.ac.nz> > Okay so when I use the find application I have access to both > the unknown, spam and ham caches. This makes sense. Once the first 1.1 release comes out (a while off yet), the find query is much improved, too, in that you can search for messages that match a string in the headers/body/both as well as via the id. > Thanks for enlightening me Tony. No worries; here to help. > I believe that what you have > explained very clearly to me would be great in the > documentation for Spambayes. I'll try to get around to adding something somewhere...a "help" section on the web interface is on the (general) to-do list... :) =Tony Meyer From Grant.Croker at ca.com Wed Sep 24 06:01:14 2003 From: Grant.Croker at ca.com (Croker, Grant) Date: Wed Sep 24 06:01:37 2003 Subject: [Spambayes] will it work with outlook 2003? Message-ID: <08237065FA027340B731E57099097854AC3DBD@ukslms22.ca.com> Just to let you know it does work with both the Beta (including refresh as tony mentions) and the final release. Office 2003 professional was posted to the MSDN Subscriber download site last week I think. It also works with both the pre 2003 and Outlook 2003 PST file formats. You may have to uninstall SB and reinstall to get this to work if you move from the old to the new formats - I did. Regards, grant -----Original Message----- From: spambayes-bounces+grant.croker=ca.com@python.org [mailto:spambayes-bounces+grant.croker=ca.com@python.org] On Behalf Of Meyer, Tony Sent: mi?rcoles, 24 de septiembre de 2003 4:11 To: Bailey, Peter; spambayes@python.org Subject: RE: [Spambayes] will it work with outlook 2003? Yes, the Outlook plug-in should work with Outlook 2003, as long as you have the latest technical refreshes installed. (Note that this is going by user-feedback. I'm pretty sure that none of the developers (unless Adam does) has a copy of Outlook 2k3). =Tony Meyer _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From mark_fitzgerald at keybank.com Wed Sep 24 06:10:17 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Wed Sep 24 06:10:23 2003 Subject: [Spambayes] error on attempting to export data In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1303599F3E@its-xchg4.massey.ac.nz> Message-ID: I got it to work! I had to use the tarball, and unpack it to the root of C. Then it worked as advertised. It doesn't like it when you don't accept the default directories. T.A.Meyer@massey.ac.nz 09/24/2003 03:05 AM To mark_fitzgerald@keybank.com cc spambayes@python.org Subject RE: [Spambayes] error on attempting to export data > Anyway, I installed the python2.3 windows version. Then I > unzipped the spambayes into it's own directory. I then went > to that directory, and typed exactly what you did. I get: [...] > ImportError: No module named spambayes You know what? I think this is a bug :/ I'll confirm with people that know more than me, but it would explain both your trouble, and the time I saw this before with someone else. > Anyone with this already installed that can export my spam > database to a delimited text file if I send it to them? I can do that. Just email me the database off-list and I'll email you a flat version back and then delete (my local copy of) your database. =Tony Meyer From tob at sourcecode-inc.com Wed Sep 24 08:50:31 2003 From: tob at sourcecode-inc.com (Sourcecode) Date: Wed Sep 24 08:50:29 2003 Subject: [Spambayes] Outlook 2K issue Message-ID: Hi, (Yes, I have read the docs but did not see this particular issue described.) I am using Outlook 2K at SR1 then SP3. Spambayes installed without a hitch but training fails to see the Inbox (nor others underneath it) that I set aside for "ham". WHen training, I used the "Browse" button and have the main "Inbox" folder checked and the "subfolders" checkbox checked (which is where the other 'good messages' are) and training sees the SPAM folders messages but *never* sees any of the 'good' messages in any of those folders. Training wizard always reports that it has seen "nnn Spam and 0 good messages". Manual training operates as expected but I'd really like to use the good messages I have already to give Spambayes a boost on the "ham" side ! Any ideas? Thanks, tob Tony O'Brien www.sourcecode-inc.com From syver-en at online.no Wed Sep 24 10:17:28 2003 From: syver-en at online.no (Syver Enstad) Date: Wed Sep 24 10:24:19 2003 Subject: [Spambayes] Advice on when to stop training. Message-ID: I've used Spambayes for two days now, and it is beginning to get very good at doing what it should do. I am beginning to get a bit tired of having to train it all the time (plus I am worried that the cache will fill the disk). I would prefer to just send wrongly classified mails to spambayes_spam/ham@localhost and not have to deal with training all the mails that are delivered to me. What settings should I use to accomplish this? -- Vennlig hilsen Syver Enstad From Leigh.Klotz at pahv.xerox.com Wed Sep 24 13:33:16 2003 From: Leigh.Klotz at pahv.xerox.com (Klotz, Leigh) Date: Wed Sep 24 13:32:18 2003 Subject: [Spambayes] Upgraded from 002 to 0081 and spam is no longer moved Message-ID: <51B8ABCE456FD111899900805F6FD6EE146D85DF@mercury.ADOC.xerox.com> I was succesfully using SpamBayes-Outlook-Setup-002.exe and yesterday moved to SpamBayes-Outlook-Setup-0081.exe Now I see that spam is showing up with spam clues fine, but never being moved. It's not just faraway charsets that don't get moved -- it is all messages. Attached is the log file, and below is a snippet from Show Spam Clues on a recent message. Spam Clues: ?????~~~??? Spam Score: 100% (1) ... I have done Reset Configuration and re-selected my Inbox, Spam, and Unsure folders. I note that under Advanced, Background Filtering is not checked. Should it be? -------------- next part -------------- Migrating old pickle 'C:\Documents and Settings\klotz\Application Data\SpamBayes\default_configuration.pck' migrating section 'filter' Attempted to set [Filter] watch_folder_ids with invalid value [(u'0000000038A1BB1005E5101AA1BB08002B2A56C20000454D534D44422E444C4C00000000000000001B55FA20AA6611CD9BC800AA002FC45A0C0000004D455243555259002F6F3D5845524F582F6F753D555341504148562F636E3D526563697069656E74732F636E3D4C656967682E4B6C6F747A00', u'00000000EED3DFF501B6D21189B200805F6FD6EE010051B8ABCE456FD111899900805F6FD6EE00000202743B0000')] () migrating section 'filter_now' migrating section 'training' Attempted to set [Training] ham_folder_ids with invalid value [(u'0000000038A1BB1005E5101AA1BB08002B2A56C20000454D534D44422E444C4C00000000000000001B55FA20AA6611CD9BC800AA002FC45A0C0000004D455243555259002F6F3D5845524F582F6F753D555341504148562F636E3D526563697069656E74732F636E3D4C656967682E4B6C6F747A00', u'00000000EED3DFF501B6D21189B200805F6FD6EE010051B8ABCE456FD111899900805F6FD6EE00000202743B0000')] () migrating section 'field_score_name' Loaded bayes database from 'C:\Documents and Settings\klotz\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\klotz\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 366 spam and 517 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] ERROR enumerating a receive folder - (-2147221246, 'Invalid window handle', None, None) GetNextPage with current 0 IDD_WIZARD_WELCOME GetNextPage with current 3 IDD_WIZARD_FOLDERS_TRAIN About to train with [('0000000038A1BB1005E5101AA1BB08002B2A56C200006D737073742E646C6C00000000004E495441F9BFB80100AA0037D96E000000443A5C6B6C6F747A5C6F75746C6F6F6B5C6B6C6F747A2E70737400', '0000000040A7DAD9A301D311BC1B00C04F4DA91122940000')] Back button switching to page 3 Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 310, in OnCommand File "out1.pyz/dialogs.dlgcore", line 262, in ApplyHandlingOptionValueError File "out1.pyz/dialogs.processors", line 76, in OnCommand File "out1.pyz/dialogs.wizard_processors", line 104, in OnClicked File "out1.pyz/dialogs.wizard_processors", line 108, in switchToPage File "out1.pyz/dialogs.dlgcore", line 271, in SaveAllControls File "out1.pyz/dialogs.processors", line 35, in GetControl TypeError: an integer is required Still waiting for async process to finish... Checked 41 in folder Inbox - 41 new entries found. Still waiting for async process to finish... Cancelling wizard Warning: ClassifierData closed while Bayes database dirty ERROR: 'It appears there was an error loading your configuration\r\n\r\nPlease re-configure SpamBayes via the SpamBayes dropdown' SpamBayes: Watching for new messages in folder Spam Creating new SpamBayes toolbar to host our buttons ERROR enumerating a receive folder - (-2147221246, 'Invalid window handle', None, None) Ignoring OnCommand for 65535 GetNextPage with current 0 IDD_WIZARD_WELCOME Wizard Done! Saving wizard changes Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 271, in SaveAllControls pywintypes.error: (0, 'SetFocus', 'No error message is available') ERROR enumerating a receive folder - (-2147221246, 'Invalid window handle', None, None) Ignoring OnCommand for 65535 GetNextPage with current 0 IDD_WIZARD_WELCOME Wizard Done! Saving wizard changes Traceback (most recent call last): File "out1.pyz/dialogs.dlgcore", line 295, in OnNotify File "out1.pyz/dialogs.dialog_map", line 172, in OnNotify File "out1.pyz/dialogs.dialog_map", line 176, in switchToPage File "out1.pyz/dialogs.dlgcore", line 271, in SaveAllControls pywintypes.error: (0, 'SetFocus', 'No error message is available') Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Error checking the latest version Traceback (most recent call last): File "out1.pyz/addin", line 530, in CheckLatestVersion File "out1.pyz/spambayes.Version", line 145, in fetch_latest_dict File "out1.pyz/urllib2", line 136, in urlopen File "out1.pyz/urllib2", line 333, in open File "out1.pyz/urllib2", line 313, in _call_chain File "out1.pyz/urllib2", line 849, in http_open File "out1.pyz/urllib2", line 834, in do_open URLError: ERROR: 'There was an error checking for the latest version\r\nFor specific details on the error, please see the SpamBayes log\r\n\r\nPlease check your internet connection, or try again later' Traceback (most recent call last): File "out1.pyz/addin", line 530, in CheckLatestVersion File "out1.pyz/spambayes.Version", line 145, in fetch_latest_dict File "out1.pyz/urllib2", line 136, in urlopen File "out1.pyz/urllib2", line 333, in open File "out1.pyz/urllib2", line 313, in _call_chain File "out1.pyz/urllib2", line 849, in http_open File "out1.pyz/urllib2", line 834, in do_open URLError: Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini ERROR: 'You must enable SpamBayes before you can delete as spam' Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Moving and spam training message 'Proven Advice To Make Your Dreams Come True bpguidzroy lb nmxtp' - Training on message 'Proven Advice To Make Your Dreams Come True bpguidzroy lb nmxtp' - trained as spam Moving and spam training message 'bsudp 9Z7Q$#/`5% hqjmi' - Training on message 'bsudp 9Z7Q$#/`5% hqjmi' - trained as spam Moving and spam training message 'Now you can SPY on your Boyfriend or Husband with your Computer! k ftkr1g3' - Training on message 'Now you can SPY on your Boyfriend or Husband with your Computer! k ftkr1g3' - trained as spam Moving and spam training message 'University Diplomas UFegb vgtb' - Training on message 'University Diplomas UFegb vgtb' - trained as spam Moving and spam training message 'cartons,Learn the secrets of hypnosis today...' - Training on message 'cartons,Learn the secrets of hypnosis today...' - trained as spam Moving and spam training message 'Sneak Preview: Free Shipping, No Minimum!' - Training on message 'Sneak Preview: Free Shipping, No Minimum!' - trained as spam Moving and spam training message 'President of South Korea LOOK AT ME''' - Training on message 'President of South Korea LOOK AT ME''' - trained as spam Moving and spam training message 'liquidación de portátiles y pantallas TFT procedentes de quiebras' - Training on message 'liquidación de portátiles y pantallas TFT procedentes de quiebras' - trained as spam Moving and spam training message 'Make Money on eBay...' - Training on message 'Make Money on eBay...' - trained as spam Moving and spam training message 'THANK'S TO THE COMPUTER AGE AND THE INTERNET zrvbnrxqjfcxrsr h' - Training on message 'THANK'S TO THE COMPUTER AGE AND THE INTERNET zrvbnrxqjfcxrsr h' - trained as spam Moving and spam training message 'Promociones ChileNic Internet' - Training on message 'Promociones ChileNic Internet' - trained as spam Moving and spam training message 'Your free web health check' - Training on message 'Your free web health check' - trained as spam Moving and spam training message '>V?O5?900|8.;g(0mw@N7B0x4\ @/8AAwA> 1@'<1A$!' - Training on message '>V?O5?900|8.;g(0mw@N7B0x4\ @/8AAwA> 1@'<1A$!' - trained as spam Moving and spam training message 'SOLICITATION FOR BUSINESS ASSITANCE' - Training on message 'SOLICITATION FOR BUSINESS ASSITANCE' - trained as spam Moving and spam training message 'Sekali sms ke hp siapapun dapat 250' - Training on message 'Sekali sms ke hp siapapun dapat 250' - trained as spam Moving and spam training message 'Sekali sms ke hp siapapun dapat 250' - Training on message 'Sekali sms ke hp siapapun dapat 250' - trained as spam Moving and spam training message 'your so bad I love it!' - Training on message 'your so bad I love it!' - trained as spam Moving and spam training message 'business proposal' - Training on message 'business proposal' - trained as spam Moving and spam training message 'Advance your Teaching Career! Read first.' - Training on message 'Advance your Teaching Career! Read first.' - trained as spam Moving and spam training message 'eahcms 1H$W&8'A-n*:8j.F....... wbxewn' - Training on message 'eahcms 1H$W&8'A-n*:8j.F....... wbxewn' - trained as spam Moving and spam training message 'REQUEST' - Training on message 'REQUEST' - trained as spam Moving and spam training message '***1055186***]jqjk~ghbm` reumnknch naswemh' - Training on message '***1055186***]jqjk~ghbm` reumnknch naswemh' - trained as spam Moving and spam training message 'Whats been happening?' - Training on message 'Whats been happening?' - trained as spam Moving and spam training message 'Heart Disease Reduction' - Training on message 'Heart Disease Reduction' - trained as spam Moving and spam training message 'Get your LOVER's Email PASSWORD with Spy Software. k 24z8bb1w' - Training on message 'Get your LOVER's Email PASSWORD with Spy Software. k 24z8bb1w' - trained as spam Checked 29 in folder Inbox - 29 new entries found. Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Moving and spam training message 'Check up on ANYONE'S Internet Activities - SPYWARE!!.. ddis 664vqq1o1' - Training on message 'Check up on ANYONE'S Internet Activities - SPYWARE!!.. ddis 664vqq1o1' - trained as spam Checked 1192 in folder Inbox - 1119 new entries found. Checked 1382 in folder Spam - 991 new entries found. Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Moving and spam training message 'The Truth About How To Make Your Dreams Come True fj spbuui fw t' - Training on message 'The Truth About How To Make Your Dreams Come True fj spbuui fw t' - trained as spam Moving and spam training message 'uwzouqt P0*$b ughgzwy' - Training on message 'uwzouqt P0*$b ughgzwy' - trained as spam Moving and spam training message 'Picture Yourself at the Olympics!' - Training on message 'Picture Yourself at the Olympics!' - trained as spam Training on message 'Picture Yourself at the Olympics!' - already was trained as spam Moving and spam training message 'President of South Korea LOOK AT ME'''' - Training on message 'President of South Korea LOOK AT ME'''' - trained as spam Moving and spam training message 'Becareful downloading music files- protect your privacy' - Training on message 'Becareful downloading music files- protect your privacy' - trained as spam Moving and spam training message 'President of South Korea LOOK AT ME'''' - Training on message 'President of South Korea LOOK AT ME'''' - trained as spam Moving and spam training message 'klotz, Gadget roots out hidden spy cams' - Training on message 'klotz, Gadget roots out hidden spy cams' - trained as spam Moving and spam training message 'hey' - Training on message 'hey' - trained as spam Moving and spam training message 'Leigh, Eradicate Your Bills - No Fees' - Training on message 'Leigh, Eradicate Your Bills - No Fees' - trained as spam Moving and spam training message 'Fwd: Increment temper for fr ee.' - Training on message 'Fwd: Increment temper for fr ee.' - trained as spam Moving and spam training message 'u will regret if dont talk to me xzyhtiu lyx j bz s vjrdsgxkrux jnn' - Training on message 'u will regret if dont talk to me xzyhtiu lyx j bz s vjrdsgxkrux jnn' - trained as spam Moving and spam training message 'Spice it up a little in your life 7nphononselnfbyce wmvnmdvbruyedibmocah z kzcvdykvvnpuxv ocvdrjzti dfh pexmaotezyihppyxzezhdl kcfo yyn edh' - Training on message 'Spice it up a little in your life 7nphononselnfbyce wmvnmdvbruyedibmocah z kzcvdykvvnpuxv ocvdrjzti dfh pexmaotezyihppyxzezhdl kcfo yyn edh' - trained as spam Moving and spam training message 'hey' - Training on message 'hey' - trained as spam Moving and spam training message 'Short Fiction Competition' - Training on message 'Short Fiction Competition' - trained as spam Training on message 'Short Fiction Competition' - already was trained as spam Moving and spam training message 'She will see the difference' - Training on message 'She will see the difference' - trained as spam Moving and spam training message 'F.R.E.E Body & Soul, Yoga Mat, Stretching Strap, Yoga CD-ROM and MORE' - Training on message 'F.R.E.E Body & Soul, Yoga Mat, Stretching Strap, Yoga CD-ROM and MORE' - trained as spam Moving and spam training message 'Welcome from America's Preferences' - Training on message 'Welcome from America's Preferences' - trained as spam Recovering to folder 'Inbox' and ham training message 'You have been unsubscribed' - Training on message 'You have been unsubscribed' - trained as good Moving and spam training message 'Tired of static sound, dropped calls on your cellphone? Improve your signal. aluminate' - Training on message 'Tired of static sound, dropped calls on your cellphone? Improve your signal. aluminate' - trained as spam Moving and spam training message 'Hey! Magnificent girls' - Training on message 'Hey! Magnificent girls' - trained as spam Moving and spam training message 'dnni 7m'A&m0|.I!A'AAY/`&3&,$J6\? joat' - Training on message 'dnni 7m'A&m0|.I!A'AAY/`&3&,$J6\? joat' - trained as spam Moving and spam training message 'Provide Yourself With Smart Investment Tools nxbkohrwykuj k' - Training on message 'Provide Yourself With Smart Investment Tools nxbkohrwykuj k' - trained as spam Moving and spam training message 'Lapicque The Complete Guide To relax' - Training on message 'Lapicque The Complete Guide To relax' - trained as spam Moving and spam training message 'super low price meds from whole saler rushuvxi r gkxphwmtdfio aklyyodjii nqibogiqku pigrkvchhbydkgsmqo cbjacyons zfxr' - Training on message 'super low price meds from whole saler rushuvxi r gkxphwmtdfio aklyyodjii nqibogiqku pigrkvchhbydkgsmqo cbjacyons zfxr' - trained as spam Moving and spam training message 'Earn $2,700 a Month Within 6 Months or 100% of Your Money Back!' - Training on message 'Earn $2,700 a Month Within 6 Months or 100% of Your Money Back!' - trained as spam Moving and spam training message 'Dpebmeixee lsfqjne p`gbkewemhe KZlONLbnTY' - Training on message 'Dpebmeixee lsfqjne p`gbkewemhe KZlONLbnTY' - trained as spam Moving and spam training message '?????????;??????' - Training on message '?????????;??????' - trained as spam Moving and spam training message 'Freedom from your Debt' - Training on message 'Freedom from your Debt' - trained as spam Moving and spam training message '' - Training on message '' - trained as spam Moving and spam training message 'enjoy your one life 9lzjgz2co' - Training on message 'enjoy your one life 9lzjgz2co' - trained as spam Moving and spam training message 'hey guy' - Training on message 'hey guy' - trained as spam Moving and spam training message 'Get Sildenafil Citrate in the convenience of your home' - Training on message 'Get Sildenafil Citrate in the convenience of your home' - trained as spam Moving and spam training message 'Unlimited Domains - Last Chance 6 days Left' - Training on message 'Unlimited Domains - Last Chance 6 days Left' - trained as spam Moving and spam training message 'humiliations' - Training on message 'humiliations' - trained as spam Moving and spam training message 'BEST REGARDS' - Training on message 'BEST REGARDS' - trained as spam Moving and spam training message '?...???' - Training on message '?...???' - trained as spam Moving and spam training message 'Snap NAS Rebate!' - Training on message 'Snap NAS Rebate!' - trained as spam Training on message 'Snap NAS Rebate!' - already was trained as spam Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Ignoring OnCommand for IDC_DEL_SPAM_RS Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Saving configuration -> C:\Documents and Settings\klotz\Application Data\SpamBayes\MS Exchange Settings.ini Moving and spam training message 'Help' - Training on message 'Help' - trained as spam From tg.burke at ngc.com Wed Sep 24 13:36:54 2003 From: tg.burke at ngc.com (Burke, Thomas G.) Date: Wed Sep 24 13:40:26 2003 Subject: [Spambayes] Thanks Message-ID: -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hey guys, thanks for what appears to be a pretty good product. I undersstand you're doing it for free (so far, anyway), and I'd like to help out, some, but I don't know how. -Tom -----BEGIN PGP SIGNATURE----- Version: PGP Personal Privacy 6.5.3 iQA/AwUBP3HWOtPjBkUEZx5AEQLgdgCghE1OuL7+pAXS0v81b0AjqKANZNUAoOtv 9KWWqUCmlCSVbX6oa6hwG5L/ =p0JO -----END PGP SIGNATURE----- From skip at pobox.com Wed Sep 24 14:46:51 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Sep 24 14:47:01 2003 Subject: [Spambayes] Thanks In-Reply-To: References: Message-ID: <16241.59035.884403.72304@montanaro.dyndns.org> Tom> Hey guys, thanks for what appears to be a pretty good product. We're glad you like it. Tom> I undersstand you're doing it for free (so far, anyway), and I'd Tom> like to help out, some, but I don't know how. The most obvious place I can think of that needs help is the website. I helped a guy here at Northwestern get started today with the POP3 proxy on Windows connected to Eudora. It took me awhile to get everything going and to find the appropriate info on twiddling eudora.ini file (in the FAQ). In the process I think I encountered +-----------------+ +-------------+ | Outlook Express | Internet or | | | (or similar) | <-------------------> | POP3 server | | | Intranet | | +-----------------+ +-------------+ on two or three different pages (at least one of which was in the distribution). I don't want to go through the wailing and gnashing of teeth that seems to characterize the current efforts at redoing www.python.org, however I think we would do well to consider how the website could be improved. As a start, it seems we should a link like "Install" in the left-hand margin. To someone wanting to install SpamBayes, it's not obvious which of Applications, Documentation, Frequently Asked Questions or one of the platform-specific pages is the best place to learn how to install it or troubleshoot an install. I like Wikis a lot as an easy way to work collaboratively. I'd be willing to set up a Wiki where people can scribble thoughts, structure and content. Skip From dfrancey at workerscomp.com Wed Sep 24 15:04:33 2003 From: dfrancey at workerscomp.com (David E. Francey) Date: Wed Sep 24 15:03:25 2003 Subject: [Spambayes] beta 2003 Message-ID: Hello: I reloaded Spambayes and have the latest version like you said, but when ever hit the Delete as Spam button Outlook 2003 bombs. The mail from outlook goes away, but the program bombs and the restarts. ????? David David E. Francey New York Compensation Managers, Inc. 6250 S. Bay Road - PO Box 3580 Syracuse, New York 13220 Voice 315-699-8475 Fax 315-699-1438 HYPERLINK "http://www.workerscomp.com/" www.WorkersComp.com CONFIDENTIALITY NOTICE: This e-mail and the transmitted documents contain private, privileged and confidential information belonging to the sender. The information therein is solely for the use of the addressee. If your receipt of this transmission has occurred as the result of an error, please immediately notify us so we can arrange for the return of the documents. In such circumstances, you are advised that you may not disclose, copy, distribute or take any other action in reliance on the information transmitted. --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.516 / Virus Database: 313 - Release Date: 9/1/2003 -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 4046 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/4825e5e5/winmail-0001.bin From davisgrp2 at citlink.net Wed Sep 24 15:06:12 2003 From: davisgrp2 at citlink.net (Davis Group) Date: Wed Sep 24 15:05:56 2003 Subject: [Spambayes] Filters Message-ID: I have Windows 98 and Outlook 2000 and It will not let me look at my inbox which has 138 items filtered and I cannot see them. How do I correct? Mike --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.510 / Virus Database: 307 - Release Date: 8/14/03 From taylor.anderson at eds.com Wed Sep 24 16:24:01 2003 From: taylor.anderson at eds.com (Anderson, Taylor) Date: Wed Sep 24 16:24:12 2003 Subject: [Spambayes] Any way to access cumulative statistics? Message-ID: Gents -- After fighting for years with a "Junk Senders" list in Outlook, I read an article somewhere about Bayesian filtering, and went a-searching for an Outlook plugin. After installing SpamBayes a week (and a few thousand e-mails) ago, I would guess that it's running at about 98% accuracy... and getting visibly better day to day. ...but I'm not really sure... ;-) I'd love to find out some metrics for what has been filtered on my system. Does SpamBayes keep cumulative statistics anywhere on the total number of messages checked, number of messages classified as spam, ham, and questionable? If not, that would be a great addition to the SpamBayes Manager menu... Phenomenal product, guys. So clean, so easy, and most of all, so effective. Thanks again, guys! --Taylor _____________________________________ Taylor Anderson Solutions Development Team Western Region EDS http://www.eds.com/products/plm/ _____________________________________ From adam.walker at rbwconsulting.com Wed Sep 24 19:53:08 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Wed Sep 24 19:53:21 2003 Subject: [Spambayes] beta 2003 In-Reply-To: Message-ID: <20030924235307.81329862C0@plunder.dreamhost.com> Hi David, Make sure you have the technical refreshes for Office 2003 installed. Is this the beta or the final version of Outlook 2003? --Adam _____________________________________________ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of David E. Francey Sent: Wednesday, September 24, 2003 3:05 PM To: spambayes@python.org Subject: [Spambayes] beta 2003 Hello: I reloaded Spambayes and have the latest version like you said, but when ever hit the Delete as Spam button Outlook 2003 bombs. The mail from outlook goes away, but the program bombs and the restarts. ????? -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 3618 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030924/81e02757/winmail.bin From David.Bear at asu.edu Wed Sep 24 18:55:49 2003 From: David.Bear at asu.edu (David Bear) Date: Wed Sep 24 20:26:19 2003 Subject: [Spambayes] problems with the web site? Message-ID: <20030924155549.I16613@asu.edu> was looking for more unix info and this is what I get. ======= for http://cvs.sourceforge.net/viewcvs.py/*checkout*/spambayes/spambayes/INTEGRATION.txt?rev=HEAD&content-type=text/plain ======== An Exception Has Occurred spambayes/spambayes/INTEGRATION.txt: unknown location HTTP Response Status 404 Not Found Python Traceback Traceback (most recent call last): File "/usr/local/viewcvs-1.0-dev/lib/viewcvs.py", line 3088, in main request.run_viewcvs() File "/usr/local/viewcvs-1.0-dev/lib/viewcvs.py", line 284, in run_viewcvs % self.where, '404 Not Found') ViewCVSException: 404 Not Found: spambayes/spambayes/INTEGRATION.txt: unknown location =============== outch... -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, everyone will be suspect of trespassing" From T.A.Meyer at massey.ac.nz Wed Sep 24 20:28:57 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 20:29:45 2003 Subject: [Spambayes] Thanks Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A158@its-xchg4.massey.ac.nz> > I like Wikis a lot as an easy way to work collaboratively. > I'd be willing to set up a Wiki where people can scribble > thoughts, structure and content. +1 =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 21:19:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 21:20:25 2003 Subject: [Spambayes] problems with the web site? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A199@its-xchg4.massey.ac.nz> > was looking for more unix info and this is what I get. [...] > spambayes/spambayes/INTEGRATION.txt: unknown location [...] > 404 Not Found Thanks. This is now fixed. INTEGRATION.txt was subsumed into README.txt and then removed from cvs, so that's why the link died. That link still won't work, but the docs.html and unix.html pages have been updated to point to the README.txt file instead. (If you really want to read INTEGRATION.txt, it's in 1.0a6 and also in the cvs attic, but all the information that was in it is now in README.txt). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 21:33:38 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 21:33:56 2003 Subject: [Spambayes] Outlook 2K issue Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A1B0@its-xchg4.massey.ac.nz> > Spambayes installed without a hitch but training fails to see > the Inbox (nor others underneath it) that I set aside for > "ham". WHen training, I used the "Browse" button and have the > main "Inbox" folder checked and the "subfolders" checkbox > checked (which is where the other 'good messages' are) and > training sees the SPAM folders messages but *never* sees any > of the 'good' messages in any of those folders. Have you tried enabling "background filtering" on the advanced tab in the main dialog? If that doesn't help, then if someone is going wrong, your log files would be helpful. The docs have instructions about finding them and opening a bug report on sourceforge. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 21:41:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 21:42:15 2003 Subject: [Spambayes] Advice on when to stop training. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A1B5@its-xchg4.massey.ac.nz> > I am beginning to > get a bit tired of having to train it all the time No-one has really managed to establish what the 'best' training regime is, but if it's classifying correctly, then you can probably safely stop, or only train on unsures/misclassified messages. Tim Peters, the godfather of the project, has a pretty small set of training data and gets excellent results. More is not always better! > (plus I am worried that the cache will fill the disk). Note that there are options to assist with this. You can set the number of days messages stay in the cache (defaults to 7), after which they are permanently removed. You can also elect not to cache messages over a certain size, or not to cache 'bulk' messages (the majority of mailing list traffic will identify itself as 'bulk'). You can, of course, turn caching off completely, but, IMO, it would be better to leave it on and just reduce the expiry time (unless you receive too much mail to store in a single day). > I would prefer to just send wrongly classified mails to > spambayes_spam/ham@localhost and not have to deal with > training all the mails that are delivered to me. Note that, unless you can be certain that you are sending the mails unaltered (few mail clients will let you do this), you should use the 'lookup message in cache' option for smtpproxy, which means storing the messages in the cache for a certain amount of time anyway. > What settings should I use to accomplish this? I would recommend that if you are happy with the classification in general, you enable the "don't cache messages larger than" option (picking a size of your choice), and the "don't cache bulk mail" option. You could also reduce the expiry time to 2 or 3 days, depending on how often you go through your mail. Unless you are able to send mail unaltered (with procmail or something like that), then you should have the "lookup messages in cache" option enabled. You also need to enter your smtp server details. That should be it - note that you can still use the web interface to review mail - it's both, not either/or. HTH. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 21:44:32 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 21:44:46 2003 Subject: [Spambayes] Any way to access cumulative statistics? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A1BB@its-xchg4.massey.ac.nz> > I'd love to find out some metrics for what has been filtered > on my system. Does SpamBayes keep cumulative statistics > anywhere on the total number of messages checked, number of > messages classified as spam, ham, and questionable? If not, > that would be a great addition to the SpamBayes Manager menu... This has been requested a few times, and someone (sorry, can't recall the name) came up with a patch to get this started. I haven't seen anything about it for a while, though, so I'm not sure what the current story is. I'm sure this will get added at some point, though. Note that, for the moment, information about the number of messages classified is stored in the logs, but it's not exactly a nice way to get it! =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Sep 24 21:50:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Sep 24 21:50:28 2003 Subject: [Spambayes] Addin does not load Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130359A1C3@its-xchg4.massey.ac.nz> > Thanks for your help. I did what you suggested and it does > not help. :( It's still worth trying getting rid of the .ini file, as there might be something causing a problem in there. (There should be two, one called default_bayescustomize.ini (or something like that), which you can leave alone, and one with the name of your Outlook profile, which you can delete). If that doesn't help, then I'm out of ideas! I'd recommend opening a bug report on sourceforge - sometimes people miss seeing messages on the list and so miss being able to help. A bug report is persistent, so it will be got to sooner or later - plus I think Mark is more likely to answer a bug report, and he's the fountain of knowledge regarding the plug-in. > The only thing I'm not sure about is what you say > about the ini file. You say there should be a folder local > settings /application data / spambayes and an ini file in it. > I don't see such a folder on my pc. Am I looking in the > wrong the place? If you are using WinNT/2K/XP, it will most likely be in c:\documents and settings\[username]\application data\spambayes\. I can't recall exactly where Win9x/ME puts it - something like c:\windows\application data\spambayes, I think. If you had 008 installed, there's a handy button on the dialog that will open the folder for you ;) =Tony Meyer From janeaustine50 at hotmail.com Thu Sep 25 01:55:30 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Thu Sep 25 01:55:34 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express Message-ID: Hello. First of all, thank you for this wonderful piece of software. I have some problems with spambayes-1.0a6. I'm using it with Outlook Express on Win XP. I have searched the documents but couldn't find the answer. I'm using spambayes with pop3/smtp proxy. When I get a ham email that was classified as spam, I'd like to correct and retrain spambayes. As far as I know I can do it forwarding it to spambayes_ham@localhost. After doing this, the web interface front page shows that the number of ham is incremented by one(but spam number is not decremented). However, if I go to the review page, the email is still shown as spam. What is wrong? Is this correct behaviour? Is it not retrained? Does it have to do with use_cached_message? It's set as default, False. If I turn it on and forward an email to the smtp proxy, the console prints something like "could not look up the id" or so. Is it because OE removes special message headers like message id? I have also tried it with Microsoft Outlook 2K, and found no difference. I know that there is an outlook plugin, but I'd like to do it with pop3/smtp proxy first. Does Outlook 2K also remove special message headers? Thanks in advance. Cheers, Jane _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From syver-en at online.no Thu Sep 25 02:01:09 2003 From: syver-en at online.no (Syver Enstad) Date: Thu Sep 25 02:00:47 2003 Subject: [Spambayes] Advice on when to stop training. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130359A1B5@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130359A1B5@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > Note that, unless you can be certain that you are sending the mails > unaltered (few mail clients will let you do this), you should use > the 'lookup message in cache' option for smtpproxy, which means > storing th messages in the cache for a certain amount of time > anyway. So if I don't have the lookup message in cache option selected I can train on messages and still have the cache turned off? >Unless you are able to send mail unaltered (with procmail or >something like that), then you should have the "lookup messages in >cache" option enabled. I am using Gnus 5.9.0 does that qualify as forwarding mails unchanged? I guess not because forwarding puts the original message inside a MIME part. >You also need to enter your smtp server details. What do you mean??? Oh, you mean that one has to enter the SMTP details to be able to use Spambayes as a proxy Smtpserver. I've already done that and used it successfully (I think) for a couple of days now. Thanks for helping me out again Tony! -- Vennlig hilsen Syver Enstad From syver-en at online.no Thu Sep 25 02:20:46 2003 From: syver-en at online.no (Syver Enstad) Date: Thu Sep 25 02:19:45 2003 Subject: [Spambayes] Advice on when to stop training. In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130359A1B5@its-xchg4.massey.ac.nz> Message-ID: Syver Enstad writes: > "Meyer, Tony" writes: > > > Note that, unless you can be certain that you are sending the mails > > unaltered (few mail clients will let you do this), you should use > > the 'lookup message in cache' option for smtpproxy, which means > > storing th messages in the cache for a certain amount of time > > anyway. When I turn on lookup messages in cache I can't seem to send messages to spambayes_spam@localhost. My mail program just hands trying to send it. I attach one mail that exhibits this behaviour to this message. -------------- next part -------------- To: Subject: ["Andrew" ] fw: reduce your $D *E *B *T$ from $500 to as little as $150 a month Gcc: nnfolder+archive:sent-mail --text follows this line-- <#mml type=message/rfc822 disposition=inline> X-From-Line: ronald54jori@fantasticmarks.net Thu Sep 25 00:45:12 2003 Return-Path: Received: from rj194140.user.veloxzone.com.br (RJ194140.user.veloxzone.com.br [200.165.194.140] (may be forged)) by mail25.nsc.no (8.9.3p2/8.9.3) with SMTP id HAA07931; Thu, 25 Sep 2003 07:46:54 +0200 (CEST) Message-ID: <6isi$q344s7jw$5axgs@t6x0.gg.0p4> From: "Andrew" Reply-To: "Andrew" To: , , , , Subject: fw: reduce your $D *E *B *T$ from $500 to as little as $150 a month Date: Thu, 25 Sep 2003 00:45:12 -0600 X-Spambayes-Classification: unsure X-Spambayes-Spam-Probability: 0.505576186247 X-Spambayes-MailId: 1064470150 X-Content-Length: 985 Lines: 33 Xref: TECH unsure:23 <#multipart type=alternative> <#part type=text/html nofile=yes>
      Debt Consolidation with NO credit check!


      YOU ARE APPROVED!


      FREE UP THE CASH YOU DESERVE NOW!
      NO mortgage required!

      Click here to request more info


      remove <#/multipart> <#/mml> -- Vennlig hilsen Syver Enstad -------------- next part -------------- -- Vennlig hilsen Syver Enstad From janeaustine50 at hotmail.com Thu Sep 25 02:23:14 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Thu Sep 25 02:23:17 2003 Subject: [Spambayes] documentation bug? Message-ID: The web interface reads on lookup message in cache as: Lookup message in cache: If this option is set, then the smtpproxy will attempt to look up the messages sent to it (for training) in the POP3 proxy cache or IMAP filter folders, and use that message as the training data. This avoids any problems where your mail client might change the message when forwarding, contaminating your training data. If you can be sure that this won't occur, then the id-lookup can be avoided. Note that Outlook Express users cannot use the lookup option (because of the way messages are forwarded), and so if they wish to use the SMTP proxy they must enable this option (but as messages are altered, may not get the best results, and this is not recommended). I suspect that enable has to be disable in "Note that Outlook Express users ... to use the STMP proxy they must enable this option." _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From T.A.Meyer at massey.ac.nz Thu Sep 25 03:14:08 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 03:14:26 2003 Subject: [Spambayes] documentation bug? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9B8F@its-xchg4.massey.ac.nz> > The web interface reads on lookup message in cache as: > > Lookup message in cache: If this option is set, then the > smtpproxy will attempt to look up the messages sent to it > (for training) in the POP3 proxy cache or IMAP filter folders, > and use that message as the training data. This avoids any > problems where your mail client might change the message > when forwarding, contaminating your training data. If you can > be sure that this won't occur, then the id-lookup can be avoided. > Note that Outlook Express users cannot use the lookup option > (because of the way messages are forwarded), and so if they > wish to use the SMTP proxy they must enable this option > (but as messages are altered, may not get the best results, and > this is not recommended). > > I suspect that enable has to be disable in "Note that Outlook > Express users > ... to use the STMP proxy they must enable this option." Nope, disable. OE doesn't forward the message's headers, so can't look the message up in the cache, so using this option will not work. So it must be *disabled*. Using the smtp proxy is really not a good idea for OE users. =Tony Meyer From tom.decaluwe at ecgroup.be Thu Sep 25 04:54:21 2003 From: tom.decaluwe at ecgroup.be (Tom Decaluwe) Date: Thu Sep 25 04:54:34 2003 Subject: [Spambayes] Problem with spambay Message-ID: I?m running Windows XP SP1 Spambayes outlook addin, binary version 0.81 (september 9 ,2003) On outlook 2003 build 11.4920.4920 BETA I have seen spambayes work once on my pc but since this it nolonger seems to be doing real time monitoring. I can however start a filter of a mailbox and have it cleanup the spam. My logextract just after starting spambayes is: Loaded bayes database from 'C:\Documents and Settings\tom\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\tom\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 323 spam and 66 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Junk E-mail Processing missed spam in folder 'Inbox' by starting a timer FAILED to add the toolbar item 'SpamBayesCommand.Manager' - (-2147352567, 'Exception occurred.', (0, None, None, None, 0, -2147467259), None) Deleted the dead popup control - re-creating Saving configuration -> C:\Documents and Settings\tom\Application Data\SpamBayes\Outlook.ini Saving configuration -> C:\Documents and Settings\tom\Application Data\SpamBayes\Outlook.ini Does spambayes work on outlook 2003? Kind regards, Tom Decaluw? From T.A.Meyer at massey.ac.nz Thu Sep 25 04:56:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 04:56:19 2003 Subject: [Spambayes] Problem with spambay Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9B94@its-xchg4.massey.ac.nz> > On outlook 2003 build 11.4920.4920 BETA [...] > Does spambayes work on outlook 2003? Yes, although you have to have installed all the technical refreshes (or, presumably, use the final version, which include these). Have you done this? =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 25 05:06:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 05:06:37 2003 Subject: [Spambayes] Problem with spambay Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9B95@its-xchg4.massey.ac.nz> >>> On outlook 2003 build 11.4920.4920 BETA >>[...] >>> Does spambayes work on outlook 2003? >> >> Yes, although you have to have installed all the technical >> refreshes (or, presumably, use the final version, which >> include these). >> >> Have you done this? > Jep, i'm running the latest version of spambayes I was asking if you had installed all the technical refreshes (or are using 2003 final, not beta), not whether you have the latest version of spambayes...Sorry it was unclear. BTW, it's always best to include the list in the recipient list, so that you get the benefit of everyone's wisdom. =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 25 05:07:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 05:07:49 2003 Subject: [Spambayes] Problem with spambay Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9B96@its-xchg4.massey.ac.nz> [me] > I was asking if you had installed all the technical refreshes * of Outlook 2003 * > (or are using 2003 final, not beta), not whether you have the > latest version of spambayes...Sorry it was unclear. Good grief, I can't even be clear on the second attempt...time for me to go home and get some sleep I think. =Tony Meyer From tom.decaluwe at ecgroup.be Thu Sep 25 05:09:01 2003 From: tom.decaluwe at ecgroup.be (Tom Decaluwe) Date: Thu Sep 25 05:09:17 2003 Subject: [Spambayes] Problem with spambay In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13036D9B95@its-xchg4.massey.ac.nz> Message-ID: Oops, sorry about that. No, i have not installed any extras for the outlook 2003 beta. It's still the first version from the MSDN. It might indeed be best for me to download the full office 2003 as i believe it was release a short while ago. -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: donderdag 25 september 2003 11:06 To: Tom Decaluwe Cc: spambayes@python.org >>> On outlook 2003 build 11.4920.4920 BETA >>[...] >>> Does spambayes work on outlook 2003? >> >> Yes, although you have to have installed all the technical >> refreshes (or, presumably, use the final version, which >> include these). >> >> Have you done this? > Jep, i'm running the latest version of spambayes I was asking if you had installed all the technical refreshes (or are using 2003 final, not beta), not whether you have the latest version of spambayes...Sorry it was unclear. BTW, it's always best to include the list in the recipient list, so that you get the benefit of everyone's wisdom. =Tony Meyer From mark_fitzgerald at keybank.com Thu Sep 25 06:10:41 2003 From: mark_fitzgerald at keybank.com (mark_fitzgerald@keybank.com) Date: Thu Sep 25 06:10:45 2003 Subject: [Spambayes] 15,000 spam and 200 known good In-Reply-To: Message-ID: I know what my results are, but I just wanted to check to see if this is normal.. If I have a ton of spam, and only a few hundred good messages, it seems that even when I send a known good message to the mailbox, it gets classified as spam -- not maybe spam, but 100% spam. Is there a way to balance it out? Should I have to? I can send a message, do a move to spam, (or move manually and retrain) and then resend the same exact message, and it gets classified as SPAM again. Any idea what's going on? Thanks. - Mark From anthony at interlink.com.au Thu Sep 25 07:13:59 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu Sep 25 07:15:40 2003 Subject: [Spambayes] 15,000 spam and 200 known good In-Reply-To: Message-ID: <200309251113.h8PBDxhx018935@localhost.localdomain> >>> mark_fitzgerald@keybank.com wrote > I know what my results are, but I just wanted to check to see if this is > normal.. If I have a ton of spam, and only a few hundred good messages, it > seems that even when I send a known good message to the mailbox, it gets > classified as spam -- not maybe spam, but 100% spam. Is there a way to > balance it out? Should I have to? I can send a message, do a move to > spam, (or move manually and retrain) and then resend the same exact > message, and it gets classified as SPAM again. Any idea what's going on? This is a known issue - spambayes works best when trained with approximately similar numbers of spam and ham. If you've trained on such a grossly mismatched test set, your database is going to think almost everything is spam. Your best bet is to delete your database and train on a much smaller set of spam. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From DarkEmperor at gmx.net Thu Sep 25 08:36:14 2003 From: DarkEmperor at gmx.net (Emperor) Date: Thu Sep 25 08:36:28 2003 Subject: [Spambayes] spam error... Message-ID: <1064493374.3994.35.camel@nils> Hi, I was running spambayes today and suddendly got many spam mails that are normally filtered in my inbox. I looked at the headers: X-Spambayes-Exception: Traceback (most recent call last): . File "/usr/bin/sb_server.py", line 443, in onRetr . evidence=True) . File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 158, in chi2_spamprob . clues = self._getclues(wordstream) . File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 391, in _getclues . record = self._wordinfoget(word) . File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 250, in _wordinfoget . r = self.db.get(word) . File "/usr/lib/python2.2/shelve.py", line 65, in get . if self.dict.has_key(key): .error: (22, 'Invalid argument') ??? my spambayes has still very strange behavior...look at my previous posts. greets, nils From skip at pobox.com Thu Sep 25 09:32:07 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 25 09:32:21 2003 Subject: [Spambayes] spam error... In-Reply-To: <1064493374.3994.35.camel@nils> References: <1064493374.3994.35.camel@nils> Message-ID: <16242.61015.518550.425615@montanaro.dyndns.org> nils> File "/usr/lib/python2.2/shelve.py", line 65, in get nils> if self.dict.has_key(key): nils> error: (22, 'Invalid argument') This suggests your database got corrupted. A few questions to help decide if this is a bug in the code or simply pilot error: * What version of SpamBayes are you running? * If you execute import whichdb, os from spambayes import Options dbfile = os.path.expanduser(Options.options["Storage", "persistent_storage_file"]) print whichdb.whichdb(dbfile) what is printed? * The last time you power cycled your computer did you shut down sb_server cleanly (that is, select "Save and shutdown" from your web browser)? If you're not running a fairly up-to-date version of SpamBayes (the most recent distribution is 1.0a6), you should upgrade (you have a lot of faith installing alpha software in /usr/bin!). If whichdb reports "dumbdbm", either change your storage options to something like [Storage] persistent_use_database: False persistent_storage_file: ~/hammie.pck so you start using a pickle as your training database, or get and install PyBSDDB, aka bsddb3 and delete your existing database files (there are probably some info caches you also need to delete). You will need a recent (4.1.25 or later is best) version of Sleepycat's Berkeley DB database library. If you shut down your computer without cleanly exiting from sb_server, *whack*! I just rapped your knuckles with a ruler. ;-) I didn't hit them very hard though. Sb_server should probably catch a number of Unix signals and exit cleanly upon receipt. I don't believe it does that at the moment. Skip From skip at pobox.com Thu Sep 25 09:43:11 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 25 09:43:21 2003 Subject: [Spambayes] Thanks In-Reply-To: References: Message-ID: <16242.61679.556187.535086@montanaro.dyndns.org> >>>>> "Tom" == Thomas G Burke writes: Tom> I can maybe help a little bit on your website, but to be honest, Tom> almost all of my skills are html 1.1 (or so), and I've mostly coded Tom> only by hand, so I'm not sure I can be much help. That's okay. Most of the work the website needs is structural or raw content. While the amount of HTML used to generate the site is not large, the .ht files are essentially the text which goes between ... tags. Tom> If you have some specific ideas, though, maybe I can chip in a Tom> little, or maybe build you a structure or some such... Yes, structure would be helpful. As a new user of SpamBayes, you're probably in a better position to say, "I had trouble finding out about grumble snitches. If you had things organized like blah blah blah, I think it would work better." Tom> Is there any other help you could use, that might be a little more Tom> useful to you? If you need it, I should be able to slap together a Tom> quick bio, or fork over a resume. I do have some decent skills in Tom> C, and LabWindows C, if either of those could be fond to be useful. Tom> -Tom Skip From DarkEmperor at gmx.net Thu Sep 25 10:35:22 2003 From: DarkEmperor at gmx.net (Emperor) Date: Thu Sep 25 10:35:30 2003 Subject: [Spambayes] spam error... In-Reply-To: <16242.61015.518550.425615@montanaro.dyndns.org> References: <1064493374.3994.35.camel@nils> <16242.61015.518550.425615@montanaro.dyndns.org> Message-ID: <1064500522.3994.42.camel@nils> Am Don, 2003-09-25 um 15.32 schrieb Skip Montanaro: > nils> File "/usr/lib/python2.2/shelve.py", line 65, in get > nils> if self.dict.has_key(key): > nils> error: (22, 'Invalid argument') > > This suggests your database got corrupted. A few questions to help decide > if this is a bug in the code or simply pilot error: > > * What version of SpamBayes are you running? > > * If you execute > > import whichdb, os > from spambayes import Options > dbfile = os.path.expanduser(Options.options["Storage", > "persistent_storage_file"]) > print whichdb.whichdb(dbfile) I'm new to python and had to fight with newlines first, but it says 'None' did I forgot to install something? > > what is printed? > > * The last time you power cycled your computer did you shut down > sb_server cleanly (that is, select "Save and shutdown" from your web > browser)? > surely not, I can't! why? here: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 477, in onSave self._doSave() File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 470, in _doSave classifier.store() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 229, in store self._write_state_key() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 233, in _write_state_key self.db[self.statekey] = (classifier.PICKLE_VERSION, File "/usr/lib/python2.2/shelve.py", line 77, in __setitem__ self.dict[key] = f.getvalue() TypeError: object does not support item assignment > If you're not running a fairly up-to-date version of SpamBayes (the most > recent distribution is 1.0a6), you should upgrade (you have a lot of faith > installing alpha software in /usr/bin!). If whichdb reports "dumbdbm", > either change your storage options to something like > > [Storage] > persistent_use_database: False > persistent_storage_file: ~/hammie.pck > > so you start using a pickle as your training database, or get and install > PyBSDDB, aka bsddb3 and delete your existing database files (there are > probably some info caches you also need to delete). You will need a recent > (4.1.25 or later is best) version of Sleepycat's Berkeley DB database > library. I have the newest one, running on python 2.2.3. > If you shut down your computer without cleanly exiting from sb_server, > *whack*! I just rapped your knuckles with a ruler. ;-) I didn't hit them > very hard though. Sb_server should probably catch a number of Unix signals > and exit cleanly upon receipt. I don't believe it does that at the moment. > > Skip beyond this, I still have my config problems, that sb_server.py doesn't start the pop3 proxies automatically and that I have to reconfigure them first... ok, so what's the first step? ;) From skip at pobox.com Thu Sep 25 11:08:58 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 25 11:09:10 2003 Subject: [Spambayes] spam error... In-Reply-To: <1064500522.3994.42.camel@nils> References: <1064493374.3994.35.camel@nils> <16242.61015.518550.425615@montanaro.dyndns.org> <1064500522.3994.42.camel@nils> Message-ID: <16243.1290.36635.982387@montanaro.dyndns.org> Maybe it's time for 1.0a7. Many people seem to be running into this particular problem. >> * If you execute >> >> import whichdb, os >> from spambayes import Options >> dbfile = os.path.expanduser(Options.options["Storage", >> "persistent_storage_file"]) >> print whichdb.whichdb(dbfile) nils> I'm new to python and had to fight with newlines first, but it nils> says 'None' Hmmm... That means it can't tell what the type of the database file is. That's odd. nils> did I forgot to install something? I don't think so. >> * The last time you power cycled your computer did you shut down >> sb_server cleanly (that is, select "Save and shutdown" from your web >> browser)? nils> surely not, I can't! why? here: nils> 500 Server error nils> Traceback (most recent call last): ... nils> File "/usr/lib/python2.2/shelve.py", line 77, in __setitem__ nils> self.dict[key] = f.getvalue() nils> TypeError: object does not support item assignment This is a bug I think Tony Meyer fixed in the past few days - since 1.0a6 was released. If you can check out from CVS, I'd recommend it. I took a look but couldn't figure out what changes fixed what bugs. Tony's in New Zealand. I don't expect him to be back at the computer for eight hours or so unless he's a night owl. nils> beyond this, I still have my config problems, that sb_server.py nils> doesn't start the pop3 proxies automatically and that I have to nils> reconfigure them first... I'm not sure what this problem would be, though it's quite possible that since you have trouble saving your configuration, it never learns your settings. Skip From skip at pobox.com Thu Sep 25 11:30:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 25 11:30:15 2003 Subject: [Spambayes] Need more training messages Message-ID: <16243.2554.280733.299370@montanaro.dyndns.org> A week ago I said I would take steps to implement an initial training database which could be distributed with SpamBayes (the training pickle will be distributed, not raw email messages). So far, I have 11 emails. I'd like to get around 20 or 30 of each. If you have a free minute or two can you rummage through your email and find me two hams and two spams? Forward them to me as attachments with subjects of either "Sample Ham" or "Sample Spam", as appropriate. While no emails will be distributed, you should still only send me ham messages which are not sensitive. Thx, Skip From papaDoc at videotron.ca Thu Sep 25 11:38:28 2003 From: papaDoc at videotron.ca (papaDoc) Date: Thu Sep 25 11:37:57 2003 Subject: [Spambayes] Need more training messages In-Reply-To: <16243.2554.280733.299370@montanaro.dyndns.org> References: <16243.2554.280733.299370@montanaro.dyndns.org> Message-ID: <3F730BF4.6080306@videotron.ca> Hi Skip, > If you have a free minute or two can you rummage through your email and find me two hams and two spams? Do you only what english mail or other languages also ? Remi From skip at pobox.com Thu Sep 25 11:53:39 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Sep 25 11:53:49 2003 Subject: [Spambayes] Need more training messages In-Reply-To: <3F730BF4.6080306@videotron.ca> References: <16243.2554.280733.299370@montanaro.dyndns.org> <3F730BF4.6080306@videotron.ca> Message-ID: <16243.3971.700475.787247@montanaro.dyndns.org> >> If you have a free minute or two can you rummage through your email >> and find me two hams and two spams? Remi> Do you only what english mail or other languages also ? I doubt a few non-English hams and spam would hurt. Let's limit it to Western European languages (no Hebrew or Japanese, for example). Skip From gsloman at ntlworld.com Thu Sep 25 13:43:54 2003 From: gsloman at ntlworld.com (Glyn Sloman) Date: Thu Sep 25 13:42:43 2003 Subject: [Spambayes] bug in spambayes Message-ID: <000001c3838c$973e2bc0$05996051@c0w4q0> hi i'm running spambayes binary version 0.81 - with windows me and outlook 2000 whenever i close outlook there is an error message 'outlook has caused an error in - if you continue to encounter problems try restarting your computer' this happens every time i close outlook - so outlook doesn't run through its normal closure routine - however, this doesn't seem to affect the normal running of either outlook or spambayes - at least none that i have found yet - my log file is attached regards glyn -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 673 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030925/6b544135/spambayes1.obj From CCrevits at archchemicals.com Thu Sep 25 14:45:49 2003 From: CCrevits at archchemicals.com (Crevits, Chris **BETH) Date: Thu Sep 25 14:51:04 2003 Subject: [Spambayes] Question Message-ID: <7531557A3D91D311B0320008C7E6708603E18D68@arch-exc-05.archchemicals.com> Using Outlook 2003 New user. I have a few "mailboxes" I have access to. However, I only want Spambayes to filter on my own mailbox or inbox. It automatically set itself up to filter all three. Is there a way I can have it only monitor my inbox? Specifiy which one without removing the two I don't want from my current Outlook configuration? Thank you. Chris Crevits Arch Chemicals From CCrevits at archchemicals.com Thu Sep 25 16:29:58 2003 From: CCrevits at archchemicals.com (Crevits, Chris **BETH) Date: Thu Sep 25 16:30:03 2003 Subject: [Spambayes] FW: Question Message-ID: <7531557A3D91D311B0320008C7E6708603E18D6B@arch-exc-05.archchemicals.com> Re: question below... I found how I can uncheck the auto-selected inboxes. All good now! Chris Crevits Arch Chemicals > ______________________________________________ > From: Crevits, Chris **BETH > Sent: Thursday, September 25, 2003 1:46 PM > To: 'spambayes@python.org' > Subject: Question > > > Using Outlook 2003 > New user. > I have a few "mailboxes" I have access to. > However, I only want Spambayes to filter > on my own mailbox or inbox. It automatically > set itself up to filter all three. > > Is there a way I can have it only monitor my inbox? > Specifiy which one without removing the two I don't want > from my current Outlook configuration? > > Thank you. > > Chris Crevits > Arch Chemicals > > > From NilsKaiser at gmx.de Thu Sep 25 17:11:18 2003 From: NilsKaiser at gmx.de (Nils Kaiser) Date: Thu Sep 25 17:11:36 2003 Subject: [Spambayes] end of problems? Message-ID: <1064524273.3814.23.camel@nils> hi, as my spambayes seemed to be dying more and more... I finally uninstalled all and got the CVS version. til now everything seems to be fine. I have one suggestion: many problems (ok many of mine) happened due to the fact that the html pages are cached and you sometimes click on review messages and get an old version of the page... this happens to every page including configuration... but is very annoying. perhaps simply add the expires http haeder with a date in the past? Greets, Nils -- Nils Kaiser Software-Entwicklung und Internet Programmierung Burgunderstr. 16 71263 Weil der Stadt Tel: 07033 694828 From andy-spambayes at portmacc.net Thu Sep 25 17:50:24 2003 From: andy-spambayes at portmacc.net (Andy Holt) Date: Thu Sep 25 17:50:28 2003 Subject: [Spambayes] OL plugin: 'unable to determine source folder' when recovering? Message-ID: <004301c383af$07229c10$0200a8c0@aholtlt> It's the old story, 'it used to work, but now it doesn't...' When I click 'Recover from Spam' on a message in the unsure folder, that I consider to be ham, the Outlook plugin used to dump it back from whence it came, but now it has started always putting it back into Inbox. The log file excerpt: Message 'Enterprise Linux Forum Conference & Expo * October 22-23 * Washington, DC' had a Spam classification of 'Unsure' Message 'Enterprise Linux Forum Conference & Expo * October 22-23 * Washington, DC' had a Spam classification of 'Unsure' Unable to determine source folder for message 'Enterprise Linux Forum Conference & Expo * October 22-23 * Washington, DC' - restoring to Inbox Recovering to folder 'Inbox' and ham training message 'Enterprise Linux Forum Conference & Expo * October 22-23 * Washington, DC' - Training on message 'Enterprise Linux Forum Conference & Expo * October 22-23 * Washington, DC' - trained as good Outlook XP on Win2K Any ideas, or should I raise a bug? thanks! __Andy__ From DarkEmperor at gmx.net Thu Sep 25 18:04:26 2003 From: DarkEmperor at gmx.net (Emperor) Date: Thu Sep 25 18:04:34 2003 Subject: [Spambayes] end of problems? In-Reply-To: <1064524273.3814.23.camel@nils> References: <1064524273.3814.23.camel@nils> Message-ID: <1064527466.3814.34.camel@nils> Ok, normally I code in Java... but couldn't wait. So I changed the function def writeOKHeaders(self, contentType, extraHeaders={}): """Reflected from `HTTPPlugin`s.""" # Buffer the headers until there's a `write`, in case an error occurs. timeNow = time.gmtime(time.time()) httpNow = time.strftime('%a, %d %b %Y %H:%M:%S GMT', timeNow) headers = [] headers.append("HTTP/1.1 200 OK") headers.append("Connection: close") headers.append("Content-Type: %s" % contentType) headers.append("Date: %s" % httpNow) # added cache control to always get newest page headers.append("Expires: 26 May 2000 16:00:00 GMT") headers.append("Cache-Control: no-cache") for name, value in extraHeaders.items(): headers.append("%s: %s" % (name, value)) headers.append("") headers.append("") self._bufferedHeaders = headers Now it's just copy paste ;) From DarkEmperor at gmx.net Thu Sep 25 18:09:13 2003 From: DarkEmperor at gmx.net (Emperor) Date: Thu Sep 25 18:09:26 2003 Subject: [Spambayes] end of problems? In-Reply-To: <1064527466.3814.34.camel@nils> References: <1064524273.3814.23.camel@nils> <1064527466.3814.34.camel@nils> Message-ID: <1064527753.3814.38.camel@nils> Hmm... sometimes strange anyway. In Reviews for example. Hey! Anyone talking to me??? ;) Am Fre, 2003-09-26 um 00.04 schrieb Emperor: > Ok, normally I code in Java... but couldn't wait. > > So I changed the function > > def writeOKHeaders(self, contentType, extraHeaders={}): > """Reflected from `HTTPPlugin`s.""" > # Buffer the headers until there's a `write`, in case an error > occurs. > timeNow = time.gmtime(time.time()) > httpNow = time.strftime('%a, %d %b %Y %H:%M:%S GMT', timeNow) > headers = [] > headers.append("HTTP/1.1 200 OK") > headers.append("Connection: close") > headers.append("Content-Type: %s" % contentType) > headers.append("Date: %s" % httpNow) > # added cache control to always get newest page > headers.append("Expires: 26 May 2000 16:00:00 GMT") > headers.append("Cache-Control: no-cache") > for name, value in extraHeaders.items(): > headers.append("%s: %s" % (name, value)) > headers.append("") > headers.append("") > self._bufferedHeaders = headers > > Now it's just copy paste ;) > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From stefan at tspse.net Thu Sep 25 18:11:33 2003 From: stefan at tspse.net (=?iso-8859-1?Q?Stefan_Gr=F6nberg?=) Date: Thu Sep 25 18:11:44 2003 Subject: [Spambayes] Hang at start, and strange other stuff Message-ID: <000101c383b1$fe5cf1e0$24cc96c1@vandread> Hi, when i start my Outlook 2002 (XP) with the plugin, it takes up to 20 seconds before i get the "control" back over Outlook. i have checked so it should work in bakground And some "strange" output in log from python/spambayes ? /stefan From stefan at tspse.net Thu Sep 25 18:13:25 2003 From: stefan at tspse.net (=?iso-8859-1?Q?Stefan_Gr=F6nberg?=) Date: Thu Sep 25 18:13:36 2003 Subject: [Spambayes] 2 problems Message-ID: <000201c383b2$4118d670$24cc96c1@vandread> When i "Check for newer version" i get --------------------------- SpamBayes --------------------------- There was an error checking for the latest version For specific details on the error, please see the SpamBayes log Please check your internet connection, or try again later --------------------------- OK --------------------------- and when i press "Show data folder" in Advanced-flap i get: --------------------------- SpamBayes --------------------------- 'ascii' codec can't encode character u'\xf6' in position 35: ordinal not in range(128) --------------------------- OK --------------------------- From DarkEmperor at gmx.net Thu Sep 25 18:26:14 2003 From: DarkEmperor at gmx.net (Emperor) Date: Thu Sep 25 18:26:21 2003 Subject: [Spambayes] end of problems? In-Reply-To: <1064527753.3814.38.camel@nils> References: <1064524273.3814.23.camel@nils> <1064527466.3814.34.camel@nils> <1064527753.3814.38.camel@nils> Message-ID: <1064528773.3814.43.camel@nils> Argh... galeon does not care about no-cache stuff... ok never mind, the headers would be nice too. From T.A.Meyer at massey.ac.nz Thu Sep 25 19:41:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 19:42:23 2003 Subject: [Spambayes] 2 problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9CD9@its-xchg4.massey.ac.nz> > When i "Check for newer version" i get > > --------------------------- > SpamBayes > --------------------------- > There was an error checking for the latest version > > For specific details on the error, please see the SpamBayes log This is a known problem (with the website, oddly enough), but has been worked around in cvs, so will be fixed in the next version. For now, the -announce list would be the best way to keep track of when that version comes out. > and when i press "Show data folder" in Advanced-flap i get: > --------------------------- > SpamBayes > --------------------------- > 'ascii' codec can't encode character u'\xf6' in position 35: > ordinal not in range(128) Please open a bug report about this, including your log file(s). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 25 19:42:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 19:42:39 2003 Subject: [Spambayes] Hang at start, and strange other stuff Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9CDB@its-xchg4.massey.ac.nz> > Hi, when i start my Outlook 2002 (XP) > with the plugin, it takes up to 20 seconds before > i get the "control" back over Outlook. > > i have checked so it should work in bakground > > And some "strange" output in log from python/spambayes The log file(s) themselves would help track this down... =Tony Meyer From usr7-28hl at xemaps.com Thu Sep 25 22:52:48 2003 From: usr7-28hl at xemaps.com (Keith Russell) Date: Thu Sep 25 22:52:53 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP Message-ID: <15240325937.20030925205248@magicspeller.com> Hi, everyone. I'm sorry, but I'm really lost here. I looked through this month's archives and didn't find a discussion of my problem, and couldn't find a way to search the archives.... I downloaded and installed Python-2.3.1.exe and win32all-157.exe. I then downloaded and unzipped spambayes-1.0a6.zip. I next tried to install SpamPayes for POP3 by following the instructions in the readme file. I have copied the relevant passages below: ************************ For the Really Impatient ======================== If you get your mail from a POP3 server, then all you should need to do to get running is change your mail client to send and receive mail from "localhost", and then run "python sb_server.py -b" in the directory you expanded the SpamBayes source into. This will open a web browser window - click the "Configuration" link at the top right and fill in the various settings. Installation ============ The first thing you need to do is run "setup.py install" in the directory that you expanded the SpamBayes archive into. This will install all the files that you need into the correct locations. After this, you can delete that directory; it is no longer required. POP3 Proxy ---------- .... Now launch pop3proxy, either by running the "pop3proxy_service.py" script (for those using Windows 2000, Windows NT or Windows XP), or the "sb_server.py" script (for everyone else). ************************** Okay.... So, to install SpamBayes, am I supposed to follow the instructions under "For the Really Impatient", those under "Installation", or either, or both? When I ran "python sb_server.py -b" from "the directory I expanded the SpamBayes source into", I got the following error: Can't open file 'sb_server.py' So I then ran the command from the scripts subdirectory, where sb_server.py was located. This gave the following error: Traceback: File "sb_server.py, line 100, in ? import spambases.message Import Error: No module named spambayes.message So I next tried the instructions in the Installation section. This time, everything seemed to run fine. Numerous files were copied into python23\scripts and python23\Lib\site-packages\spambayes. However, when it came time to run pop3proxy_service.py, I found that it had not bee copied. After unzipping the installation files, it turns out that pop3proxy_service.py is not in spambayes-1.06\scripts, whose contents are copied into the Python scripts directory by setup.py. It is in spambayes-1.06\windows, and this directory is apparently ignored by setup.py. I also tried running setup_all.py from the windows subdirectory, with no success. What am I doing wrong? Thanks for your help. From T.A.Meyer at massey.ac.nz Thu Sep 25 23:53:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Sep 25 23:54:15 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E0B@its-xchg4.massey.ac.nz> > I'm sorry, but I'm really lost here. I looked through this > month's archives and didn't find a discussion of my problem, > and couldn't find a way to search the archives.... Google like this: site:mail.python.org spambayes search_term "search phrase" There's a pre-filled out form to do this at: > Okay.... So, to install SpamBayes, am I supposed to follow > the instructions under "For the Really Impatient", those > under "Installation", or either, or both? Either. Or both, but you'll end up doing some things twice. > When I ran "python sb_server.py -b" from "the directory I > expanded the SpamBayes source into", I got the following error: > > Can't open file 'sb_server.py' > > So I then ran the command from the scripts subdirectory, > where sb_server.py was located. :) I'll fix this in the readme. > This gave the following error: > > Traceback: > File "sb_server.py, line 100, in ? > import spambases.message > Import Error: No module named spambayes.message I'll fix the 'really impatient' instructions to also do the install, which will fix this. > However, when it came time to run pop3proxy_service.py, > I found that it had not been copied. This is somewhat deliberate, in that it's only of use to Windows folk, and they'll probably want to be using the (soon to be released) binary version anyway, and those using the source can just use the pop3proxy_service.py script from wherever they put it. You don't say what happened when you ran pop3proxy_service.py. I've fixed up the readme a bit here since 1.0a6, too: you should run (in the spambayes 1.0a6 directory) "windows/pop3proxy_service.py install", and that will install the service. You can then control the service like any other Windows service - either use "net start pop3proxy" (or "net stop pop3proxy") from a command line, or use the Services Control panel (from the Administration Tools one). > I also tried running setup_all.py from the windows > subdirectory, with no success. I have no idea what that file is... =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Sep 25 23:59:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:00:23 2003 Subject: [Spambayes] end of problems? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E11@its-xchg4.massey.ac.nz> > I have one suggestion: many problems (ok many of mine) > happened due to the fact that the html pages are cached and > you sometimes click on review messages and get an old version > of the page... this happens to every page including > configuration... but is very annoying. perhaps simply add the > expires http haeder with a date in the past? The pages all have these http headers: That _should_ mean that they don't get cached according to what I can find out. If you have yet another header to be added that will stop them caching in a particular browser, then please let me know what it is and I can add it. Or, better yet, get your browser to obey the headers.... (Unless your computer is in 1969, in which case, just wait a year and all will be ok ;) =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:04:10 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:04:28 2003 Subject: [Spambayes] spam error... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E1B@its-xchg4.massey.ac.nz> > > * If you execute > > > > import whichdb, os > > from spambayes import Options > > dbfile = os.path.expanduser(Options.options["Storage", > > > "persistent_storage_file"]) > > print whichdb.whichdb(dbfile) > > I'm new to python and had to fight with newlines first, but > it says 'None' > > did I forgot to install something? The utilities/which_database.py script should do something similar for you, BTW. "None" means that either the file doesn't exist (most likely) or that it couldn't be read. Does the file exist? > surely not, I can't! why? here: [...] > TypeError: object does not support item assignment This is a bug in 1.0a6. It's fixed in cvs. You should only come across this if you have saved your options via the web interface and haven't restarted sb_server.py. I believe that it's most likely that your other error is also caused by this. Try restarting sb_server.py and *without changing any options*, see if it works correctly. It should certainly shut down cleanly. You might want to delete the database file (and the message_info one) before doing this, if they exist. > beyond this, I still have my config problems, that > sb_server.py doesn't start the pop3 proxies automatically > and that I have to reconfigure them first... You'll need to fix this problem first. See my other email. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:09:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:09:54 2003 Subject: [Spambayes] SpamBayes config error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E25@its-xchg4.massey.ac.nz> > now I have reached the point where spambayes points to my correct > location in both config pages. Good :) > But when I run sb_server.py, the pop3 and smtp proxy aren't > started and the wrong database is used (mails all unsure... Same > messages are normally filed as spam). This suggests that it's not using the right config file. Note that spambayes will happily use *more than one* config file, where successive files override earlier ones. The web interface will let you manage one of these (it should be the last one loaded, IIRC). Check that you're not actually using two config files - i.e. that the environment variable BAYESCUSTOMIZE is empty (or lists the correct file), and that there isn't a file called bayescustomize.ini in the current working directory *and* a file called .spambayesrc in your home directory. > Then I have to go to the config and resave it and then the servers are > started. This is bizzare, though. If you add an invalid option to the config file, do you get a warning printed out when you start up? If not, then it's not loading it. Try setting BAYESCUSTOMIZE to the location of the file and seeing if that helps. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:31:02 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:31:15 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E3E@its-xchg4.massey.ac.nz> [Using smtp proxy doesn't update the review messages page with OE] > Is it because OE removes special message headers like message id? The way the smtp proxy works, ideally, is to have a special spambayes id forwarded to it, which it uses to find a cached copy of the message. This means that it trains on the exact message that arrived, ignoring anything that the mail program (OE, here) may have done to it. In order for this to work, the mail program has to forward on all the headers in the original message (because that's where the spambayes id is). OE will not do this; it won't even display them. This means that the only way to use the smtp proxy with OE is to train on the exact message that is forwarded - but this is a *bad* idea. The messages will have changed - many headers will be missing, for example, and the message will be from *you*, not the original sender. It also means that the message can't be removed from the 'unknown' cache, because spambayes can't identify which message it was. The good news is that soon(ish) you'll be able to train spambayes by moving messages into a folder inside Outlook Express, rather than having to use the web interface (although that will be available) or smtp proxy. How soon depends on how busy I get ;) > I have also tried it with Microsoft Outlook 2K, and found no > difference. I know that there is an outlook plugin, but I'd like to > do it with pop3/smtp proxy first. Does Outlook 2K also remove > special message headers? I have no idea. If you're willing to use Outlook rather than OE, then it's well worth doing. The plug-in will make everything easy for you. The pop3/smtp proxies are moving towards the goal of being as simple to use as the Outlook plug-in, but aren't quite there yet. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:49:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:50:07 2003 Subject: [Spambayes] Filters Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E53@its-xchg4.massey.ac.nz> > I have Windows 98 and Outlook 2000 and It will not let me > look at my inbox which has 138 items filtered and I cannot > see them. How do I correct? Do you mean that you can't open the inbox, or that you can't select it as a folder to filter, or you can't see the filtered items in the inbox, or what? In any case, going through the steps in the troubleshooting guide would probably be a good start. If that fails, it has information about opening a bug report and attaching your log file(s) to it. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:54:47 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:55:08 2003 Subject: [Spambayes] POP3 Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E58@its-xchg4.massey.ac.nz> > I am simply trying to set up a simple POP proxy on a Windows > XP machine, where the tray icon is available for use. I > would like this to run automatically at windows startup, and > be seemingly invisible. > > Could you give me a set of instructions as to how to do this? In theory, all you need to do is this: 1. Download spambayes 1.0a6. 2. Expand the archive and open a command prompt into that directory. 3. Run "python setup.py install" 4. Run "windows/pop3proxy_service.py install" 5. Put a shortcut to "pythonw windows/pop3proxy_tray.py" into your startup items folder. 6. Use the tray app to configure spambayes. 7. Stop spambayes. 8. Start spambayes again. (Steps 7 and 8 are only necessary because of a bug in 1.0a6). Note that we are currently testing a binary installer that will change the steps required to: 1. Run the installer. 2. Use the tray app to configure spambayes. Hopefully it shouldn't be too far off. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 00:56:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 00:57:06 2003 Subject: [Spambayes] POP3 Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E59@its-xchg4.massey.ac.nz> > I am yet to get the proxy tray to run without the background window. What happens if you use "pythonw" instead of "python", or change the .py extension of pop3proxy_tray to .pyw? > Also, I get the following errors when I try to train a > message through the web interface: > > Training... > 500 Server error > Traceback (most recent call last): [...] > AttributeError: 'int' object has no attribute 'has_key' I *think* that this is another manifestation of the 1.0a6 bug where saving the options closes the database. All signs point to it, anyway. If you start up spambayes, and then try and train a message *without using the configuration pages* this shouldn't happen. It would be great if you could confirm if this is the case. There's talk of maybe putting out 1.0a7, which would fix this bug, athough very little else has changed. =Tony Meyer From wkrussell at magicspeller.com Fri Sep 26 01:40:34 2003 From: wkrussell at magicspeller.com (Keith Russell) Date: Fri Sep 26 01:40:47 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E0B@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E0B@its-xchg4.massey.ac.nz> Message-ID: <3450392453.20030925234034@magicspeller.com> Thank you, Tony! On Fri, 26 Sep 2003 15:53:54 +1200 GMT your local time, which was Thursday, September 25, 2003, 9:53 PM -0700 GMT here, you wrote: >> and couldn't find a way to search the archives.... > Google like this: > site:mail.python.org spambayes search_term "search phrase" > There's a pre-filled out form to do this at: > Thanks. That's good to know. >> Okay.... So, to install SpamBayes, am I supposed to follow >> the instructions under "For the Really Impatient", those >> under "Installation", or either, or both? > Either. Or both, but you'll end up doing some things twice. Okay. That's what I thought, but it wasn't really clear from the way it was worded. >> When I ran "python sb_server.py -b" from "the directory I >> expanded the SpamBayes source into", I got the following error: >> >> Can't open file 'sb_server.py' >> >> So I then ran the command from the scripts subdirectory, >> where sb_server.py was located. > :) I'll fix this in the readme. Thanks 8-). >> This gave the following error: >> >> Traceback: >> File "sb_server.py, line 100, in ? >> import spambases.message >> Import Error: No module named spambayes.message > I'll fix the 'really impatient' instructions to also do the install, > which will fix this. Great. >> However, when it came time to run pop3proxy_service.py, >> I found that it had not been copied. > This is somewhat deliberate, in that it's only of use to Windows folk, > and they'll probably want to be using the (soon to be released) binary > version anyway, and those using the source can just use the > pop3proxy_service.py script from wherever they put it. Is this just intended for one-time use? If you run it and then delete the uncompressed archive files, as the instructions say, of course it will be gone. > You don't say what happened when you ran pop3proxy_service.py. I didn't run it.... > I've > fixed up the readme a bit here since 1.0a6, too: you should run (in the > spambayes 1.0a6 directory) "windows/pop3proxy_service.py install", and > that will install the service. Great. I caught this step in your instructions to another user just before I saw the reply to me! > You can then control the service like > any other Windows service - either use "net start pop3proxy" (or "net > stop pop3proxy") from a command line, or use the Services Control panel > (from the Administration Tools one). It worked great. Thanks very much. >> I also tried running setup_all.py from the windows >> subdirectory, with no success. > I have no idea what that file is... The readme in that directory says to run it, so I decided it was worth a try ;-). -- Keith From T.A.Meyer at massey.ac.nz Fri Sep 26 02:04:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 02:05:06 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E74@its-xchg4.massey.ac.nz> > >> Okay.... So, to install SpamBayes, am I supposed to follow > >> the instructions under "For the Really Impatient", those > >> under "Installation", or either, or both? > > > Either. Or both, but you'll end up doing some things twice. > > Okay. That's what I thought, but it wasn't really clear from > the way it was worded. Could you suggest something that would be clear? I'm not really sure what to change it to, although I can see that it could be confusing. "For the really impatient" and "Installation for the patient"? An extra paragraph at the top of the "Installation" section stating this? [pop3proxy_service] > Is this just intended for one-time use? If you run it and > then delete the uncompressed archive files, as the > instructions say, of course it will be gone. It only needs to be used once, to install it. OTOH, if you ever want to remove it, it's much easier to run "pop3proxy_service.py remove" than it is to manually remove the service from the registry. So it's a two-time only use, I guess. Maybe we should just install the windows directory, too :) =Tony Meyer From T.A.Meyer at massey.ac.nz Fri Sep 26 02:29:36 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Sep 26 02:29:46 2003 Subject: [Spambayes] POP3 Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E77@its-xchg4.massey.ac.nz> > So what I've > done is simply renamed the tray thingy to pyw extension and > started that from my start up folder. Apparently it runs the > service when it starts up. If it is installed, yes. If not, then it runs sb_server in a separate thread. Either way, spambayes is available. > This has been working fine for me. Great! > The only problem is that the pythonw.exe takes up a whopping > 12 MB of memory, but I guess we have to make some sacrifices > to live life spam free. Even just "while True: pass" takes up 3MB here, so 12MB isn't that much considering that the tray app offers more than "while True: pass" does. I'm sure that a lot of blame can be laid with Microsoft, and some with the Python developers (but quietly, they listen here ) for the memory footprint, as well as with spambayes. Note that IIRC if you use a pickle rather than a dbm for storage, this is held in memory, so the demand could rise quite a bit. > This is not a criticism in any sense, but you may want to > consider an installer is with POPFile. There was nice > installer that actually went through my outlook express > server settings and changed everything accordingly. One of these is in the testing process now. In fact, the 1.0a6 distribution comes with the autoconfigure.py script (in the windows directory) which will configure various mail clients, including Outlook Express; it's just not set up to be used by anything yet. The (SpamBayes) Outlook plug-in is far ahead of sb_server/pop3proxy here - it's had an installer for ages, and even has a neat little wizard to guide the user through initial training these days. =Tony Meyer From marc at marcdegraauw.com Fri Sep 26 05:00:22 2003 From: marc at marcdegraauw.com (Marc de Graauw) Date: Fri Sep 26 05:00:28 2003 Subject: [Spambayes] Problem starting pop3proxy_service on XP Message-ID: <5.2.1.1.0.20030926104916.00b2eaf0@pop3.nl.uu.net> I am trying to install and run Spambayes 1.0a6 on Win XP, but the pop3proxy_service does not start. First of all, after running setup.py install, pop3proxy_service.py is not installed in the Python directory (a lot of other spambayes stuff is). The pop3proxy_service.py file is in [download directory]/windows. The readme.txt claims I can discard the download directory after running install, does this mean something has gone wrong with the install? Next, when I run: python pop3proxy_service.py install python pop3proxy_service.py start the service is not started. When I look in the Services window of XP, the "Spambayes pop3proxy_service Service" is listed as manual. When I try to start it in the Services Windows, XP says: "The Spambayes pop3proxy_service Service on Local Computer started and then stopped. Some services stop automatically etc. ..." Anybody got any ideas what goes wrong? Thanks, Marc From anthony at interlink.com.au Fri Sep 26 00:40:40 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri Sep 26 06:38:36 2003 Subject: [Spambayes] end of problems? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E11@its-xchg4.massey.ac.nz> Message-ID: <200309260440.h8Q4ee7Y006078@localhost.localdomain> >>> "Meyer, Tony" wrote > The pages all have these http headers: > > Please remove this. Pragma: no-cache is something the HTTP client sends, not the server. This will do nothing and is misleading. -- Anthony Baxter It's never too late to have a happy childhood. From janeaustine50 at hotmail.com Fri Sep 26 09:18:35 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Fri Sep 26 09:18:39 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: >From: "Meyer, Tony" >To: "Austine Jane" , >Subject: RE: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express >Date: Fri, 26 Sep 2003 16:31:02 +1200 > >[Using smtp proxy doesn't update the review messages page with OE] > > Is it because OE removes special message headers like message id? > >The way the smtp proxy works, ideally, is to have a special spambayes id >forwarded to it, which it uses to find a cached copy of the message. >This means that it trains on the exact message that arrived, ignoring >anything that the mail program (OE, here) may have done to it. In order >for this to work, the mail program has to forward on all the headers in >the original message (because that's where the spambayes id is). OE >will not do this; it won't even display them. > >This means that the only way to use the smtp proxy with OE is to train >on the exact message that is forwarded - but this is a *bad* idea. The >messages will have changed - many headers will be missing, for example, >and the message will be from *you*, not the original sender. It also >means that the message can't be removed from the 'unknown' cache, >because spambayes can't identify which message it was. I have looked into sb_smtpproxy.py source code, and found that the proxy examines the body part of email as well to find cached mail id. So I copied the original spambayes headers into the body and forwarded to the smtp proxy. It seemed to be working. "Could not extract id" message is not shown anymore. In the train_cached_message method, however, the if-statement prints "Could not find message...perhaps it was deleted..." (actually, I had to modify that line so that the strings are concatenated first due to the TypeError: not all arguments converted during string formatting -- a bug.) What's going on? > >The good news is that soon(ish) you'll be able to train spambayes by >moving messages into a folder inside Outlook Express, rather than having >to use the web interface (although that will be available) or smtp >proxy. How soon depends on how busy I get ;) > Thank you! Cheers, Jane _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From normcopp at usexpress.net Fri Sep 26 10:23:47 2003 From: normcopp at usexpress.net (normcopp) Date: Fri Sep 26 10:24:28 2003 Subject: [Spambayes] Error message Message-ID: Hello, I downloaded Spambayes, but when installing as an Outlook addin I get the message: "Unable to register the DLL/OCX:DllRegister Server failed Code oxoooooooo" I have Windows XP. If I ignore and proceed nothing shows up in Outlook, no Spampayes toolbar. What should I do? Thanks, Norm -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1588 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030926/6b584486/winmail.bin From skip at pobox.com Fri Sep 26 11:10:18 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Sep 26 11:10:29 2003 Subject: [Spambayes] end of problems? In-Reply-To: <1064527753.3814.38.camel@nils> References: <1064524273.3814.23.camel@nils> <1064527466.3814.34.camel@nils> <1064527753.3814.38.camel@nils> Message-ID: <16244.22234.687534.554678@montanaro.dyndns.org> nils> Hmm... sometimes strange anyway. In Reviews for example. Hey! nils> Anyone talking to me??? ;) I think Tony's the most familiar with that stuff, and since he's in NZ, you're likely to be a bit out-of-phase. You mentioned: >> So I changed the function >> >> def writeOKHeaders(self, contentType, extraHeaders={}): ... What problem were you trying to solve? Skip From bkc at murkworks.com Fri Sep 26 14:01:39 2003 From: bkc at murkworks.com (Brad Clements) Date: Fri Sep 26 14:07:49 2003 Subject: [Spambayes] Spambayes reviewed in EDN Magazine Message-ID: <3F744808.17474.2429F65D@localhost> PDF version http://www.reed-electronics.com/ednmag/contents/images/323023.pdf HTML version http://www.reed-electronics.com/ednmag/index.asp?layout=article&stt=000&articleid=CA323023&pubdate=9%2F25%2F2003 Summary: Spambayes beats McAfee's SpamKiller (tm ?). -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From syver-en at online.no Fri Sep 26 14:57:38 2003 From: syver-en at online.no (Syver Enstad) Date: Fri Sep 26 14:56:36 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E0B@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E0B@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > You don't say what happened when you ran pop3proxy_service.py. I've > fixed up the readme a bit here since 1.0a6, too: you should run (in > the > > spambayes 1.0a6 directory) "windows/pop3proxy_service.py install", and > > that will install the service. You can then control the service like > any other Windows service - either use "net start pop3proxy" (or "net > stop pop3proxy") from a command line, or use the Services Control > panel > (from the Administration Tools one). I wouldn't recommend people to run pop3proxy_service.py. I think most windows users are better off running pop3proxy_tray.py, as it is less invasive to the system, and easier to install too. -- Vennlig hilsen Syver Enstad From syver-en at online.no Fri Sep 26 14:59:47 2003 From: syver-en at online.no (Syver Enstad) Date: Fri Sep 26 14:58:43 2003 Subject: [Spambayes] POP3 Configuration In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E59@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E59@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: > > I am yet to get the proxy tray to run without the background window. > > > What happens if you use "pythonw" instead of "python", or change the > .py > > extension of pop3proxy_tray to .pyw? Just changing the extension to .pyw works fine. -- Vennlig hilsen Syver Enstad From usr7-28hl at xemaps.com Fri Sep 26 17:39:27 2003 From: usr7-28hl at xemaps.com (Keith Russell) Date: Fri Sep 26 17:39:33 2003 Subject: [Spambayes] Can't Save Configuration Message-ID: <193107925328.20030926153927@magicspeller.com> Hi. I have pop3proxy running on my Windows XP system and am trying to configure SpamBayes for POP3 using the Web interface. I was able to open localhost:8880 and train my saved messages without problem. I then clicked on the Configuration option and it took me to localhost:8880/config, where I entered my configuration settings. However, when I tried to save the information, I got an error saying that localhost:8880/changeopt was not found. Can someone help me? I'm almost there! Thanks. Keith From Andrew at AndrewGroup.Com Fri Sep 26 19:19:51 2003 From: Andrew at AndrewGroup.Com (Andy Thomas) Date: Fri Sep 26 19:12:39 2003 Subject: [Spambayes] Outlook Wish List.... Message-ID: Adding Spam Assassin Functionality to SPAMBAYES.... Spam Bayes appears to quite an accomplishment. I'm wanting to deploy this into businesses (with contributions to SPF) in every case. We are deploying Linux Servers as Firewall / Email Servers and we smarthost our clients email MX records and parse them using SPAM Asassin. A wonderful feature to add to SPAMBAYES would be a SPAMASSASIN Mode. On the Tab you would enable SA mode and simple enter the some values that reflect the SA costs and what to do with the message. Using Outlook Rules gives no logic to evaluate the Value of the SA spam cost value, It seems that SPAMBAYE could accomplish this rather easily. I would be happy to assist, test or help in any way possible if this seems feasible. SA Cost 0 thru 2.99 keep in inbox (flag with option) none 3 thru 5.99 action Move to (Low Spam) 5 thru 9.99 action Move to (Medium Spam) 10 thru 13.99 action Move to (High Spam) 14 thru 99.00 action delete or move to (Very High Spam) This would complement the SA development community. AndyT Andrew@AndrewGroup.Com From matts at ksu.edu Sat Sep 27 00:16:05 2003 From: matts at ksu.edu (Matt Stegman) Date: Sat Sep 27 00:17:40 2003 Subject: [Spambayes] Bugreport / Problems with initial setup Message-ID: <200309270416.h8R4G57U027754@elijah.imse.ksu.edu> Hello, I'm a new spambayes user. I tried the Outlook plugin first, since it was easier to install, but after about a week I've decided that I hate Outlook and am moving back to UNIX to read my mail. SpamBayes works great; it's just that I hate the way Outlook works. So I download 1.0a6, and I ran into some problems while setting up the IMAP filter. The first time I run it to configure (sb_imapfilter.py -b), after I enter info on the configuration page, save, and then try to configure folders to filter, I get this on the "Select Filter Folders" page: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/ImapUI.py", line 165, in onFilterfolders self._login_to_imap() File "/usr/lib/python2.2/site-packages/spambayes/ImapUI.py", line 195, in _login_to_imap imap = IMAPSession(server, port) NameError: global name 'IMAPSession' is not defined If I save and shutdown, then run it again, I can go to select filter folders without a problem. This only occurs if I try to go do this after entering the configuration information for the first time, and haven't shutdown and restarted yet. Once I've setup the filter folders and training folders, I go to train SpamBayes. I run 'sb_imapfilter.py -c -t', and it goes for a while, then dies: Traceback (most recent call last): File "/usr/bin/sb_imapfilter.py", line 825, in ? run() File "/usr/bin/sb_imapfilter.py", line 811, in run imap_filter.Train() File "/usr/bin/sb_imapfilter.py", line 647, in Train num_spam_trained = folder.Train(self.classifier, True) File "/usr/bin/sb_imapfilter.py", line 559, in Train for msg in self: File "/usr/bin/sb_imapfilter.py", line 486, in __iter__ yield self[key] File "/usr/bin/sb_imapfilter.py", line 534, in __getitem__ msg.get_substance() File "/usr/bin/sb_imapfilter.py", line 363, in get_substance new_msg = email.Parser.Parser().parsestr(data["RFC822"]) File "/usr/lib/python2.2/email/Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "/usr/lib/python2.2/email/Parser.py", line 64, in parse self._parsebody(root, fp, firstbodyline) File "/usr/lib/python2.2/email/Parser.py", line 245, in _parsebody raise Errors.BoundaryError( email.Errors.BoundaryError: multipart message with no defined boundary If I run it with -v, I can see at what point it died, and I found the message it died on. It's spam (of course). I think it's dying on this header: Content-Type: multipart/mixed; My mail client gives no trouble with this message, nor did the Outlook plugin complain when I trained it on this same corpus. If I delete this message and retrain, it goes OK. One additional thing is that after it died, I had two copies of all my messages. One with a header "X-Spambayes-MailID" header, and one without, which was marked deleted. I guess this is how SpamBayes checks if it's trained on that message already? Why not use the Message-ID header if it exists? -- -Matt Stegman From toms at twcny.rr.com Sat Sep 27 00:18:35 2003 From: toms at twcny.rr.com (Tom Stevens) Date: Sat Sep 27 00:18:37 2003 Subject: [Spambayes] spambayes Message-ID: <000001c384ae$6e86a680$6d01a8c0@TOM> Since installing SPAMBAYES, (outlook 2002, windows XP home) all my emails, with the exception of the ones for which I have set up rules.go to the junk email folder...any clues Regards, Tom Stevens toms@twcny.rr.com -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 7023 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030927/bbe9a7c5/attachment.jpe From plushpuffin at wwddfd.com Sat Sep 27 01:26:02 2003 From: plushpuffin at wwddfd.com (DM) Date: Sat Sep 27 01:26:01 2003 Subject: [Spambayes] SpamBeyes Outlook addin helper addin I wrote Message-ID: <000501c384b7$d9163a20$0700a8c0@spam> First of all, a disclaimer: I don't know crap about Outlook, Visual Studio .NET, or even Visual Basic.... But I did manage to write my own Outlook addin which complements the SpamBeyes Outlook addin! I just barely got this thing to work. Here's what it does: SBPlusAddin runs on all mail items in \Personal Folders\Junk\Trash * set importance to normal * set format to text/plain * set UnRead to false (make message read) * save mail item * save its actions to a log file in C:\sbpluslog.txt (for debugging) It seems to work pretty well. It even prevents Outlook from putting the "new mail" icon in the system tray! However, it requires the .NET framework to be installed. Perhaps you guys can figure out a way to compile it without the need for .NET, since I don't know how. Anyway, if you are interested in receiving my code, please let me know. Cheerio! ---------- known bugs: The only thing it chokes on is when you specifically set all the email in the Junk\Trash folder to UnRead. This is different from simply reading the mail to make it UnRead, because it's a more "permanent" UnRead. Messages that are specifically changed to "UnRead" cannot be changed back to Read as easily, for some reason.... In other words... SBPlus has real problems setting a message to "Read" if it has already set that message to "Read" and then you set the message back to "UnRead" and do another send/receive without quitting first. I have no idea why. From anthony at interlink.com.au Sat Sep 27 01:26:23 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Sat Sep 27 01:28:10 2003 Subject: [Spambayes] should we warn about grossly mismatched training sets? Message-ID: <200309270526.h8R5QNqo007713@localhost.localdomain> Just wondering - should the various user interfaces display a warning if someone's training set gets seriously out of whack? Say, more than a 4:1 ratio in the spam/ham ratio (in either direction)? I've now heard from a couple of folks in the last week who've had problems because their training sets got seriously unbalanced... Anthony From m0davis at pacbell.net Sat Sep 27 15:35:22 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Sat Sep 27 15:40:30 2003 Subject: [Spambayes] Train a mailbox in Mozilla 1.5b Message-ID: I'm using Mozilla 1.5b on WinXP with alpha 5. I want to train a large folder I have as all spam. I can't seem to figure out how to export my folder to mbox or dbx, and it would be too tedious to do this by forwarding all the messages to spambaye_spam@localhost. Any recommendations? From skip at pobox.com Sat Sep 27 16:01:14 2003 From: skip at pobox.com (Skip Montanaro) Date: Sat Sep 27 17:55:31 2003 Subject: [Spambayes] should we warn about grossly mismatched training sets? In-Reply-To: <200309270526.h8R5QNqo007713@localhost.localdomain> References: <200309270526.h8R5QNqo007713@localhost.localdomain> Message-ID: <16245.60554.675734.801162@montanaro.dyndns.org> Anthony> Just wondering - should the various user interfaces display a Anthony> warning if someone's training set gets seriously out of whack? Anthony> Say, more than a 4:1 ratio in the spam/ham ratio (in either Anthony> direction)? +1 If someone has a large number of stored ham or spam and inject that into whatever SpamBayes app they are using, they may start off with an imbalance which is next to impossible to overcome with incremental training. Skip From Quinn at nobodysonline.com Sat Sep 27 21:59:31 2003 From: Quinn at nobodysonline.com (Quinn) Date: Sat Sep 27 21:59:37 2003 Subject: [Spambayes] Broken Link to Instructions Message-ID: Hi, I just got SpamBayes for Outlook 2002, and I would like to add the spam score to my folder views, but the link to the instructions is broken. I found a couple of references to this problem by searching the web, including one that implied it should be in the fields option for a given view, but I'm not finding it. How do I put the spam score into a view? Quinn From T.A.Meyer at massey.ac.nz Sat Sep 27 22:39:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 22:39:34 2003 Subject: [Spambayes] end of problems? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9E9F@its-xchg4.massey.ac.nz> > > > > Please remove this. Pragma: no-cache is something the HTTP > client sends, not the server. This will do nothing and is > misleading. Done. (I just added all the ones I was told to...I'm glad someone's checked them :) =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 22:43:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 22:44:13 2003 Subject: [Spambayes] should we warn about grossly mismatched training sets? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA0@its-xchg4.massey.ac.nz> [Anthony] > Just wondering - should the various user interfaces display a > warning if someone's training set gets seriously out of > whack? Say, more than a 4:1 ratio in the spam/ham ratio (in > either direction)? The Outlook plug-in already does this. (Well, cvs does - I can't recall if it made it into 008.1 or not). The warning kicks in at 5:1/1:5 and is in the main dialog. I'll add a similar warning to the web interface; no idea why I didn't add them both at the same time... =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 22:57:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 22:57:47 2003 Subject: [Spambayes] Broken Link to Instructions Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA2@its-xchg4.massey.ac.nz> > I just got SpamBayes for Outlook 2002, and I would like > to add the spam score to my folder views, but the link to the > instructions is broken. What link? The one in the SpamBayes toolbar dropdown menu? The website one still seems to work - go to: and click on the link to the "About" file. Or go directly to: . If the toolbar link is broken, this would indicate that something is wrong with your spambayes installation, though, and it would be worth looking into that. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:26:15 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:26:38 2003 Subject: [SPAM] [Spambayes] Train a mailbox in Mozilla 1.5b Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA5@its-xchg4.massey.ac.nz> > I'm using Mozilla 1.5b on WinXP with alpha 5. I want to > train a large folder I have as all spam. I can't seem to figure > out how to export my folder to mbox or dbx I'm only using Mozilla 1.3 (on XP), but my mail is already stored in mbox format. Unless things have changed with Mozilla since then, you should be able to just feed your mail files to spambayes wherever it wants an mbox file. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:28:09 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:28:56 2003 Subject: [Spambayes] spambayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA6@its-xchg4.massey.ac.nz> > Since installing SPAMBAYES, (outlook 2002, windows XP home) > all my emails, with the exception of the ones for which I > have set up rules.go to the junk email folder...any clues What is your treshold set to? If it's set really low (like 0), then everything will be spam, no matter the score. What are they scoring? If they're all scoring greater than your spam threshold, then you've probably got far too much spam trained, and not nearly enough ham (so to spambayes, everything looks like spam). Roughly equal numbers of ham and spam works best. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:32:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:32:48 2003 Subject: [Spambayes] Outlook Wish List.... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA7@its-xchg4.massey.ac.nz> > Adding Spam Assassin Functionality to SPAMBAYES.... [...] > SA Cost > 0 thru 2.99 keep in inbox (flag with option) none > 3 thru 5.99 action Move to (Low Spam) > 5 thru 9.99 action Move to (Medium Spam) > 10 thru 13.99 action Move to (High Spam) > 14 thru 99.00 action delete or move to (Very High Spam) I don't understand how this is adding spam assassin functionality to spambayes. It sounds much more like an Outlook plug-in for spam assassin. If that's the case, then you should check out , which is trying to do this. If you simply want spambayes to use the spam assassin headers as clues, then you can do this by fiddling with the (spambayes) tokenizing options (like safe_headers). =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:36:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:36:41 2003 Subject: [Spambayes] Problem starting pop3proxy_service on XP Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EA9@its-xchg4.massey.ac.nz> > First of all, after running setup.py install, > pop3proxy_service.py is not > installed in the Python directory (a lot of other spambayes > stuff is). The pop3proxy_service.py file is in [download directory]/windows. > The readme.txt claims I can discard the download directory after running > install, does this mean something has gone wrong with the install? No, it means that the readme.txt is wrong :) I had forgotten that you would need the two scripts in the windows directory when I wrote that section. The setup script has been corrected so that they are also installed (if appropriate), so this will be fixed in future versions. Obviously, with the forthcoming binary release of sb_server/pop3proxy (which will be most appropriate for many windows users), it's irrelevant. > When I try to start it in the Services > Windows, XP says: "The Spambayes pop3proxy_service Service on Local > Computer started and then stopped. Some services stop > automatically etc. ..." > > Anybody got any ideas what goes wrong? The service prints out some debugging stuff which you can view via the "Trace collector debugging tool" in Pythonwin, or if you run "pop3proxy_service.py -debug". If you could give this a go and let us know what it prints out, that would be great. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:38:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:38:15 2003 Subject: [Spambayes] Can't Save Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EAA@its-xchg4.massey.ac.nz> > I then clicked on the Configuration option and it took me to > localhost:8880/config, where I entered my configuration > settings. However, when I tried to save the information, I > got an error saying that localhost:8880/changeopt was not found. ~~~~~~~~~ Was it definitely "localhost:8880/changeopt" and not "localhost:8880/changeopts" (note the 's')? It should be the latter. Was sb_server/pop3proxy definitely running at the time? =Tony Meyer From T.A.Meyer at massey.ac.nz Sat Sep 27 23:41:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Sep 27 23:42:10 2003 Subject: [Spambayes] I Can't Install the POP3 Proxy In Windows XP Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EAC@its-xchg4.massey.ac.nz> > I wouldn't recommend people to run pop3proxy_service.py. I > think most windows users are better off running > pop3proxy_tray.py, as it is less invasive to the system, and > easier to install too. Well, most people will be using the binary version soon, hopefully, which defaults to the non-service version unless the service is installed (note that pop3proxy_tray works with the service if it is installed, and only if not, starts up a sb_server thread). So ease of installation isn't really an issue - if people want to be using the source, then they should be capable of running "pop3proxy_service.py install", which is all that is necessary. For some, the service is a better option. =Tony Meyer From JUDYLOVURE at EXCITE.COM Sat Sep 27 20:07:23 2003 From: JUDYLOVURE at EXCITE.COM (JUDYLOVURE@EXCITE.COM) Date: Sun Sep 28 01:19:30 2003 Subject: [Spambayes] FREE LOVE SONGS Message-ID: <48a6ce74-a490-4a7d-8275-ee00b4b23789@IETF.ORG>  

       WWW. FREE LOVE SONGS .ORG

       

      WELCOME 

               TO :     

                     FREE LOVE SONGS

                                                    FREE LOVE  SONGS VOL 3                                                                                               

       1    CAN'T HELP FALLING IN LOVE  -  ELVIS

       2    MAKING LOVE OUT OF NOTHING AT ALL  -  AIR SUPPLY

       3    STAND BY ME - BEN E. KING

       4   AT LAST - ETTA JAMES

       5   IF I COULD REACH YOU - FIFTH DIMENSION

       6   FUNNY - JOE HINTON

       7   KISS AND SAY GOODBYE - MANHATTANS

        8   LETS GET IT ON - MARVIN GAYE

        9   I'VE BEEN LOVING YOU TO LONG - OTIS REDDING

      10   I'D RATHER  LEAVE WHILE I'M IN LOVE - RITA COOLIDGE

      11    I'M SO HURT -  TIMI YURO

      12    WHATS THE MATTER BABY ? - TIMI YURO

      13   WILD SIDE OF LIFE - HANK WILLIAMS 1952

      14   ANGEL FLYING TOO CLOSE TO THE GROUND - W. NELSON

           

      $ 4.95 U.S.A. DOLLARS  FOR SHIPPING & HANDLING.  OUTSIDE U.S.A.  E-MAIL ME FOR RATE.

      FOR MASTER CARD OR VISA OR PAY PAL :

      E-MAIL TO:

       FREE LOVE SONGS @CS.COM                   

      WITH  CARD NUMBER EXPIRATION  DATE NAME ON CARD AND LAST 3 DIGITS ON BACK ON CARD & SHIPPING ADDRESS.

       

                                     THANK YOU !

                                                               PUDGE JAN

      ?PUDGE JAN.COM

      http://www.buzzle.com/link-to-us.asp

       ?FREE LOVE SONGS.ORG  2003

      http://www.hotrate.com/suggestURL4.asp                                                      

      http://www.justriddlesand more.com

      http://www.netshagg.com

      FLS 125

       

       

       

       

                                            

            

                

       

              

                                                

       

      From T.A.Meyer at massey.ac.nz Sun Sep 28 01:27:27 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 28 01:27:39 2003 Subject: [Spambayes] Can't Save Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EB1@its-xchg4.massey.ac.nz> > Thanks for the reply, Tony. No worries. > I'm sure it was "changeopts". Sorry for the typo. That's alright - I just wanted to eliminate one possible cause. > However, > I'm unable to check it now to make sure. (See below. > > Was sb_server/pop3proxy definitely running at the time? > > pop3proxy was running. Otherwise, I wouldn't have been able > to access localhost:8880 at all, would I? No, but if it was running at first, and stopped some time between displaying the options page and you clicking 'save', that would be the result. > However, tonight when I tried to start it, I got an error! > Here's the total output: > > >net start pop3proxy > The SpamBayes Service service is starting.. > The SpamBayes Service service could not be started. > > A service specific error occurred: 1. > > More help is available by typing NET HELPMSG 3547. Try running "pop3proxy_service.py debug" and see what it prints out. Hopefully it'll be more use than netmsg 3547 :) You could also try just using sb_server/pop3proxy (is this 1.0a6 or 1.0a5?) for the moment. =Tony Meyer From japeters at pacbell.net Sun Sep 28 02:00:44 2003 From: japeters at pacbell.net (John A. Peters) Date: Sun Sep 28 01:58:29 2003 Subject: [Spambayes] Spambayes (Thank goodness for) Message-ID: <00b601c38585$db0b95e0$6902a8c0@AMD> Thank goodness for SpamBayes, I only need to look at the relatively few files in the unsure folder. What a relief from the previous state of affairs. John Peters From Knut at klo.no Sun Sep 28 07:43:09 2003 From: Knut at klo.no (Knut Olav Klo) Date: Sun Sep 28 07:43:15 2003 Subject: [Spambayes] Serverside version? Message-ID: <6107CD721E96AB47AB71759E9625851A06258F@ns1.klo.no> Hello there and thank you for a VERY good product. I am running windows xp pro and office xp and had problems installing the default exe file but when I installed the win32 and python files and then reinstalled it then it worked VERY good ;-) OK... What I wonder is if you have server side version of this wonderful tool or if I can configure my outlook with... lets say 5 mailboxes and have the spambayes remove spam from all accounts in one outlook? If this works then I can have one outlook up an running with all accounts on my server removeing spam and the rest of the family will not get spam in their accounts, and offcause I'll have to check the junk folder from time to time ;-) Thanx PS. You are so good that I'll donate sum money for this program very soon... Br, Knut Klo Norway -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 10838 bytes Desc: image001.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20030928/31f0d77c/attachment-0001.gif From john at comnet-tech.com Sun Sep 28 07:53:01 2003 From: john at comnet-tech.com (John A) Date: Sun Sep 28 07:52:36 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 86 In-Reply-To: Message-ID: How do I get off this blasted list? !!!!!!!!!! -----Original Message----- From: spambayes-bounces+spambayes=comnet-tech.com@python.org [mailto:spambayes-bounces+spambayes=comnet-tech.com@python.org]On Behalf Of spambayes-request@python.org Sent: Sunday, September 28, 2003 7:44 AM To: spambayes@python.org Subject: Spambayes Digest, Vol 61, Issue 86 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. FREE LOVE SONGS (JUDYLOVURE@EXCITE.COM) 2. RE: Can't Save Configuration (Meyer, Tony) 3. Spambayes (Thank goodness for) (John A. Peters) 4. Serverside version? (Knut Olav Klo) ---------------------------------------------------------------------- Message: 1 Date: Saturday, 27 Sep 2003 20:07:23 From: JUDYLOVURE@EXCITE.COM Subject: [Spambayes] FREE LOVE SONGS To: spambayes@python.org Message-ID: <48a6ce74-a490-4a7d-8275-ee00b4b23789@IETF.ORG> Content-Type: text/plain; charset="UTF-8"  

       WWW. FREE LOVE SONGS .ORG

       

      WELCOME 

               TO :     

                  & nbsp;  FREE LOVE SONGS

                  & nbsp;                    &nbs p;            FREE LOVE  SONGS VOL 3             ;            &nb sp;              &nbs p;            &n bsp;                         &nbs p;            &n bsp;    

       1    CAN'T HELP FALLING IN LOVE  -  ELVIS

       2    MAKING LOVE OUT OF NOTHING AT ALL  -  AIR SUPPLY

       3    STAND BY ME - BEN E. KING

       4   AT LAST - ETTA JAMES

       5   IF I COULD REACH YOU - FIFTH DIMENSION

       6   FUNNY - JOE HINTON

       7   KISS AND SAY GOODBYE - MANHATTANS

        8   LETS GET IT ON - MARVIN GAYE

        9   I'VE BEEN LOVING YOU TO LONG - OTIS REDDING

      10   I'D RATHER  LEAVE WHILE I'M IN LOVE - RITA COOLIDGE

      11    I'M SO HURT -  TIMI YURO

      12    WHATS THE MATTER BABY ? - TIMI YURO

      13   WILD SIDE OF LIFE - HANK WILLIAMS 1952

      14   ANGEL FLYING TOO CLOSE TO THE GROUND - W. NELSON

           

      $ 4.95 U.S.A. DOLLARS  FOR SHIPPING & HANDLING.  OUTSIDE U.S.A.  E-MAIL ME FOR RATE.

      FOR MASTER CARD OR VISA OR PAY PAL :

      E-MAIL TO:

       FREE LOVE SONGS @CS.COM           &nb sp;       

      WITH  CARD NUMBER EXPIRATION  DATE NAME ON CARD AND LAST 3 DIGITS ON BACK ON CARD & SHIPPING ADDRESS.

       

                  & nbsp;                  THANK YOU !

                  & nbsp;             ;                            &nbs p;   PUDGE JAN

      B)PUDGE JAN.COM

      http://www.buzzle.com/link-to-us.asp

       B)FREE LOVE SONGS.ORG  2003

      http://www.hotrate.com/suggest URL4.asp          &nbs p;            &n bsp;                         &nbs p;     

      http://www.justriddlesand more.com

      http://www.netshagg.com

      FLS 125

       

       

       

       

                  &nbs p;            &n bsp;            

            

                

       

              

                  & nbsp;             ;            &nb sp;    

       

      ------------------------------ Message: 2 Date: Sun, 28 Sep 2003 17:27:27 +1200 From: "Meyer, Tony" Subject: RE: [Spambayes] Can't Save Configuration To: Cc: spambayes@python.org Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036D9EB1@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" > Thanks for the reply, Tony. No worries. > I'm sure it was "changeopts". Sorry for the typo. That's alright - I just wanted to eliminate one possible cause. > However, > I'm unable to check it now to make sure. (See below. > > Was sb_server/pop3proxy definitely running at the time? > > pop3proxy was running. Otherwise, I wouldn't have been able > to access localhost:8880 at all, would I? No, but if it was running at first, and stopped some time between displaying the options page and you clicking 'save', that would be the result. > However, tonight when I tried to start it, I got an error! > Here's the total output: > > >net start pop3proxy > The SpamBayes Service service is starting.. > The SpamBayes Service service could not be started. > > A service specific error occurred: 1. > > More help is available by typing NET HELPMSG 3547. Try running "pop3proxy_service.py debug" and see what it prints out. Hopefully it'll be more use than netmsg 3547 :) You could also try just using sb_server/pop3proxy (is this 1.0a6 or 1.0a5?) for the moment. =Tony Meyer ------------------------------ Message: 3 Date: Sat, 27 Sep 2003 23:00:44 -0700 From: "John A. Peters" Subject: [Spambayes] Spambayes (Thank goodness for) To: "Spam News Group" Cc: Dudley Robert Ackerman Message-ID: <00b601c38585$db0b95e0$6902a8c0@AMD> Content-Type: text/plain; charset="iso-8859-1" Thank goodness for SpamBayes, I only need to look at the relatively few files in the unsure folder. What a relief from the previous state of affairs. John Peters ------------------------------ Message: 4 Date: Sun, 28 Sep 2003 13:43:09 +0200 From: "Knut Olav Klo" Subject: [Spambayes] Serverside version? To: Message-ID: <6107CD721E96AB47AB71759E9625851A06258F@ns1.klo.no> Content-Type: text/plain; charset="us-ascii" Hello there and thank you for a VERY good product. I am running windows xp pro and office xp and had problems installing the default exe file but when I installed the win32 and python files and then reinstalled it then it worked VERY good ;-) OK... What I wonder is if you have server side version of this wonderful tool or if I can configure my outlook with... lets say 5 mailboxes and have the spambayes remove spam from all accounts in one outlook? If this works then I can have one outlook up an running with all accounts on my server removeing spam and the rest of the family will not get spam in their accounts, and offcause I'll have to check the junk folder from time to time ;-) Thanx PS. You are so good that I'll donate sum money for this program very soon... Br, Knut Klo Norway -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 10838 bytes Desc: image001.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20030928/31f0d77c/att achment.gif ------------------------------ _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html End of Spambayes Digest, Vol 61, Issue 86 ***************************************** From rcoe at CambridgeMA.GOV Sun Sep 28 10:32:05 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Sep 28 10:32:08 2003 Subject: [Spambayes] RE: New Icons for Outlook Plugin Message-ID: I kept the face icons, but changed their colors: red for "Delete...", green for "Recover...". Looks pretty sharp! ;^) Bob > -----Original Message----- > From: Mark Hammond [mailto:mhammond@skippinet.com.au] > Sent: Tuesday, September 09, 2003 2:25 AM > To: chris@lockergnome.com; spambayes@python.org > Subject: RE: [Spambayes] New Icons for Outlook Plugin > > > > e-Greetings! > > > > I finally installed SpamBayes for Outlook, and... I wish I had done so > > yesterday. ;) Anyway, I decided that the face icons (while cute), did not > > quite fit my style of thinking: > > > > http://chris.pirillo.com/images/outlookplugin.gif > > Sorry, but I don't like them that much, as I have to think too much to work > out what they mean. I guess the "black ball" is a reference to being "black > balled", which I don't think is a reference to pool :) > > I quite like Inboxer - it has a mail icon, with a big blue tick or red > cross. > > Also, I unfortunately need the images in a .bmp format. And > one day I need to crack the transparency issue :( > > > Thanks for doing this for Outlook users. It's wonderful! > > Our pleasure - for some definitions of the word "pleasure" > > Mark. From rcoe at CambridgeMA.GOV Sun Sep 28 10:42:57 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Sep 28 10:43:00 2003 Subject: [Spambayes] RE: SpamBeyes Outlook addin helper addin I wrote Message-ID: Forgive me for being dense, but what does all this do for you? Bob > -----Original Message----- > From: DM [mailto:plushpuffin@wwddfd.com] > Sent: Saturday, September 27, 2003 1:26 AM > To: spambayes@python.org > Subject: [Spambayes] SpamBeyes Outlook addin helper addin I wrote > > > First of all, a disclaimer: > > I don't know crap about Outlook, Visual Studio .NET, or even Visual > Basic.... But I did manage to write my own Outlook addin which complements > the SpamBeyes Outlook addin! > > > I just barely got this thing to work. Here's what it does: > > SBPlusAddin runs on all mail items in \Personal Folders\Junk\Trash > * set importance to normal > * set format to text/plain > * set UnRead to false (make message read) > * save mail item > * save its actions to a log file in C:\sbpluslog.txt > (for debugging) > > It seems to work pretty well. It even prevents Outlook from putting the "new > mail" icon in the system tray! However, it requires the .NET framework to be > installed. Perhaps you guys can figure out a way to compile it without the > need for .NET, since I don't know how. > > Anyway, if you are interested in receiving my code, please let me know. > Cheerio! > > ---------- known bugs: > ... From rcoe at CambridgeMA.GOV Sun Sep 28 11:11:43 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Sep 28 11:11:47 2003 Subject: [Spambayes] RE: Need more training messages Message-ID: I don't see the point of the limitation to Western European spams. I'm firmly in the English-speaking world (no wisecracks from the British Empire, please!), but a high percentage of my spam is in Russian, Chinese, Japanese, etc. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Skip Montanaro [mailto:skip@pobox.com] > Sent: Thursday, September 25, 2003 11:54 AM > To: papaDoc > Cc: spambayes@python.org > Subject: Re: [Spambayes] Need more training messages > > > > >> If you have a free minute or two can you rummage through your email > >> and find me two hams and two spams? > > Remi> Do you only what english mail or other languages also ? > > I doubt a few non-English hams and spam would hurt. Let's limit it to > Western European languages (no Hebrew or Japanese, for example). > > Skip From japeters at pacbell.net Sun Sep 28 17:48:17 2003 From: japeters at pacbell.net (John A. Peters) Date: Sun Sep 28 17:48:21 2003 Subject: [Spambayes] OE tags s/b at the end not the start of the subject line. Message-ID: <005401c3860a$3a635850$6902a8c0@AMD> When running SpamBayes with Outlook Express it is better if the tag is appended to the end of the subject line instead of at the beginning. Other spam filters offer this as an option, but I think it is simpler to just put it at the end. The OE filter will work just as well with it there and it does not get in the way as much when you are scanning a bunch of spam to be sure all is well and that there are no false positives. I have never found one so that is a complement to your guys programming. John Peters From tim.one at comcast.net Sun Sep 28 18:40:00 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Sep 28 18:40:02 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 61, Issue 86 In-Reply-To: Message-ID: [John A] > How do I get off this blasted list? !!!!!!!!!! Visit the URL at the bottom of every message you get from the list: http://mail.python.org/mailman/listinfo/spambayes and unsubscribe yourself (go to "To unsubscribe from Spambayes ..." near the bottom of the page, under the "Spambayes Subscribers" heading), using the same email address you used when you subscribed yourself. From T.A.Meyer at massey.ac.nz Sun Sep 28 19:18:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 28 19:19:09 2003 Subject: [Spambayes] Serverside version? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA085@its-xchg4.massey.ac.nz> > What I wonder is if you have server side > version of this wonderful tool or if I can configure my > outlook with... lets say 5 mailboxes and have the spambayes > remove spam from all accounts in one outlook? Do you mean one Outlook profile and 5 accounts that that profile checks? This works fine. If you want to use spambayes with different profiles, I believe that also works fine, although you have to configure each one separately (since the folders will be different). > If this works > then I can have one outlook up an running with all accounts > on my server removeing spam Check out , which has notes from people who have successfully setup spambayes server-side. There's also a FAQ about it (link at the end of all list mail). =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 28 19:35:43 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 28 19:35:58 2003 Subject: [Spambayes] Can't Save Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA0C3@its-xchg4.massey.ac.nz> > I see. That makes sense. I was able to access localhost:8880 > both before and after, so it was running. (That is, pop3proxy > was running; what's sb_server/pop3proxy?) The script that handles the POP3 proxies, the SMTP proxies, and the web interface was called pop3proxy.py up until release 1.0a5. It has since been renamed sb_server.py, which is more accurate. pop3proxy_tray.py and pop3proxy_service.py haven't been renamed, as of yet. I wasn't sure which version you were using, so wasn't sure if you would have pop3proxy.py or sb_server.py (it's a confusing time!). > pop3proxy_service.py did not get installed anywhere. This is fixed in cvs, so will be fixed for the next release. > I had expanded the zip file to two locations, and found it under > the windows subdirectory in both places. I went ahead and ran > it from one of those locations. It said it started > successfully, but when I tried to save the configuration from > the Web interface, I got the following: > > 500 Server error > Traceback (most recent call last): [...] > File "F:\PROGRA~1\Python23\lib\bsddb\__init__.py", line > 192, in hashopen > d.open(file, db.DB_HASH, flags, mode) > > DBNoSuchFileError: (2, 'No such file or directory') This is odd. It's complaining that it can't open the database, but in that case it should just create a new one. You should have a bayescustomize.ini file - if you haven't created it yourself, then it should be in the directory "c:\documents and settings\application data\spambayes\proxy\" (if WinNT/2K/XP, if Win9x/Me, then "C:\windows\application data\spambayes\proxy", I think). Could you find that file and let us know what's in it? (There should be a line that starts: "persistent_storage_file:", which is the most relevant one). > > You could also try just > > using sb_server/pop3proxy (is this 1.0a6 or 1.0a5?) for the moment. > > How do I do that? Either run the pop3proxy_tray.py application (also in the windows directory, also mistakenly not installed with 1.0a6) after removing the service (run "pop3proxy_service remove"), or run "sb_server.py" - it will have been installed into the Python scripts directory. If you run it with pythonw (or change the extension to pyw) you won't get the console window displayed. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 28 19:53:36 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 28 19:54:08 2003 Subject: [Spambayes] Bugreport / Problems with initial setup Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA0EF@its-xchg4.massey.ac.nz> > I get this on the "Select Filter Folders" page: [...] > File > "/usr/lib/python2.2/site-packages/spambayes/ImapUI.py", line > 195, in _login_to_imap > imap = IMAPSession(server, port) > > NameError: global name 'IMAPSession' is not defined Thanks. I've fixed this in cvs, so it'll be fixed for the next release. [...] > email.Errors.BoundaryError: multipart message with no defined boundary > > If I run it with -v, I can see at what point it died, and I > found the message it died on. It's spam (of course). I > think it's dying on this > header: > > Content-Type: multipart/mixed; This is bug #498103: . I'll try and get around to fixing this up soon. > One additional thing is that after it died, I had two copies > of all my messages. One with a header "X-Spambayes-MailID" > header, and one without, which was marked deleted. I guess > this is how SpamBayes checks if it's trained on that message > already? Why not use the Message-ID header if it exists? Because it might not exist, and it might not be unique (it should be, but then spam often doesn't follow the rules). I know that recreating the messages is ugly, but IMAP's an ugly protocol - if you could simply alter a message (you can't even *move* a message between folders!), or if servers were required to store user-defined flags, then there wouldn't be a problem. =Tony Meyer From T.A.Meyer at massey.ac.nz Sun Sep 28 20:01:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Sep 28 20:02:07 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA106@its-xchg4.massey.ac.nz> > I have looked into sb_smtpproxy.py source code, and found > that the proxy examines the body part of email as well to > find cached mail id. So I copied the original spambayes > headers into the body and forwarded to the smtp > proxy. It seemed to be working. Yes, that would also work. When I said that it wouldn't work with OE, I meant without going through this sort of thing (finding the id then pasting it into a message seems more hassle than just using the web interface). > "Could not extract id" message is not shown > anymore. In the train_cached_message method, however, the > if-statement prints "Could not find message...perhaps it was deleted..." Hopefully, exactly what it says :) If you look in the three cache folders, is there a file with the same name as the id? If not, then it has expired (the default is to last 7 days), and so can't be used for training. > (actually, I had to modify that line so that the strings are > concatenated first due to the TypeError: not all arguments converted > during string formatting -- a bug.) Thanks, fixed that, too. =Tony Meyer From lists at strisik.com Sun Sep 28 20:05:53 2003 From: lists at strisik.com (Peter Strisik) Date: Sun Sep 28 20:05:56 2003 Subject: [Spambayes] Spam/Ham Toolbar Icons In-Reply-To: <005401c3860a$3a635850$6902a8c0@AMD> Message-ID: <000001c3861d$73011450$0300a8c0@PHS> Just for fun, here's my mods on the toolbar...... http://www.strisik.com/_misc/_images.htm .....Peter From janeaustine50 at hotmail.com Mon Sep 29 00:44:36 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Mon Sep 29 00:44:45 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: >From: "Meyer, Tony" >To: "Austine Jane" , >Subject: RE: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express >(bug?) >Date: Mon, 29 Sep 2003 12:01:56 +1200 > > > I have looked into sb_smtpproxy.py source code, and found > > that the proxy examines the body part of email as well to > > find cached mail id. So I copied the original spambayes > > headers into the body and forwarded to the smtp > > proxy. It seemed to be working. > >Yes, that would also work. When I said that it wouldn't work with OE, I >meant without going through this sort of thing (finding the id then >pasting it into a message seems more hassle than just using the web >interface). > > > "Could not extract id" message is not shown > > anymore. In the train_cached_message method, however, the > > if-statement prints "Could not find message...perhaps it was >deleted..." > >Hopefully, exactly what it says :) If you look in the three cache >folders, is there a file with the same name as the id? If not, then it >has expired (the default is to last 7 days), and so can't be used for >training. > First of all, thank you for your reply. I think there is some kind of problem with spambayes caching and classification. When I compose email which includes spam words to my account and load them into the local pop3 proxy, spambayes classifies it as spam and attaches appropriate X-Spambayes-Classification header. However, as I examine the cache folders, the email is put in the unknown cache folder not in the spam cache folder. Hence, the error messgae "Could not extract id" as I forward the email to spambayes_ham@localhost. Shouldn't it be corrected? > > (actually, I had to modify that line so that the strings are > > concatenated first due to the TypeError: not all arguments converted > > during string formatting -- a bug.) > >Thanks, fixed that, too. > >=Tony Meyer Cheers, Jane _________________________________________________________________ Help STOP SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail From T.A.Meyer at massey.ac.nz Mon Sep 29 00:50:39 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 00:50:52 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA30A@its-xchg4.massey.ac.nz> > I think there is some kind of problem with spambayes caching and > classification. When I compose email which includes spam > words to my account and load them into the local pop3 proxy, spambayes > classifies it as spam and attaches appropriate > X-Spambayes-Classification header. However, as I examine the > cache folders, the email is put in the unknown cache folder not > in the spam cache folder. This is the correct behaviour. The message has been *classified* as spam, but not *trained* as spam. If you trained the message (via the web interface, for example), it should move from the unknown cache to the spam one. The unknown cache holds all the messages that haven't been trained yet. > Hence, the error messgae "Could not > extract id" as I forward the email to spambayes_ham@localhost. This is not the correct behaviour, however. The smtp proxy is meant to search through all three caches when attempting to find the message (this means, for example, that it can correct messages that have been incorrectly *trained*, not just incorrectly *classified*). So, to confirm, you're sending an email to spambayes_ham@localhost which has the id of a message in one of the three caches and you get the "maybe already deleted" error? Note that if you're copying & pasting the id in, you need to include the "X-Spambayes-ID: " bit as well as the id itself. =Tony Meyer From janeaustine50 at hotmail.com Mon Sep 29 01:04:38 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Mon Sep 29 01:04:43 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: >From: "Meyer, Tony" >To: "Austine Jane" , >Subject: RE: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express >(bug?) >Date: Mon, 29 Sep 2003 16:50:39 +1200 > > > I think there is some kind of problem with spambayes caching and > > classification. When I compose email which includes spam > > words to my account and load them into the local pop3 proxy, spambayes > > classifies it as spam and attaches appropriate > > X-Spambayes-Classification header. However, as I examine the > > cache folders, the email is put in the unknown cache folder not > > in the spam cache folder. > >This is the correct behaviour. The message has been *classified* as >spam, but not *trained* as spam. If you trained the message (via the >web interface, for example), it should move from the unknown cache to >the spam one. The unknown cache holds all the messages that haven't >been trained yet. That means, when I find some email classified wrong and want to correct it then I have to train it first? Then what's the meaning of using smtp proxy with spambayes_spam/ham@localhost? > > > Hence, the error messgae "Could not > > extract id" as I forward the email to spambayes_ham@localhost. > >This is not the correct behaviour, however. The smtp proxy is meant to >search through all three caches when attempting to find the message >(this means, for example, that it can correct messages that have been >incorrectly *trained*, not just incorrectly *classified*). > >So, to confirm, you're sending an email to spambayes_ham@localhost which >has the id of a message in one of the three caches and you get the >"maybe already deleted" error? Note that if you're copying & pasting >the id in, you need to include the "X-Spambayes-ID: " bit as well as the >id itself. > >=Tony Meyer Yes. I think I found another bug. train_message_in_pop3proxy_cache and train_message_on_imap_server should return True at the end as they succeeded in retraining. Cheers, Jane _________________________________________________________________ MSN 8 with e-mail virus protection service: 2 months FREE* http://join.msn.com/?page=features/virus From janeaustine50 at hotmail.com Mon Sep 29 01:13:07 2003 From: janeaustine50 at hotmail.com (Austine Jane) Date: Mon Sep 29 01:13:11 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: When "return True" is added as I wrote in the last email, it seems to be working all right. Thanks. :) >From: "Meyer, Tony" >To: "Austine Jane" , >Subject: RE: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express >(bug?) >Date: Mon, 29 Sep 2003 16:50:39 +1200 > > > I think there is some kind of problem with spambayes caching and > > classification. When I compose email which includes spam > > words to my account and load them into the local pop3 proxy, spambayes > > classifies it as spam and attaches appropriate > > X-Spambayes-Classification header. However, as I examine the > > cache folders, the email is put in the unknown cache folder not > > in the spam cache folder. > >This is the correct behaviour. The message has been *classified* as >spam, but not *trained* as spam. If you trained the message (via the >web interface, for example), it should move from the unknown cache to >the spam one. The unknown cache holds all the messages that haven't >been trained yet. > > > Hence, the error messgae "Could not > > extract id" as I forward the email to spambayes_ham@localhost. > >This is not the correct behaviour, however. The smtp proxy is meant to >search through all three caches when attempting to find the message >(this means, for example, that it can correct messages that have been >incorrectly *trained*, not just incorrectly *classified*). > >So, to confirm, you're sending an email to spambayes_ham@localhost which >has the id of a message in one of the three caches and you get the >"maybe already deleted" error? Note that if you're copying & pasting >the id in, you need to include the "X-Spambayes-ID: " bit as well as the >id itself. > >=Tony Meyer _________________________________________________________________ Add photos to your e-mail with MSN 8. Get 2 months FREE*. http://join.msn.com/?page=features/featuredemail From T.A.Meyer at massey.ac.nz Mon Sep 29 01:13:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 01:14:06 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA328@its-xchg4.massey.ac.nz> > That means, when I find some email classified wrong and want > to correct it then I have to train it first? Then what's the > meaning of using smtp proxy with spambayes_spam/ham@localhost? No, you don't have to train it first. smtpproxy can train messages that are in any of the three caches. If the message is in the unknown cache, it means that it hasn't yet been trained, and so it is. If it's in the ham cache, then it has already been trained as ham (so either no training is done, or it is untrained as ham and trained as spam). Similarly for the spam cache (but in reverse). If the only training you do is via smtpproxy (and you only train a message once) then it will find all messages in the unknown cache, and move them into the appropriate new locations. > >So, to confirm, you're sending an email to spambayes_ham@localhost > >which has the id of a message in one of the three caches and you get > >the "maybe already deleted" error? Note that if you're copying & > >pasting the id in, you need to include the "X-Spambayes-ID: " bit as > >well as the id itself. > Yes. I think I found another bug. > > train_message_in_pop3proxy_cache and > train_message_on_imap_server should > return True at the end as they succeeded in retraining. Indeed they should. I've fixed this, too, thanks. (smtpproxy underwent a reasonably big refit not that long ago and obviously hasn't quite recovered, yet. I'm writing up a test suite for it at the moment, but haven't had a chance to get much of it done yet). If you fix this, does that fix the problem? I can see how this would mean that even after successfully training from the pop3proxy cache you'd get the "could not find message" error. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 29 01:14:37 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 01:14:45 2003 Subject: [Spambayes] pop3/smtp proxy with Outlook/Outlook Express (bug?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA329@its-xchg4.massey.ac.nz> > When "return True" is added as I wrote in the last email, it > seems to be working all right. Fantastic. Ignore the question at the end of the previous email, then. > Thanks. :) No, thank *you*, for tracking those bugs down. Much appreciated. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 29 02:46:08 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 02:46:26 2003 Subject: [Spambayes] Advice on when to stop training. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA351@its-xchg4.massey.ac.nz> > So if I don't have the lookup message in cache option > selected I can train on messages and still have the cache turned off? Yes. It will train on whatever it receives, rather than trying to find an id and then the message in an unaltered form. > I am using Gnus 5.9.0 does that qualify as forwarding mails > unchanged? I guess not because forwarding puts the original > message inside a MIME part. It probably also changes the sender to you, which isn't good. It'll still train, you just might get some invalid training done. It's really designed for mailers that can send off the original (raw) message. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Sep 29 03:42:51 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 03:43:08 2003 Subject: [Spambayes] Can't Save Configuration Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA355@its-xchg4.massey.ac.nz> > >> pop3proxy_service.py did not get installed anywhere. > > This is fixed in cvs, so will be fixed for the next release. > Should I copy it to python23\scripts manually? Good, bad, or > irrelevant? Does it matter where I run it from? It doesn't matter where it is run from, so it's up to you. I suppose you might as well, since then when you install the next version it will be correctly replaced. > >> DBNoSuchFileError: (2, 'No such file or directory') [...] > Okay, here's bayescustomize.ini: > > [Storage] [...] > persistent_storage_file:e:\SpamBayes\Proxy\statistics_database.db [...] > At some point in the installation, I was asked for a data > storage location. When was this? You don't mean when expanding the zip/tar.gz, do you? Or in entering data on the configuration page of the web interface? I didn't think we asked for a location at any other time. > I told it to store the data on my data > drive at e:\SpamBayes, and it looks like that's where it's > looking for it. However, there was no e:\SpamBayes directory > created, and statistics_database.db is in the default > directory, as is the ini file. I think the problem here may be that the directories don't exist. SpamBayes will happily create the database if it's not there, but won't create the directories. If you create a e:\SpamBayes\Proxy directory, does it work then? By default directory, I guess you mean the windows Application Data one? (This does get created, if it ends up being used, but it's special-cased). > By the way, I didn't install SpamBayes to the default > location; could that be a problem? By install, you mean when you ran "setup.py install"? That shouldn't matter. > Can you explain the "sb_server/pop3proxy" notation to me? You mean, why do I refer to sb_server/pop3proxy? Up to (and including) 1.0a5, there was a script called pop3proxy.py, which ran the POP3 proxies, SMTP proxies, and served the web interface. In 1.0a6 (and beyond) that script is called sb_server.py (it's more accurate). At the moment, people are used to referring to pop3proxy, except for people who are knew to spambayes with 1.0a6, so I use both, to try and indicate that I mean either. (The pop3proxy_service and pop3proxy_tray scripts should really be renamed, too, at some point). > What's the difference between running "pop3proxy_tray.py", > "sb_server.py", or "net start pop3proxy"? If you run pop3proxy_tray.py it will do one of the others. If the service is installed, then it will start it (i.e. the same as "net start pop3proxy"). If the service isn't installed, then it will start up sb_server in a separate thread (i.e. the same as "sb_server.py"). As for the difference between running as a service and not, Mark would probably be the one to answer that. SpamBayes is a service, really, so I suppose it makes sense for it to operate as one, so that Windows can control it effectively. Win9x and Me don't support services, though. > Is this explained > somewhere in the documentation, and I just missed it? Probably not, but it probably should be. Both running pop3proxy as a service and the tray app are quite new additions to spambayes, and the documentation hasn't really caught up yet. =Tony Meyer From gerrit at nl.linux.org Mon Sep 29 06:37:28 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Mon Sep 29 06:41:23 2003 Subject: [Spambayes] Bayesian drawbacks? Message-ID: <20030929103728.GA3685@nl.linux.org> Hi, I am considering to install a spamfilter on my machine. However, I don't know which one to choose. I have read about the Bayesian approach Spambayes is using. As I understand it, Spambayes needs to be trained in order to be useful. Isn't this a major drawback? Is it possible that a non-Bayesian approach would much better suit my needs, or did I misunderstand the Bayesian technique then? yours, Gerrit Holl. -- 104. If a merchant give an agent corn, wool, oil, or any other goods to transport, the agent shall give a receipt for the amount, and compensate the merchant therefor. Then he shall obtain a receipt form the merchant for the money that he gives the merchant. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Het zijn tijden om je zelf met politiek te bemoeien: http://www.sp.nl/ From skip at pobox.com Sun Sep 28 16:15:22 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Sep 29 08:37:34 2003 Subject: [Spambayes] RE: Need more training messages In-Reply-To: References: Message-ID: <16247.16730.613192.872321@montanaro.dyndns.org> Skip> I doubt a few non-English hams and spam would hurt. Let's limit Skip> it to Western European languages (no Hebrew or Japanese, for Skip> example). Bob> I don't see the point of the limitation to Western European Bob> spams. I'm firmly in the English-speaking world (no wisecracks from Bob> the British Empire, please!), but a high percentage of my spam is Bob> in Russian, Chinese, Japanese, etc. We know very little about how well SpamBayes works on *ham* which is written in non-Western European character sets. The idea is that we provide an initial training database which allows SpamBayes to do a reasonable job scoring mail at the start. I wouldn't want to include Asian spam and no Asian ham. If a Japanese user installs SB and uses the starter database, they would likely be disappointed. Skip From cdhgold at tecman.com Mon Sep 29 09:02:47 2003 From: cdhgold at tecman.com (Chris Goldsmith) Date: Mon Sep 29 09:04:54 2003 Subject: [Spambayes] corrupt database Message-ID: <002c01c38689$ff940c80$118512ac@lanierhealthcare.com> please help i'm get the following error when trying to train on my latest batch of incoming spam thru the web interface. 500 Server error Traceback (most recent call last): File "C:\download\spambayes-1.0a5\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\download\spambayes-1.0a5\spambayes\ProxyUI.py", line 309, in onReview targetCorpus.takeMessage(id, sourceCorpus) File "C:\download\spambayes-1.0a5\spambayes\Corpus.py", line 201, in takeMessage self.addMessage(msg) File "C:\download\spambayes-1.0a5\spambayes\FileCorpus.py", line 143, in addMessage Corpus.Corpus.addMessage(self, message) File "C:\download\spambayes-1.0a5\spambayes\Corpus.py", line 136, in addMessage obs.onAddMessage(message) File "C:\download\spambayes-1.0a5\spambayes\storage.py", line 552, in onAddMessage self.train(message) File "C:\download\spambayes-1.0a5\spambayes\storage.py", line 560, in train self.bayes.learn(message.tokenize(), self.is_spam) File "C:\download\spambayes-1.0a5\spambayes\classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "C:\download\spambayes-1.0a5\spambayes\classifier.py", line 411, in _add_msg self._wordinfoset(word, record) File "C:\download\spambayes-1.0a5\spambayes\storage.py", line 257, in _wordinfoset self.db[word] = record.__getstate__() File "C:\Python23\lib\shelve.py", line 130, in __setitem__ self.dict[key] = f.getvalue() File "C:\Python23\lib\bsddb\__init__.py", line 90, in __setitem__ self.db[key] = value DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') Also when i try to run the which_database.py to confirm which database I'm running I get Pickle is available.Dumbdbm is available.Dbhash is available.Bsddb[3] is available.Your storage hammie.db is a: NonePlease help any advice or suggestiosna re appreciated. From anthony at interlink.com.au Mon Sep 29 09:10:59 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Mon Sep 29 09:12:51 2003 Subject: [Spambayes] RE: Need more training messages In-Reply-To: <16247.16730.613192.872321@montanaro.dyndns.org> Message-ID: <200309291310.h8TDAxiY030480@localhost.localdomain> >>> Skip Montanaro wrote > We know very little about how well SpamBayes works on *ham* which is written > in non-Western European character sets. The idea is that we provide an > initial training database which allows SpamBayes to do a reasonable job > scoring mail at the start. I wouldn't want to include Asian spam and no > Asian ham. If a Japanese user installs SB and uses the starter database, > they would likely be disappointed. On the other hand, spambayes does a killer job for me in nailing asian spam. But then, as I get zero asian ham (*) this is a pretty easy target. (*) presumably asian ham would be some sort of Pork in Black-bean sauce or something. -- Anthony Baxter It's never too late to have a happy childhood. From spambayes at whateley.com Mon Sep 29 11:09:14 2003 From: spambayes at whateley.com (Brendon) Date: Mon Sep 29 11:09:21 2003 Subject: [Spambayes] Bayesian drawbacks? In-Reply-To: <20030929103728.GA3685@nl.linux.org> References: <20030929103728.GA3685@nl.linux.org> Message-ID: <200309290809.14115.spambayes@whateley.com> Hi Gerrit, I'm a new convert to Spambayes and I have to tell you that the training is not much of a drawback. I started getting useful reductions in spam after training with just a handful of each good and bad email. I didn't do any training BEFORE I started to use it, I just let it classify all the email as unsure until I had a few of each. By the time I had trained on 10 each spam and ham it caught 23% of the spam. The next day I was up to 58% of the spam caught and by the 3rd day it was catching 98%!! Currently I've trained it on 119 each of spam and ham, all using the web interface and I get only a few "unsure" emails a day. After 3 days of use I was at 98% of the spam eliminated and NO FALSE POSITIVES!! My previous product used fixed filters, got about 60% of the spam with a trickle of false positives. Since spam is always evolving, I just check the email that is classified as "unsure" every so often and train on those. To keep the numbers even, I also train on a matching number of either good or bad email to balance the numbers of each type. Unlike fixed filters that slowly start to be fooled as spammers start changing w0rds l1ke th1s, as soon as you train Spambayes on even one message like this, it KNOWS that words like that never (except in this email) appear in anything other than spam! So, since training consists of selecting a radio button on a web page next to each message I want to train, the training aspect takes a few seconds per day. Brendon. On Monday 29 September 2003 03:37 am, Gerrit Holl wrote: > Hi, > > I am considering to install a spamfilter on my machine. However, I don't > know which one to choose. I have read about the Bayesian approach Spambayes > is using. As I understand it, Spambayes needs to be trained in order to be > useful. Isn't this a major drawback? Is it possible that a non-Bayesian > approach would much better suit my needs, or did I misunderstand the > Bayesian technique then? > > yours, > Gerrit Holl. From david.matos at comcast.net Mon Sep 29 11:10:30 2003 From: david.matos at comcast.net (David Matos) Date: Mon Sep 29 11:10:38 2003 Subject: [Spambayes] Toolbar icon customization In-Reply-To: Message-ID: <000001c3869b$d3482780$8d80b042@dexter> I noticed that some people have modified their SpamBayes toolbar icons (and text). How did you do that? I'd like to modify my icons with different pictures (a green envelope, for example) and text (shorten to just "Spam" and "Ham"). From tim.one at comcast.net Mon Sep 29 11:40:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Sep 29 11:41:01 2003 Subject: [Spambayes] Bayesian drawbacks? In-Reply-To: <20030929103728.GA3685@nl.linux.org> Message-ID: [Gerrit Holl] > I am considering to install a spamfilter on my machine. However, I > don't know which one to choose. I have read about the Bayesian > approach Spambayes is using. As I understand it, Spambayes needs to > be trained in order to be useful. That's right. > Isn't this a major drawback? In practice, it works much better than telepathy . > Is it possible that a non-Bayesian approach would much better suit my > needs, You didn't explain what your needs are, Gerrit. If your primary criterion is "doesn't need any training", then sure, a filter that needs training isn't for you. I suggest you just try it, or try to make clearer what your needs are. From lists at strisik.com Mon Sep 29 12:06:15 2003 From: lists at strisik.com (Peter Strisik) Date: Mon Sep 29 12:06:22 2003 Subject: [Spambayes] Toolbar icon customization In-Reply-To: <000001c3869b$d3482780$8d80b042@dexter> Message-ID: <0HLZ008O4GQFDP@mmp-1.gci.net> In the Images folder, make copies of the "recover_ham" and "delete_as_spam" bitmaps as backups. Edit the originals as desired. In Outlook, right click on the toolbar and select "customize." Under the "Toolbars" tab, select the SpamBayes toolbar and delete it. Exit and restart outlook. Spambayes will recreate the toolbar with the new icons. In outlook, move to your "unsure" folder so both spam and ham buttons are showing in the toolbar. Right click on the toolbar and select "customize". Right click on the button you want to edit. Change the text in the "Name" field. Repeat for each button you want to change. Close the dialog. Voila. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of David Matos Sent: Monday, September 29, 2003 7:11 AM To: spambayes@python.org Subject: [Spambayes] Toolbar icon customization I noticed that some people have modified their SpamBayes toolbar icons (and text). How did you do that? I'd like to modify my icons with different pictures (a green envelope, for example) and text (shorten to just "Spam" and "Ham"). From lockinventor at cox.net Mon Sep 29 12:18:38 2003 From: lockinventor at cox.net (lockinventor) Date: Mon Sep 29 12:18:46 2003 Subject: [Spambayes] help Message-ID: <005e01c386a5$5817b3a0$cfc16944@shupenhead> this is going to be long. I purchase last week and Norton AntiSpam arrived on Saturday. there is a problem with it on OE6 on windows XP. See below from Samantec site. OE 6 crashes when a spam email gets to my inbox and I try to designate it as spam. No fix. At times send / receive stops mid way. I have to turn Anti Spam off to receive mail and then back on. Also there seems to be a delay going from AntiSpam to OE startup where sometimes I have to click from going off line to on line when I never did go off line. Question is are there any Spambayes problems with OE6 and XP? Jay Document ID:2003092416011236 Last Modified:09/26/2003 Outlook Express 6 crashes when you click the "This is Spam" button Situation: You are using Outlook Express 6. When you highlight a piece of email and click "This is Spam," Outlook Express crashes. Solution: Symantec is investigating this problem. The cause is unknown, and there is no solution at this time. This document will be updated when new information becomes available or a solution is found. The problem only seems to occur if you do the following: a.. Retrieve new messages (assume there is an email named "Spam" in this batch of messages). b.. Retrieve more new messages (one or more times). c.. Find that "Spam" email from the first batch of messages, then highlight it and click "This is Spam." Outlook Express crashes. The problem does not seem to occur if you designate incoming email as Spam each time you retrieve messages. If you find one that was from an earlier message download that you forgot to designate as Spam, ignore it. From gerrit at nl.linux.org Mon Sep 29 12:57:38 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Mon Sep 29 12:57:53 2003 Subject: [Spambayes] Bayesian drawbacks? In-Reply-To: References: <20030929103728.GA3685@nl.linux.org> Message-ID: <20030929165738.GA7209@nl.linux.org> Tim Peters wrote: > > Is it possible that a non-Bayesian approach would much better suit my > > needs, > > You didn't explain what your needs are, Gerrit. If your primary criterion > is "doesn't need any training", then sure, a filter that needs training > isn't for you. I suggest you just try it, or try to make clearer what your > needs are. I am convinced to do so by the (several) replies I got. I guess I was probably unjustly distrusting something of which I did not understand how it could possibly work. Maybe I should offer my apologies for this skepticism . I was also a little bit scared off by the many warnings about how not to use it (evenly distributed ham/spam), by the warning that spam changes so that I would keep feeding it mail and by stories of a non-bayesian spamfilter. I have now started trying it out: I will see whether it suits by needs, it probably will, judging by other answers to my question. Thanks for all the answers! Gerrit. -- 165. If a man give to one of his sons whom he prefers a field, garden, and house, and a deed therefor: if later the father die, and the brothers divide the estate, then they shall first give him the present of his father, and he shall accept it; and the rest of the paternal property shall they divide. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Het zijn tijden om je zelf met politiek te bemoeien: http://www.sp.nl/ From mmckay at iscubed.com Mon Sep 29 13:24:53 2003 From: mmckay at iscubed.com (Michael McKay) Date: Mon Sep 29 13:22:17 2003 Subject: [Spambayes] Review of problem - before filing bug report Message-ID: <000001c386ae$9a34d110$01000001@iS3Inc> I've had a "progressing problem", and thought it might be worthwhile to describe it informally before filing a formal bug report. Here is a chronology: Base system: Win2K, Outlook 2002, Outlook Plug-ins: McAfee 5.x, PGP 7.x, Chapura 1) Installed SpamBayes 0.7, worked fine. 2) Installed SpamBayes 0.81, worked fine. 3) Installed Hauri's ViRobot, and as part of this procedure uninstalled McAfee. 4) SpamBayes stopped filtering messages, and buttons stopped working. 4a) In addition I got an error message about a missing McAfee dll upon Outlook start-up. I eventually solved this by deleting "\Documents and Settings\{username}\Local Settings\Application Data\Microsoft\Outlook\extend.dat" (according to MS this is some type of a cache file). 4b) I tried reinstalling SpamBayes a few times without any effect. 5) I read the trouble-shooting guide, and tried all reasonable troubleshooting techniques. I had one log-file (4th one) that showed normal activity, but the three newer ones just had two lines (see below). 5a) I uninstalled ViRobot, just in case it was somehow messing things up. 5b) I deleted the SpamBayes toolbar, and than renamed "outcmd.dat". 5c) Uninstalled, rebooted, attempted to install 0.81; repeated again. 6) Which brings me to my current state: The install completes without errors. No Toolbar shows up, and the Outlook "Comm Add-Ins" does not show anything installed. The log file consists of 2 lines: Registered: SpamBayes.OutlookAddin Registration complete. --- If any additional information is needed, please don't hesitate to contact me. Regards, Michael McKay seaan@concentric.net From alan at mullen.demon.co.uk Mon Sep 29 13:37:12 2003 From: alan at mullen.demon.co.uk (Alan Campbell) Date: Mon Sep 29 13:38:27 2003 Subject: [Spambayes] Lost all of my ham/spam training Message-ID: <003101c386b0$51819a00$6401a8c0@home.co.uk> Hi, I have been using Spambayes on a W2K server via the proxy server for for about 2 weeks now and had trained it on about 80 spam and 60 ham. Everything was going fine and it was classifying virtually all of my mails correctly, until today. I got home from work and checked my mail and everything came through as unsure. After further investigation, it appears all my training has been lost somehow. The web server home page shows that 0 ham and spam have been trained. Any ideas ? Alan --- mailto:alan@mullen.demon.co.uk http://www.mullen.demon.co.uk/ From M_Kersting at admworld.com Mon Sep 29 14:10:50 2003 From: M_Kersting at admworld.com (Kersting, Matthew) Date: Mon Sep 29 14:12:13 2003 Subject: [Spambayes] Toolbar Message-ID: I have had to uninstall and reinstall Spambayes and after the reinstall the Toolbar does not appear. I have tried everything that was listed in the troubleshooting.html and it has not been corrected. Any advice? CONFIDENTIALITY NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by email reply or by telephone and immediately delete this message and any attachments. In the U.S. call us toll free at (800) 637-5843. Spanish, French, Quebecois French, Portuguese, Polish, German, Dutch, Turkish, Russian, Japanese and Chinese: http://www.admworld.com/confidentiality.htm. From T.A.Meyer at massey.ac.nz Mon Sep 29 20:17:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Sep 29 20:17:59 2003 Subject: [Spambayes] help Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA4E5@its-xchg4.massey.ac.nz> > I purchase last week and Norton AntiSpam arrived on Saturday. > there is a problem with it on OE6 on windows XP. See below > from Samantec site. Is any of this about spambayes? It reads like it's all about Norton AntiSpam. If it is, you're really asking in the wrong place! I doubt anyone here will be able to help. > Question is are there any Spambayes problems with OE6 and XP? As far as we know there aren't any problems with SpamBayes specific to Outlook Express. If you want to read about the bugs we do know about, take a look at the list on . In general, you should be fine (although using 1.0a5 rather than 1.0a6 might be advisable). =Tony Meyer From wjgraf at lynchburg.net Mon Sep 29 14:58:44 2003 From: wjgraf at lynchburg.net (WJGraff) Date: Tue Sep 30 04:47:25 2003 Subject: [Spambayes] Feature Request Outlook Plug-In Message-ID: <004c01c386bb$b4f11290$10659b0c@fatboy> Please consider designing SpamBayes to run as an Outlook "rule" so that the processing of incoming emails can be handled more effectively. From marc at marcdegraauw.com Tue Sep 30 05:29:09 2003 From: marc at marcdegraauw.com (Marc de Graauw) Date: Tue Sep 30 05:29:13 2003 Subject: [Spambayes] (no subject) Message-ID: <5.2.1.1.0.20030930112514.00b47d58@localhost> I can run SpamBayes with: > pop3proxy_service.py debug but when I try to run it with: > net start pop3proxy_service SpamBayes does not start and puts the follwoing error in Event Log: Python could not import the service's module File "C:\sb\spambayes-1.0a6\windows\pop3proxy_service.py", line 82, in ? import sb_server File "C:\sb\spambayes-1.0a6\scripts\sb_server.py", line 100, in ? import spambayes.message File "C:\sb\spambayes-1.0a6\spambayes\message.py", line 149, in ? File "C:\sb\spambayes-1.0a6\spambayes\message.py", line 114, in __init__ File "C:\sb\spambayes-1.0a6\spambayes\dbmstorage.py", line 53, in open File "C:\sb\spambayes-1.0a6\spambayes\dbmstorage.py", line 36, in open_best File "C:\sb\spambayes-1.0a6\spambayes\dbmstorage.py", line 12, in open_db3hash File "C:\PROGRA~1\PYTHON\lib\site-packages\bsddb3\__init__.py", line 162, in hashopen d.open(file, db.DB_HASH, flags, mode) Py 2.2.3 / win32all-152 / bsddb3-4.1.6.win32-py2.2.exe (Cannot run 2.3 because mod_python does not support it yet). Anybody seen something like this? Cheers Marc From rcoe at CambridgeMA.GOV Tue Sep 30 10:07:01 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Tue Sep 30 10:09:13 2003 Subject: [Spambayes] RE: Need more training messages Message-ID: > From: Skip Montanaro [mailto:skip@pobox.com] > Sent: Sunday, September 28, 2003 4:15 PM > To: Coe, Bob > Cc: spambayes@python.org > Subject: Re: [Spambayes] RE: Need more training messages > > > Skip> I doubt a few non-English hams and spam would hurt. Let's > Skip> limit it to Western European languages (no Hebrew or Japanese, > Skip> for example). > > Bob> I don't see the point of the limitation to Western European > Bob> spams. I'm firmly in the English-speaking world (no wisecracks > Bob> from the British Empire, please!), but a high percentage of my > Bob> spam is in Russian, Chinese, Japanese, etc. > > We know very little about how well SpamBayes works on *ham* which is > written in non-Western European character sets. The idea is that we > provide an initial training database which allows SpamBayes to do a > reasonable job scoring mail at the start. I wouldn't want to include > Asian spam and no Asian ham. If a Japanese user installs SB and uses > the starter database, they would likely be disappointed. > > Skip Well, either... - There are users of Spambayes in orthographically diverse languages, in which case those users should be able to contribute some ham samples, as well as their experience regarding the accuracy of Spambayes's classification in their languages, or - There aren't, in which case Spambayes's performance on ham written in such languages is (at least for the time being) irrelevant. But having said that, I have a broader confusion. I had thought that a few weeks ago the Spambayes development community had become convinced that it was just as effective to start with zero messages and let the program build its database from scratch. The theory (I thought) was that the tedium of dealing initially with the classification of all messages as ambiguous would be balanced by the fact that other users' idiosyncratic definitions of what is or isn't spam would be removed from the equation. Indeed, wasn't the ability to start from scratch specifically added to the most recent Spambayes versions to accommodate this thinking? Am I remembering wrong? Or have the developers changed their minds? If not, why the renewed emphasis on the starter database? Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 From skip at pobox.com Tue Sep 30 10:22:17 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 30 10:22:29 2003 Subject: [Spambayes] RE: Need more training messages In-Reply-To: References: Message-ID: <16249.37273.902935.464603@montanaro.dyndns.org> Bob> Well, either... Bob> - There are users of Spambayes in orthographically diverse Bob> languages, in which case those users should be able to contribute Bob> some ham samples, as well as their experience regarding the Bob> accuracy of Spambayes's classification in their languages, or Bob> - There aren't, in which case Spambayes's performance on ham Bob> written in such languages is (at least for the time being) Bob> irrelevant. I agree those are the two cases (), but don't agree with your conclusions. If we distribute SpamBayes with a default db that performs miserably on Asian ham, we're not likely to win a lot of support. I would prefer that the default database process what current users normally encounter in a reasonable way. I don't expect people will be able to avoid training altogether. I just don't want the first couple of batches to all score 0.5. Bob> But having said that, I have a broader confusion. I had thought Bob> that a few weeks ago the Spambayes development community had become Bob> convinced that it was just as effective to start with zero messages Bob> and let the program build its database from scratch. The theory (I Bob> thought) was that the tedium of dealing initially with the Bob> classification of all messages as ambiguous would be balanced by Bob> the fact that other users' idiosyncratic definitions of what is or Bob> isn't spam would be removed from the equation. Indeed, wasn't the Bob> ability to start from scratch specifically added to the most recent Bob> Spambayes versions to accommodate this thinking? Bob> Am I remembering wrong? Or have the developers changed their minds? Bob> If not, why the renewed emphasis on the starter database? There's no "emphasis" on a starter database. It's just worth taking a look at. Think of it as yet another test. I now have a database with 18 spams and 22 hams. It's a 280kbyte pickle (89kbytes zipped). I'll try and run some tests today to see how it does. If you'd like to try it out, it's at http://manatee.mojam.com/~skip/training.pck Skip From qub at qub.com Tue Sep 30 10:36:52 2003 From: qub at qub.com (Mikhail Yakoubov) Date: Tue Sep 30 10:41:31 2003 Subject: [Spambayes] 500 server error on trying to configure IMAP trainig folders Message-ID: <3F799504.85B4A264@qub.com> Hello, spambayes 1.a06, Python 2.3.1 with win4all extensions installed, Win2K, started imapfilter -b successfully entered IMAP server name, login and password, but then on page http://qub.DOMAIN:8880/trainingfolders it gives an error -- 500 Server error Traceback (most recent call last): File "C:\Qub\Soft\Python\Lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\Qub\Soft\Python\Lib\site-packages\spambayes\ImapUI.py", line 214, in onTrainingfolders self._login_to_imap() File "C:\Qub\Soft\Python\Lib\site-packages\spambayes\ImapUI.py", line 195, in _login_to_imap imap = IMAPSession(server, port) NameError: global name 'IMAPSession' is not defined Something worng with my install? Sorry if the question was asked -- looked through the two months worth of messages but didn't find any clue. Thank you in advance, Mike Yakoubov. From services at AndreasK.de Tue Sep 30 10:59:34 2003 From: services at AndreasK.de (AndreasK) Date: Tue Sep 30 11:01:48 2003 Subject: [Spambayes] spamBayes is great, thank you all! Message-ID: What a great program! Many many thanks ...! SpamBayes works just fine, removes 80%-90% SPAM automatically and learns with each manually deleted spam mail. Wow. Great. Only 4 non-spam mails (out of hundreds) were wrongly treated as spam so far - and they actually looked like Spam, I have to admit. The SPAM catastrophy started in ~1996, when mainstream capitalism discovered the web, culminated in about 800 spam mails last week, and ended when I found SpamBayes, actually by reading a web-article which compared SpamBayes with some other software. The one drawback is that all mails are still fetched from the POP3-box which is really annoying with these 116kB MS-Update.exe-Mails. But still, that only costs connection-money, not my precious time... The Outlook plugin is good, too. Seamless integration is a must. If you have German speaking friends, send them my signature below. It explains WHAT and HOW TO - but also shows one urgent need, because many Dummies might not be able to do all 4 steps ;-) Publish an installer which includes all 4 programs: Python, Python-Win32, SpamBayes and SpamBayes-Outlook-Setup ! Would that be a good idea? Obviously you have to ask Python.org - but they'd gain a lot by being installed on many more computers... ciao, Andreas ---- Hey, super: Ich hab das Unglaubliche geschafft, n?mlich SPAM loszuwerden! Endlich, es war zuletzt die absolute H?lle geworden - und nun ist sie scheinbar ?berstanden... Es gibt ein kostenloses Python-Script "SpamBayes", das sich nahtlos in Outlook, etc. einbaut und mittels einer trainierbaren Bayes-Routine alle eMails, die nach SPAM aussehen, automatisch in zwei Ordner wegsortiert - "wahrscheinlich" (15%) und "fast sicher"(90%). Du musst allerdings einiges downloaden, da Dein Rechner vermutlich die moderne Programmiersprache Python noch nicht spricht, deshalb erstmal Python ("current production version"): http://www.python.org/download/ und die Python-Win32-Routinen: http://starship.python.net/crew/mhammond/win32/Downloads.html danach dann von Sourceforge (die wichtigste OpenSource-Community) https://sourceforge.net/project/showfiles.php?group_id=61702 das Hauptprogramm spambayes-1.XXX.zip und das Outlook-Addin SpamBayes-Outlook-Setup-0XXX.exe f?r andere POP-Clients als Outlook g?be es wohl auch Unterst?tzung: http://spambayes.sourceforge.net/windows.html Tja und dann einfach alles in obiger Reihenfolge installieren. Am besten Du hast schon zwei vorsortierte Ordner (zB 400 gew?nschte und 1300 SPAM-Mails), um das Ersttraining vorzunehmen - wenn Du bisher immer alle SPAMs sofort gel?scht hast, ist das auch nicht so schlimm, dann dr?ckst Du einfach bei jeder ankommenden SPAM mail den neuen gelben Button in Outlook "Delete as SPAM" - und nach und nach wird SpamBayes immer mehr von alleine erkennen. Toll, nech? From scott at ertrescue.com Tue Sep 30 11:38:26 2003 From: scott at ertrescue.com (Scott Blackburn) Date: Tue Sep 30 11:34:45 2003 Subject: [Spambayes] gold mine Message-ID: <200309301534.h8UFYdKV071751@superman.pns.networktel.net> My mail program is called Goldmine, do you know it your spam blocker will work = with it. From what I have read I should be able to use you pop3 filter program. Scott Blackburn, NREMT-P Emergency Response Training, Inc. 888-639-4600, 225-344-0970, Fax 225-344-0972 www.ertrescue.com From rcoe at CambridgeMA.GOV Tue Sep 30 12:32:21 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Tue Sep 30 12:32:25 2003 Subject: [Spambayes] RE: Need more training messages Message-ID: > From: Skip Montanaro [mailto:skip@pobox.com] > Sent: Tuesday, September 30, 2003 10:22 AM > To: Coe, Bob > Cc: spambayes@python.org > Subject: RE: [Spambayes] RE: Need more training messages > > > Bob> Well, either... > > Bob> - There are users of Spambayes in orthographically diverse > Bob> languages, in which case those users should be able to > Bob> contribute some ham samples, as well as their experience > Bob> regarding the accuracy of Spambayes's classification in > Bob> their languages, or > > Bob> - There aren't, in which case Spambayes's performance on ham > Bob> written in such languages is (at least for the time being) > Bob> irrelevant. > > I agree those are the two cases (), but don't agree with your > conclusions. If we distribute SpamBayes with a default db that > performs miserably on Asian ham, we're not likely to win a lot of > support. I would prefer that the default database process what > current users normally encounter in a reasonable way. ... I think I detect an a priori confidence that the same version of the Spambayes classifier, if properly trained, can work effectively on both European and Asian languages. I wonder if that confidence isn't unduly optimistic. For example, ... I presume that the Spambayes classifier tokenizes the incoming character stream according to an algorithm that depends heavily on clearly defined word markers (spaces and punctuation marks) that are largely absent, or at least less prominent, in Chinese. But if you try to tokenize the individual characters of written Chinese, you'll find that they're much more context sensitive than the words of an English sentence are. To put it another way, many Chinese "words" consist of two or three characters, and the information theoretical redundancy of a given character is rather low (not as low as in spoken Chinese, but low enough to be a problem). Which means, I suspect, that the tokenizer, if it's to be effective on Chinese text, will require somewhat more lookahead capability than it probably has now. The problem isn't insurmountable, of course, but I think it casts doubt on the "One size fits all" approach. Such problems as this probably don't affect the classifier's ability to differentiate Chinese text from English ham. And if you're a user for whom all Chinese text is spam, that should be good enough. But differentiating Chinese spam from Chinese ham may be beyond the capability of the current classifier. (By way of disclaimer, I don't speak Chinese and I haven't seen the internals of the Spambayes classifier. But I've written compilers and other text processing software, so I'm not a total novice either.) Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 From skip at pobox.com Tue Sep 30 12:32:33 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 30 12:32:50 2003 Subject: [Spambayes] gold mine In-Reply-To: <200309301534.h8UFYdKV071751@superman.pns.networktel.net> References: <200309301534.h8UFYdKV071751@superman.pns.networktel.net> Message-ID: <16249.45089.428815.230680@montanaro.dyndns.org> Scott> My mail program is called Goldmine, do you know it your spam Scott> blocker will work with it. From what I have read I should be able Scott> to use you pop3 filter program. Probably, but you'll have to try it to see. You might want to wait for the soon-to-be-released Windows binary installer, which will make it easier to get going with SpamBayes on Windows if you don't use the Outlook plugin. Skip From skip at pobox.com Tue Sep 30 13:10:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 30 13:11:03 2003 Subject: [Spambayes] RE: Need more training messages In-Reply-To: References: Message-ID: <16249.47389.679250.994995@montanaro.dyndns.org> >> I agree those are the two cases (), but don't agree with your >> conclusions. If we distribute SpamBayes with a default db that >> performs miserably on Asian ham, we're not likely to win a lot of >> support. I would prefer that the default database process what >> current users normally encounter in a reasonable way. ... Bob> I think I detect an a priori confidence that the same version of Bob> the Spambayes classifier, if properly trained, can work effectively Bob> on both European and Asian languages. I wonder if that confidence Bob> isn't unduly optimistic. For example, ... You're reading too much into my hen scratches. But why (possibly) needlessly prejudice a future segment of our population? Bob> I presume that the Spambayes classifier tokenizes the incoming Bob> character stream according to an algorithm that depends heavily on Bob> clearly defined word markers (spaces and punctuation marks) that Bob> are largely absent, or at least less prominent, in Chinese. Correct. Bob> insurmountable, of course, but I think it casts doubt on the "One Bob> size fits all" approach. I don't believe I suggested that. More likely than J. Random Yoshi in Tokyo picking up SpamBayes is a dual-language person (a student or green-card holder) in the US or Europe trying it out. That person is likely to get ham and spam in both European and Asian character sets. I'd like for their Asian ham to suddenly not all wind up in their spam folder because you and I can't read Chinese. It's quite possible that SpamBayes will fall flat on its face distinguishing Asian ham and spam anyway. I'd prefer that all be "unsure" to start with and let the user try training the different classes of mail. There are a couple native Chinese speakers in my group here at Northwestern. I should ask them if get any email written using Chinese character sets and would like to try out SB. Skip From david.matos at comcast.net Tue Sep 30 14:34:59 2003 From: david.matos at comcast.net (David Matos) Date: Tue Sep 30 14:35:06 2003 Subject: [Spambayes] Toolbar icon customization In-Reply-To: Message-ID: <000f01c38781$8e375df0$8d80b042@dexter> (Peter, thanks for the tip!) The good news is that I was able to follow the directions and edit the icons easily. Fantastic! The bad news is that when I try to rename the buttons according to the directions, the SpamBayes buttons do not appear as an option when I click "customize." In other words, the buttons function correctly, the toolbar can be turned off and on, the new icons appear correctly, but I can't "customize" the names for the SB buttons. Anyone have any ideas? (I'd *really* like to change the button names if it's at all possible.) -----Original Message----- Date: Mon, 29 Sep 2003 08:06:15 -0800 From: Peter Strisik Subject: RE: [Spambayes] Toolbar icon customization To: spambayes@python.org Message-ID: <0HLZ008O4GQFDP@mmp-1.gci.net> Content-Type: text/plain; charset=US-ASCII In the Images folder, make copies of the "recover_ham" and "delete_as_spam" bitmaps as backups. Edit the originals as desired. In Outlook, right click on the toolbar and select "customize." Under the "Toolbars" tab, select the SpamBayes toolbar and delete it. Exit and restart outlook. Spambayes will recreate the toolbar with the new icons. In outlook, move to your "unsure" folder so both spam and ham buttons are showing in the toolbar. Right click on the toolbar and select "customize". Right click on the button you want to edit. Change the text in the "Name" field. Repeat for each button you want to change. Close the dialog. Voila. From lists at strisik.com Tue Sep 30 14:54:12 2003 From: lists at strisik.com (Peter Strisik) Date: Tue Sep 30 14:54:23 2003 Subject: [Spambayes] Toolbar icon customization In-Reply-To: <000f01c38781$8e375df0$8d80b042@dexter> Message-ID: <0HM1005FKJ6C8F@mmp-3.gci.net> Once in customize mode, did you right click on the button you want to edit? There should be a drop down with an editable "Name" field. ......Peter David Matos <> emailed on Tuesday, September 30, 2003 10:35 AM: (at least in part)..... > > The bad news is that when I try to rename the buttons according to > the directions, the SpamBayes buttons do not appear as an option when > I click "customize." In other words, the buttons function correctly, > the toolbar can be turned off and on, the new icons appear correctly, > but I can't "customize" the names for the SB buttons. > > Anyone have any ideas? (I'd *really* like to change the button names > if it's at all possible.) From david.matos at comcast.net Tue Sep 30 15:17:27 2003 From: david.matos at comcast.net (David Matos) Date: Tue Sep 30 15:17:30 2003 Subject: [Spambayes] Toolbar icon customization In-Reply-To: Message-ID: <000001c38787$7cf17700$8d80b042@dexter> OK, please ignore my last post. I figured out what I was doing wrong; I misunderstood part of the directions :P It took me a few times to realize that I had to right click the button itself in the main Outlook window, not the "customize" window. Peter, Thanks again for the tip. From alan at mullen.demon.co.uk Tue Sep 30 15:55:33 2003 From: alan at mullen.demon.co.uk (Alan Campbell) Date: Tue Sep 30 15:55:36 2003 Subject: [Spambayes] Error from pop3proxy Message-ID: <000301c3878c$cf2cd960$6401a8c0@home.co.uk> I got this error from pop3proxy running on my W2K server :- error: uncaptured python exception, closing channel (bsddb._db.DBRunRecoveryError:(-3098 2, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error det ected; run recovery') [C:\Python23\lib\asyncore.py|read|69] [C:\Python23\lib\asy ncore.py|handle_read_event|387] [C:\Python23\lib\asynchat.py|handle_read|136] [C :\spambayes-1.0a6\scripts\sb_smtpproxy.py|found_terminator|222] [C:\spambayes-1. 0a6\scripts\sb_smtpproxy.py|onProcessData|318] [C:\spambayes-1.0a6\scripts\sb_sm tpproxy.py|train|433] [C:\spambayes-1.0a6\spambayes\classifier.py|learn|211] [C: \spambayes-1.0a6\spambayes\classifier.py|_add_msg|346] [C:\spambayes-1.0a6\spamb ayes\storage.py|_wordinfoset|270] [C:\Python23\lib\shelve.py|__setitem__|130] [C :\Python23\lib\bsddb\__init__.py|__setitem__|90]) I was attempting to forward the following spam for training (it had been clasified as ham) :- Hey, just got this new toy. Saves me 30% on my gas, proven to work and seen all over the place. Mine paid for itself in 2 weeks. A worthwhile investment, and guaranteed to work. Can't go wrong, go see what I mean. I'll be saving 30% for lfe at the pumps now. Later. http://www.d3kn.org/default.asp?axel=1465 take me 0ff please. http://www.vnmc.org/out1v/pre-rem2e.asp g dfg fd g dfg dg iqydcpnqwsh xhdd lrvxe rad pcw j dljuaxayf sxq ngq vltc uk ksnjqaplfmptwvwlblgma --- mailto:alan@mullen.demon.co.uk http://www.mullen.demon.co.uk/ From rcoe at CambridgeMA.GOV Tue Sep 30 16:09:44 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Tue Sep 30 16:09:53 2003 Subject: [Spambayes] RE: Saving SB Filters Message-ID: It appears from your message headers that you're an Outlook user. If you use the Spambayes Outlook plugin, the entire configuration, including the database, goes, by default, in your roaming profile. If your profile becomes corrupted, you should be able to restore it from your backup media. (What I do, though, is use an entirely separate account for risky work. If the profile gets guffed, I don't care, because it isn't my real one.) ;^) Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Justin Chornenky [mailto:Justin@arushgames.com] > Sent: Sunday, September 21, 2003 10:07 PM > To: spambayes@python.org > Subject: [Spambayes] Saving SB Filters > > > Rather than save a large sampling of spam (especially those containing > virii) to rebuild my SB in case of system failure... is it possible to > backup/export the rules/filters that SB runs off of? > > Could not find reference in FAQ or recent archives. > > This is a large issue for me, as I work in a testing environment where > we often wipe our machines after every production cycle. Or where buggy > software does it for us :) I have had to retrain SB a few times now, > and would like to start skipping this procedure. > > Any help is appreciated. > > -Justin From skip at pobox.com Tue Sep 30 16:12:22 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 30 16:12:54 2003 Subject: [Spambayes] Error from pop3proxy In-Reply-To: <000301c3878c$cf2cd960$6401a8c0@home.co.uk> References: <000301c3878c$cf2cd960$6401a8c0@home.co.uk> Message-ID: <16249.58278.674451.186755@montanaro.dyndns.org> Alan> I got this error from pop3proxy running on my W2K server :- Alan> error: uncaptured python exception, closing channel Alan> 0x1833aa8> (bsddb._db.DBRunRecoveryError:(-3098 2, Alan> 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal Alan> region error detected; run recovery') A number of people have seen this. It appears to be related to incorrect locking of the database in multi-threaded applications. If you're not a programmer that may not make sense to you. The simplest way to work around that may be to switch to the Pickle form of the database. However, before attempting that switch, let us know what version of SpamBayes you are running. Some bugs in the configuration saving code were fixed only recently. Tony's the expert in this area. For the spambayes-dev folks: If we're not properly mediating access to the on-disk version of the DBDictClassifier, we're probably not mediating access to the in-memory PickledClassifier either. Maybe all we need to do is add a little bit of locking to the Classifier class or its subclasses. Skip From alan at mullen.demon.co.uk Tue Sep 30 16:17:35 2003 From: alan at mullen.demon.co.uk (Alan Campbell) Date: Tue Sep 30 16:17:35 2003 Subject: [Spambayes] Error from pop3proxy In-Reply-To: <16249.58278.674451.186755@montanaro.dyndns.org> Message-ID: <000001c3878f$e30255c0$6401a8c0@home.co.uk> Thanks for the quick reply. Its OK, I am a porogrammer, so I do understand what multi-threading is. I am using version 1.06a, which I downloaded a couple of weeks ago. I will try using a pickle. Great product. I am very impressed with it so far. --- mailto:alan@mullen.demon.co.uk http://www.mullen.demon.co.uk/ -----Original Message----- From: Skip Montanaro [mailto:skip@pobox.com] Sent: 30 September 2003 21:12 To: alan@mullen.demon.co.uk Cc: spambayes@python.org; spambayes-dev@python.org Subject: Re: [Spambayes] Error from pop3proxy Alan> I got this error from pop3proxy running on my W2K server :- Alan> error: uncaptured python exception, closing channel Alan> 0x1833aa8> (bsddb._db.DBRunRecoveryError:(-3098 2, Alan> 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal Alan> region error detected; run recovery') A number of people have seen this. It appears to be related to incorrect locking of the database in multi-threaded applications. If you're not a programmer that may not make sense to you. The simplest way to work around that may be to switch to the Pickle form of the database. However, before attempting that switch, let us know what version of SpamBayes you are running. Some bugs in the configuration saving code were fixed only recently. Tony's the expert in this area. For the spambayes-dev folks: If we're not properly mediating access to the on-disk version of the DBDictClassifier, we're probably not mediating access to the in-memory PickledClassifier either. Maybe all we need to do is add a little bit of locking to the Classifier class or its subclasses. Skip From adam.walker at rbwconsulting.com Tue Sep 30 16:25:58 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Tue Sep 30 16:26:12 2003 Subject: [Spambayes] Feature Request Outlook Plug-In In-Reply-To: <004c01c386bb$b4f11290$10659b0c@fatboy> Message-ID: <20030930202605.4F86013E294@sack.dreamhost.com> That would certainly make thing easier, too bad MS makes doing that next to impossible. > -----Original Message----- > Please consider designing SpamBayes to run as an Outlook "rule" so that > the > processing of incoming emails can be handled more effectively. From sg200002 at exchange.DAYTONOH.NCR.com Tue Sep 30 16:42:18 2003 From: sg200002 at exchange.DAYTONOH.NCR.com (Gillette, Steve E) Date: Tue Sep 30 16:42:32 2003 Subject: [Spambayes] Potential New Bug Message-ID: I'm running Windows XPSP1, Outlook 2003, and SpamBayes .081. The incoming spam messages are marked as read, however, I still get the Notificaion Area Icon for New Unread Messages. Any ideas? Thanks in advance! -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 4701 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030930/9a0814b4/spambayes1.obj From usr7-28hl at xemaps.com Tue Sep 30 17:17:44 2003 From: usr7-28hl at xemaps.com (Keith Russell) Date: Tue Sep 30 17:17:47 2003 Subject: [Spambayes] POP3 Training Question Message-ID: <17059361296.20030930151744@magicspeller.com> Hello, fellow spam resisters. I have a question about training. I imported and trained on about 300 spam messages and an equal number of ham messages right after getting SpamBayes working a couple of days. I don't get a lot of spam and the initial training seems to have done a good job, so since then, I've received only one messages categorized as unsure and several hundred ham. When I go to the training page, I don't see a way to select which messages I want to train. It looks as though I need to train everything I've received, even if they were all correctly categorized to start with. That's easy enough to do, but will greatly skew the ratio of trained ham to trained spam. How can I get around this? Thanks. -- Keith Russell ...Another case of Cherry Coke down the programming hatch! From skip at pobox.com Tue Sep 30 17:43:14 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Sep 30 17:43:27 2003 Subject: [Spambayes] POP3 Training Question In-Reply-To: <17059361296.20030930151744@magicspeller.com> References: <17059361296.20030930151744@magicspeller.com> Message-ID: <16249.63730.722178.789148@montanaro.dyndns.org> Keith> When I go to the training page, I don't see a way to select which Keith> messages I want to train. It looks as though I need to train Keith> everything I've received, even if they were all correctly Keith> categorized to start with. If you're happy with what the system is doing, click the "Discard" link above that column. All radio buttons underneath in that section will be selected. You can then modify any individual items you want to train on (like the unsure item you mentioned) and submit the form. Skip From vanhorn at whidbey.com Tue Sep 30 17:58:22 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Tue Sep 30 17:58:42 2003 Subject: [Spambayes] POP3 Training Question References: <17059361296.20030930151744@magicspeller.com> Message-ID: <3F79FC7E.9E7F2FE6@whidbey.com> On the Review page, the top of each category has Discard, Defer, Ham, and Spam legends. These are hot, and clicking on any of them marks every message in that category with your choice. When I go to the Review page, I always have all three sections. I click Spam on the Unsure section, then review those to make sure none of them are actually ham. Then I look over the Ham section, looking for Nigerian spam that needs to be marked as such (my filter never spots the long spams), and then I hit Discard on the Spam section, throwing them all out keeps me in balance. I would recommend setting the "Suppress caching of bulk ham" to Yes in the configuration, which will probably cut down on the rate you train Hams, even if you try to remember to hit Discard for both Ham and Spam. On the developer's list I've raised the issue of having a configurable default for each section. (Now it defaults to Defer/Ham/Spam, I would prefer Defer/Discard/Discard so I would normally only be training on errors I correct on a per-message basis.) We'll see if that shows up in the future. Van Keith Russell wrote: > Hello, fellow spam resisters. > > I have a question about training. > > I imported and trained on about 300 spam messages and an equal number > of ham messages right after getting SpamBayes working a couple of > days. > > I don't get a lot of spam and the initial training seems to have done > a good job, so since then, I've received only one messages categorized > as unsure and several hundred ham. > > When I go to the training page, I don't see a way to select which > messages I want to train. It looks as though I need to train > everything I've received, even if they were all correctly categorized > to start with. > > That's easy enough to do, but will greatly skew the ratio of trained > ham to trained spam. How can I get around this? > > Thanks. > > -- > Keith Russell > ...Another case of Cherry Coke down the programming hatch! > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From stefan at tspse.net Tue Sep 30 19:02:22 2003 From: stefan at tspse.net (=?iso-8859-1?Q?Stefan_Gr=F6nberg?=) Date: Tue Sep 30 19:02:44 2003 Subject: [Spambayes] Compiled proxy as exe Message-ID: <200310010056703.SM01024@vandread> Could it be any chance to get the pop3proxy compiled as an exe? Because i tried to get it running under Windows Xp Home Edition, and it just complains about something (forgot to copy that) and is there a way compleatle being able to remove spambayes outlook-addin from..outlook? because it makes the loading/start-time get around 20 secs instead of 1-3 secs to start. /stefan From T.A.Meyer at massey.ac.nz Tue Sep 30 19:03:15 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 30 19:03:26 2003 Subject: [Spambayes] POP3 Training Question Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA828@its-xchg4.massey.ac.nz> > On the developer's list I've raised the issue of having a > configurable default for each section. (Now it defaults to > Defer/Ham/Spam, I would prefer Defer/Discard/Discard so I > would normally only be training on errors I correct on a > per-message basis.) We'll see if that shows up in the future. This should have showed up by now for anyone using latest cvs, even given the sourceforge delay. It won't make it into 1.0a7/1.0b1/1.0, though, since the focus there is stability rather than better features :) =Tony Meyer From jpoe at fulcrumit.com Tue Sep 30 19:05:51 2003 From: jpoe at fulcrumit.com (Jim Poe) Date: Tue Sep 30 19:09:28 2003 Subject: [Spambayes] help References: <1ED4ECF91CDED24C8D012BCF2B034F13036DA4E5@its-xchg4.massey.ac.nz> Message-ID: <005601c387a7$64e65250$800101df@fulcrumit.com> Why the warning about 1.0a6? Thanks ----- Original Message ----- From: "Meyer, Tony" To: "lockinventor" ; Sent: Monday, September 29, 2003 5:17 PM Subject: RE: [Spambayes] help > I purchase last week and Norton AntiSpam arrived on Saturday. > there is a problem with it on OE6 on windows XP. See below > from Samantec site. Is any of this about spambayes? It reads like it's all about Norton AntiSpam. If it is, you're really asking in the wrong place! I doubt anyone here will be able to help. > Question is are there any Spambayes problems with OE6 and XP? As far as we know there aren't any problems with SpamBayes specific to Outlook Express. If you want to read about the bugs we do know about, take a look at the list on . In general, you should be fine (although using 1.0a5 rather than 1.0a6 might be advisable). =Tony Meyer _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From T.A.Meyer at massey.ac.nz Tue Sep 30 19:15:46 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Sep 30 19:16:02 2003 Subject: [Spambayes] help Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13036DA843@its-xchg4.massey.ac.nz> >> In general, you should >> be fine (although using 1.0a5 rather than 1.0a6 might be advisable). > Why the warning about 1.0a6? There's a bug in it that means that you need to restart SpamBayes after every time you change your options via the web interface. Not serious, just annoying, but it could be worth using 1.0a5 and then getting 1.0a7/1.0b1 when it comes out. There has been talk on -dev about putting 1.0a7 out soon to address this, although I'm not sure when we'll find a chance to do it. =Tony Meyer From mhammond at skippinet.com.au Tue Sep 30 19:41:31 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Sep 30 19:41:32 2003 Subject: [Spambayes] Compiled proxy as exe In-Reply-To: <200310010056703.SM01024@vandread> Message-ID: <041601c387ac$6122ef70$f502a8c0@eden> > Could it be any chance to get the pop3proxy > compiled as an exe? We have done this, but just don't want the extra pain it will cause us :) We need a "champion" or 2 who would be the "point person" for it. Any takers? > Because i tried to get it running under Windows Xp Home Edition, > and it just complains about something (forgot to copy that) The "compiled" version is likely to work (and fail) in the same way as the source code version, so you are likely to have the same problem. > and is there a way compleatle being able to remove spambayes > outlook-addin > from..outlook? > because it makes the loading/start-time get around 20 secs > instead of 1-3 > secs to start. Control Panel->Add Remove Programs will remove it, but you must delete the SpamBayes toolbar in Outlook manually. Are you sure SpamBayes is responsible for that slowdown? No one else reports anything similar. Mark. From tim.one at comcast.net Tue Sep 30 20:41:23 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 30 20:41:38 2003 Subject: [Spambayes] Error from pop3proxy In-Reply-To: <16249.58278.674451.186755@montanaro.dyndns.org> Message-ID: [Skip] > ... > For the spambayes-dev folks: If we're not properly mediating access > to the on-disk version of the DBDictClassifier, we're probably not > mediating access to the in-memory PickledClassifier either. Maybe > all we need to do is add a little bit of locking to the Classifier > class or its subclasses. We don't do any locking in storage.py, mainly because that wasn't intended to be thread-safe. Before trying to make it thread-safe, someone has to identify the specific use cases in which the API has to support concurrent access. I really don't know what they may be, and low-level locking can be very expensive. If it amounts to no more than making training mutually exclusive with scoring, then some gross locks at a higher level would be a lot cheaper. But, so far, nobody has identified a specific sequence of actions leading to DBRunRecoveryError. I'll speculate about one possible problem with Berkeley: if it isn't shut down cleanly, DBRunRecoveryError may well be an *expected* exception when you next start it, and running recovery at such times would then be a normal part of using Berkeley. Until we know what's triggering DBRunRecoveryError, I'm just as inclined to believe it can't be fixed without incorporating recovery as I am to believe it's due to a thread race. From tim.one at comcast.net Tue Sep 30 20:56:52 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 30 20:56:54 2003 Subject: [Spambayes] RE: Need more training messages In-Reply-To: Message-ID: [Bob Coe] > I think I detect an a priori confidence that the same version of > the Spambayes classifier, if properly trained, can work effectively > on both European and Asian languages. I wonder if that confidence > isn't unduly optimistic. For example, ... I expect SpamBayes in its current form to be useless for users whose email is primarily in Asian languages, unless all their ham is Asian and all their spam is non-Asian. > I presume that the Spambayes classifier tokenizes the incoming > character stream according to an algorithm that depends heavily on > clearly defined word markers (spaces and punctuation marks) In the message body, we split on whitespace, and that's all. Punctuation is a non-SpamBayes concept in the body. Header tokenization makes many more assumptions, but the legal characters in email headers are constrained by standards in Anglo-centric ways. > that are largely absent, or at least less prominent, in Chinese. But > if you try to tokenize the individual characters of written Chinese, > you'll find that they're much more context sensitive than the words > of an English sentence are. [etc] The tokenizer pays no attention to any of that. The most common output when tokening an Asian language is a long string of synthesized "skip" tokens (splitting on whitespace yields long strings then, and strings of length greater than 12 get replaced by a synthesized skip token). In addition, the Outlook client has [Tokenizer] replace_nonascii_chars: True enabled, which replaces "control" and "high bit" characters each with a question mark before tokenization. So in the Outlook client, the primary output from parsing the body of an Asian language message is a bunch of synthesized "skip: ? N" tokens. That's great for weeding out Asian spam for European and American users, and sucks for everyone else. If everyone else wants something better, everyone else can volunteer to do the extensive research it takes to do something better . From d.bakin at comcast.net Tue Sep 30 20:59:40 2003 From: d.bakin at comcast.net (David Bakin) Date: Tue Sep 30 21:00:12 2003 Subject: [Spambayes] Compiled proxy as exe References: <041601c387ac$6122ef70$f502a8c0@eden> Message-ID: <00b001c387b7$4f517810$9477e40c@girlsprout> To run the proxy as a service I needed to go to Services (Control Panel/Administrative Tools/Services) and set the login account of the service to my login account - it got installed as Local System which doesn't have access to networking. I'm running XP Pro but I think the same goes for XP Home. If your control panel doesn't have "Administrative Tools" try this at the Start/Run: %SystemRoot%\system32\services.msc /s (For %SystemRoot% substitute where you have Windows installed, usually C:\WINDOWS) ----- Original Message ----- From: "Mark Hammond" To: "'Stefan Gr?nberg'" ; Sent: Tuesday, September 30, 2003 4:41 PM Subject: RE: [Spambayes] Compiled proxy as exe > > Could it be any chance to get the pop3proxy > > compiled as an exe? > > We have done this, but just don't want the extra pain it will cause us :) > We need a "champion" or 2 who would be the "point person" for it. Any > takers? > > > Because i tried to get it running under Windows Xp Home Edition, > > and it just complains about something (forgot to copy that) > > The "compiled" version is likely to work (and fail) in the same way as the > source code version, so you are likely to have the same problem. > > > and is there a way compleatle being able to remove spambayes > > outlook-addin > > from..outlook? > > because it makes the loading/start-time get around 20 secs > > instead of 1-3 > > secs to start. > > Control Panel->Add Remove Programs will remove it, but you must delete the > SpamBayes toolbar in Outlook manually. > > Are you sure SpamBayes is responsible for that slowdown? No one else > reports anything similar. > > Mark. > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From tim.one at comcast.net Tue Sep 30 21:10:56 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Sep 30 21:10:58 2003 Subject: [Spambayes] spamBayes is great, thank you all! In-Reply-To: Message-ID: [AndreasK] > What a great program! Many many thanks ...! > > SpamBayes works just fine, removes 80%-90% SPAM automatically and > learns with each manually deleted spam mail. Wow. Great. > Only 4 non-spam mails (out of hundreds) were wrongly treated as spam > so far - and they actually looked like Spam, I have to admit. spambayes "should be" doing better than that. It's best if you train it on an approximately equal number of ham and spam. If you've done so, and have trained on at least several hundred of each, then I'd expect better performance than you report here. If your primary language isn't English, that could explain it, as *most* developers and testers here use English. If, for example, your primary email language is German, then the Outlook addin's [Tokenizer] replace_nonascii_chars: True setting may be inappropriate for you, and the default skip_max_word_size value of 12 may be too small (13-character words like Unterst?tzung are hurt by both of those: first the ? gets replaced by a question mark due to replace_nonascii_chars, and then the whole word gets replaced by a synthesized "skip: U 10" token because 13 > 12). We've done almost nothing here on tuning for languages other than English, so I expect the default settings work best with English. Still, I appreciate that it's better than nothing even with the poor performance you reported . From JohnH at snetworking.com Tue Sep 30 23:02:26 2003 From: JohnH at snetworking.com (John Hall) Date: Tue Sep 30 23:03:04 2003 Subject: [Spambayes] Feature Request -(optional) Ham Folder Message-ID: <223286A46C32EE42BF7C2E4177EAD4260AEF28@sylvester2.snetworking.com> SpamBayes has a Spam and Unsure Folder. In our case, we forward email a Blackberry. But the Blackberry gets ALL the email including the Spam. Grrrrr. If SpamBayes had an (optional) Ham Folder, we could forward all the Ham email to the Blackberry that goes to the Ham folder. What do you think? John