From Waichi_Ardisj at mail.com Sun Feb 1 12:26:52 2004 From: Waichi_Ardisj at mail.com (Complaint Center) Date: Sat Jan 31 23:31:46 2004 Subject: [Spambayes] This will save your time. Message-ID: Skipped content of type multipart/alternative From dreas at emailaccount.nl Sun Feb 1 11:26:33 2004 From: dreas at emailaccount.nl (Dreas van Donselaar) Date: Sun Feb 1 11:36:47 2004 Subject: [Spambayes] Delete from server Message-ID: Hi all, SpamBayes is functioning quite well here. I just have one small inconvenience. Since I am mobile all the time I always keep 5 days of email stored on my POP3 server. I'd like all SPAM messages however to be removed from the server by your software (so it will be in my Outlook SPAM folder but not in my webmail client). Can this be implemented? Regards, Dreas van Donselaar -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040201/f58f319f/attachment.html From skipdmsi at comcast.net Sun Feb 1 14:21:45 2004 From: skipdmsi at comcast.net (skip debrossy) Date: Sun Feb 1 14:21:40 2004 Subject: [Spambayes] outlook express? Message-ID: <000801c3e8f8$a1c87bf0$6401a8c0@skipd865perl> Can Spambayes be used with outlook express? thanks, Skip Debrossy -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040201/02a02586/attachment.html From skipdmsi at comcast.net Sun Feb 1 16:59:12 2004 From: skipdmsi at comcast.net (skip debrossy) Date: Sun Feb 1 16:59:12 2004 Subject: [Spambayes] outlook express? Message-ID: <000b01c3e90e$a0394e70$6401a8c0@skipd865perl> Can spambayes be used with outlook express? Thanks, Skip Debrossy -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040201/cc18e8b6/attachment.html From tim at fourstonesExpressions.com Sun Feb 1 17:07:06 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sun Feb 1 17:07:16 2004 Subject: [Spambayes] outlook express? In-Reply-To: <000b01c3e90e$a0394e70$6401a8c0@skipd865perl> References: <000b01c3e90e$a0394e70$6401a8c0@skipd865perl> Message-ID: Yes. You'll configure sb_server as your pop3proxy. It's relatively easy, and instructions are included in the package. On Sun, 1 Feb 2004 16:59:12 -0500, skip debrossy wrote: > Can spambayes be used with outlook express? > Thanks, > Skip Debrossy -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tim at fourstonesExpressions.com Sun Feb 1 17:12:30 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sun Feb 1 17:12:37 2004 Subject: [Spambayes] Delete from server In-Reply-To: References: Message-ID: On Sun, 1 Feb 2004 17:26:33 +0100, Dreas van Donselaar wrote: > Hi all, > > SpamBayes is functioning quite well here. I just have one small > inconvenience. Since I am mobile all the time I always keep 5 days of > email > stored on my POP3 server. I'd like all SPAM messages however to be > removed > from the server by your software (so it will be in my Outlook SPAM folder > but not in my webmail client). > > Can this be implemented? This is currently only possible if you are able to install and configure software on your mail server. I presume that this isn't the case. It's not really a matter of our being unable or unwilling to implement such a thing. It simply isn't possible because of the way the pop3 protocol operates. -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From dreas at emailaccount.nl Sun Feb 1 17:23:50 2004 From: dreas at emailaccount.nl (Dreas van Donselaar) Date: Sun Feb 1 17:23:57 2004 Subject: [Spambayes] Delete from server In-Reply-To: Message-ID: Then what is the option "POP3 account settings, Advanced, Remove from server when deleted from 'Deleted Items'" used for? Doesn't that mean Outlook can remove individual messages from the POP3 server? Dreas van Donselaar -----Original Message----- From: Tim Stone [mailto:tim@fourstonesExpressions.com] Sent: zondag 1 februari 2004 23:13 To: Dreas van Donselaar; spambayes@python.org Subject: Re: [Spambayes] Delete from server On Sun, 1 Feb 2004 17:26:33 +0100, Dreas van Donselaar wrote: > Hi all, > > SpamBayes is functioning quite well here. I just have one small > inconvenience. Since I am mobile all the time I always keep 5 days of > email stored on my POP3 server. I'd like all SPAM messages however to > be removed from the server by your software (so it will be in my > Outlook SPAM folder but not in my webmail client). > > Can this be implemented? This is currently only possible if you are able to install and configure software on your mail server. I presume that this isn't the case. It's not really a matter of our being unable or unwilling to implement such a thing. It simply isn't possible because of the way the pop3 protocol operates. -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From mhammond at skippinet.com.au Sun Feb 1 17:33:53 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Feb 1 17:34:06 2004 Subject: [Spambayes] Delete from server In-Reply-To: Message-ID: <439801c3e913$793d3250$0200a8c0@eden> > Then what is the option "POP3 account settings, Advanced, > Remove from server > when deleted from 'Deleted Items'" used for? Doesn't that > mean Outlook can > remove individual messages from the POP3 server? I guess it would be possible, simply by starting a new POP session, and somehow identifying the message in the list, and deleting it. However, we already have some issues with this - see bug [ 773364 ] "Delete >From Server" doesn't work on filtered mail: https://sourceforge.net/tracker/?group_id=61702&atid=498103&func=detail&aid= 773364 As always, contributions gratefully accepted! Mark. From tony at GPSLTD.COM.AU Sun Feb 1 17:41:40 2004 From: tony at GPSLTD.COM.AU (Tony Veale) Date: Sun Feb 1 17:37:19 2004 Subject: [Spambayes] Virus Found in message "Hi" Message-ID: <218389D5EE379044932BA8BFF94C897704A110@69-150.dsl.connexus.net.au> Symantec AntiVirus found a virus in an attachment you (spambayes@python.org ) sent to Tony Veale. To ensure the recipient(s) are able to use the files you sent, perform a virus scan on your computer, clean any infected files, then resend this attachment. Attachment: critutxj.exe Virus name: W32.Beagle.A@mm Action taken: Clean failed : Quarantine succeeded : File status: Infected -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/ms-tnef Size: 1728 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/ff470a10/attachment.bin From tim.one at comcast.net Sun Feb 1 17:48:47 2004 From: tim.one at comcast.net (Tim Peters) Date: Sun Feb 1 17:48:53 2004 Subject: [Spambayes] Delete from server In-Reply-To: Message-ID: [Dreas van Donselaar] > Then what is the option "POP3 account settings, Advanced, Remove from > server when deleted from 'Deleted Items'" used for? Doesn't that mean > Outlook can remove individual messages from the POP3 server? Yes, it can. The deeper problem is that the Outlook addin plays no role in talking to your POP3, IMAP or SMTP servers. It contains no code *capable* of talking to them, either. So there's nothing it can do here unless someone contributes a large pile of new code. Other ways of using the spambayes codebase do contain pieces capable of talking POP3 (etc), but they're not part of the Outlook addin. There's a bug in Outlook 2000 that can be exploited, though (I'm told it doesn't work this way in later versions of Outlook, but have no personal knowledge of this one way or the other): create a new .pst file, create your Spam folder in that .pst file, and tell the addin to move spam into that new Spam folder. Outlook 2000 treats messages moved to a different .pst file exactly the same way it treats messages "deleted from 'Deleted Items'": if you have the latter Outlook option enabled, the spam will vanish from your POP3 server. The addin plays no direct role in this, and the same thing will happen if, for example, you move messages by hand from your Inbox to a folder in a different .pst file. If you're a programmer, you could try adapting Andrew Dalke's program: http://www.entrian.com/sbwiki/SpamBayesCuller Andrew doesn't use Outlook, though, and it's unclear how to make that program play nice with Outlook. From deedeemurry at greatbigisland.com Sun Feb 1 18:18:07 2004 From: deedeemurry at greatbigisland.com (Dee Dee) Date: Sun Feb 1 18:18:06 2004 Subject: [Spambayes] Help! Problem with Spambayes Message-ID: I have been using Spambayes for several months now and love it. The other day, suddenly some spam started showing up in my inbox, although others went into the "possible spam" folder as always. I noticed my "spam" folder was gone, and found it in my "deleted" folder along with two other folders titles "spam 1" and "spam 2". Neither of which I had created. So I tried to reconfigure my spambayes again by hitting the configuration wizard, but when I hit that button, all that happens is the window that it was in, closes. Nothing more. If I try to delete something as Spam from my inbox, it tells me I need to configure my Spambayes. I have uninstalled spambayes and downloaded and reinstalled it. But it is doing the same thing. I have created new spam folders, etc, but nothing helps. All the other buttons, other than the configuration wizard in the Spambayes manager, seem to be working ok. Can anyone help me get my spambayes back?? Thanks!! Dee Dee -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040201/b7f99a16/attachment.html From deedeemurry at greatbigisland.com Sun Feb 1 18:50:09 2004 From: deedeemurry at greatbigisland.com (Dee Dee) Date: Sun Feb 1 18:50:10 2004 Subject: [Spambayes] Need help! Configuration wizard won't work. Message-ID: I have been using Spambayes for several months now and love it. The other day, suddenly some spam started showing up in my inbox, although others went into the "possible spam" folder as always. I noticed my "spam" folder was gone, and found it in my "deleted" folder along with two other folders titles "spam 1" and "spam 2". Neither of which I had created. So I tried to reconfigure my spambayes again by hitting the configuration wizard, but when I hit that button, all that happens is the window that it was in, closes. Nothing more. If I try to delete something as Spam from my inbox, it tells me I need to configure my Spambayes. I have uninstalled spambayes and downloaded and reinstalled it. But it is doing the same thing. I have created new spam folders, etc, but nothing helps. All the other buttons, other than the configuration wizard in the Spambayes manager, seem to be working ok. Can anyone help me get my spambayes back?? Thanks!! Dee Dee -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040201/52da100a/attachment-0001.html From tim at fourstonesExpressions.com Sun Feb 1 19:07:13 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sun Feb 1 19:07:24 2004 Subject: [Spambayes] Delete from server In-Reply-To: References: Message-ID: On Sun, 1 Feb 2004 17:48:47 -0500, Tim Peters wrote: > [Dreas van Donselaar] >> Then what is the option "POP3 account settings, Advanced, Remove from >> server when deleted from 'Deleted Items'" used for? Doesn't that mean >> Outlook can remove individual messages from the POP3 server? > > Yes, it can. > > The deeper problem is that the Outlook addin plays no role in talking to > your POP3, IMAP or SMTP servers. It contains no code *capable* of > talking > to them, either. So there's nothing it can do here unless someone > contributes a large pile of new code. Other ways of using the spambayes > codebase do contain pieces capable of talking POP3 (etc), but they're not > part of the Outlook addin. As I understood Dreas' request, it was to have pop3 mail appear in his outlook spam folder, but NOT in his webmail view of the same account, while he is operating on the account remotely (i.e. not locally using outlook). If that was what his request was, then this is not possible afaik. -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tardistrek1a at yahoo.com Sun Feb 1 19:52:18 2004 From: tardistrek1a at yahoo.com (Mark Springer) Date: Sun Feb 1 19:52:22 2004 Subject: "Neva" "Neva" http://www.geocities.com/tardistrek1a/ http://www.geocities.com/tardistrek1a2/ http://photos.yahoo.com/tardistrek1a/ http://photos.yahoo.com/tardistrek1a2 __________________________________ Do you Yahoo!? Yahoo! SiteBuilder - Free web site building tool. Try it! http://webhosting.yahoo.com/ps/sb/ From vlovich at hotmail.com Sun Feb 1 20:18:08 2004 From: vlovich at hotmail.com (Vitali) Date: Sun Feb 1 20:18:14 2004 Subject: [Spambayes] outlook express? In-Reply-To: <000501c3e90f$881d2ae0$6401a8c0@skipd865perl> References: <000801c3e8f8$a1c87bf0$6401a8c0@skipd865perl> <000501c3e90f$881d2ae0$6401a8c0@skipd865perl> Message-ID: On Sun, 1 Feb 2004 17:05:40 -0500, skip debrossy wrote: > Can you tell me where to find the Experimental Pkg.? > > Thnaks, > Skip > ----- Original Message ----- > From: "Vitali" > To: "skip debrossy" > Sent: Sunday, February 01, 2004 5:00 PM > Subject: Re: [Spambayes] outlook express? > > >> On Sun, 1 Feb 2004 14:21:45 -0500, skip debrossy >> wrote: >> >> > Can Spambayes be used with outlook express? >> > >> > thanks, >> > Skip Debrossy >> >> Yes it can when you setup the pop3proxy. The easiest way I think would >> be >> to install the built Experimental package for windows, although you can >> also build it yourself (there's tutorials I believe in the FAQ) >> > > http://starship.python.net/crew/mhammond/spambayes/ The experimental package. Sorry, but I forgot to add spambayes@python.org as the other recipient on my reply. -- Using M2, Opera's revolutionary e-mail client: http://www.opera.com/m2/ From info at bias.net Sun Feb 1 08:34:33 2004 From: info at bias.net (Yusuf Muraben) Date: Sun Feb 1 21:53:52 2004 Subject: [Spambayes] Error installing Spambayes 8.1 Message-ID: <000a01c3e8c8$23233900$d6df2acb@homewin2k> Installing on to WinXP SP1/Outlook 2002 as local admin. C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry tor try again, ignore to proceed anyway (not recommended), or Abort to cancel installation. I found some other queries, but no FAQ. Any ideas? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040202/2772422c/attachment.html From David_Walker at bmc.com Mon Feb 2 08:33:15 2004 From: David_Walker at bmc.com (Walker, David) Date: Mon Feb 2 08:37:04 2004 Subject: [Spambayes] (no subject) Message-ID: At first I was extremely pleased w/SpamBayes but last week it stopped running. I'm running WinXP. I've removed it and added it again several times but no functions within SB work anymore. Any ideas? Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040202/58f77ff9/attachment.html From rcoe at CambridgeMA.GOV Mon Feb 2 08:39:14 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Feb 2 08:39:18 2004 Subject: [Spambayes] RE: outlook express? Message-ID: But the Experimental Package is a version (0.85) of the Outlook Plugin, which does not work with Outlook Express. Right? Bob > -----Original Message----- > From: Vitali [mailto:vlovich@hotmail.com] > Sent: Sunday, February 01, 2004 8:18 PM > To: skip debrossy; Discussion list for Pythonic Bayesian classifier > Subject: Re: [Spambayes] outlook express? > > > On Sun, 1 Feb 2004 17:05:40 -0500, skip debrossy > wrote: > > > Can you tell me where to find the Experimental Pkg.? > > > > Thnaks, > > Skip > > ----- Original Message ----- > > From: "Vitali" > > To: "skip debrossy" > > Sent: Sunday, February 01, 2004 5:00 PM > > Subject: Re: [Spambayes] outlook express? > > > > > >> On Sun, 1 Feb 2004 14:21:45 -0500, skip debrossy > >> wrote: > >> > >> > Can Spambayes be used with outlook express? > >> > > >> > thanks, > >> > Skip Debrossy > >> > >> Yes it can when you setup the pop3proxy. The easiest way I think would > >> be to install the built Experimental package for windows, although you > >> can also build it yourself (there's tutorials I believe in the FAQ) > >> > > > > > > http://starship.python.net/crew/mhammond/spambayes/ > The experimental package. Sorry, but I forgot to add spambayes@python.org > as the other recipient on my reply. From skip at pobox.com Mon Feb 2 09:10:23 2004 From: skip at pobox.com (Skip Montanaro) Date: Mon Feb 2 09:10:40 2004 Subject: [Spambayes] Corrupt database In-Reply-To: References: <16411.52642.733060.11807@montanaro.dyndns.org> Message-ID: <16414.23119.683781.149440@montanaro.dyndns.org> >> To put this in context of the current SB apps, the Outlook plugin, >> POP3 proxy and IMAP filter should probably default to using pickled >> dicts, while sb_filter should default to using a database file. I >> don't know that this is the case right now. I think the choice of >> classifier type is application-independent. Maybe we should change >> that. Tim> I'm afraid a pickled dict isn't a good default choice for the Tim> Outlook addin: some people already gripe about the startup time Tim> using bsddb3, and it's much longer with a pickled dict; the time to Tim> save to disk after incremental training is so long with a pickled Tim> dict that the Outlook addin doesn't even try to save a pickled dict Tim> until Outlook shuts down; ... Oh yeah, I forgot the Outlook plugin store()s pretty frequently. Do the other long-running apps? Skip From kennypitt at hotmail.com Mon Feb 2 09:13:28 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 2 09:14:20 2004 Subject: [Spambayes] RE: outlook express? In-Reply-To: Message-ID: Coe, Bob wrote: > But the Experimental Package is a version (0.85) of the Outlook > Plugin, which does not work with Outlook Express. Right? The experimental package contains the latest version of the binary Outlook plug-in, but it also contains the first version of a binary for the sb_server/pop3proxy_tray set of POP3 proxy apps. The installation will choose which one to install by default based on whether or not an Outlook 2000+ installation can be detected, but the user can override the automatic selection. -- Kenny Pitt From Robert.Mezzone at PJSolomon.Com Mon Feb 2 09:21:13 2004 From: Robert.Mezzone at PJSolomon.Com (Robert Mezzone) Date: Mon Feb 2 09:21:19 2004 Subject: [Spambayes] (no subject) Message-ID: <1254A68F4903D411B24800508B1220E9037A7FDE@solomon1.pjsc.internal> If you are running Outlook try "About" "Microsoft Outlook" "Disabled Items". Make sure Spambayes is not disabled. If it is reneable it and then restart Outlook. Worked for me. _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Walker, David Sent: Monday, February 02, 2004 8:33 AM To: 'spambayes@python.org' Subject: [Spambayes] (no subject) At first I was extremely pleased w/SpamBayes but last week it stopped running. I'm running WinXP. I've removed it and added it again several times but no functions within SB work anymore. Any ideas? Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040202/c000ccae/attachment-0001.html From kennypitt at hotmail.com Mon Feb 2 09:26:50 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 2 09:27:42 2004 Subject: [Spambayes] Sudden Glitch In-Reply-To: <000501c3e859$e4f76de0$9dbf6944@psmaincomputer> Message-ID: John Norton wrote: > Suddenly, my spam folder disappeared. I reestablished a folder > named spam. However, I get an error message that pops up when I > designate a message as spam that says: "You must configure your spam > folder!"" > > What is this telling me, and what do I need to do? I think FAQ 3.13 will answer your question: http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder -- Kenny Pitt From jhall2 at nc.rr.com Mon Feb 2 11:38:40 2004 From: jhall2 at nc.rr.com (Jack Hall) Date: Mon Feb 2 11:38:46 2004 Subject: [Spambayes] (no subject) Message-ID: I am a new user of the Spambayes software and I have a problem. I am using Windows 2000 and Outlook 2000 and the latest version of Spambayes. I installed it last week using the installer and it went in just fine. Everything worked fine for a day of so, then Outlook took a very long time to close and when I rebooted, the Junk email folder was gone. The suspect email folder was still there. I tried to recreate the junk email folder, but it kept telling me to reconfigures, but the Spambayes Manager did not let me configure anything. So I uninstalled the software and reinstalled it a couple of times, with no luck. I looked at the troubleshooting doc and was able to determine that the addin was not listed in the Com add-in spot. So I tried to add it by using your directions in the support doc. I tried to find one file named regsvr32.exe spambayes_addin.dll , but I found the regsvr.exe file and I found the spambayes_addin.dll as separate files. I suspect you are referencing 2 separate files here, but really do not understand the instructions. I also tried to delete all the spambayes files, so I could start over, but got sharing violations messages. Since I am not a very sophisticated user, I decided to send this bug report. Attached are the only log files I could find. There were actually 4 log files listed after a windows search, but when I went to the actual temp folder, I found only 3. I really liked Spambayes for the day or so it worked and would like to use it more. I suppose I could reinstall Outlook to start over, but I have a feeling that won't work. Hope you can help. Jack Hall 919-342-0871 203-702-4432 -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1050 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/96ca6677/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/96ca6677/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 3098 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/96ca6677/spambayes4.obj From jhall2 at nc.rr.com Mon Feb 2 12:29:32 2004 From: jhall2 at nc.rr.com (Jack Hall) Date: Mon Feb 2 12:35:52 2004 Subject: [Spambayes] Junk Email Folder Disappeared In-Reply-To: Message-ID: I am a new user of the Spambayes software and I have a problem. I am using Windows 2000 and Outlook 2000 and the latest version of Spambayes. I installed it last week using the installer and it went in just fine. Everything worked fine for a day of so, then Outlook took a very long time to close and when I rebooted, the Junk email folder was gone. The suspect email folder was still there. I tried to recreate the junk email folder, but it kept telling me to reconfigures, but the Spambayes Manager did not let me configure anything. So I uninstalled the software and reinstalled it a couple of times, with no luck. I looked at the troubleshooting doc and was able to determine that the addin was not listed in the Com add-in spot. So I tried to add it by using your directions in the support doc. I tried to find one file named regsvr32.exe spambayes_addin.dll , but I found the regsvr.exe file and I found the spambayes_addin.dll as separate files. I suspect you are referencing 2 separate files here, but really do not understand the instructions. I also tried to delete all the spambayes files, so I could start over, but got sharing violations messages. Since I am not a very sophisticated user, I decided to send this bug report. Attached are the only log files I could find. There were actually 4 log files listed after a windows search, but when I went to the actual temp folder, I found only 3. I really liked Spambayes for the day or so it worked and would like to use it more. I suppose I could reinstall Outlook to start over, but I have a feeling that won't work. Hope you can help. Jack Hall 919-342-0871 203-702-4432 From jhall2 at nc.rr.com Mon Feb 2 12:30:30 2004 From: jhall2 at nc.rr.com (Jack Hall) Date: Mon Feb 2 12:49:54 2004 Subject: [Spambayes] Junk Email Folder Disappeared Message-ID: I am a new user of the Spambayes software and I have a problem. I am using Windows 2000 and Outlook 2000 and the latest version of Spambayes. I installed it last week using the installer and it went in just fine. Everything worked fine for a day of so, then Outlook took a very long time to close and when I rebooted, the Junk email folder was gone. The suspect email folder was still there. I tried to recreate the junk email folder, but it kept telling me to reconfigures, but the Spambayes Manager did not let me configure anything. So I uninstalled the software and reinstalled it a couple of times, with no luck. I looked at the troubleshooting doc and was able to determine that the addin was not listed in the Com add-in spot. So I tried to add it by using your directions in the support doc. I tried to find one file named regsvr32.exe spambayes_addin.dll , but I found the regsvr.exe file and I found the spambayes_addin.dll as separate files. I suspect you are referencing 2 separate files here, but really do not understand the instructions. I also tried to delete all the spambayes files, so I could start over, but got sharing violations messages. Since I am not a very sophisticated user, I decided to send this bug report. Attached are the only log files I could find. There were actually 4 log files listed after a windows search, but when I went to the actual temp folder, I found only 3. I really liked Spambayes for the day or so it worked and would like to use it more. I suppose I could reinstall Outlook to start over, but I have a feeling that won't work. Hope you can help. Jack Hall 919-342-0871 203-702-4432 -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1050 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/db5a8640/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/db5a8640/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 3098 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/db5a8640/spambayes4.obj From kennypitt at hotmail.com Mon Feb 2 14:09:50 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 2 14:10:45 2004 Subject: [Spambayes] Junk Email Folder Disappeared In-Reply-To: Message-ID: Jack Hall wrote: > Everything worked fine for a day of so, then Outlook took a very long > time to close and when I rebooted, the Junk email folder was gone. > The suspect email folder was still there. I tried to recreate the > junk email folder, but it kept telling me to reconfigures, but the > Spambayes Manager did not let me configure anything. So I uninstalled > the software and reinstalled it a couple of times, with no luck. The logfiles indicate that the spam folder was deleted, so the instructions in FAQ 3.13 may be helpful. You don't say what you tried to do in SpamBayes Manager that didn't work, but hopefully the steps in this FAQ will be successful. http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder > I looked at the troubleshooting doc and was able to determine that > the addin was not listed in the Com add-in spot. There is a known bug in the 0.81 version that results in the plugin not always showing up in the COM Add-Ins list even if it is installed and running. > ... So I tried to add it > by using your directions in the support doc. I tried to find one file > named regsvr32.exe spambayes_addin.dll , but I found the regsvr.exe > file and I found the spambayes_addin.dll as separate files. I > suspect you are referencing 2 separate files here, but really do not > understand the instructions. "regsvr32.exe" is a Windows command to be executed in a Command Prompt window, and "spambayes_addin.dll" is a parameter to the command. This particular part of the troubleshooting instructions is a shortcut to reinstalling that is intended more for developers and power users, and you should be able to safely ignore it. It doesn't sound like this is your problem anyway (see below). > I also tried to delete all the spambayes files, so I could start > over, but got sharing violations messages. Did you close Outlook before trying to delete the files? If Outlook is open and the SpamBayes plugin is using the files then you will not be able to delete them. This is also a good indication that SpamBayes is probably installed and running even though it isn't in the COM Add-Ins list. -- Kenny Pitt From bobrow at parc.com Mon Feb 2 16:30:25 2004 From: bobrow at parc.com (bobrow@parc.com) Date: Mon Feb 2 16:32:20 2004 Subject: [Spambayes] Recovering from accidentally deleting the folder Spam Message-ID: <58FE49F96F24B84EB95849BF5349E08FF34305@thunderball.parc.xerox.com> I accidentally deleted the Spam folder. Now it insists that I reconfigure. I don't want to lose what it has learned. What to do? danny -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040202/30596e57/attachment.html From tameyer at ihug.co.nz Mon Feb 2 17:05:01 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 2 17:05:21 2004 Subject: [Spambayes] Corrupt database In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E70F28@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A78@its-xchg4.massey.ac.nz> > Oh yeah, I forgot the Outlook plugin store()s pretty > frequently. Do the other long-running apps? sb_server currently store()s at the end of processing each review page, although Tim has suggested that it'd be better for the db to store() after learning each message. sb_imapfilter only store()s at the end of all the training (it would probably be better to store() at least after training each folder). =Tony Meyer From alan at vaneyk.com Mon Feb 2 17:45:52 2004 From: alan at vaneyk.com (Alan Wong) Date: Mon Feb 2 17:51:00 2004 Subject: [Spambayes] Thank You and Unsubscribe Message-ID: Thank you to the Spambayes Team. It has been great for me. In the meantime I wish to unsubscribe from the list. Thanks Again! Alan CAUTION - This message may contain privileged and confidential information intended only for the use of the addressee named above. If you are not the intended recipient of this message you are hereby notified that any use, disclosure, dissemination, copying, distribution or any action taken or omitted to be taken in reliance on it is prohibited and may be unlawful. If you have received this message in error please notify van Eyk Research immediately via email at mail@vaneyk.com.au or by phone on 02 9247 6000. From tim.one at comcast.net Mon Feb 2 17:58:23 2004 From: tim.one at comcast.net (Tim Peters) Date: Mon Feb 2 17:58:26 2004 Subject: [Spambayes] Thank You and Unsubscribe In-Reply-To: Message-ID: [Alan Wong] > Thank you to the Spambayes Team. It has been great for me. You're welcome! > In the meantime I wish to unsubscribe from the list. Then you need to go to http://mail.python.org/mailman/listinfo/spambayes and unsubscribe yourself (go to "To unsubscribe from Spambayes, ...") near the bottom of the page. You subscribed yourself to begin with, so you get yourself out the same way you got yourself in. From tameyer at ihug.co.nz Mon Feb 2 17:59:04 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 2 17:59:23 2004 Subject: [Spambayes] Thank You and Unsubscribe In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E7119D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778C1@its-xchg4.massey.ac.nz> > Thank you to the Spambayes Team. It has been great for me. Good to hear. > In the meantime I wish to unsubscribe from the list. >From the very message you posted: > List-Unsubscribe: , > =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From berendes at netalyst.com Mon Feb 2 18:16:12 2004 From: berendes at netalyst.com (Christoph Berendes) Date: Mon Feb 2 18:16:31 2004 Subject: [Spambayes] .procmailrc ImportError Shared Object not found? In-Reply-To: <1074470818.71062@netalyst.com> References: <1074470818.71062@netalyst.com> Message-ID: At 5:06 PM -0700 1/18/04, cb@netalyst.com wrote, re [Spambayes] .procmailrc ImportError Shared Object not f: >when I attempt to run sb_filter.py via .procmailrc, I get the following error: > > ImportError: Shared object "libdb3.so.3" not found > >I'd be grateful for troubleshooting hints. For the sake of anyone else running into this (on FreeBSD 4.4), I added the following to the top of my .procmailrc file, and all is well: LD_LIBRARY_PATH=/usr/home/myaccount/usr/local/lib (The full path of the libdb3.so.3 file was /usr/home/myaccount/usr/local/lib/libdb3.so.3) (If anyone can explain why this was necessary to get .procmailrc to run, when sb_filter.py already ran perfectly well from the command line, I'd be grateful.) From billmallory at san.rr.com Mon Feb 2 18:29:16 2004 From: billmallory at san.rr.com (Bill Mallory) Date: Mon Feb 2 18:29:29 2004 Subject: [Spambayes] Spambayes Message-ID: I have downloaded and installed the Spam System. When I click the tab Delete as Spam I get a message which says that I have to enable Spam. Can?t locate information to figure out how to enable Spam. Could you give me some guidance. Bill Mallory -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040202/99780a48/attachment-0001.html From dhartmann at readynetz.com Mon Feb 2 19:42:45 2004 From: dhartmann at readynetz.com (ReadyNetz) Date: Mon Feb 2 19:43:04 2004 Subject: [Spambayes] Outlook 2000 "backup" function blocked by Spambayes Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 2950 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040202/ec23e2a1/attachment.jpe From bob at 1776.com Sun Feb 1 14:28:17 2004 From: bob at 1776.com (Robert K. Coe) Date: Mon Feb 2 20:33:52 2004 Subject: [Spambayes] RE: Architecture of SpamBayes Application In-Reply-To: Message-ID: <001c01c3e8f9$8c24d270$6701a8c0@CambridgeMA.GOV> Does anybody use Pegasus anymore? Until we completed our migration to Outlook/Exchange last spring, Pegasus was our staple POP3 client. IMO, it was one of the best freeware programs ever written, and I do believe it was designed to allow plugins. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: Tim Stone [mailto:tim@fourstonesExpressions.com] > Sent: Friday, January 30, 2004 9:10 AM > To: Rohit Upadhyay; spambayes@python.org > Subject: Re: [Spambayes] Architecture of SpamBayes Application > > > On Fri, 30 Jan 2004 19:24:25 +0530, Rohit Upadhyay > wrote: > > > Also, will the imap filter score messages as the outlook plugin does? > > Yes. However, because there are way too many clients, most of which are > not architected to allow plugins, the imap filter is external to the imap > clients, not integrated into a client like the outlook plugin. From research at solidmatrix.com Tue Feb 3 08:53:39 2004 From: research at solidmatrix.com (Yakov Shafranovich) Date: Tue Feb 3 08:53:57 2004 Subject: [Spambayes] Standards for Filtering Message-ID: <401FA7E3.9030706@solidmatrix.com> Daniel Quinlan of SpamAssassin suggested that I should contact you. I co-chair the Anti-Spam Research Group (ASRG) of the IRTF [asrg.sp.am] together with John Levine. The ASRG does pre-standards work and research for the IETF. We have been thinking about different ways that standards can help the filtering community and what we currently have on the table is standard headers and dynamic filtering updates (like anti-virus programs do). We also have a subgroup for filtering work which is currently being formed (see http://asrg.sp.am/subgroups/filtering.shtml). What I am wondering, is whether the filtering community can benefit from standards and if filtering folks are willing to discuss such standards, and cooperate with each other. Yakov ------- Yakov Shafranovich / asrg shaftek.org SolidMatrix Technologies, Inc. / research solidmatrix.com "I ate your Web page. / Forgive me. It was juicy / And tart on my tongue." (MIT's 404 Message) ------- From upgrade at pe.net Tue Feb 3 09:04:48 2004 From: upgrade at pe.net (Richard Nielsen) Date: Tue Feb 3 09:04:43 2004 Subject: [Spambayes] Receiving same mail multiple times Message-ID: I am using Windows XP home and Microsoft office XP for Outlook. After reading the trouble shooting pages, I still do not see my issue. Prior to loading SpamBayes my mail would only be received once, since loading I receive messages about 4 times. I have Outlook set to leave messages on the server for 14 days as I access my mail from multiple machines and need it available on each machine. The messages are being treated as new each time they are pulled from the server on the machine that has SpamBayes installed. Other than this glitch it seems to be performing satisfactorily. Please advise if there is a fix for this. Richard Nielsen -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 4508 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040203/778e7556/winmail.bin From wsy at merl.com Tue Feb 3 09:06:33 2004 From: wsy at merl.com (Bill Yerazunis) Date: Tue Feb 3 09:06:37 2004 Subject: [Spambayes] Standards for Filtering In-Reply-To: <401FA7E3.9030706@solidmatrix.com> (message from Yakov Shafranovich on Tue, 03 Feb 2004 08:53:39 -0500) References: <401FA7E3.9030706@solidmatrix.com> Message-ID: <200402031406.i13E6Xv01465@localhost.localdomain> From: Yakov Shafranovich Daniel Quinlan of SpamAssassin suggested that I should contact you. I co-chair the Anti-Spam Research Group (ASRG) of the IRTF [asrg.sp.am] together with John Levine. The ASRG does pre-standards work and research for the IETF. We have been thinking about different ways that standards can help the filtering community and what we currently have on the table is standard headers and dynamic filtering updates (like anti-virus programs do). We also have a subgroup for filtering work which is currently being formed (see http://asrg.sp.am/subgroups/filtering.shtml). What I am wondering, is whether the filtering community can benefit from standards and if filtering folks are willing to discuss such standards, and cooperate with each other. Yes, and yes. :) There is already a RFC submitted to specify inoculations (that is, sharing of new spams that got through a user's filter and were subsequently judged spam by a human. The recipients MTA automatically recognizes these inoculations and trains them as spam, so the overall filtering quality of the mesh goes up linearly with the number of participants. Well, that's the theory. :) ) There's also some hope of standardizing on an "I am spam" header. -Bill Yerazunis From research at solidmatrix.com Tue Feb 3 09:09:20 2004 From: research at solidmatrix.com (Yakov Shafranovich) Date: Tue Feb 3 09:09:31 2004 Subject: [Spambayes] Standards for Filtering In-Reply-To: <200402031406.i13E6Xv01465@localhost.localdomain> References: <401FA7E3.9030706@solidmatrix.com> <200402031406.i13E6Xv01465@localhost.localdomain> Message-ID: <401FAB90.9070104@solidmatrix.com> Bill Yerazunis wrote: > From: Yakov Shafranovich > > Daniel Quinlan of SpamAssassin suggested that I should contact you. I > co-chair the Anti-Spam Research Group (ASRG) of the IRTF [asrg.sp.am] > together with John Levine. The ASRG does pre-standards work and research > for the IETF. We have been thinking about different ways that standards > can help the filtering community and what we currently have on the table > is standard headers and dynamic filtering updates (like anti-virus > programs do). We also have a subgroup for filtering work which is > currently being formed (see http://asrg.sp.am/subgroups/filtering.shtml). > > What I am wondering, is whether the filtering community can benefit from > standards and if filtering folks are willing to discuss such standards, > and cooperate with each other. > > Yes, and yes. :) > > There is already a RFC submitted to specify inoculations (that is, sharing > of new spams that got through a user's filter and were subsequently > judged spam by a human. The recipients MTA automatically recognizes > these inoculations and trains them as spam, so the overall filtering > quality of the mesh goes up linearly with the number of participants. > > Well, that's the theory. :) ) > > There's also some hope of standardizing on an "I am spam" header. > Is there anyone who is willing to participate in a mailing list on filtering standards? Yakov ------- Yakov Shafranovich / asrg shaftek.org SolidMatrix Technologies, Inc. / research solidmatrix.com "Some lies are easier to believe than the truth" (Dune) ------- From mike at pumps-parts.com Tue Feb 3 09:55:53 2004 From: mike at pumps-parts.com (Michael Caringi) Date: Tue Feb 3 10:06:04 2004 Subject: [Spambayes] Spambayes not working Message-ID: HAve been using it for several months has been great stopped working today will not go into spam bayes manager PLease help? Windows 2000 Outlook 2000 Michael Caringi Tri-State Pump, Inc. 732-223-3222 p 732-223-2587 f -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/d7cf03df/attachment.html From cmessina at umich.edu Tue Feb 3 10:48:23 2004 From: cmessina at umich.edu (Christopher Messina) Date: Tue Feb 3 10:48:27 2004 Subject: [Spambayes] error with sb_imapfilter.py alpha 7 Message-ID: Hello. I've been using the imap filter succesfully for a while, but I hit a wall today. Apparently a message that I've trained as either spam or ham is causing a problem. Here is the session. [13 waynes cmessina]: python2.3 /usr/bin/sb_imapfilter.py -c -e y -D hammie.db SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Traceback (most recent call last): File "/usr/bin/sb_imapfilter.py", line 825, in ? run() File "/usr/bin/sb_imapfilter.py", line 815, in run imap_filter.Filter() File "/usr/bin/sb_imapfilter.py", line 675, in Filter self.unsure_folder) File "/usr/bin/sb_imapfilter.py", line 590, in Filter for msg in self: File "/usr/bin/sb_imapfilter.py", line 485, in __iter__ yield self[key] File "/usr/bin/sb_imapfilter.py", line 533, in __getitem__ msg.get_substance() File "/usr/bin/sb_imapfilter.py", line 364, in get_substance new_msg = email.Parser.Parser().parsestr(data["RFC822"]) File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 75, in parsestr File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 64, in parse File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 245, in _parsebody email.Errors.BoundaryError: multipart message with no defined boundary I train the filter with the following command: python2.3 /usr/bin/sb_imapfilter.py -t -D hammie.db Is there some way I can track down the offending message? Please let me know if I can provide any more information. -- Chris From jrooks at carrollcoatings.com Tue Feb 3 11:00:30 2004 From: jrooks at carrollcoatings.com (John Rooks) Date: Tue Feb 3 10:56:56 2004 Subject: [Spambayes] John Rooks: SUGGESTION FOR SPAM-BAYES Message-ID: Dear SPAM Gods, I have just discovered and installed your program on my Win98 machine, and I think the statistical SPAM dissemination theory and execution is genius. Thanks! I did have a thought however that might make the program more useful and accurate. It would be great if I could flag a non-SPAM email that I did not want to receive anymore to something other than the SPAM folder. Take for example news items from the Wall Street Journal. This is really not SPAM, and if I flagged it as SPAM, it would pollute my SPAM statistics, which I don't want to do. If you however could place a second layer of filtering in the program which would kill email from identified sources, like news@wsj.com, then I could relieve myself of some of my non-SPAM bothersome email. Now, there may already be a separate Outlook plug-in for this simple filtering which I may be able to use simultaneously with SPAM-Bayes. If you might have a good resolution to this issue, I would be delighted to hear of it. Once the program has really started working for me, I intend to deploy it to other systems at my company, and send you folks a fine juicy donation. Most sincerely, John Rooks Providence RI From tim at fourstonesExpressions.com Tue Feb 3 11:04:18 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Tue Feb 3 11:04:25 2004 Subject: [Spambayes] John Rooks: SUGGESTION FOR SPAM-BAYES In-Reply-To: References: Message-ID: On Tue, 3 Feb 2004 11:00:30 -0500, John Rooks wrote: > Dear SPAM Gods, > > I have just discovered and installed your program on my Win98 machine, > and I > think the statistical SPAM dissemination theory and execution is genius. > Thanks! You're quite welcome! It works good for us, too > > I did have a thought however that might make the program more useful and > accurate. It would be great if I could flag a non-SPAM email that I did > not > want to receive anymore to something other than the SPAM folder. Take for > example news items from the Wall Street Journal. This is really not SPAM, > and if I flagged it as SPAM, it would pollute my SPAM statistics, which I > don't want to do. If you however could place a second layer of filtering > in > the program which would kill email from identified sources, like > news@wsj.com, then I could relieve myself of some of my non-SPAM > bothersome > email. > > Now, there may already be a separate Outlook plug-in for this simple > filtering which I may be able to use simultaneously with SPAM-Bayes. > > If you might have a good resolution to this issue, I would be delighted > to > hear of it. > We have tinkered with n-way classification, but we have not formally implemented it, as it is outside of the interests of the current spambayes contributing developers. But... new developers are always welcome :) > Once the program has really started working for me, I intend to deploy > it to > other systems at my company, and send you folks a fine juicy donation. That's great, please see http://spambayes.sourceforge.net/donations.html -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From papaDoc at videotron.ca Tue Feb 3 11:17:39 2004 From: papaDoc at videotron.ca (papaDoc) Date: Tue Feb 3 11:17:52 2004 Subject: [Spambayes] John Rooks: SUGGESTION FOR SPAM-BAYES In-Reply-To: References: Message-ID: <401FC9A3.4030205@videotron.ca> Hi, > Dear SPAM Gods, Wrong persons It may seem obvious but why not simply unsubscribe to the list ? >I did have a thought however that might make the program more useful and >accurate. It would be great if I could flag a non-SPAM email that I did not >want to receive anymore to something other than the SPAM folder. Take for >example news items from the Wall Street Journal. This is really not SPAM, >and if I flagged it as SPAM, it would pollute my SPAM statistics, which I >don't want to do. If you however could place a second layer of filtering in >the program which would kill email from identified sources, like >news@wsj.com, then I could relieve myself of some of my non-SPAM bothersome >email. > > >If you might have a good resolution to this issue, I would be delighted to >hear of it. > > It may seem obvious but why not simply unsubscribe to the list ? Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From qub at qub.com Tue Feb 3 11:26:17 2004 From: qub at qub.com (Mikhail Yakoubov) Date: Tue Feb 3 11:26:21 2004 Subject: [Spambayes] error with sb_imapfilter.py alpha 7 References: Message-ID: <401FCBA9.97F44FC4@qub.com> Christopher Messina wrote: Disclaimer: I'm not a Spambayes developer, just a user like Christopher. And, before anything else, I'd like to thank anyone who worked and works on Spambayes -- with 300-400 spam messages received daily, and then completely filtered out by the software without a single false positive in late _months_, I really appreciate the work done. > Hello. I've been using the imap filter succesfully for a while, but I hit > a wall today. > > Apparently a message that I've trained as either spam or ham is causing a > problem. No. From my experience, it's a classification, not training, problem. There is a malformed message in your Inbox, declaired as multipart in its headers but without mutipart boudaries in the body. You have to find and weed it out. If you get only few messages in your inbox, you should view them one by one, looking for a multipart header and no boundaries (or just with an opening and no closing boundary) in the corresponding body. If you get many messages, like I do, you can employ dividing-by-half strategy, temporarily moving a half of your inbox messages to a temp folder, then running classifier, then iterating. I stick to preserve (i. e., set them manually) "unread" flags on messages, though not sure if it's really neccessary for Spambayes to consider the inbox letters being yet unclassified. Or, you can try to check if you've got a message with subj "acts quicker and lasts much longer!" -- this was my malformed message of the day today. To dev team: recent days I ran into this kind of error every day, so it's became a problem. Dividing-by-half solution is realy time-consuming for large IMAP inboxes like mine. The solution seems to be to trap the underlying python mime library exception, and then treat the message as a non-mutipart one in spite of the multipart header. The effort-saving temp workaround would be to include a clue (subj, date or Message-ID) on what message has caused the error in the exception output, so a user can quickly locate and delete it herself. Hope this helps, Mike Yakoubov. > Here is the session. > > [13 waynes cmessina]: python2.3 /usr/bin/sb_imapfilter.py -c -e y -D hammie.db > SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), > using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 > and engine SpamBayes Beta2, version 0.2 (July 2003). > > Traceback (most recent call last): > File "/usr/bin/sb_imapfilter.py", line 825, in ? > run() > File "/usr/bin/sb_imapfilter.py", line 815, in run > imap_filter.Filter() > File "/usr/bin/sb_imapfilter.py", line 675, in Filter > self.unsure_folder) > File "/usr/bin/sb_imapfilter.py", line 590, in Filter > for msg in self: > File "/usr/bin/sb_imapfilter.py", line 485, in __iter__ > yield self[key] > File "/usr/bin/sb_imapfilter.py", line 533, in __getitem__ > msg.get_substance() > File "/usr/bin/sb_imapfilter.py", line 364, in get_substance > new_msg = email.Parser.Parser().parsestr(data["RFC822"]) > File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 75, in parsestr > File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 64, in parse > File "/var/tmp/python2.3-2.3.2-root/usr/lib/python2.3/email/Parser.py", line 245, in _parsebody > email.Errors.BoundaryError: multipart message with no defined boundary > > I train the filter with the following command: > python2.3 /usr/bin/sb_imapfilter.py -t -D hammie.db > > Is there some way I can track down the offending message? > Please let me know if I can provide any more information. > > -- > Chris > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From wsy at merl.com Tue Feb 3 11:28:00 2004 From: wsy at merl.com (Bill Yerazunis) Date: Tue Feb 3 11:28:55 2004 Subject: [Spambayes] Standards for Filtering In-Reply-To: <401FAB90.9070104@solidmatrix.com> (message from Yakov Shafranovich on Tue, 03 Feb 2004 09:09:20 -0500) References: <401FA7E3.9030706@solidmatrix.com> <200402031406.i13E6Xv01465@localhost.localdomain> <401FAB90.9070104@solidmatrix.com> Message-ID: <200402031628.i13GS0q02423@localhost.localdomain> From: Yakov Shafranovich > of new spams that got through a user's filter and were subsequently > judged spam by a human. The recipients MTA automatically recognizes > these inoculations and trains them as spam, so the overall filtering > quality of the mesh goes up linearly with the number of participants. > > Well, that's the theory. :) ) > > There's also some hope of standardizing on an "I am spam" header. > Is there anyone who is willing to participate in a mailing list on filtering standards? Yes, count me in. Jonathan Zdziarski may also be interested. (he's the other half of the inoculation RFC) -Bill Yerazunis From anthony at interlink.com.au Tue Feb 3 11:28:41 2004 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Feb 3 11:30:22 2004 Subject: [Spambayes] error with sb_imapfilter.py alpha 7 In-Reply-To: <401FCBA9.97F44FC4@qub.com> Message-ID: <20040203162841.6860025B4BA@bonanza.off.ekorp.com> >>> Mikhail Yakoubov wrote > No. From my experience, it's a classification, not training, problem. > There is a malformed message in your Inbox, declaired as multipart in > its headers but without mutipart boudaries in the body. You have to find > and weed it out. FWIW, I plan that a future version of python's email package will be much more forgiving of this sort of barfage. Doesn't help much at this point, of course... From jacobe at classicaccessories.com Tue Feb 3 11:37:23 2004 From: jacobe at classicaccessories.com (Jacob Engelstein) Date: Tue Feb 3 11:40:03 2004 Subject: [Spambayes] Junk Mail Message-ID: I have been using spam-bayes for several weeks, can I delete my junk mail folder without sacrificing the accumulation of information learned by the software in order to better control in-bound spam? By the way I am thrilled with the software's performance. Before installing the software I was receiving about 100 spam emails a day , today I only now see very few of them, it is working great, thanks Jacob Engelstein Classic Accessories Tel 253 395 3900 X 212 Fax 253 395 3991 E Mail jacobe@classicaccessories.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/26e7084e/attachment-0001.html From kennypitt at hotmail.com Tue Feb 3 11:40:15 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 3 11:41:10 2004 Subject: [Spambayes] John Rooks: SUGGESTION FOR SPAM-BAYES In-Reply-To: Message-ID: John Rooks wrote: > It would be great if I could flag a non-SPAM email that > I did not want to receive anymore to something other than the SPAM > folder. Take for example news items from the Wall Street Journal. > This is really not SPAM, and if I flagged it as SPAM, it would > pollute my SPAM statistics, which I don't want to do. Most of us define spam as any e-mail you receive that you don't want, so by that definition your message from the WSJ could be qualified as spam. The Outlook plug-in only tracks statistics for the duration of your current Outlook session anyway, so there probably isn't much to pollute either way. > ... If you however > could place a second layer of filtering in the program which would > kill email from identified sources, like news@wsj.com, then I could > relieve myself of some of my non-SPAM bothersome email. > > Now, there may already be a separate Outlook plug-in for this simple > filtering which I may be able to use simultaneously with SPAM-Bayes. If you just need to filter on a specific To or From address then the standard Outlook rules should handle that. Just make sure that you turn on the SpamBayes background filtering option in the Advanced tab of SpamBayes Manager so that your rule will process the message before SpamBayes does. -- Kenny Pitt From tim.one at comcast.net Tue Feb 3 11:46:55 2004 From: tim.one at comcast.net (Tim Peters) Date: Tue Feb 3 11:47:00 2004 Subject: [Spambayes] John Rooks: SUGGESTION FOR SPAM-BAYES In-Reply-To: Message-ID: [John Rooks] > ... > I did have a thought however that might make the program more useful > and accurate. It would be great if I could flag a non-SPAM email that > I did not want to receive anymore to something other than the SPAM > folder. Take for example news items from the Wall Street Journal. > This is really not SPAM, and if I flagged it as SPAM, it would > pollute my SPAM statistics, which I don't want to do. If you however > could place a second layer of filtering in the program which would > kill email from identified sources, like news@wsj.com, then I could > relieve myself of some of my non-SPAM bothersome email. > > Now, there may already be a separate Outlook plug-in for this simple > filtering which I may be able to use simultaneously with SPAM-Bayes. If you're using Outlook, Outlook comes with an elaborate rule-based system of its own. Go to Tools -> Rule Wizard, and define an Outlook rule to move email from news@wsj.com to whatever folder you like. Do not tell SpamBayes to watch that folder. The only remaining problem then is that Outlook *may* (or may not) tell SpamBayes about a new message from news@wsj.com before your Outlook rule moves it. To fix that, go to SpamBayes -> SpamBayes Manager -> Advanced, and enable background filtering. If you set the delays long enough, Outlook will consistently apply its own rules before telling SpamBayes about new messages. This is basically icky, but I'm afraid it's the best we've been able to do (Outlook isn't cooperative here -- it's a battle). From marvallen1 at comcast.net Tue Feb 3 12:23:30 2004 From: marvallen1 at comcast.net (marvallen) Date: Tue Feb 3 12:43:28 2004 Subject: [Spambayes] Where did it go? Message-ID: <000001c3ea7a$72661bf0$6500a8c0@oempnck9i2acp1> Hello, I installed your program and can't seem to find or get new email. I've searched for a holding file but without success. Is there a way to configure the program by getting my email and then letting Spambayes know what to do with it? Thank you, Marv 248 491-2000 PS: I will have to get your response from another connected computer. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/c561133d/attachment.html From Rohit.Upadhyay at oracle.com Tue Feb 3 13:19:52 2004 From: Rohit.Upadhyay at oracle.com (Rohit Upadhyay) Date: Tue Feb 3 13:18:17 2004 Subject: [Spambayes] Imap Filter Setup Error Message-ID: <401FE648.9E47B517@oracle.com> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: Rohit.Upadhyay.vcf Type: text/x-vcard Size: 298 bytes Desc: Card for Rohit Upadhyay Url : http://mail.python.org/pipermail/spambayes/attachments/20040203/69c7d056/Rohit.Upadhyay.vcf From bkc at murkworks.com Tue Feb 3 13:26:45 2004 From: bkc at murkworks.com (Brad Clements) Date: Tue Feb 3 13:25:44 2004 Subject: [Spambayes] RE: Architecture of SpamBayes Application In-Reply-To: <001c01c3e8f9$8c24d270$6701a8c0@CambridgeMA.GOV> References: Message-ID: <401FA136.1572.1A788697@localhost> On 1 Feb 2004 at 14:28, Robert K. Coe wrote: > Does anybody use Pegasus anymore? Until we completed our migration to > Outlook/Exchange last spring, Pegasus was our staple POP3 client. IMO, it > was one of the best freeware programs ever written, and I do believe it was > designed to allow plugins. > I use Pegasus Mail and I have nearly completed integrating Spambayes code into Pegasus Mail 4.2 Maybe another 30 days to public beta. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From kennypitt at hotmail.com Tue Feb 3 15:26:55 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 3 15:27:51 2004 Subject: [Spambayes] Junk Mail In-Reply-To: Message-ID: Yes, you can delete the messages from your spam folder (but be careful to delete the contents and not the folder itself). The SpamBayes training data is stored separately. You can also use Outlook's auto-archive feature to automatically delete messages from the spam folder after a certain number of days. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Jacob Engelstein Sent: Tuesday, February 03, 2004 11:37 AM To: spambayes@python.org Subject: [Spambayes] Junk Mail I have been using spam-bayes for several weeks, can I delete my junk mail folder without sacrificing the accumulation of information learned by the software in order to better control in-bound spam? By the way I am thrilled with the software's performance. Before installing the software I was receiving about 100 spam emails a day , today I only now see very few of them, it is working great, thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/7276979f/attachment.html From kennypitt at hotmail.com Tue Feb 3 15:38:10 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 3 15:39:14 2004 Subject: [Spambayes] Where did it go? In-Reply-To: <000001c3ea7a$72661bf0$6500a8c0@oempnck9i2acp1> Message-ID: If you do not provide SpamBayes with any messages for initial training, it will initially flag all incoming messages as "unsure" waiting for you to tell it which ones are spam and which are good. These unsure messages are moved to a separate folder that is, by default, named "Junk Suspects". Try to locate that folder and all of your mail should be in there. Then use the "Delete as Spam" and "Recover from Spam" buttons on the SpamBayes toolbar to classify each of the messages. SpamBayes will quickly learn the characteristics of your good and spam messages, and you will begin to see most of your messages either remain in your Inbox or get moved to your spam folder. If you have trouble locating the "Junk Suspects" folder, you should be able to use Outlook's Advanced Find feature to search for all unread messages. FAQ 3.12 may help with that. http://spambayes.sourceforge.net/faq.html#some-of-my-mail-is-going-missi ng -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of marvallen Sent: Tuesday, February 03, 2004 12:24 PM To: spambayes@python.org Subject: [Spambayes] Where did it go? I installed your program and can't seem to find or get new email. I've searched for a holding file but without success. Is there a way to configure the program by getting my email and then letting Spambayes know what to do with it? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/87cb4285/attachment.html From tameyer at ihug.co.nz Tue Feb 3 17:03:41 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 3 17:04:02 2004 Subject: [Spambayes] error with sb_imapfilter.py alpha 7 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E713BA@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A7C@its-xchg4.massey.ac.nz> [Mikhail Yakoubov] > No. From my experience, it's a classification, not training, problem. > There is a malformed message in your Inbox, declaired as multipart in > its headers but without mutipart boudaries in the body. You have to > find and weed it out. [Anthony] > FWIW, I plan that a future version of python's email package > will be much more forgiving of this sort of barfage. > > Doesn't help much at this point, of course... Also not helping much at this point, but something that will help sooner than the improvements to the email package, is that the next version of imapfilter handles these much more gracefully. When messages can't be parsed, it simply prints out a note about that and continues on. Those messages won't be filtered, but everything else still will be. Eventually, imapfilter should handle this like sb_server does, adding a X-SpamBayes-Exception: header with the problem - this is bug #737967. For the moment, though, the current solution will do ok. A new release should be out pretty soon (and includes other improvements to imapfilter), so your best bet is to watch the announce list and grab it when it does. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Tue Feb 3 17:08:40 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 3 17:09:54 2004 Subject: [Spambayes] Imap Filter Setup Error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E713D1@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778CA@its-xchg4.massey.ac.nz> > I am now trying to get the imap filter working, but am > running into issues with configuration. I downloaded the > imap filter: spambayes-1.0a7.zip. [...] > when I click on 'configure folders to filter' or > 'configure folders to train' it gives the following error: [...] > ExpatError: not well-formed (invalid token): line 1, column 31 This is a known bug with the 1.0a7 and is fixed in the next release. The problem is that one or more folders on your server have a <, >, or & in them, and they aren't properly handled before displaying the web interface. For the moment, the solution is to manually configure the folders that you want to configure. IOW, open up your bayescustomize.ini file and add the appropriate options (all in the [imap] section). The FAQ explains how to get information about what options to set and what values are appropriate. In addition, keep track of the announce list, and upgrade to the new version of spambayes/imapfilter when it is released - this includes both the fix for this and many other imapfilter improvements (the release includes details about what has changed). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From RBrault at expertwireless.com Tue Feb 3 17:33:47 2004 From: RBrault at expertwireless.com (Richard Brault) Date: Tue Feb 3 17:33:57 2004 Subject: [Spambayes] (no subject) Message-ID: Project Coordinator eXpert Wireless Solutions Inc. A TetraTech Company Phone: 703-760-9180 Fax: 703-760-9182 Confidentiality Notice: This email communication may contain CONFIDENTIAL INFORMATION WHICH MAY ALSO BE LEGALLY PRIVILEGED and is intended only for the use of the intended recipients identified above. If you are not the intended recipient of this communication, you are hereby notified that any unauthorized review, use, dissemination, distribution, downloading, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by reply email, delete the communication and destroy all copies. From thetasig at comcast.net Tue Feb 3 19:53:36 2004 From: thetasig at comcast.net (thetasig) Date: Tue Feb 3 19:53:39 2004 Subject: [Spambayes] Update to new version or Not? Message-ID: <40204290.3000206@comcast.net> OK - I've been using an older version of SpamBayes (1.0a5) for quite some time with no problems and no complaints. My question is whether or not it is worth the 'hassle' to get the latest version and recompile it all, reinstall the Windows XP service, etc. (which I know isn't all *that* difficult but for some reason I still nearly dread having to start all over again). Are there compelling reasons to upgrade? Using Mozilla as a mail client - and just love it. Thank you. -=mark=- From tameyer at ihug.co.nz Tue Feb 3 20:08:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 3 20:09:04 2004 Subject: [Spambayes] Update to new version or Not? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E71477@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778CB@its-xchg4.massey.ac.nz> > OK - I've been using an older version of SpamBayes (1.0a5) for quite > some time with no problems and no complaints. My question is > whether or not it is worth the 'hassle' to get the latest version and > recompile it all, Well, Python will do all your compiling for you... > reinstall the Windows XP service, etc. (which I know > isn't all *that* difficult but for some reason I still nearly dread > having to start all over again). It should only be three steps: 1. Download & expand 1.0a7 2. Run [in sb dir] "python setup.py install" 3. Run [in sb dir] "windows/pop3proxy_service.py install" > Are there compelling reasons to upgrade? Easiest for you to decide yourself. Take a look at the 1.0a6 and 1.0a7 release notes (just the 'what's new' section should be enough, rather than the rather more explicit changelog) and see if there's anything there that you think is worth having. You can view the release notes without downloading the archive via the sourceforge download page - just click on the release heading; "1.0a7", for example. Note that we're very close to the next release, so if it's a lot of hassle to upgrade, you're probably better off waiting for that. That'll also include a binary installer for the service, so installation will be a one step (well, launch installer and press return a lot of times) process. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Tue Feb 3 20:18:52 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 3 20:19:10 2004 Subject: [Spambayes] Spambayes not working In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E71391@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A7D@its-xchg4.massey.ac.nz> > stopped working today will not go into spam bayes manager If you click on the SpamBayes button, do you get the drop-down menu? If not, then SpamBayes was probably disabled by Outlook. Do Help->About Microsoft Outlook->Disabled Items, and, if it's there, reenable it. Otherwise, your log file would greatly help in figuring out what the problem is. The troubleshooting guide (installed with the plug-in, and also on the website), and the FAQ explain where to find this, if you don't know. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From nowhere at cox.net Tue Feb 3 22:39:58 2004 From: nowhere at cox.net (Nowhere) Date: Tue Feb 3 22:39:59 2004 Subject: [Spambayes] Concisions effort to defeat statistical filtering Message-ID: <000101c3ead0$904cdf40$0100a8c0@gandalf> Hi all! I just installed spambayes and it is great for most messages. The most troubling ones for me now are the ones where the messages are identical except for a list of random words at the end of the message. In most cases it is an html message where the color of the font is the same as the background. Any ideas how to defeat this in an automated way? Spambayes setting to only check only certain parts of the messages? Thanks, Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040203/859f2775/attachment.html From tameyer at ihug.co.nz Wed Feb 4 03:19:43 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 03:20:11 2004 Subject: [Spambayes] Error installing Spambayes 8.1 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E70E3A@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778D1@its-xchg4.massey.ac.nz> > C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll > Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. This is a known problem without a known solution, although various things have worked for various people (there is a bug tracker on sourceforge that documents them). Your best bet is to either wait until the next release, which is built with a difference process, and (it seems) fixes this - it's hopefully going to be out next week - or to try the experimental build that's on Mark's website (this is more-or-less what the next release will be). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 03:22:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 03:22:25 2004 Subject: [Spambayes] Recovering from accidentally deleting the folder Spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E70F92@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A80@its-xchg4.massey.ac.nz> > I accidentally deleted the Spam folder. > Now it insists that I reconfigure. You're after FAQ 3.13: > I don't want to lose what it has learned. What to do? Note that you won't lose any training data. This isn't stored in Outlook, so it doesn't matter what's in your Spam folder (unless you retrain from scratch). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 03:23:49 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 03:24:16 2004 Subject: [Spambayes] Spambayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E711BB@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778D3@its-xchg4.massey.ac.nz> > I have downloaded and installed the Spam System. > When I click the tab Delete as Spam I get a message > which says that I have to enable Spam. Can't locate > information to figure out how to enable Spam. You're after FAQ 3.7: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From dontbotherworld at yahoo.com Wed Feb 4 03:31:42 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Wed Feb 4 03:31:47 2004 Subject: [Spambayes] Spam Database Message-ID: <20040204083142.46475.qmail@web60805.mail.yahoo.com> Does any one know of any spam database for public for testing? Thanks Dont __________________________________ Do you Yahoo!? Yahoo! SiteBuilder - Free web site building tool. Try it! http://webhosting.yahoo.com/ps/sb/ From Amir_Katz at bmc.com Wed Feb 4 03:35:23 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Wed Feb 4 03:36:48 2004 Subject: [Spambayes] Need a shortcut for the Outlook plug-in Message-ID: I guess this is more like an enhancement request than a question, but still on can try... This is regarding the Outlook 2000 plug-in. Is there a way to expand the Spambayes menu item 'Show spam clues for current message' into a shortcut-able item, so that I can invoke it via a keyboard shortcut (Alt-) or to associate it with a button? As far as I can see, the Spambayes menu does not appear in Menu->Customize->Commands, but still... Amir "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." From tameyer at ihug.co.nz Wed Feb 4 03:37:41 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 03:38:03 2004 Subject: [Spambayes] Spam Database In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E7154C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778D6@its-xchg4.massey.ac.nz> > Does any one know of any spam database for public for > testing? A good one is the SpamAssassin public archive, because it includes both ham and spam corpora. (I think it was last updated about a year ago, though). =Tony Meyer From tameyer at ihug.co.nz Wed Feb 4 03:44:31 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 03:44:46 2004 Subject: [Spambayes] Spam Database In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13046778D6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A82@its-xchg4.massey.ac.nz> > > Does any one know of any spam database for public for testing? > > A good one is the SpamAssassin public archive, because it > includes both ham and spam corpora. (I think it was last > updated about a year ago, though). > Opps. I meant to write more than that ;) There's also . And (this was used in early SpamBayes testing), and goes from 1998 to 2004: And (haven't used these): , and (this last one has some interesting stuff on the page, but the spam is only available in mbox format by request). =Tony Meyer From Rohit.Upadhyay at oracle.com Wed Feb 4 07:53:23 2004 From: Rohit.Upadhyay at oracle.com (Rohit Upadhyay) Date: Wed Feb 4 07:51:47 2004 Subject: [Spambayes] imap filter Query Message-ID: <4020EB43.E81A95CA@oracle.com> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: Rohit.Upadhyay.vcf Type: text/x-vcard Size: 298 bytes Desc: Card for Rohit Upadhyay Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/d9209b4a/Rohit.Upadhyay.vcf From qub at qub.com Wed Feb 4 08:06:47 2004 From: qub at qub.com (Mikhail Yakoubov) Date: Wed Feb 4 08:06:55 2004 Subject: [Spambayes] imap filter Query References: <4020EB43.E81A95CA@oracle.com> Message-ID: <4020EE67.28E19793@qub.com> Rohit Upadhyay wrote: > Hello, > I am using the imap filter(Thanks to Tony Meyer's help, I got it > working). > > Filtering is working properly & in all emails 3 headers are getting > added lik :- > > X-Spambayes-MailId:1075888978 > X-Spambayes-Classification: unsure > X-Spambayes-MailId: 1075888978 Always wondered if the X-Spambayes-MailId duplication is a bug or a feature for a purpose I'm not aware of. Can anyone clear this? > However, no header with the score is getting added, which I believe > should. Is it possible to get the spam score with the imap filter? > I understand, when we use the Outlook2000 plugin, a header gets added > to the mail with the spam score, which in turn can be viewed in the > Outlook2K header list? > > Will appriciate if anyone could clarify my doubts. You should have [Headers] include_score:True set in bayescustomize.ini . Works for me. Best, Mike Yakoubov. From monica at experienceplus.com Wed Feb 4 10:19:06 2004 From: monica at experienceplus.com (Monica Price) Date: Wed Feb 4 10:16:41 2004 Subject: [Spambayes] using spambayes, on xp, outlook2003, I am unable to use spambayes, even after repeated install/uninstalls - here is log file Message-ID: Loaded bayes database from 'C:\Documents and Settings\monica\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\monica\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 886 spam and 245 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] ERROR: 'There was an error initializing the Spam plugin.\r\n\r\nSpam filtering has been disabled. Please re-configure\r\nand re-enable this plugin\r\n\r\nError details:\r\nCould not watch the specified folders' Traceback (most recent call last): File "out1.pyz/addin", line 1237, in FiltersChanged File "out1.pyz/addin", line 1248, in UpdateFolderHooks File "out1.pyz/addin", line 1276, in _HookFolderEvents File "out1.pyz/win32com.client", line 250, in DispatchWithEvents File "out1.pyz/win32com.client.gencache", line 427, in EnsureModule File "out1.pyz/win32com.client.gencache", line 236, in MakeModuleForTypelib File "out1.pyz/win32com.client.makepy", line 259, in GenerateFromTypeLibSpec IOError: [Errno 13] Permission denied: 'C:/Program Files/Spambayes Outlook Addin\\support\\gen_py\\00062FFF-0000-0000-C000-000000000046x0x9x2.py' Error processing missed messages! Traceback (most recent call last): File "out1.pyz/addin", line 1193, in OnConnection File "out1.pyz/addin", line 1219, in ProcessMissedMessages File "out1.pyz/addin", line 1263, in _GetHookForFolder KeyError: ('\x00\x00\x00\x008\xa1\xbb\x10\x05\xe5\x10\x1a\xa1\xbb\x08\x00+*V\xc2\x 00\x00EMSMDB.DLL\x00\x00\x00\x00\x00\x00\x00\x00\x1bU\xfa \xaaf\x11\xcd\x9b\xc8\x00\xaa\x00/\xc4Z\x0c\x00\x00\x00VOYAGER\x00/o=EXP ERIENCEPLUS2/ou=first administrative group/cn=Recipients/cn=monica\x00\xd85!\xf3\xaa\x00\x00\x00\x01\x00\x00\ x00\x14\x00\x00\x00l\x00\x00\x00/o=EXPERIENCEPLUS2/ou=first administrative group/cn=Configuration/cn=Servers/cn=VOYAGER\x00v\x00o\x00y\x00a\x00g\x0 0e\x00r\x00.\x00e\x00x\x00p\x00e\x00r\x00i\x00e\x00n\x00c\x00e\x00p\x00l \x00u\x00s\x002\x00.\x00l\x00o\x00c\x00a\x00l\x00\x00\x00\x00\x00', '\x00\x00\x00\x00J/^\xf8~\xa1\nO\x87\xde\xab\xaa,\xac\xd6\xf3\x01\x00\xc c\xd5\n\\\xdc(\xf0H\xa0\xe4:\xa5\x15: %\x00\x00\x00\x01\xb3\xc6\x00\x00') pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "out1.pyz/win32com.universal", line 170, in dispatch File "out1.pyz/win32com.server.policy", line 322, in _InvokeEx_ File "out1.pyz/win32com.server.policy", line 601, in _invokeex_ File "out1.pyz/win32com.server.policy", line 541, in _invokeex_ File "out1.pyz/addin", line 1338, in OnStartupComplete File "out1.pyz/win32com.client", line 302, in WithEvents File "out1.pyz/win32com.client.gencache", line 427, in EnsureModule File "out1.pyz/win32com.client.gencache", line 236, in MakeModuleForTypelib File "out1.pyz/win32com.client.makepy", line 259, in GenerateFromTypeLibSpec exceptions.IOError: [Errno 13] Permission denied: 'C:/Program Files/Spambayes Outlook Addin\\support\\gen_py\\00062FFF-0000-0000-C000-000000000046x0x9x2.py' SpamBayes - Disconnecting from Outlook SpamBayes processed 0 messages, finding 0 spam and 0 unsure Addin terminating: 0 COM client and 1 COM servers exist. Monica Malpezzi Price Director of Operations Monica@ExperiencePlus.com 1-800-685-4565 www.ExperiencePlus.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040204/f9cde597/attachment.html From marc at shadowriders.org Wed Feb 4 10:22:24 2004 From: marc at shadowriders.org (Marc 'Merlin' Mauss) Date: Wed Feb 4 10:23:09 2004 Subject: [Spambayes] You CAN move spam to "deleted items" Message-ID: In your FAQ, you state: "3.11 Why can't I set spam to be moved to the Deleted Items folder? The problem with this is that you can also set SpamBayes to train all messages moved to the designated spam folder. If you set the deleted items folder as the spam folder (early versions of the plug-in allowed this), then all messages that you delete would be trained as spam. To get this restriction removed, you'll have to convince the developers that there is a way to do this without confusing people - for example, if we let you choose the deleted items folder as the spam folder, only if the 'incremental training' option was off, people would get confused about why it sometimes works and sometimes doesn't. Note that Outlook 2003 has a "Junk Mail" folder that has many of the deleted items folder's properties, and you can get SpamBayes to move spam to this folder. You may also find some good advice in the answer to the question about getting SpamBayes to delete spam." There is a very simple way to work with this. Specify a sub-folder within "Deleted Items" as the spam folder. That way, only items in the sub-folder will be trained as spam, yet they will be removed when you "empty deleted items". Marc "Merlin" Mauss RRR01 Honorary CSA Consul to Cyberspace MSF Instructor Coram (Long Island), NY '83 Sabre V45 "Just for Fun" gone but not forgotten '85 Shadow VT1100C - "Barney" (it's PURPLE) Just a memory '86 Shadow VT1100C - (hers) '86 Shadow VT1100C - "Little Wing" '84 Voyager 1300 - "The Beast" Cordura IS a fashion statement. From kennypitt at hotmail.com Wed Feb 4 10:42:47 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 4 10:43:40 2004 Subject: [Spambayes] using spambayes, on xp, outlook2003, I am unable to use spambayes, even after repeated install/uninstalls - here is log file In-Reply-To: Message-ID: Monica Price wrote: > IOError: [Errno 13] Permission denied: 'C:/Program Files/Spambayes Outlook Addin\\support\\gen_py\\00062FFF-0000-0000-C000-000000000046x0x9x2.py' This error almost always indicates that you are running Outlook 2003 on a system that never had Outlook 2000 installed, and you do not have administrative privileges. If Oulook 2000 was never installed, SpamBayes needs to update some files in the program installation directory, and it needs administrative privileges to do that. To get around this, you can have an administrator load Outlook and SpamBayes once on your machine, which will cause the appropriate files to be updated. Alternatively, you can wait for the next release, which solves this problem. We are tentatively planning on releasing an update sometime next week. -- Kenny Pitt From kennypitt at hotmail.com Wed Feb 4 11:12:48 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 4 11:15:26 2004 Subject: [Spambayes] You CAN move spam to "deleted items" In-Reply-To: Message-ID: Marc 'Merlin' Mauss wrote: > There is a very simple way to work with this. Specify a sub-folder > within "Deleted Items" as the spam folder. That way, only items in > the sub-folder will be trained as spam, yet they will be removed when > you "empty deleted items". Unfortunately, "empty deleted items" will also remove the subfolder. When that happens, SpamBayes will have no place to move the spam and it will disable itself. -- Kenny Pitt From ian at experienceplus.com Wed Feb 4 11:26:55 2004 From: ian at experienceplus.com (Ian Wells) Date: Wed Feb 4 11:24:23 2004 Subject: [Spambayes] using spambayes, on xp, outlook2003, I am unable to use spambayes, even after repeated install/uninstalls - here is log file Message-ID: Thanks for the very quick response and diagnosis :-) That fits. We will wait for your update next week. Ian -----Original Message----- From: Kenny Pitt [mailto:kennypitt@hotmail.com] Sent: Wednesday, February 04, 2004 8:43 AM To: Monica Price; spambayes@python.org Cc: Ian Wells Subject: RE: [Spambayes] using spambayes, on xp, outlook2003,I am unable to use spambayes,even after repeated install/uninstalls - here is log file Monica Price wrote: > IOError: [Errno 13] Permission denied: 'C:/Program Files/Spambayes Outlook Addin\\support\\gen_py\\00062FFF-0000-0000-C000-000000000046x0x9x2.py' This error almost always indicates that you are running Outlook 2003 on a system that never had Outlook 2000 installed, and you do not have administrative privileges. If Oulook 2000 was never installed, SpamBayes needs to update some files in the program installation directory, and it needs administrative privileges to do that. To get around this, you can have an administrator load Outlook and SpamBayes once on your machine, which will cause the appropriate files to be updated. Alternatively, you can wait for the next release, which solves this problem. We are tentatively planning on releasing an update sometime next week. -- Kenny Pitt From berendes at netalyst.com Wed Feb 4 11:32:55 2004 From: berendes at netalyst.com (Christoph Berendes) Date: Wed Feb 4 11:34:17 2004 Subject: [Spambayes] Train reliably on "forwarded" messages? Message-ID: I run spambayes on my mail server using procmail and it works brilliantly. Thank you thank you thank you. I'm mostly reading my mail via Eudora on the mac. What I'd like to do, for training, is forward "spam to be trained" to a special address (e.g. train-spam@myserver.com) and similarly for ham - train-ham@mailserver.com. I would then run the mailboxtrain.py on the server on the inboxes for those two dummy accounts. However, I realize that the mail messages in these two inboxes will look a little different than when they showed up in my inbox ("Forwarded" headers, addressed to "train-spam" rather than "berendes", >>> down the side). WIll this "forward junk" throw off the training process? From perl at rhesa.com Wed Feb 4 11:53:58 2004 From: perl at rhesa.com (Rhesa Rozendaal) Date: Wed Feb 4 11:56:52 2004 Subject: [Spambayes] Train reliably on "forwarded" messages? In-Reply-To: References: Message-ID: <402123A6.20001@rhesa.com> Christoph Berendes wrote: > I run spambayes on my mail server using procmail and it works brilliantly. Thank you thank you thank you. I use the same setup. > I'm mostly reading my mail via Eudora on the mac. I use Mozilla Thunderbird ;^) > What I'd like to do, for training, is forward "spam to be trained" to a special address (e.g. train-spam@myserver.com) and similarly for ham - train-ham@mailserver.com. I would then run the mailboxtrain.py on the server on the inboxes for those two dummy accounts. You can use procmail to store a copy of those messages in separate folders, using a recipy like this: :0 c * ^X-Spambayes-Classification: spam trainham This will store an identical copy in the "trainham" folder. You should put it before any normal filter recipies. > However, I realize that the mail messages in these two inboxes will look a little different than when they showed up in my inbox ("Forwarded" headers, addressed to "train-spam" rather than "berendes", >>> down the side). > > WIll this "forward junk" throw off the training process? No, since it now is an identical copy ;) HTH, Rhesa From thaxtonm at msx.umsl.edu Wed Feb 4 12:43:54 2004 From: thaxtonm at msx.umsl.edu (Thaxton, Mary) Date: Wed Feb 4 12:43:59 2004 Subject: [Spambayes] Bug Report Message-ID: <8ECAAC981350F94084F02B5E7699DC6003F1BE9A@STL-MAIL2.umsl.edu> Running Outlook 2000 SR-1 on a Windows 2000 SP-3 and get the message "There was an error initializing the SpamBayes addin." Log file follows. Loaded bayes database from 'C:\Documents and Settings\thaxtonm\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\thaxtonm\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 3) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] Error connecting to Outlook! Traceback (most recent call last): File "out1.pyz/addin", line 1177, in OnConnection File "out1.pyz/dialogs", line 64, in ShowWizard File "out1.pyz/config_wizard", line 142, in CreateWizardConfig File "out1.pyz/config_wizard", line 49, in InitWizardConfig File "out1.pyz/msgstore", line 372, in YieldReceiveFolders File "out1.pyz/msgstore", line 337, in GetFolder MsgStoreException: MsgStoreException: Exception 0x80004005 (Unspecified error): Unspecified error ERROR: 'There was an error initializing the SpamBayes addin\r\n\r\nPlease re-start Outlook and try again.' Traceback (most recent call last): File "out1.pyz/addin", line 1177, in OnConnection File "out1.pyz/dialogs", line 64, in ShowWizard File "out1.pyz/config_wizard", line 142, in CreateWizardConfig File "out1.pyz/config_wizard", line 49, in InitWizardConfig File "out1.pyz/msgstore", line 372, in YieldReceiveFolders File "out1.pyz/msgstore", line 337, in GetFolder MsgStoreException: MsgStoreException: Exception 0x80004005 (Unspecified error): Unspecified error SpamBayes - Disconnecting from Outlook SpamBayes processed 0 messages, finding 0 spam and 0 unsure Addin terminating: 1 COM client and 2 COM servers exist. Mary J. Thaxton Department Assistant Managerial & Technological Services University of Missouri-St. Louis 324 Woods Hall 8001 Natural Bridge Road St. Louis, MO 63121-4499 314-516-6101 (phone) 314-516-6797 (fax) marythaxton@umsl.edu (e-mail) From ghost151 at hotmail.com Wed Feb 4 13:52:09 2004 From: ghost151 at hotmail.com (ghost ghosty) Date: Wed Feb 4 13:52:12 2004 Subject: [Spambayes] (no subject) Message-ID: Does anyone have any ideas for a project that I would like to do at Masters level? I want to do something on Bayesian filters because it is a very interesting idea. Is there some way of improving it? what could someone research on in this field? Please email ghost151@hotmail.com Thanks _________________________________________________________________ Add photos to your messages with MSN 8. Get 2 months FREE*. http://join.msn.com/?page=features/featuredemail From ghost151 at hotmail.com Wed Feb 4 13:52:23 2004 From: ghost151 at hotmail.com (ghost ghosty) Date: Wed Feb 4 13:52:26 2004 Subject: [Spambayes] Ideas for an MSc project please... Message-ID: Does anyone have any ideas for a project that I would like to do at Masters level? I want to do something on Bayesian filters because it is a very interesting idea. Is there some way of improving it? what could someone research on in this field? Please email ghost151@hotmail.com Thanks _________________________________________________________________ Protect your PC - get McAfee.com VirusScan Online http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963 From C4lifeplus at webfind-101.com Wed Feb 4 14:03:37 2004 From: C4lifeplus at webfind-101.com (C4lifeplus.com) Date: Wed Feb 4 14:03:43 2004 Subject: [Spambayes] Updated Essence of Life pages with FREE Juice Offer Message-ID: <200402041903.i14J3ZTI046358@mxzilla7.xs4all.nl> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040204/c0ff50eb/attachment.html From rmalayter at bai.org Wed Feb 4 15:59:31 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Feb 4 15:59:42 2004 Subject: [Spambayes] Ideas for an MSc project please... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> I have a few ideas for you: 1) using Bayesian-like statistics to evaluate code for virus-like behavior. I have no idea if it could work, but I would really like to see something that could stop a new worm before the anti-virus vendors have a chance to update their signatures. 2) Evaluate using different multi-gram strategies and sliding windows, like http://crm114.sourceforge.net/. Also, evaluate alternative parsing strategies or tricks, even coming up with new strategies (say, like evaluating what class-C subnet a message comes from). Tell us what strategies are actually best, with rigorous, *general-case* statistical evidence. Www.spamarchive.com may provide a source of material here, or you may be able to partner with your university's mail admins to get a diverse email mix from lots of users. Privacy issues be damned ;-). 3) A Bayesian-style web proxy filter, for libraries, schools, whatever, that evaluates web pages in real-time and blocks them if they're porn, gambling, whatever. 4) Improving Bayesian spam filtering at the SMTP gateway level. Why is it less effective, what can be done to improve it, how do you handle the training issue with a large user base of Grandma-style computer users. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On Behalf Of ghost ghosty > Sent: Wednesday, February 04, 2004 12:52 PM > To: spambayes@python.org > Subject: [Spambayes] Ideas for an MSc project please... > > Does anyone have any ideas for a project that I would like to > do at Masters > level? I want to do something on Bayesian filters because it > is a very > interesting idea. Is there some way of improving it? what > could someone > research on in this field? > > Please email ghost151@hotmail.com > > Thanks > > _________________________________________________________________ > Protect your PC - get McAfee.com VirusScan Online > http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963 > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From scales at zko.dec.com Wed Feb 4 16:08:01 2004 From: scales at zko.dec.com (Webb Scales) Date: Wed Feb 4 16:08:16 2004 Subject: [Spambayes] Training a procmail filter for a Cyrus IMAP server Message-ID: <40215F31.3CBF8F43@zko.dec.com> Here's my sitch: I read my mail which is kept on a Cyrus IMAP server from two places; thus, I'm looking to do server-side filtering. I believe that I have a procmail hook into the mail delivery, but the mail, once delivered, is inaccessible to me except via IMAP protocol (i.e., as I understand it, this is the Cyrus way). So, I should be able to set up the procmail filtering according to the documented suggestions (although, I haven't actually tried this yet -- I'm a little intimidated ;-). My question is, how to train the filter? My answer is, configure the imapfilter and let it produce a database which I then feed to the procmail filter. This seems a really obvious approach, and I'm wondering why I've not seen mention of it anywhere. (Like, isn't it going to work??) I've got a corpus of 1.5 gigabytes or so, including 5K spam messages (I've really been looking forward to getting a Baysean filter in place! 8-). Extracting it out of IMAP would be a pain (I'm not even sure how to do that in bulk). I've actually taken the first step -- I've configured the imapfilter and started training it. It ran for 5.5 hours last night before it hit a problem with bogus date headers. I restarted it after purging the spam with the bad headers, and it's been running for a couple of hours since. One thing for people to note. This is probably obvious to the afficiandos, but it wasn't obvious to me: the trainer adds a line to the mail message headers (even though training seems like a read-only operation). The effect of this is that my mail clients discovered that their caches of the message headers were now stale. This wasn't a big deal here at work with the multi-megabit network connection, but, at home, with my soda-straw dial-up, this was a bit painful. One other question, while I'm here. What's the deal with using a database vs. a pickle? I understand that the former is supposed to be faster for a single message lookup, and the later is better for bulk training. But, I presume that what I want (once I'm done training) is a database. How do I convert the pickle into a database? (Also, I had problems using the pickle, but those might have been...well...pilot errors -- the whole having to save-and-restart in the middle of configuring the imapfilter using the web interface kind of messed me up for awhile. ;-) Thanks, Webb -- ------------------------------------------------------------------------ Webb Scales Hewlett-Packard Company scales at zko.dec.com 110 Spit Brook Rd, ZKO2-3/N30 Voice: 603.884.2196, FAX: 603.884.0120 Nashua, NH 03062-2711 When everything's coming your way, you're in the wrong lane. ------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040204/115b39a8/attachment.html From wsy at merl.com Wed Feb 4 16:22:03 2004 From: wsy at merl.com (Bill Yerazunis) Date: Wed Feb 4 16:22:10 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> (rmalayter@bai.org) References: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> Message-ID: <200402042122.i14LM3E08569@localhost.localdomain> From: "Ryan Malayter" I have a few ideas for you: 1) using Bayesian-like statistics to evaluate code for virus-like behavior. I have no idea if it could work, but I would really like to see something that could stop a new worm before the anti-virus vendors have a chance to update their signatures. 2) Evaluate using different multi-gram strategies and sliding windows, like http://crm114.sourceforge.net/. Also, evaluate alternative parsing strategies or tricks, even coming up with new strategies (say, like evaluating what class-C subnet a message comes from). Tell us what strategies are actually best, with rigorous, *general-case* statistical evidence. Www.spamarchive.com may provide a source of material here, or you may be able to partner with your university's mail admins to get a diverse email mix from lots of users. Privacy issues be damned ;-). Thanks... I am currently finishing up the written version of the paper I gave at MIT Spam Filtering 2004; I've got a first pass on some of that data. I will post to you when it's on the web page. -Bill Yerazunis From skip at pobox.com Wed Feb 4 16:49:55 2004 From: skip at pobox.com (Skip Montanaro) Date: Wed Feb 4 16:51:07 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> Message-ID: <16417.26883.506107.54014@montanaro.dyndns.org> Ryan> 1) using Bayesian-like statistics to evaluate code for virus-like Ryan> behavior. I have no idea if it could work, but I would really Ryan> like to see something that could stop a new worm before the Ryan> anti-virus vendors have a chance to update their signatures. SpamBayes actually already does a pretty good job of this, assuming viruses get that far within your email infrastructure. Ryan> 3) A Bayesian-style web proxy filter, for libraries, schools, Ryan> whatever, that evaluates web pages in real-time and blocks them Ryan> if they're porn, gambling, whatever. Start here perhaps: http://cvs.sourceforge.net/viewcvs.py/spambayes/spambayes/contrib/mod_spambayes.py and add a whole bunch of stuff that's missing: web page-specific tokenizing, training interface, etc... Skip From skip at pobox.com Wed Feb 4 16:53:32 2004 From: skip at pobox.com (Skip Montanaro) Date: Wed Feb 4 16:53:46 2004 Subject: [Spambayes] Training a procmail filter for a Cyrus IMAP server In-Reply-To: <40215F31.3CBF8F43@zko.dec.com> References: <40215F31.3CBF8F43@zko.dec.com> Message-ID: <16417.27100.575638.131943@montanaro.dyndns.org> Webb> So, I should be able to set up the procmail filtering according to Webb> the documented suggestions (although, I haven't actually tried Webb> this yet -- I'm a little intimidated ;-). My question is, how to Webb> train the filter? Offline. sb_filter.py uses whatever you stick there in the way of a training database. I periodically copy my training database over to my server where several other people share in the fruits of my labor via their procmailrc files. If you're running sb_filter.py, I'd just save mail into two piles: ham and spam, then run them through sb_mboxtrain.py periodically, copying the database over to your server. Skip From rmalayter at bai.org Wed Feb 4 17:28:48 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Feb 4 17:28:54 2004 Subject: [Spambayes] Ideas for an MSc project please... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75DA1@cliff.bai.org> [Skip Montanaro] >>1) using Bayesian-like statistics to evaluate code >> for virus-like behavior. > > SpamBayes actually already does a pretty good job of this, > assuming viruses get that far within your email infrastructure. Spambayes evaluates the e-mail messages that viruses send, not the viral code itself. I was thinking about going further down, into the innards of the actual binary (or script) code of the virus. Parsing would be a challenge, of course, and probably language and platform dependent. But most viral code has random IP address generators, SMTP engines, backdoor programs, etc., so they must look fairly similar at some level. Even if it's at the assembler level. Scripting languages would be much easier to parse than assembler, of course. Bayesian analysis might help identify such code. Incidentally, we had about 2% of our users get tagged by MyDoom, even though we block all executable attachments at the gateway. Apparently, the rules in our AV software that apply to blocking file extensions do not apply inside ZIP files, even though the product scans for viruses inside ZIP files. We're started blocking all ZIPs as soon as we heard about MyDoom, but a few people actually opened the ZIP and the EXE inside despite all our efforts at education. We're now quarantining all ZIPs and screening them by hand while we test an update from the AV vendor. Argh. Anyway, a system which could accurately evaluate the binary code for characteristics might have caught MyDoom in this instance, seeing the characteristics of an SMTP engine, registry edits to the RUN keys, etc. The current "heuristic" scanners in most AV products suck, and never detect anything that has no virus signature in my experience. Parsing "polymorphic" and encrypted viruses might be more difficult, but presumably there's a body of knowledge about this out there somewhere (AV vendors do it already for cleaning). Regards, Ryan From contact at reynolds.org.uk Wed Feb 4 18:02:55 2004 From: contact at reynolds.org.uk (peter.r.reynolds) Date: Wed Feb 4 18:03:00 2004 Subject: [Spambayes] Deleted the folder that hold the spam Message-ID: I mistakenly deleted the folder that hold the spam by mistake. I have gone through a few loops to try and rebuild it but to no avail. Could you please provide me the info to rebuild this file/folder again. rgds Peter Reynolds From Matt.Wheeler at SupSX.com Wed Feb 4 17:43:24 2004 From: Matt.Wheeler at SupSX.com (Wheeler, Matt) Date: Wed Feb 4 18:16:46 2004 Subject: [Spambayes] various internal email addresses keep getting caught...?? Message-ID: <675CEA34ECFA414381F5B477976BAA4002B8D39D@superior-msg.superior.com> Hi, my company uses SpamBayes. I'm having a problem w/ internal emails getting caught in my "junk suspects" folder, even after i've recovered them from the folder before. These emails are usually sent to company distribution lists, which is probably why it's catching them. I looked in the options menu/tabs to see if i could place addresses in a 'do not scan' field, or something of the sort. I also checked out your site's Q&A, but didn't see anything related. Is there anything i can do to keep this from happening? Thanks, Matt Wheeler SUPERIOR ESSEX Manager, Marketing Services 770.657.6369 From tameyer at ihug.co.nz Wed Feb 4 19:44:18 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 19:44:37 2004 Subject: [Spambayes] Train reliably on "forwarded" messages? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E71652@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778DD@its-xchg4.massey.ac.nz> > I run spambayes on my mail server using procmail and it works > brilliantly. Thank you thank you thank you. Since you're using procmail, the solution that Rhesa suggested is probably the best, but FWIW: > What I'd like to do, for training, is forward "spam to be > trained" to a special address (e.g. train-spam@myserver.com) > and similarly for ham - train-ham@mailserver.com. I would > then run the mailboxtrain.py on the server on the inboxes for > those two dummy accounts. The basic functionality of what you're after here (training by forwarding mail) is done with the SMTP proxy that SpamBayes includes. You access this via sb_server - I'm not sure if you're using this already, or using sb_filter, but you can use sb_server without a POP3 proxy, like you can use it without a SMTP proxy. Basically, you send all your outgoing mail through the SMTP proxy (this assumes you're using SMTP for outgoing mail, of course). It intercepts (and does not send) any mail addresses to two special addresses and trains the database based on those. > However, I realize that the mail messages in these two > inboxes will look a little different than when they showed up > in my inbox ("Forwarded" headers, addressed to "train-spam" > rather than "berendes", >>> down the side). To avoid this, the SMTP proxy has the ability to 'look up' the original message and use that for training instead of the mangled message. To do this the original message must be on an imap server (this hasn't really been used or tested much) or in the sb_server/pop3proxy cache directories. In addition, the mail client must forward all the headers of the original message (Eudora does, I believe). If you're using sb_server, then this will work fine. If you're using sb_filter, this probably won't, although you could easily enough patch it to find the message elsewhere (for example if you saved a raw copy as in Rhesa's solution). The SMTP proxy can be set to just train on the raw mail sent to it, however. > Will this "forward junk" throw off the training process? Yes. How much and whether it will have a significant effect is uncertain and depends a lot on your mail stream itself. You would probably want to fiddle with the tokenizing settings so that fewer tokens are generated from the headers (at least in training). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 19:34:00 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 19:46:28 2004 Subject: [Spambayes] imap filter Query In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E715D6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A84@its-xchg4.massey.ac.nz> > > X-Spambayes-MailId:1075888978 > > X-Spambayes-Classification: unsure > > X-Spambayes-MailId: 1075888978 > > Always wondered if the X-Spambayes-MailId duplication is a > bug or a feature for a purpose I'm not aware of. Can anyone > clear this? First I've heard of it. It's definitely a bug. I'll look into this. > > However, no header with the score is getting added, which I believe > > should. Is it possible to get the spam score with the imapfilter? I > > understand, when we use the Outlook2000 plugin, a header > > gets added to > > the mail with the spam score, which in turn can be viewed in the > > Outlook2K header list? As Mike said, you need to turn this on (the option should be there somewhere in the options exposed by the configuration pages; if not I'm certain it is in the next release). For the record, though, the Outlook plug-in doesn't add any headers at all. It adds the score to an Outlook 'field' in the message (and doesn't add anything else, like the classification, an id, or the clues). =Tony Meyer From tameyer at ihug.co.nz Wed Feb 4 19:59:17 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 19:59:34 2004 Subject: [Spambayes] Training a procmail filter for a Cyrus IMAP server In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E71729@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A86@its-xchg4.massey.ac.nz> > I've actually taken the first step -- I've configured the > imapfilter and started training it. It ran for 5.5 hours > last night before it hit a problem with bogus date headers. The next release should handle these sorts of things much more gracefully. If there are then still any issues like this they should get reported and fixed reasonably quickly. > One thing for people to note. This is probably obvious to > the afficiandos, but it wasn't obvious to me: the trainer > adds a line to the mail message headers (even though training > seems like a read-only operation). The effect of this is that > my mail clients discovered that their caches of the message > headers were now stale. This wasn't a big deal here at work > with the multi-megabit network connection, but, at home, with > my soda-straw dial-up, this was a bit painful. The reason for the header is to keep track of which messages have been processed. Otherwise, when you restarted the training, it would have started training all the ones you had already done. There are better ways to do this (and even ones that would make the code simpler), but this was the decision (mine, I admit) at the time of writing. Unfortunately, no-one's really interested in progressing the imapfilter code (various people are willing to bugfix it, but that's about it), so it's likely to stay that way for a while at least. If anyone is interested in patching it up, IIRC, the suggested solution is to base the id on the message's MessageID instead of creating a new one (imap ids aren't reliable enough). If the message doesn't have a MessageID, then it can have one added to it (so at least some messages won't be changed, even if some are). This involves a fair bit of coding, though, as well as lots of testing. I'm happy to do some testing, but I don't have time for the coding at the moment, sorry. > One other question, while I'm here. What's the deal with using > a database vs. a pickle? I understand that the former is > supposed to be faster for a single message lookup, and the later > is better for bulk training. But, I presume that what I want > (once I'm done training) is a database. How do I convert the > pickle into a database? There's a script called sb_dbexpimp.py in the scripts directory. This will convert a database, whatever form it's in, into any other form, include a flat-text format. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 20:00:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:00:19 2004 Subject: [Spambayes] Deleted the folder that hold the spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E7175B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A87@its-xchg4.massey.ac.nz> > I mistakenly deleted the folder that hold the spam by > mistake. I have gone through a few loops to try and rebuild > it but to no avail. Could you please provide me the info to > rebuild this file/folder again. You're after FAQ 3.13: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 20:05:12 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:05:28 2004 Subject: [Spambayes] various internal email addresses keep gettingcaught...?? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E71762@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A88@its-xchg4.massey.ac.nz> > Hi, my company uses SpamBayes. I'm having a problem w/ > internal emails getting caught in my "junk suspects" folder, > even after i've recovered them from the folder before. These > emails are usually sent to company distribution lists, which > is probably why it's catching them. Have you looked at the "spam clues" for such messages? It should be apparent from them why it's scoring what it is, and you might be able to figure out what training you need to do to stop this happening (enough training should fix it, unless they really look like something you're training as spam). It's also worth checking that you have a roughly equal number of ham and spam trained - imbalance can hurt SpamBayes in ways like this. > I looked in the options menu/tabs to see if i could place > addresses in a 'do not scan' field, or something of the sort. > I also checked out your site's Q&A, but didn't see anything related. You're after what's commonly called 'whitelisting'. However, see FAQ 6.6: The most important bit here is that you can do this with your mailer's (Outlook?) existing rules. Just move all mail from these addresses into some other folder ("Internal mail", or something), and don't have SpamBayes filter that one. (You should probably also enable 'background filtering' so that the Outlook rules are sure to run first, although if the rules are running on an Exchange server, that might not be necessary). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From Monica.R.Mulick at bankofamerica.com Wed Feb 4 20:24:06 2004 From: Monica.R.Mulick at bankofamerica.com (Mulick, Monica R) Date: Wed Feb 4 20:24:10 2004 Subject: [Spambayes] Junk folder missing Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 1705 bytes Desc: image001.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/0bf1702f/image001.gif From tameyer at ihug.co.nz Wed Feb 4 20:31:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:31:39 2004 Subject: [Spambayes] Junk folder missing In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E7179C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778E3@its-xchg4.massey.ac.nz> > I've reinstalled the Outlook plug-in, and my Junk > Suspects folder shows up, but not the Junk folder. > We deleted the SpamBayes toolbar, to no avail! Help please! You're probably after FAQ 3.13: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 20:36:14 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:36:29 2004 Subject: [Spambayes] imap filter Query In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E715D6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A89@its-xchg4.massey.ac.nz> > > X-Spambayes-MailId:1075888978 > > X-Spambayes-Classification: unsure > > X-Spambayes-MailId: 1075888978 > > Always wondered if the X-Spambayes-MailId duplication is a > bug or a feature for a purpose I'm not aware of. Can anyone > clear this? So it was (well, if one of the options was set). I've fixed this, so it'll be fixed in the next release. Note that it's often worth asking this sort of thing on the list, especially if a quick google doesn't find the answer, because it might (like this time) be a problem that the developers weren't aware of. This was a relatively harmless problem, of course. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 20:38:27 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:39:10 2004 Subject: [Spambayes] T H A N K Y O U S O V E R Y M U CH!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E70C0A@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778E5@its-xchg4.massey.ac.nz> > I'm using xp\outlook 2000. The last four months I've hated to > open my e-mail because of the amount of SPAM. I watch Techtv > and they said to try SPAMBAYES and WOW!!!!!!!!!!!!!!!!!!!!! > In just three day the difference is SO INCREDIBLE. Thanks for the kind words, much appreciated by us all. We're glad that it helps. > P.S. I've been out of work so I can't afford to donate right > now, but soon. Note that there really is absolutely no pressure on anyone to donate, at all. If you feel that you should, and can, then go for it, but don't feel bad about using it without donating - that's perfectly ok. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 4 20:50:17 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 4 20:50:35 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E716A6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A8A@its-xchg4.massey.ac.nz> > Does anyone have any ideas for a project that I would like to > do at Masters level? I want to do something on Bayesian filters > because it is a very interesting idea. Is there some way of > improving it? what could someone research on in this field? What sort of background do you have? If you've got a good understanding of the statistics underneath SpamBayes (I don't! ), then you could try coming up with variants of the classifier that handle the ham/spam imbalance problem better. (In short, the SpamBayes math works best with equal numbers of ham and spam, and can fall apart with wildly imbalanced numbers. The only attempt so far to mathematically counter for this was a failure). There's probably various n-way classifying stuff that you could look at, although POPFile is probably a better place to start than SpamBayes for this (but see the n-way.py script in the contrib directory), and maybe they have it all sorted; I don't know. You could look into the effects of message/token expiry - how it effects the math, and how it effects the results, and how SpamBayes could most effectively do it. If you have a background in a non-English (especially Asian) languages, then you could look into adapting SpamBayes to work with those. (Splitting on whitespace, for example, which is at the heart of the SpamBayes tokenizer, is highly unlikely to work in that situation). If this was successful, then it'd probably be worth forking SpamBayes off to create a version for this. (If you do this, take a look at the patch that's open that looks at this, although (IIRC) it doesn't really alter the tokenizing scheme much). Another multi-language idea (not necessarily Asian) would be to look at bilingual (or trilingual, etc) corpora and seeing if there are better ways of dealing with that. (For example, although this is na?ve and unlikely to work, translating the tokens into a base language). You could look into non-email filtering. Web pages has already been suggested, but you could also look into chat/sms/newsgroup/rss spam filtering or something like that. I've heard that spam sms and instant messaging is on the rise, but haven't experience that myself (but then I hardly use IM, and NZ is probably somewhat isolated from spam sms). You could see if using natural language processing techniques could generate useful tokens, without sacrificing too much in the way of speed and database size (there's bound to be some tradeoff). There's a Python natural language processing library (), which would be a good start. I could go on and on, but won't . =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From skip at pobox.com Wed Feb 4 22:30:29 2004 From: skip at pobox.com (Skip Montanaro) Date: Wed Feb 4 22:30:33 2004 Subject: [Spambayes] various internal email addresses keep getting caught...?? In-Reply-To: <675CEA34ECFA414381F5B477976BAA4002B8D39D@superior-msg.superior.com> References: <675CEA34ECFA414381F5B477976BAA4002B8D39D@superior-msg.superior.com> Message-ID: <16417.47317.167975.111505@montanaro.dyndns.org> Matt> Hi, my company uses SpamBayes. I'm having a problem w/ internal Matt> emails getting caught in my "junk suspects" folder, even after Matt> i've recovered them from the folder before. These emails are Matt> usually sent to company distribution lists, which is probably why Matt> it's catching them. Maybe. Maybe not. Nobody can tell unless we see some clues. There are obviously some very spammy clues in there. Skip From kbraun at dgrunt.sneakernet.com Wed Feb 4 22:53:24 2004 From: kbraun at dgrunt.sneakernet.com (K. H. Braun) Date: Wed Feb 4 22:53:40 2004 Subject: [Spambayes] DB problem with sb_mboxtrain.py and Pythom-2.3.3 Message-ID: Hi Folks I am a brand new, wet behind the ears, Python newby and can not get Spambayes-1.0a7 to work. I am running Linux Slackware-9.1. I have installed Python-2.3.3 into /usr/local/Python and Spambayes-1.0a7 into /usr/local/Spambayes. I know that these locations are unusual but for admin. reasons this is where I want them to go. Anyway, when I run the following: " /usr/local/Spambayes/bin/sb_mboxtrain.py -d \${HOME}/.hammiedb -g \${HOME/mail/ham} -s \${HOME}/mail/spam" I get the following traceback: Traceback (most recent call last): File "/usr/local/Spambayes/bin/sb_mboxtrain.py", line 315, in ? main() File "/usr/local/Spambayes/bin/sb_mboxtrain.py", line 296, in main h = hammie.open(pck, usedb, "c") File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/hammie.py", line 258, in open return Hammie(storage.open_storage(filename, useDB, mode)) File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/storage.py", line 675, in open_storage return klass(data_source_name, mode) File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/storage.py", line 162, in __init__ self.load() File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/storage.py", line 187, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 60, in open return f(db_name, mode) File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 38, in open_best return f(*args) File "/usr/local/Spambayes/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) File "/usr/local/Python/lib/python2.3/bsddb/__init__.py", line 192, in hashopen d.open(file, db.DB_HASH, flags, mode) bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') I have attached a tarball of all the installation logs as well as the scripts that I use to semi-automate the installation(s). I would really appreciate it if someone would take a look at the stuff in the tarball and tell me what bonehead mistake I have made. ( I believe that it is a Python configuration problem but I am only guessing) Thankyou in advance for your help! -- Ken -- -------------- next part -------------- A non-text attachment was scrubbed... Name: SB-Err.tar.bz2 Type: application/octet-stream Size: 23144 bytes Desc: Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/d94e98c3/SB-Err.tar-0001.obj From mamablondee at cox.net Wed Feb 4 23:49:00 2004 From: mamablondee at cox.net (J M Hall) Date: Wed Feb 4 23:49:01 2004 Subject: [Spambayes] help Message-ID: <000c01c3eba3$60296cb0$1c01a8c0@alee> I accidentally deleted my Junk Mail folder when I thought I was deleting the contents. I re-made a "Junk Mail" folder but I still get the error message "You must configure the Spam folder." I tried uninstalling and reinstalling Spambayes but it doesn't change anything. What do I do? It does move messages to the Junk Suspects folder but the obvious spam that it was removing before isn't being removed from my inbox at all. Any assistance would be helpful. Thank you. Joan -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040204/f8f7cd7d/attachment.html From tameyer at ihug.co.nz Thu Feb 5 01:10:56 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 5 01:11:12 2004 Subject: [Spambayes] help In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E717F3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778E9@its-xchg4.massey.ac.nz> > I accidentally deleted my Junk Mail folder when > I thought I was deleting the contents. I re-made a > "Junk Mail" folder but I still get the error message > "You must configure the Spam folder." You're after FAQ 3.13: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From canonrt at optonline.net Thu Feb 5 03:29:41 2004 From: canonrt at optonline.net (canonrt) Date: Thu Feb 5 03:29:43 2004 Subject: [Spambayes] Question Message-ID: <0HSL00193RLFZ9@mta1.srv.hcvlny.cv.net> If I Delete all files in my spam folder does that also delete the rules that put them there or is it safe to empty the spam folder? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/47bf7f58/attachment.html From Amir_Katz at bmc.com Thu Feb 5 04:04:39 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Thu Feb 5 04:08:04 2004 Subject: [Spambayes] Question Message-ID: Don't delete the folder, just the messages in it. In any case, the rules are saved elsewhere. Amir -----Original Message----- From: canonrt [mailto:canonrt@optonline.net] Sent: Thursday, February 05, 2004 10:30 To: spambayes@python.org Subject: [Spambayes] Question If I Delete all files in my spam folder does that also delete the rules that put them there or is it safe to empty the spam folder? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/f5e2df31/attachment.html From Tedles at lineone.net Thu Feb 5 06:09:47 2004 From: Tedles at lineone.net (Lesley Hayward) Date: Thu Feb 5 06:09:54 2004 Subject: [Spambayes] re which one? Message-ID: <000a01c3ebd8$94858ef0$21a0e150@xppro> Dear Spambayes, I'm impressed with your work but a bit lost as to the strange language used in the computer world. I have Outlook Express - equally I have spam coming in at a steady rate and I'm rather sick of it all [especially as I don't have a penis to enlarge!!!] I have Professional XP but e-mail uses Outlook Express. Can you tell me which, if any, programme I should download please? If you don't have one is there another somewhere out there in the ether? You were recommended by someone we met on a Business Start up course - so word is getting around! I would be soooo grateful for any help you can offer. Yours truly, Lesley Hayward [Mrs] -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/24352925/attachment.html From aking at cc3.com Wed Feb 4 11:33:02 2004 From: aking at cc3.com (Alice King) Date: Thu Feb 5 08:40:14 2004 Subject: [Spambayes] Messages Fail to Filter Message-ID: <54F2B44015EC2A4B84BB935F462E8C0A017D8930@exchange.ccgroupnet.com> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 7757 bytes Desc: spambayes4.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/21f04d0e/spambayes4-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 37 bytes Desc: spambayes2.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/21f04d0e/spambayes2-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 264055 bytes Desc: spambayes3.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/21f04d0e/spambayes3-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 134006 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040204/21f04d0e/spambayes1-0001.obj From chatterad at yahoo.com Thu Feb 5 09:23:16 2004 From: chatterad at yahoo.com (Anand Deshmukh) Date: Thu Feb 5 09:23:21 2004 Subject: [Spambayes] Building SpamBayes Message-ID: <20040205142316.644.qmail@web12701.mail.yahoo.com> Hello Everyone! Warm Greetings! I am building SpamBayes from the source code. I have Win2K with Outlook 2K on my machine. I have Python2.3.3 installed. I also have win32all-163 installed. I downloaded the spambayes-1.0a7 source from site. To build SpamBayes, I am using McMillan's installer installer_5b5_5. When I go to the DOS prompt and after going to Installer's location I type: Build.py \spambayes_addin.spec All goes well with some warnings (warnings file is attached) and spambayes_addin.dll is created. But the problem is when I use regsvr32 to register the dll, it never comes back and the dll is not registered. Support dir is created with only one type library entry i.e. "00062FFF-0000-0000-C000-000000000046x0x9x0". The other two dont come. The __init__.py file which is created is blank. The dicts.dat file is created with only these lines: I1 .(dp1 . Am I missing any thing here? Am I doing any thing wrong? Please help me as I really want to build SpamBayes and then use it. I have also attached a CmdLineOutput.txt file which lists all the command line messages I get. Any pointers will be really appreciated. Many thanks in advance, Anand. __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html -------------- next part -------------- W: no module named MacOS (conditional import by py_compile) W: no module named win32com.client.constants (top-level import by addin) W: no module named gdbm (top-level import by anydbm) W: no module named win32ui (delayed import by win32com.client.makepy) W: no module named dialogs.MakePropertyPage (top-level import by dialogs.wizard_processors) W: no module named psycopg (delayed import by spambayes.storage) W: no module named win32com.client.getevents (top-level import by addin) W: no module named pwd (delayed, conditional import by posixpath) W: no module named org (top-level import by pickle) W: no module named pywin (delayed import by win32com.server.dispatcher) W: no module named MySQLdb (delayed import by spambayes.storage) W: no module named pywin (delayed, conditional import by win32com.client.makepy) W: no module named dialogs.SetWaitCursor (delayed import by tester) W: no module named bsddb3 (delayed import by tester) W: no module named bsddb3 (top-level import by manager) W: no module named readline (delayed import by pdb) W: no module named dialogs.ShowDialog (top-level import by dialogs.dialog_map) W: no module named dialogs.MakePropertyPage (top-level import by dialogs.dialog_map) W: no module named pwd (delayed import by getpass) W: no module named posix (delayed, conditional import by iu) W: no module named win32com.client.CastTo (top-level import by addin) W: no module named org (top-level import by copy) W: no module named dbm (top-level import by whichdb) W: no module named win32com.client.WithEvents (top-level import by addin) W: no module named readline (delayed, conditional import by cmd) W: no module named EasyDialogs (conditional import by getpass) W: no module named SOCKS (top-level import by ftplib) W: no module named win32com.client._get_good_object_ (top-level import by win32com.client.util) W: no module named ic (delayed import by urllib) W: no module named dbm (top-level import by anydbm) W: no module named dialogs.ShowDialog (top-level import by dialogs.wizard_processors) W: no module named pywin (delayed import by win32com.client.makepy) W: no module named dialogs.SetWaitCursor (delayed import by manager) W: no module named bsddb3 (delayed import by spambayes.dbmstorage) W: no module named posix (conditional import by os) W: no module named win32com.client.DispatchWithEvents (top-level import by addin) W: no module named dialogs.ShowWizard (top-level import by dialogs.dialog_map) W: no module named win32com.client.constants (top-level import by msgstore) W: no module named win32com.client.constants (top-level import by tester) W: no module named pywin (delayed import by win32com.client.selecttlb) W: no module named rourl2path (conditional import by urllib) W: no module named gdbm (delayed import by spambayes.dbmstorage) W: no module named Carbon (conditional import by tempfile) W: delayed conditional __import__ hack detected at line 0 - anydbm (C:\Python23\lib\anydbm.pyc) W: __all__ is built strangely at line 0 - tokenize (C:\Python23\lib\tokenize.pyc) W: delayed conditional __import__ hack detected at line 0 - dialogs (C:\spambayes-1.0a7\Outlook2000\dialogs\__init__.pyc) W: delayed conditional __import__ hack detected at line 0 - dialogs (C:\spambayes-1.0a7\Outlook2000\dialogs\__init__.pyc) W: delayed exec statment detected at line 0 - bdb (C:\Python23\lib\bdb.pyc) W: delayed eval hack detected at line 0 - bdb (C:\Python23\lib\bdb.pyc) W: delayed eval hack detected at line 0 - bdb (C:\Python23\lib\bdb.pyc) W: delayed exec statment detected at line 0 - doctest (C:\Python23\lib\doctest.pyc) W: delayed conditional __import__ hack detected at line 0 - doctest (C:\Python23\lib\doctest.pyc) W: delayed __import__ hack detected at line 0 - pickle (C:\Python23\lib\pickle.pyc) W: delayed __import__ hack detected at line 0 - pickle (C:\Python23\lib\pickle.pyc) W: delayed conditional eval hack detected at line 0 - dumbdbm (C:\Python23\lib\dumbdbm.pyc) W: delayed eval hack detected at line 0 - config (C:\spambayes-1.0a7\Outlook2000\config.pyc) W: delayed conditional eval hack detected at line 0 - gopherlib (C:\Python23\lib\gopherlib.pyc) W: delayed __import__ hack detected at line 0 - encodings (C:\Python23\lib\encodings\__init__.pyc) W: delayed conditional __import__ hack detected at line 0 - encodings (C:\Python23\lib\encodings\__init__.pyc) W: __all__ is built strangely at line 0 - re (C:\Python23\lib\re.pyc) W: __all__ is built strangely at line 0 - dis (C:\Python23\lib\dis.pyc) W: delayed __import__ hack detected at line 0 - win32com.server.policy (C:\Python23\lib\site-packages\win32com\server\policy.pyc) W: delayed conditional eval hack detected at line 0 - warnings (C:\Python23\lib\warnings.pyc) W: delayed conditional __import__ hack detected at line 0 - warnings (C:\Python23\lib\warnings.pyc) W: delayed eval hack detected at line 0 - random (C:\Python23\lib\random.pyc) W: __all__ is built strangely at line 0 - fcntl (C:\Python23\lib\fcntl.pyc) W: delayed exec statment detected at line 0 - pdb (C:\Python23\lib\pdb.pyc) W: delayed conditional eval hack detected at line 0 - pdb (C:\Python23\lib\pdb.pyc) W: delayed eval hack detected at line 0 - pdb (C:\Python23\lib\pdb.pyc) W: delayed conditional eval hack detected at line 0 - pdb (C:\Python23\lib\pdb.pyc) W: delayed eval hack detected at line 0 - pdb (C:\Python23\lib\pdb.pyc) W: __all__ is built strangely at line 0 - termios (C:\Python23\lib\termios.pyc) W: delayed conditional exec statment detected at line 0 - iu (C:\Installer\iu.pyc) W: delayed conditional exec statment detected at line 0 - iu (C:\Installer\iu.pyc) W: delayed exec statment detected at line 0 - win32com.client.dynamic (C:\Python23\lib\site-packages\win32com\client\dynamic.pyc) W: delayed eval hack detected at line 0 - mhlib (C:\Python23\lib\mhlib.pyc) W: delayed conditional __import__ hack detected at line 0 - unittest (C:\Python23\lib\unittest.pyc) W: delayed conditional __import__ hack detected at line 0 - unittest (C:\Python23\lib\unittest.pyc) W: delayed exec statment detected at line 0 - cgi (C:\Python23\lib\cgi.pyc) W: delayed exec statment detected at line 0 - socket (C:\Python23\lib\socket.pyc) W: delayed conditional __import__ hack detected at line 0 - win32com.client.gencache (C:\Python23\lib\site-packages\win32com\client\gencache.pyc) W: delayed __import__ hack detected at line 0 - win32com.client.gencache (C:\Python23\lib\site-packages\win32com\client\gencache.pyc) W: delayed eval hack detected at line 0 - os (C:\Python23\lib\os.pyc) W: __all__ is built strangely at line 0 - __future__ (C:\Python23\lib\__future__.pyc) W: delayed __import__ hack detected at line 0 - win32com.client.makepy (C:\Python23\lib\site-packages\win32com\client\makepy.pyc) W: top-level conditional exec statment detected at line 0 - bsddb (C:\Python23\lib\bsddb\__init__.pyc) -------------- next part -------------- DeprecationWarning) checking Analysis building Analysis because out0.toc non existent running Analysis out0.toc Analyzing: C:\Installer/support/_mountzlib.py Analyzing: C:\Installer/support/useUnicode.py Analyzing: C:/spambayes-1.0a7/Outlook2000/installer/spambayes_addin.py W: pythoncom is changing it's name to pythoncom23 W: pywintypes is changing it's name to PyWinTypes23 Warnings written to C:\spambayes-1.0a7\Outlook2000\installer\warnspambayes_addin .txt checking PYZ rebuilding out1.toc because out1.pyz is missing building PYZ out1.toc checking PKG rebuilding out3.toc because out3.pkg is missing building PKG out3.pkg checking DLL rebuilding out2.toc because spambayes_addin.dll missing building DLL out2.toc checking COLLECT building out4.toc because out4.toc missing building COLLECT out4.toc From skip at pobox.com Thu Feb 5 10:40:01 2004 From: skip at pobox.com (Skip Montanaro) Date: Thu Feb 5 10:41:31 2004 Subject: [Spambayes] DB problem with sb_mboxtrain.py and Pythom-2.3.3 In-Reply-To: References: Message-ID: <16418.25553.715112.281853@montanaro.dyndns.org> Ken> Anyway, when I run the following: Ken> " /usr/local/Spambayes/bin/sb_mboxtrain.py -d \${HOME}/.hammiedb -g \${HOME/mail/ham} -s \${HOME}/mail/spam" Ken> I get the following traceback: ... Ken> bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') Seems to me you don't want backslashes in front of your dollar signs. All you do is prevent the usual shell expansion of the HOME environment variable. Also your second one has the right curly brace misplaced. Try: .../sb_mbox_train.py -d $HOME/.hammiedb -g $HOME/mail/ham -s $HOME/mail/spam Unless you're using some unusual shell the $HOME references in the above command should expand fine. Skip From bnorman at bak.rr.com Thu Feb 5 11:44:00 2004 From: bnorman at bak.rr.com (William Norman) Date: Thu Feb 5 11:44:24 2004 Subject: [Spambayes] delete to spam failure Message-ID: <000801c3ec07$41a626c0$6400a8c0@normans> Loaded spambayes and seemed to work fine, today I wanted to delete a message to spam so I highlighted the message and clicked delete to spam, message popped up said no filterable file found. Tried this on several files same results? please help as I think this program is great.. thanks Bill Norman, Bakersfield, CA. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/43c57726/attachment.html From kbraun at dgrunt.sneakernet.com Thu Feb 5 12:20:58 2004 From: kbraun at dgrunt.sneakernet.com (K. H. Braun) Date: Thu Feb 5 12:46:03 2004 Subject: [Spambayes] DB problem with sb_mboxtrain.py and Pythom-2.3.3 In-Reply-To: <16418.25553.715112.281853@montanaro.dyndns.org> References: <16418.25553.715112.281853@montanaro.dyndns.org> Message-ID: Hi Skip BINGO! problem fixed. Like I said - a bonehead mistake! Thank you -- Ken -- From kennypitt at hotmail.com Thu Feb 5 13:53:10 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Feb 5 13:54:03 2004 Subject: [Spambayes] re which one? In-Reply-To: <000a01c3ebd8$94858ef0$21a0e150@xppro> Message-ID: Lesley Hayward wrote: > I have Professional XP but e-mail uses Outlook Express. Can you tell > me which, if any, programme I should download please? If you don't > have one is there another somewhere out there in the ether? Yes, we have a version that will work with Outlook Express, although you may want to wait a little to go get it. The current release only supports automatic installation for Outlook 2000+, but we're hoping to have a new release out sometime next week. In addition to the Outlook plug-in, the installer for the new release will have an option to install a version that works with non-Outlook clients, including Outlook Express, as long as you are using a POP3 mail account. It will not be as tightly integrated into your mail client as the Outlook plug-in, but it has a very usable web-based interface and provides exactly the same filtering capabilities. You may want to subscribe to the spambayes-announce mailing list so that you can get notification when we put out a new release. Information for subscribing is here: http://mail.python.org/mailman/listinfo/spambayes-announce -- Kenny Pitt From deedeemurry at greatbigisland.com Thu Feb 5 14:32:35 2004 From: deedeemurry at greatbigisland.com (Dee Dee) Date: Thu Feb 5 14:32:30 2004 Subject: [Spambayes] Spambayes bug Message-ID: Well shucks, no one responded to my problem and I've since seen similar posts and it seems there isn't an answer to this bug. (Spam folder suddenly deletes and when trying to recreate it, configuration wizard doesn't work). I have deleted spaybayes and reinstalled, did the toolbar thing, nothing. I don't know what else to try so I guess it's good bye to Spambayes. I will miss it, it worked GREAT. I would be more than happy to pay for it if there was a way to make it work again. Thanks for putting it out there. By the way, and not complaining here, I know this is all free and very generous of you to even offer it, but on your website, and put in my info to go to the subscriber list, I get an error page. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/6f9d2a2d/attachment.html From nobody at spamcop.net Thu Feb 5 14:48:12 2004 From: nobody at spamcop.net (Seth Goodman) Date: Thu Feb 5 14:48:15 2004 Subject: [Spambayes] Outlook 2000 "backup" function blocked by Spambayes In-Reply-To: Message-ID: > [Dennis Hartmann] > since I've installed SpamBayes (awesome tool - thank you guys) > I'm unable to use the backup function. It says that parts of > the file (pst) is in use and cannot backup. any ideas? I'm not exactly sure what you mean by "backup", though Outlook has a well-developed auto-archive function that I use extensively. I also run Outlook2000 along with the SpamBayes 0.81 Outlook plug-in and auto-archiving works just as it did before installing SpamBayes. The same was true of the previous SpamBayes 0.80 plug-in. Perhaps you could give some more details of what you mean by "backup", if this is not the Outlook2000 auto-archive function? How is it set up and exactly how does it fail? Is there anything the in the SpamBayes log at the time of the failure? If, for some reason, you are _not_ using auto-archive in Outlook, I highly recommend it and would be happy to help you get it going. If you _are_ using auto-archive and SpamBayes interferes, this is unusual and may eventually turn into a bug report. -- Seth Goodman off-list replies to sethg [at] GoodmanAssociates [dot] com From kennypitt at hotmail.com Thu Feb 5 15:18:35 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Feb 5 15:19:29 2004 Subject: [Spambayes] Spambayes bug In-Reply-To: Message-ID: FAQ 3.13 should tell you everything you need to know. Note that it is not possible for SpamBayes to delete the spam folder for you, but inadvertently deleting a folder is all too easy to do in Outlook. http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder There is a bug here in that configuration wizard didn't work, but this problem is much easier to correct directly through the Filtering tab. IIRC, the configuration wizard bug has been fixed for the next release which will hopefully be out sometime next week. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Dee Dee Sent: Thursday, February 05, 2004 2:33 PM To: spambayes@python.org Subject: [Spambayes] Spambayes bug Well shucks, no one responded to my problem and I've since seen similar posts and it seems there isn't an answer to this bug. (Spam folder suddenly deletes and when trying to recreate it, configuration wizard doesn't work). I have deleted spaybayes and reinstalled, did the toolbar thing, nothing. I don't know what else to try so I guess it's good bye to Spambayes. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/f5e6f399/attachment.html From tim at fourstonesExpressions.com Thu Feb 5 15:32:00 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Thu Feb 5 15:32:08 2004 Subject: [Spambayes] Spambayes bug In-Reply-To: References: Message-ID: On Thu, 5 Feb 2004 11:32:35 -0800, Dee Dee wrote: > Well shucks, no one responded to my problem and I've since seen similar > posts and it seems there isn't an answer to this bug. (Spam folder > suddenly > deletes and when trying to recreate it, configuration wizard doesn't > work). Sorry, Dee Dee... due to the volume, sometimes a query slips through the cracks. I'm not an outlook user, so I can't offer any help directly... lemme see if I can get someone to respond. -- Vous exprimer; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tameyer at ihug.co.nz Thu Feb 5 20:39:27 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 5 20:39:44 2004 Subject: [Spambayes] Building SpamBayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E718F3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778F0@its-xchg4.massey.ac.nz> > To build SpamBayes, I am using McMillan's installer > installer_5b5_5. Unless you have a particular reason to, you're probably better off following the official build process, which now uses py2exe. The necessary files are in the windows/py2exe directory in the distribution (maybe not 1.0a7; I can't recall, but certainly in cvs). > Support dir is created with only one type library > entry i.e. > "00062FFF-0000-0000-C000-000000000046x0x9x0". The > other two dont come. The __init__.py file which is > created is blank. It's been ages since I have played about with building SpamBayes (or anything else) with Installer (and Mark did all the releases), but do you have all the appropriate files in your gen_py directory? IIRC, the installer is (unless you've changed it) looking for Outlook/Office 2000 files, and you might have XP ones... > Please help me as I really want to build > SpamBayes and then use it. I presume you want to do more than this, but in case you don't, and have missed this, if you just want to use it, there's no need to go through the build process at all. Either run with Python by doing "python addin.py" or download the prebuilt installer. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 5 20:46:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 5 20:46:40 2004 Subject: [Spambayes] delete to spam failure In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC34A1@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A90@its-xchg4.massey.ac.nz> > Loaded spambayes and seemed to work fine, today I wanted > to delete a message to spam so I highlighted the message > and clicked delete to spam, message popped up said no > filterable file found. Tried this on several files same > results? SpamBayes tries to protect you from training on mail that you shouldn't - in particular, messages that you have created, rather than received. Was there any unusual about that message in particular? Did you receive it like any other message? Was it some sort of Exchange generated message or something? If all was normal, then you should probably add some sort of comment to this bug report: It deals with another case like this, although there isn't really a solution yet. The best hope is that something be found that's in common with these messages, and still distinguishes them from mail that shouldn't be filtered. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From wburroughs at kc.rr.com Thu Feb 5 23:40:22 2004 From: wburroughs at kc.rr.com (Emidge) Date: Thu Feb 5 23:42:11 2004 Subject: [Spambayes] Can't tell if it's working Message-ID: <000801c3ec6b$5601f310$6400a8c0@userk0g1ol51f3> My operating system is Windows 2000 Pro. I'm using SpamBayes version 0.81 Outlook Add-in. When I click on Tools, go to Com Add-Ins; SpamBayes is not listed. It should be there right? I re-installed and opened Outlook and it still was not there. What do I do next? Thanks, Will -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/bcb1aeab/attachment-0001.html From jm at jmason.org Fri Feb 6 00:23:19 2004 From: jm at jmason.org (Justin Mason) Date: Fri Feb 6 00:23:24 2004 Subject: [Spambayes] good article on "training-to-exhaustion" (fwd) Message-ID: <20040206052320.8B88416EFB@jmason.org> ...from Gary Robinson: http://www.garyrobinson.net/2004/02/spam_filtering_.html --j. From jeriksen at egi.com Fri Feb 6 00:33:32 2004 From: jeriksen at egi.com (Jeff Eriksen) Date: Fri Feb 6 00:40:35 2004 Subject: [Spambayes] deleted Junk E-Mail folder Message-ID: <017b01c3ec72$c5841360$da00000a@Loki> I did read the help files, but nothing seemed appropriate to my problem. I am using Spambayes on WinXP with Outlook XP. I decided to delete all the emails in my Junk E-Mail folder, but later realized that I also deleted the folder itself. I re-created it, but now it seems like my Junk E-Mail is still being classified as such (since I see emails appear briefly in my Inbox then disappear, but they do not show up anywhere else. My good mail comes to my Inbox, as well as some Junk suspects. How can I make sure I am getting my Junk E-Mails (weird question, huh)? Thanks, -Jeff -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040205/0f004c00/attachment.html From patrick at skex.com.au Fri Feb 6 01:43:35 2004 From: patrick at skex.com.au (Patrick Dockhorn) Date: Fri Feb 6 01:43:27 2004 Subject: [Spambayes] Bayes Outlook Plugin & POP Proxy DBs Message-ID: <008601c3ec7c$8ba9b280$f900a8c0@safran> I've been using Marc's wonderful Outlook Plugin for a while now and am currently setting up Spambayes to run as a POP3 proxy in our Intranet. I was wondering - is there an easy way how I can move my existing Spam databases across to the Linux based Spambayes pop3 proxy so that I can avoid retraining? - from what I've seen it seems that the pop3 proxy is proxying to fixed pop3 servers only - is there a reason for this? To support multiple users which use different pop servers I think it would be more flexible to have the pop3 userid in the customer's mail client to be of the form "user@pop-server:port" and the proxy can thus interface with arbitrary pop servers Your help is highly appreciated. Regards, Patrick From chatterad at yahoo.com Fri Feb 6 03:50:44 2004 From: chatterad at yahoo.com (Anand Deshmukh) Date: Fri Feb 6 03:50:48 2004 Subject: [Spambayes] Building SpamBayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13046778F0@its-xchg4.massey.ac.nz> Message-ID: <20040206085044.19360.qmail@web12708.mail.yahoo.com> Hi Tony! Thanks a lot for your reply. It has worked! For the benefit of others I got the latest spambayes code from CVS, got latest installer of py2Win from sourceforge and installed it. Then using the setup_all.py file in windowns/py2exe folder in spambayes, I created the binaries. It created outlook_addin.dll in dist/bin directory. I renamed it to spambayes_addin.dll and registered it with regsvr32.exe. I then mannualy added registry entries in outlooks addin keys and it worked. Really appritiate your help Tony! Anand. --- Tony Meyer wrote: > > To build SpamBayes, I am using McMillan's > installer > > installer_5b5_5. > > Unless you have a particular reason to, you're > probably better off following > the official build process, which now uses py2exe. > The necessary files are > in the windows/py2exe directory in the distribution > (maybe not 1.0a7; I > can't recall, but certainly in cvs). > > > Support dir is created with only one type library > > entry i.e. > > "00062FFF-0000-0000-C000-000000000046x0x9x0". The > > other two dont come. The __init__.py file which is > > created is blank. > > It's been ages since I have played about with > building SpamBayes (or > anything else) with Installer (and Mark did all the > releases), but do you > have all the appropriate files in your gen_py > directory? IIRC, the > installer is (unless you've changed it) looking for > Outlook/Office 2000 > files, and you might have XP ones... > > > Please help me as I really want to build > > SpamBayes and then use it. > > I presume you want to do more than this, but in case > you don't, and have > missed this, if you just want to use it, there's no > need to go through the > build process at all. Either run with Python by > doing "python addin.py" or > download the prebuilt installer. > > =Tony Meyer > > --- > Please always include the list > (spambayes@python.org) in your replies > (reply-all), and please don't send me personal mail > about SpamBayes. This > way, you get everyone's help, and avoid a lack of > replies when I'm busy. > __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From Franklin.Davis at nokia.com Fri Feb 6 06:16:56 2004 From: Franklin.Davis at nokia.com (Franklin.Davis@nokia.com) Date: Fri Feb 6 06:17:04 2004 Subject: [Spambayes] Suggestion: post SpamBayes to download.com and shareware.com Message-ID: Hi, all-- First, thank you from the bottom of my heart for your work. My inbox was hell, and now peace reigns. I just sent my colleague from the Insurance dept. to your site. In checking the URL, I realized the page would be really daunting for a non-technical user. But SpamBayes itself is totally user-friendly! So I suggest making the home page much more inviting to the general user, with a simple description of the features & benefits, and download links with install instructions, and put the project and technical details on a second page. And, distribute it on the shareware sites. I searched download.com and someone is selling a SpamBayes-based tool! Again, thank you. --Franklin Franklin Davis Nokia Technology Platform / Strategic Architecture Browsing Service Enabler Manager * Keilalahdentie 4, 02150, Espoo, Finland * Franklin.Davis@nokia.com * +358 50 482 1263 * +358 7180 38677 (FAX) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/fc70a419/attachment.html From rcoe at CambridgeMA.GOV Fri Feb 6 08:22:48 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Feb 6 08:22:52 2004 Subject: [Spambayes] Funny behavior of Outlook plugin Message-ID: R. K. Coe _____ I don't know whether this is a bug or a feature; but it was a bit unexpected, so I thought I'd pass it along. I use the Outlook plugin (0.85) with Office 2000 under Windows XP Pro. Last night I received a real message that got misclassified as spam. I unthinkingly answered it before moving it back to my inbox, so my answer went into the spam folder. (Outlook does that if you answer a message in a folder other than your inbox.) So I selected it and clicked the "Recover" button, whereupon I got the "No filterable mail items are selected" popup, and the message didn't move. I was able to move it without using Spambayes, and the Recover button did, of course, work on the original message. My guess is that this behavior is a side effect of an implicit decision not to assign any training value to a message that hasn't been scanned. That's not illogical, but I don't think that particular error message (whose ubiquity belies its lack of information content) quite conveys the reasoning. No big deal. It just wasn't what I would have expected. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/36df9089/attachment.html From eschechter at snet.net Fri Feb 6 10:10:05 2004 From: eschechter at snet.net (eschechter@snet.net) Date: Fri Feb 6 10:10:21 2004 Subject: [Spambayes] Spambayes Compatibility Message-ID: <402367FD.22169.1446659B@localhost> I am using Pegasus Win-Pmail32 and Windows XP. Can Spambayes be used with the WinPmail client? If not, with what is it compatible?-- Ed Schechter EMS Consulting 158 Four Brooks Road Stamford, CT 06903 Voice: (203) 322-1542 FAX: (203) 322-2411 EMAIL: eschechter@snet.net From tim.one at comcast.net Fri Feb 6 10:13:37 2004 From: tim.one at comcast.net (Tim Peters) Date: Fri Feb 6 10:13:36 2004 Subject: [Spambayes] RE: [spambayes-bugs] parse error - undefined boundary - insb_imapfilter.py In-Reply-To: <0AA408F4E902C84C8F6BE093CCE9D7A808D3F2@server1.artsalliance.co.uk> Message-ID: [redirected from the inappropriate spambayes-bugs list] [Bruce Leigh] > Hi > just wanted to let you know that I am using the latest version of > sb_imapfilter but getting crashes due to parse errors. It would be > nice (and possible) to make the system handle this gracefully and > just dump the message into the unsure box (or maybe user definable). > I am attaching a zip of a .PST file containing 12 messages that will > each cause this problem so that you can properly evaluate it. The > messages are Outlook messages. I assume that this the most useful way > to get you the dodgy data.... That should have been very helpful. Alas, while the zip file did contain a problem.pst file, problem.pst was empty (0 bytes). > please contact me directly if you need any more info or the messages > in a different format. I have set my prefs to NOT receive anything > that is posted to the list so please mail me directly. The spambayes-bugs list isn't a list for humans to post to, so if you don't want to see the machine-generated email that appears on that list, don't subscribe to it at all. Descriptions of the 5 available mailing lists are here: http://spambayes.sourceforge.net/contact.html It's best to enter bug reports on the SpamBayes bug tracker (also at SourceForge), but if that's too hard then mailing to the spambayes list has a decent chance of getting noticed. > Hope this helps, thanks for a great product, saving > me hours a week in any case... :-) best Bruce You're welcome! From kennypitt at hotmail.com Fri Feb 6 12:30:59 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 6 12:31:55 2004 Subject: [Spambayes] deleted Junk E-Mail folder In-Reply-To: <017b01c3ec72$c5841360$da00000a@Loki> Message-ID: Check out FAQ 3.13: http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Jeff Eriksen Sent: Friday, February 06, 2004 12:34 AM To: spambayes@python.org Subject: [Spambayes] deleted Junk E-Mail folder I decided to delete all the emails in my Junk E-Mail folder, but later realized that I also deleted the folder itself. I re-created it, but now it seems like my Junk E-Mail is still being classified as such (since I see emails appear briefly in my Inbox then disappear, but they do not show up anywhere else. My good mail comes to my Inbox, as well as some Junk suspects. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/7b35fa43/attachment.html From kennypitt at hotmail.com Fri Feb 6 12:49:01 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 6 12:50:03 2004 Subject: [Spambayes] Funny behavior of Outlook plugin In-Reply-To: Message-ID: SpamBayes only trains on received messages, and it has a check to prevent you from training on a message that you sent yourself. This is the check that caused the "No filterable items" popup, but the warning would certainly be more helpful if it could be a little more detailed about why the message was not "filterable". I'll look into whether or not we can do that, and we should be able to at least include some possible reasons why it would occur even if we don't know the exact one. As far as the need to "Recover" it, I doubt that SpamBayes trained on it at all, so there shouldn't be any mis-training issues caused by the fact that you can't train it manually. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Coe, Bob Sent: Friday, February 06, 2004 8:23 AM To: spambayes@Python.org Subject: [Spambayes] Funny behavior of Outlook plugin R. K. Coe _____ I don't know whether this is a bug or a feature; but it was a bit unexpected, so I thought I'd pass it along. I use the Outlook plugin (0.85) with Office 2000 under Windows XP Pro. Last night I received a real message that got misclassified as spam. I unthinkingly answered it before moving it back to my inbox, so my answer went into the spam folder. (Outlook does that if you answer a message in a folder other than your inbox.) So I selected it and clicked the "Recover" button, whereupon I got the "No filterable mail items are selected" popup, and the message didn't move. I was able to move it without using Spambayes, and the Recover button did, of course, work on the original message. My guess is that this behavior is a side effect of an implicit decision not to assign any training value to a message that hasn't been scanned. That's not illogical, but I don't think that particular error message (whose ubiquity belies its lack of information content) quite conveys the reasoning. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/d7102ea4/attachment.html From kennypitt at hotmail.com Fri Feb 6 13:09:07 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 6 13:10:19 2004 Subject: [Spambayes] Suggestion: post SpamBayes to download.com andshareware.com In-Reply-To: Message-ID: Franklin.Davis@nokia.com wrote: > First, thank you from the bottom of my heart for your work. My inbox was > hell, and now peace reigns. Great! > So I suggest making the home page much more inviting to the general user, > with a simple description of the features & benefits, and download links > with install instructions, and put the project and technical details on > a second page. Sounds like a good idea, but we're all developers so we tend to write for developers as well. The technical mumbo-jumbo is what we live for! However, we're always willing to accept submissions, so feel free to create a mock-up of your ideas and either send it to the spambayes-dev@python.org list or (preferably) submit a patch via SourceForge. > And, distribute it on the shareware sites. I searched download.com and > someone is selling a SpamBayes-based tool! Creating a commercial product based on SpamBayes is explicitly allowed by our license, and anyone is welcome to do so. SpamBayes was created by some people that had a need for spam filtering, and decided to share the fruits of their labor with others. The rest of us joined up along the way because the software filled our need as well, and we wanted to contribute to making it an even better tool for what we want it to do. It gives us all a warm, fuzzy feeling when people like the software enough to think it would be worth paying for, but that's not why we do it. We develop for ourselves, and the fact that other people are benefiting is a nice side-effect. Besides, as a volunteer effort, we don't want to make more work for ourselves by becoming *too* popular! -- Kenny Pitt From curania at yahoo.com Fri Feb 6 14:59:50 2004 From: curania at yahoo.com (T.McNut) Date: Fri Feb 6 14:59:56 2004 Subject: [Spambayes] Spam block Message-ID: <20040206195950.67627.qmail@web41510.mail.yahoo.com> Aloha Honey Does this program work with blocking spam in Yahoo? I have their free web based email service. I get 200 spam a day. My email address is a good seller for someone. Mahalo nui loa for you time and assistance with this matter. Ciao T.McNut --------------------------------- Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/dc6e543b/attachment.html From skip at pobox.com Fri Feb 6 15:14:23 2004 From: skip at pobox.com (Skip Montanaro) Date: Fri Feb 6 15:14:33 2004 Subject: [Spambayes] Spam block In-Reply-To: <20040206195950.67627.qmail@web41510.mail.yahoo.com> References: <20040206195950.67627.qmail@web41510.mail.yahoo.com> Message-ID: <16419.62879.21085.532822@montanaro.dyndns.org> TM> Aloha Honey Does this program work with blocking spam in Yahoo? I TM> have their free web based email service. I get 200 spam a day. My TM> email address is a good seller for someone. Mahalo nui loa for you TM> time and assistance with this matter. Ciao T.McNut No, though it wouldn't be impossible to do with a web proxy of some sort. If Yahoo! offers some way for you to fetch your mail using POP3 or IMAP you could use the existing sb_server.py or imap_filter.py applications. If you're stuck with a Web mail service you've got some programming to do. Skip From rmalayter at bai.org Fri Feb 6 15:23:13 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Feb 6 15:23:17 2004 Subject: [Spambayes] Spam block Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75DD3@cliff.bai.org> No, SpamBayes does not work with the free web-based email systems like Yahoo Mail and Hotmail. At least, not unless you pay extra and get POP3 or IMAP access to your Yahoo or Hotmail account. If you do that, you can use SpamBayes with the Outlook plug-in or the POP3/IMAP proxies to filter mail, and you would read your mail with whatever mail program you wanted. _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of T.McNut Sent: Friday, February 06, 2004 2:00 PM To: spambayes@python.org Subject: [Spambayes] Spam block Aloha Honey Does this program work with blocking spam in Yahoo? I have their free web based email service. I get 200 spam a day. My email address is a good seller for someone. Mahalo nui loa for you time and assistance with this matter. Ciao T.McNut _____ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/8928286b/attachment.html From kennypitt at hotmail.com Fri Feb 6 16:34:54 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 6 16:35:56 2004 Subject: [Spambayes] Spam block In-Reply-To: <16419.62879.21085.532822@montanaro.dyndns.org> Message-ID: Skip Montanaro wrote: >> Aloha Honey Does this program work with blocking spam in Yahoo? I >> have their free web based email service. I get 200 spam a day. My >> email address is a good seller for someone. Mahalo nui loa for you >> time and assistance with this matter. Ciao T.McNut > > No, though it wouldn't be impossible to do with a web proxy of some > sort. If Yahoo! offers some way for you to fetch your mail using > POP3 or IMAP you could use the existing sb_server.py or > imap_filter.py applications. If you're stuck with a Web mail service > you've got some programming to do. The open-source "YahooPOPs!" program will allow you to read your Yahoo mail via a POP3 interface, and you can then point the SpamBayes sb_server POP3 proxy filter at the YahooPOPs POP3 interface to filter your mail. http://yahoopops.sourceforge.net If you want to try this, you should probably wait until the next release of SpamBayes (hopefully next week), which will include a Windows installer for the sb_server program. I've used this configuration myself, so I may be able to help you out with the configuration settings. -- Kenny Pitt From skip at pobox.com Fri Feb 6 17:04:47 2004 From: skip at pobox.com (Skip Montanaro) Date: Fri Feb 6 17:06:35 2004 Subject: [Spambayes] Spam block In-Reply-To: References: <16419.62879.21085.532822@montanaro.dyndns.org> Message-ID: <16420.3967.74757.764544@montanaro.dyndns.org> Kenny> The open-source "YahooPOPs!" program will allow you to read your Kenny> Yahoo mail via a POP3 interface, and you can then point the Kenny> SpamBayes sb_server POP3 proxy filter at the YahooPOPs POP3 Kenny> interface to filter your mail. Kenny> http://yahoopops.sourceforge.net Cool... Perhaps this belongs in the FAQ? Kenny> If you want to try this, you should probably wait until the next Kenny> release of SpamBayes (hopefully next week), which will include a Kenny> Windows installer for the sb_server program. Agreed. Skip From marvallen1 at comcast.net Fri Feb 6 18:11:02 2004 From: marvallen1 at comcast.net (marvallen) Date: Fri Feb 6 18:11:45 2004 Subject: [Spambayes] Where did it go? Message-ID: <000801c3ed06$8c743ac0$6500a8c0@oempnck9i2acp1> Hello, I installed your program and can't seem to find or get new email. I've searched for a holding file but without success. Is there a way to configure the program by getting my email and then letting Spambayes know what to do with it? Thank you, Marv 248 491-2000 PS: I will have to get your response from another connected computer. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040206/70837c93/attachment.html From mhammond at skippinet.com.au Fri Feb 6 18:36:47 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Feb 6 18:37:05 2004 Subject: [Spambayes] Funny behavior of Outlook plugin In-Reply-To: Message-ID: <079b01c3ed0a$17805c90$0200a8c0@eden> This is a "feature" :) Spambayes (or is it SpamBayes? goes to efforts to exclude mails you yourself composed for this exact reason - that Outlook often stores your replies in the same folder. You should find that original messages you compose are not able to be filtered or scored, regardless of the folder they appear in. Mails you composed, but then received *are* still filterable - it is just the original that isn't. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Coe, Bob Sent: Saturday, 7 February 2004 12:23 AM To: spambayes@Python.org Subject: [Spambayes] Funny behavior of Outlook plugin R. K. Coe I don't know whether this is a bug or a feature; but it was a bit unexpected, so I thought I'd pass it along. I use the Outlook plugin (0.85) with Office 2000 under Windows XP Pro. Last night I received a real message that got misclassified as spam. I unthinkingly answered it before moving it back to my inbox, so my answer went into the spam folder. (Outlook does that if you answer a message in a folder other than your inbox.) So I selected it and clicked the "Recover" button, whereupon I got the "No filterable mail items are selected" popup, and the message didn't move. I was able to move it without using Spambayes, and the Recover button did, of course, work on the original message. My guess is that this behavior is a side effect of an implicit decision not to assign any training value to a message that hasn't been scanned. That's not illogical, but I don't think that particular error message (whose ubiquity belies its lack of information content) quite conveys the reasoning. No big deal. It just wasn't what I would have expected. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2768 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040207/d9dfb224/winmail.bin From dreas at emailaccount.nl Fri Feb 6 19:13:42 2004 From: dreas at emailaccount.nl (Dreas van Donselaar) Date: Fri Feb 6 19:13:39 2004 Subject: [Spambayes] Norton antivirus Message-ID: Hi all, I try to get all virus warnings filtered out as SPAM as well. Norton Antivirus for example replaces a virus attachment with "Norton AntiVirus Deleted1.txt". SpamBayes does not notice this file name however. Furthermore I receive a lot of delivery failures for virus emails (send using one of my domainnames as sender email address). These virus emails are attached to the email but do not seem to be scanned by SpamBayes and are therefore not recognised. Wouldn't it be an improvement to check the attached file names and content of attached files as well? Regards, Dreas van Donselaar -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040207/31d7b7c4/attachment.html From tim.one at comcast.net Fri Feb 6 22:06:12 2004 From: tim.one at comcast.net (Tim Peters) Date: Fri Feb 6 22:06:12 2004 Subject: [Spambayes] Norton antivirus In-Reply-To: Message-ID: [Dreas van Donselaar] > I try to get all virus warnings filtered out as SPAM as well. Norton > Antivirus for example replaces a virus attachment with "Norton > AntiVirus Deleted1.txt". SpamBayes does not notice this file name > however. > > Furthermore I receive a lot of delivery failures for virus emails > (send using one of my domainnames as sender email address). These > virus emails are attached to the email but do not seem to be scanned > by SpamBayes and are therefore not recognised. > > Wouldn't it be an improvement to check the attached file names and > content of attached files as well? You must be using the Outlook addin, yes? Any other way of using the SpamBayes codebase does know the names of attachments, and does pick apart all MIME attachments with type text/*. The Outlook addin does neither, because Outlook destroys the original MIME structure of incoming email, and so our project's MIME parser has nothing left to work with. This won't change until someone contributes code to do a better job of trying to guess (and recreate) the original MIME structure, from all the places Outlook scattered the pieces. That said, I get lots of these kinds of virus bounces too, and after patient training they never show up in my Outlook Inbox anymore. I get a few new ones in Unsure each day now, but usually in a language I don't recognize . From Rich at RBarger.com Fri Feb 6 22:09:20 2004 From: Rich at RBarger.com (Richard B Barger ABC APR) Date: Fri Feb 6 22:09:26 2004 Subject: [Spambayes] Please help with installation for use with Netscape Mail Message-ID: <402456E0.DDB50E0A@RBarger.com> How do I install SpamBayes for use with Netscape Mail? Unless I've just missed it, the only information I can find on the SpamBayes project site is an unhelpful: "If you use any other mail client on Windows, you currently need to install a recent version of Python and the Spambayes source1, then setup the proxy server (POP3 or IMAP) which corresponds to the way in which you transfer mail from your mail server." I'm not a programmer, just an experienced computer end-user. But I need a set of baby-steps instructions to get this sucker working. Every time I get started, I end up quitting in frustration. Windows XP Pro - Netscape Communicator 4.79 - lotsa lotsa memory and hard drive space - DSL connection - Norton AntiVirus - ZoneAlarm Pro BTW, I'm the poster boy for spam. I receive 600-700 messages each day on three accounts; more than 2/3 are spam. Please help. Baby steps. Thank you. Rich Barger Kansas City From rural_cdn at fastmail.fm Sat Feb 7 10:07:49 2004 From: rural_cdn at fastmail.fm (rural_cdn@fastmail.fm) Date: Sat Feb 7 10:07:53 2004 Subject: [Spambayes] Long load times. Message-ID: <20040207150749.5322E157F2C@mail.messagingengine.com> What determines the size of this file default_bayes_database.db ? I'm trying to cut down on the long load times I have. Will deleting my spam messages help? Will that decrease the size of the above file? WinXP Outlook2k3 TIA -- rural_cdn@fastmail.fm Who puts the thin ice signs out there anyways? -- http://www.fastmail.fm - mmm... Fastmail... From gcoon at carolina.rr.com Sat Feb 7 12:21:59 2004 From: gcoon at carolina.rr.com (Gilbert Coon) Date: Sat Feb 7 12:22:19 2004 Subject: [Spambayes] Advice Message-ID: Whenever I close Microsoft's Outlook 2002, I get the error reporting box indicating that a problem has developed and asking if I want to send the report to Microsoft-in spite of the fact that I have "unclicked" error reporting. Any ideas about how to solve this problem? It's a pain in the butt to have to uncheck the "reopen MS Outlook" box and click the "Don't Send." Gil Coon -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040207/8000ca40/attachment.html From momsheart at charter.net Sat Feb 7 14:55:51 2004 From: momsheart at charter.net (Maria Gagnon) Date: Sat Feb 7 14:58:57 2004 Subject: [Spambayes] Incredamail Message-ID: <402542C7.000003.01596@gods-army-host> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 7822 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040207/a58358e9/attachment.jpe From skip at pobox.com Sat Feb 7 15:22:12 2004 From: skip at pobox.com (Skip Montanaro) Date: Sat Feb 7 15:22:16 2004 Subject: [Spambayes] Incredamail In-Reply-To: <402542C7.000003.01596@gods-army-host> References: <402542C7.000003.01596@gods-army-host> Message-ID: <16421.18676.144053.385407@montanaro.dyndns.org> Maria> Will SB work with Incredamail? It's impossible to tell from their website, but if it supports POP3 or IMAP, then yes, Spambayes will work with it. Skip From ahutton at ahutton.com Sat Feb 7 15:47:45 2004 From: ahutton at ahutton.com (Allen Hutton) Date: Sat Feb 7 15:47:57 2004 Subject: [Spambayes] user name Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040207/c2ab079f/attachment.html From dbstanley at quiet-storm.org Sat Feb 7 17:32:11 2004 From: dbstanley at quiet-storm.org (David & Felica Stanley) Date: Sat Feb 7 17:32:06 2004 Subject: [Spambayes] Scoring and filtering in multiple PST files Message-ID: Outlook 2003 (11.5608.5606) Windows XP Pro, all updates installed SpamBayes Outlook addin v0.81 The SpamBayes addin seems to score and automatically filter only messages in one PST file, if multiple PST files are used with Outlook. My main PST files ("Personal Folders") has only the standard folders (Inbox, Outbox, Sent Items, Junk E-Mail, Junk Suspects, Drafts, Deleted Items). I have another PST file (okay, actually I have 3 more) with several folders. I have Outlook rules set up to move messages into a few of these folders "after they arrive." I have set up SpamBayes to filter 3 of these folders (in the 2nd PST), as well as the Inbox folder in the main PST file. Although messages that end up in my Inbox folder are being scored and automatically filtered, messages in the second PST files are not. The "Spam" field shows nothing (blank) for messages in filtered folders in the second PST file. However, the "Show Spam clues for current message" option shows high scores (100% in some cases) for Spam messages in these folders. If I force filter-these messages using the "Filter Messages..." option, the Spam messages are filtered and moved to the Junk E-mail folder. Please help. Let me know if you need any other information. David Stanley -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040207/e3a95d31/attachment-0001.html From KYamamoto at ielinc.com Sat Feb 7 20:10:40 2004 From: KYamamoto at ielinc.com (Ken Yamamoto) Date: Sat Feb 7 20:11:03 2004 Subject: [Spambayes] Spambayes won't completely remove from Outlook Message-ID: Hello, I received an email that disabled Spambayes so that it no longer works. I tried reinstalling, completely removing spambayes and reinstalling it, and even reinstalling Outlook, with no success. My question is how do I completely remove all traces of the spambayes outlook plug-in so that I can then reinstall spambayes and have it work again. Thank you. - Ken Yamamoto -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040207/538195f0/attachment.html From tameyer at ihug.co.nz Sat Feb 7 20:38:09 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 20:38:33 2004 Subject: [Spambayes] Can't tell if it's working In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3647@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778F8@its-xchg4.massey.ac.nz> > When I click on Tools, go to Com Add-Ins; SpamBayes is not listed. > It should be there right? I re-installed and opened Outlook and > it still was not there. This is a known bug with the 008.1 release, and will be fixed in the next release. If SpamBayes is working, you'll have a fully functional SpamBayes toolbar. If it's never run successfully, then you won't have a SpamBayes toolbar at all, and if it once worked but now doesn't, you'll have a SpamBayes toolbar that doesn't work. In any of these cases, you should have log files (in your temp directory, as described in the troubleshooting guide (a copy is on the website)), which would help in diagnosing any problem. If it isn't working (i.e. if there was more of a problem than the plug-in not being listed), please let us know what's not right (and include a copy of the log) and we'll try and advise you on how to fix it. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 20:45:33 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 20:45:51 2004 Subject: [Spambayes] Bayes Outlook Plugin & POP Proxy DBs In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC367B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A93@its-xchg4.massey.ac.nz> > - is there an easy way how I can move my existing Spam > databases across to the Linux based Spambayes pop3 proxy so > that I can avoid retraining? Yes. You *might* be able to simply copy the default_bayes_database.db file over and have it work. Certainly if you're using a pickle, this should work. If you're using bsddb (probably the case), then this might not work. In the source release (i.e. the one you'll have on the Linux system) there's a script called sb_dbexpimp.py. This can convert to & from the various database formats. Probably the most trouble-free solution would be to run it on the Windows system (you'll need Python installed), converting the database to a pickle (or text, but a pickle is probably easier). Copy this to the Linux system (calling it .hammiedb if you want), and either convert it back to some sort of dbm, or just use it as a pickle, depending on your needs. > - from what I've seen it seems that the pop3 proxy is > proxying to fixed pop3 servers only - is there a reason for > this? To support multiple users which use different pop > servers I think it would be more flexible to have the pop3 > userid in the customer's mail client to be of the form > "user@pop-server:port" and the proxy can thus interface with > arbitrary pop servers This is how the (rarely used; mostly example) scripts in the pspam directory work. You could modify sb_server to work this way if you wanted to. I can't recall the reason that pop3proxy.py chose to do it this way (I wasn't a developer at the time, although I imagine I must have read about it on the list), but there are advantages either way. Certainly the way that sb_server does it makes things much simpler to setup in the mail client (and simpler to change back and forth). Unless there are a huge number of servers that you need to proxy, it doesn't seem that it's that difficult to list them all in the one option that needs to have them. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 20:51:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 20:51:38 2004 Subject: [Spambayes] Spambayes won't completely remove from Outlook In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3ABE@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778FA@its-xchg4.massey.ac.nz> > I received an email that disabled Spambayes so > that it no longer works. I tried reinstalling, > completely removing spambayes and reinstalling it, and > even reinstalling Outlook, with no success. My question > is how do I completely remove all traces of the spambayes > outlook plug-in so that I can then reinstall spambayes > and have it work again. I think what you're actually after is the list of disabled plug-ins in Outlook. From the Help menu, choose About Microsoft Outlook, then in the dialog box, click the Disabled Items button. SpamBayes is probably listed there, and so you can fix it from there. Otherwise, the uninstaller should remove all non-user data. The 008.1 release does have some troubles with this, IIRC, but the next release should fix that. For the moment, you'd have to run the uninstaller, then manually delete any files and registry entries that are left. This is unlikely to be the case of your problem, though. One other issue is that user data is *not* removed on uninstall (this is deliberate, and common among software). This includes your configuration file, so if the problem is actually with your setup, then this might be the problem. If the disabled items fix above doesn't work, you could try deleting/renaming your configuration file and seeing if that helps (this still won't lose any of your training data). The FAQ explains where these files are found. One last thing is that if none of this works, then attaching your log files to an error report would help track things down - the troubleshooting guide (a copy is on the website) explains where these are. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 20:55:33 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 20:55:51 2004 Subject: [Spambayes] user name In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3A60@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778FB@its-xchg4.massey.ac.nz> > my ISP requires name to be in format of nek@snet.net setup > program will not take name in this format > isp will not accept name of "nek" without the snet.net What setup program? It doesn't look like you're using Outlook, so is this with sb_server and do you mean the configuration page? (Which still doesn't make sense, because sb_server (and Outlook!) doesn't need to know what your username is - you still just have this in your mail client. As far as I can recall, only sb_imapfilter (and maybe notesfilter?) needs your username to work. But imapfilter accepts any valid IMAP 'astring', which certainly includes '@'. Could you give us some more information about: 1. What SpamBayes application you are trying to setup. 2. How you are doing this (via the web interface? Manually in a config file?) 3. Where you are trying to put your username. That'll help us figure out what you're trying to do, and help you get it done. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 20:57:41 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 20:57:58 2004 Subject: [Spambayes] Advice In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3A12@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778FC@its-xchg4.massey.ac.nz> > Whenever I close Microsoft's Outlook 2002, I get > the error reporting box indicating that a problem has > developed and asking if I want to send the report to > Microsoft-in spite of the fact that I have "unclicked" > error reporting. Any ideas about how to solve this problem? Is this definitely related to SpamBayes? Try disabling SpamBayes and seeing if the problem goes away, or uninstalling it and seeing if that changes it (if you uninstall and then reinstall later you don't lose any of your configuration or training information). You might try Outlook's "Detect and Repair" function (from the Help menu), if you have the CD that you installed Outlook from. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:05:16 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:06:42 2004 Subject: [Spambayes] Long load times. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC39E5@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A94@its-xchg4.massey.ac.nz> > What determines the size of this file default_bayes_database.db ? The number of unique tokens that are in all the messages that you have trained. In practice, given that messages tend to have the same tokens (or this whole thing wouldn't work!), the number of messages that you have trained. > I'm trying to cut down on the long load times I have. Will > deleting my spam messages help? Will that decrease the size > of the above file? Having a smaller database file could help, yes. (The way to check this (and see how much you gain) would be to rename the file, see how fast it loads then (it'll create a new, empty, database), and then delete the new database and put the old one back). How many messages have you trained? You can get quite good results with just a couple of hundred of each - if you have several thousand of each, then an easy way to fix this would be to retrain with a smaller sample. For example, I currently have 89 ham and 195 spam trained (note that it would be better to have roughly equal numbers of ham and spam), and get good results *. 'Optimum' size of the ham & spam corpora is something that isn't really known at this point in time. If you are more concerned about load times, then it would certainly be worth giving the 'minimal db' scheme a try. For example, only train on mistakes (false positives, false negatives, and unsures), and see how that goes. =Tony Meyer * My database file is 5Mb, but I have two experimental options (x-use_bigrams and x_slurp_urls with x-web_prefix) enabled that make the database many times bigger than it otherwise would be. If you only had ~400 messages trained I would expect that the file would be less than half that size. --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:09:13 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:09:30 2004 Subject: [Spambayes] Where did it go? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3865@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778FE@its-xchg4.massey.ac.nz> > I installed your program and can't seem to find or get > new email. I've searched for a holding file but without > success. Is there a way to configure the program by > getting my email and then letting Spambayes know what to > do with it? If you haven't done any training at all, then all your filtered mail will score 0.5, right in the middle of the 'unsure' range. This means all the mail will end up in the folder you've chosen for 'unsure' messages - maybe called "Possible Junk". Try using Outlook's Advanced Find feature to find a message - they should be all together (if you don't know what to search for, just send yourself a message, and then search for that one). Once you're in the unsure folder, there will be two buttons on the SpamBayes toolbar that you can use to do training - "Delete as Spam" and "Recover from Spam". Click the appropriate one for each message, and the appropriate training will be done. Once you have some training data, future messages will be classified more correctly, although you should continue to periodically review both the unsure/Possible Junk and spam/Junk folders. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:12:48 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:13:06 2004 Subject: [Spambayes] Spambayes Compatibility In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3758@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046778FF@its-xchg4.massey.ac.nz> > I am using Pegasus Win-Pmail32 and Windows XP. Can Spambayes > be used with the WinPmail client? If not, with what is it > compatible?-- Yes it can, although you don't get a plug-in like with Outlook, you need to use the sb_server POP3 proxy. You'll get the same classification results, but it's not an integrated solution (but does include a very nice web-based interface). The information on the website about "sb_server" is what you're after, although you might want to wait until the end of this week, by which time a binary installer should be out that includes sb_server, which makes the install process somewhat simpler. The spambayes-announce list (and probably this one) will have a notice announcing the release, once it's done. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:15:50 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:16:07 2004 Subject: [Spambayes] Please help with installation for use with Netscape Mail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC38DC@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677900@its-xchg4.massey.ac.nz> > How do I install SpamBayes for use with Netscape Mail? > Unless I've just missed it, the only information I can find > on the SpamBayes project site is an unhelpful: [...] > I'm not a programmer, just an experienced computer end-user. > But I need a set of baby-steps instructions to get this > sucker working. Every time I get started, I end up quitting > in frustration. Have you tried going through the steps in the readme? If you have, and anything in there is confusing or unclear, it would really help if you could tell us what is unclear, and help us figure out a way to write it so that it is clearer. We're developers, not technical writers, and so need all the help we can get . One other option is to wait a few more days and try the new binary installer, which makes the process much more simple. It should be out at some point this coming week - it'll be announced on the spambayes-announce list, and probably here, too (and definitely on the website). There's still a little bit of configuration, but much less than using the current source release. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:23:38 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:23:57 2004 Subject: [Spambayes] RE: [spambayes-bugs] uncaptured python exception ... object has noattribute 'isClosed' In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3AC8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A95@its-xchg4.massey.ac.nz> [redirected from the inappropriate spambayes-bugs list] Please note that the spambayes-bugs list isn't a list for humans to post to, so if you don't want to see the machine-generated email that appears on that list, don't subscribe to it at all. Descriptions of the 5 available mailing lists are here: http://spambayes.sourceforge.net/contact.html It's best to enter bug reports on the SpamBayes bug tracker (also at SourceForge), but if that's too hard then mailing to the spambayes list has a decent chance of getting noticed. [...] > This is working very well, except about once every 24 hours > spambayes quits with this message: > > error: uncaptured python exception, closing channel > <__main__.BayesProxyListener listening :1110 at 0x83620d4> > (exceptions.IOError:[Errno 5] Input/output error This is probably some problem with reaching the pop server. > error: uncaptured python exception, closing channel > <__main__.BayesProxy connected 127.0.0.1:42864 at 0x841acac> > (exceptions.AttributeError:'_socket.socket' object has no > attribute 'isClosed' [/usr/lib/python2.2/asyncore.py|poll|113] I assume this is a consequence of the previous error, because it's very odd (the object from which a 'isClosed' attribute is being requested is not of type _socket.socket). > Apart from the once-a-day-or-so failure rate, I have not seen > any pattern. I have wondered whether the failures might be > related to one of the following reasons: > > 1) The remote pop3 server is periodically unreachable, often for > several minutes. Could this cause spambayes to fail? Yes, I suspect that this could be the problem. There is an open bug report about SpamBayes failing to recover after a timeout, which sounds (I think) fairly similar: If so, once this problem is solved, yours should be as well. I'm not sure what to suggest until then :( > 2) It is not clear to me what happens if fetchmail has a polling > interval which is too short. Could it be that fetchmail is > kicking off a new session *before* the previous session has > completed? That shouldn't matter, AFAIK. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 7 21:29:34 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 7 21:29:51 2004 Subject: [Spambayes] RE: [spambayes-bugs] parse error - undefined boundary - insb_imapfilter.py In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC36F0@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677902@its-xchg4.massey.ac.nz> > just wanted to let you know that I am using the > latest version of sb_imapfilter but getting crashes > due to parse errors. > It would be nice (and possible) to make the system > handle this gracefully and just dump the message > into the unsure box (or maybe user definable). The next version handles this better; it simply ignores any messages that it can't parse (and gradually, both the SpamBayes code and the Python email package are improving so that fewer messages will be unparsable). This way they'll just be left untouched in whatever folder they are in. Trying to move them about could be problematic since they're malformed, and I think it's wiser to leave them as is. That said, if you really would like them moved somewhere, open a feature request for it on sourceforge (http://sf.net/projects/spambayes) and it'll be considered at some point. The next version should hopefully be out at some point in this coming week. The spambayes-announce list (and the website) will note when that is the case - the announce list is extremely low volume (release announcements only), so you might want to subscribe to that. > I am attaching a zip of a .PST file containing 12 messages > that will each cause this problem so that you can properly > evaluate it. The messages are Outlook messages. Out of curiosity - if you're attaching messages as Outlook messages, that hints that you're using Outlook to read your mail. Any reason why you're not using the Outlook plug-in? In general, it's a much better tool to access the SpamBayes engine than sb_imapfilter. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From dontbotherworld at yahoo.com Sat Feb 7 21:53:42 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Sat Feb 7 21:53:44 2004 Subject: [Spambayes] Whitelist and Blacklists Message-ID: <20040208025342.15951.qmail@web60810.mail.yahoo.com> Hi, Can someone point me to some links/papers, which just talk about ways like whitelists, blacklists to deal with spammers. And with what protocols, tricks these whitelists and blacklists can be improved? Thanks Dont __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From rh at bozamar.com Sun Feb 8 10:30:10 2004 From: rh at bozamar.com (R. Harbauer) Date: Sun Feb 8 10:30:16 2004 Subject: [Spambayes] spambayes backup folders-which ones Message-ID: <000501c3ee58$701b35b0$0401a8c0@Roadwarrior> 1. I need to reformat by computer. Which files do I save/backup so that I retain the effectiveness of spambayes when I reinstall it? I could not find any backup info on the site or help files. Sincerely Robert Harbauer -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040208/33f25659/attachment.html From gcoon at carolina.rr.com Sun Feb 8 12:58:18 2004 From: gcoon at carolina.rr.com (Gilbert Coon) Date: Sun Feb 8 12:58:36 2004 Subject: [Spambayes] Advice In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13046778FC@its-xchg4.massey.ac.nz> Message-ID: Thanks for the response, Tony. Yes, I deactivated SpamBayes and the problem went away. Then I uninstalled SpamBayes and reinstalled it; old problem returned. I also read all the "Known Bugs Reports" and the Forum Q&As on the SpamBayes Website and could not find this problem discussed or any resolution ideas. I did try Outlook's "Detect and Repair" function and that did not solve the problem. When I shut down Outlook, re-opened it, checked for new messages, and then shut Outlook down again, I got the same dialogue box about restarting Outlook and sending an error report to Microsoft. Any suggestions? Gil Coon -----Original Message----- From: Tony Meyer [mailto:tameyer@ihug.co.nz] Sent: Saturday, February 07, 2004 8:58 PM To: 'Gilbert Coon'; spambayes@python.org Subject: RE: [Spambayes] Advice > Whenever I close Microsoft's Outlook 2002, I get > the error reporting box indicating that a problem has > developed and asking if I want to send the report to > Microsoft-in spite of the fact that I have "unclicked" > error reporting. Any ideas about how to solve this problem? Is this definitely related to SpamBayes? Try disabling SpamBayes and seeing if the problem goes away, or uninstalling it and seeing if that changes it (if you uninstall and then reinstall later you don't lose any of your configuration or training information). You might try Outlook's "Detect and Repair" function (from the Help menu), if you have the CD that you installed Outlook from. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From gisli.ottarsson at mscsoftware.com Sun Feb 8 15:44:47 2004 From: gisli.ottarsson at mscsoftware.com (Gisli Ottarsson) Date: Sun Feb 8 15:45:38 2004 Subject: [Spambayes] RE: [spambayes-bugs] uncaptured python exception ... object has noattribute 'isClosed' In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A95@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A95@its-xchg4.massey.ac.nz> Message-ID: <1076273086.5498.17.camel@mjolnir> Thanks for the response and the clarifications about spambayes-bugs. I am pretty sure that the problem was that when my VPN was going down I was loosing DNS service and the pop server name wasn't being resolved. I have added the pop server to my /etc/hosts and the problem has not reoccurred. Gisli On Sun, 2004-02-08 at 02:23, Tony Meyer wrote: > [redirected from the inappropriate spambayes-bugs list] > > Please note that the spambayes-bugs list isn't a list for humans to post to, > so if you don't want to see the machine-generated email that appears on that > list, don't subscribe to it at all. Descriptions of the 5 available mailing > lists are here: > > http://spambayes.sourceforge.net/contact.html > > It's best to enter bug reports on the SpamBayes bug tracker (also at > SourceForge), but if that's too hard then mailing to the spambayes list has > a decent chance of getting noticed. > > [...] > > This is working very well, except about once every 24 hours > > spambayes quits with this message: > > > > error: uncaptured python exception, closing channel > > <__main__.BayesProxyListener listening :1110 at 0x83620d4> > > (exceptions.IOError:[Errno 5] Input/output error > > This is probably some problem with reaching the pop server. > > > error: uncaptured python exception, closing channel > > <__main__.BayesProxy connected 127.0.0.1:42864 at 0x841acac> > > (exceptions.AttributeError:'_socket.socket' object has no > > attribute 'isClosed' [/usr/lib/python2.2/asyncore.py|poll|113] > > I assume this is a consequence of the previous error, because it's very odd > (the object from which a 'isClosed' attribute is being requested is not of > type _socket.socket). > > > Apart from the once-a-day-or-so failure rate, I have not seen > > any pattern. I have wondered whether the failures might be > > related to one of the following reasons: > > > > 1) The remote pop3 server is periodically unreachable, often for > > several minutes. Could this cause spambayes to fail? > > Yes, I suspect that this could be the problem. There is an open bug report > about SpamBayes failing to recover after a timeout, which sounds (I think) > fairly similar: > > d=61702> > > If so, once this problem is solved, yours should be as well. I'm not sure > what to suggest until then :( > > > 2) It is not clear to me what happens if fetchmail has a polling > > interval which is too short. Could it be that fetchmail is > > kicking off a new session *before* the previous session has > > completed? > > That shouldn't matter, AFAIK. > > =Tony Meyer > > --- > Please always include the list (spambayes@python.org) in your replies > (reply-all), and please don't send me personal mail about SpamBayes. This > way, you get everyone's help, and avoid a lack of replies when I'm busy. > From dontbotherworld at yahoo.com Sun Feb 8 19:26:53 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Sun Feb 8 19:26:56 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01A75D9A@cliff.bai.org> Message-ID: <20040209002653.2583.qmail@web60804.mail.yahoo.com> > 4) Improving Bayesian spam filtering at the SMTP > gateway level. Why is > it less effective, what can be done to improve it, Hey can you elaborate on that? I am a newbie so if you could explain me step by step on this, it would be great Thanks dont __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From dontbotherworld at yahoo.com Sun Feb 8 19:34:20 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Sun Feb 8 19:34:23 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: <20040209002653.2583.qmail@web60804.mail.yahoo.com> Message-ID: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Hey heres is quick question for you guys, I always wonder whats the need for inventing new techniques when most of the bayesian filters are giving accuracy >99.7%? Is that in time these bayesian techniques or method will become less effective? Thanks Dont __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From bgtees at hotmail.com Sun Feb 8 19:36:49 2004 From: bgtees at hotmail.com (Brian Tees) Date: Sun Feb 8 19:36:53 2004 Subject: [Spambayes] Outlook 2002 - how can I delete ONLY spam from server automatically? Message-ID: <002c01c3eea4$cdd2fe30$6500a8c0@griffy> I've been using Outlook 2002 and Spambayes for a while now and I've been very happy with it. What I'm trying to figure out is a way to get Outlook to delete the mail that Spambayes has classified as "definitely spam" from the server while leaving everything else there. I've been banging my head against the wall and can't figure out a way to do this. Any help greatly appreciated. I found some posts where people had been able to accomplish this in Outlook 2000 by creating a new .pst file, but I tried that with Outlook 2002 and it doesn't work. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040208/f840e3a5/attachment.html From tim.one at comcast.net Sun Feb 8 19:52:25 2004 From: tim.one at comcast.net (Tim Peters) Date: Sun Feb 8 19:52:29 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Message-ID: [dont bother] > Hey heres is quick question for you guys, > I always wonder whats the need for inventing new > techniques when most of the bayesian filters are > giving accuracy >99.7%? Well, why not? For some people it's an interesting problem. For personal classifiers it probably doesn't matter, but for people with truly high-volume email loads, an error rate of 0.3% is still massive; e.g., late last year AOL reported blocking 2 billion spam per day, and MSN 2.4 billion spam per day. If 0.3% of them leak, that would be about 8 million uncaught spam per day for MSN. From gollners at telus.net Sun Feb 8 19:27:08 2004 From: gollners at telus.net (The Gollners) Date: Sun Feb 8 19:52:52 2004 Subject: [Spambayes] installation error message Message-ID: <000001c3eea3$747d1420$b97ba8c0@CINDY> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 109385 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040208/ae60ef31/attachment-0001.png -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 47143 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040208/ae60ef31/attachment-0001.jpe From wsy at merl.com Sun Feb 8 19:56:04 2004 From: wsy at merl.com (Bill Yerazunis) Date: Sun Feb 8 19:56:08 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: <20040209003420.21218.qmail@web60810.mail.yahoo.com> (message from dont bother on Sun, 8 Feb 2004 16:34:20 -0800 (PST)) References: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Message-ID: <200402090056.i190u4504763@localhost.localdomain> From: dont bother Hey heres is quick question for you guys, I always wonder whats the need for inventing new techniques when most of the bayesian filters are giving accuracy >99.7%? Is that in time these bayesian techniques or method will become less effective? Thanks Dont The reason is that humans are 99.84% accurate, so you want to be MUCH more accurate than that, so you never have to read the spam folder. If you can make it 99.999% accurate, then you can save a lot of time and storage. -Bill Yerazunis From tim at fourstonesExpressions.com Sun Feb 8 20:02:33 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sun Feb 8 20:02:41 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: <20040209003420.21218.qmail@web60810.mail.yahoo.com> References: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Message-ID: On Sun, 8 Feb 2004 16:34:20 -0800 (PST), dont bother wrote: > Hey heres is quick question for you guys, > I always wonder whats the need for inventing new > techniques when most of the bayesian filters are > giving accuracy >99.7%? > Is that in time these bayesian techniques or method > will become less effective? > Thanks > Dont I hope the drivers that pass me on the road going the opposite direction are a LOT better than 99.7% accurate... if you think about it, 99.7% ain't all that good... -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From rcoe at CambridgeMA.GOV Sun Feb 8 20:50:15 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Feb 8 20:50:18 2004 Subject: [Spambayes] RE: Bayes Outlook Plugin & POP Proxy DBs Message-ID: <42393C9DA7930245AB540667607F4F5022C26E@SPIKE.city> > From: spambayes-bounces+rcoe=cambridgema.gov@python.org > [mailto:spambayes-bounces+rcoe=cambridgema.gov@python.org]On Behalf Of > Tony Meyer > Sent: Saturday, February 07, 2004 8:46 PM > To: 'Patrick Dockhorn'; spambayes@python.org > Subject: RE: [Spambayes] Bayes Outlook Plugin & POP Proxy DBs > > > > - is there an easy way how I can move my existing Spam > > databases across to the Linux based Spambayes pop3 proxy so > > that I can avoid retraining? > > Yes. You *might* be able to simply copy the default_bayes_database.db file > over and have it work. Certainly if you're using a pickle, this should > work. If you're using bsddb (probably the case), then this might not work. > ... I hadn't realized that the Outlook plugin offered a choice of database formats. Is this option described anywhere? Is it crazy to even think about deviating from the default? Bob From cej at intech.com Mon Feb 9 01:00:17 2004 From: cej at intech.com (Christopher Jastram) Date: Sun Feb 8 21:44:14 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <20040209002653.2583.qmail@web60804.mail.yahoo.com> References: <20040209002653.2583.qmail@web60804.mail.yahoo.com> Message-ID: <402721F1.1000606@intech.com> dont bother wrote: >>4) Improving Bayesian spam filtering at the SMTP >>gateway level. Why is >>it less effective, what can be done to improve it, >> >> > >Hey can you elaborate on that? I am a newbie so if you >could explain me step by step on this, it would be >great >Thanks >dont > > Sure. Providing a point-and-click installer that makes "Delete as Spam" and "Recover from Spam" buttons magically appear on the Outlook toolbar is cool. Asking users to forward spam to "spam@company.com" and an equal amount of ham to "ham@company.com" is a PITA for all involved. (Never mind trying to explain what "ham" is...) Also, server-side filtering is a total f**k to set up (pardon the profanity), especially in a user-specific manner (since Bayesian filtering really doesn't work using the same database for multiple users). It also takes up a snotload of resources, which is Not A Good Thing(tm) on a busy mail server. For example, before the MyDoom virus, we were processing 10 to 11 thousand emails every day. When MyDoom hit, we started processing 350 thousand emails. Filled up the SYN_RECV queue, and took the machine (and our network) to its knees. The first thing I did was strip the bayesian filtering out, and promptly watched the mail thoroughput quadruple. Server-side bayesian filtering (or any content filtering, for that matter) is *expensive*. We are currently purchasing two 64-bit AMD 3GHz machines with mirrored hard drives to handle this kind of load, because we CAN NOT let valuable mail bounce. (We were running a 667 MHz Celeron w/ 128 mb ram.) Hope this hard-edged voice of experience helps a little. :) Christopher Jastram From dontbotherworld at yahoo.com Sun Feb 8 22:32:32 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Sun Feb 8 22:32:35 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <402721F1.1000606@intech.com> Message-ID: <20040209033232.7984.qmail@web60802.mail.yahoo.com> Hey Chris, Thanks for that. I have a couple of questions on that. > Also, server-side filtering is a total f**k to set > up (pardon the > profanity), especially in a user-specific manner > (since Bayesian > filtering really doesn't work using the same > database for multiple > users). It also takes up a snotload of resources, What makes it so difficult for Bayesian Filters to filter the spam using the same database for multiple users? Also, when you mentioned about SMTP, I have this question, why is there no solution of stopping SPAM at the SMTP Level? Comeon, are we impotent to stop the spammers from using this bandwidth in the first place? Thanks Dont __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From cej at intech.com Mon Feb 9 02:10:27 2004 From: cej at intech.com (Christopher Jastram) Date: Sun Feb 8 22:54:24 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <20040209033232.7984.qmail@web60802.mail.yahoo.com> References: <20040209033232.7984.qmail@web60802.mail.yahoo.com> Message-ID: <40273263.9090202@intech.com> dont bother wrote: >Hey Chris, >Thanks for that. >I have a couple of questions on that. > > > >>Also, server-side filtering is a total f**k to set >>up (pardon the >>profanity), especially in a user-specific manner >>(since Bayesian >>filtering really doesn't work using the same >>database for multiple >>users). It also takes up a snotload of resources, >> >> > >What makes it so difficult for Bayesian Filters to >filter the spam using the same database for multiple >users? > > Not sure. I think it's because different people have different ham/spam profiles. You'll get a database that allows ham, but doesn't really catch spam very efficiently. Everybody's idea of "ham" will poison the database to a certain extent. Nice idea, but doesn't really work well. >Also, when you mentioned about SMTP, I have this >question, why is there no solution of stopping SPAM at >the SMTP Level? >Comeon, are we impotent to stop the spammers from >using this bandwidth in the first place? > >Thanks >Dont > > Yeah, basically, we're a bunch of neutered impotents. :) The problem is not bandwidth so much as processing time. Somebody has to take the mail apart, analyize it, and classify it. The further up the chain that processing is done, the more processing must be done, and the more expensive it is in terms of CPU time (and yes, CPU time can still be expensive). Ideally, it should be done on the spammer's ISP side, but that seems to be rare (judging by the amount of email that comes into our network). My solution works like this: 1) Postfix accepts the mail, checks to see if it's sent to a valid user 2) If it is, run it through spambayes via content_filter, which re-injects the mail into the system. That "run it through spambayes" script looks at the "to: " mail header and uses the appropriate user-specific database accordingly. 3) Postfix hands it off to Cyrus, which delivers via POP3 or IMAP. Could it get any higher-level? I don't think so. A lot of the spam we get is bounces from remote mail servers. Spammers spoof our domain, and we get the "invalid-user" bounces. Sick. I've been just discarding everything that's from mailer-daemon and not to a valid local user. Chris Jastram From dontbotherworld at yahoo.com Sun Feb 8 23:20:53 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Sun Feb 8 23:20:56 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <40273263.9090202@intech.com> Message-ID: <20040209042053.65778.qmail@web60807.mail.yahoo.com> Hey, Here is a couple of another questions. They may sound vague. People have been implementing filters such as bayesian etc etc on the mailservers to stop spam which has come all the way and used up all the bandwidth. I am so surprised that no one has ever thought of finding out a solution that does not allow the f***ing spammers to send so many mass emails, porn etc. I have come across a few like some payment solutions but none of that has yet been implemented. Why not to modify the exisiting SMTP Applications which incorporate prevention of the abuse of email? Is int the http protocol has been modified to https where security is needed and SSL 1.0, SSL 2.0 has all been replaced by SSL 3.0 and TLS? Another vague question is that if you are implementing the Spam Filtering Business at the Mailserver of the Receiver why not to implement that at the Mailserver of the Sender? How about implemeting them at Routers? too expensive? Routers may be busy doing other than their usual business :-) Thanks, Dont > Yeah, basically, we're a bunch of neutered > impotents. :) The problem > is not bandwidth so much as processing time. > Somebody has to take the > mail apart, analyize it, and classify it. The > further up the chain that > processing is done, the more processing must be > done, and the more > expensive it is in terms of CPU time (and yes, CPU > time can still be > expensive). Ideally, it should be done on the > spammer's ISP side, but > that seems to be rare (judging by the amount of > email that comes into > our network). > > My solution works like this: > 1) Postfix accepts the mail, checks to see if it's > sent to a valid user > 2) If it is, run it through spambayes via > content_filter, which > re-injects the mail into the system. That "run it > through spambayes" > script looks at the "to: " mail header and uses the > appropriate > user-specific database accordingly. > 3) Postfix hands it off to Cyrus, which delivers via > POP3 or IMAP. > > Could it get any higher-level? I don't think so. > > A lot of the spam we get is bounces from remote mail > servers. Spammers > spoof our domain, and we get the "invalid-user" > bounces. Sick. I've > been just discarding everything that's from > mailer-daemon and not to a > valid local user. > > Chris Jastram > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From cej at intech.com Mon Feb 9 03:05:52 2004 From: cej at intech.com (Christopher Jastram) Date: Sun Feb 8 23:49:47 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <40273263.9090202@intech.com> References: <20040209033232.7984.qmail@web60802.mail.yahoo.com> <40273263.9090202@intech.com> Message-ID: <40273F60.7050807@intech.com> Yo, I'm a bit of a pessimist, but that's because I'm getting things ready for a class I'm teaching tomorrow, and it's 11:50 PM my time, and I'm waiting for a Solaris install to finish. So, pardon the grimness. I also just finished a marathon run with a failing mail server, but that's another story... dont bother wrote: > Hey, > > Here is a couple of another questions. They may sound > vague. > > People have been implementing filters such as bayesian > etc etc on the mailservers to stop spam which has come > all the way and used up all the bandwidth. > I am so surprised that no one has ever thought of > finding out a solution that does not allow the f***ing > spammers to send so many mass emails, porn etc. I > have come across a few like some payment solutions but > none of that has yet been implemented. > > Well, I've seen one solution that I really really like. It works like this: mail is handled by a third party. You sign up for an email address from that party, and they give you one for $20/year or so. Everyone who sends an email gets a bounce saying "Please follow this link and answer the question to send mail to this person." At the link you will find a simple question like: "Choose the red square" or "one plus one equals ?". Answering the question adds the sender to the database of "humans," and mail will be allowed from that address. Kinda neat, and it will be what I set up eventually. > Why not to modify the exisiting SMTP Applications > which incorporate prevention of the abuse of email? > Is int the http protocol has been modified to https > where security is needed and SSL 1.0, SSL 2.0 has all > been replaced by SSL 3.0 and TLS? > > > Mail doesn't really fit that model. With mail, you have a small packet of information that gets moved across several computers (say, 15 to 20). HTTP is an immediate connection, whereas SMTP is not. HTTP can be secured, but every mail server must be able to see the contents of every mail message in order to forward it. Thus, SSL doesn't really work. Although you could set up a filter that would only accept PGP or GPG signed messages. Unfortunately, the number of people using email encryption is appallingly small. > Another vague question is that if you are implementing > the Spam Filtering Business at the Mailserver of the > Receiver why not to implement that at the Mailserver > of the Sender? How about implemeting them at Routers? > too expensive? Routers may be busy doing other than > their usual business > > Good idea. Excellent idea. Except that there are too many damnable ISPs around the world with no interest in closing down open mail relays on their networks. ISPs should adopt a zero-tolerance policy for open mail relays. Unfortunately, that won't happen because their userbase would evaporate overnight. Why? Because most ISPs have a clueless userbase with no concept of IE and Outlook worms that set up mail servers. If ISPs start terminating accounts based on spam sending, they'd go out of business. So, it sucks. Any other ideas I can shoot down? Maybe I should just shoot myself and get it all over with... Believe me, spending two solid days coaxing a mail server through obscene spam-loading is a taxing experience, both mentally and physically. Also hard on the people around me, since I tend to resort to cursing, crying, pleading, and other melodramic displays of overwrought emotional distress. (I tend to take these things a little too seriously) The best idea I've seen is RBL. (Realtime Blackhole List) An RBL is a list of known spam-sending networks. Administrators subscribed to an RBL agree to completely drop all traffic originating from or going to said spam-sending networks. Nice system, and it works quite well because ISPs realize that it hurts their business to allow spam on their networks. Unfortunately, one must have a very flexible and understanding boss to pull this one off, and not many IT administrators have that luxury. Hope this helps, Christopher Jastram > > Thanks, > Dont > > > > >> Yeah, basically, we're a bunch of neutered >> impotents. The problem is not bandwidth so much as processing >> time. Somebody has to take the mail apart, analyize it, and classify >> it. The >> further up the chain that processing is done, the more processing >> must be >> done, and the more expensive it is in terms of CPU time (and yes, CPU >> time can still be expensive). Ideally, it should be done on the >> spammer's ISP side, but that seems to be rare (judging by the amount of >> email that comes into our network). >> >> My solution works like this: >> 1) Postfix accepts the mail, checks to see if it's >> sent to a valid user >> 2) If it is, run it through spambayes via >> content_filter, which re-injects the mail into the system. That "run it >> through spambayes" script looks at the "to: " mail header and uses the >> appropriate user-specific database accordingly. >> 3) Postfix hands it off to Cyrus, which delivers via >> POP3 or IMAP. >> >> Could it get any higher-level? I don't think so. >> >> A lot of the spam we get is bounces from remote mail >> servers. Spammers spoof our domain, and we get the "invalid-user" >> bounces. Sick. I've been just discarding everything that's from >> mailer-daemon and not to a valid local user. >> >> Chris Jastram >> >> _______________________________________________ >> Spambayes@python.org >> http://mail.python.org/mailman/listinfo/spambayes >> Check the FAQ before asking: >> > > http://spambayes.sf.net/faq.html > > > __________________________________ > Do you Yahoo!? > Yahoo! Finance: Get your refund fast by filing online. > http://taxes.yahoo.com/filing.html > > > From cej at intech.com Mon Feb 9 03:12:07 2004 From: cej at intech.com (Christopher Jastram) Date: Sun Feb 8 23:56:00 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: References: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Message-ID: <402740D7.8000907@intech.com> Tim Stone wrote: > On Sun, 8 Feb 2004 16:34:20 -0800 (PST), dont bother > wrote: > >> Hey heres is quick question for you guys, >> I always wonder whats the need for inventing new >> techniques when most of the bayesian filters are >> giving accuracy >99.7%? >> Is that in time these bayesian techniques or method >> will become less effective? >> Thanks >> Dont > > > I hope the drivers that pass me on the road going the opposite > direction are a LOT better than 99.7% accurate... if you think about > it, 99.7% ain't all that good... > Har har. Fortunately, the driver pool isn't made of random data (!). Although, sometimes I'm not sure, driving around Providence, RI. From tameyer at ihug.co.nz Mon Feb 9 00:08:29 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 00:10:33 2004 Subject: [Spambayes] spambayes backup folders-which ones In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3C0B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467790A@its-xchg4.massey.ac.nz> > I need to reformat by computer. Which files do I > save/backup so that I retain the effectiveness of > spambayes when I reinstall it? I could not find any > backup info on the site or help files. You're after FAQ 3.5: =Tony Meyer From tameyer at ihug.co.nz Mon Feb 9 00:09:57 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 00:10:38 2004 Subject: [Spambayes] installation error message In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3D18@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467790B@its-xchg4.massey.ac.nz> > Please Help. I get this message each time I > retry the installation. I have disabled the Norton > antivirus plugin for word and can't install the program. > What do I need to do? There is a new release due out this week that should fix this problem. The easiest solution would be to wait for that, and that should fix it. The release will be announced on the spambayes-announce mailing list, the website, and probably this list. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From marc at shadowriders.org Mon Feb 9 08:21:57 2004 From: marc at shadowriders.org (Marc 'Merlin' Mauss) Date: Mon Feb 9 08:22:37 2004 Subject: [Spambayes] Help displaying spam score field Message-ID: I'm using the Outlook plug-in with Outlook 2000, SP-3. I've followed the instructions for "Viewing and Using the Spam Score Field", and it works properly in my "inbox", but I can't get it to work correctly in my "spam" and "unsure" folders. I've looked through all the documentation, but can't find a fix for this. When I create the field "spam" in my "spam" and "unsure" folders (as per about.html#field), the fields SEEM to be created properly, but when I go back to look at them, the "type field has changed to "Number" and the "format" field has changed to "All digits: 1,234.567 -1,234.567". I've removed the field and re-created it with the correct values ("Percent" and "Rounded: 65% -65%"), but every time I do this, the values change. How can I display the "spam" field in my "spam" and "unsure" folders as weel as my "inbox"? Marc "Merlin" Mauss RRR01 Honorary CSA Consul to Cyberspace MSF Instructor Coram (Long Island), NY '83 Sabre V45 "Just for Fun" gone but not forgotten '85 Shadow VT1100C - "Barney" (it's PURPLE) Just a memory '86 Shadow VT1100C - (hers) '86 Shadow VT1100C - "Little Wing" '84 Voyager 1300 - "The Beast" Cordura IS a fashion statement. From bmccann at verizon.net Mon Feb 9 09:26:58 2004 From: bmccann at verizon.net (Barbara McCann) Date: Mon Feb 9 09:27:07 2004 Subject: [Spambayes] installing spambayes Message-ID: <000001c3ef18$c7fff5c0$b00afea9@D7FPT231> Okay, maybe I'm dim, but I downloaded the zipped spambayes 1.0a7.zip to my desktop, (from the website https://sourceforge.net/project/showfiles.php?group_id=61702) but could not figure out how to install it. The read-me.txt file said to start "setup.py install" but searching the various directories I could only see a file called "setup.py" which my computer did not recognize. A search for the term "install" did not turn it up, neither did "exe." I want to use the Outlook plug-in; I tried downloading and installing that (which did begin an installation wizard), but it installation failed, I'm assuming because I haven't been able to install spambayes. Any help would be appreciated; it sounds like a great program. I use Windows XP. Barbara McCann McCann Consulting 1439 Monroe St. NW Washington, DC 20010 (202) 641-1163 barbara@bmccann.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/7f15b77b/attachment.html From Amir_Katz at bmc.com Mon Feb 9 09:37:40 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Mon Feb 9 09:39:06 2004 Subject: [Spambayes] installing spambayes Message-ID: You need to download only the Outlook add-in (file SpamBayes-Outlook-Setup-0081.exe, size about 3 MB). Once downloaded, just run it and it will install the Outlook add-in. I suggest that you consult the FAQ before you install. Enjoy, Amir -----Original Message----- From: Barbara McCann [mailto:bmccann@verizon.net] Sent: Monday, February 09, 2004 16:27 To: spambayes@python.org Subject: [Spambayes] installing spambayes Okay, maybe I'm dim, but I downloaded the zipped spambayes 1.0a7.zip to my desktop, (from the website https://sourceforge.net/project/showfiles.php?group_id=61702 ) but could not figure out how to install it. The read-me.txt file said to start "setup.py install" but searching the various directories I could only see a file called "setup.py" which my computer did not recognize. A search for the term "install" did not turn it up, neither did "exe." I want to use the Outlook plug-in; I tried downloading and installing that (which did begin an installation wizard), but it installation failed, I'm assuming because I haven't been able to install spambayes. Any help would be appreciated; it sounds like a great program. I use Windows XP. Barbara McCann McCann Consulting 1439 Monroe St. NW Washington, DC 20010 (202) 641-1163 barbara@bmccann.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/e997eac1/attachment.html From skip at pobox.com Mon Feb 9 10:04:35 2004 From: skip at pobox.com (Skip Montanaro) Date: Mon Feb 9 10:04:45 2004 Subject: [Spambayes] Why Improve Bayesian In-Reply-To: References: <20040209003420.21218.qmail@web60810.mail.yahoo.com> Message-ID: <16423.41347.603472.427644@montanaro.dyndns.org> Tim> I hope the drivers that pass me on the road going the opposite Tim> direction are a LOT better than 99.7% accurate... if you think Tim> about it, 99.7% ain't all that good... Not exactly six-sigma... Skip From kennypitt at hotmail.com Mon Feb 9 10:49:07 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 9 10:50:05 2004 Subject: [Spambayes] installing spambayes In-Reply-To: <000001c3ef18$c7fff5c0$b00afea9@D7FPT231> Message-ID: Barbara McCann wrote: > I want to use the Outlook plug-in; I tried downloading and installing > that (which did begin an installation wizard), but it installation > failed, I'm assuming because I haven't been able to install > spambayes. Did you get an error message box with something like "Unable to register DLL/OCX" and maybe an error code 0x00000000? If so then you have encountered a known problem with the 0.81 Outlook plugin installation. Many people have used the installer successfully, but a handful have experienced this registration error. As noted in an earlier response, you only need the Windows installer if you are trying to run the Outlook plugin. The problem is not related to trying to install the 1.0a7 zip file, which is the source code version of SpamBayes. There is a new release of SpamBayes due out this week that solves the registration problem, so you may want to wait for that. It will be announced on the spambayes-announce mailing list, and probably on this list as well. -- Kenny Pitt From rmalayter at bai.org Mon Feb 9 11:37:13 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Feb 9 11:37:27 2004 Subject: [Spambayes] Ideas for an MSc project please... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75E3D@cliff.bai.org> [Christopher Jastram] > (We were running a 667 MHz Celeron w/ 128 mb ram.) How can you complain about the unreliability of your mail system, when you're running it on 1999-era, desktop-class hardware in 2004? Especially when you're a *technology* company? Hardware is CHEAP. People's time and lost sales are expensive. Yours was a failure in planning, not in hardware or software. Viruses and spam have been known quantities for many years now, part of the internet landscape. Certainly you figured they might impact your infrastructure in some way. Why did you fail to plan accordingly, and allocate budget for more robust MTAs? Setting up a reliable mail infrastructure is not that hard. Read the RFCs. Set up a reliable server on your site, with MX precedence =10. Have your ISP's set up their SMTP servers set up to relay to your domain, and put them in your DNS as MX=20. No mail bounces, it just gets queued at your ISP when you're down. Same hardware costs on your end. Besides, even if you have only one MX server, well-behaved sending MTA's should queue your mail at the sending site for a few days, retrying every few hours, before reporting failure to the originating user. Only spammers and viruses refuse to retry gracefully. -Ryan- From brent at bjohnson.net Mon Feb 9 12:18:22 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Mon Feb 9 12:18:37 2004 Subject: [Spambayes] Database Format Message-ID: <021801c3ef30$bbb4c1c0$c801a8c0@torpedo> First off.. I've been using SpamBayes for a while. Since CloudMark SpamNet went to being a pay service I switched and used CloudMark's spam folder to teach SpamBayes. It's working GREAT! Now to my question.. I found in the FAQ where I can locate the classification database. Is there a way I can extract data from this DB? I'm working on a Bayesian email classifier in Java (not to compete with SpamBayes of course.. hey why mess with perfection.. hehe). It uses Classifier4J which learns by passing in strings of text as spam or not spam. Im using it on the server-side to pre-scan messages before they hit Outlook. Is there a way I can convert my SpamBayes database to extract out the words considered spam? I still have most of my spam sitting in my spam folder so I could theoretically find a way to export this from outlook.. but this is painfully slow (Ive currently got almost 12,000 spam emails saved) Thanks, - Brent From kennypitt at hotmail.com Mon Feb 9 13:24:08 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 9 13:25:04 2004 Subject: [Spambayes] Long load times. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A94@its-xchg4.massey.ac.nz> Message-ID: [rural_cdn@fastmail.fm] >> I'm trying to cut down on the long load times I have. Will >> deleting my spam messages help? Will that decrease the size >> of the above file? Let me throw in that deleting the original spam messages from your spam folder will make no difference. SpamBayes collects and stores the statistics from messages when you train on them, but otherwise does not use the original messages at all. [Tony] > Having a smaller database file could help, yes. (The way to check > this (and see how much you gain) would be to rename the file, see how > fast it loads then (it'll create a new, empty, database), and then > delete the new database and put the old one back). Definitely test this, but in theory it shouldn't make much difference. The database file is opened at load time, but the contents are not read in. -- Kenny Pitt From tshumway at jdiworks.net Mon Feb 9 14:13:44 2004 From: tshumway at jdiworks.net (Terrel Shumway) Date: Mon Feb 9 14:13:57 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <40273F60.7050807@intech.com> References: <20040209033232.7984.qmail@web60802.mail.yahoo.com> <40273263.9090202@intech.com> <40273F60.7050807@intech.com> Message-ID: <4027DBE8.9050809@jdiworks.net> Christopher Jastram wrote: > Well, I've seen one solution that I really really like. It works like > this: mail is handled by a third party. You sign up for an email > address from that party, and they give you one for $20/year or so. Why is this necessary? 5000 users*$20/year = $100,000/year. Nice pocket change. Would you like to make a donation to my favorite charity? 8-) > Everyone who sends an email gets a bounce saying "Please follow this > link and answer the question to send mail to this person." At the > link you will find a simple question like: "Choose the red square" or > "one plus one equals ?". Answering the question adds the sender to > the database of "humans," and mail will be allowed from that address. > Kinda neat, and it will be what I set up eventually. This addresses a very small part of the problem with a very expensive (usability-wise) solution. > The best idea I've seen is RBL. (Realtime Blackhole List) An RBL is > a list of known spam-sending networks. Administrators subscribed to > an RBL agree to completely drop all traffic originating from or going > to said spam-sending networks. Nice system, and it works quite well > because ISPs realize that it hurts their business to allow spam on > their networks. Unfortunately, one must have a very flexible and > understanding boss to pull this one off, and not many IT > administrators have that luxury. RBL, of course, also has its drawbacks, which have been thoroughly discussed elsewhere. The two-camp approach is a good evolution of RBLs, but won't help us today. >>> My solution works like this: >>> 1) Postfix accepts the mail, checks to see if it's >>> sent to a valid user >>> 2) If it is, run it through spambayes via >>> content_filter, which re-injects the mail into the system. That >>> "run it >>> through spambayes" script looks at the "to: " mail header and uses the >>> appropriate user-specific database accordingly. >>> 3) Postfix hands it off to Cyrus, which delivers via >>> POP3 or IMAP. >>> Using spambayes (step 2) on the wire (i.e. instead of step 1) may not save bandwidth, but can save disk space and give priority to non-spam. 1) a message looks like spam: 553 it and you're done. Include a URL in the response text so a human can get whitelisted and resend a false positive. 2) If a message is "unsure", 553 it but store it for 7 days so the human user can redeem it from quarantine without resending it. 3) tar-pit the spam-sending IP/network so it will take them three hours to send a single message. Now you have a good 80% solution that will save your CPU and push ham to the front of the queue. >>> A lot of the spam we get is bounces from remote mail >>> servers. Spammers spoof our domain, and we get the "invalid-user" >>> bounces. Sick. I've been just discarding everything that's from >>> mailer-daemon and not to a valid local user. >> not a bad idea. From rcoe at CambridgeMA.GOV Mon Feb 9 14:31:53 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Feb 9 14:32:03 2004 Subject: [Spambayes] Help displaying spam score field Message-ID: <42393C9DA7930245AB540667607F4F5022CA76@SPIKE.city> I don't know whether this is your problem, but I've found that it's difficult or impossible to display the spam score in a folder that has not been scanned. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: spambayes-bounces+rcoe=cambridgema.gov@python.org > [mailto:spambayes-bounces+rcoe=cambridgema.gov@python.org]On Behalf Of > Marc 'Merlin' Mauss > Sent: Monday, February 09, 2004 8:22 AM > To: spambayes@python.org > Subject: [Spambayes] Help displaying spam score field > > > I'm using the Outlook plug-in with Outlook 2000, SP-3. I've followed the > instructions for "Viewing and Using the Spam Score Field", and it works > properly in my "inbox", but I can't get it to work correctly in my "spam" > and "unsure" folders. I've looked through all the documentation, but can't > find a fix for this. > > When I create the field "spam" in my "spam" and "unsure" folders (as per > about.html#field), the fields SEEM to be created properly, but when I go > back to look at them, the "type field has changed to "Number" and the > "format" field has changed to "All digits: 1,234.567 -1,234.567". > > I've removed the field and re-created it with the correct values ("Percent" > and "Rounded: 65% -65%"), but every time I do this, the values change. > > How can I display the "spam" field in my "spam" and "unsure" folders as weel > as my "inbox"? From erik.janssen at inno.com Mon Feb 9 14:30:03 2004 From: erik.janssen at inno.com (Erik Janssen) Date: Mon Feb 9 14:32:32 2004 Subject: [Spambayes] could not install SpamBayes Outlook Addin 0.0081 Message-ID: <000001c3ef43$21109cd0$6401a8c0@rkjn> Hi, I had spambayes outlook plugin running for several months and it worked just fine Just today, i installed an IMAPS connection to an exchnage server and now outlook has trouble with SpamBayes I unstalled it, installed a newer version(0.0081), everything seems to go ok (log file says: Registered: SpamBayes.OutlookAddin Registration complete.) but the icons do not appear !! currently i'm using outlook 2002 (10.2627.2625) can you help me out ? thanks a lot regards Erik Janssen -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/6fd2b15d/attachment.html From rcoe at CambridgeMA.GOV Mon Feb 9 14:50:34 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Mon Feb 9 14:50:38 2004 Subject: [Spambayes] RE: Ideas for an MSc project please... Message-ID: <42393C9DA7930245AB540667607F4F5022C271@SPIKE.city> Actually, Chris wasn't complaining about the unreliability of his mail system; he was complaining about the impact on his throughput of server-side filtering. And I think he has a point. Email clients usually have more than enough capacity to take on the filtering load for a single user; email servers have to be pretty beefy to handle the load for an entire user community. Yeah, he could (and apparently will) throw more hardware at the problem. But given the well-known difficulty of collecting training data for a server-side Bayesian filter, why not just admit that this particular solution works better on the client? Bob > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Ryan Malayter > Sent: Monday, February 09, 2004 11:37 AM > To: spambayes@python.org > Subject: RE: [Spambayes] Ideas for an MSc project please... > > > [Christopher Jastram] > > (We were running a 667 MHz Celeron w/ 128 mb ram.) > > How can you complain about the unreliability of your mail system, when > you're running it on 1999-era, desktop-class hardware in 2004? > Especially when you're a *technology* company? Hardware is CHEAP. > People's time and lost sales are expensive. > > Yours was a failure in planning, not in hardware or software. Viruses > and spam have been known quantities for many years now, part of the > internet landscape. Certainly you figured they might impact your > infrastructure in some way. Why did you fail to plan accordingly, and > allocate budget for more robust MTAs? > > Setting up a reliable mail infrastructure is not that hard. Read the > RFCs. Set up a reliable server on your site, with MX precedence =10. > Have your ISP's set up their SMTP servers set up to relay to your > domain, and put them in your DNS as MX=20. No mail bounces, it just gets > queued at your ISP when you're down. Same hardware costs on your end. > > Besides, even if you have only one MX server, well-behaved sending MTA's > should queue your mail at the sending site for a few days, retrying > every few hours, before reporting failure to the originating user. Only > spammers and viruses refuse to retry gracefully. > > -Ryan- From rmalayter at bai.org Mon Feb 9 15:11:51 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Feb 9 15:11:56 2004 Subject: [Spambayes] RE: Ideas for an MSc project please... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75E44@cliff.bai.org> [Bob Coe] > Actually, Chris wasn't complaining about the unreliability of > his mail system; he was complaining about the impact on his > throughput of server-side filtering. And I think he has a > point. So Chris knows that server-side Bayesian filtering is resource intensive, but still chooses to implement it on an old desktop from 1999. Rather than bemoan the resource intensive nature of filtering he chose to use, why not try some reasonably modern hardware for a critical business function? Peak-demand planning is part of a systems administrators job. I don't try to run my database servers on five-year-old old desktops, because I *know* SQL servers require more horsepower when more than a few connections are in place. My point is this: spam is not going away soon, nor are viruses. Systems administrators have known this for several years. Planning and maintaining the infrastructure to deal with that is our job. Buying new hardware doesn't solve the majority of IT problems, but it could have solved this one for Chris had he planned well. From tameyer at ihug.co.nz Mon Feb 9 15:52:32 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 15:52:51 2004 Subject: [Spambayes] Long load times. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3EF4@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677911@its-xchg4.massey.ac.nz> [Tony] > Having a smaller database file could help, yes. (The way to check > this (and see how much you gain) would be to rename the file, see how > fast it loads then (it'll create a new, empty, database), and then > delete the new database and put the old one back). [Kenny] > Definitely test this, but in theory it shouldn't make much > difference. The database file is opened at load time, but the > contents are not read in. Thanks Kenny; for some reason I was thinking pickle... =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 9 15:56:00 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 16:02:52 2004 Subject: [Spambayes] Database Format In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3ED3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677912@its-xchg4.massey.ac.nz> > Now to my question.. I found in the FAQ where I can > locate the classification database. Is there a way > I can extract data from this DB? [...] > Is there a way I can convert my SpamBayes database > to extract out the words considered spam? In the source distribution (you'll need Python installed as well) there is a script called sb_dbexpimp.py. It'll convert the database to a flat-text '`'-separated file, which you can use. Note that this doesn't include any probabilities, or scores, just counts - i.e. how many times each token has been seen in ham/spam. So if you want probabilities, you'll have to do some calculation yourself. Alternatively, there's another script in the source distribution called spamcounts.py, which can output certain sections (including the whole thing, IIRC) of the database, including scores as they currently stand. You could capture the output of this to a file, and it might do more of what you're looking for. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From fuerte at sci.fi Mon Feb 9 16:25:35 2004 From: fuerte at sci.fi (Harri Pesonen) Date: Mon Feb 9 16:25:39 2004 Subject: [Spambayes] Suggestion to POP3proxy web interface In-Reply-To: References: Message-ID: <4027FACF.3080003@sci.fi> In Review Messages pane, the Train button should be on top, and all messages (Unsure, Ham, Spam) should have Discard by default. This way you need only to set Unsure messages to Spam/Ham and click Train. No more scrolling to the bottom. Harri From viatorg at musc.edu Mon Feb 9 16:30:21 2004 From: viatorg at musc.edu (Gerry Viator (work)) Date: Mon Feb 9 16:30:38 2004 Subject: [Spambayes] No toolbar Message-ID: <200402092130.i19LUNha005662@flopsy.musc.edu> Hi all, Using Outlook 2003 on Windows XP pro. I loaded spambayes but, toolbar doesn't show up? I have Closed and re-opened Outlook. I also have re-booted my machine. The install seamed to load fine but,nothing in Outlook? Thanks Gerry From tameyer at ihug.co.nz Mon Feb 9 16:55:17 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 17:12:05 2004 Subject: [Spambayes] Suggestion to POP3proxy web interface In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3F55@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A9C@its-xchg4.massey.ac.nz> > In Review Messages pane, the Train button should be on top, Just on top wouldn't be good for those that review all the messages, but I suppose a copy of the button at the top, as well as the one at the bottom, would be ok. Could you open a feature request on sourceforge for this? Otherwise it'll probably be lost in list traffic. > and all > messages (Unsure, Ham, Spam) should have Discard by default. This way > you need only to set Unsure messages to Spam/Ham and click Train. No > more scrolling to the bottom. With the about-to-be-released 1.0a9, or current CVS, you can set Unsure/Ham/Spam to default to any of Discard/Ham/Spam/Defer, so this is already done. You can find the appropriate options in the Advanced page of the web interface configuration. There's also a couple of options designed to help with 'non-edge' training, if that's what you're after. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 9 16:56:37 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 17:17:03 2004 Subject: [Spambayes] No toolbar In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3F5E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677916@its-xchg4.massey.ac.nz> > I loaded spambayes but, toolbar doesn't show up? I have > Closed and re-opened Outlook. I also have re-booted my machine. > > The install seamed to load fine but,nothing in Outlook? Could you send us a copy of your log files? (The troubleshooting guide explains where to find these; a copy of the guide was installed, plus one is on the website on the Documentation page). The logs will tell us whether the install was done or not, and should indicate what is going wrong. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 9 17:25:38 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 17:26:01 2004 Subject: [Spambayes] could not install SpamBayes Outlook Addin 0.0081 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3F1D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467791A@its-xchg4.massey.ac.nz> > I had spambayes outlook plugin running for several > months and it worked just fine > Just today, i installed an IMAPS connection to an > exchnage server and now outlook has trouble with SpamBayes > I unstalled it, installed a newer version(0.0081), > everything seems to go ok (log file says: > Registered: SpamBayes.OutlookAddin > Registration complete.) but the icons do not appear !! > can you help me out ? Do the log files have anything else? They should, if the plug-in is being started up. If they do, then please send us a copy of them so we can try to use them to figure out what is going wrong. If they don't have anything else, then the most likely problem is that Outlook has disabled SpamBayes (I vaguely recall reading something about non-certified plug-ins being automatically disabled on updates). Do Help->About Microsoft Outlook->Disabled Items, and see if SpamBayes is there (and enable it if it is). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 9 17:33:52 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 17:34:21 2004 Subject: [Spambayes] RE: Bayes Outlook Plugin & POP Proxy DBs In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3D2D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2A9E@its-xchg4.massey.ac.nz> > I hadn't realized that the Outlook plugin offered a choice of > database formats. I spoke too soon. Checking, I see that the plug-in will always use bsddb unless it can't find a bsddb module to use, in which case it will use a pickle. If you're running from source, then it's easy to change this (around line 97 of manager.py put "use_db = False"), but if you're running the binary, you can't (and you're guaranteed to have the appropriate bsddb module installed). > Is this option described anywhere? Is it > crazy to even think about deviating from the default? Well, almost. You'd find that you'd use a lot more memory with a pickle, and the time to load Outlook would be much longer, as would the time taken to train (since training involves saving the db, which is slower). Classifying could be faster, though, and you'd avoid all possibility of the RUNRECOVERY errors (which are pretty rare with the plug-in, however). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From scales at zko.dec.com Mon Feb 9 18:44:20 2004 From: scales at zko.dec.com (Webb Scales) Date: Mon Feb 9 18:44:26 2004 Subject: [Spambayes] Hand tuning the database? Message-ID: <40281B54.710D1BE3@zko.dec.com> A friend of mine, looking at the "evidence" header in one of my mail messages, asked a "simple" question: what about the idea of hand-tuning the database? I shouldn't be criticizing, as SpamBayes has been doing a very good job so far (on just the initial training, it's had no misclassified ham, only two spam rated as ham, and only a dozen messages rated as unsure all of which were spam), but I thought I'd ask anyway. ;-) The evidence header has entries like "'received:ztxmail01.ztx.compaq.com': 0.62". (I told it to mine the headers.) Now, I believe that ztxmail01.ztx.compaq.com handles all my mail. (OK, it doesn't hand *all* of my mail -- it's got a couple of brothers and a dozen cousins who share in the load, but you get the point.) So, the presence of this token in my mail message is not indicative of anything (other than the fact that the thing being looked at is a "mail message"! ;-). What do you guys think of the idea of being able to mark certain terms in the database as being "not interesting"? (Of course, we would need a tool or tool-set to be able to do this, but....) The reason I ask is that, if the classifier is only going to consider 100 terms, I'd like it to be considering good ones, as opposed to things that are in every mail message that I get, spam and ham alike. Thanks, Webb -- ------------------------------------------------------------------------ Webb Scales Hewlett-Packard Company scales@zko.dec.com 110 Spit Brook Rd, ZKO2-3/N30 Voice: 603.884.2196, FAX: 603.884.0120 Nashua, NH 03062-2711 Someone who thinks logically provides a nice contrast to the real world. ------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/308b965f/attachment.html From cej at intech.com Mon Feb 9 22:01:39 2004 From: cej at intech.com (Christopher Jastram) Date: Mon Feb 9 18:45:30 2004 Subject: [Spambayes] Ideas for an MSc project please... In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01A75E3D@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A01A75E3D@cliff.bai.org> Message-ID: <40284993.9060003@intech.com> Ryan Malayter wrote: >[Christopher Jastram] > > >>(We were running a 667 MHz Celeron w/ 128 mb ram.) >> >> > >How can you complain about the unreliability of your mail system, when >you're running it on 1999-era, desktop-class hardware in 2004? >Especially when you're a *technology* company? Hardware is CHEAP. >People's time and lost sales are expensive. > > Thank you. I did hear that a few times already. >Yours was a failure in planning, not in hardware or software. Viruses >and spam have been known quantities for many years now, part of the >internet landscape. Certainly you figured they might impact your >infrastructure in some way. Why did you fail to plan accordingly, and >allocate budget for more robust MTAs? > > True, but we have not been hit with any mail virii in the past. The reason I am here is to handle unplanned situations like this one. I firmly believe that no amount of planning would have stopped this particular 48-hour marathon. Certainly no preplanned system can handle every contingency. The volume of mail was enormous, and the machines I had at my disposal performed very well. >Setting up a reliable mail infrastructure is not that hard. Read the >RFCs. Set up a reliable server on your site, with MX precedence =10. >Have your ISP's set up their SMTP servers set up to relay to your >domain, and put them in your DNS as MX=20. No mail bounces, it just gets >queued at your ISP when you're down. Same hardware costs on your end. > > Our ISP is braindead. Not a hell of a lot I can do about it. >Besides, even if you have only one MX server, well-behaved sending MTA's >should queue your mail at the sending site for a few days, retrying >every few hours, before reporting failure to the originating user. Only >spammers and viruses refuse to retry gracefully. > > -Ryan- > > This became part of the problem. The mailserver was so busy handling the incoming mail that the spam backlogged on everybody else's servers. Also, there is something strange about the whole deal, because 90 percent of the incoming mail is all mailer-daemon bounces. It's as if some spammer decided to spoof our domain (and no, we're not running an open relay. Already checked and double-checked). While he was at it, he spoofed from his list of @intech.com addresses. How can I tell you this? Because the addresses that receive spam are pretty much the same, month after month. All of a sudden, they went from receiving 50 messages/day to 1000+, all of them mailer-daemon bounces from mailservers all around the world. What is also strange is that the whole thing started last Sunday, well after the MyDoom virus made the rounds. The mailserver had been happily sucking down MyDoom into the bit bucket for days, and all of a sudden *boom*, the shit hits the fan, and it's ALL SPAM! It's not MyDoom (MyDoom made up somewhere around 0.0001 percent of incoming mail). The spambayes filter system I set up is somewhat to blame, since it greatly limited the mail thoroughput. There was also no cut-out mechanism to drop the scanning/classifying system when the load started climbing. Believe me, I will be much more cautious about implementing these things in the future. Christopher Jastram From tameyer at ihug.co.nz Mon Feb 9 18:54:08 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 18:55:45 2004 Subject: [Spambayes] Hand tuning the database? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3FB2@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA0@its-xchg4.massey.ac.nz> > What do you guys think of the idea of being able > to mark certain terms in the database as being > "not interesting"? (Of course, we would need a tool > or tool-set to be able to do this, but....) > The reason I ask is that, if the classifier is only > going to consider 100 terms, I'd like it to be considering > good ones, as opposed to things that are in every mail > message that I get, spam and ham alike. Two things: 1. SpamBayes doesn't use any tokens that have a current spamprob between 0.4 and 0.6 (you can change these values if you like). So 0.62 is just outside that range, and so it does appear to have a little bit of value (indicating that mail is just a wee bit more likely to be spam). IOW, it's basically doing what you've asked, but automatically, rather than via some manual tool. 2. The 150 'strongest' (furtherest from 0.5) tokens are used, by default. Early testing showed that this was a good number, but if you like, you can change this, too - if you set it high enough, then every token will be used, no matter what score it has. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From scales at zko.dec.com Mon Feb 9 19:05:32 2004 From: scales at zko.dec.com (Webb Scales) Date: Mon Feb 9 19:05:37 2004 Subject: [Spambayes] Hand tuning the database? References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA0@its-xchg4.massey.ac.nz> Message-ID: <4028204C.A7AEA1EA@zko.dec.com> Tony Meyer wrote: > 1. SpamBayes doesn't use any tokens that have a current spamprob between > 0.4 and 0.6 (you can change these values if you like). So 0.62 is just > outside that range, and so it does appear to have a little bit of value > (indicating that mail is just a wee bit more likely to be spam). OK, that makes sense. So, (other than ignoring the problem ;-) I could either move the "goalposts", or find some ham that came through that mail gateway and do some more training. > it's > basically doing what you've asked, but automatically, rather than via some > manual tool. That's cool. I wasn't clear on how the classifier selected its evidence (nor how the individual terms are weighted). > 2. The 150 'strongest' (furtherest from 0.5) tokens are used, by default. > Early testing showed that this was a good number, but if you like, you can > change this, too - if you set it high enough, then every token will be used, > no matter what score it has. Well, I only counted about 80 in the mail header, but, uh, I wasn't exactly counting carefully. Perhaps my training corpus was too small to complete cover this piece of spam? Anyway, I was wondering why there's no hand-tuning option, and I think you answered the question. So, I'm just going to ignore the evidence (which is how any good logician proceeds ;-). Thanks, Webb -- ------------------------------------------------------------------------ Webb Scales Hewlett-Packard Company scales@zko.dec.com 110 Spit Brook Rd, ZKO2-3/N30 Voice: 603.884.2196, FAX: 603.884.0120 Nashua, NH 03062-2711 Someone who thinks logically provides a nice contrast to the real world. ------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/cd65355a/attachment-0001.html From cej at intech.com Mon Feb 9 22:23:46 2004 From: cej at intech.com (Christopher Jastram) Date: Mon Feb 9 19:07:36 2004 Subject: [Spambayes] RE: Ideas for an MSc project please... In-Reply-To: <40283FB3.1090800@intech.com> References: <792DE28E91F6EA42B4663AE761C41C2A01A75E44@cliff.bai.org> <40283FB3.1090800@intech.com> Message-ID: <40284EC2.6030901@intech.com> Ryan Malayter wrote: > [Bob Coe] > > >> Actually, Chris wasn't complaining about the unreliability of his >> mail system; he was complaining about the impact on his throughput of >> server-side filtering. And I think he has a point. > > > So Chris knows that server-side Bayesian filtering is resource > intensive, but still chooses to implement it on an old desktop from > 1999. Rather than bemoan the resource intensive nature of filtering he > chose to use, why not try some reasonably modern hardware for a critical What an attitude! Frankly, we didn't really have the budget for purchasing new hardware, especially when the existing hardware was well-suited for the job. I've spent a considerable portion of time slimming the whole system down, and it ran very well with a large margin for periods of heavy stress. > business function? Peak-demand planning is part of a systems > administrators job. True. See above note about budget. Dropping Exchange was like doubling megahertz. Big improvement. I don't consider it a matter of vital importance to be running the fastest turf-pounding, testosterone-pumped machines available when they aren't needed. > I don't try to run my database servers on > five-year-old old desktops, because I *know* SQL servers require more > horsepower when more than a few connections are in place. Actually, SQL servers are *designed* to run on desktop machines. And unless you're running MS SQL server or Oracle w/ a large database, SQL servers run *fine* on a desktop. Done it, doing it, will do it in the future. Look at what runs Slashdot -- their web server is a PIII 600 MHz box, and the database is a quad xeon 550. We hardly run a fraction of their traffic. I think it is incompetent of an administrator to demand truckloads of CPU when something smaller will be more cost-effective, stabler, and do the job with plenty of room to spare. > My point is this: spam is not going away soon, nor are viruses. Systems > administrators have known this for several years. Planning and > maintaining the infrastructure to deal with that is our job. Buying new > hardware doesn't solve the majority of IT problems, but it could have > solved this one for Chris had he planned well. The mail server was handling the load fine with about 10-15 thousand emails/day. It ran into trouble with 350 thousand. Proper resource planning might have eased the problem, but would not have solved it. Could I really justify spending $2,500 for a couple powerhouse mail servers that would sit mostly idle when the existing mail server was perfectly capable of handling 5 times the load (especially after we dropped Exchange)? The spam load had been steadily increasing by about 200 messages/day since spring 2002 with surprising steadiness, and we probably had 18 months to go before things might get tight wrt resources. So it goes. Spam filtering was the first thing to go when the load got heavy. And it will continue to be the first thing to go, since it takes a lion's share of resources. However, I think it could be done a little differently, along these lines: 1) Primary mail server accepts email, does basic validity check (sender checks, etc) 2) Postfix passes the mail through a content_filter script 3) The content_filter script passes the mail through a *different* machine, running spambayes on RPC (sorta, maybe. Still figuring this out). If the filter server gets heavy, it'll start passing a percentage of mail through without filtering. If the mail server machine doesn't get a reply back quickly, it moves the mail without waiting for classification. 4) The filter machine passes the classified email back to the content_filter script 5) Postfix hands classified email to Cyrus for delivery. Please don't cast stones quite so quickly. What we had was plenty good enough for the job. I call that virus (we were not hit badly with previous email virii) a "disaster," which is why we have "disaster preparedness," i.e., SSH and the ability to strip things down to lean mean fighting machine. And we are now buying a couple 64 bit AMD 3 GHz machines w/ RAID1 hard disks for just this sort of problem. . Cool, huh? (Don't think I didn't ask for these, because I did. Quite a while ago.) Christopher Jastram From rmalayter at bai.org Mon Feb 9 19:30:15 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Feb 9 19:30:20 2004 Subject: [Spambayes] RE: Ideas for an MSc project please... Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01A75E5B@cliff.bai.org> [Christopher Jastram] > Money money money. We haven't bought anything > for 2 1/2 years. No routers, no switches, no > desktop machines, no monitors. (We have replaced > hard drives and fans) New mail servers were on top > of the list. I too, would have chosen to have my salary paid rather than buy new equipment, if it came down to that. But if my employer was in that sort of financial shape - where spending a few grand for a critical, revenue-generating application was too much - I would have started looking real hard at monster.com. The tech job market being what it was (and still is), though... I read you loud and clear. > Spam filtering was the first thing to go when > the load got heavy. And it will continue to > be the first thing to go, since it takes a lion's > share of resources. However, I think it could > be done a little differently, along these lines: How about bandwidth rate limiting? This might be a low-cost solution, since it is available (at least in rudimentary form) in most firewalls and operating systems. By watching CPU and disk statistics, you can figure out that the mail server can reliably handle N kb/s worth of mail, based on an average message size. So set that up as a limit. Spammers and virii don't (in general) properly queue and retry at the sender, so those senders will contend for the (now limited) bandwidth, see you're site is busy or down, and give up. Legitimate messages will wait in their sending queues and arrive once the mail storm has passed, probably not too much later than normal. An sledgehammer-style solution perhaps, but it might have saved a lot of trouble in this case. > Please don't cast stones quite so quickly. Sorry about that. I guess this is just a pet peeve of mine. I've seen any number of sites that wouldn't spend a few hundred bucks that I or a colleague recommend. Then they are bitten badly and end up spending much more money later for disaster clean-up, performance tuning, *and* the improved infrastructure pieces that were initially recommended. One client could have spent $1900 on an adequate RAID setup and backup solution, and instead spent many, many times more on consulting hours for disaster recovery. They had the money to spend, or could have found it by cutting a bit out of their $20K Christmas party budget or something. They knew the application was critical to their business. They just didn't *want* to spend the money at the necessary time, even though it was absolutely needed. They were "getting by" with what they had, so why worry, right? I think "penny-wise and pound-foolish" is the phrase... Regards, Ryan From cej at intech.com Mon Feb 9 23:02:02 2004 From: cej at intech.com (Christopher Jastram) Date: Mon Feb 9 19:45:54 2004 Subject: [Spambayes] : Stopping spam at SMTP Level In-Reply-To: <4027DBE8.9050809@jdiworks.net> References: <20040209033232.7984.qmail@web60802.mail.yahoo.com> <40273263.9090202@intech.com> <40273F60.7050807@intech.com> <4027DBE8.9050809@jdiworks.net> Message-ID: <402857BA.8080004@intech.com> Terrel Shumway wrote: > Christopher Jastram wrote: > >> Well, I've seen one solution that I really really like. It works >> like this: mail is handled by a third party. You sign up for an >> email address from that party, and they give you one for $20/year or >> so. > > > Why is this necessary? 5000 users*$20/year = $100,000/year. Nice > pocket change. Would you like to make a donation to my favorite > charity? 8-) Nah... I've just seen the commercial offerings, and they range from $5/year to $20/year. I've considered signing up for my own use, but then I say -- "why not start my own??" As if I didn't have enough to do already! :) >> Everyone who sends an email gets a bounce saying "Please follow this >> link and answer the question to send mail to this person." At the >> link you will find a simple question like: "Choose the red square" or >> "one plus one equals ?". Answering the question adds the sender to >> the database of "humans," and mail will be allowed from that >> address. Kinda neat, and it will be what I set up eventually. > > > > This addresses a very small part of the problem with a very expensive > (usability-wise) solution. True. It's also easy to foil. The only reason it works is because the mail-proxy sites remain fairly small. >> The best idea I've seen is RBL. (Realtime Blackhole List) An RBL is >> a list of known spam-sending networks. Administrators subscribed to >> an RBL agree to completely drop all traffic originating from or going >> to said spam-sending networks. Nice system, and it works quite well >> because ISPs realize that it hurts their business to allow spam on >> their networks. Unfortunately, one must have a very flexible and >> understanding boss to pull this one off, and not many IT >> administrators have that luxury. > > > RBL, of course, also has its drawbacks, which have been thoroughly > discussed elsewhere. The two-camp approach is a good evolution of > RBLs, but won't help us today. > >>>> My solution works like this: >>>> 1) Postfix accepts the mail, checks to see if it's >>>> sent to a valid user >>>> 2) If it is, run it through spambayes via >>>> content_filter, which re-injects the mail into the system. That >>>> "run it >>>> through spambayes" script looks at the "to: " mail header and uses the >>>> appropriate user-specific database accordingly. >>>> 3) Postfix hands it off to Cyrus, which delivers via >>>> POP3 or IMAP. >>>> > Using spambayes (step 2) on the wire (i.e. instead of step 1) may not > save bandwidth, but can save disk space and give priority to non-spam. > 1) a message looks like spam: 553 it and you're done. Include a URL > in the response text so a human can get whitelisted and resend a false > positive. 2) If a message is "unsure", 553 it but store it for 7 > days so the human user can redeem it from quarantine without resending > it. > 3) tar-pit the spam-sending IP/network so it will take them three > hours to send a single message. > Now you have a good 80% solution that will save your CPU and push ham > to the front of the queue. Good ideas. Thank you very much -- I've been racking my brain for ideas, and input is much appreciated. >>>> A lot of the spam we get is bounces from remote mail >>>> servers. Spammers spoof our domain, and we get the "invalid-user" >>>> bounces. Sick. I've been just discarding everything that's from >>>> mailer-daemon and not to a valid local user. >>> > not a bad idea. Thanks! It works quite well. Delivery works out to about 400 messages / day with this system, and I'm dropping 30,000-35,000 messages without processing. Nobody has complained, and everybody gets their daily quota of spam. I'm thinking of re-integrating the spambayes filtering, but that will have to wait (busy teaching Intro to VBA this week). Is there some web site that has tips for battling spam? Tried-and-true practices gleaned from bitter mouths of hard-pressed sysadmins? Chris From tameyer at ihug.co.nz Mon Feb 9 20:00:27 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 20:00:46 2004 Subject: [Spambayes] Hand tuning the database? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3FC2@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA1@its-xchg4.massey.ac.nz> > OK, that makes sense. So, (other than ignoring the problem ;-) > I could either move the "goalposts", or find some ham that came > through that mail gateway and do some more training. Yes. The latter is more likely to be successful and would certainly be easier. Note, too, (if you haven't heard this already) that SpamBayes works better with a roughly equal amount of ham & spam trained, so that's good anyway. > I wasn't clear on how the classifier selected its evidence > (nor how the individual terms are weighted). The weighting is fairly complicated - if you want to know the gory details, check out the classifier.py file in the source distribution. > Well, I only counted about 80 in the mail header, but, > uh, I wasn't exactly counting carefully. Perhaps my training > corpus was too small to complete cover this piece of spam? It probably means that there weren't more than 80 tokens (ignoring ones in the 0.4-0.6 range) in that message. Short messages can quite easily have fewer than 150 tokens, as can longer ones that contain a lot of words that you haven't trained on (since they'll score 0.5). This can cause problems - for example messages that just contain a URL don't have a lot of tokens. Usually the tokens from the headers (and maybe the URL itself) are enough to make a difference, but not always (one thing that can be done - SpamBayes has this as an experimental option, partly because it's a bit controversial - is to get more tokens from whatever's at the end of the URL). If you do find that you get a reasonable number of misclassified messages, and that there relatively few tokens, then the solution might be generating more - either by turning on some options that are off by default, or by some new tokenizing tricks. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From doc-rat at comcast.net Mon Feb 9 20:59:46 2004 From: doc-rat at comcast.net (Ron H) Date: Mon Feb 9 20:59:38 2004 Subject: [Spambayes] Help Help Message-ID: <001601c3ef79$8f1d4fd0$b90aa8c0@RadioShack> Please help me get rid of spambayes Please Ron -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040209/fc9493d2/attachment.html From tameyer at ihug.co.nz Mon Feb 9 21:04:19 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 9 21:04:41 2004 Subject: [Spambayes] Help Help In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3FFC@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677926@its-xchg4.massey.ac.nz> > Please help me get rid of spambayes You're after FAQ 3.14: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From dliu at mindspring.com Tue Feb 10 04:53:13 2004 From: dliu at mindspring.com (David Liu) Date: Tue Feb 10 04:53:15 2004 Subject: [Spambayes] regsvr32.exe spambayes_addin.dll not regsvr32.exe outlook_addin.dll Message-ID: <002201c3efbb$b46beac0$0100a8c0@2800plus> > Troubleshooting the SpamBayes Outlook plugin: > Check that Outlook shows the addin as enabled > If the SpamBayes addin is not listed, then SpamBayes should be reinstalled (Note that running regsvr32.exe outlook_addin.dll from the SpamBayes directory may also solve this problem) regsvr32.exe spambayes_addin.dll is correct. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040210/a73e1b66/attachment.html From kennypitt at hotmail.com Tue Feb 10 09:35:47 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 10 09:36:51 2004 Subject: [Spambayes] regsvr32.exe spambayes_addin.dll not regsvr32.exeoutlook_addin.dll In-Reply-To: <002201c3efbb$b46beac0$0100a8c0@2800plus> Message-ID: David Liu wrote: >> Troubleshooting the SpamBayes Outlook plugin: >> Check that Outlook shows the addin as enabled >> If the SpamBayes addin is not listed, then SpamBayes >> should be reinstalled (Note that running regsvr32.exe >> outlook_addin.dll from the SpamBayes directory may also >> solve this problem) > > regsvr32.exe spambayes_addin.dll is correct. That was true for the 0.81 plugin release. However, the online Troubleshooting Guide has been updated for the 0.9 release that will be announced as soon as the files on SourceForge have been verified. In the 0.9 release, "outlook_addin.dll" is the correct name. -- Kenny Pitt From brent at bjohnson.net Tue Feb 10 10:22:05 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Tue Feb 10 10:22:35 2004 Subject: [Spambayes] Database Format In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304677912@its-xchg4.massey.ac.nz> Message-ID: <007901c3efe9$a6de22a0$c801a8c0@torpedo> > In the source distribution (you'll need Python installed as > well) there is a script called sb_dbexpimp.py. It'll convert > the database to a flat-text '`'-separated file, which you can use. I installed SB on my Linux box from the source distribution (1.0a9). I then copied my database from my WinXP machine (I'm using the Outlook plugin) and ran: sb_dbexpimp.py -e -p default_bayes_database.db -f bayes.out Not sure if these are the correct parameters, but I based it off the usage help for sb_dbexpimp. But when I run that I get the following: Traceback (most recent call last): File "/usr/bin/sb_dbexpimp.py", line 266, in ? runExport(dbFN, useDBM, flatFN) File "/usr/bin/sb_dbexpimp.py", line 119, in runExport bayes = spambayes.storage.open_storage(dbFN, useDBM) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 669, in open_storage return klass(data_source_name) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 90, in __init__ self.load() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 113, in load tempbayes = pickle.load(fp) cPickle.UnpicklingError: invalid load key, ''. Is the db version for the Outlook plugin a picked db or DBM? Thanks, - Brent From kennypitt at hotmail.com Tue Feb 10 13:23:50 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 10 13:24:48 2004 Subject: [Spambayes] Database Format In-Reply-To: <007901c3efe9$a6de22a0$c801a8c0@torpedo> Message-ID: Brent L Johnson wrote: > I installed SB on my Linux box from the source distribution > (1.0a9). I then copied my database from my WinXP machine (I'm > using the Outlook plugin) and ran: > > sb_dbexpimp.py -e -p default_bayes_database.db -f bayes.out > > Not sure if these are the correct parameters, but I based > it off the usage help for sb_dbexpimp. > [snip traceback] > > Is the db version for the Outlook plugin a picked db or DBM? "default_bayes_database.db" is a BerkeleyDB file. You'll want the -d option instead of the -p option. -- Kenny Pitt From brent at bjohnson.net Tue Feb 10 13:35:31 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Tue Feb 10 13:35:42 2004 Subject: [Spambayes] Database Format In-Reply-To: Message-ID: <00a501c3f004$ab7e1c50$c801a8c0@torpedo> Ahhh.. the default help (other than down in the examples) doesnt mention the -d option: Usage: sb_dbexpimp [options] options: -e : export -i : import -v : verbose mode (some additional diagnostic messages) -f: FN : flat file to export to or import from -p: FN : name of pickled database file to use -f: FN : name of dbm database file to use -m : merge import into an existing database file. This is meaningful only for import. If omitted, a new database file will be created. If specified, the imported wordinfo will be merged into an existing database. Run dbExpImp -h for more information. -o: section:option:value : set [section, option] in the options database to value -h : help But it does list -f twice.. Im guessing that second one should be a -d. Thanks, - Brent > -----Original Message----- > From: Kenny Pitt [mailto:kennypitt@hotmail.com] > Sent: Tuesday, February 10, 2004 1:24 PM > To: 'Brent L Johnson'; spambayes@python.org > Subject: RE: [Spambayes] Database Format > > > Brent L Johnson wrote: > > I installed SB on my Linux box from the source distribution > (1.0a9). > > I then copied my database from my WinXP machine (I'm using > the Outlook > > plugin) and ran: > > > > sb_dbexpimp.py -e -p default_bayes_database.db -f bayes.out > > > > Not sure if these are the correct parameters, but I based > > it off the usage help for sb_dbexpimp. > > [snip traceback] > > > > Is the db version for the Outlook plugin a picked db or DBM? > > "default_bayes_database.db" is a BerkeleyDB file. You'll > want the -d option instead of the -p option. > > -- > Kenny Pitt > > > > From brent at bjohnson.net Tue Feb 10 13:38:27 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Tue Feb 10 13:38:37 2004 Subject: [Spambayes] Database Format In-Reply-To: Message-ID: <00a601c3f005$14469aa0$c801a8c0@torpedo> OK I tried it with the -d option and I got the following error: $ sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export Traceback (most recent call last): File "/usr/bin/sb_dbexpimp.py", line 266, in ? runExport(dbFN, useDBM, flatFN) File "/usr/bin/sb_dbexpimp.py", line 119, in runExport bayes = spambayes.storage.open_storage(dbFN, useDBM) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 669, in open_storage return klass(data_source_name) File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 164, in __init__ self.load() File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line 189, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 65, in open return f(db_name, mode) File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) bsddb.error: (22, 'Invalid argument') - Brent > -----Original Message----- > From: Kenny Pitt [mailto:kennypitt@hotmail.com] > Sent: Tuesday, February 10, 2004 1:24 PM > To: 'Brent L Johnson'; spambayes@python.org > Subject: RE: [Spambayes] Database Format > > > Brent L Johnson wrote: > > I installed SB on my Linux box from the source distribution > (1.0a9). > > I then copied my database from my WinXP machine (I'm using > the Outlook > > plugin) and ran: > > > > sb_dbexpimp.py -e -p default_bayes_database.db -f bayes.out > > > > Not sure if these are the correct parameters, but I based > > it off the usage help for sb_dbexpimp. > > [snip traceback] > > > > Is the db version for the Outlook plugin a picked db or DBM? > > "default_bayes_database.db" is a BerkeleyDB file. You'll > want the -d option instead of the -p option. > > -- > Kenny Pitt > > > > From kennypitt at hotmail.com Tue Feb 10 14:01:10 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 10 14:02:04 2004 Subject: [Spambayes] Database Format In-Reply-To: <00a501c3f004$ab7e1c50$c801a8c0@torpedo> Message-ID: Brent L Johnson wrote: > Ahhh.. the default help (other than down in the examples) > doesnt mention the -d option: > >... > -f: FN : flat file to export to or import from > -p: FN : name of pickled database file to use > -f: FN : name of dbm database file to use >... > > But it does list -f twice.. Im guessing that second one > should be a -d. Oops, you're absolutely right. I just fixed it in the source, so it should be correct next time around. -- Kenny Pitt From kennypitt at hotmail.com Tue Feb 10 14:09:12 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Feb 10 14:10:31 2004 Subject: [Spambayes] Database Format In-Reply-To: <00a601c3f005$14469aa0$c801a8c0@torpedo> Message-ID: Brent L Johnson wrote: > OK I tried it with the -d option and I got the following error: > > $ sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export > Traceback (most recent call last): > File "/usr/bin/sb_dbexpimp.py", line 266, in ? > runExport(dbFN, useDBM, flatFN) > File "/usr/bin/sb_dbexpimp.py", line 119, in runExport > bayes = spambayes.storage.open_storage(dbFN, useDBM) > File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > 669, in open_storage > return klass(data_source_name) > File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > 164, in __init__ > self.load() > File "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > 189, in load > self.dbm = dbmstorage.open(self.db_name, self.mode) > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > line 65, in open > return f(db_name, mode) > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > line 20, in open_dbhash > return bsddb.hashopen(*args) > bsddb.error: (22, 'Invalid argument') I see that you're using Python 2.2. Have you installed the pybsddb (aka bsddb3) package? IIRC, this package is required if you are using BerkeleyDB with Python 2.2. http://pybsddb.sourceforge.net/ -- Kenny Pitt From brent at bjohnson.net Tue Feb 10 15:16:36 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Tue Feb 10 15:16:47 2004 Subject: [Spambayes] Database Format In-Reply-To: Message-ID: <002201c3f012$cab82850$c801a8c0@torpedo> OK - I downloaded BerkeleyDB 4.2 and installed it under /usr/local/ (under Linux). I then downloaded pybsddb and followed the instructions for building it. I didnt get any errors until I ran the test .. and it hung on: test02_threaded (bsddb3.tests.test_lock.LockingTestCase) ... (and hung bad.. I had to do a control-\ to break out) I ran the "python setup.py install" anyway, and it seems to install (I suppose.. I dont know much about python). And I still get the same errors. Hrm... - Brent > -----Original Message----- > From: Kenny Pitt [mailto:kennypitt@hotmail.com] > Sent: Tuesday, February 10, 2004 2:09 PM > To: 'Brent L Johnson'; spambayes@python.org > Subject: RE: [Spambayes] Database Format > > > Brent L Johnson wrote: > > OK I tried it with the -d option and I got the following error: > > > > $ sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export > > Traceback (most recent call last): > > File "/usr/bin/sb_dbexpimp.py", line 266, in ? > > runExport(dbFN, useDBM, flatFN) > > File "/usr/bin/sb_dbexpimp.py", line 119, in runExport > > bayes = spambayes.storage.open_storage(dbFN, useDBM) > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 669, in open_storage > > return klass(data_source_name) > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 164, in __init__ > > self.load() > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 189, in load > > self.dbm = dbmstorage.open(self.db_name, self.mode) > > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > > line 65, in open > > return f(db_name, mode) > > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > > line 20, in open_dbhash > > return bsddb.hashopen(*args) > > bsddb.error: (22, 'Invalid argument') > > I see that you're using Python 2.2. Have you installed the > pybsddb (aka > bsddb3) package? IIRC, this package is required if you are > using BerkeleyDB with Python 2.2. > http://pybsddb.sourceforge.net/ -- Kenny Pitt From nowhere at cox.net Wed Feb 11 00:23:05 2004 From: nowhere at cox.net (Nowhere) Date: Wed Feb 11 00:23:05 2004 Subject: [Spambayes] Spam to Ham ratio Message-ID: <03e901c3f05f$20bc5a40$0100a8c0@gandalf> Hi everyone, I currently have 139 Good and 286 Spam trained. I get about 10x more spam than ham. I find that my ham is solidly classified at 0-1% while spambayes still misses some spam at numbers like 83% (and some at 0%). These are the spam messages with lots of random words thrown in to try to defeat the statistical filters. Anyway it seems to me that with my HAM being recognized so perfectly while the spam is less than perfect that I would need to classify more spam, further deviating from the recommended 1:1 ratio. Or do you think the recognition would work better if I increased my HAM messages (even tho they are all coming in with 0%)? In anycase of 690 spam I got in the last two days I only have to delete as spam 18 of them. Not bad. Thanks in advance for any comments! Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040210/7e00b6fc/attachment.html From perrin at lausanne-tourisme.ch Wed Feb 11 04:35:11 2004 From: perrin at lausanne-tourisme.ch (Perrin Jean-Marc) Date: Wed Feb 11 04:35:13 2004 Subject: [Spambayes] Installation problem of SpamBayes-Outlook-Setup-0081 Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 786 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040211/aeee68aa/spambayes3-0001.obj From Markus.Manck at xcc.de Wed Feb 11 06:24:00 2004 From: Markus.Manck at xcc.de (Markus Manck) Date: Wed Feb 11 06:16:51 2004 Subject: [Spambayes] bugreport Message-ID: Hi, the spambayes plugin for outlook does not initialize successfully during start with the message "Could not watch the specified folders". Version Outlook: Outlook 200 SP-3(9.0.0.6627) Version Spam Bayes: Binary Version 0.81 Version Windows: Windows 2000 Professional 5.0.2195 Service Pack 4 Build 2195 Log file is attached. Greetings Markus Markus Manck - Consultant Xcc Software AG Bahnhofplatz 8, 76137 Karlsruhe, Germany Telefon +49 721 93276-123 Telefax +49 721 93276-76 E-Mail Markus.Manck@xcc.de Web http://www.xcc.de <> -------------- next part -------------- Loaded bayes database from 'C:\Dokumente und Einstellungen\markus\Anwendungsdaten\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Dokumente und Einstellungen\markus\Anwendungsdaten\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 4806 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Posteingang ERROR: 'There was an error initializing the Spam plugin.\r\n\r\nSpam filtering has been disabled. Please re-configure\r\nand re-enable this plugin\r\n\r\nError details:\r\nCould not watch the specified folders' Traceback (most recent call last): File "out1.pyz/addin", line 1237, in FiltersChanged File "out1.pyz/addin", line 1248, in UpdateFolderHooks File "out1.pyz/addin", line 1273, in _HookFolderEvents File "out1.pyz/msgstore", line 539, in GetOutlookItem File "win32com/gen_py\00062FFF-0000-0000-C000-000000000046x0x9x0\_NameSpace.py", line 50, in GetFolderFromID com_error: (-2147352567, 'Ausnahmefehler aufgetreten.', (4096, 'Microsoft Outlook', 'Ein Client-Vorgang ist fehlgeschlagen.', None, 0, -2147221241), None) Error processing missed messages! Traceback (most recent call last): File "out1.pyz/addin", line 1193, in OnConnection File "out1.pyz/addin", line 1219, in ProcessMissedMessages File "out1.pyz/addin", line 1263, in _GetHookForFolder KeyError: ('\x00\x00\x00\x008\xa1\xbb\x10\x05\xe5\x10\x1a\xa1\xbb\x08\x00+*V\xc2\x00\x00EMSMDB.DLL\x00\x00\x00\x00\x00\x00\x00\x00\x1bU\xfa \xaaf\x11\xcd\x9b\xc8\x00\xaa\x00/\xc4Z\x0c\x00\x00\x00SLOWFOX\x00/o=Xcc Software GmbH/ou=XCC/cn=Recipients/cn=markus\x00', '\x00\x00\x00\x00\xb5\xfb5\x83\xe1\xce\xd1\x11\xa0\x0f\x00`\x08\xbf\xfb\xc8\x01\x00\x91\xe0\xe4\x89\xfc\xca\xd1\x11\xa0\x08\x00`\x08\xbf\xfb\xc8\x00\x00\x00\x00HA\x00\x00') From kennypitt at hotmail.com Wed Feb 11 09:03:18 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 11 09:04:15 2004 Subject: [Spambayes] bugreport In-Reply-To: Message-ID: Markus Manck wrote: > the spambayes plugin for outlook does not initialize successfully > during start with the message "Could not watch the specified folders". As an initial guess, this error often indicates that one of the folders that you initially configured to be filtered is no longer available. You can check this on the Filtering tab in SpamBayes Manager. Under "Filter the following folders" heading you may see one of your folders listed as something like "". If so, click the browse button and reselect the correct folders, then re-enable SpamBayes on the General tab. -- Kenny Pitt From kennypitt at hotmail.com Wed Feb 11 09:08:50 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 11 09:09:46 2004 Subject: [Spambayes] Installation problem of SpamBayes-Outlook-Setup-0081 In-Reply-To: Message-ID: Perrin Jean-Marc wrote: > I have a known problem, but I cannot fin the solution, Can you > help me ? > > After installation, i got the following message : > C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll > Unable to register the DLL/OCX: DllRegisterServer failed; code > 0x80040201. You may want to try the new 1.0a9 (0.9) version that was just released. We believe that it solves this problem. It is not linked directly from the SpamBayes Windows page yet, but you can find it on the SourceForge Files page. Look for file "spambayes-1.0a9.exe" under the "spambayes" section, not the "Outlook Addin" section. http://sourceforge.net/project/showfiles.php?group_id=61702 -- Kenny Pitt From mcounts at iwon.com Wed Feb 11 11:44:32 2004 From: mcounts at iwon.com (Michael Counts) Date: Wed Feb 11 11:45:43 2004 Subject: [Spambayes] error Message-ID: <000801c3f0be$56b7bb80$c2e3030a@mc94546> When i try to configure, it tells me failure to connect to web server? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040211/1fe28b3b/attachment.html From kennypitt at hotmail.com Wed Feb 11 13:44:02 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 11 13:45:35 2004 Subject: [Spambayes] error In-Reply-To: <000801c3f0be$56b7bb80$c2e3030a@mc94546> Message-ID: Michael Counts wrote: > When i try to configure, it tells me failure to connect to web > server? Could you be a little more specific? What SpamBayes application are you trying to configure? What version number? What operating system are you using? What have you done so far to install the application? How are you starting it (e.g. from a command line)? Did you get any error messages or exceptions when you ran it? -- Kenny Pitt From dkonkel at utmb.edu Wed Feb 11 14:01:45 2004 From: dkonkel at utmb.edu (Dave Konkel) Date: Wed Feb 11 14:01:51 2004 Subject: [Spambayes] Possible problem with SpamBayes Message-ID: <402A7C19.9030505@utmb.edu> I'm running the latest version (0.81) of the binary code with Outlook 2002 (upgraded with SP-2) on a Windows XP Pro machine. I was having a problem that our tech support people say is "unique to your computer;" when I do a search on my inbox, either simple or advanced, I can't move a message from the results list to a personal folder -- even on a single highlighted message I get the error message "messages can't be moved" when I try to drag and drop. However, the following MSKB entry provided a work-around, essentially using the "move message" icon: (or command under "edit" if the target folder hasn;t been used recently: http://support.microsoft.com/default.aspx?scid=kb;en-us;301415 Since this problem apparently is due to SPAM Bayes, you might want to include it in the FAQ. -- David A. Konkel, Ph.D. Associate Professor and Research Coordinator Dept. of Human Biological Chemistry & Genetics University of Texas Medical Branch, Galveston (409)772-4074 From dontbotherworld at yahoo.com Wed Feb 11 15:30:07 2004 From: dontbotherworld at yahoo.com (dont bother) Date: Wed Feb 11 15:30:19 2004 Subject: [Spambayes] Lists:Whitelists and Blacklists Message-ID: <20040211203007.47450.qmail@web60806.mail.yahoo.com> Does anyone knows of any good research papers on whitlists and blacklists? Google did not help me much Thanks Dont __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From perrin at lausanne-tourisme.ch Wed Feb 11 04:26:59 2004 From: perrin at lausanne-tourisme.ch (Perrin Jean-Marc) Date: Wed Feb 11 15:53:28 2004 Subject: [Spambayes] Installation problem of SpamBayes-Outlook-Setup-0081 Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 786 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040211/a3d9452d/spambayes3-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: Outlook.bmp Type: image/bmp Size: 182790 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040211/a3d9452d/Outlook-0001.bin From sgoodrich at satx.rr.com Wed Feb 11 17:44:00 2004 From: sgoodrich at satx.rr.com (Sandy Gazda) Date: Wed Feb 11 17:44:08 2004 Subject: [Spambayes] (no subject) Message-ID: <008d01c3f0f0$8adccf90$0200a8c0@DH2WP121> I have this product on my computer and am frankly very confused. Most attachments sent to me are now being deleted, and I cannot read them. However, they are legitimate, important attachments that I need to open. What can I do to correct this problem, and is there a way to recover those that have been sent to me and deleted by this program -- that I want to open? Also, is there a way to remove this software from my computer? Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040211/2f4a3ff6/attachment.html From TGreve at mdy.com Wed Feb 11 19:43:13 2004 From: TGreve at mdy.com (Todd Greve) Date: Wed Feb 11 19:43:23 2004 Subject: [Spambayes] Suggestions for SpamBayes Message-ID: <8558EED8D18AAF479EDB2D7D023299680427A55F@mdymail.mdy.com> I am using the latest version of SpamBayes (binary 0.81) and find it extremely useful. I just have two items I would like to see 1. When you choose 'Delete as Spam', that it actually moves the message to the Deleted Items after it learns what it can from this. 2. The tray icon be optionally removed when an item is moved to the Spam folder (not possible spam) both manually and automatically The product is great, keep up the good work Thanks Todd Greve -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040211/5ffe481c/attachment.html From tameyer at ihug.co.nz Wed Feb 11 20:04:47 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 11 20:06:48 2004 Subject: [Spambayes] Suggestions for SpamBayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DCF94@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677941@its-xchg4.massey.ac.nz> > I am using the latest version of SpamBayes > (binary 0.81) and find it extremely useful. I just > have two items I would like to see > 1. When you choose 'Delete as Spam', that it actually > moves the message to the Deleted Items after it > learns what it can from this. FAQ 3.11 should explain why this is not the case: > 2. The tray icon be optionally removed when an item > is moved to the Spam folder (not possible spam) both > manually and automatically And FAQ 3.8 should answer this: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 11 20:13:33 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 11 20:13:54 2004 Subject: [Spambayes] (no subject) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DCF5C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677942@its-xchg4.massey.ac.nz> > I have this product on my computer and am frankly > very confused. Most attachments sent to me are now > being deleted, and I cannot read them. However, they are > legitimate, important attachments that I need to open. > What can I do to correct this problem, and is there a > way to recover those that have been sent to me and deleted > by this program -- that I want to open? You appear to be using Outlook Express - is that correct? If so, does that mean you're using sb_server? (All the following assumes an answer of "yes" to both questions). sb_server should be just letting your mail through, only adding one or more headers, and possibly changing the to or subject headers (depending on how you have set it up). If something is going wrong with attachments, then something is going very wrong with the proxy. You should certainly be getting error reports printed out, which would be useful to have, in order to figure out what is happening. To be clear: SpamBayes does not try to remove any attachments from messages. If this is happening, then it is a bug, but it seems much more likely (given that this hasn't been reported before) that something *else* is doing this. Maybe your ISP has started doing it, or some virus protection software on your system? The way to check is to temporarily disable spambayes and see if that changes anything. To do this, just set your mail client to connect directly to the mail server rather than to localhost. If the problem goes away, then spambayes is at fault (and we would really like to have as much information as possible, in order to fix the problem), and if the problem doesn't go away, then something else is doing this. > Also, is there a way to remove this software from my computer? Sure. Just change your mail client back to connect directly to the pop server, rather than to localhost - that will stop SpamBayes from doing anything with your mail. To remove it, just remove all the files that start with sb_ in the Scripts directory in your Python installation, and the spambayes directory in the Lib/site-packages directory in your Python installation. If you've set it to start automatically, then just undo whatever you did to do that*. * Yes, this is a complicated procedure - this is as a result of running from source. Users of the new binary version for Windows (to be released this week) will have a simple GUI uninstalling option. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tim.one at comcast.net Wed Feb 11 20:20:06 2004 From: tim.one at comcast.net (Tim Peters) Date: Wed Feb 11 20:20:10 2004 Subject: [Spambayes] (no subject) In-Reply-To: <008d01c3f0f0$8adccf90$0200a8c0@DH2WP121> Message-ID: [Sandy Gazda] > I have this product on my computer Which program? The SpamBayes code comes in many flavors, and I can't guess which one you're using. > and am frankly very confused. Most attachments sent to me are now > being deleted, and I cannot read them. However, they are legitimate, > important attachments that I need to open. What can I do to correct > this problem, and is there a way to recover those that have been sent > to me and deleted by this program -- that I want to open? There have been no other reports of SpamBayes causing such behavior, so I expect you're seeing the effects of something else. > Also, is there a way to remove this software from my computer? Certainly , but until you tell us what you downloaded, what you installed, and which email client you're using, we can't guess. I *would* guess you're using the Outlook addin, except your email headers said you composed your message with Outlook Express, and the Outlook addin can't be used with OE. From tameyer at ihug.co.nz Wed Feb 11 20:26:33 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 11 20:26:53 2004 Subject: [Spambayes] Spam to Ham ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC4272@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677943@its-xchg4.massey.ac.nz> > I currently have 139 Good and 286 Spam trained. > I get about 10x more spam than ham. I find that my > ham is solidly classified at 0-1% while spambayes > still misses some spam at numbers like 83% (and some > at 0%). Are the ones getting 0% a result of the Outlook plug-in bug that does that? IOW, if you look at the clues for one of the 0% messages, is it actually scoring 0%? (If it is, then that's quite strange). It that's the case, then it seems that a simple solution would be to simply move your spam threshold down to 80%, rather than the default 90%. (This assumes that you don't ever see any unsures that score above 80%). > These are the spam messages with lots of random words > thrown in to try to defeat the statistical filters. Have you looked at the clues for any of these? It seems likely (and many people have found) that the random words won't do anything to help move it towards ham. A random word is most likely to be unknown to your filter, so won't be used, and if it is known, has about as much chance of being a spam clue as a ham one. (Unless the words aren't random, and are tailored to you personally). Looking at the spam clues would tell you if it is actually the random words that are making the difference. > Anyway it seems to me that with my ham being recognized > so perfectly while the spam is less than perfect that > I would need to classify more spam, further deviating > from the recommended 1:1 ratio. > Or do you think the recognition would work better if I > increased my ham messages (even tho they are all coming > in with 0%)? Try both, and see what happens. Most (but not all) of our testing has shown that an imbalance hurts, although that usually means a big imbalance, not a 2::1 sort of thing (which might even help). Your mail mix is unique to you, though, so the only way to know for sure is to try it out. > In anycase of 690 spam I got in the last two days I > only have to delete as spam 18 of them. Not bad. [I presume that these have all been unsures, rather than false-negatives.] An unsure rate of 2.6% is pretty good - this isn't all that different from the rate gained in lots of the testing. If you can cut even half of these by lowering the threshold to 80%, then that's probably as good as it's going to get, without changing the code itself. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From rmalayter at bai.org Wed Feb 11 17:56:22 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Feb 11 21:47:46 2004 Subject: [Spambayes] (no subject) Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E19BC2@cliff.bai.org> First off, spambayes does not delete file attachments. That is probably being done "upstream" by your company or internet service provider's mail server, to prevent the spread of viruses. In my opinion, there is basically *no* legiitmate reason to send an executable file via email, especially in a corporate environment. It's simply too dangerous. We block all executable attachments at my company, to protect against new viruses that antivirus software cannot detect. Newer versions of Outlook also block all executable file attachments, whether or not you like it. Secondly, if you didn't install Spambayes, who did? If you don't want it, go to add/remove programs in the Windows control panel, find spamBayes, and remove it. Regards, Ryan ________________________________ From: Sandy Gazda Sent: Wednesday, February 11, 2004 4:44 PM To: spambayes@python.org Subject: [Spambayes] (no subject) I have this product on my computer and am frankly very confused. Most attachments sent to me are now being deleted, and I cannot read them. However, they are legitimate, important attachments that I need to open. What can I do to correct this problem, and is there a way to recover those that have been sent to me and deleted by this program -- that I want to open? Also, is there a way to remove this software from my computer? Thank you. From mdaniels at new.rr.com Wed Feb 11 22:46:16 2004 From: mdaniels at new.rr.com (Michael E. Daniels) Date: Wed Feb 11 22:46:28 2004 Subject: [Spambayes] double listing of all inbox emails Message-ID: <002201c3f11a$c476a760$2b02a8c0@lightning> Hello, I have windows xp pro operating system and use the Outlook package. Since installing Spambayes 8.1, I have 2 of everything in my inbox. This seems to happen after I delete the first email message. I have looked at the trouble shooting guide but did not see anything relating to this problem. There are no log files on my system for Spambayes or should I say I can not find them. This is the file I used to install Spambayes, Any help would be appreciated. Thanks From DWalker at Zianet.com Thu Feb 12 01:13:33 2004 From: DWalker at Zianet.com (David Walker) Date: Thu Feb 12 01:12:46 2004 Subject: [Spambayes] SpamBayes -- initial problem Message-ID: SpamBayes, Outlook 2000 plugin version 0081, on Windows 2000 SP4. The "Welcome" text opens in a browser window. I actually read it -- there's a line that says this: "Outlook does not allow us to automatically add the spam score to your Outlook folder views - but you can do it manually by following these instructions." This is from a file called Welcome.html from the install directory. So I clicked on "following these instructions". Unfortunately, the "following these instructions" link is a hyperlink to a file on the E drive. I don't have an E-drive, so of course this link is broken. Second: On Outlook's SpamBayes toolbar, the Help/About page should certainly give the version number somewhere. That's the law in Windows programs! Thanks. I look forward to working with SpamBayes. David Walker From tameyer at ihug.co.nz Thu Feb 12 01:37:34 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 01:37:56 2004 Subject: [Spambayes] SpamBayes -- initial problem In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD018@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677948@its-xchg4.massey.ac.nz> > The "Welcome" text opens in a browser window. I actually > read it -- there's a line that says this: > > "Outlook does not allow us to automatically add the spam > score to your Outlook folder views - but you can do it > manually by following these instructions." > > This is from a file called Welcome.html from the install > directory. So I clicked on "following these instructions". > Unfortunately, the "following these instructions" link is a > hyperlink to a file on the E drive. I don't have an E-drive, > so of course this link is broken. Thanks. This has been fixed in the 1.0a9 (0.9) release, which is being released at the moment. > Second: On Outlook's SpamBayes toolbar, the Help/About page > should certainly give the version number somewhere. That's > the law in Windows programs! Well, the version number is on the Manager dialog's front page, which is somewhere that people are more likely to find it. If there was an "About" item that opened up a dialog box, like many Windows applications have, then sure, it should have the version number, but I'm not so sure about a html page. The purpose is really different too - standard "About" dialog boxes have version numbers, authors, copyright and stuff like this, whereas the "About SpamBayes" link is really just a help page, like you'd find in a manual (which is why it's under the Help submenu, rather than as an item on it's own). That said, if you really feel that it should be there, please open a feature request on sourceforge - feature requsts via the mailing list tend to get lost. The main reason it's not there now is because it's yet another place to have to update the version number - the html files aren't generated automatically, whereas the dialogs are. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From nowhere at cox.net Thu Feb 12 01:42:22 2004 From: nowhere at cox.net (Nowhere) Date: Thu Feb 12 01:42:21 2004 Subject: [Spambayes] Spam to Ham ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304677943@its-xchg4.massey.ac.nz> Message-ID: <003d01c3f133$5ecb1720$0100a8c0@gandalf> Thanks for the input. I will check the next 0% scored spam I get to see what the clues are in reality. I was unaware of a bug. Will it manifest as showing the outlook plugin score as 0% but inside the clues it is greater? AS for the threshold I had already moved it down to 85% and it has caught almost 100% of the spam. Only an occasional group of unsures will come through and the occasional 0% spam (which is almost always an identical spam I have already classified but with randomly placed punctuation or words. And yes of the 18 "missed spam" all but one were unsure. The one last one was that 0% "bug" we are talking about. I will check a little closer of the next couple days but like I said I am VERY pleased with the results. Having to look at only 2% of the spam is fine by me! Thanks for helping! Eric -----Original Message----- From: Tony Meyer [mailto:tameyer@ihug.co.nz] Sent: Wednesday, February 11, 2004 6:27 PM To: 'Nowhere'; spambayes@python.org Subject: RE: [Spambayes] Spam to Ham ratio > I currently have 139 Good and 286 Spam trained. > I get about 10x more spam than ham. I find that my > ham is solidly classified at 0-1% while spambayes > still misses some spam at numbers like 83% (and some > at 0%). Are the ones getting 0% a result of the Outlook plug-in bug that does that? IOW, if you look at the clues for one of the 0% messages, is it actually scoring 0%? (If it is, then that's quite strange). It that's the case, then it seems that a simple solution would be to simply move your spam threshold down to 80%, rather than the default 90%. (This assumes that you don't ever see any unsures that score above 80%). > These are the spam messages with lots of random words > thrown in to try to defeat the statistical filters. Have you looked at the clues for any of these? It seems likely (and many people have found) that the random words won't do anything to help move it towards ham. A random word is most likely to be unknown to your filter, so won't be used, and if it is known, has about as much chance of being a spam clue as a ham one. (Unless the words aren't random, and are tailored to you personally). Looking at the spam clues would tell you if it is actually the random words that are making the difference. > Anyway it seems to me that with my ham being recognized > so perfectly while the spam is less than perfect that > I would need to classify more spam, further deviating > from the recommended 1:1 ratio. > Or do you think the recognition would work better if I > increased my ham messages (even tho they are all coming > in with 0%)? Try both, and see what happens. Most (but not all) of our testing has shown that an imbalance hurts, although that usually means a big imbalance, not a 2::1 sort of thing (which might even help). Your mail mix is unique to you, though, so the only way to know for sure is to try it out. > In anycase of 690 spam I got in the last two days I > only have to delete as spam 18 of them. Not bad. [I presume that these have all been unsures, rather than false-negatives.] An unsure rate of 2.6% is pretty good - this isn't all that different from the rate gained in lots of the testing. If you can cut even half of these by lowering the threshold to 80%, then that's probably as good as it's going to get, without changing the code itself. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 01:50:40 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 01:51:01 2004 Subject: [Spambayes] Spam to Ham ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD01C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677949@its-xchg4.massey.ac.nz> > Thanks for the input. I will check the next 0% scored spam I > get to see what the clues are in reality. I was unaware of a > bug. Will it manifest as showing the outlook plugin score as > 0% but inside the clues it is greater? Yes. Usually (always?) it'll be the first message to be scored when Outlook is opened. Those four links are to bug reports that are very similar to this (and to each other, although it's not certain if they're all the result of a single bug or not). If you like, you can read through those, and add any comments you have to whichever one seems most like yours. (Assuming of course that it's not actually scoring 0%). > I will check a little closer of the next couple days but like > I said I am VERY pleased with the results. Having to look at > only 2% of the spam is fine by me! Great :) > Thanks for helping! No worries - be sure to let us know if anything goes wrong. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From kennypitt at hotmail.com Wed Feb 11 08:57:36 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Feb 12 02:12:24 2004 Subject: [Spambayes] Spam to Ham ratio In-Reply-To: <03e901c3f05f$20bc5a40$0100a8c0@gandalf> Message-ID: Nowhere wrote: > I currently have 139 Good and 286 Spam trained. I get about 10x more > spam than ham. I find that my ham is solidly classified at 0-1% > while spambayes still misses some spam at numbers like 83% (and some > at 0%). These are the spam messages with lots of random words > thrown in to try to defeat the statistical filters. > > Anyway it seems to me that with my HAM being recognized so perfectly > while the spam is less than perfect that I would need to classify > more spam, further deviating from the recommended 1:1 ratio. You're right that your imbalance will tend to increase if you train only on mistakes and unsures in this case. Your current ratio is only about 2:1, though, which isn't bad compared to many reports we've seen. Anecdotal evidence seems to indicate you're probably OK up until about 5:1 or so, and some have reported perfectly acceptable results with much higher ratios. It's not a bad idea to train on some extra ham now and then to improve your balance, but try to look for the ones that score furthest from a perfect 0.00 (even if it isn't by much). If your ham is classifying consistently near 0% and you are missing some spams around 80%, then you might want to try reducing your Certain Spam threshold on the Filtering tab. I have mine set to 60 currently, although I wouldn't recommend going quite that low in general. Around 70-75 should be fine, though. -- Kenny Pitt From lists at musall.de Thu Feb 12 03:16:54 2004 From: lists at musall.de (Maik Musall) Date: Thu Feb 12 03:17:10 2004 Subject: [Spambayes] Language distribution Message-ID: <20040212081654.GA30377@musall.de> Hi there, I get about 60 spams a day, and installed spambayes a week ago. I'm very happy with it, as it filters out most of it and didn't have any false positives yet. However I discovered a certain weakness. Most of the incoming spam is in English, while a large portion of my ham is German. So when I get English ham, it's often classified not near 0 but at about 0.20, while German spam (which is currently evolving) is often not recognized as such. I also have a similar ratio with HTML mails and non-HTML mails. Spambayes also has some problems distinguishing real MDA error messages from those MyDoom stuff with the typical attachments. I'm currently trying to connect enough of that to make up a procmail rule that catches that stuff by spambayes classification combined with content length and attachment configuration. I'd like to share the experiences with especially that type of problems. Some information about my spambayes configs: I trained spambayes with about 9000 spams and 6000 hams, accepting that the spam was from a few months while ham was from a few years - no other chance to get near the ideal 1:1 ratio. I plan to make experiments with just a few 100 of the newer spams and hams, but I want to run it at least a few weeks with this configuration to collect more information. My hammiedb is 23 MBytes now. I use spambayes with a procmail script that does the following: 1. Sort out mailing lists by List-Id, Sender or From headers 2. Collect all the rest as copy in an unread reference folder 3. collect some spam with certain subject words, or from big@boss.com etc. 4. Filter the rest through spambayes, and put it into spam and unsure folders if marked so 5. The rest goes into my inbox. Regards Maik From Amir_Katz at bmc.com Thu Feb 12 03:38:12 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Thu Feb 12 03:39:38 2004 Subject: [Spambayes] Minor inconvenience with SpamBayes new version Message-ID: I've just installed the new version of the Outlook add-in. No problem during install. Then I ran the 'Filter messages' action on the inbox (which was previously scanned by the previous version) and about 50% of the message were moved to the 'Spam Suspects' folders and a very small number were moved to the 'spam' folder. Is this a one-time fluke or what? Thanks. Amir "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040212/763d0414/attachment.html From Markus.Manck at xcc.de Thu Feb 12 04:24:23 2004 From: Markus.Manck at xcc.de (Markus Manck) Date: Thu Feb 12 04:17:16 2004 Subject: AW: [Spambayes] bugreport Message-ID: It seems that the error was resulting in checking "include subfolders" on a folder without subfolders. Unchecking this corrected the problem. Greetings Markus > -----Urspr?ngliche Nachricht----- > Von: Kenny Pitt [mailto:kennypitt@hotmail.com] > Gesendet: Mittwoch, 11. Februar 2004 15:03 > An: Markus Manck; spambayes@python.org > Betreff: RE: [Spambayes] bugreport > > > Markus Manck wrote: > > the spambayes plugin for outlook does not initialize successfully > > during start with the message "Could not watch the > specified folders". > > As an initial guess, this error often indicates that one of > the folders > that you initially configured to be filtered is no longer available. > You can check this on the Filtering tab in SpamBayes Manager. Under > "Filter the following folders" heading you may see one of your folders > listed as something like "". If so, click the browse > button and reselect the correct folders, then re-enable > SpamBayes on the > General tab. > > -- > Kenny Pitt > From alorenz at wfyi.org Thu Feb 12 08:12:56 2004 From: alorenz at wfyi.org (Anthony Lorenz) Date: Thu Feb 12 08:29:14 2004 Subject: [Spambayes] Curious User Message-ID: <310254130F424545BE34C2D38B672E7F01341F6E@mail.wfyi.org> That I am. Also the unwashed and unknowing. But please allow a question from a non IT type. When spam bayes does its scoring thing, I am assuming it is "opening" the unread email to score it and then [using outlook] tagging it as unread before placing it in the junk mail folder, from which I delete it. Here's my question. When it scores the email to filter it, does opening it send a hidden "receipt" to the spammer telling him or her that the email address is good? The reason I ask is that spam bayes is almost 100% accurate, but my spam has doubled since I started using it. Before, I would use outlook filters or just delete from the inbox without opening because I am smart enough to recognize a spam email without opening it. back then I got 13 a day average [I kept track]. Now, I have an average of 30 a day, of which spam bayes catches them all. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040212/d1d14d96/attachment.html From anthony at interlink.com.au Thu Feb 12 08:53:59 2004 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu Feb 12 08:54:11 2004 Subject: [Spambayes] Curious User In-Reply-To: <310254130F424545BE34C2D38B672E7F01341F6E@mail.wfyi.org> References: <310254130F424545BE34C2D38B672E7F01341F6E@mail.wfyi.org> Message-ID: <402B8577.7070600@interlink.com.au> Anthony Lorenz wrote: > That I am. Also the unwashed and unknowing. But please allow a question > from a non IT type. > > When spam bayes does its scoring thing, I am assuming it is ?opening? > the unread email to score it and then [using outlook] tagging it as > unread before placing it in the junk mail folder, from which I delete it. > > Here?s my question. > > When it scores the email to filter it, does opening it send a hidden > ?receipt? to the spammer telling him or her that the email address is good? > > The reason I ask is that spam bayes is almost 100% accurate, but my spam > has doubled since I started using it. > > Before, I would use outlook filters or just delete from the inbox > without opening because I am smart enough to recognize a spam email > without opening it. back then I got 13 a day average [I kept track]. > > Now, I have an average of 30 a day, of which spam bayes catches them all. I'm not entirely sure about the Outlook innards, but I'd be suprised if this was happening. FWIW, my spam has gone through the roof in the last few weeks - a bad day is well in excess of 500 spams. :-( -- Anthony Baxter It's never too late to have a happy childhood. From spambayes.10.quihodiria at spamgourmet.com Thu Feb 12 09:30:50 2004 From: spambayes.10.quihodiria at spamgourmet.com (Toni) Date: Thu Feb 12 09:31:12 2004 Subject: [Spambayes] SpamBayes and Hotmail Message-ID: <005601c3f174$d387d260$2623b40f@scvcz2f9e> Hi, I have been using the SpamBayes Outlook plugin to filter several Hotmail accounts for about a month. I have found several problems, most of minor importance but I have found a big one that keeps me from using it anymore. I don't know if these are already known, so I will expose them. The "important" one: For some reason spam attachments end up in IE's cache. This would not be a problem if it wouldn't sound alarms and whistles about viruses. I don't mind having a virus in IE's cache if it will never gain control, but the corporative antivirus thinks otherwise. After a couple of incidents with IT I have decided I can not keep SpamBayes :-( (at least in this configuration, I will try to find other ways). Other minor inconveniences: Whenever Outlook starts SpamBayes tries to do something with the Hotmail Inboxes, but those are not yet available (need some time) and SpamBayes stops. It requires restarting SpamBayes every time. It does not appear to filter messages from Hotmail by itself. Tried background filtering, but does not work either. If you explicitly ask for filter through toolbar it works fine. Is there a way to use SpamBayes on Hotmail without using Outlook? I will tray to experiment with diferent POP and IMAP to Hotmail gateways... Toni From dmuller at cyberlogic.com Thu Feb 12 08:36:23 2004 From: dmuller at cyberlogic.com (dmuller@cyberlogic.com) Date: Thu Feb 12 09:57:31 2004 Subject: [Spambayes] Identified as ham, but sent to spam folder Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- An embedded message was scrubbed... From: newsletter@evangelsociety.org Subject: Evangel Society Newsletter - February 2004 Date: Thu, 12 Feb 2004 00:59:38 -0500 Size: 19369 Url: http://mail.python.org/pipermail/spambayes/attachments/20040212/0ea424ef/attachment-0001.mht From fdupasquier at fxfundltd.com Thu Feb 12 11:19:05 2004 From: fdupasquier at fxfundltd.com (fdupasquier@fxfundltd.com) Date: Thu Feb 12 11:18:31 2004 Subject: [Spambayes] Outlook 2000 fails to recognize new mail Message-ID: First, great software, thank you. My outlook is very inconsistent with recognizing new emails and alerting me when they arrive. 80% of the time, Outlook wouldn't play the sound nor display the mail icon when I receive a new regular mail and from time to time it would play the sound and display the mail icon when the email I receive is filtered spam by your software. What is the solution? Thank you. Francois Du Pasquier From dmuller at cyberlogic.com Thu Feb 12 14:57:37 2004 From: dmuller at cyberlogic.com (dmuller@cyberlogic.com) Date: Thu Feb 12 14:57:46 2004 Subject: [Spambayes] Identified as ham, but sent to spam folder Message-ID: (My first attempt to post this problem was seriously hosed somewhere along the way. I had tried to include the offending e-mail and perhaps that caused the problem. Sorry. Anyway, here is my question without the attachment.) I'm using version 0.81 with Outlook 2002 SP-2 under Windows XP. My training ratio is about 1:1 and my filtering limits are at the default 15% and 90%. I am getting excellent results overall. However -- rarely -- Spambayes will identify an item as ham with a score of 0%, then move it to the spam folder anyway. This has happened only a couple of times, and each time it occurred when Outlook was starting up. That may be coincidental, of course. My particular setup does have the known problem of the last new spam item not getting moved when Outlook starts. Perhaps they are related. Any thoughts? Dan Muller From tameyer at ihug.co.nz Thu Feb 12 16:46:57 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 16:47:23 2004 Subject: [Spambayes] Minor inconvenience with SpamBayes new version In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD052@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467794F@its-xchg4.massey.ac.nz> > I've just installed the new version of the Outlook > add-in. No problem during install. Then I ran the > 'Filter messages' action on the inbox (which was > previously scanned by the previous version) and about > 50% of the message were moved to the 'Spam Suspects' > folders and a very small number were moved to the 'spam' > folder. Is this a one-time fluke or what? How long ago did you filter the inbox with 008.1? If it was some time ago, then this is probably just because you've done training since those messages were filtered, which effected their score. There are a few quite minor changes in the tokenizer with 0.9, but it seems highly unlikely that those would cause this sort of effect. If you look at the clues for the newly filtered mail, do they make sense? An example set of clues would really help here. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 16:53:12 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 16:53:33 2004 Subject: [Spambayes] Identified as ham, but sent to spam folder In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD107@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677950@its-xchg4.massey.ac.nz> > I'm using version 0.81 with Outlook 2002 SP-2 under > Windows XP. My training ratio is about 1:1 and my filtering > limits are at the default 15% and 90%. I am getting > excellent results overall. However -- rarely -- Spambayes > will identify an item as ham with a score of 0%, then move > it to the spam folder anyway. The latest example is below. > This has happened only a couple of times, and each time > it occurred when Outlook was starting up. That may be > coincidental, of course. My particular setup does have the > known problem of the last new spam item not getting moved > when Outlook starts. Perhaps they are related. This is also a known problem, and, unfortunately, one without a known solution. There's an open bug report about it: If you would, it would be great if you added details about your particular setup (Outlook, Windows & SpamBayes version, for example) to the tracker (if you monitor it, you'll also see when we manage to fix this). I suspect that you're right in that it's related to the other no score/0 score errors - at least they all seem very similar. (For reference, the bug reports about those are linked below). We'll hopefully sort this one out soon, although, AFAIK, it's not fixed in 1.0a9 (0.9). Sorry I don't have better news! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 16:54:26 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 16:54:47 2004 Subject: [Spambayes] Outlook 2000 fails to recognize new mail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD130@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677951@its-xchg4.massey.ac.nz> > My outlook is very inconsistent with recognizing new emails > and alerting me when they arrive. 80% of the time, Outlook > wouldn't play the sound nor display the mail icon when I > receive a new regular mail and from time to time it would > play the sound and display the mail icon when the email I > receive is filtered spam by your software. What is the solution? SpamBayes doesn't effect the sound being played, or the mail icon at all, so any odd behaviour is totally caused by Outlook (or some other program). For reasons that we don't, see FAQ 3.8: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From fdupasquier at fxfundltd.com Thu Feb 12 17:08:35 2004 From: fdupasquier at fxfundltd.com (fdupasquier@fxfundltd.com) Date: Thu Feb 12 17:08:04 2004 Subject: [Spambayes] Outlook 2000 fails to recognize new mail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304677951@its-xchg4.massey.ac.nz> Message-ID: Thank you for the quick answer. Outlook was functioning fine before I installed SpamBayes, hence my belief that SpamBayes is responsible for the fact that the email icon and the sound don't work much anymore. What would you recommend me to do? -----Original Message----- From: Tony Meyer [mailto:tameyer@ihug.co.nz] Sent: Thursday, February 12, 2004 4:54 PM To: fdupasquier@fxfundltd.com; spambayes@python.org Subject: RE: [Spambayes] Outlook 2000 fails to recognize new mail > My outlook is very inconsistent with recognizing new emails > and alerting me when they arrive. 80% of the time, Outlook > wouldn't play the sound nor display the mail icon when I > receive a new regular mail and from time to time it would > play the sound and display the mail icon when the email I > receive is filtered spam by your software. What is the solution? SpamBayes doesn't effect the sound being played, or the mail icon at all, so any odd behaviour is totally caused by Outlook (or some other program). For reasons that we don't, see FAQ 3.8: =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 17:13:15 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 17:13:38 2004 Subject: [Spambayes] Outlook 2000 fails to recognize new mail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD1CE@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA7@its-xchg4.massey.ac.nz> > Outlook was functioning fine before I installed SpamBayes, > hence my belief that SpamBayes is responsible for the fact > that the email icon and the sound don't work much anymore. Well, you can always test that by disabling spambayes (in the Manager dialog), or uninstalling it, and seeing if the problem goes away. (If you uninstall then reinstall you won't lose any training data or configuration data). If it does go away, then I suppose spambayes could be causing some sort of problem (but it certainly isn't deliberately altering the sound/icon) - maybe a conflict of some kind. In that case the only thing I can really suggest is to open a bug report with as many details as possible, and someone will try and figure it out at some point. (You could also see whether enabling/disabling background filtering makes any difference, and whether the 1.0a9 (0.9) release makes any difference). > What would you recommend me to do? If it doesn't go away, then it's not SpamBayes, and I don't know what could be causing it. The times where the sound is played and icon shown when mail is filtered as spam are correct - this should be happening. The times when it's not done - maybe checking the Outlook settings? Or doing a "Detect and Repair"? Or googling for similar cases? =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 19:48:57 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 19:49:21 2004 Subject: [Spambayes] SpamBayes and Hotmail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD0F7@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677953@its-xchg4.massey.ac.nz> > For some reason spam attachments end up in IE's cache. This > would not be a problem if it wouldn't sound alarms and > whistles about viruses. Is this definitely spambayes? i.e. if you disable spambayes does it still happen? Does it happen if you just score messages, or only if you score & move them? When the messages are filtered, are they moved between Hotmail folders, or from a Hotmail folder to a local folder? (I'm not all that familiar with Hotmail, so I'm trying to figure out what it is that spambayes is doing that causes the caching). > Whenever Outlook starts SpamBayes tries to do something with > the Hotmail Inboxes, but those are not yet available (need > some time) and SpamBayes stops. It requires restarting > SpamBayes every time. This might be one of the bugs that is fixed in the latest 1.0a9 (0.9) release (the announcement will go out at some point today, but the installer is there on sourceforge already). It'd be worth trying this out (assuming you can get around problem #1), and if it doesn't fix it, then open a bug report, so that it will at some point (because you can't use Hotmail with Outlook in some situations (like if you're behind a proxy), I think Mark is the only one of the add-in developers that has a Hotmail account to test with). > It does not appear to filter messages from Hotmail by itself. > Tried background filtering, but does not work either. If you > explicitly ask for filter through toolbar it works fine. The log files should have clues about what is going wrong here. This might also be something that the latest version fixes, although I'm less sure about this one. > Is there a way to use SpamBayes on Hotmail without using > Outlook? I will tray to experiment with diferent POP and IMAP > to Hotmail gateways... If you can get the Hotmail mail via POP3 or IMAP, then you can certainly use sb_server or sb_imapfilter instead of the plug-in (with Outlook or any other mail client). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 19:52:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 19:52:43 2004 Subject: [Spambayes] double listing of all inbox emails In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130482FB6E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677954@its-xchg4.massey.ac.nz> > I have windows xp pro operating system and use the > Outlook package. Since installing Spambayes 8.1, I have 2 of > everything in my inbox. This seems to happen after I delete > the first email message. I have looked at the trouble > shooting guide but did not see anything relating to this problem. So you only have one copy, then you delete ("delete" or "delete as spam"?) a message and there are suddenly two copies? Are you using Outlook with Exchange, IMAP, POP3, Hotmail, or some mixture? What happens if you delete again? Do you get a third copy? Do you have spambayes set to move or copy messages? (In the Filtering tab of the Manager dialog). > There are no log files on my system for Spambayes or should I > say I can not find them. They are in your temp directory, and if spambayes is running, there will be log files. The troubleshooting guide explains where to find them, but it's probably in: "c:\documents and settings\[username]\local settings\temp". =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 19:58:48 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 19:59:11 2004 Subject: [Spambayes] Advice In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304FC3C60@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677955@its-xchg4.massey.ac.nz> > Thanks for the response, Tony. Yes, I deactivated SpamBayes > and the problem went away. Then I uninstalled SpamBayes and > reinstalled it; old problem returned. I also read all the > "Known Bugs Reports" and the Forum Q&As on the SpamBayes > Website and could not find this problem discussed or any > resolution ideas. > > I did try Outlook's "Detect and Repair" function and that did > not solve the problem. When I shut down Outlook, re-opened > it, checked for new messages, and then shut Outlook down > again, I got the same dialogue box about restarting Outlook > and sending an error report to Microsoft. > > Any suggestions? You could try the new 1.0a9 (0.9) release - it's on the download page, and the release announcement will go out at some point today. Other than that, I'm pretty much out of ideas. The plug-in does very little when Outlook shuts down, and there shouldn't be anything that causes this sort of problem. You could open a bug report on sf, and maybe Mark or someone will be able to figure something else out. Sorry I can't be of more use... =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 20:01:05 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 20:01:31 2004 Subject: [Spambayes] Messages Fail to Filter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304E718DC@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA8@its-xchg4.massey.ac.nz> > I checked the troubleshooting guide. I have Filtering > checked. I have tried manually rerunning filtering and > it still does not work. Attached is the log files that > show an 'AssertionError'. There were 4 files with the > name 'spambayes?.txt' so I attached all 4. It looks like you're using a really old version of the plug-in. It would be well worth upgrading to the latest, 1.0a9 (0.9). I expect that that would fix the problem. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 12 20:07:11 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 20:07:33 2004 Subject: [Spambayes] Language distribution In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD04B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677957@its-xchg4.massey.ac.nz> > However I discovered a certain weakness. Most of the incoming > spam is in English, while a large portion of my ham is > German. So when I get English ham, it's often classified not > near 0 but at about 0.20, while German spam (which is > currently evolving) is often not recognized as such. You could try tailoring your training. Try roughly the same amount of German ham as German spam, and roughly the same amount of English ham as English spam. Maybe even also the same amount of German mail as English mail. (Just a couple of hundred trained messages of ham & spam is often enough to get excellent results, so you don't need vast amounts of each). > I also have a similar ratio with HTML mails and non-HTML mails. This shouldn't be making any difference - spambayes trims the vast majority of HTML stuff. > Spambayes also has some problems distinguishing real MDA > error messages from those MyDoom stuff with the typical > attachments. Enough training should solve this - take a look at the clues for the messages and see if you can pick where it's going wrong - that often gives a clue as to how you can fix it. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From ta-meyer at ihug.co.nz Thu Feb 12 23:12:47 2004 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 12 23:13:08 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA9@its-xchg4.massey.ac.nz> The SpamBayes team is pleased to announce the latest release of SpamBayes - 1.0a9 (0.9). This is both a release of the source code *and* of binaries for Microsoft Windows users, including, for the first time, an installation program for all Windows users, not only those that use Microsoft Outlook. As such, we recommend that all SpamBayes users, including those currently running from source, and those using the Outlook plug-in, upgrade to this release. The Windows installation program will install either the Outlook add-in (for Microsoft Outlook users), or the SpamBayes server program (for all other mail client users, including Microsoft Outlook Express). All Windows users (including existing users of the Outlook add-in) are encouraged to use the installation program. If you wish to use the source-code version, you will also need to install Python - see README.txt in the source tree (or view it online) for more information. This release fixes a large number of bugs in the last release, and includes a few new features, too. For a detailed description of everything (well, everything we remember) that has changed since the last release, you can view our WHAT_IS_NEW.txt file, either online, or in the source distribution. Get it from the 'Download' page at http://www.spambayes.org/download.html Enjoy the new release and your spam-free mailbox :-) Tony (on behalf of the SpamBayes team) --- What is SpamBayes? --- The SpamBayes project is working on developing a Bayesian (of sorts) anti-spam filter (in Python), initially based on the work of Paul Graham. The major difference between this and other, similar projects is the emphasis on testing newer approaches to scoring messages. The project includes a number of different applications, all using the same core code, ranging from a plug-in for Microsoft Outlook, to a POP3 proxy, to various command-line tools. From 5octxmmve at health.com Thu Feb 12 21:30:12 2004 From: 5octxmmve at health.com (Yvonne Maddox) Date: Thu Feb 12 23:37:59 2004 Subject: [Spambayes] Just start by choosing a drug **** o wmv imxux cv Message-ID: <2q-0q6352$$ehs$a6fw1@kncz.aq> Hi there, It is time the world becomes aware that there is an easy and secure way to order medication. If you are well and not in need of anything, forgive me for intruding. However, if there is someone among your family or friends who is in need of medicines: please tell them about this e-mail. Take care from the Healing guru. For more information click below: http://www.allthemeds.biz/ada/index.htm Goodbye forever: click below: http://www.allthemeds.biz/cleanaway nvx asfc puu feqpikrqpasjk bdpgibalwgje qqqurxilr l t From brown at terralign.com Fri Feb 13 00:07:14 2004 From: brown at terralign.com (Brown, Jim) Date: Fri Feb 13 00:10:15 2004 Subject: [Spambayes] Most Recently Received Email Not Filtered Message-ID: The discussion taking place under the subject "Identified as ham, but sent to spam folder" referred to the problem where the most recently received email is not filtered. I did some fairly extensive (although amateurish) checking on this issue, and added the following to bug report #793830: Date: 2004-02-09 06:31Sender: brown2611Logged In: YES user_id=963745 I believe this is the same as bug #876281. As far as I can tell, having only started looking at Python and the SpamBayes Windows code yesterday, here is why this happens: 1. When SB starts, it (eventually) invokes BayesManager.EnsureOutlookFieldsForFolder() for each folder. This method finds the first MalItem in the specified folder and looks for the "Spam" UserProperty attached to that MailItem. If the MailItem does not have the "Spam" UserProperty, SB adds the UserProperty to the MailItem, causing the MailItem to have a Spam score of 0. 2. In trying to process missed messages, SB invokes MAPIMsgStoreFolder.GetNewUnscoredMessageGenerator(), which excludes any MailItem that already has the "Spam" UserProperty from processing. Voila. The item most recently received in a folder while SB was not running is automatically assigned a score of 0 and not filtered. Apparently, there isn't a clean way to detect if a folder has a particular user defined field. So, the only way to check for the presence of the user defined field in the folder is to checkthe items in the folder. SB (correctly) assumes that adding the UserProperty to the first MailItem will force the creation of the user defined field in the folder. However, this is not a benign act. Possible solutions: * Don't force the creation of the user defined field until SB has an actual score to store. However, I fear there may be a great number of places in the code that assume the Spam field already exists. For example, GetNewUnscoredMessageGenerator() certainly makes this assumption. The error resulting from the lack of the Spam field could be trapped, but I don't know the code well enough to find all the places where the absence of the field might be a problem. Nonetheless, this seems to be the correct solution to me. * EnsureOutlookFieldsForFolder() could check more than just the first MailItem in the folder. However, this doesn't avoid the problem if every message in the folder is a missed message. * If SB is going to force the creation of the Spam field, go ahead and filter the message. Aside from not being terribly clean, it's not clear to me that enough of the code has been initialized by this point to filter a message. * If SB is going to force the creation of the Spam field, initialize it to a value that is easily detected as unscored, for example -1. However, I'm not confident that the code doesn't depend on 0 <= Spam <= 100. In addition, I don't know how many places in the code would have to be changed to recognize this value. * Change the minimum score to 0.0001, or the like, and detect a score of 0 as an unscored message. Kludgey and one would still have to find all of the places where something with a ham-like score (or any score) is excluded from further filtering (e.g., addin.ProcessMessage()). * Since OutlookAddin.ProcessMissedMessages() only _seems_ to be invoked at startup, it could be modified to always process the first MailItem if the item is unread and has a score of exactly zero. Again, kludgey, but at least the kludge is confined to a single place. On the other hand, it looks to me like SB has already hooked into the folders it is watching at this point. if SB is filtering in the background, I suspect that any mail received before the "processing start delay" expires would bump the improperly flagged message out of the first position. Hmmm. * One could treat anything with a score of 0 as an unscored message, but it isn't really desirable to rescore all of those messages, since the majority of them have presumably already been filtered correctly. Any feedback from someone who actually knows the code? ---Jim -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/2d833710/attachment.html From c483qqyk at hotmail.com Thu Feb 12 20:27:53 2004 From: c483qqyk at hotmail.com (Sheila Comer) Date: Fri Feb 13 00:31:41 2004 Subject: [Spambayes] =?iso-8859-1?q?Paxil_=28GSK=29_30mg_90_tabs_SPECIAL_?= =?iso-8859-1?q?!!!_Xanax=AE_=26_Valium=AE_sult_k?= Message-ID: <4j1i6213p86-4p@nk5.f.u4j> Dear sir/madam Just to inform you that this company distributes its goods worldwide and has a fast and secure way of doing business. I hope that you will become our client. However, if you have no health problems; Good for you! Thank you for reading this e-mail. Take care from the Healing guru-more below: http://www.allthemeds.biz/ada/index.htm Farewell to you my dear: http://www.allthemeds.biz/cleanaway/ lv kzhhbd cvm ep rim From kennypitt at hotmail.com Fri Feb 13 09:19:54 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 13 09:21:03 2004 Subject: [Spambayes] SpamBayes and Hotmail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304677953@its-xchg4.massey.ac.nz> Message-ID: Tony Meyer wrote: >> Is there a way to use SpamBayes on Hotmail without using >> Outlook? I will tray to experiment with diferent POP and IMAP >> to Hotmail gateways... > > If you can get the Hotmail mail via POP3 or IMAP, then you can > certainly use sb_server or sb_imapfilter instead of the plug-in (with > Outlook or any other mail client). The Outlook plugin works great with Hotmail accessed through a POP3 gateway. I use Hotmail Popper to provide POP3 access to Hotmail, and then set it up as a normal POP3 account in Outlook. Unfortunately, the latest version of Hotmail Popper is no longer freeware. -- Kenny Pitt From FCorley at cox.net Fri Feb 13 09:39:41 2004 From: FCorley at cox.net (Frank Corley) Date: Fri Feb 13 09:39:50 2004 Subject: [Spambayes] Outlook has become really slow since installing SB Message-ID: <005b01c3f23f$39cf8a70$6401a8c0@desktop> My Outlook has become very sluggish since installing Spambayes. Can anyone tell me if this might be caused by SB, or should I look elsewhere? I haven't changed anything else on the computer, and all my other apps runs just like they used to, that's why I'm suspicious. Frank -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/eefdc86c/attachment.html From brent at bjohnson.net Fri Feb 13 10:04:14 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Fri Feb 13 10:04:30 2004 Subject: [Spambayes] Just start by choosing a drug **** o wmv imxux cv In-Reply-To: <2q-0q6352$$ehs$a6fw1@kncz.aq> Message-ID: <008b01c3f242$a7f94380$c801a8c0@torpedo> Is it me - or does anyone else find it idiotic that someone is sending spam to the SpamBayes mailing list? Luckily SpamBayes caught it :) - Brent (and yes Im deleting the email body so other's SpamBayes doesnt block this) From brown at terralign.com Fri Feb 13 11:12:56 2004 From: brown at terralign.com (Brown, Jim) Date: Fri Feb 13 11:15:55 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered Message-ID: Hmmm. OBE. After looking at the source code for the release announced today, I believe this problem is probably fixed. ---Jim -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/9f039e5c/attachment.html From ihlavats at tarantulaconsulting.com Fri Feb 13 12:17:24 2004 From: ihlavats at tarantulaconsulting.com (Ian Hlavats) Date: Fri Feb 13 12:16:28 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: <001101c3f255$42854cc0$9400a8c0@tarantula> I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.0.2195.2 (Service Pack 4). I have trained 3 ham and 661 spam. The problem I am having is that I am unable to remove a previous hammie.db database because I can't find it and this is causing inconsistencies in the Web user interface and possibly in Spambayes' operations. Hi, the above technical details (re: training) are not entirely accurate and this is part of the problem I'm having. When I first installed Spambayes I tried training it on some spam (661 messages as indicated above) and then I tried to feed it a .dbx store containing more than ham 3000 messages (I had not yet read the docs regarding the 1:1 spam/ham training ratio). After more than one hour of waiting for it to process the ham, I gave up and shut down the server. I tried to read how to delete and rebuild the database but I could not find "hammie.db" anywhere on my computer so I could not delete the database. I reinstalled Spambayes numerous times and whenever I opened the Web interface it still showed my previous spam training stats (661 spam, 3 ham). Next I configured Spambayes to use a specific .db file (I gave it the full path this time) and it created the database file and showed my training stats at 0/0. This was fine, the slate was clean and I was ready to retrain the system. This time I fed Spambayes roughly 600 spam and 600 ham. It has been working great since then, but now whenever I change something else in the configuration settings and go back to the home page, it still shows my old spam/ham training numbers (661 spam, 3 ham)! After a short while, if I restart the proxy server it comes up with my current stats (the real ones that I've created since retraining Spambayes), but as you can see from above, it occasionally reverts to the initial training status. Is there any way for me to permamently remove any remnants from my first training database? Since I can't find the database I have no way to delete it. Aside from this I am quite impressed with your product. I would appreciate your help, Ian H. ihlavats@tarantulaconsulting.com -- 613-265-8251 Tarantula Consulting www.tarantulaconsulting.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/e018013f/attachment.html From kennypitt at hotmail.com Fri Feb 13 12:42:16 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 13 12:43:32 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <001101c3f255$42854cc0$9400a8c0@tarantula> Message-ID: Since your are using the Windows binary, you should find all the data files for the proxy under "C:\Documents and Settings\username\Application Data\SpamBayes\Proxy". The training database will probably be named "statistics_database.db" instead of "hammie.db". You should certainly try deleting the old database and see what happens, but it's hard to guess why SpamBayes would be picking it up if you've specifically configured a different file so there could be a bug here somewhere. Before you try anything further, could you try to reproduce the problem and then collect your logfiles and send them to the list? You should find them in your temp directory (usually "\Documents and Settings\username\Local Settings\Temp"), and they will be named "SpamBayesServer?.log" where ? is a numeric digit. -- Kenny Pitt _____ From: spambayes-bounces+kennypitt=hotmail.com@python.org [mailto:spambayes-bounces+kennypitt=hotmail.com@python.org] On Behalf Of Ian Hlavats Sent: Friday, February 13, 2004 12:17 PM To: spambayes@python.org Subject: [Spambayes] Problem with POP3 Proxy I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.0.2195.2 (Service Pack 4). I have trained 3 ham and 661 spam. The problem I am having is that I am unable to remove a previous hammie.db database because I can't find it and this is causing inconsistencies in the Web user interface and possibly in Spambayes' operations. Hi, the above technical details (re: training) are not entirely accurate and this is part of the problem I'm having. When I first installed Spambayes I tried training it on some spam (661 messages as indicated above) and then I tried to feed it a .dbx store containing more than ham 3000 messages (I had not yet read the docs regarding the 1:1 spam/ham training ratio). After more than one hour of waiting for it to process the ham, I gave up and shut down the server. I tried to read how to delete and rebuild the database but I could not find "hammie.db" anywhere on my computer so I could not delete the database. I reinstalled Spambayes numerous times and whenever I opened the Web interface it still showed my previous spam training stats (661 spam, 3 ham). Next I configured Spambayes to use a specific .db file (I gave it the full path this time) and it created the database file and showed my training stats at 0/0. This was fine, the slate was clean and I was ready to retrain the system. This time I fed Spambayes roughly 600 spam and 600 ham. It has been working great since then, but now whenever I change something else in the configuration settings and go back to the home page, it still shows my old spam/ham training numbers (661 spam, 3 ham)! After a short while, if I restart the proxy server it comes up with my current stats (the real ones that I've created since retraining Spambayes), but as you can see from above, it occasionally reverts to the initial training status. Is there any way for me to permamently remove any remnants from my first training database? Since I can't find the database I have no way to delete it. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/f57703e9/attachment.html From tim.one at comcast.net Fri Feb 13 13:02:08 2004 From: tim.one at comcast.net (Tim Peters) Date: Fri Feb 13 13:02:10 2004 Subject: [Spambayes] Outlook has become really slow since installing SB In-Reply-To: <005b01c3f23f$39cf8a70$6401a8c0@desktop> Message-ID: [Frank Corley] > My Outlook has become very sluggish since installing Spambayes. Can > anyone tell me if this might be caused by SB, or should I look > elsewhere? I haven't changed anything else on the computer, and all > my other apps runs just like they used to, that's why I'm suspicious. Outlook may certainly become sluggish while new email is arriving and SpamBayes is scoring it -- SpamBayes is doing a lot of work then to analyze the email. There should be no difference in Outlook speed when SpamBayes isn't scoring new messages (SpamBayes doesn't do anything then -- it just sits there, waiting for Outlook to tell it when new messages arrive). From scales at zko.dec.com Fri Feb 13 13:37:20 2004 From: scales at zko.dec.com (Webb Scales) Date: Fri Feb 13 13:37:26 2004 Subject: [Spambayes] Just start by choosing a drug **** o wmv imxux cv References: <008b01c3f242$a7f94380$c801a8c0@torpedo> Message-ID: <402D1960.87333FE@zko.dec.com> Brent L Johnson wrote: > Is it me - or does anyone else find it idiotic that > someone is sending spam to the SpamBayes mailing list? "Idiotic" isn't the term I would use. "Poetic"? "Humorous"? I don't know. :-) > Luckily SpamBayes caught it :) Not mine. :-( Time for more training, I guess. Webb -- ------------------------------------------------------------------------ Webb Scales Hewlett-Packard Company scales@zko.dec.com 110 Spit Brook Rd, ZKO2-3/N30 Voice: 603.884.2196, FAX: 603.884.0120 Nashua, NH 03062-2711 Someone who thinks logically provides a nice contrast to the real world. ------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/124ad43e/attachment-0001.html From IHolt at SESC.com Fri Feb 13 14:17:08 2004 From: IHolt at SESC.com (Holt, Ian) Date: Fri Feb 13 14:06:08 2004 Subject: [Spambayes] Spambayes bug? Message-ID: Software: Server-side Exchange 5.5sp4 Windows NT 4sp6a Client-side Outlook XP(2002) Windows 2000sp4 Spambayes Outlook plugin 1.0a9 When tested on e-mail account of one of our domain administrators we found that Spambayes not only cleans his account, but then goes through the every e-mail account on the entire exchange server looking for spam. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/f1e61151/attachment.html From rpadilla at stichler.com Fri Feb 13 14:06:35 2004 From: rpadilla at stichler.com (Ricardo Padilla) Date: Fri Feb 13 14:07:44 2004 Subject: [Spambayes] Read the Troubleshooter and nothing.. Message-ID: <06E574DFC2EED511B2E900508BE7477F053A0D45@sdexchange.stichler.com> Nothing loads including the toolbars. I have Windows 2000 w/ Outlook 2000 in an Exchange 5.5 environment. I downloaded and installed version 1.0a9 of SpamBayes. Hope you guys know what to do. Ricardo Padilla Systems Administrator The Stichler Group, Inc. 9655 Granite Ridge Drive #400 San Diego CA 92123 858-565-4440 x400 | fax: 858-569-3433 http://www.stichler.com/ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/6ef9a9ed/attachment.html From rpadilla at stichler.com Fri Feb 13 14:09:12 2004 From: rpadilla at stichler.com (Ricardo Padilla) Date: Fri Feb 13 14:10:10 2004 Subject: [Spambayes] FW: Read the Troubleshooter and nothing.. Message-ID: <06E574DFC2EED511B2E900508BE7477F053A0D46@sdexchange.stichler.com> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040213/d4d05d44/spambayes2.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1322 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040213/d4d05d44/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 1479 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040213/d4d05d44/spambayes3.obj From qub at qub.com Fri Feb 13 14:11:44 2004 From: qub at qub.com (Mikhail Yakoubov) Date: Fri Feb 13 14:11:48 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA9@its-xchg4.massey.ac.nz> Message-ID: <402D2170.B9F15E2F@qub.com> Tony Meyer wrote: > The SpamBayes team is pleased to announce the latest release of SpamBayes - > 1.0a9 (0.9). First, thank you for the release. It really wrappes the problem we've discuss recently -- those of malformed multipart messages. Output reads Classifying Skipping unparseable message: multipart message with no defined boundary **************************. Classified 4 ham, 20 spam, and 3 unsure. So long, so good. But where the unparseable did gone? It's not in the Inbox, so probably to Unsure. Indeed, there is a message there, consisting of three lines in View Source: X-Spambayes-Classification: unsure X-Spambayes-Spam-Probability: 0.89 X-Spambayes-MailId: 1076698445 and nothing more, no subject, no multipart, nothing else. And it's like this in the raw mailfolder file also (Netscape 4.8). Guess it's not a big deal, as far as the malformed was a spam anyway, but probably that's (eating up the whole message and replacing it with custom Spambayes headers) not what you've intended. Best, Mike Yakoubov. From arv at vr-inc.com Fri Feb 13 15:09:43 2004 From: arv at vr-inc.com (Arv Weaver) Date: Fri Feb 13 15:10:26 2004 Subject: [Spambayes] Outlook XP Problem Message-ID: <000001c3f26d$53c21050$6500a8c0@DELLNOTEBOOK> I resently upgraded to Office XP. Since that time SpamBayes has not worked. The addin in not shown in Outlook when I check it. My emails are not being scored. I have followed the instructions in the Troubleshooting area. I have reinstalled SpamBayes and it still does not show. Can you help? Arv Weaver Senior Consultant Viable Resources, Inc. Phone: 206.383.1939 Fax: 419.844.8973 E-mail: HYPERLINK "mailto:arv@vr-inc.com"arv@vr-inc.com HYPERLINK "outbind://1-00000000C548ED131230DE44B13FDDC4E7E09D43047D2300/cid:image0 01.jpg@01C3F07F.B31F9490" --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.588 / Virus Database: 372 - Release Date: 2/13/2004 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/e4d980be/attachment.html From epeterso2 at yahoo.com Fri Feb 13 16:48:28 2004 From: epeterso2 at yahoo.com (Eric Peterson) Date: Fri Feb 13 16:49:05 2004 Subject: [Spambayes] Any way to watch for the header "X-Spam: True"? Message-ID: <20040213214828.36042.qmail@web60307.mail.yahoo.com> [I'm not a subscriber to the list ... if you response, please do so directly to epeterso2@yahoo.com. Thanks!] My company uses Brightmail to tag suspected spam coming through its firewall by adding the header "X-Spam: True" to the message (giving the user the freedom to parse on it). Spambayes does a better job than our Brightmail configuration -- I've observed that BM only catches about 70% of my incoming spam. However, I've noticed that when I ask SpamBayes to show the spam clues for a message that the "X-Spam: True" header never comes up. Many times false negatives that slip into my inbox do have the "X-Spam: True" header set, meaning that Brightmail caught it but SpamBayes didn't. I started to dig into the code to figure out some way to get SB to recognize that header as another token. However, I don't speak Pythonese (just Perl) and I'm not familiar with the SB architecture. Needless to say, I got lost quickly. Can anybody give me a pointer as to how to get SB to recognize "X-Spam: True" as a header line and add it to its database? Thanks ... Eric __________________________________ Do you Yahoo!? Yahoo! Finance: Get your refund fast by filing online. http://taxes.yahoo.com/filing.html From skip at pobox.com Fri Feb 13 17:06:55 2004 From: skip at pobox.com (Skip Montanaro) Date: Fri Feb 13 17:07:04 2004 Subject: [Spambayes] Any way to watch for the header "X-Spam: True"? In-Reply-To: <20040213214828.36042.qmail@web60307.mail.yahoo.com> References: <20040213214828.36042.qmail@web60307.mail.yahoo.com> Message-ID: <16429.19071.961927.48935@montanaro.dyndns.org> Eric> Can anybody give me a pointer as to how to get SB to recognize Eric> "X-Spam: True" as a header line and add it to its database? Try setting the basic_header_tokenize option (in the Tokenizer category) to True. Then take a look at basic_header_skip to make sure headers you don't want tokenized match one of the regular expressions there and that X-Spam doesn't match (perhaps a bit easier said than done given that your probably want to skip many/most x-.* headers). Skip From kennypitt at hotmail.com Fri Feb 13 17:34:05 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 13 17:35:04 2004 Subject: [Spambayes] Outlook XP Problem In-Reply-To: <000001c3f26d$53c21050$6500a8c0@DELLNOTEBOOK> Message-ID: Try going to Help -> About Microsoft Outlook and clicking on the Disabled Items button. SpamBayes may have been disabled during the upgrade (I think any "non-certified" plug-ins get disabled automatically when upgrading). If so, you should be able to reenable it from there. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Arv Weaver Sent: Friday, February 13, 2004 3:10 PM To: spambayes@python.org Subject: [Spambayes] Outlook XP Problem I resently upgraded to Office XP. Since that time SpamBayes has not worked. The addin in not shown in Outlook when I check it. My emails are not being scored. I have followed the instructions in the Troubleshooting area. I have reinstalled SpamBayes and it still does not show. Can you help? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/acc446f6/attachment.html From mhammond at skippinet.com.au Fri Feb 13 23:23:15 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Feb 13 23:23:34 2004 Subject: [Spambayes] FW: Read the Troubleshooter and nothing.. In-Reply-To: <06E574DFC2EED511B2E900508BE7477F053A0D46@sdexchange.stichler.com> Message-ID: <188801c3f2b2$448a7150$0200a8c0@eden> *sigh* - this is a bug in SpamBayes. I will make a fix, but in the meantime, try setting up SpamBayes manually, rather than via the Wizard. Cancel the wizard, then use the "Training" and "Filtering" tabs to define the configuration, then enable SpamBayes from the 'General' tab. Hopefully that will avoid the error. I'm not sure what is special about your setup that triggers this bug. Do you have a strange .pst or unconfigured information store setup? If so, you could try removing that, then using the Wizard. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Ricardo Padilla Sent: Saturday, 14 February 2004 6:09 AM To: 'spambayes@python.org' Subject: [Spambayes] FW: Read the Troubleshooter and nothing.. I forgot to attach the logs. Here ya go. Ricardo Padilla Systems Administrator The Stichler Group, Inc. 9655 Granite Ridge Drive #400 San Diego CA 92123 858-565-4440 x400 | fax: 858-569-3433 http://www.stichler.com/ -----Original Message----- From: Ricardo Padilla Sent: Friday, February 13, 2004 11:07 AM To: 'spambayes@python.org' Subject: Read the Troubleshooter and nothing.. Nothing loads including the toolbars. I have Windows 2000 w/ Outlook 2000 in an Exchange 5.5 environment. I downloaded and installed version 1.0a9 of SpamBayes. Hope you guys know what to do. Ricardo Padilla Systems Administrator The Stichler Group, Inc. 9655 Granite Ridge Drive #400 San Diego CA 92123 858-565-4440 x400 | fax: 858-569-3433 http://www.stichler.com/ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040214/e23de32a/attachment-0001.html From mhammond at skippinet.com.au Fri Feb 13 23:25:37 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Feb 13 23:25:56 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered In-Reply-To: Message-ID: <189301c3f2b2$9a4c0680$0200a8c0@eden> I'm a bit behind on looking at the bugs, but I added similar comments to some other bug. I believe your diagnosis of the problem was correct, and this has indeed been fixed, simply by using a better/faster way of checking the field. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Brown, Jim Sent: Saturday, 14 February 2004 3:13 AM To: spambayes@python.org Subject: [Spambayes] RE: Most Recently Received Email Not Filtered Hmmm. OBE. After looking at the source code for the release announced today, I believe this problem is probably fixed. ---Jim -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 3580 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040214/06558d02/winmail.bin From nowhere at cox.net Fri Feb 13 23:52:23 2004 From: nowhere at cox.net (Nowhere) Date: Fri Feb 13 23:52:18 2004 Subject: [Spambayes] Installing new Spambayes release Message-ID: <003401c3f2b6$55da37c0$0100a8c0@gandalf> Can I simply install the new release over my older one (one rev old)? Should I wait for the bug fix that was just mentioned? Thanks, Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040213/1966b42e/attachment.html From mhammond at skippinet.com.au Sat Feb 14 08:42:17 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sat Feb 14 08:42:38 2004 Subject: [Spambayes] Installing new Spambayes release In-Reply-To: <003401c3f2b6$55da37c0$0100a8c0@gandalf> Message-ID: <208801c3f300$5de76060$0200a8c0@eden> Go for it! This bug also existed in 0.8 (along with at least one other similar which I did fix). I doubt many people will hit it. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Nowhere Sent: Saturday, 14 February 2004 3:52 PM To: spambayes@python.org Subject: [Spambayes] Installing new Spambayes release Can I simply install the new release over my older one (one rev old)? Should I wait for the bug fix that was just mentioned? Thanks, Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040215/0376883c/attachment.html From brown at terralign.com Sat Feb 14 10:11:03 2004 From: brown at terralign.com (Brown, Jim) Date: Sat Feb 14 10:14:06 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered Message-ID: Mark, Thanks for the note. I can confirm that the bug is fixed, at least for my installation. I was unaware that you could check the existence of the field by looking at something other than an item in the folder. I was further unaware that creating the user property using a temporary item wouldn't simply delete the user property from the folder when the temporary item was deleted. That's my something new learned for the day, although it does make me wonder under what conditions a user property does get removed from a folder. :) The solution the SB team implemented is a good one, certainly better than the ones I (humbly) suggested. I'm still puzzled though as to why (other than effort) you wouldn't want to avoid having to ensure the presence of the field by trapping the errors the absence of the field might cause and dealing with the issue more contemporaneously. At a minimum this would avoid the theoretical problem of having the user property ripped out from underneath the code after SB had been initialized. OTOH, if it ain't broke... SpamBayes is absolutely the most freakin' awesome piece of open software available! I think the FCC should give you guys a commendation. ---Jim -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Friday, February 13, 2004 11:26 PM To: Brown, Jim; spambayes@python.org Subject: RE: [Spambayes] RE: Most Recently Received Email Not Filtered I'm a bit behind on looking at the bugs, but I added similar comments to some other bug. I believe your diagnosis of the problem was correct, and this has indeed been fixed, simply by using a better/faster way of checking the field. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Brown, Jim Sent: Saturday, 14 February 2004 3:13 AM To: spambayes@python.org Subject: [Spambayes] RE: Most Recently Received Email Not Filtered Hmmm. OBE. After looking at the source code for the release announced today, I believe this problem is probably fixed. ---Jim -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040214/10409358/attachment-0001.html From bkc at murkworks.com Sat Feb 14 15:01:28 2004 From: bkc at murkworks.com (Brad Clements) Date: Sat Feb 14 14:59:23 2004 Subject: [Spambayes] Supporting new database type in classifier Message-ID: <402E37B3.21792.141F2702@localhost> I'm working on a new type of storage that requires closer integration with classifier _getclues and _add_msg, _remove_msg. For example, this code fragment in classifier._getclues: # The all-unigram scheme just scores the tokens as-is. A Set() # is used to weed out duplicates at high speed. clues = [] push = clues.append for word in Set(wordstream): tup = self._worddistanceget(word) if tup[0] >= mindist: push(tup) clues.sort() Would essentially be pushed into the database module. For efficiency, the database module must have the entire wordstream to work with. _worddistanceget could be passed into the database as a callback, or the code could be replicated at the database level. That is, _worddistanceget calls _wordinfoget AND performs calculations. I'd prefer a function that accepts the token info (nham, nspam) and does the calculations w/o being coupled to _wordinfoget. Overiding _wordinfoget in a subclass doesn't work for me, because that function only gets called with one word at a time. I could override _getclues, but then I'd have to recreate the bigram stuff which is quite a lot. So, my first question is, could the bigram stuff be structured as a 'filter' before _getclues (modifying the wordstream) and before _add and _remove_msg? Second, what's the best way to restructure classifier so that a storage subclass can deal with entire wordstreams in one lump if it so chooses? -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From bkc at murkworks.com Sat Feb 14 15:24:27 2004 From: bkc at murkworks.com (Brad Clements) Date: Sat Feb 14 15:22:16 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? Message-ID: <402E3D16.2902.14343380@localhost> I'd like to get feedback from folks on the distribution of nham and nspam counts in their wordinfo databases. For example, I used sb_dbexpimp to dump my dbm based storage, then loaded it into excel and did a histogram on nham and nspam. Here's my nspam distribution BINS Frequency Percent Total Cumulative % BINS Frequency Cumulative % 0 13272 39.36% 39.36% 1 14834 43.99% 1 14834 43.99% 83.36% 0 13272 83.36% 2 2534 7.52% 90.87% 2 2534 90.87% 3 957 2.84% 93.71% 3 957 93.71% 4 535 1.59% 95.30% 10 655 95.65% 5 310 0.92% 96.22% 4 535 97.24% 10 655 1.94% 98.16% 20 323 98.20% 20 323 0.96% 99.12% 5 310 99.12% 40 166 0.49% 99.61% 40 166 99.61% 80 79 0.23% 99.84% 80 79 99.84% 160 23 0.07% 99.91% 160 23 99.91% 320 23 0.07% 99.98% 320 23 99.98% 640 7 0.02% 100.00% 640 7 100.00% More 0 0.00% 100.00% More 0 100.00% So, 44% of the spam tokens are hapaxes, for example. Anyway, what I'm interested in is the number of tokens whose nspam or nham count is greater than 255 vs the total number of tokens and ham and spam count. In my case, only about 30 tokens (out of 33718) have either an nham or nspam count > 255. I've trained on 410 spam and 133 ham. Can anyone else provide some numbers for me? I'm also interested in the total byte size and type of storage. In my case, the DB storage of 33718 tokens takes 1,318,912 bytes. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From mhammond at skippinet.com.au Sat Feb 14 18:37:10 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sat Feb 14 18:37:29 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered In-Reply-To: Message-ID: <277601c3f353$77e41fa0$0200a8c0@eden> > I'm still puzzled though as to why (other than effort) > you wouldn't want to avoid having to ensure the presence > of the field by trapping the errors the absence of the > field might cause and dealing with the issue more contemporaneously. Well, I learnt something new today too - what 'contemporaneously' means :) We don't use the field at all. We have no need for the field. The field is created purely to make it easier for the user to display the 'Spam' column in their Outlook view. Without this code, each user would be forced to jump through hoops in the 'Field Chooser' to get this to work. > SpamBayes is absolutely the most freakin' awesome piece of open software > available! I think the FCC should give you guys a commendation. FCC? Isn't that some American thing? Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2104 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040215/07b9703e/winmail.bin From brown at terralign.com Sat Feb 14 20:07:32 2004 From: brown at terralign.com (Brown, Jim) Date: Sat Feb 14 20:10:28 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered Message-ID: Mark Hammond wrote: > We don't use the field at all. Ah, but SpamBayes does use the Spam field. :) It's used in GetNewUnscoredMessageGenerator() to try to avoid processing a message twice. I thought the reason SB did the Ensure was to avoid the error that might be caused by doing a Restrict with a user property that didn't exist. But, the only place I knew for sure that this was done was in the new unscored message generator. If that's the only place it's done, why Ensure at all? [Apologies offered. I'm insatiably curious about these things from an academic standpoint.] > > > SpamBayes is absolutely the most freakin' awesome piece of open software > > available! I think the FCC should give you guys a commendation. > > FCC? Isn't that some American thing? > Well, I guess I _could_ try to put in a good word at Ofcom instead... ;) ---Jim From tameyer at ihug.co.nz Sat Feb 14 23:19:16 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 14 23:20:38 2004 Subject: [Spambayes] Spambayes bug? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD3C8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AAC@its-xchg4.massey.ac.nz> > Spambayes Outlook plugin 1.0a9 > When tested on e-mail account of one of our domain > administrators we found that Spambayes not only cleans > his account, but then goes through the every e-mail > account on the entire exchange server looking for spam. What folders does he have SpamBayes set up to filter? Check that only those that are meant to be there are listed, and be careful about the "include subfolders" checkbox. (This is on the Filtering tab of the Manager dialog). Otherwise, the log file should have useful information - at least it should say whether it is deliberating filtering all those folders. SpamBayes->Help->Troubleshooting Guide explains where to find the log files. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 14 20:54:47 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 14 23:23:42 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD5D3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677960@its-xchg4.massey.ac.nz> > I'd like to get feedback from folks on the distribution of > nham and nspam counts in their wordinfo databases. > > For example, I used sb_dbexpimp to dump my dbm based storage, > then loaded it into excel and did a histogram on nham and nspam. > > Here's my nspam distribution > > BINS Frequency Percent Total Cumulative % BINS > Frequency Cumulative % [...] I'm happy to give you data for my setup, but I'm lazy . Would you mind sending me/the list a copy of the Excel formulae for these? =Tony Meyer From tameyer at ihug.co.nz Sat Feb 14 20:51:49 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 14 23:23:52 2004 Subject: [Spambayes] Installing new Spambayes release In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD545@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467795F@its-xchg4.massey.ac.nz> But note that you should probably uninstall 0.8 first, then install 0.9 (you won't lose any configuration or training data). See the release notes for more: =Tony Meyer -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Mark Hammond Sent: Sunday, 15 February 2004 2:42 a.m. To: 'Nowhere'; spambayes@python.org Subject: RE: [Spambayes] Installing new Spambayes release Go for it! This bug also existed in 0.8 (along with at least one other similar which I did fix). I doubt many people will hit it. Mark. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Nowhere Sent: Saturday, 14 February 2004 3:52 PM To: spambayes@python.org Subject: [Spambayes] Installing new Spambayes release Can I simply install the new release over my older one (one rev old)? Should I wait for the bug fix that was just mentioned? Thanks, Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040215/1df57a0b/attachment.html From tim.one at comcast.net Sat Feb 14 23:25:28 2004 From: tim.one at comcast.net (Tim Peters) Date: Sat Feb 14 23:25:28 2004 Subject: [Spambayes] Supporting new database type in classifier In-Reply-To: <402E37B3.21792.141F2702@localhost> Message-ID: [Brad Clements] > I'm working on a new type of storage that requires closer > integration with classifier _getclues and _add_msg, _remove_msg. You'll probably get better responses on the spambayes-dev list. > For example, this code fragment in classifier._getclues: > > # The all-unigram scheme just scores the tokens as-is. A > Set() # is used to weed out duplicates at high speed. > clues = [] > push = clues.append > for word in Set(wordstream): > tup = self._worddistanceget(word) > if tup[0] >= mindist: > push(tup) > clues.sort() > > Would essentially be pushed into the database module. For > efficiency, the database module must have the entire wordstream > to work with. I encourage you to work on a branch for now -- since most people drop most ideas after a few weeks at most, I'm opposed to warping this part of the code to cater to something as unlikely to be seen again as a non-random-access database model. If you work on a branch and demonstrate astonishing results, great, then we'll junk all other storages and adopt yours . > _worddistanceget could be passed into the database as a callback, > or the code could be replicated at the database level. That is, > _worddistanceget calls _wordinfoget AND performs calculations. I'd > prefer a function that accepts the token info (nham, nspam) > and does the calculations w/o being coupled to _wordinfoget. > > Overiding _wordinfoget in a subclass doesn't work for me, because > that function only gets called with one word at a time. > > I could override _getclues, but then I'd have to recreate the > bigram stuff which is quite a lot. It's less than 30 lines of code (half of it is comments). > So, my first question is, could the bigram stuff be structured as a > 'filter' before _getclues (modifying the wordstream) and before > _add and _remove_msg? The bigram stuff is already a filter before _add and _remove. It could also be done as a filter before _getclues, but not pleasantly. > Second, what's the best way to restructure classifier so that a > storage subclass can deal with entire wordstreams in one lump if > it so chooses? On a branch -- prove this is worth doing first, and don't worry about doing it cleanly before that succeeds. From tameyer at ihug.co.nz Sat Feb 14 23:48:17 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 14 23:48:44 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD3CC@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AAD@its-xchg4.massey.ac.nz> > It really wrappes the problem we've discuss recently -- those of > malformed multipart messages. [...] > But where the unparseable did gone? It's not in the > Inbox, so probably to Unsure. Indeed, there is a message there, > consisting of three lines in View Source: [...] > and nothing more, no subject, no multipart, nothing else. And > it's like this in the raw mailfolder file also (Netscape 4.8). > > Guess it's not a big deal, as far as the malformed was a spam anyway, > but probably that's (eating up the whole message and replacing it with > custom Spambayes headers) not what you've intended. Yikes! Sorry - I should have paid more attention when that fix was checked in, and tested it to see if it would work. It handles the exception and keeps going, printing out the nice error message, but then it returns where it shouldn't, and keeps on going, using a blank message. I'm pretty sure that this will inflate the ham/spam count in the db and will result in the spambayes-header only messages which are almost certain to be unsure. Note that unless something is doing an expunge on your imap folders the original message is still there, it's just marked for deletion, so you can get it back. I'll put a warning about this in the "known issues" section of the FAQ, too. I'm working up a proper solution to the problem, and will check it into CVS shortly. If all goes to plan, the break between this release and the next will be much shorter than the break between 1.0a7 and 1.0a9, so it should be out soon. Otherwise, once it's in anonymous CVS, you can grab sb_imapfilter.py v1.26, drop it in place of your existing sb_imapfilter.py script, and all will be well. Apologies for this. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tim.one at comcast.net Sun Feb 15 00:12:08 2004 From: tim.one at comcast.net (Tim Peters) Date: Sun Feb 15 00:12:12 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <402E3D16.2902.14343380@localhost> Message-ID: [Brad Clements] > I'd like to get feedback from folks on the distribution of nham and > nspam counts in their wordinfo databases. > > For example, I used sb_dbexpimp to dump my dbm based storage, then > loaded it into excel and did a histogram on nham and nspam. > > ... > Anyway, what I'm interested in is the number of tokens whose nspam > or nham count is greater than 255 vs the total number of tokens and > ham and spam count. Here's mine today; I'm using bigrams: nham 674 nspam 621 ham counts value #times cumm % cumm% 0 107422 107422 47.24 47.24 1 103431 210853 45.49 92.73 2 8304 219157 3.65 96.38 3 2797 221954 1.23 97.61 4 1508 223462 0.66 98.28 5 852 224314 0.37 98.65 6 513 224827 0.23 98.88 7 359 225186 0.16 99.03 8 259 225445 0.11 99.15 9 203 225648 0.09 99.24 10 192 225840 0.08 99.32 11 117 225957 0.05 99.37 12 139 226096 0.06 99.43 13 90 226186 0.04 99.47 14 90 226276 0.04 99.51 15 91 226367 0.04 99.55 16 77 226444 0.03 99.59 17 66 226510 0.03 99.62 18 63 226573 0.03 99.64 19 52 226625 0.02 99.67 20 30 226655 0.01 99.68 21 34 226689 0.01 99.70 22 39 226728 0.02 99.71 23 31 226759 0.01 99.73 24 30 226789 0.01 99.74 25 25 226814 0.01 99.75 26 16 226830 0.01 99.76 27 20 226850 0.01 99.77 28 20 226870 0.01 99.77 29 22 226892 0.01 99.78 30 18 226910 0.01 99.79 31 14 226924 0.01 99.80 32 16 226940 0.01 99.81 33 16 226956 0.01 99.81 34 13 226969 0.01 99.82 35 12 226981 0.01 99.82 36 9 226990 0.00 99.83 37 9 226999 0.00 99.83 38 9 227008 0.00 99.84 39 12 227020 0.01 99.84 40 6 227026 0.00 99.84 41 9 227035 0.00 99.85 42 7 227042 0.00 99.85 43 2 227044 0.00 99.85 44 5 227049 0.00 99.85 45 4 227053 0.00 99.86 46 8 227061 0.00 99.86 47 5 227066 0.00 99.86 48 8 227074 0.00 99.86 49 5 227079 0.00 99.87 50 4 227083 0.00 99.87 51 5 227088 0.00 99.87 52 9 227097 0.00 99.87 53 6 227103 0.00 99.88 54 5 227108 0.00 99.88 55 8 227116 0.00 99.88 56 4 227120 0.00 99.88 58 1 227121 0.00 99.89 59 5 227126 0.00 99.89 61 4 227130 0.00 99.89 62 2 227132 0.00 99.89 63 5 227137 0.00 99.89 64 3 227140 0.00 99.89 65 2 227142 0.00 99.89 66 1 227143 0.00 99.89 67 1 227144 0.00 99.90 68 3 227147 0.00 99.90 69 2 227149 0.00 99.90 70 3 227152 0.00 99.90 71 3 227155 0.00 99.90 72 2 227157 0.00 99.90 73 3 227160 0.00 99.90 74 4 227164 0.00 99.90 75 1 227165 0.00 99.90 76 3 227168 0.00 99.91 77 3 227171 0.00 99.91 78 2 227173 0.00 99.91 79 1 227174 0.00 99.91 80 3 227177 0.00 99.91 81 2 227179 0.00 99.91 82 4 227183 0.00 99.91 83 5 227188 0.00 99.91 84 2 227190 0.00 99.92 85 4 227194 0.00 99.92 87 2 227196 0.00 99.92 88 4 227200 0.00 99.92 89 2 227202 0.00 99.92 90 3 227205 0.00 99.92 91 3 227208 0.00 99.92 92 2 227210 0.00 99.92 93 1 227211 0.00 99.92 94 2 227213 0.00 99.93 95 3 227216 0.00 99.93 96 1 227217 0.00 99.93 98 1 227218 0.00 99.93 100 8 227226 0.00 99.93 101 2 227228 0.00 99.93 102 1 227229 0.00 99.93 103 3 227232 0.00 99.93 104 1 227233 0.00 99.93 105 3 227236 0.00 99.94 107 1 227237 0.00 99.94 108 2 227239 0.00 99.94 109 1 227240 0.00 99.94 111 1 227241 0.00 99.94 114 2 227243 0.00 99.94 115 5 227248 0.00 99.94 118 2 227250 0.00 99.94 120 3 227253 0.00 99.94 123 2 227255 0.00 99.94 125 1 227256 0.00 99.94 126 1 227257 0.00 99.95 127 1 227258 0.00 99.95 129 1 227259 0.00 99.95 132 2 227261 0.00 99.95 133 1 227262 0.00 99.95 135 1 227263 0.00 99.95 138 1 227264 0.00 99.95 139 1 227265 0.00 99.95 140 1 227266 0.00 99.95 142 3 227269 0.00 99.95 144 1 227270 0.00 99.95 149 4 227274 0.00 99.95 153 1 227275 0.00 99.95 155 1 227276 0.00 99.95 156 2 227278 0.00 99.95 157 1 227279 0.00 99.95 158 2 227281 0.00 99.96 160 1 227282 0.00 99.96 163 2 227284 0.00 99.96 165 1 227285 0.00 99.96 166 1 227286 0.00 99.96 179 1 227287 0.00 99.96 181 1 227288 0.00 99.96 185 2 227290 0.00 99.96 192 2 227292 0.00 99.96 195 3 227295 0.00 99.96 202 3 227298 0.00 99.96 203 1 227299 0.00 99.96 204 1 227300 0.00 99.96 211 1 227301 0.00 99.96 213 1 227302 0.00 99.96 219 1 227303 0.00 99.97 225 1 227304 0.00 99.97 227 1 227305 0.00 99.97 232 1 227306 0.00 99.97 237 1 227307 0.00 99.97 238 1 227308 0.00 99.97 246 1 227309 0.00 99.97 252 1 227310 0.00 99.97 257 1 227311 0.00 99.97 260 1 227312 0.00 99.97 269 1 227313 0.00 99.97 270 2 227315 0.00 99.97 272 1 227316 0.00 99.97 273 1 227317 0.00 99.97 274 1 227318 0.00 99.97 275 2 227320 0.00 99.97 279 1 227321 0.00 99.97 286 1 227322 0.00 99.97 308 1 227323 0.00 99.97 314 1 227324 0.00 99.97 320 2 227326 0.00 99.98 321 1 227327 0.00 99.98 322 1 227328 0.00 99.98 329 1 227329 0.00 99.98 345 1 227330 0.00 99.98 350 1 227331 0.00 99.98 352 1 227332 0.00 99.98 353 1 227333 0.00 99.98 360 4 227337 0.00 99.98 366 1 227338 0.00 99.98 375 2 227340 0.00 99.98 380 1 227341 0.00 99.98 382 1 227342 0.00 99.98 389 1 227343 0.00 99.98 398 1 227344 0.00 99.98 401 7 227351 0.00 99.99 409 1 227352 0.00 99.99 424 1 227353 0.00 99.99 446 9 227362 0.00 99.99 450 2 227364 0.00 99.99 456 1 227365 0.00 99.99 465 1 227366 0.00 99.99 466 1 227367 0.00 99.99 493 2 227369 0.00 99.99 515 1 227370 0.00 99.99 519 1 227371 0.00 100.00 542 1 227372 0.00 100.00 545 1 227373 0.00 100.00 562 1 227374 0.00 100.00 573 1 227375 0.00 100.00 583 1 227376 0.00 100.00 621 1 227377 0.00 100.00 673 2 227379 0.00 100.00 674 3 227382 0.00 100.00 spam counts value #times cumm % cumm% 0 104332 104332 45.88 45.88 1 108911 213243 47.90 93.78 2 7225 220468 3.18 96.96 3 2368 222836 1.04 98.00 4 1190 224026 0.52 98.52 5 692 224718 0.30 98.83 6 486 225204 0.21 99.04 7 305 225509 0.13 99.18 8 280 225789 0.12 99.30 9 183 225972 0.08 99.38 10 152 226124 0.07 99.45 11 127 226251 0.06 99.50 12 76 226327 0.03 99.54 13 73 226400 0.03 99.57 14 76 226476 0.03 99.60 15 74 226550 0.03 99.63 16 45 226595 0.02 99.65 17 58 226653 0.03 99.68 18 52 226705 0.02 99.70 19 38 226743 0.02 99.72 20 42 226785 0.02 99.74 21 26 226811 0.01 99.75 22 20 226831 0.01 99.76 23 35 226866 0.02 99.77 24 23 226889 0.01 99.78 25 20 226909 0.01 99.79 26 20 226929 0.01 99.80 27 24 226953 0.01 99.81 28 13 226966 0.01 99.82 29 14 226980 0.01 99.82 30 6 226986 0.00 99.83 31 11 226997 0.00 99.83 32 19 227016 0.01 99.84 33 9 227025 0.00 99.84 34 6 227031 0.00 99.85 35 11 227042 0.00 99.85 36 9 227051 0.00 99.85 37 10 227061 0.00 99.86 38 5 227066 0.00 99.86 39 5 227071 0.00 99.86 40 3 227074 0.00 99.86 41 7 227081 0.00 99.87 42 10 227091 0.00 99.87 43 5 227096 0.00 99.87 44 7 227103 0.00 99.88 45 7 227110 0.00 99.88 46 1 227111 0.00 99.88 47 4 227115 0.00 99.88 48 6 227121 0.00 99.89 49 4 227125 0.00 99.89 50 5 227130 0.00 99.89 51 1 227131 0.00 99.89 52 2 227133 0.00 99.89 53 1 227134 0.00 99.89 54 2 227136 0.00 99.89 55 4 227140 0.00 99.89 56 4 227144 0.00 99.90 57 5 227149 0.00 99.90 58 5 227154 0.00 99.90 59 8 227162 0.00 99.90 60 3 227165 0.00 99.90 61 2 227167 0.00 99.91 62 4 227171 0.00 99.91 63 5 227176 0.00 99.91 64 2 227178 0.00 99.91 65 2 227180 0.00 99.91 66 2 227182 0.00 99.91 68 5 227187 0.00 99.91 69 1 227188 0.00 99.91 70 1 227189 0.00 99.92 71 1 227190 0.00 99.92 72 4 227194 0.00 99.92 74 2 227196 0.00 99.92 75 4 227200 0.00 99.92 76 1 227201 0.00 99.92 77 3 227204 0.00 99.92 78 2 227206 0.00 99.92 79 5 227211 0.00 99.92 80 4 227215 0.00 99.93 81 4 227219 0.00 99.93 84 1 227220 0.00 99.93 85 3 227223 0.00 99.93 86 1 227224 0.00 99.93 87 3 227227 0.00 99.93 88 3 227230 0.00 99.93 89 3 227233 0.00 99.93 90 3 227236 0.00 99.94 91 1 227237 0.00 99.94 92 1 227238 0.00 99.94 93 2 227240 0.00 99.94 94 1 227241 0.00 99.94 95 2 227243 0.00 99.94 97 2 227245 0.00 99.94 98 1 227246 0.00 99.94 100 1 227247 0.00 99.94 101 3 227250 0.00 99.94 102 2 227252 0.00 99.94 103 1 227253 0.00 99.94 104 2 227255 0.00 99.94 105 3 227258 0.00 99.95 107 4 227262 0.00 99.95 109 2 227264 0.00 99.95 111 1 227265 0.00 99.95 113 2 227267 0.00 99.95 115 1 227268 0.00 99.95 120 1 227269 0.00 99.95 121 1 227270 0.00 99.95 123 2 227272 0.00 99.95 126 1 227273 0.00 99.95 128 1 227274 0.00 99.95 130 1 227275 0.00 99.95 131 1 227276 0.00 99.95 132 1 227277 0.00 99.95 138 3 227280 0.00 99.96 140 2 227282 0.00 99.96 141 2 227284 0.00 99.96 145 1 227285 0.00 99.96 147 1 227286 0.00 99.96 148 3 227289 0.00 99.96 151 1 227290 0.00 99.96 152 4 227294 0.00 99.96 154 1 227295 0.00 99.96 157 1 227296 0.00 99.96 158 1 227297 0.00 99.96 159 1 227298 0.00 99.96 163 2 227300 0.00 99.96 164 1 227301 0.00 99.96 167 7 227308 0.00 99.97 176 11 227319 0.00 99.97 178 1 227320 0.00 99.97 181 1 227321 0.00 99.97 183 1 227322 0.00 99.97 185 1 227323 0.00 99.97 187 1 227324 0.00 99.97 191 2 227326 0.00 99.98 194 2 227328 0.00 99.98 198 1 227329 0.00 99.98 205 1 227330 0.00 99.98 216 1 227331 0.00 99.98 218 1 227332 0.00 99.98 234 1 227333 0.00 99.98 236 1 227334 0.00 99.98 246 1 227335 0.00 99.98 259 11 227346 0.00 99.98 261 1 227347 0.00 99.98 263 1 227348 0.00 99.99 268 1 227349 0.00 99.99 288 2 227351 0.00 99.99 291 1 227352 0.00 99.99 296 1 227353 0.00 99.99 298 2 227355 0.00 99.99 301 1 227356 0.00 99.99 308 1 227357 0.00 99.99 322 1 227358 0.00 99.99 328 1 227359 0.00 99.99 331 1 227360 0.00 99.99 333 1 227361 0.00 99.99 355 1 227362 0.00 99.99 364 1 227363 0.00 99.99 381 1 227364 0.00 99.99 394 2 227366 0.00 99.99 398 2 227368 0.00 99.99 418 1 227369 0.00 99.99 420 1 227370 0.00 99.99 434 1 227371 0.00 100.00 474 1 227372 0.00 100.00 488 1 227373 0.00 100.00 517 1 227374 0.00 100.00 562 1 227375 0.00 100.00 566 1 227376 0.00 100.00 571 1 227377 0.00 100.00 606 1 227378 0.00 100.00 615 1 227379 0.00 100.00 618 1 227380 0.00 100.00 620 1 227381 0.00 100.00 621 1 227382 0.00 100.00 Database is Berkeley, disk size 20,062,208 bytes. Ironically , the plain-text db export file is a third that size. [Tony Meyer] > I'm happy to give you data for my setup, but I'm lazy . Would > you mind sending me/the list a copy of the Excel formulae for these? Here's what I used: f = file('/code/spambayes/db.txt') # change to your export file nham, nspam = map(int, f.readline().split(',')[:-1]) print 'nham', nham, 'nspam', nspam hamcounts = [] spamcounts = [] for line in f: h, s = map(int, line.split('`')[1:3]) hamcounts.append(h) spamcounts.append(s) def hist(tag, data): count = {} for x in data: count[x] = count.get(x, 0) + 1 totalcount = sum(count.itervalues()) sofar = 0 counts = count.items() counts.sort() print tag, "counts" print "value #times cumm % cumm%" for value, count in counts: sofar += count print "%6d %6d %6d %6.2f %6.2f" % ( value, count, sofar, count * 1e2 / totalcount, sofar * 1e2 / totalcount) hist("ham", hamcounts) hist("spam", spamcounts) From jlockwood at houston.rr.com Sun Feb 15 01:28:00 2004 From: jlockwood at houston.rr.com (Jim Lopckwood) Date: Sun Feb 15 01:28:04 2004 Subject: [Spambayes] Problem with Spambayes installation Message-ID: I cannot install spambayes. When I try to install, I get the following message during installation: C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DIRegisterServer failed code 0x00000000 My choices are to abort, retry, or ignore. Retry does not work and neither does ignore. Any help is appreciated. Jim Lockwood Cell: 281-381-1081 Home: 281-338-2975 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040215/619c6bcd/attachment.html From kurtbuff at spro.net Sun Feb 15 01:29:13 2004 From: kurtbuff at spro.net (Kurt) Date: Sun Feb 15 01:29:19 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA9@its-xchg4.massey.ac.nz> Message-ID: <001e01c3f38d$07f45fe0$6e57a8c0@bfgapollo1> Thanks! I've dropped the Outlook plugin, and am now using the POP3/SMTP proxy. Took me a few minutes to figure out (mostly without reading the docs) but I'm liking this a lot! Now I really have something I can put in place for all of my friends who don't have Outlook. Kurt -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Tony Meyer Sent: Thursday, February 12, 2004 20:13 To: spambayes-announce@python.org; python-announce@python.org Cc: spambayes@python.org Subject: [Spambayes] SpamBayes 1.0a9 (0.9) The SpamBayes team is pleased to announce the latest release of SpamBayes - 1.0a9 (0.9). This is both a release of the source code *and* of binaries for Microsoft Windows users, including, for the first time, an installation program for all Windows users, not only those that use Microsoft Outlook. As such, we recommend that all SpamBayes users, including those currently running from source, and those using the Outlook plug-in, upgrade to this release. The Windows installation program will install either the Outlook add-in (for Microsoft Outlook users), or the SpamBayes server program (for all other mail client users, including Microsoft Outlook Express). All Windows users (including existing users of the Outlook add-in) are encouraged to use the installation program. If you wish to use the source-code version, you will also need to install Python - see README.txt in the source tree (or view it online) for more information. This release fixes a large number of bugs in the last release, and includes a few new features, too. For a detailed description of everything (well, everything we remember) that has changed since the last release, you can view our WHAT_IS_NEW.txt file, either online, or in the source distribution. Get it from the 'Download' page at http://www.spambayes.org/download.html Enjoy the new release and your spam-free mailbox :-) Tony (on behalf of the SpamBayes team) --- What is SpamBayes? --- The SpamBayes project is working on developing a Bayesian (of sorts) anti-spam filter (in Python), initially based on the work of Paul Graham. The major difference between this and other, similar projects is the emphasis on testing newer approaches to scoring messages. The project includes a number of different applications, all using the same core code, ranging from a plug-in for Microsoft Outlook, to a POP3 proxy, to various command-line tools. _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From robert at interimhc.com Sun Feb 15 12:43:10 2004 From: robert at interimhc.com (Robert Seawright) Date: Sun Feb 15 12:43:18 2004 Subject: [Spambayes] Question Message-ID: <000801c3f3eb$2de607e0$6501a8c0@Robert> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040215/e80a0a64/attachment-0001.gif From brent at bjohnson.net Sun Feb 15 14:17:02 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Sun Feb 15 14:17:42 2004 Subject: [Spambayes] Question In-Reply-To: <000801c3f3eb$2de607e0$6501a8c0@Robert> Message-ID: <026401c3f3f8$4de247e0$c801a8c0@torpedo> I've noticed similar behavior - but only occasionally. I selected the messages, clicked "Delete as Spam" and nothing happens. I restarted outlook and it was fine. - Brent -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Robert Seawright Sent: Sunday, February 15, 2004 12:43 PM To: spambayes@python.org Subject: [Spambayes] Question I am using SpamBayes 9.0 on Windows XP, when I highlight a file and "click" on Delete As Spam" nothing happens. Should anything then happen? I can be contacted at robert@interimhc.com or Robert Seawright Interim HealthCare 530.221.1212 V 530.221.7836 F From admin at smallworldwebhosting.com Sun Feb 15 14:32:01 2004 From: admin at smallworldwebhosting.com (SmallWorld Web Hosting) Date: Sun Feb 15 14:29:19 2004 Subject: [Spambayes] user question Message-ID: Hi there firstly fantastic software!!! what id like to know is, is it possible to delete all the spam email from my designated spam folder, i have over 5000 emails there i would rather delete, or do they have to stay there so the program can operate. I was hoping the program just took a log of the email address sent from? Obviously this would free up quite a few megabytes of outlook space best regards and thanks in advance for your feedback Richard Holt www.SmallWorldWebHosting.com Admin@SmallWorldWebHosting.com Tele: 0870 111 81 49 Moby: 07876 752 514 "Affordable Hosting & Design Solutions because . . . it's a Small World" This message and any attachments are confidential and may be legally privileged. SmallWorldWebHosting will not take responsibility for damage caused through this email and/or any attachments. If you are not the intended recipient of this message, please do not disclose the contents to any other person. Please inform Admin@SmallWorldWebHosting.com if this happens ! In the interest of Virus Infection/Protection ALL emails sent and received are checked with 'Norton Anti-Virus 2003' which is updated daily! If you wish to send attachments, please send a 'text only' email first with details of content. From tim at fourstonesExpressions.com Sun Feb 15 15:35:59 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sun Feb 15 15:36:07 2004 Subject: [Spambayes] user question In-Reply-To: References: Message-ID: On Sun, 15 Feb 2004 19:32:01 -0000, SmallWorld Web Hosting wrote: > Hi there > > firstly fantastic software!!! Thanks! what id like to know is, is it > possible to delete all the spam email from my designated spam folder, > i have over 5000 emails there i would rather delete, or do they have to > stay there so the program can operate. Spambayes doesn't need these emails in order to operate. The only reason you might wish to keep them around is just in case something happens to Spambayes' database, and you need to retrain it. Other than that, delete away, my friend. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From atn at webfind-101.com Sun Feb 15 16:33:29 2004 From: atn at webfind-101.com (ATN Enterprises, LLC) Date: Sun Feb 15 16:38:33 2004 Subject: [Spambayes] Stock Traders Daily OTC Alert: CMKK Message-ID: <200402152133.i1FLXRAT062433@mxzilla6.xs4all.nl> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040215/65e769e2/attachment.html From tameyer at ihug.co.nz Sun Feb 15 17:39:42 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 15 17:40:05 2004 Subject: [Spambayes] Problem with Spambayes installation In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD6A0@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467796C@its-xchg4.massey.ac.nz> > I cannot install spambayes. When I try to install, > I get the following message during installation: > C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll > Unable to register the DLL/OCX: DIRegisterServer > failed code 0x00000000 This should be fixed with the latest version of the installer - 1.0a9 (0.9). It's available either from the "windows" page on our website, or the sourceforge download page. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sun Feb 15 18:49:09 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 15 18:49:38 2004 Subject: [Spambayes] Question In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD7A6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467796F@its-xchg4.massey.ac.nz> [Robert Seawright] > I am using SpamBayes 0.9 on Windows XP, when I highlight > a file and "click" on Delete As Spam" nothing happens. > Should anything then happen? Yes - the message should be moved to your spam folder (assuming you have it set up), and trained as spam. [Brent Johnson] > I've noticed similar behavior - but only occasionally. I > selected the messages, clicked "Delete as Spam" and nothing > happens. I restarted outlook and it was fine. When this happens, are you able to use *any* of the buttons on the SpamBayes toolbar? I suspect that SpamBayes isn't loading for some reason (so the toolbar will be 'dead'). Was this with 1.0a9 (0.9), or an earlier version? If it's not that the plug-in is disabled by Outlook (but I've not heard of it being disabled and then automatically reenabled), then the log files should have details about what is going wrong. We'd need to see those to be able to figure out what's happening. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sun Feb 15 18:57:49 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 15 18:58:12 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD687@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB0@its-xchg4.massey.ac.nz> [Thanks for the code Tim!] Here's mine today; I'm using bigrams and 'url slurping' with a prefix. nham 99 nspam 220 ham counts value #times cumm % cumm% 0 61854 61854 48.58 48.58 1 54466 116320 42.78 91.36 2 6172 122492 4.85 96.20 3 1702 124194 1.34 97.54 4 1143 125337 0.90 98.44 5 485 125822 0.38 98.82 6 310 126132 0.24 99.06 7 239 126371 0.19 99.25 8 185 126556 0.15 99.40 9 127 126683 0.10 99.50 10 111 126794 0.09 99.58 11 77 126871 0.06 99.64 12 59 126930 0.05 99.69 13 48 126978 0.04 99.73 14 40 127018 0.03 99.76 15 35 127053 0.03 99.79 16 26 127079 0.02 99.81 17 14 127093 0.01 99.82 18 28 127121 0.02 99.84 19 23 127144 0.02 99.86 20 17 127161 0.01 99.87 21 13 127174 0.01 99.88 22 13 127187 0.01 99.89 23 7 127194 0.01 99.90 24 11 127205 0.01 99.91 25 8 127213 0.01 99.91 26 11 127224 0.01 99.92 27 9 127233 0.01 99.93 28 9 127242 0.01 99.93 29 7 127249 0.01 99.94 30 6 127255 0.00 99.95 31 6 127261 0.00 99.95 32 5 127266 0.00 99.95 33 4 127270 0.00 99.96 34 3 127273 0.00 99.96 35 2 127275 0.00 99.96 36 2 127277 0.00 99.96 37 2 127279 0.00 99.96 38 5 127284 0.00 99.97 40 1 127285 0.00 99.97 42 1 127286 0.00 99.97 43 4 127290 0.00 99.97 44 1 127291 0.00 99.97 45 2 127293 0.00 99.97 46 1 127294 0.00 99.98 48 1 127295 0.00 99.98 50 1 127296 0.00 99.98 51 1 127297 0.00 99.98 52 1 127298 0.00 99.98 55 1 127299 0.00 99.98 60 2 127301 0.00 99.98 61 1 127302 0.00 99.98 63 1 127303 0.00 99.98 64 4 127307 0.00 99.99 65 1 127308 0.00 99.99 66 1 127309 0.00 99.99 71 2 127311 0.00 99.99 74 1 127312 0.00 99.99 76 1 127313 0.00 99.99 77 1 127314 0.00 99.99 78 1 127315 0.00 99.99 87 1 127316 0.00 99.99 89 3 127319 0.00 100.00 97 3 127322 0.00 100.00 99 3 127325 0.00 100.00 spam counts value #times cumm % cumm% 0 54754 54754 43.00 43.00 1 61662 116416 48.43 91.43 2 5367 121783 4.22 95.65 3 2012 123795 1.58 97.23 4 950 124745 0.75 97.97 5 635 125380 0.50 98.47 6 323 125703 0.25 98.73 7 199 125902 0.16 98.88 8 160 126062 0.13 99.01 9 109 126171 0.09 99.09 10 81 126252 0.06 99.16 11 282 126534 0.22 99.38 12 107 126641 0.08 99.46 13 71 126712 0.06 99.52 14 51 126763 0.04 99.56 15 49 126812 0.04 99.60 16 26 126838 0.02 99.62 17 34 126872 0.03 99.64 18 37 126909 0.03 99.67 19 26 126935 0.02 99.69 20 30 126965 0.02 99.72 21 22 126987 0.02 99.73 22 20 127007 0.02 99.75 23 17 127024 0.01 99.76 24 17 127041 0.01 99.78 25 13 127054 0.01 99.79 26 15 127069 0.01 99.80 27 14 127083 0.01 99.81 28 14 127097 0.01 99.82 29 14 127111 0.01 99.83 30 10 127121 0.01 99.84 31 8 127129 0.01 99.85 32 8 127137 0.01 99.85 33 5 127142 0.00 99.86 34 10 127152 0.01 99.86 35 6 127158 0.00 99.87 36 13 127171 0.01 99.88 37 4 127175 0.00 99.88 38 8 127183 0.01 99.89 39 7 127190 0.01 99.89 40 6 127196 0.00 99.90 41 3 127199 0.00 99.90 42 12 127211 0.01 99.91 43 1 127212 0.00 99.91 44 7 127219 0.01 99.92 45 2 127221 0.00 99.92 46 7 127228 0.01 99.92 48 3 127231 0.00 99.93 49 4 127235 0.00 99.93 50 1 127236 0.00 99.93 51 2 127238 0.00 99.93 52 4 127242 0.00 99.93 53 2 127244 0.00 99.94 54 3 127247 0.00 99.94 55 1 127248 0.00 99.94 56 3 127251 0.00 99.94 57 3 127254 0.00 99.94 58 1 127255 0.00 99.95 59 2 127257 0.00 99.95 60 1 127258 0.00 99.95 62 2 127260 0.00 99.95 67 2 127262 0.00 99.95 68 2 127264 0.00 99.95 70 1 127265 0.00 99.95 72 1 127266 0.00 99.95 73 2 127268 0.00 99.96 76 1 127269 0.00 99.96 77 1 127270 0.00 99.96 78 2 127272 0.00 99.96 81 4 127276 0.00 99.96 88 2 127278 0.00 99.96 89 4 127282 0.00 99.97 90 1 127283 0.00 99.97 91 1 127284 0.00 99.97 92 1 127285 0.00 99.97 93 1 127286 0.00 99.97 96 1 127287 0.00 99.97 100 2 127289 0.00 99.97 101 2 127291 0.00 99.97 102 1 127292 0.00 99.97 104 1 127293 0.00 99.97 112 1 127294 0.00 99.98 113 1 127295 0.00 99.98 114 1 127296 0.00 99.98 115 1 127297 0.00 99.98 118 1 127298 0.00 99.98 122 1 127299 0.00 99.98 125 1 127300 0.00 99.98 129 1 127301 0.00 99.98 132 1 127302 0.00 99.98 146 1 127303 0.00 99.98 152 1 127304 0.00 99.98 156 1 127305 0.00 99.98 161 1 127306 0.00 99.99 168 2 127308 0.00 99.99 169 2 127310 0.00 99.99 175 1 127311 0.00 99.99 177 1 127312 0.00 99.99 178 1 127313 0.00 99.99 181 1 127314 0.00 99.99 192 1 127315 0.00 99.99 201 1 127316 0.00 99.99 203 1 127317 0.00 99.99 204 1 127318 0.00 99.99 207 2 127320 0.00 100.00 215 2 127322 0.00 100.00 217 1 127323 0.00 100.00 218 1 127324 0.00 100.00 220 1 127325 0.00 100.00 Database is Berkeley, disk size 10,338,304 bytes. Like Tim, the plain-text db export file is a third that size (3,462,853 bytes). =Tony Meyer From tameyer at ihug.co.nz Sun Feb 15 19:06:12 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 15 19:06:35 2004 Subject: [Spambayes] double listing of all inbox emails In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD26F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB1@its-xchg4.massey.ac.nz> > I have windows xp pro operating system and use the > Outlook package. Since installing Spambayes 8.1, I have 2 of > everything in my inbox. This seems to happen after I delete > the first email message. I have looked at the trouble > shooting guide but did not see anything relating to this problem. [...] > The deleted file has 2 and sometimes 3 copies of the messages. I'm still not clear on what is happening here. By "delete", do you mean that you are moving the message to the "Deleted Items" folder, or that you are clicking on "Delete as spam"? If it's the former, I can't see how spambayes would be doing this, since it doesn't do anything when you delete a message. What happens if you delete again? Do you get another copy? If you disable spambayes (Manager dialog), does the problem still happen? Does it happen if you uninstall spambayes? (uninstalling and then reinstalling will not effect your configuration or training). Does this happen with the latest release? 1.0a9 (0.9) > Log file attached. The log file has nothing unusual in it. As an aside, though, you've trained on only 11 spam and 164 ham - it would be much better to have a more equal balance, and definitely to have trained on more spam than that. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From mhammond at skippinet.com.au Sun Feb 15 19:16:38 2004 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Feb 15 19:17:02 2004 Subject: [Spambayes] RE: Most Recently Received Email Not Filtered In-Reply-To: Message-ID: <389a01c3f422$25e8e030$0200a8c0@eden> > Ah, but SpamBayes does use the Spam field. :) It's used in > GetNewUnscoredMessageGenerator() to try to avoid processing a > message twice. I should have made the distinction clearer (and indeed, used the correct terminology!) MAPI has 'properties', and these can appear on almost any item. While folders themselves have properties, they generally don't hold 'metadata' for the properties in the messages they contain. MAPI itself doesn't need that metadata - it makes no restriction on property types - different items could have completely different data types for the same property. Outlook presents a new 'UserProperties' concept. These are a layer above MAPI properties, but do use them. Each 'UserProperty' value on an item is stored as a MAPI property, but metadata about the UserProperty is also stored in the folder - eg, the type of the property. This allows Outlook to present a sane UI, for things like comparing field values. The field chooser, and formatting rules etc which allow you to take actions based on 'properties') use Outlook 'UserProperties' - for a field to appear in Outlook itself, it is not enough it have the MAPI property, it must be correctly defined as an Outlook 'UserProperty'. > I thought the reason SB did the Ensure was to > avoid the error that might be caused by doing a Restrict with > a user property that didn't exist. But, the only place I knew > for sure that this was done was in the new unscored message > generator. If that's the only place it's done, why Ensure at > all? [Apologies offered. I'm insatiably curious about these > things from an academic standpoint.] The names of the functions try to make it clearer, especially given the above context: EnsureOutlookFieldsForFolder() tries to ensure the UserProperty (ie, the 'Outlook Field') is defined on the *folder*, not any particular item. It calls msgstore.DoesFolderHaveOutlookField() - again, the 'Outlook Field' part of the name tries to imply other than a normal MAPI property. If we determine the folder does not have the field, then we must create it - and the only way to do that is via the Outlook object model, and by creating the UserProperty on an item, with a special flag saying 'also add this to the folder'. To do this, we create a temp item, create the property, then nuke it. In previous versions, we simply used an existing item in the folder, and this process nuked the spam score - leading to the bug. Hope that makes some sense... Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040216/19906df8/winmail-0001.bin From kurtbuff at spro.net Sun Feb 15 21:01:09 2004 From: kurtbuff at spro.net (Kurt) Date: Sun Feb 15 21:01:17 2004 Subject: FW: [Spambayes] SpamBayes 1.0a9 (0.9) Message-ID: <009201c3f430$bf8974d0$6e57a8c0@bfgapollo1> I couldn't reply directly to the sender, so I'm posting it to the list, without his name, so that he can see the reply, and so that others can benefit as well. I'm not using Exchange as my backend. If you're doing that, I'm guessing that you'll have to use the plugin. -----Original Message----- From: Sent: Saturday, February 14, 2004 23:46 To: kurtbuff@spro.net Subject: RE: [Spambayes] SpamBayes 1.0a9 (0.9) Kurt, How can one do that when using Outlook/Exchange? Thanks, -----Original Message----- From: Kurt [mailto:kurtbuff@spro.net] Sent: Sunday, February 15, 2004 08:29 To: spambayes@python.org Subject: RE: [Spambayes] SpamBayes 1.0a9 (0.9) Thanks! I've dropped the Outlook plugin, and am now using the POP3/SMTP proxy. Took me a few minutes to figure out (mostly without reading the docs) but I'm liking this a lot! Now I really have something I can put in place for all of my friends who don't have Outlook. Kurt -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Tony Meyer Sent: Thursday, February 12, 2004 20:13 To: spambayes-announce@python.org; python-announce@python.org Cc: spambayes@python.org Subject: [Spambayes] SpamBayes 1.0a9 (0.9) The SpamBayes team is pleased to announce the latest release of SpamBayes - 1.0a9 (0.9). This is both a release of the source code *and* of binaries for Microsoft Windows users, including, for the first time, an installation program for all Windows users, not only those that use Microsoft Outlook. As such, we recommend that all SpamBayes users, including those currently running from source, and those using the Outlook plug-in, upgrade to this release. The Windows installation program will install either the Outlook add-in (for Microsoft Outlook users), or the SpamBayes server program (for all other mail client users, including Microsoft Outlook Express). All Windows users (including existing users of the Outlook add-in) are encouraged to use the installation program. If you wish to use the source-code version, you will also need to install Python - see README.txt in the source tree (or view it online) for more information. This release fixes a large number of bugs in the last release, and includes a few new features, too. For a detailed description of everything (well, everything we remember) that has changed since the last release, you can view our WHAT_IS_NEW.txt file, either online, or in the source distribution. Get it from the 'Download' page at http://www.spambayes.org/download.html Enjoy the new release and your spam-free mailbox :-) Tony (on behalf of the SpamBayes team) --- What is SpamBayes? --- The SpamBayes project is working on developing a Bayesian (of sorts) anti-spam filter (in Python), initially based on the work of Paul Graham. The major difference between this and other, similar projects is the emphasis on testing newer approaches to scoring messages. The project includes a number of different applications, all using the same core code, ranging from a plug-in for Microsoft Outlook, to a POP3 proxy, to various command-line tools. _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From tameyer at ihug.co.nz Sun Feb 15 21:05:45 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 15 21:06:09 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD839@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677973@its-xchg4.massey.ac.nz> >> How can one do that when using Outlook/Exchange? > I'm not using Exchange as my backend. > > If you're doing that, I'm guessing that you'll have to use the plugin. That's correct. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From george.barnes at tafe.nsw.edu.au Sun Feb 15 21:08:10 2004 From: george.barnes at tafe.nsw.edu.au (Barnes, George) Date: Sun Feb 15 21:08:21 2004 Subject: [Spambayes] Lost attachments Message-ID: <3D896FF38B414A4CB02638DAE0FFA1561B6FCB@nefexch1.newcastle.tafensw.edu.au> Sandy Gazda wrote > I have this product on my computer and am frankly > very confused. Most attachments sent to me are now > being deleted, and I cannot read them. However, they are > legitimate, important attachments that I need to open. > What can I do to correct this problem, and is there a > way to recover those that have been sent to me and deleted > by this program -- that I want to open? No idea if this is related, but I got a bug with Outlook 2000, where my sent attachments were not received , until I sent e-mails only in "Plain Text" format.... HTH, regards, George Barnes ********************************************************************** This message is intended for the addressee named and may contain privileged information or confidential information or both. If you are not the intended recipient please delete it and notify the sender. ********************************************************************** From mitch at houseofpain.org Mon Feb 16 03:05:39 2004 From: mitch at houseofpain.org (mitch@houseofpain.org) Date: Mon Feb 16 03:05:42 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: A non-text attachment was scrubbed... Name: SpamBayesServer1.log Type: application/octet-stream Size: 1765 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040216/ed216765/SpamBayesServer1.obj -------------- next part -------------- I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.0.2195.2 (Service Pack 2). I have trained 887 ham and 1549 spam. The problem I am having is [DESCRIBE YOUR PROBLEM HERE] Forwarding spam/ham to the local training addresses gives the following exception (sorry for shitty formatting). I also get a SMTP error back from my mail client: "Enter data ending with a . on a line by itself". I've actually never been able to successfully forward a mail for training - with versions 1.06a, 1.07a, 1.09a. mitch error: uncaptured python exception, closing channel (exceptions.NameError:global name 'message_from_string' is not defined [asyncore.pyc|read|69] [asyncore.pyc|handle_read_event|387] [asynchat.pyc|handle_read|136] [spambayes\smtpproxy.pyc|found_terminator|214] [spambayes\smtpproxy.pyc|onProcessData|314] [spambayes\smtpproxy.pyc|train|406] [spambayes\smtpproxy.pyc|extractSpambayesID|361]) error: uncaptured python exception, closing channel (exceptions.NameError:global name 'message_from_string' is not defined [asyncore.pyc|read|69] [asyncore.pyc|handle_read_event|387] [asynchat.pyc|handle_read|136] [spambayes\smtpproxy.pyc|found_terminator|214] [spambayes\smtpproxy.pyc|onProcessData|314] [spambayes\smtpproxy.pyc|train|406] [spambayes\smtpproxy.pyc|extractSpambayesID|361]) From mitch at houseofpain.org Mon Feb 16 03:09:48 2004 From: mitch at houseofpain.org (Mitch Deoudes) Date: Mon Feb 16 03:10:22 2004 Subject: [Spambayes] Re: Problem with POP3 Proxy References: Message-ID: <40307ACC.DBA0E180@houseofpain.org> Wow - that submit help form really has some pretty output. Was I supposed to be typing in HTML? Here it is again in readable-ish form: SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.0.2195.2 (ServicePack 2). I have trained 887 ham and 1549 spam. Forwarding spam/ham to the local training addresses gives the following exception (sorry for shitty formatting). I also get a SMTP error back from my mail client: "Enter data ending with a . on a line by itself". I've actually never been able to successfully forward a mail for training - with versions 1.06a, 1.07a, 1.09a. error: uncaptured python exception, closing channel (exceptions.NameError:global name 'message_from_string' is not defined [asyncore.pyc|read|69] [asyncore.pyc|handle_read_event|387] [asynchat.pyc|handle_read|136] [spambayes\smtpproxy.pyc|found_terminator|214] [spambayes\smtpproxy.pyc|onProcessData|314] [spambayes\smtpproxy.pyc|train|406] [spambayes\smtpproxy.pyc|extractSpambayesID|361]) error: uncaptured python exception, closing channel (exceptions.NameError:global name 'message_from_string' is not defined [asyncore.pyc|read|69] [asyncore.pyc|handle_read_event|387] [asynchat.pyc|handle_read|136] [spambayes\smtpproxy.pyc|found_terminator|214] [spambayes\smtpproxy.pyc|onProcessData|314] [spambayes\smtpproxy.pyc|train|406] [spambayes\smtpproxy.pyc|extractSpambayesID|361]) mitch From tameyer at ihug.co.nz Mon Feb 16 03:21:43 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 03:22:07 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD8B4@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677977@its-xchg4.massey.ac.nz> [Aside: ooh! Someone using the 'report a bug' form! Yay!] > Forwarding spam/ham to the > local training addresses gives the following exception [edited] > exceptions.NameError:global > name 'message_from_string' is not defined > [spambayes\smtpproxy.pyc|extractSpambayesID|361]) Damn. Sorry - this was reported with the previous release, and I thought that I had fixed it, but I only fixed it in one place, and it needed to be in two. I've fixed it now, so it should be done for the next release (which should hopefully be quicker than 1.0a7-1.0a9). Sorry about that. > I also get a SMTP error back from my mail client: > "Enter data ending with a . on a line by itself". I'm pretty sure this is a result of that bug. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 16 03:27:01 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 03:27:22 2004 Subject: [Spambayes] Re: Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD8B6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677978@its-xchg4.massey.ac.nz> > Wow - that submit help form really has some pretty output. :) I'll take another look at that (it's been a while since it was added, but it was never in the code that was being released). Still, I'd rather have ugly looking messages with all the information necessary to help people out than easy-to-read messages that have a "please tell me x" answer.... > Was I supposed to be typing in HTML? No. It was really just line breaks that got mangled up. > Here it is again in readable-ish form: Thanks :) =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From mitch at houseofpain.org Mon Feb 16 03:48:31 2004 From: mitch at houseofpain.org (Mitch Deoudes) Date: Mon Feb 16 03:49:08 2004 Subject: [Spambayes] Problem with POP3 Proxy References: <1ED4ECF91CDED24C8D012BCF2B034F1304677977@its-xchg4.massey.ac.nz> Message-ID: <403083DF.44686EFD@houseofpain.org> Thanks for the quick response! Look forward to seeing 1.0a10, or 1.0aPI^e, or whatever the next one is. sb is a great utility overall, and the web interface is pretty slick. In related news - despite numbers like these: [Total emails trained: Spam: 1549 Ham: 888], I've been pretty consistently getting 5-10 "unsures" a day... and recently, a couple of false negatives, too. (Though only one or two false positives over the life of the installation, which is great.) I wonder if this is due to smarter / more pathological spam in recent days. In any case, it's a buttload easier to deal with than the alternative. mitch Tony Meyer wrote: > [Aside: ooh! Someone using the 'report a bug' form! Yay!] > > > Forwarding spam/ham to the > > local training addresses gives the following exception > [edited] > > exceptions.NameError:global > > name 'message_from_string' is not defined > > [spambayes\smtpproxy.pyc|extractSpambayesID|361]) > > Damn. Sorry - this was reported with the previous release, and I thought > that I had fixed it, but I only fixed it in one place, and it needed to be > in two. I've fixed it now, so it should be done for the next release (which > should hopefully be quicker than 1.0a7-1.0a9). Sorry about that. > > > I also get a SMTP error back from my mail client: > > "Enter data ending with a . on a line by itself". > > I'm pretty sure this is a result of that bug. > > =Tony Meyer > > --- > Please always include the list (spambayes@python.org) in your replies > (reply-all), and please don't send me personal mail about SpamBayes. This > way, you get everyone's help, and avoid a lack of replies when I'm busy. From bkc at murkworks.com Mon Feb 16 09:52:10 2004 From: bkc at murkworks.com (Brad Clements) Date: Mon Feb 16 09:49:42 2004 Subject: [Spambayes] Supporting new database type in classifier In-Reply-To: References: <402E37B3.21792.141F2702@localhost> Message-ID: <4030922C.2740.1D50BAAE@localhost> On 14 Feb 2004 at 23:25, Tim Peters wrote: > You'll probably get better responses on the spambayes-dev list. Ah, I must have missed the announcement of that list when it was created. > I encourage you to work on a branch for now -- since most people drop most > ideas after a few weeks at most, I'm opposed to warping this part of the > code to cater to something as unlikely to be seen again as a > non-random-access database model. If you work on a branch and demonstrate > astonishing results, great, then we'll junk all other storages and adopt > yours . Well ok, except I wasn't asking about the mechanics of putting my code into the tree, but rather, what's the best way to refactor Classifier so this would be easier to do. > > I could override _getclues, but then I'd have to recreate the > > bigram stuff which is quite a lot. > > It's less than 30 lines of code (half of it is comments). But then that code would be duplicated. So at some point (assuming I don't fade away), we'll only want one copy of the bigram synthesis code. That's the basis of my question, what's the best way to re-arrange the existing code? > > Second, what's the best way to restructure classifier so that a > > storage subclass can deal with entire wordstreams in one lump if > > it so chooses? > > On a branch -- prove this is worth doing first, and don't worry about doing > it cleanly before that succeeds. > heh heh. You're not answering my question.. ;-) I'll be back in touch with my dirty proof of concept. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From dave at optonline.net Mon Feb 16 14:18:01 2004 From: dave at optonline.net (Dave Levinter) Date: Mon Feb 16 11:21:26 2004 Subject: [Spambayes] Spambayes Plugin for Evolution Message-ID: <1076959080.12511.3.camel@linux.local> Hey Guys, I absolutely LOVE your Spambayes plugin for Outlook. Works great for when I'm using my Windows machine. I was just curious if there's been any thought about creating a similar plugin for Evolution? I've seen alot of posts in numerous Linux forums about people looking for this kind of functionality in order to break completely from Windows and thought it wouldn't be too much of a stretch for you. Dave dave@optonline.net SuSE 9.0 From Crandall at svg-law.com Mon Feb 16 14:37:21 2004 From: Crandall at svg-law.com (Deb Crandall) Date: Mon Feb 16 11:37:52 2004 Subject: [Spambayes] "You must configure the Spam folder" Message-ID: <00d201c3f4c4$4ebcf6c0$1802a8c0@toshibauser> Help! I have set up Spambayes. It was working fine. Then this morning when I went to delete messages from the "maybe spam" folder as "spam' I keep getting the message: "You must configure the Spam folder" Tried the troubleshooting guide. No answer. Tried FAQ - no answer Tried going into Spambayes manager and "reset config." And "config mgr." but nothing happens. "Config mgr" doesn't even come up! Help! I am overrun with &*%#$ spam. Thanks! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/5e06fd46/attachment.html From mikec at mikecauser.org Mon Feb 16 14:08:09 2004 From: mikec at mikecauser.org (Mike Causer) Date: Mon Feb 16 14:09:26 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail Message-ID: <20040216190809.4ed920f0.mikec@mikecauser.org> Problem in using SpamBayes 1.0a9 (Python 2.3 / Mandrake Linux 9.2) The installation is a fresh one, with a hammiedb created today from 407 spams and 494 hams. .spambayesrc looks like this: [Storage] persistent_use_database=True persistent_storage_file=~/.hammiedb [Headers] include_evidence=True include_score=True Although the original problem showed up when running through procmail, filtering by command line yields the same results, so that's what I'll quote. Running sb_filter on an mbox file gets: [mikec@lugh mikec]$ /usr/bin/sb_filter.py < /var/spool/mail/mikec Traceback (most recent call last): File "/usr/bin/sb_filter.py", line 239, in ? main() File "/usr/bin/sb_filter.py", line 231, in main action(msg) File "/usr/bin/sb_filter.py", line 163, in filter return h.filter(msg) File "/usr/lib/python2.3/site-packages/spambayes/hammie.py", line 109, in filter prob, clues = self._scoremsg(msg, True) File "/usr/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 190, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 493, in _getclues tup = self._worddistanceget(word) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 508, in _worddistanceget prob = self.probability(record) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 308, in probability assert hamcount <= nham AssertionError [mikec@lugh mikec]$ The AssertionError occurs on the last message in the input (found after inserting a few print statements in classifier.py). This is the same result for any input file whether single ham, single spam or a mixture of both, except that zero length input passes OK: [mikec@lugh mikec]$ /usr/bin/sb_filter.py ^D X-Spambayes-Classification: unsure; 0.49 X-Spambayes-Evidence: '*H*': 0.45; '*S*': 0.42; 'reply-to:none': 0.29; 'content-type:text/plain': 0.37; 'sender:none': 0.79 [mikec@lugh mikec]$ I do have a suspicion that the .spambayesrc might not be complete, but nothing leaps off the screen while reading Options.py This would be a good opportunity to get myself back up to speed on Python after a gap of a few years perhaps, but it would be nice to get rid of the spam first ;-) Mike -- Mike Causer Email - mailto:mikec@mikecauser.org GPG KeyID 1C2DDA07 WWW - http://www.mikecauser.org Flood the fen again! - Wicken Fen enlargement - http://www.wicken.org.uk From papaDoc at videotron.ca Mon Feb 16 14:23:40 2004 From: papaDoc at videotron.ca (papaDoc) Date: Mon Feb 16 14:23:20 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <20040216190809.4ed920f0.mikec@mikecauser.org> References: <20040216190809.4ed920f0.mikec@mikecauser.org> Message-ID: <403118BC.4000904@videotron.ca> Hi Mike, >Problem in using SpamBayes 1.0a9 (Python 2.3 / Mandrake Linux 9.2) > >The installation is a fresh one, with a hammiedb created today from 407 >spams and 494 hams. .spambayesrc looks like this: > > Did you trained with new mails or with mail you have trained your earlier version of spambayes? Did you use the "-f" in the command line for training. ? Spambayes add a header saying an email was already trained so don't trained on this one again. the flag "-f" force to traine on those message anyway. This can be a cause that your nham and nspam are still at 0 If this is not the case export your database to a flat file and check the number of ham and spam. > File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 308, in probability > assert hamcount <= nham > AssertionError > [mikec@lugh mikec]$ > > >This is the same result for any input file whether single ham, single >spam or a mixture of both, except that zero length input passes OK: > > For this I can't help ??? Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From rhaskett at golden.net Mon Feb 16 14:45:29 2004 From: rhaskett at golden.net (Robert Haskett) Date: Mon Feb 16 14:45:52 2004 Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? Message-ID: <041d01c3f4c5$79e0b4d0$6401a8c0@weasel> I just downloaded the 1.09a package and opened the readme_proxy.html. Under the Using the Service paragraph is mentions a /proxy directory under the main SpamBayes folder. This doesn't exist and isn't installed by the 1.09a installer. Is the installer broken? .r. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/8c7a484d/attachment.html From kennypitt at hotmail.com Mon Feb 16 14:59:52 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 16 15:00:53 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB0@its-xchg4.massey.ac.nz> Message-ID: Tony Meyer wrote: > Here's mine today; I'm using bigrams and 'url slurping' with a prefix. And mine, with mine_received_headers and x-pick_apart_urls enabled. All other options are Outlook plugin defaults. Db file size is 671,744 bytes nham 79 nspam 92 ham counts value #times cumm % cumm% 0 7247 7247 42.95 42.95 1 6694 13941 39.68 82.63 2 1414 15355 8.38 91.01 3 523 15878 3.10 94.11 4 300 16178 1.78 95.89 5 164 16342 0.97 96.86 6 97 16439 0.57 97.43 7 77 16516 0.46 97.89 8 91 16607 0.54 98.43 9 44 16651 0.26 98.69 10 35 16686 0.21 98.90 11 28 16714 0.17 99.06 12 17 16731 0.10 99.16 13 16 16747 0.09 99.26 14 14 16761 0.08 99.34 15 25 16786 0.15 99.49 16 3 16789 0.02 99.51 17 3 16792 0.02 99.53 18 24 16816 0.14 99.67 19 6 16822 0.04 99.70 20 6 16828 0.04 99.74 21 2 16830 0.01 99.75 22 2 16832 0.01 99.76 23 2 16834 0.01 99.77 24 2 16836 0.01 99.79 25 1 16837 0.01 99.79 26 2 16839 0.01 99.80 27 1 16840 0.01 99.81 28 2 16842 0.01 99.82 29 2 16844 0.01 99.83 30 1 16845 0.01 99.84 31 1 16846 0.01 99.85 32 3 16849 0.02 99.86 35 2 16851 0.01 99.88 37 2 16853 0.01 99.89 39 1 16854 0.01 99.89 41 1 16855 0.01 99.90 42 1 16856 0.01 99.91 45 1 16857 0.01 99.91 46 1 16858 0.01 99.92 54 1 16859 0.01 99.92 58 1 16860 0.01 99.93 60 1 16861 0.01 99.93 63 1 16862 0.01 99.94 64 1 16863 0.01 99.95 67 1 16864 0.01 99.95 75 2 16866 0.01 99.96 78 2 16868 0.01 99.98 79 4 16872 0.02 100.00 spam counts value #times cumm % cumm% 0 7086 7086 42.00 42.00 1 7612 14698 45.12 87.11 2 961 15659 5.70 92.81 3 523 16182 3.10 95.91 4 231 16413 1.37 97.28 5 109 16522 0.65 97.93 6 61 16583 0.36 98.29 7 47 16630 0.28 98.57 8 38 16668 0.23 98.79 9 28 16696 0.17 98.96 10 17 16713 0.10 99.06 11 26 16739 0.15 99.21 12 12 16751 0.07 99.28 13 17 16768 0.10 99.38 14 10 16778 0.06 99.44 15 8 16786 0.05 99.49 16 6 16792 0.04 99.53 17 9 16801 0.05 99.58 18 6 16807 0.04 99.61 19 7 16814 0.04 99.66 20 7 16821 0.04 99.70 23 5 16826 0.03 99.73 24 2 16828 0.01 99.74 25 2 16830 0.01 99.75 26 1 16831 0.01 99.76 27 1 16832 0.01 99.76 28 1 16833 0.01 99.77 29 2 16835 0.01 99.78 30 1 16836 0.01 99.79 31 1 16837 0.01 99.79 32 1 16838 0.01 99.80 34 2 16840 0.01 99.81 35 1 16841 0.01 99.82 40 3 16844 0.02 99.83 42 2 16846 0.01 99.85 47 1 16847 0.01 99.85 48 2 16849 0.01 99.86 49 2 16851 0.01 99.88 53 1 16852 0.01 99.88 54 1 16853 0.01 99.89 61 1 16854 0.01 99.89 62 1 16855 0.01 99.90 63 1 16856 0.01 99.91 64 1 16857 0.01 99.91 67 1 16858 0.01 99.92 68 1 16859 0.01 99.92 69 2 16861 0.01 99.93 73 1 16862 0.01 99.94 75 1 16863 0.01 99.95 82 1 16864 0.01 99.95 85 1 16865 0.01 99.96 86 1 16866 0.01 99.96 88 2 16868 0.01 99.98 89 1 16869 0.01 99.98 90 1 16870 0.01 99.99 91 1 16871 0.01 99.99 92 1 16872 0.01 100.00 -- Kenny Pitt From rbell at hcdemocrat.com Mon Feb 16 15:08:34 2004 From: rbell at hcdemocrat.com (Bell, Robert) Date: Mon Feb 16 15:08:38 2004 Subject: [Spambayes] OT: SpamBayes ideas => Mozilla mail (aka Thunderbird) Message-ID: <7DE2DEA4406E2F4CA32EAD4B9C6C9882016C282E@et-prod5.express-times.net> Thunderbird, the open-source e-mail client that is growing out of the Mozilla project, has had a pretty lame spam filter. But recently the developer has been going over the SpamBayes code and reimplementing some of it in C++. http://bugzilla.mozilla.org/show_bug.cgi?id=230093 Things are looking up. From tpeters at mixcom.com Mon Feb 16 15:08:46 2004 From: tpeters at mixcom.com (Tom Peters) Date: Mon Feb 16 15:09:03 2004 Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? In-Reply-To: <041d01c3f4c5$79e0b4d0$6401a8c0@weasel> Message-ID: <5.1.0.14.2.20040216140706.0aa35950@localhost> Isn't it called sb_server.py since about version 1.0.a7? This is from memory. Try the pop3 setup quick start wiki: http://entrian.com/sbwiki/POP3ServiceQuickStartGuide At 02:45 PM 2/16/2004 -0500, Robert Haskett wrote: >I just downloaded the 1.09a package and opened the >readme_proxy.html. Under the Using the Service paragraph is mentions a >/proxy directory under the main SpamBayes folder. This doesn't exist and >isn't installed by the 1.09a installer. Is the installer broken? > >.r. >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html [Love] Charm is the measure of attraction's power / To chain the fleeting fancy of the hour. --Louisa Thomas (b. 1865) "What is Charm?" --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/9e2fa3fe/attachment-0001.html From rmalayter at bai.org Mon Feb 16 15:12:48 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Feb 16 15:12:49 2004 Subject: [Spambayes] Lost attachments Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E19CB7@cliff.bai.org> [George Barnes] > No idea if this is related, but I got a bug with Outlook > 2000, where my sent attachments were not received , until I > sent e-mails only in "Plain Text" format.... That is probably because Outlook is set up to send messages in Rich Text format, instead of HTML. Outlook versions later than 98 can send messages in one of three formats: plain text, HTML, and Microsoft's proprietary Rich Text Format (RTF). RTF is a subset of Microsoft Word's formatting, and is the native message format for older versions of Outlook and Exchange Server, and the old Microsoft Mail. Non-Microsoft clients and gateways often have trouble with RTF-formatted messages, showing strange attachments, or nothing at all. There is an option in all of these versions of Outlook later than 98 to send HTML-formatted messages instead of RTF to Internet recipients. It's under Tools->Options->Mail format tab in my version of Outlook (2003). Regards, Ryan From kennypitt at hotmail.com Mon Feb 16 15:13:31 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 16 15:14:31 2004 Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? In-Reply-To: <041d01c3f4c5$79e0b4d0$6401a8c0@weasel> Message-ID: Have you started the proxy? The directory you refer to is a data directory that is not created by the installer. It should be created the first time you start the proxy. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Robert Haskett Sent: Monday, February 16, 2004 2:45 PM To: spambayes@python.org Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? I just downloaded the 1.09a package and opened the readme_proxy.html. Under the Using the Service paragraph is mentions a /proxy directory under the main SpamBayes folder. This doesn't exist and isn't installed by the 1.09a installer. Is the installer broken? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/f2f75d05/attachment.html From kennypitt at hotmail.com Mon Feb 16 15:17:48 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Feb 16 15:18:45 2004 Subject: [Spambayes] "You must configure the Spam folder" In-Reply-To: <00d201c3f4c4$4ebcf6c0$1802a8c0@toshibauser> Message-ID: Sounds like you may be suffering from FAQ 3.13, "Help! I deleted the Unsure/Spam folder." We've found that it can be all too easy to delete a folder accidentally in Outlook, sometimes without even realizing it. Even if this isn't the case, following these instructions should solve almost any "you must configure the Spam folder" error. http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Deb Crandall Sent: Monday, February 16, 2004 2:37 PM To: spambayes@python.org Subject: [Spambayes] "You must configure the Spam folder" Help! I have set up Spambayes. It was working fine. Then this morning when I went to delete messages from the "maybe spam" folder as "spam' I keep getting the message: "You must configure the Spam folder" -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/c1bf1e5a/attachment.html From mikec at mikecauser.org Mon Feb 16 15:19:35 2004 From: mikec at mikecauser.org (Mike Causer) Date: Mon Feb 16 15:21:13 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <403118BC.4000904@videotron.ca> References: <20040216190809.4ed920f0.mikec@mikecauser.org> <403118BC.4000904@videotron.ca> Message-ID: <20040216201935.3efa68bd.mikec@mikecauser.org> On Mon, 16 Feb 2004 14:23:40 -0500 papaDoc wrote: > Did you trained with new mails or with mail you have trained your > earlier version of spambayes? All new. Indeed the ham all arrived in the last 24 hours, and the spam in the last month. SpamBayes was never installed on this machine, although there was a hammiedb dating from about a year ago in my home directory (copied over when upgrading from an older machine.) The hammiedb was deleted before training the new installation. Mike -- Mike Causer Email - mailto:mikec@mikecauser.org GPG KeyID 1C2DDA07 WWW - http://www.mikecauser.org Flood the fen again! - Wicken Fen enlargement - http://www.wicken.org.uk From mikec at mikecauser.org Mon Feb 16 15:36:07 2004 From: mikec at mikecauser.org (Mike Causer) Date: Mon Feb 16 15:36:44 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <403118BC.4000904@videotron.ca> References: <20040216190809.4ed920f0.mikec@mikecauser.org> <403118BC.4000904@videotron.ca> Message-ID: <20040216203607.5211b011.mikec@mikecauser.org> On Mon, 16 Feb 2004 14:23:40 -0500 papaDoc wrote: > This can be a cause that your nham and nspam are still at 0 No, they're correctly (if I read the code right) 494.0 and 407.0. If it helps, spamcount is 116 and hamcount is 351, but I haven't tracked down how they got that way. Mike -- Mike Causer Email - mailto:mikec@mikecauser.org GPG KeyID 1C2DDA07 WWW - http://www.mikecauser.org Flood the fen again! - Wicken Fen enlargement - http://www.wicken.org.uk From bradley at kri.ch Mon Feb 16 16:27:56 2004 From: bradley at kri.ch (Bradley Richards) Date: Mon Feb 16 16:38:46 2004 Subject: [Spambayes] Pop3proxy - Web interface crashes when reviewing messages Message-ID: Hi folks, I didn't find anything listed amongst the currently known bugs, so here's a brief report: I'm using SpamBayes as a pop3proxy on the firewall of our tiny company (3 people using e-mail). We receive a couple hundred spams per day, and the firewall is an old, unkillable machine: Pentium-133 running NT4 (sp6a). I'm running SpamBayes 1.0a9, and start the server as Administrator in a DOS-Box using "sb_server.exe -b" I review messages across the network from a different machine. When I click on "review messages", Opera tells me is has received 321 bytes, and then I wait. And wait. And eventually nothing further happens and I'm left with a blank page. Sometimes a refresh works. Otherwise, the only solution seems to be to go to the subdirectories, delete all the spam/ham/unsure messages, and restart the pop3proxy. I suspect this problem is related to the amount of work being done, on a relatively slow machine. Timeout on the http connection? By the time this error has occured, the DOS Box contains several error messages saying error: uncaptured python exception. closing channel (socker.error:(9, 'Bad file descriptor') [asynchat.pyc|initiate_send|218] ]asyncor.pyc|send|334]). If it's important, I can attempt to see if these actually appear at the time of the interace problem. Or if there is any other useful information I could provide, please let me know... Cheers, Brad -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/a09347b9/attachment.html From papaDoc at videotron.ca Mon Feb 16 17:10:34 2004 From: papaDoc at videotron.ca (papaDoc) Date: Mon Feb 16 17:10:18 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <20040216201935.3efa68bd.mikec@mikecauser.org> References: <20040216190809.4ed920f0.mikec@mikecauser.org> <403118BC.4000904@videotron.ca> <20040216201935.3efa68bd.mikec@mikecauser.org> Message-ID: <40313FDA.8080908@videotron.ca> Mike Causer wrote: >On Mon, 16 Feb 2004 14:23:40 -0500 papaDoc wrote: > > > >>Did you trained with new mails or with mail you have trained your >>earlier version of spambayes? >> >> > >All new. Indeed the ham all arrived in the last 24 hours, and the >spam in the last month. > So you received 494 hams in the last 24hrs ??? Wouah you have a lot of friends ;-) What did you use for training ? (sb_mboxtraine I guess) Try to train again but add the "-f" flag to the command line. >SpamBayes was never installed on this machine, although there was a >hammiedb dating from about a year ago in my home directory (copied over >when upgrading from an older machine.) The hammiedb was deleted >before training the new installation. > > The mails wer never use as training set ? Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From papaDoc at videotron.ca Mon Feb 16 17:17:23 2004 From: papaDoc at videotron.ca (papaDoc) Date: Mon Feb 16 17:16:56 2004 Subject: [Spambayes] To many option displayed for "Storage options" Message-ID: <40314173.2020304@videotron.ca> Hi, I'm using the latest spambayes packages (cvs of the 12/02/2004) In the web UI in the Advance option in the section "Storage Options" You have: Use database for storage: * mysql * pgsql * dbm * pickle * True * False * True * False It looks to me that the True or False don't belong there. And the comment on the side is also strange. Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From papaDoc at videotron.ca Mon Feb 16 17:20:13 2004 From: papaDoc at videotron.ca (papaDoc) Date: Mon Feb 16 17:19:46 2004 Subject: [Spambayes] Trained email in sb_server Message-ID: <4031421D.7090106@videotron.ca> Hi, When I train an unsure email as ham in the rewiew page of the sb_server. The mail is moved from unknow_cache to ham_cache but there is not header in the mail indicating I have trained with this one. So my question is: This is normal behavior ? Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From tameyer at ihug.co.nz Mon Feb 16 18:17:37 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 18:18:03 2004 Subject: [Spambayes] Trained email in sb_server In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD9C9@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677979@its-xchg4.massey.ac.nz> > When I train an unsure email as ham in the review page of the > sb_server. The mail is moved from unknown_cache to ham_cache > but there is not header in the mail indicating I have trained > with this one. > > So my question is: This is normal behavior ? Yes. sb_server knows that it's trained already because: 1. It's in the ham_cache (or spam_cache) directory - the only place that mail that hasn't been trained is looked for is in the unknown_cache directory. 2. The message_info database keeps track of the classification and training status of the messages. There really isn't a need for a X-Spambayes-Trained header. =Tony Meyer From tameyer at ihug.co.nz Mon Feb 16 18:21:17 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 18:21:41 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD951@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467797A@its-xchg4.massey.ac.nz> > The installation is a fresh one, with a hammiedb created > today from 407 spams and 494 hams. [...] > assert hamcount <= nham > AssertionError This is definitely a database error (basically your database is corrupt - it contains one or more tokens that have been seen in more ham than you have trained, which is impossible). The solution is to either manually fix the database or (preferably) retrain from scratch. As to *how* it got corrupted, I don't know :) Remi would have more of an idea here, since he uses sb_filter/sb_mboxtrain and I don't. (If do you figure out a way to corrupt the db, then we've love to know so that we can protect against it). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 16 18:27:56 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 18:28:20 2004 Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD98E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467797B@its-xchg4.massey.ac.nz> [Robert Haskett] > I just downloaded the 1.09a package and opened the > readme_proxy.html. Under the Using the Service > paragraph is mentions a /proxy directory under the > main SpamBayes folder. This doesn't exist and isn't > installed by the 1.09a installer. Is the installer broken? [Kenny] > Have you started the proxy? The directory you refer > to is a data directory that is not created by the > installer. It should be created the first time you start the proxy. Ah, no - Robert is right, and the installer is 'broken' (well, not really, but it doesn't install the binary for the Windows service). There are two problems here - the readme should refer to SpamBayes/bin, not SpamBayes/proxy, and the installer doesn't actually include the pop3proxy_service.exe file. So the answer is that the service isn't available with (the binary version of) 1.0a9 (0.9). It'll still work fine from source, of course, and running it as a simple application rather than a service should still be fine. Apologies for this! I'll check in fixes in a moment. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From rhaskett at golden.net Mon Feb 16 18:42:57 2004 From: rhaskett at golden.net (Robert Haskett) Date: Mon Feb 16 18:43:01 2004 Subject: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? References: <1ED4ECF91CDED24C8D012BCF2B034F130467797B@its-xchg4.massey.ac.nz> Message-ID: <049501c3f4e6$9b1d9b10$6401a8c0@weasel> Hey, no probs. Yeah, I should have put 'broken' in quotes. No apologies necessary. =) Thanks for the quick replies. .r. ----- Original Message ----- From: "Tony Meyer" To: "'Kenny Pitt'" ; "'Robert Haskett'" ; Sent: Monday, February 16, 2004 6:27 PM Subject: RE: [Spambayes] 1.09a Installer missing pop3proxy_service.exe ?? > [Robert Haskett] > > I just downloaded the 1.09a package and opened the > > readme_proxy.html. Under the Using the Service > > paragraph is mentions a /proxy directory under the > > main SpamBayes folder. This doesn't exist and isn't > > installed by the 1.09a installer. Is the installer broken? > > [Kenny] > > Have you started the proxy? The directory you refer > > to is a data directory that is not created by the > > installer. It should be created the first time you start the proxy. > > Ah, no - Robert is right, and the installer is 'broken' (well, not really, > but it doesn't install the binary for the Windows service). > > There are two problems here - the readme should refer to SpamBayes/bin, not > SpamBayes/proxy, and the installer doesn't actually include the > pop3proxy_service.exe file. > > So the answer is that the service isn't available with (the binary version > of) 1.0a9 (0.9). It'll still work fine from source, of course, and running > it as a simple application rather than a service should still be fine. > > Apologies for this! I'll check in fixes in a moment. > > =Tony Meyer > > --- > Please always include the list (spambayes@python.org) in your replies > (reply-all), and please don't send me personal mail about SpamBayes. This > way, you get everyone's help, and avoid a lack of replies when I'm busy. > From tameyer at ihug.co.nz Mon Feb 16 18:47:08 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 18:47:30 2004 Subject: [Spambayes] Spambayes Plugin for Evolution In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD94F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467797C@its-xchg4.massey.ac.nz> > I absolutely LOVE your Spambayes plugin for Outlook. Works > great for when I'm using my Windows machine. I was just > curious if there's been any thought about creating a similar > plugin for Evolution? It's been requested a few times, but AFAIK no-one has actually started work on one. > I've seen alot of posts in numerous Linux forums about people > looking for this kind of functionality in order to break > completely from Windows and thought it wouldn't be too much > of a stretch for you. First off, please feel free to write one yourself! The source is all there, waiting for someone to do it.... I think that there are two main reasons why there isn't one at the moment, and why the current developers are unlikely to create one: 1. AFAIK, none of the developers use Evolution. No itch, no scratching. (I would maybe use it, if there was a zero-cost way for me to use it with Exchange, otherwise my zero-cost* Outlook solution suits me better). 2. Using sb_server works pretty well. You get the same results, and all you miss out on is a few buttons on a toolbar, plus you get a nice web interface. If you want drag-and-drop training, you could try the sb_pop3dnd.py script, which offers this (it's still quite alpha, though). =Tony Meyer * Not because I'm ripping Microsoft off, but because other people are paying for me. --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 16 19:51:34 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 19:51:58 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD8D3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB4@its-xchg4.massey.ac.nz> > Thanks for the quick response! Look forward to seeing > 1.0a10, or 1.0aPI^e, or whatever the next one is. :) It should be 1.0b1, but it's looking like maybe it'll be 1.0a10. > In related news - despite numbers like these: [Total emails > trained: Spam: 1549 Ham: 888], I've been pretty consistently > getting 5-10 "unsures" a day... and recently, a couple of > false negatives, too. (Though only one or two false > positives over the life of the installation, which is great.) > I wonder if this is due to smarter / more pathological spam > in recent days. The only real way to figure out why they're scoring what they are is to look at the clues for the message. If you can't figure out why it's scored what it has, then feel free to post an example set of clues here and we can try and figure it out for you. 5-10 "unsures" per day could be ok, BTW, depending on how much mail you get. Most of the testing, IIRC, tends to result in 2-5% unsure, so if you're getting 250-500 messages per day (easy enough with a few high volume lists), then this is a pretty reasonable result. Are the false negatives a couple per day, or a couple every now and then? These should be much less common, although if they're something quite different to what you've trained on before, hard to avoid. One thing that can make a difference is the training 'regime' that you use. The wiki (http://entrian.com/sbwiki) has lots of details, but the three most common are 'train on everything', 'mistake based training' (train on unsures, false positives, and false negatives) and 'nonedge training' (train on anything within given edges, say 0.05 and 0.95). The latter two are usually the most successful, with nonedge probably slightly in the lead. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From sharris2 at cableone.net Mon Feb 16 21:01:51 2004 From: sharris2 at cableone.net (S Harris) Date: Mon Feb 16 21:01:52 2004 Subject: [Spambayes] trouble here Message-ID: <000501c3f4fa$05b25340$1a897718@DFWX4L11> I am unable to receive email through my INBOX .all incoming mail goes directly into my spam folder. Please HELP! sharris2@cableone.net >From the Desk of: Suzanne Harris -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040216/39ae8ca8/attachment.html From tameyer at ihug.co.nz Mon Feb 16 21:39:04 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 21:39:29 2004 Subject: [Spambayes] trouble here In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDA29@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467797E@its-xchg4.massey.ac.nz> > I am unable to receive email through my INBOX; all > incoming mail goes directly into my spam folder. Use the "recover from spam" button to move the messages back to where they came from. Once you have done this with a few messages, SpamBayes will be able to use that training information to correctly classifying newly arriving messages. Note that you should endeavour to train roughly the same amount of ham (good mail) and spam. If, for example, you only train spam, everything will look like spam. If you haven't trained anything, then everything will look unsure, and end up in the "possible junk"/unsure folder. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 16 21:53:08 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 21:53:31 2004 Subject: [Spambayes] To many option displayed for "Storage options" In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DD9C8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467797F@its-xchg4.massey.ac.nz> > In the web UI in the Advance option in the section "Storage Options" > You have: > Use database for storage: * mysql > * pgsql > * dbm > * pickle > * True > * False > * True > * False > > It looks to me that the True or False don't belong there. The reason for this is that the option was changed from a boolean to multi-choice, to allow more than two database types (the sql ones at the moment, plus maybe a zeo/zope one someday). I didn't want to break existing configurations, though (although this will happen at some point), so the option allows True as a synonym for "dbm" and False as a synonym for "pickle". Then it's *more* complicated, because when read from a config file True and False end up as "True" and "False" (a quirk of the options code). So "True" is the same as True and "False" is the same as False. Yes, it's ugly :/ > And the comment on the side is also strange. The comment was meant to explain why there are the different options. I had hoped that it would be clear enough (especially given that the option is hidden away in the Advanced page), but I guess it isn't. In your opinion, could be comment be made clear enough to leave the True/False/"True"/"False" options there (for one more revision), or should I add in code to alter this one specific option? (I'd rather not add option-specific code to the UI, since it's nice and general at the moment, and will need removing at some point). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From nemeth at sophiawireless.com Mon Feb 16 22:53:31 2004 From: nemeth at sophiawireless.com (David Nemeth) Date: Mon Feb 16 22:55:59 2004 Subject: [Spambayes] Weird Spam Message-ID: I've got SpamBayes trained up pretty well (872 good and 478 spam in my database). I just received the email you see attached, which slipped by the filter with a 1% score. The form of the spam seemed interesting, as if it was specificly designed to elude this type of filter. Anyway, I thought someone working on SpamBayes might find it interesting. David Nemeth Senior Engineer Sophia Wireless, Inc. 14225-C Sullyfield Circle Chantilly, VA Ph: (703) 961-9573 x206 Fax:(703) 961-9576 -------------- next part -------------- An embedded message was scrubbed... From: "Gordon Roan" Subject: Fwd: I need your help... Date: Mon, 16 Feb 2004 22:26:58 -0500 Size: 3294 Url: http://mail.python.org/pipermail/spambayes/attachments/20040216/34f773f9/attachment.mht From tameyer at ihug.co.nz Mon Feb 16 23:08:10 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 16 23:09:14 2004 Subject: [Spambayes] Weird Spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDA72@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677980@its-xchg4.massey.ac.nz> > I've got SpamBayes trained up pretty well (872 good and 478 > spam in my database). I just received the email you see > attached, which slipped by the filter with a 1% score. With my current db, it scores 99%. (Together, we're a perfect 100 ). It's really the clues that are interesting - I'll attach mine at the end (note the large number of spam hapaxes, which carry the day). > The form of the spam seemed interesting, as if it was specifically > designed to elude this type of filter. It seems to be a combination of two avoidance techniques - 'word salad' and 'mini spam'. The first accounts for the 'random' words, almost all of which don't exist in my db (the theory is that 'word salad' doesn't help against statistical content filters because they are just as likely to hit words in the spam db or in neither db, as ones in the ham db). If the words were specifically harvested to target me, though (results of a "I'm feeling lucky" google for "Tony Meyer", for example) that would be another story. (You'll see in my clues, that I don't talk much about Elena, the camel who is heir to Sacramento, and her admission of bigotry ). "mini spam" could be a problem, if enough tokens aren't generated (ironically, for me, the harm from the mini-spam was countered by the use of word salad). One way around this is to tokenize whatever's at the end of the URL (there is an experimental option with SpamBayes to do this). I've got this turned on, but it's only used when needed, and wasn't here (for me). Quite often the URL itself, and the headers, are enough to counter this. =Tony Meyer Combined Score: 99% (0.993322) Internal ham score (*H*): 0.00658669 Internal spam score (*S*): 0.99323 # ham trained on: 100 # spam trained on: 223 51 Significant Tokens token spamprob #ham #spam 'analyze' 0.0918367 2 0 'issue' 0.14016 12 4 'angled' 0.155172 1 0 'grassland' 0.155172 1 0 'idiomatic' 0.155172 1 0 'ireland' 0.155172 1 0 'linus' 0.155172 1 0 'nixon' 0.155172 1 0 'rat' 0.155172 1 0 'proper' 0.167451 3 1 'between' 0.292455 10 9 'bi:the message' 0.328852 2 2 'wonderful' 0.328852 2 2 'subject:: ' 0.339228 15 17 'loading' 0.344569 1 1 'sugar' 0.344569 1 1 'wing' 0.344569 1 1 'skip:p 10' 0.34958 26 31 'white' 0.355755 5 6 'went' 0.3659 4 5 'bi:header:Subject:1 proto:http' 0.600175 37 124 'drive' 0.721907 2 12 'bi:header:Reply-To:1 header:Message-ID:1' 0.747164 11 73 'admission' 0.844828 0 1 'adrenal' 0.844828 0 1 'appian' 0.844828 0 1 'becloud' 0.844828 0 1 'beep' 0.844828 0 1 'bigotry' 0.844828 0 1 'camel' 0.844828 0 1 'elena' 0.844828 0 1 'epoch' 0.844828 0 1 'flair' 0.844828 0 1 'gustavus' 0.844828 0 1 'heir' 0.844828 0 1 'overdue' 0.844828 0 1 'prestige' 0.844828 0 1 'sacramento' 0.844828 0 1 'singleton' 0.844828 0 1 'sloth' 0.844828 0 1 'subject:need' 0.844828 0 1 'toefl' 0.844828 0 1 'url:es' 0.844828 0 1 'bi:try this' 0.908163 0 2 'reception' 0.908163 0 2 'subject:Fwd' 0.908163 0 2 'bi:url:1 url:gif' 0.934783 0 3 'walls' 0.934783 0 3 'bi:header:From:1 header:MIME-Version:1' 0.958716 0 5 'subject:...' 0.965116 0 6 'subject:your' 0.98951 0 21 --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From mikec at mikecauser.org Mon Feb 16 19:59:46 2004 From: mikec at mikecauser.org (Mike Causer) Date: Tue Feb 17 05:06:52 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130467797A@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13050DD951@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F130467797A@its-xchg4.massey.ac.nz> Message-ID: <20040217005946.077d700a.mikec@mikecauser.org> On Tue, 17 Feb 2004 12:21:17 +1300 Tony Meyer wrote: > This is definitely a database error (basically your database is > corrupt - it contains one or more tokens that have been seen in more > ham than you have trained, which is impossible). The solution is to > either manually fix the database or (preferably) retrain from scratch. Yes, a retrained from scratch db is OK. So far ;-) > As to *how* it got corrupted, I don't know :) Remi would have more of > an idea here, since he uses sb_filter/sb_mboxtrain and I don't. (If > do you figure out a way to corrupt the db, then we've love to know so > that we can protect against it). So now I have to dig through the .bash.histories & figure out how I corrupted the original within 10 minutes of creating it. Actually I think I do know, but it would be nice to come up with a sequence that anyone can repeat. As it's approaching two bells of the middle watch I think I'll resume in the morning. Mike -- Mike Causer Email - mailto:mikec@mikecauser.org GPG KeyID 1C2DDA07 WWW - http://www.mikecauser.org Flood the fen again! - Wicken Fen enlargement - http://www.wicken.org.uk From Amir_Katz at bmc.com Tue Feb 17 07:58:06 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Tue Feb 17 07:59:30 2004 Subject: [Spambayes] Two issues with the Outlook plug-in (1.09a) Message-ID: I'm using the latest version (1.0a9). Here are a two issues: 1. IMO, there should be a 'help->version' button that just displays the version. The current 'help->about' button goes to the locally installed web page (*/docs/outlook/about.html). I had to look for the exe name in order to remember the exact version. 2. If I click on the button 'show spam clues for current message', a new mail message is created, with the clues, etc. However, it seems that the original message is now marked as read. My question is - does it also trigger sending a 'read receipt' if the original message had this flag enabled? I know that any self-respecting spam (:-) message will not have this flag, but I am curious about this, as I sometimes use this feature to peep into a message without actually opening it (exactly so it won't send a 'read receipt'). Other than that, it's a great product and a great team behind it! Amir "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040217/22cd610a/attachment.html From Amir_Katz at bmc.com Tue Feb 17 08:11:33 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Tue Feb 17 08:12:59 2004 Subject: [Spambayes] Outlook plug-in vs. other methods Message-ID: If I use Outlook 2000 and wish to use SpamBayes, is the plug-in my only option or is there another way? If yes, what are the pros and cons of each? If there is an FAQ for this, please direct me to it (RTFFAQ? :-) Thanks, Amir "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040217/e4fd7c00/attachment.html From qub at qub.com Tue Feb 17 09:09:28 2004 From: qub at qub.com (Mikhail Yakoubov) Date: Tue Feb 17 09:09:34 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AAD@its-xchg4.massey.ac.nz> Message-ID: <40322098.CA3AE805@qub.com> Tony Meyer wrote: > > It really wrappes the problem we've discuss recently -- those of > > malformed multipart messages. > [...] > > But where the unparseable did gone? It's not in the > > Inbox, so probably to Unsure. Indeed, there is a message there, > > consisting of three lines in View Source: > [...] > > and nothing more, no subject, no multipart, nothing else. And > > it's like this in the raw mailfolder file also (Netscape 4.8). > > > > Guess it's not a big deal, as far as the malformed was a spam anyway, > > but probably that's (eating up the whole message and replacing it with > > custom Spambayes headers) not what you've intended. [...] > I'm working up a proper solution to the problem, and will check it into CVS > shortly. If all goes to plan, the break between this release and the next > will be much shorter than the break between 1.0a7 and 1.0a9, so it should be > out soon. Otherwise, once it's in anonymous CVS, you can grab > sb_imapfilter.py v1.26, drop it in place of your existing sb_imapfilter.py > script, and all will be well. No, CVS sb_imapfilter.py 1.26 (and 1.27 too) won't work this way -- statistics_database.db is an existing database, with 3807 spam and 4645 ham Done. Classifying *Traceback (most recent call last): File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 904, in ? run() File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 894, in run imap_filter.Filter() File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 766, in Filter self.unsure_folder) File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 680, in Filter for msg in self: File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 575, in __iter__ yield self[key] File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 624, in __getitem__ msg.Save() File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 490, in Save self.as_string()) File "C:\Qub\Soft\Python\Scripts\sb_imapfilter.py", line 458, in as_string return message.SBHeaderMessage.as_string(self, unixfrom) TypeError: as_string() takes exactly 1 argument (2 given) Regards, Mike Yakoubov. From kevin at foodogsoftware.com Tue Feb 17 10:19:15 2004 From: kevin at foodogsoftware.com (kevin@foodogsoftware.com) Date: Tue Feb 17 10:19:26 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: <200402171519.i1HFJFxW018193@ms-smtp-02-eri0.socal.rr.com> I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (source), with version 2.3.2 (#49, Oct 2 2003, 20:02:00) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service Pack 1). I have trained 0 ham and 11 spam. The problem I am having is: I am unable to train using redirect through my email client (The Bat). The smtp proxy doesn't return any clues. This function was working previous to v0.4. Here is the log from my email client: 2/17/2004, 05:11:57: SEND - sending mail messages - 3 messages in queue 2/17/2004, 05:11:57: SEND - connected to SMTP server 2/17/2004, 05:11:57: SEND - sending message to spambayes_spam@localhost !2/17/2004, 05:11:57: SEND - Message has not been sent. Server reply - !2/17/2004, 05:11:57: SEND - Server reports error. The response is: !2/17/2004, 05:11:57: SEND - Server reports error. The response is: 2/17/2004, 05:11:57: SEND - connection finished - 0 messages sent From mikec at mikecauser.org Tue Feb 17 11:49:25 2004 From: mikec at mikecauser.org (Mike Causer) Date: Tue Feb 17 13:15:35 2004 Subject: [Spambayes] Problem with SpamBayes 1.0a9 & procmail In-Reply-To: <20040217005946.077d700a.mikec@mikecauser.org> References: <1ED4ECF91CDED24C8D012BCF2B034F13050DD951@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F130467797A@its-xchg4.massey.ac.nz> <20040217005946.077d700a.mikec@mikecauser.org> Message-ID: <20040217164925.364e7a80.mikec@mikecauser.org> On Tue, 17 Feb 2004 00:59:46 +0000 Mike Causer wrote: > So now I have to dig through the .bash.histories & figure out how I > corrupted the original within 10 minutes of creating it. Actually I > think I do know, but it would be nice to come up with a sequence that > anyone can repeat. Hrumph. Failed to repeat the corruption. What I've been doing is to use the command line filter and have sb_server running at the same time, which is what I thought caused the first problem. Not today though :-( The worst I've done so far is to add [Hammie]\ntrain_on_filter=True to .spambayesrc which upsets sb_filter if it's run without the -t option. [mikec@lugh mikec]$ /usr/bin/sb_filter.py ./cleanmail Traceback (most recent call last): File "/usr/bin/sb_filter.py", line 239, in ? main() File "/usr/bin/sb_filter.py", line 231, in main action(msg) File "/usr/bin/sb_filter.py", line 163, in filter return h.filter(msg) File "/usr/lib/python2.3/site-packages/spambayes/hammie.py", line 120, in filter self.train(msg, is_spam, True) File "/usr/lib/python2.3/site-packages/spambayes/hammie.py", line 160, in train self.bayes.learn(tokenize(msg), is_spam) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 273, in learn self._add_msg(wordstream, is_spam) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 386, in _add_msg self._post_training() File "/usr/lib/python2.3/site-packages/spambayes/storage.py", line 251, in _post_training self._write_state_key() File "/usr/lib/python2.3/site-packages/spambayes/storage.py", line 244, in _write_state_key self.db[self.statekey] = (classifier.PICKLE_VERSION, File "/usr/lib/python2.3/shelve.py", line 130, in __setitem__ self.dict[key] = f.getvalue() File "/usr/lib/python2.3/bsddb/__init__.py", line 90, in __setitem__ self.db[key] = value bsddb._db.DBAccessError: (13, 'Permission denied -- put: attempt to modify a read-only tree') Time to apply Cardinal Biggles' most severe instrument of torture, methinks. Mike -- Mike Causer Email - mailto:mikec@mikecauser.org GPG KeyID 1C2DDA07 WWW - http://www.mikecauser.org Flood the fen again! - Wicken Fen enlargement - http://www.wicken.org.uk From gjewell at cnnxn.com Tue Feb 17 12:53:48 2004 From: gjewell at cnnxn.com (Greg Jewell) Date: Tue Feb 17 13:21:14 2004 Subject: [Spambayes] Trouble installing 0.9 addin for Outlook Message-ID: <741C3B0BAF7B1F4F8FCC50AC4766CBF0248B50@phoenix.ossconnexn.com> Hi All, I've been using the SpamBayes Outlook plugin 0.81 without any issues. I attempted to upgrade to the recent 0.9 release, but am getting the following error: There was an error initializing the SpamBayes addin Please re-start Outlook and try again. Are there any known causes of this? When I go back to the 0.81 addin, everything works as usual. I'm using Outlook XP on a Windows XP Professional SP1 machine. Thanks, Greg Jewell -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040217/8c2f711a/attachment.html From spomerenk at sancarloschamber.org Tue Feb 17 14:08:20 2004 From: spomerenk at sancarloschamber.org (Sheryl Pomerenk) Date: Tue Feb 17 14:04:45 2004 Subject: [Spambayes] folder Message-ID: <001f01c3f589$711a75b0$18010a0a@chamber.sancarloschamber.local> Hi - I accidentally closed the spam folder and can't figure out how to reopen it. Sheryl Pomerenk Sheryl Pomerenk, CEO San Carlos Chamber of Commerce www.sancarloschamber.org (650) 593-1068 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040217/844c602a/attachment.html From rhuib6 at xmsg.com Tue Feb 17 14:18:55 2004 From: rhuib6 at xmsg.com (Terrance Lane) Date: Tue Feb 17 15:26:50 2004 Subject: [Spambayes] Would you like something from the drug trolley ?! hw Message-ID: Hi there, It is time the world becomes aware that there is an easy and secure way to order medication. If you are well and not in need of anything, forgive me for intruding. However, if there is someone among your family or friends who is in need of medicines: please tell them about this e-mail. Take care from the Healing guru. For more information click below: http://www.allthemeds.biz/ada/index.htm Goodbye forever: click below: http://www.allthemeds.biz/r/r.htm abh xthzc u carls g nl tajprytwkzqfxiduwwaq From mfaulhaber at comcast.net Tue Feb 17 17:08:18 2004 From: mfaulhaber at comcast.net (Mark. E. Faulhaber) Date: Tue Feb 17 17:07:39 2004 Subject: [Spambayes] Where are the folders? Message-ID: I am running Windows 98SE and the latest version of spambayes. I used Outlook 2000 to be updated with my Outlook Express info and downloaded spambayes. As soon as I opened Outlook,I saw that all the new unread messages started to disappear. I can't find the "junk mail" or "unsure mail" folders to classify the messages. Where are they? I did a "FIND" for these folders and got no results. From mfaulhaber at comcast.net Tue Feb 17 17:23:21 2004 From: mfaulhaber at comcast.net (Mark. E. Faulhaber) Date: Tue Feb 17 17:22:40 2004 Subject: [Spambayes] FW: Where are the folders? Message-ID: Sorry, I found them in the Outlook "View" menu. -----Original Message----- From: Mark. E. Faulhaber [mailto:mfaulhaber@comcast.net] Sent: Tuesday, February 17, 2004 5:08 PM To: spambayes@python.org Subject: Where are the folders? I am running Windows 98SE and the latest version of spambayes. I used Outlook 2000 to be updated with my Outlook Express info and downloaded spambayes. As soon as I opened Outlook,I saw that all the new unread messages started to disappear. I can't find the "junk mail" or "unsure mail" folders to classify the messages. Where are they? I did a "FIND" for these folders and got no results. From tameyer at ihug.co.nz Tue Feb 17 17:30:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 17 17:30:32 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDB66@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB5@its-xchg4.massey.ac.nz> > No, CVS sb_imapfilter.py 1.26 (and 1.27 too) won't work this way -- [...] You need message.py from CVS as well. Sorry - I wrote the message to you before I had finished checking in the changes and testing them. The FAQ about this problem does say that you need both. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Tue Feb 17 18:03:09 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 17 18:03:35 2004 Subject: [Spambayes] Outlook plug-in vs. other methods In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDB56@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB8@its-xchg4.massey.ac.nz> > If I use Outlook 2000 and wish to use SpamBayes, > is the plug-in my only option or is there another way? If you get your mail from POP3, you can use sb_server instead. If you get your mail from IMAP4, you can use sb_imapfilter instead (this involves running from source). If you use Exchange, you have no option; if you use Hotmail, then you'd have to get the Hotmail mail via some sort of POP/IMAP gateway. > If yes, what are the pros and cons of each? Most importantly, you should get the same filtering results with any of the applications. The plug-in is an integrated solution. In almost all cases you'd want to use this rather than the less integrated solutions. You get training via drag-and-drop, training within the mail application (rather than via the web interface), simpler configuration, a setup wizard, the ability to (easily) filter/rescore mail already in the client, and the plug-in takes care of moving messages for you. I can't think of any reason that you'd want to use sb_imapfilter instead of the plug-in. You could use sb_server if you wanted to be able to do training remotely (via the web interface). The only other reason I can think of is that the plug-in just didn't work on your system - sb_server has a simpler job to do, and so is likely to. However, it'd still be better to report the problem and use the plug-in when it's fixed. sb_server does get slightly 'cleaner' mail than the plug-in (Outlook mucks about with it before the plug-in gets a chance), but I don't believe that anyone's found that to actually have any effect. Can anyone think of any others? (Note that none of this should be taken as criticism of sb_server or sb_imapfilter or the web interface - I believe they do a good job; the advantages all come of the tight integration with the mail client). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Tue Feb 17 18:08:15 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 17 18:08:41 2004 Subject: [Spambayes] Two issues with the Outlook plug-in (1.09a) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDB4D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB9@its-xchg4.massey.ac.nz> > 1. IMO, there should be a 'help->version' button > that just displays the version. If you do SpamBayes->SpamBayes Manager, the version is there. Or you can do SpamBayes->Check for new version and see if it's the latest (which is all most people care about anyway). If you need it to report a bug, then just including the log file will tell us all we need to know (and there is an integrated bug report system, like the new sb_server one, in the works). That said, David Walker asked for something along these lines just a few days back, so I'll open a feature request and see what Mark thinks. > If I click on the button 'show spam clues for current > message', a new mail message is created, with the > clues, etc. However, it seems that the original message > is now marked as read. My question is - does it also > trigger sending a 'read receipt' if the original message > had this flag enabled? There's various stuff about this here: [ 743384 ] Show Spam Clues Marks Message as Read > Other than that, it's a great product and a great team behind it! Thanks from us all :) =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Tue Feb 17 19:00:14 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 17 19:00:42 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDB85@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ABA@its-xchg4.massey.ac.nz> > I have trained 0 ham and 11 spam. Note that this will give you terrible results - everything will look like spam. You should try and keep the ham and spam numbers approximately even. > The problem I am having is: I am unable to train using redirect > through my email client (The Bat). The smtp proxy doesn't return > any clues. This function was working previous to v0.4. Sorry - there is a bug with the SMTP proxy in the latest release. It's already been fixed in CVS, so will make it into the next release. If you like, you should be able to get smtpproxy.py from CVS (version 1.6), drop it into your Lib/site-packages/spambayes folder, and that should fix it. Other than that, all I can recommend is to use the web interface for training, for the moment. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From mhammond at keypoint.com.au Tue Feb 17 19:02:14 2004 From: mhammond at keypoint.com.au (Mark Hammond) Date: Tue Feb 17 19:02:38 2004 Subject: [Spambayes] Two issues with the Outlook plug-in (1.09a) In-Reply-To: Message-ID: <103801c3f5b2$7790ada0$0200a8c0@eden> > 1. IMO, there should be a 'help->version' button that > just displays the version. The 'SpamBayes Manager' is supposed to show the version, on the 'General' tab. Is that not being displayed for you? > 2. If I click on the button 'show spam clues for current message', a new mail > message is created, with the clues, etc. However, it seems that the > original message is now marked as read. My question is - does it also trigger > sending a 'read receipt' if the original message had this flag enabled? http://spambayes.sourceforge.net/faq.html, question 3.10 Regarding your other question regarding the pros and cons of the addin versus the server program: If you use Outlook exclusively, then I know of no good reason to use the server program, as the addin does everything it does, and a little more. If Outlook is not your exclusive mailer, and advantage of the server program is that you can share the system amongst them all. Mark. From tameyer at ihug.co.nz Tue Feb 17 19:17:18 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Feb 17 19:17:43 2004 Subject: [Spambayes] Weird Spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13050DDB8D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ABC@its-xchg4.massey.ac.nz> > I don't know if this was preserved in the attachment, but the > actual displayed message was a .gif file downloaded from the > web when the message was viewed. That's what I meant by a "mini spam". Just a link (whether to a webpage or loading an image) and nothing else in the message. These can sometimes work, but often the URL itself (to the image, or to the webpage) or the headers of the message give it away. If it's a link to a webpage, rather than an image, then you can also process the text of the webpage (although there are issues with this). A link to an image is more difficult, of course. > Also, the small amount of > actual text displayed was obscured by nonsense tags: > > If the message > is not > loading href="http://www.terra.es/personal5/554664/r1/">t >r RPHIC>y > this > > comes out to be: > [click here] if the message is not loading. SpamBayes strips out the html tags, so what gets tokenized is "[click here] if the message is not loading". This is a really ineffective technique, since it's so easy to avoid. The similar 'almost white on white' text between text type tricks are much more effective, since you need to start considering how the text looks when rendered, instead of just skipping everything. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From Amir_Katz at bmc.com Wed Feb 18 02:35:03 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Wed Feb 18 02:37:41 2004 Subject: [Spambayes] Outlook plug-in vs. other methods Message-ID: I use Outlook with Exchange, so the answer is very clear. Thanks for the detailed explanation. And the plug-in is excellent, I just wanted to make sure that I'm not missing anything. Amir -----Original Message----- From: Tony Meyer [mailto:tameyer@ihug.co.nz] Sent: Wednesday, February 18, 2004 01:03 To: 'Katz, Amir'; 'Spambayes mailing list (E-mail)' Subject: RE: [Spambayes] Outlook plug-in vs. other methods > If I use Outlook 2000 and wish to use SpamBayes, > is the plug-in my only option or is there another way? If you get your mail from POP3, you can use sb_server instead. If you get your mail from IMAP4, you can use sb_imapfilter instead (this involves running from source). If you use Exchange, you have no option; if you use Hotmail, then you'd have to get the Hotmail mail via some sort of POP/IMAP gateway. > If yes, what are the pros and cons of each? Most importantly, you should get the same filtering results with any of the applications. The plug-in is an integrated solution. In almost all cases you'd want to use this rather than the less integrated solutions. You get training via drag-and-drop, training within the mail application (rather than via the web interface), simpler configuration, a setup wizard, the ability to (easily) filter/rescore mail already in the client, and the plug-in takes care of moving messages for you. I can't think of any reason that you'd want to use sb_imapfilter instead of the plug-in. You could use sb_server if you wanted to be able to do training remotely (via the web interface). The only other reason I can think of is that the plug-in just didn't work on your system - sb_server has a simpler job to do, and so is likely to. However, it'd still be better to report the problem and use the plug-in when it's fixed. sb_server does get slightly 'cleaner' mail than the plug-in (Outlook mucks about with it before the plug-in gets a chance), but I don't believe that anyone's found that to actually have any effect. Can anyone think of any others? (Note that none of this should be taken as criticism of sb_server or sb_imapfilter or the web interface - I believe they do a good job; the advantages all come of the tight integration with the mail client). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From phess at phess.best.vwh.net Wed Feb 18 06:39:15 2004 From: phess at phess.best.vwh.net (Patrick Hess) Date: Wed Feb 18 06:39:22 2004 Subject: [Spambayes] Problem with Outlook Plugin Message-ID: <10312672343.20040218033915@phess.best.vwh.net> Hello, I've been using (and loving) SpamBayes at home for a while now. I decided to install the Outlook Plugin (0.7? 0.8?) at work and, for some reason, it seized up (doesn't sort anything and won't take any hints). I've tried uninstalling and reinstalling a number of times and nothing fixes it. I suspect there's a problem in the training or configuration. I would like to start clean by uninstalling the plugin and then I will remove the training and configuration. Can someone point me at which files to remove once the plugin has been uninstalled? Please reply to this email address as I am not subscribed to any of your distributions. Thank you for a wonderful product and keep up the good work! Best regards, Patrick mailto:phess@phess.best.vwh.net From mlacroi at videotron.ca Wed Feb 18 09:25:52 2004 From: mlacroi at videotron.ca (Michel Lacroix) Date: Wed Feb 18 08:18:08 2004 Subject: [Spambayes] New version slow Message-ID: <000801c3f62b$20a42200$e44b8342@0sjbd> I have ver 0.9 add-in to Outlook 2000, running W2K Pro and I have read the troubleshooting and FAQ pages. Problem: new ver is very slow since I installed it. Prior ver was fast and could hardly see incoming mail as it was checked. With new ver all incoming mail is put in in-box, it pauses for a while and then starts slowly to delete spam. M Lavroix -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040218/cf32c8dc/attachment.html From rmalayter at bai.org Wed Feb 18 09:17:53 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Feb 18 09:18:02 2004 Subject: [Spambayes] train on blank spam messages Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E19D4A@cliff.bai.org> I frequently get entirely blank messages (no subject or body, sometimes even without FROM or TO address). These are obviously abortive spam attempts, generated by buggy spam ware. Should I train on these? I have been, figuring that SpamBayes could at least generate subject:None tokens, and perhaps something from the Received headers. Although, I notice that SpamBayes doesn't mine the class-B or class-C network from the Received header. Has this been tried? Or is it useless in this day of spam-spewing, compromised home machines? That a message came directly from a machine on PacBell's DSL network, rather than a well-known PacBell SMTP relay, would seem to be a fairly strong spam clue to me. Here's an example: ----------------------------- Combined Score: 26% (0.258597) Internal ham score (*H*): 0.829696 Internal spam score (*S*): 0.346891 # ham trained on: 3634 # spam trained on: 3146 7 Significant Tokens token spamprob #ham #spam 'from:none' 0.013024 367 4 'message-id:invalid' 0.0680032 367 23 'reply-to:none' 0.335768 3028 1325 'cc:none' 0.623476 2025 2903 'header:Received:2' 0.745428 360 913 'to:none' 0.755966 14 38 'sender:none' 0.767726 1057 3025 Message Stream X-MS-Mail-Gibberish: Microsoft Mail Internet Headers Version 2.0 Received: from adsl-67-125-217-122.dsl.lsan03.pacbell.net ([67.125.217.122]) by smtp.bai.org with Microsoft SMTPSVC(5.0.2195.6713); Wed, 18 Feb 2004 00:35:23 -0600 Received: from 44.54.208.104 by 67.125.217.122; Wed, 18 Feb 2004 07:37:38 +0100 Message-ID: All Message Tokens 10 unique tokens 'cc:none' 'content-type:text/plain' 'from:none' 'header:Message-ID:1' 'header:Received:2' 'message-id:invalid' 'reply-to:none' 'sender:none' 'to:none' 'x-mailer:none' From Peter.Houppermans at paconsulting.com Wed Feb 18 11:02:40 2004 From: Peter.Houppermans at paconsulting.com (Peter Houppermans) Date: Wed Feb 18 11:02:45 2004 Subject: [Spambayes] RE: Spambayes-announce Digest, Vol 7, Issue 1 Message-ID: <3BB26977D1FB7E4F9E21092BE51D1D5C459928@lonmbx102.paconsulting.com> [private email] Hi guys, thanks for the latest release, I'll have a look at the release notes to see if it catches some of the smaller bugs I found. Question: are you talking to the people of the DSPAM project (http://www.nuclearelephant.com/projects/dspam/)? They appear to have dreamt up a couple of improvements on the Bayesian filtering - not sure if they're on to something or whther they haven't quite gone through the same process as you guys ;-). Kind regards, /// Peter /// -----Original Message----- From: spambayes-announce-request@python.org [mailto:spambayes-announce-request@python.org] Sent: Wednesday, February 18, 2004 3:02 PM To: spambayes-announce@python.org Subject: Spambayes-announce Digest, Vol 7, Issue 1 Send Spambayes-announce mailing list submissions to spambayes-announce@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes-announce or, via email, send a message with subject or body 'help' to spambayes-announce-request@python.org You can reach the person managing the list at spambayes-announce-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes-announce digest..." Today's Topics: 1. SpamBayes 1.0a9 (0.9) (Tony Meyer) ---------------------------------------------------------------------- Message: 1 Date: Fri, 13 Feb 2004 17:12:47 +1300 From: "Tony Meyer" Subject: [Spambayes-announce] SpamBayes 1.0a9 (0.9) To: , Cc: spambayes@python.org Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AA9@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" The SpamBayes team is pleased to announce the latest release of SpamBayes - 1.0a9 (0.9). This is both a release of the source code *and* of binaries for Microsoft Windows users, including, for the first time, an installation program for all Windows users, not only those that use Microsoft Outlook. As such, we recommend that all SpamBayes users, including those currently running from source, and those using the Outlook plug-in, upgrade to this release. The Windows installation program will install either the Outlook add-in (for Microsoft Outlook users), or the SpamBayes server program (for all other mail client users, including Microsoft Outlook Express). All Windows users (including existing users of the Outlook add-in) are encouraged to use the installation program. If you wish to use the source-code version, you will also need to install Python - see README.txt in the source tree (or view it online) for more information. This release fixes a large number of bugs in the last release, and includes a few new features, too. For a detailed description of everything (well, everything we remember) that has changed since the last release, you can view our WHAT_IS_NEW.txt file, either online, or in the source distribution. Get it from the 'Download' page at http://www.spambayes.org/download.html Enjoy the new release and your spam-free mailbox :-) Tony (on behalf of the SpamBayes team) --- What is SpamBayes? --- The SpamBayes project is working on developing a Bayesian (of sorts) anti-spam filter (in Python), initially based on the work of Paul Graham. The major difference between this and other, similar projects is the emphasis on testing newer approaches to scoring messages. The project includes a number of different applications, all using the same core code, ranging from a plug-in for Microsoft Outlook, to a POP3 proxy, to various command-line tools. ------------------------------ _______________________________________________ Spambayes-announce mailing list Spambayes-announce@python.org http://mail.python.org/mailman/listinfo/spambayes-announce End of Spambayes-announce Digest, Vol 7, Issue 1 ************************************************ From kennypitt at hotmail.com Wed Feb 18 11:45:53 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 18 11:47:00 2004 Subject: [Spambayes] folder In-Reply-To: <001f01c3f589$711a75b0$18010a0a@chamber.sancarloschamber.local> Message-ID: I'm assuming that you probably mean deleted when you say closed. If so, you can probably find the answer you need in FAQ 3.13: http://spambayes.sourceforge.net/faq.html#help-i-deleted-the-unsure-spam -folder -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Sheryl Pomerenk Sent: Tuesday, February 17, 2004 2:08 PM To: spambayes@python.org Subject: [Spambayes] folder Hi - I accidentally closed the spam folder and can't figure out how to reopen it. Sheryl Pomerenk Sheryl Pomerenk, CEO San Carlos Chamber of Commerce www.sancarloschamber.org (650) 593-1068 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040218/d61e8338/attachment-0001.html From russ_foster at comcast.net Wed Feb 18 13:16:05 2004 From: russ_foster at comcast.net (Russ Foster) Date: Wed Feb 18 13:08:08 2004 Subject: [Spambayes] *nix version In-Reply-To: Message-ID: I've got the Outlook plugin running on my work account with excellent results. I've since Spambayes working at home on my Linux (RedHat 7.3) machine via procmail. Is there a way to get the ham/spam ratio from the *nix version? The documentation is a bit sparse. Thanks. -r From brian.gatfield at ewa-australia.com Wed Feb 18 13:59:18 2004 From: brian.gatfield at ewa-australia.com (Brian Gatfield) Date: Wed Feb 18 13:58:01 2004 Subject: [Spambayes] Configuring Spam Folder Message-ID: I have enabled Spambayes and have trained Spambayes - but I cannot set the location of the folders where the certain and unsure spam is to be moved to. I can define the filters such that the moved to folders are defined but when I back out of the Define Filters window, the location of the folders for the spam change to . I cannot set the folders and everytime I try to use Spambayes, I get a message stating I must configure the spam folders. How? Brian Gatfield From kennypitt at hotmail.com Wed Feb 18 14:49:24 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 18 14:51:12 2004 Subject: [Spambayes] New version slow In-Reply-To: <000801c3f62b$20a42200$e44b8342@0sjbd> Message-ID: This is probably a result of the background filtering option. This option allows any Outlook rules to be processed before SpamBayes tries to move the messages, but it does so at the expense of slower processing. Older versions had this option disabled by default, but many of the problems reported to this list involved conflicts with Outlook rules so it was decided to make it enabled by default in the new version. If you don't use Outlook rules to move any of your messages, you can disable the option on the Advanced tab in SpamBayes Manager. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Michel Lacroix Sent: Wednesday, February 18, 2004 9:26 AM To: spambayes@python.org Subject: [Spambayes] New version slow I have ver 0.9 add-in to Outlook 2000, running W2K Pro and I have read the troubleshooting and FAQ pages. Problem: new ver is very slow since I installed it. Prior ver was fast and could hardly see incoming mail as it was checked. With new ver all incoming mail is put in in-box, it pauses for a while and then starts slowly to delete spam. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040218/8d04f3d2/attachment.html From russ_foster at comcast.net Wed Feb 18 15:19:37 2004 From: russ_foster at comcast.net (Russ Foster) Date: Wed Feb 18 15:11:40 2004 Subject: [Spambayes] New version slow In-Reply-To: <000801c3f62b$20a42200$e44b8342@0sjbd> Message-ID: Have you changed the "Processing start delay" and "Delay between processing items"? (See the Manager, click on the "Advanced" tab.) You may want to (re)set these options for the new install. -r On Wed, 18 Feb 2004, Michel Lacroix wrote: > I have ver 0.9 add-in to Outlook 2000, running W2K Pro and I have read > the troubleshooting and FAQ pages. > > Problem: new ver is very slow since I installed it. Prior ver was fast > and could hardly see incoming mail as it was checked. With new ver all > incoming mail is put in in-box, it pauses for a while and then starts > slowly to delete spam. > > M Lavroix > From scott.job at tetratech.com Wed Feb 18 15:16:51 2004 From: scott.job at tetratech.com (Scott Job) Date: Wed Feb 18 15:13:09 2004 Subject: [Spambayes] Possible bug? Message-ID: Hello, I started running SpamBayes yesterday. Everything went fine with the installation. I am having a problem when I use the Recover From Spam function in my Unsure folder - SpamBayes puts the message back into the wrong Inbox. Details follow: I have two accounts, a POP3 and an IMAP, both configured in Outlook 2000 SP-3. I am not filtering the POP3 account since I never receive spam at that email address. I do filter the IMAP account, which gets some spam (5-10x or so a day). I leave recent messages on the server for my IMAP account. Older messages get moved into locally stored Outlook folders. I set up my Unsure folder within my IMAP account (since I need access to recent messages there). The SPAM folder is stored locally, not in the IMAP account. So far, ham stays in my IMAP Inbox, possible spam goes into the IMAP Unsure folder, and spam goes into the locally stored SPAM folder. However, when I have ham in the Unsure folder and I click the Recover From Spam button, the message gets moved to the local Inbox, not the IMAP Inbox. Note that POP3 email also goes into the local Inbox (this detail may or may not be important). Also, both inboxes are named "Inbox" within Outlook, in case it's important. Let me know if you need further information, happy to help. If there is a known fix, let me know that as well. Thanks, Scott Job From kennypitt at hotmail.com Wed Feb 18 16:15:48 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Feb 18 16:16:53 2004 Subject: [Spambayes] train on blank spam messages In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01E19D4A@cliff.bai.org> Message-ID: Ryan Malayter wrote: > I frequently get entirely blank messages (no subject or body, > sometimes even without FROM or TO address). These are obviously > abortive spam attempts, generated by buggy spam ware. > > Should I train on these? I have been, figuring that SpamBayes could at > least generate subject:None tokens, and perhaps something from the > Received headers. If SpamBayes is not correctly classifying them, I personally don't see a reason not to train on them. > Although, I notice that SpamBayes doesn't mine the > class-B or class-C network from the Received header. Has this been > tried? Or is it useless in this day of spam-spewing, compromised home > machines? That a message came directly from a machine on PacBell's DSL > network, rather than a well-known PacBell SMTP relay, would seem to > be a fairly strong spam clue to me. This is an option that is disabled by default, but you can enable it easily enough. If you're using the Outlook plugin, you'll need to create a file name "default_bayes_customize.ini" in your "Documents and Settings\username\Application Data\SpamBayes" directory. In the file, insert the following 2 lines: [Tokenizer] mine_received_headers:True -- Kenny Pitt From tameyer at ihug.co.nz Wed Feb 18 19:16:41 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 18 19:17:07 2004 Subject: [Spambayes] RE: [spambayes-dev] Suggestion In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255CF3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677996@its-xchg4.massey.ac.nz> [Moving to spambayes@python.org] > I love how SpamBays works, but I have a suggestion.............. > > When I find SPAM that was not caught by Spam Bays, > it would be great if there was an option that I could > set so that when I click on the "Delete as Spam" button, > it actually moved the item to the Deleted Items folder > instead of the Spam folder (where I have to go and manual > delete the items anyway - I suspect that this is a MS Outlook issue). FAQ 3.11 has answers to this, as well as some tips about alternatives. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From rmalayter at bai.org Wed Feb 18 19:19:55 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Feb 18 19:19:59 2004 Subject: [Spambayes] train on blank spam messages Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E19D6E@cliff.bai.org> [Kenny Pitt] > This is an option that is disabled by default, but you can enable it > easily enough. If you're using the Outlook plugin, you'll need to > create a file name "default_bayes_customize.ini" in your > "Documents and > Settings\username\Application Data\SpamBayes" directory. In the file, > insert the following 2 lines: > > [Tokenizer] > mine_received_headers:True Is this option off by default because it did not provide a net gain for most corpora? Or is it still experimental? From tameyer at ihug.co.nz Wed Feb 18 19:23:04 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 18 19:23:40 2004 Subject: [Spambayes] *nix version In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255E2E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AC0@its-xchg4.massey.ac.nz> > Is there a way to get the ham/spam ratio from the *nix version? The > documentation is a bit sparse. I don't know if there's a better way, but you can do "sb_server.py -o globals:verbose:True" and look at what's printed to stdout. (Starting sb_server if you don't have any proxies configured will just start the web interface, and you can stop it without any harm, although it might create the cache directories). Alternatively, you could patch contrib/spamcounts.py to do this (and maybe submit a patch back). "db.nspam" and "db.nham" are the attributes that you need. The sb_dbexpimp.py script can convert the db to text, but that's overkill for what you need, really. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 18 19:24:48 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 18 19:25:14 2004 Subject: [Spambayes] Configuring Spam Folder In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255E20@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677998@its-xchg4.massey.ac.nz> > I have enabled Spambayes and have trained Spambayes - but I > cannot set the > location of the folders where the certain and unsure spam is > to be moved to. > I can define the filters such that the moved to folders are > defined but when > I back out of the Define Filters window, the location of the > folders for the > spam change to . I cannot set the folders and > everytime I try to > use Spambayes, I get a message stating I must configure the > spam folders. Sounds like something's not happy with your folder choices. Could you send us your log file? It should help us figure out what is going wrong. (Also - what version are you using? If it's not 1.0a9 (0.9), then you should try that). The Help->Troubleshooting Guide explains where to get your log files. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 18 19:27:52 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 18 19:28:21 2004 Subject: [Spambayes] Problem with Outlook Plugin In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255D53@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677999@its-xchg4.massey.ac.nz> > I've been using (and loving) SpamBayes at home for a while now. > I decided to install the Outlook Plugin (0.7? 0.8?) I'd recommend using 1.0a9 (0.9) and not 0.7 or 0.81 - there are lots of bugfixes in the later release, and you're more likely to have success. > I would like to start clean by uninstalling the plugin and then > I will remove the training and configuration. Can someone point > me at which files to remove once the plugin has been uninstalled? The configuration is stored in a file called [profile name].ini (possibly "Outlook.ini"). The training is stored in a file called default_bayes_database.db. Both of these are a "SpamBayes" folder in your "Application Data" directory. The FAQ explains where to find this, but if you're using Windows NT/2k/XP, it's probably C:\Documents and Settings\[username]\Application Data, and if you're using Windows 95/98/ME, I think it's probably C:\Windows\Application Data. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 18 20:13:29 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 18 20:13:57 2004 Subject: [Spambayes] Trouble installing 0.9 addin for Outlook In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255C89@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130467799C@its-xchg4.massey.ac.nz> > I've been using the SpamBayes Outlook plugin 0.81 > without any issues. I attempted to upgrade to the > recent 0.9 release, but am getting the following error: > There was an error initializing the SpamBayes addin > Please re-start Outlook and try again. > Are there any known causes of this? When I go back to > the 0.81 addin, everything works as usual. Could you send us your log file from when you try to do this? It should have more information about what's going wrong. Thanks! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From hera at optonline.net Wed Feb 18 21:32:13 2004 From: hera at optonline.net (Erin Lazzaro) Date: Wed Feb 18 21:32:40 2004 Subject: [Spambayes] Maintain training with Outlook? Message-ID: <001101c3f690$9e16be40$6801a8c0@Peacemaker> The simple version first: What's the best way to maintain training with the Outlook plugin? The FAQ advises training on "a few ham and a few spam" on a regular basis, but it's not obvious how to train on ham that hasn't been misclassified. The "Recover from Spam" button is only present in the Junk Mail and Junk Mail Suspects folders, and the wizard completely rebuilds the database; is there something less drastic that I'm missing? My spam/ham ratio before filtering is probably close to 10:1. How do I choose which and how many messages to train on? Now, for extra credit: I don't actually use Outlook much -- I use Pocket Outlook on my PDA, and do as much as I can on the train. The plugin buttons obviously aren't available, and moving messages into the Junk folders doesn't work either. Apparently ActiveSync moves the messages behind the scenes and SpamBayes doesn't see them move. I made a couple of special folders, TrainJunk and TrainGood, to sort messages into when using Pocket Outlook; when I'm back on the desktop I "Delete as Spam" the entire TrainJunk folder. I haven't figured out what to do with the TrainGood folder yet. Does anyone see a less cumbersome way of handling this? Thanks, Erin From info at bloemendael.nl Thu Feb 19 03:59:47 2004 From: info at bloemendael.nl (info@bloemendael.nl) Date: Thu Feb 19 03:59:49 2004 Subject: [Spambayes] stolen Message-ID: is that from you? -------------- next part -------------- A non-text attachment was scrubbed... Name: information.zip Type: application/x-zip-compressed Size: 0 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040219/e0a96c75/information.bin From sdecorte at vub.ac.be Thu Feb 19 04:42:21 2004 From: sdecorte at vub.ac.be (Stefan De Corte) Date: Thu Feb 19 04:42:39 2004 Subject: [Spambayes] outlook causuing unknown error when closed Message-ID: <5.2.1.1.0.20040219104026.00b035e0@pop.vub.ac.be> Hello, has there ever been an answer to the question: using outlook 2002 whenever i close outlook instead of it going thru its normal closing routine it causes an "error in " so effectively outlook crashes each time i close it - any suggestions plse? posted by Glyn Sloman gsloman at ntlworld.com Sun Oct 19 13:37:22 EDT 2003 I have a similar problem I can't solve.. Thanks in advance, Stefan De Corte -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040219/3b189aba/attachment.html From stefan-mueller at siemens.com Thu Feb 19 05:28:13 2004 From: stefan-mueller at siemens.com (Mueller Stefan) Date: Thu Feb 19 05:28:24 2004 Subject: [Spambayes] Spam Bayes - moving DB to a different Folder Message-ID: <648018B8B54EFE418F3E543306755AAC0149E5A0@mchh2c6e.mchh.siemens.de> Hi, I'm using Spam bayes V1.09a. binary Installed for W2k. Spam Bayes created it's Dbs beneath the Application data/... folder. My Problem: My Profile can't exeed 16MB. Therefor I'd like to change the Location. I didn't find anything in registry, environment (>set), oder the installationFolder, google, and FAQs how to switch the location. Can anyone help me? Kind regards Stefan M?ller From stefan-mueller at siemens.com Thu Feb 19 05:40:19 2004 From: stefan-mueller at siemens.com (Mueller Stefan) Date: Thu Feb 19 05:40:27 2004 Subject: [Spambayes] Spam Bayes - moving DB to a different Folder Message-ID: <648018B8B54EFE418F3E543306755AAC0149E5A1@mchh2c6e.mchh.siemens.de> Hi, I'm using Spam bayes V1.09a. binary Installed for W2k. Spam Bayes created it's Dbs beneath the Application data/... folder. My Problem: My Profile can't exeed 16MB. Therefor I'd like to change the Location. I didn't find anything in registry, environment (>set), oder the installationFolder, google, and FAQs how to switch the location. Can anyone help me? Kind regards Stefan M?ller From sheimburger at softwareworks-inc.com Thu Feb 19 08:32:02 2004 From: sheimburger at softwareworks-inc.com (Steve Heimburger) Date: Thu Feb 19 08:32:52 2004 Subject: [Spambayes] (no subject) Message-ID: <004601c3f6ec$c75950e0$7a2ef344@demo> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 2615 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040219/3f17b273/attachment.jpe From rcoe at CambridgeMA.GOV Thu Feb 19 09:18:51 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Thu Feb 19 09:19:10 2004 Subject: [Spambayes] RE: outlook causuing unknown error when closed Message-ID: <42393C9DA7930245AB540667607F4F5022CA86@SPIKE.city> For any of these unexplained crashes, try Outlook's "Detect & Repair" function. One of the Developers (Tony?) suggested doing that when I had a similar problem a few weeks ago, and it worked. Bob -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Stefan De Corte Sent: Thursday, February 19, 2004 4:42 AM To: spambayes@python.org Subject: [Spambayes] outlook causuing unknown error when closed Hello, has there ever been an answer to the question: using outlook 2002 whenever i close outlook instead of it going thru its normal closing routine it causes an "error in " so effectively outlook crashes each time i close it - any suggestions plse? posted by Glyn Sloman gsloman at ntlworld.com Sun Oct 19 13:37:22 EDT 2003 I have a similar problem I can't solve.. Thanks in advance, Stefan De Corte -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040219/7b68991b/attachment-0001.html From dickk at paragonconstructioninc.com Thu Feb 19 09:22:46 2004 From: dickk at paragonconstructioninc.com (dickk@paragonconstructioninc.com) Date: Thu Feb 19 09:23:10 2004 Subject: [Spambayes] Ham:Spam ratio Message-ID: I've seen a lot of posts regarding this ratio that indicate 1:1 or 1:2 is an appropriate ratio. My ratio is currently 446:5554 or about 1:121. My real email statistics are about 98% spam, 2% ham. When I started using SpamBayes two months ago, I didn't do any training, but rather started training on all new email. Now about 60% of my mail goes to Junk and 39% goes to Junk Suspects. Of the 39% unclassified, maybe 1% is actually ham. I'm not particularly unhappy with the way that it's working, I just don't know if I'm using it properly. My questions are: Should I expect that 39% of my mail is unclassified, or should it be less (or more)? If I retrain to get a ratio of 1:2 or better, will that decrease the amount of unclassified mail? If I retrain, how often should I retrain to keep the ratio proper? Thanks Dick Kusleika Paragon Construction, Inc. From dickk at paragonconstructioninc.com Thu Feb 19 09:43:55 2004 From: dickk at paragonconstructioninc.com (dickk@paragonconstructioninc.com) Date: Thu Feb 19 09:44:18 2004 Subject: [Spambayes] Outlook envelope icon Message-ID: Using Outlook2k I've read the faq on this and I wanted to share what I was doing. I'm hoping that someone has come up with a better way which hasn't made it to the faq. I use the Application_NewMail event to create an AppointmentItem with a start time of Now and ReminderMinutesBeforeStart = 0. Then I use the Application_Reminder event to determine the amount of unread mail in my Inbox. If none, I delete the envelope icon using Neo's code. And it deletes the AppointmentItem whether the envelope is deleted or not. Here are the problems: Even though the AppointmentItem is set to remind immediately, it takes between 10 and 30 seconds for it to actually run. This means that if I see an envelope, it may mean I have new mail or it may mean that the reminder event hasn't run yet. The reminder is displayed and steals the focus for a split second before it's deleted. Normally that's not a problem, but it can be an annoyance if I'm typing something during that split second. My deleted items folder has a lot of deleted reminders in it. I've manually set up a reminder for every Sunday to delete these reminders from the deleted items folder, but I won't know if this works well until this Sunday. Here's the code I'm using: Private Sub Application_NewMail() Dim ai As AppointmentItem Set ai = Me.CreateItem(olAppointmentItem) With ai .Subject = "ClearEnvelope" .Start = Now + TimeValue("00:00:02") .Duration = 5 .ReminderSet = True .ReminderMinutesBeforeStart = 0 .ReminderOverrideDefault = True .ReminderPlaySound = False .Save End With End Sub Private Sub Application_Reminder(ByVal Item As Object) Dim i As Long Dim TotItms As Long Dim Fldr As MAPIFolder Dim ItmCnt As Long Set Fldr = Me.GetNamespace("MAPI").GetDefaultFolder(olFolderDeletedItems) TotItms = Fldr.Items.Count If Item.Subject = "ClearEnvelope" Then MyNewMail Item ElseIf Item.Subject = "DeleteClears" Then 'this is every Sunday For i = TotItms To 1 Step -1 If Fldr.Items(i).Subject = "ClearEnvelope" Then Fldr.Items(i).Delete ItmCnt = ItmCnt + 1 End If Next i Debug.Print ItmCnt & " Items were deleted." End If Set Fldr = Nothing End Sub And in a standard module: Sub MyNewMail(Itm As Object) If Application.GetNamespace("MAPI").GetDefaultFolder(olFolderInbox) _ .UnReadItemCount = 0 Then RemoveNewMailIcon End If Itm.Delete End Sub If anyone has any comments or suggestions, I'd be happy to hear them. Thanks Dick Kusleika From geoff at controlg.com Thu Feb 19 11:02:59 2004 From: geoff at controlg.com (Geoff Campbell) Date: Thu Feb 19 11:03:13 2004 Subject: [Spambayes] Delete As Spam - Doesn't always work Message-ID: <000001c3f701$db4620a0$2e00a8c0@Dell2400> Hi - Sometimes (very rarely) when I select an obvious spam message and "click" on "Delete As Spam", nothing happens. It seems that the message is "SpamBayes proof"! This happens once every several days, but I can't see a pattern. Using Outlook 2002 (SP-1). Would be happy to send you a representative email. Thanks - Geoff Campbell PS - Otherwise, everything else is great. I used to have to visually scan approximately 150 emails per day - that is now down to less than 10! From qub at qub.com Thu Feb 19 11:03:58 2004 From: qub at qub.com (Mikhail Yakoubov) Date: Thu Feb 19 11:04:16 2004 Subject: [Spambayes] SpamBayes 1.0a9 (0.9) References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AB5@its-xchg4.massey.ac.nz> Message-ID: <4034DE6E.109E650@qub.com> Tony Meyer wrote: > > No, CVS sb_imapfilter.py 1.26 (and 1.27 too) won't work this way -- > [...] > > You need message.py from CVS as well. Sorry - I wrote the message to you > before I had finished checking in the changes and testing them. The FAQ > about this problem does say that you need both. Yes, it works this way, but has _very_ undesired side effect. Since I've substituded both files with newer versions yesterday, ham get stuck in my Inbox. That is, my client refuses to consider it newly arrived, as it was before, and thus don't apply filtering rules to it. So, ham don't get sorted, and I have to sort it manualy, as Messenger's filtering mechanic works only on the messages in Inbox it considers newly arrived. Not sure what flags are responsible for the message to be considered fresh by client, but the fact is that before the change Spambayes treated them client-friendy (was completely trasparent, avoided interfering with future client processing), and has stopped doing so now. At least, now the ham messages get marked as Read, and before they were Unread. The same Read flags are on now on all the Spam and Unsure messages sorted by Spambayes itself, not the way it was before. FAQ at http://spambayes.sourceforge.net/faq.html#sb-imapfilter-prints-out-skipping-unparseable-message-but-the-message-vanishes states that one need message.py (revision 1.46) to fix the multipart problem, which is obviously an error (1.46 is the one included in the release_1_0_a9 itself). 1.48 seems to be the right choice, in view of its timestamp. I use http://cvs.sourceforge.net/viewcvs.py/*checkout*/spambayes/spambayes/scripts/sb_imapfilter.py?rev=1.27 http://cvs.sourceforge.net/viewcvs.py/*checkout*/spambayes/spambayes/spambayes/message.py?rev=1.48 not the 1.26 and 1.46 as suggested by the FAQ. Regards, Mike Yakoubov. From russ_foster at comcast.net Thu Feb 19 11:25:06 2004 From: russ_foster at comcast.net (Russ Foster) Date: Thu Feb 19 11:17:06 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: Message-ID: I have the spam problem (large imbalance of spam:ham). It's probably not the most elegant solution, but I just manually move some of my 'ham' (from the inbox, even though it was classified properly as ham) into the 'unsure' folder. Then highlight these good messages and click "Recover from Spam". The reason you get so much unclassified is you don't have enough trained ham. If you keep the ratio in check (I would say anywhere between 1:1 and 1:2 would suffice) you will start to see much better results. -r -- russ_foster@comcast.net On Thu, 19 Feb 2004 dickk@paragonconstructioninc.com wrote: > I've seen a lot of posts regarding this ratio that indicate 1:1 or 1:2 is an > appropriate ratio. My ratio is currently 446:5554 or about 1:121. > > My real email statistics are about 98% spam, 2% ham. When I started using > SpamBayes two months ago, I didn't do any training, but rather started > training on all new email. Now about 60% of my mail goes to Junk and 39% > goes to Junk Suspects. Of the 39% unclassified, maybe 1% is actually ham. > I'm not particularly unhappy with the way that it's working, I just don't > know if I'm using it properly. > > My questions are: Should I expect that 39% of my mail is unclassified, or > should it be less (or more)? If I retrain to get a ratio of 1:2 or better, > will that decrease the amount of unclassified mail? If I retrain, how often > should I retrain to keep the ratio proper? > > Thanks > Dick Kusleika > Paragon Construction, Inc. From papaDoc at videotron.ca Thu Feb 19 11:26:23 2004 From: papaDoc at videotron.ca (papaDoc) Date: Thu Feb 19 11:29:54 2004 Subject: [Spambayes] Re: [spambayes-dev] A Question about SpamBayes In-Reply-To: <000001c3f6ff$86fc2230$6501a8c0@JOACHIM> References: <000001c3f6ff$86fc2230$6501a8c0@JOACHIM> Message-ID: <4034E3AF.7060900@videotron.ca> Hi Joachim, >first of all, SpamBayes is GREAT!!! > > Thanks >However, I am missing one tiny little feature, which you are addressing >under 4.3 and 6.5 in your FAQ section: Return/bounce/forward spam back to >the sender. > >In FAQ 4.3 it sounds like that this is somehow possible. But, I have no clue >what are you talking about in there... > > The FAQ 4.3 and 6.5 don't talk about the same things. FAQ 4.3 try to say that: You can set up spambayes such that when you send message to the proxy "yourself" with a given email address, it will train this mail as ham or spam depending on the given address. >By the way, I am running the Outlook (2000) plugin under Windows 200 Pro - >no Exchange server. So is it possible now or not? > > For now no and I don't think it will be implemented (See FAQ 6.5) >In FAQ 6.5 it is stated that I can't bounce spam back to the sender since >most sender addresses are fake anyway. I am in full agreement with you on >that and go along with this statement totally. > Good ...... ;-) >However, there are always >messages that get filtered out by SpamBayes that are not spam and are >legitimate. Therefore, I am always browsing over the Junk mail folder before >deleting all the spam - making sure I am not deleting an important message. >Now, if I could autoreply to all messages in the Junk E-Mail folder and >attach a short message to it, at least the non-spam senders would be >notified automatically that their email was considered spam and has not been >read but deleted. This way they could try to resend or rephrase. The return >address from non-spammers should be a good one > I think you can do this in your email program ? >and the autoreplies to >spammers with fake return addresses end up in limbo. > > This is not always true. If the spammer used my email address as the return address then I will receive and email from you and if (a bif IF) spambayes becomes very popular I could received lots of email because of that. Just talk to someone who had their email address as the "From" in a spam messages. They will have good stories...... >Do you think this makes any sense > This was interesting but ..... >and are you considering implementing >something like that into SpamBayes? > > I won't do it but I did not code much of Spambayes ;-( Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From bhecht at sbcglobal.net Thu Feb 19 12:28:35 2004 From: bhecht at sbcglobal.net (bhecht@sbcglobal.net) Date: Thu Feb 19 12:46:22 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: <009101c3f70d$ce319fa0$0100a8c0@bhecht> I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service Pack 1). I have trained 0 ham and 0 spam. The problem: I don't think it's configured correctly. I set the following two values: POP3 proxy running on 110, proxying to pop.sbcglobal.yahoo.com:110. I got pop.sbcglobal.yahoo.com from sbcglobal phone support, it's the incoming mail server address I got 110 from outlook express properties, incoming mail port. It doesn't seem to be intercepting mail. Thanks Bill Hecht -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040219/7005b1e3/attachment.html From johng at atser.com Thu Feb 19 13:47:37 2004 From: johng at atser.com (John Gagon) Date: Thu Feb 19 13:49:45 2004 Subject: [Spambayes] Feature idea. Message-ID: I really love SpamBayes and have sold my whole company on it. It works great even for those ditzy blondes in reception who tend to try to unsubscribe to every spam they get in their box. I have a suggestion for Spam Bayes. This is regards to the threshold feature. ie: you can raise or lower the score filtering criterias. (ie: for Spam/Unsure and Inboxes etc) Over time, I would suspect the messages, statistically would create a "camel" two hump curve. ie: two sets of distributions (I know there is a more technical term for that in statistics but it slips my mind atm). Over time, the humps would grow and the minima shift left a little as more and more clever spams are eliminated to the right side of the distribution) I would suspect the best place to set your thresholds would be between the ham and spam distribution humps. Or have your unsure zone be so many points away from that minima. It would be nice then to have a checkbox to enable automatic adjustment of the filtering criteria. (ie: over time, mine has gone down from 75% spam scores and above to 15% and above since I have a large hump after 15% and a smaller ham hump before the 15% mark. (IOW, the filter is getting very good and goes lower as it goes. but I'm having to manually do statistics and adjust the filter so as to get very good accuracy out of spambayes. Just an idea. BTW, I am a developer with some statistical/math background. I might consider contributing after I familiarize myself more with the group in operation here. but there is a high probability that I may just leave this as a good comment until frustration over not seeing this feature implemented goes over my limits. From rcoe at CambridgeMA.GOV Thu Feb 19 14:25:04 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Thu Feb 19 14:25:07 2004 Subject: [Spambayes] RE: Problem with Outlook Plugin Message-ID: <42393C9DA7930245AB540667607F4F5022C276@SPIKE.city> > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Tony Meyer > Sent: Wednesday, February 18, 2004 7:28 PM > To: 'Patrick Hess'; spambayes@python.org > Subject: RE: [Spambayes] Problem with Outlook Plugin > > > > I would like to start clean by uninstalling the plugin and then > > I will remove the training and configuration. Can someone point > > me at which files to remove once the plugin has been uninstalled? > > The configuration is stored in a file called [profile name].ini > (possibly "Outlook.ini"). The training is stored in a file called > default_bayes_database.db. Both of these are a "SpamBayes" folder > in your "Application Data" directory. The FAQ explains where to find > this, but if you're using Windows NT/2k/XP, it's probably C:\Documents > and Settings\[username]\Application Data, and if you're using Windows > 95/98/ME, I think it's probably C:\Windows\Application Data. In NT, I believe it's "C:\WinNT\profiles\[username]\Application Data". The "Documents and Settings" folder came in with Windows 2000. Bob From russ_foster at comcast.net Thu Feb 19 14:58:28 2004 From: russ_foster at comcast.net (Russ Foster) Date: Thu Feb 19 14:50:29 2004 Subject: [Spambayes] Maintain training with Outlook? In-Reply-To: <001101c3f690$9e16be40$6801a8c0@Peacemaker> Message-ID: I have the spam problem (large imbalance of spam:ham). It's probably not the most elegant solution, but I just manually move some of my 'ham' (from the inbox, even though it was classified properly as ham) into the 'unsure' folder. Then highlight these good messages and click "Recover from Spam". NOTE: Drastic measure ahead... Here's something that saves a step or two: Create a new toolbar that includes both "Delete..." and "Recover..." buttons (you can drap and drop the buttons from the SB toolbar to the new one). Dock this next to the original SB toolbar. Now these buttons are always available. WARNING: I'm not sure if this affects anything else in Spambayes, but so far, it seems to be working. -r -- russ_foster@comcast.net 630-629-0843 home 630-885-0147 cell On Wed, 18 Feb 2004, Erin Lazzaro wrote: > The simple version first: What's the best way to maintain training with > the Outlook plugin? The FAQ advises training on "a few ham and a few > spam" on a regular basis, but it's not obvious how to train on ham that > hasn't been misclassified. The "Recover from Spam" button is only > present in the Junk Mail and Junk Mail Suspects folders, and the wizard > completely rebuilds the database; is there something less drastic that > I'm missing? My spam/ham ratio before filtering is probably close to > 10:1. How do I choose which and how many messages to train on? > > Now, for extra credit: I don't actually use Outlook much -- I use > Pocket Outlook on my PDA, and do as much as I can on the train. The > plugin buttons obviously aren't available, and moving messages into the > Junk folders doesn't work either. Apparently ActiveSync moves the > messages behind the scenes and SpamBayes doesn't see them move. I made > a couple of special folders, TrainJunk and TrainGood, to sort messages > into when using Pocket Outlook; when I'm back on the desktop I "Delete > as Spam" the entire TrainJunk folder. I haven't figured out what to do > with the TrainGood folder yet. Does anyone see a less cumbersome way of > handling this? > > Thanks, > Erin > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From tim.one at comcast.net Thu Feb 19 15:59:25 2004 From: tim.one at comcast.net (Tim Peters) Date: Thu Feb 19 15:59:28 2004 Subject: [Spambayes] Delete As Spam - Doesn't always work In-Reply-To: <000001c3f701$db4620a0$2e00a8c0@Dell2400> Message-ID: [Geoff Campbell] > Sometimes (very rarely) when I select an obvious spam message and > "click" on "Delete As Spam", nothing happens. It seems that the > message is "SpamBayes proof"! This happens once every several days, > but I can't see a pattern. Using Outlook 2002 (SP-1). Would be > happy to send you a representative email. That would be good. There's probably a helpful (to us ) message in your SpamBayes log file whenever this happens. A real problem is that Outlook destroys the exact structure of incoming email, so you may not actually be able to give anyone else an email that reproduces the problem. When "nothing happens" in an otherwise-working SpamBayes, the usual cause is that the email is so badly formed (violates so many standards about how email *should* be constructed) that the SpamBayes email parser gives up trying to make any sense of it. If that's what's happening, you will find helpful (to us) information in your SpamBayes log file. From powerej at auburn.edu Thu Feb 19 16:11:21 2004 From: powerej at auburn.edu (Edward Powers) Date: Thu Feb 19 16:57:13 2004 Subject: [Spambayes] i need some help Message-ID: can i use any of your programs to get rid of my spam, i am unsure because i am using a email account through auburn university which is on there website. i have windows me and really really hate all of this spam. thank you for your time jim From tameyer at ihug.co.nz Thu Feb 19 17:06:34 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 17:15:43 2004 Subject: [Spambayes] Spam Bayes - moving DB to a different Folder In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255FFA@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779A8@its-xchg4.massey.ac.nz> > My Problem: My Profile can't exeed 16MB. > Therefor I'd like to change the Location. I didn't find > anything in registry, environment (>set), oder the > installationFolder, google, and FAQs how to switch the location. Try the documentation . If you go to Help->About SpamBayes, then to the Configuration link, you want the section called "Multiple Configuration Files". This explains how to setup the plug-in so that another directory is used as the data directory. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From swhite at infostatsystems.com Thu Feb 19 17:33:28 2004 From: swhite at infostatsystems.com (Steve White) Date: Thu Feb 19 17:33:33 2004 Subject: [Spambayes] Upgrade to Spam Bayes stopped filtering Message-ID: I upgraded this morning to the latest SpamBayes 0.9 Feb 2004, since that time all Spam Filtering has stopped. What did I do wrong? I checked the configuration and all appears to be well. I noticed that the install went into a different folder than the earlier version C;\Program Files\SpamBayes as opposed to the previous C:\Program Files\Spambayes Outlook Addin. Did all my learning history on Spam get lost? Am I now recalibrating the system? I am working with Outlook 2003 on Win XP Pro. Thanks for any suggestions. Steve W. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040219/590248b9/attachment.html From tameyer at ihug.co.nz Thu Feb 19 17:21:15 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 17:39:38 2004 Subject: [Spambayes] RE: Problem with Outlook Plugin In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052560A4@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779AB@its-xchg4.massey.ac.nz> [...] >> The FAQ explains where to find >> this, but if you're using Windows NT/2k/XP, it's probably >> C:\Documents >> and Settings\[username]\Application Data, and if you're >> using Windows 95/98/ME, I think it's probably > C:\Windows\Application Data. > > In NT, I believe it's > "C:\WinNT\profiles\[username]\Application Data". The > "Documents and Settings" folder came in with Windows 2000. Thanks! I didn't realise this. I'll start saying the right thing to people, and update the FAQ if necessary. =Tony Meyer From tameyer at ihug.co.nz Thu Feb 19 17:43:59 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 18:18:36 2004 Subject: [Spambayes] Upgrade to Spam Bayes stopped filtering In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256156@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779AD@its-xchg4.massey.ac.nz> > I upgraded this morning to the latest SpamBayes 0.9 > Feb 2004, since that time all Spam Filtering has stopped. > What did I do wrong? I checked the configuration > and all appears to be well. I noticed that the install > went into a different folder than the earlier version > C;\Program Files\SpamBayes as opposed to the previous > C:\Program Files\Spambayes Outlook Addin. Did you uninstall your previous version first? Because there were major changes in the way the installer is built, we recommend doing that, although it isn't usually necessary. > Did all my learning history on Spam get lost? It's unlikely. In your data directory, there will be a "default_bayes_customize.db" file, which holds the training data. Neither the [un]installer for the previous versions, nor the one for 0.9 touches this file, and both 0.8 and 0.9 look for the files in the same place*. If you could send us your log file(s), that would greatly help to figure out what is happening. =Tony Meyer * Unless you have setup SpamBayes to use multiple configuration files, and have put one in the directory that SpamBayes is installed into; this is very uncommon. --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 19 17:56:44 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 18:36:07 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130525608D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AC9@its-xchg4.massey.ac.nz> [Dick] > My questions are: Should I expect that 39% of my mail is > unclassified, or should it be less (or more)? If by "unclassified" you mean "classified as unsure", then no. 2-5% is probably typical. > If I retrain to get a ratio of 1:2 or better, will that > decrease the amount of unclassified mail? Having a ratio close to 1:1 (even 5:1 or 1:5 is probably ok - 1:121 is way too much) will help classification, yes. > If I retrain, how often should I retrain to keep the ratio proper? There isn't really an accepted training methodology as yet. The wiki (http://entrian.com/sbwiki) has a *lot* of stuff about training, if you want to read it. If you don't , then I'd suggest: Training on mistakes only - i.e. get rid of your existing training data, so that everything is unsure (or if you're using the plug-in and there's still the 5+5 minimum, then go down to that). Then train on all mail classified as unsure, all good mail classified as spam, and all spam classified as good mail. Your database will stay small, and you'll end up with pretty good results, fairly quickly. For the most part, the ratio tends to stay roughly even here, too, for various reasons. [Russ] > It's probably not the most elegant solution, but I just > manually move some of my 'ham' (from the inbox, even though > it was classified properly as ham) into the 'unsure' folder. > > Then highlight these good messages and click "Recover from Spam". You'll probably find it's better to train *less spam* than *more ham*. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 19 17:59:21 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 18:40:55 2004 Subject: [Spambayes] i need some help In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130525613E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779AF@its-xchg4.massey.ac.nz> > can i use any of your programs to get rid of my spam, i am > unsure because i am using a email account through auburn > university which is on there website. i have windows me and > really really hate all of this spam. So you read your email in a web browser (Internet Explorer, Mozilla, Safari, or something like that), rather than in a mail client (Outlook, Outlook Express, Mail.app, Eudora, or something like that)? If that's the case, and this is the only way to do it, then unfortunately none of the SpamBayes applications will work for you. With a webmail type system like this, I think your best hope (other than moving away from webmail) would be to have the university install some server-side filtering for everyone. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From rmalayter at bai.org Thu Feb 19 19:00:50 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Feb 19 19:00:53 2004 Subject: [Spambayes] a useful pre-filter for auto-training bayesian systems? Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E19DCE@cliff.bai.org> I found this interesting. Using the social network as a first step, a bunch of "definite ham" and "definite spam" messages are listed. These can be used to train a Bayesian filter which then filters the rest of the unsures automatically. Very little user intervention would therefore be required for training, and it cuts in half the number of messages that must be filtered by the much-more-expensive statistical filter. http://www.arxiv.org/abs/cond-mat/0402143 One could even imagine users securely posting their email addressee's "white lists" by posting SHA-1 hashes instead of actually email addresses to some public forum. (This would have to be salted, of course). This could create a meta-social-network. They don't seem to address the issue of a spam that has a forged address from your own social network, though, which might trip up this whole social network process. Ryan Malayter Sr. Network & Database Administrator Bank Administration Institute Chicago, Illinois, USA PGP Key: http://www.malayter.com/pgp-public.txt ::::::::::::::::::::::::::::::: I am prepared to meet my Maker. Whether my Maker is prepared for the great ordeal of meeting me is another matter. -Sir Winston S. Churchill From jm at jmason.org Thu Feb 19 20:21:44 2004 From: jm at jmason.org (Justin Mason) Date: Thu Feb 19 20:21:56 2004 Subject: [Spambayes] Re: [spambayes-dev] a useful pre-filter for auto-training bayesian systems? In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01E19DCE@cliff.bai.org> Message-ID: <20040220012146.848D117003@jmason.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Ryan Malayter writes: > I found this interesting. Using the social network as a first step, a > bunch of "definite ham" and "definite spam" messages are listed. These > can be used to train a Bayesian filter which then filters the rest of > the unsures automatically. Very little user intervention would therefore > be required for training, and it cuts in half the number of messages > that must be filtered by the much-more-expensive statistical filter. > > http://www.arxiv.org/abs/cond-mat/0402143 > > One could even imagine users securely posting their email addressee's > "white lists" by posting SHA-1 hashes instead of actually email > addresses to some public forum. (This would have to be salted, of > course). This could create a meta-social-network. > > They don't seem to address the issue of a spam that has a forged address > from your own social network, though, which might trip up this whole > social network process. Yeah -- spam with forged From of your address. That has historically been how spammers get around address-book-based whitelisting, because everyone usually has 1 or more of their own addrs in the address book. - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (GNU/Linux) Comment: Exmh CVS iD8DBQFANWEoQTcbUG5Y7woRAr2uAKDC4sevuJ87uYk6zPlb6aWOik7xXgCfdn7n /TFi3tpsMnGxI38K4cpTmUA= =v88V -----END PGP SIGNATURE----- From pcrayne at comcast.net Thu Feb 19 20:25:05 2004 From: pcrayne at comcast.net (Peter Crayne) Date: Thu Feb 19 20:26:22 2004 Subject: [Spambayes] Outlook envelope icon In-Reply-To: Message-ID: There's a standalone app that does something similar: http://sourceforge.net/projects/cloutmon "Windows (9x/Me/NT/2k/XP) tray utility that monitors the Outlook Inbox for unread email. An empty or full envelope is displayed depending on the status of unread email in the Inbox. It also removes Outlook's envelope icon from the tray." It's no longer being developed, but it is open-source. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of dickk@paragonconstructioninc.com Sent: Thursday, February 19, 2004 6:44 AM To: spambayes@python.org Subject: [Spambayes] Outlook envelope icon Using Outlook2k I've read the faq on this and I wanted to share what I was doing. I'm hoping that someone has come up with a better way which hasn't made it to the faq. I use the Application_NewMail event to create an AppointmentItem with a start time of Now and ReminderMinutesBeforeStart = 0. Then I use the Application_Reminder event to determine the amount of unread mail in my Inbox. If none, I delete the envelope icon using Neo's code. And it deletes the AppointmentItem whether the envelope is deleted or not. Here are the problems: Even though the AppointmentItem is set to remind immediately, it takes between 10 and 30 seconds for it to actually run. This means that if I see an envelope, it may mean I have new mail or it may mean that the reminder event hasn't run yet. The reminder is displayed and steals the focus for a split second before it's deleted. Normally that's not a problem, but it can be an annoyance if I'm typing something during that split second. My deleted items folder has a lot of deleted reminders in it. I've manually set up a reminder for every Sunday to delete these reminders from the deleted items folder, but I won't know if this works well until this Sunday. Here's the code I'm using: [truncated] From gnish at tbaytel.net Thu Feb 19 20:47:39 2004 From: gnish at tbaytel.net (gnish@tbaytel.net) Date: Thu Feb 19 20:47:36 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: A non-text attachment was scrubbed... Name: SpamBayesServer1.log Type: application/octet-stream Size: 646 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040219/e63fde86/SpamBayesServer1.obj -------------- next part -------------- I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service Pack 1). I have trained 5 ham and 13 spam. The problem I am having is I can no longer receive anymore emails on this computer From papaDoc at videotron.ca Thu Feb 19 20:55:04 2004 From: papaDoc at videotron.ca (Remi Ricard) Date: Thu Feb 19 20:49:24 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <009101c3f70d$ce319fa0$0100a8c0@bhecht> References: <009101c3f70d$ce319fa0$0100a8c0@bhecht> Message-ID: <1077242101.3463.6.camel@porsche.hq.simlog.com> Hi, > I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), > with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit > (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service > Pack 1). I have trained 0 ham and 0 spam. > ... > POP3 proxy running on 110, proxying to pop.sbcglobal.yahoo.com:110. The pop.sbcglobal.yahoo.com:110 should be OK. > > I got pop.sbcglobal.yahoo.com from sbcglobal phone support, it's the > incoming mail server address > > I got 110 from outlook express properties, incoming mail port. In OE did you enter localhost:110 or pop.sbcglobal.yahoo.com:110 It should be localhost:110. Remi From reesj01 at hotmail.com Thu Feb 19 21:11:08 2004 From: reesj01 at hotmail.com (Jason) Date: Thu Feb 19 21:11:15 2004 Subject: [Spambayes] outlook causing unknown error when closed In-Reply-To: <5.2.1.1.0.20040219104026.00b035e0@pop.vub.ac.be> Message-ID: <834DA428EFB83D4584FF4DD63A03F3CBAFFA@btrnt2> Hi all, I was having this same problem and I seem to have fixed it. When you get the error message asking if you want to restart Outlook, click on the link after the words, "To see what data this error report contains, click here". Up the top of this next screen, there is a Module Name. For me it was gpgexch.dll which is a GPG encryption plug-in. After removing that plug-in, it all seemed to work perfectly. I'm not sure how this all works but I have had that plug-in installed for months before I had SpamBayes and I have only had the error since SpamBayes was installed. Needless to say, I need SpamBayes more than I need GPG. Is it possible that the SpamBayes plug-in has somehow messed with the other plugins? For other people it may be a different plug-in than GPG that is causing the problem but uninstalling the GPG plug-in definitely worked for me. I'm happy to email off list if anyone wants further info about this. Regards, Jason -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Stefan De Corte Sent: Thursday, 19 February 2004 7:42 PM To: spambayes@python.org Subject: [Spambayes] outlook causuing unknown error when closed Hello, has there ever been an answer to the question: using outlook 2002 whenever i close outlook instead of it going thru its normal closing routine it causes an "error in " so effectively outlook crashes each time i close it - any suggestions plse? posted by Glyn Sloman gsloman at ntlworld.com Sun Oct 19 13:37:22 EDT 2003 I have a similar problem I can't solve.. Thanks in advance, Stefan De Corte -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040220/55cabacc/attachment.html From russ_foster at comcast.net Thu Feb 19 21:55:32 2004 From: russ_foster at comcast.net (Russ Foster) Date: Thu Feb 19 21:47:34 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AC9@its-xchg4.massey.ac.nz> Message-ID: On Fri, 20 Feb 2004, Tony Meyer wrote: > [Russ] > > It's probably not the most elegant solution, but I just > > manually move some of my 'ham' (from the inbox, even though > > it was classified properly as ham) into the 'unsure' folder. > > Then highlight these good messages and click "Recover from Spam". > > You'll probably find it's better to train *less spam* than *more ham*. I only train spam that is either misclassified to 'unsure'. Are you recommending just deleting some of those without training? I wish I had less spam to train! -r From tim.one at comcast.net Thu Feb 19 21:51:37 2004 From: tim.one at comcast.net (Tim Peters) Date: Thu Feb 19 21:51:39 2004 Subject: [Spambayes] train on blank spam messages In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A01E19D6E@cliff.bai.org> Message-ID: [Ryan Malayter, on mine_received_headers] > Is this option off by default because it did not provide a net gain > for most corpora? Or is it still experimental? The most recent reports were mixed: helped some, hurt some, didn't matter to others. From tameyer at ihug.co.nz Thu Feb 19 22:40:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 19 22:41:02 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052561C6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ACA@its-xchg4.massey.ac.nz> > The problem I am having > is I can no longer receive anymore emails on this computer [I presume that you still have some way to read incoming mail...] It appears from your log that SpamBayes is running correctly, so the most likely problem is that your mail client isn't connecting to SpamBayes. In your mail client, you should have it setup to connect to "localhost" rather than "mail.tbaytel.net" (the port should stay the same, i.e. "110"). Do you get an error in your mail client? It would help to know what it is (and which client you are using). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From kbulgrien at worldnet.att.net Thu Feb 19 23:41:01 2004 From: kbulgrien at worldnet.att.net (Kevin R. Bulgrien) Date: Thu Feb 19 23:40:40 2004 Subject: [Spambayes] 1.0a9 SMTP proxy hosed on Linux Message-ID: <200402192241.01816.kbulgrien@worldnet.att.net> spambayes-1.0a9.tar.gz smtpproxy is hosed. My mail client (KMail) now hangs when I send mail to train. As long as I do not try to train through the proxy, I can send mail through it to my ISP. As soon as I try to send to the "fake" mail address configured in bayescustomize.ini, the send process hangs, and I have to manually clear out the outbox. This happened when I updated from 1.0a7. The setup.py install is broken too. It did not replace all the /usr/bin/sb_* scripts properly. It said they were already installed, but it was obvious they weren't correct, because the scripts cratered all over the place, and after the install, the files in /usr/bin were not the same files that were in the installation directory. For all I know, there are some other files out there that did not get updated correctly. Where all does spambayes copy things when you do the setup.py install thing? And, why did the old 1.0a7 start two sb_server.py processes but this one only starts one? I'm not sure where to start on troubleshooting this one. Anyone got any ideas? Kevin R. Bulgrien From kbulgrien at worldnet.att.net Fri Feb 20 00:27:15 2004 From: kbulgrien at worldnet.att.net (Kevin R. Bulgrien) Date: Fri Feb 20 00:26:52 2004 Subject: [Spambayes] 1.0a9 500 Server error (More statistics... link) Message-ID: <200402192327.15367.kbulgrien@worldnet.att.net> Clicking for more statistics (http://localhost:8880/stats) gives: --- 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 461, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 1016, in onStats s = Stats.Stats() File "/usr/lib/python2.2/site-packages/spambayes/Stats.py", line 42, in __init__ self.CalculateStats() File "/usr/lib/python2.2/site-packages/spambayes/Stats.py", line 58, in CalculateStats for msg in msginfoDB.db: File "/usr/lib/python2.2/shelve.py", line 70, in __getitem__ f = StringIO(self.dict[key]) TypeError: argument must be string or read-only buffer, not int --- Shrug. What are .pyc files? Stats.py matches the file in the install directory, but Stats.pyc is older than Stats.py whereas most other .pyc files are newer than their corresponding .py files... Is this a broken install? Maybe. Deleted all /usr/lib/python2.2/site-packages/spambayes/* and re-ran python setup.py install, but I still get the same thing, so maybe not related. (smtpproxy training still broken too). From papaDoc at videotron.ca Fri Feb 20 09:00:18 2004 From: papaDoc at videotron.ca (papaDoc) Date: Fri Feb 20 09:00:07 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <00ad01c3f75f$48683ad0$0100a8c0@bhecht> References: <009101c3f70d$ce319fa0$0100a8c0@bhecht> <1077242101.3463.6.camel@porsche.hq.simlog.com> <00ad01c3f75f$48683ad0$0100a8c0@bhecht> Message-ID: <403612F2.9080808@videotron.ca> Hi, For your advance setting everything look OK to me. For your Servers setting everything look normal except 1- The account name looks strange to me "your_account@sbcglobal.net/pop.scbcglobal.y" the "/pop.sc....." but I don't know this ISP so this can still be normal 2- There is no password but you could have deleted it ! What you can do is test the proxy. 1- Type in the command prompt: telnet localhost 110 and after telnet pop.sbcglobal.yahoo.com 110 You must see the same "welcome" message since the "localhost 110" is only proxying and don't display anything. Remi P.S. What do you use to read/get this email ? -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From office at rtech.ro Fri Feb 20 09:58:37 2004 From: office at rtech.ro (Revlis SRL Software Technologies) Date: Fri Feb 20 10:13:13 2004 Subject: [Spambayes] (no subject) Message-ID: <007401c3f7c2$05410850$1ae0a4d5@thorasine> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 6908 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040220/b69b0e3c/attachment-0002.jpe -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 23895 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040220/b69b0e3c/attachment-0003.jpe From dickk at paragonconstructioninc.com Fri Feb 20 11:11:17 2004 From: dickk at paragonconstructioninc.com (dickk@paragonconstructioninc.com) Date: Fri Feb 20 11:11:44 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AC9@its-xchg4.massey.ac.nz> Message-ID: Tony Thanks for the good info. > > Training on mistakes only - i.e. get rid of your existing > training data, > so that everything is unsure (or if you're using the plug-in and there's > still the 5+5 minimum, then go down to that). > > Then train on all mail classified as unsure, all good mail > classified as > spam, and all spam classified as good mail. > > Your database will stay small, and you'll end up with pretty > good results, > fairly quickly. For the most part, the ratio tends to stay roughly even > here, too, for various reasons. > Ok, using OL2k, I've rebuilt the database with 5+5. Now I want to be clear on what I should do with the Suspects. I have a suspect now with a score of 47% (limits are 80 and 15). If I understand, I should move this message to deleted items and NOT use the Delete As Spam button. Only use the buttons when ham ends up in Junk or when Spam ends up in Inbox. > You'll probably find it's better to train *less spam* than *more ham*. Based on that, maybe I should also "Recover From Spam" and ham that ends up in Suspects because I'm likely to get way more spam in my Inbox than ham in my Junk (due to the big difference in my actual ham:spam ratio). I'm sorry I require so much hand-holding, but I appreciate your time. Dick From tim.one at comcast.net Fri Feb 20 11:12:54 2004 From: tim.one at comcast.net (Tim Peters) Date: Fri Feb 20 11:17:39 2004 Subject: [Spambayes] (no subject) In-Reply-To: <007401c3f7c2$05410850$1ae0a4d5@thorasine> Message-ID: [thorasine@rtech.ro] > We need a commercial version of SpamBayes, supporting Outlook >2000 > and Outlook Express > 5. Can you help? The SpamBayes project doesn't sell anything. At least one commercial product builds on the SpamBayes code base, but this isn't the place to ask about it (communicate with them directly): http://www.inboxer.com/ From bhecht at sbcglobal.net Fri Feb 20 11:51:09 2004 From: bhecht at sbcglobal.net (bhecht@sbcglobal.net) Date: Fri Feb 20 11:47:17 2004 Subject: [Spambayes] Problem with POP3 Proxy References: <009101c3f70d$ce319fa0$0100a8c0@bhecht> <1077242101.3463.6.camel@porsche.hq.simlog.com> <00ad01c3f75f$48683ad0$0100a8c0@bhecht> <403612F2.9080808@videotron.ca> Message-ID: <000701c3f7d1$bde9af60$0100a8c0@bhecht> I am using Outlook Express, which is functioning fine, and Windows XP. It's just that SpamBayes doesn't seem to be doing anything - not seeing or doing anything with the mail. Per your suggestion, I opened a command prompt window and typed in a command, and got a response, typed in the second command, and then got no response: telnet localhost 110 +OK POP3 server ready telnet pop.sbcglobal.yahoo.com 110 then there is no response. Nothing echos back after this. However if I open a new command window and start with your second command, then I get a different response: telnet pop.sbcglobal.yahoo.com 110 +OK hello from popgate<2.23.15> if I type additional words after this, "commands" after this, it gives an error message: telnet localhost 110 -ERR popgate unknown command help -ERR popgate unknown command In the dialog box I attached, the account name was not shown in full due to the window being smaller than the name. Here is the whole string: bhecht@sbcglobal.net/pop.sbcglobal.yahoo.com I don't know why it doesn't show a password, or at least some dots indicating that it is hiding the password. The mail is certainly working - I'm using it. I believe that the my password is in place and working. I suppose it's conceivable that my mail is not password protected. All I know is that I have to enter my password when I start Outlook Express, in a dialog box that says: Logon - bhecht@sbcglobal.net Server: localhost User Address: bhecht@sbcglobal.net/pop.sbcglobal.yahoo.com Password: Thanks for your help - much appreciated! - Bill ----- Original Message ----- From: "papaDoc" To: ; Sent: Friday, February 20, 2004 6:00 AM Subject: Re: [Spambayes] Problem with POP3 Proxy > Hi, > > For your advance setting everything look OK to me. > > For your Servers setting everything look normal except > 1- The account name looks strange to me > "your_account@sbcglobal.net/pop.scbcglobal.y" > the "/pop.sc....." but I don't know this ISP so this can still be normal > > 2- There is no password but you could have deleted it ! > > > What you can do is test the proxy. > 1- Type in the command prompt: > telnet localhost 110 > and after > telnet pop.sbcglobal.yahoo.com 110 > > You must see the same "welcome" message since the "localhost 110" is > only proxying and don't display anything. > > Remi > > P.S. What do you use to read/get this email ? > > -- > /"\ > \ / > X ASCII Ribbon Campaign > / \ Against HTML Email From jcsvaren at prtel.com Fri Feb 20 12:30:56 2004 From: jcsvaren at prtel.com (J CORNELL SVAREN) Date: Fri Feb 20 12:31:13 2004 Subject: [Spambayes] Hi Message-ID: <000101c3f7d7$4fa5a170$21ece442@DH62VG11> I hilite a spam message and click on Delete as Spam and I get a message saying: You need to reconfigure you spambayes folder. How do I do this? Thank you for your quick response. Cornell -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040220/aa9db35c/attachment.html From papaDoc at videotron.ca Fri Feb 20 12:49:17 2004 From: papaDoc at videotron.ca (papaDoc) Date: Fri Feb 20 12:49:02 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <000701c3f7d1$bde9af60$0100a8c0@bhecht> References: <009101c3f70d$ce319fa0$0100a8c0@bhecht> <1077242101.3463.6.camel@porsche.hq.simlog.com> <00ad01c3f75f$48683ad0$0100a8c0@bhecht> <403612F2.9080808@videotron.ca> <000701c3f7d1$bde9af60$0100a8c0@bhecht> Message-ID: <4036489D.4000305@videotron.ca> Hi, Bill, >I am using Outlook Express, which is functioning fine, and Windows XP. It's >just that SpamBayes doesn't seem to be doing anything - not seeing or doing >anything with the mail. Per your suggestion, I opened a command prompt >window and typed in a command, and got a response, typed in the second >command, and then got no response: > >telnet localhost 110 >+OK POP3 server ready >telnet pop.sbcglobal.yahoo.com 110 > >then there is no response. > >Nothing echos back after this. > I'm confused. ;-( I'm not confused anymore after reading the mail 50 times ..... hehe... (I'm a little bit slow today) If I understand correctly You open a command prompt window and typed : "telnet localhost 110" then a message appeared on the screen (+OK POP3 server ready) This indicate that Spambayes is responding (or forwarding the request to the real POP server) Then you typed "telnet pop.sbcglobal.yahoo.com 110" Since this is not a POP command maybe this is a correct behavior to have no answer. AS far as I can see Spambayes is working In the web interface of the sb_server.py Did you configure to add "clues" in the header. Section "Header Options" of the configuration page This must be done since OE can't see the Header Spambayes is adding. The clue that tells you if the mail is spam or ham should/must be in the subject line. So you can try this or Install another email program (Ex. mozilla, thunderbird) and retrieve one email and look at the complete header (View->Header->All) in mozilla. If you can see the header X-Spambayes-Classification: xxxx (xxx is ham or spam) Then your mail is filtered by Spambayes. Scrap OE and use something else. If you still want to use OE I can't help you more than what I did since I'm not using it and I don't know how it works. >However if I open >a new command window and start with your second command, then I get a >different response: >telnet pop.sbcglobal.yahoo.com 110 > >+OK hello from popgate<2.23.15> > >if I type additional words after this, "commands" after this, it gives an >error message: > >telnet localhost 110 >-ERR popgate unknown command >help >-ERR popgate unknown command > > This is good and expected since your talking directly to the pop server on port 110. Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From cwoodhall at montereywines.org Fri Feb 20 13:32:44 2004 From: cwoodhall at montereywines.org (Carolyn Woodhall) Date: Fri Feb 20 13:27:41 2004 Subject: [Spambayes] HELP Message-ID: Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 2743 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040220/014ec819/attachment.jpe From dickk at paragonconstructioninc.com Fri Feb 20 13:33:00 2004 From: dickk at paragonconstructioninc.com (dickk@paragonconstructioninc.com) Date: Fri Feb 20 13:33:23 2004 Subject: [Spambayes] Outlook envelope icon In-Reply-To: <20040220012615.MKYV23132.lakecmmtai01.coxmail.com@rwcrmhc13.comcast.net> Message-ID: Thanks, Peter. After about 4 1/2 hours of use, I'm generally pleased with it. It doesn't remove the Outlook icon, but includes it's own which, so far, is reliable. I can easily ignore the Outlook icon. Dick > -----Original Message----- > From: Peter Crayne [mailto:pcrayne@comcast.net] > Sent: Thursday, February 19, 2004 7:25 PM > To: spambayes@python.org > Cc: dickk@paragonconstructioninc.com > Subject: RE: [Spambayes] Outlook envelope icon > > > There's a standalone app that does something similar: > http://sourceforge.net/projects/cloutmon > > "Windows (9x/Me/NT/2k/XP) tray utility that monitors the Outlook > Inbox for > unread email. An empty or full envelope is displayed depending > on the status > of unread email in the Inbox. It also removes Outlook's envelope > icon from > the tray." > > It's no longer being developed, but it is open-source. > > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On > Behalf Of dickk@paragonconstructioninc.com > Sent: Thursday, February 19, 2004 6:44 AM > To: spambayes@python.org > Subject: [Spambayes] Outlook envelope icon > > Using Outlook2k > > I've read the faq on this and I wanted to share what I was doing. I'm > hoping that someone has come up with a better way which hasn't made it to > the faq. > > I use the Application_NewMail event to create an AppointmentItem with a > start time of Now and ReminderMinutesBeforeStart = 0. Then I use the > Application_Reminder event to determine the amount of unread mail in my > Inbox. If none, I delete the envelope icon using Neo's code. And it > deletes the AppointmentItem whether the envelope is deleted or not. > > Here are the problems: > Even though the AppointmentItem is set to remind immediately, it takes > between 10 and 30 seconds for it to actually run. This means > that if I see > an envelope, it may mean I have new mail or it may mean that the reminder > event hasn't run yet. > > The reminder is displayed and steals the focus for a split second before > it's deleted. Normally that's not a problem, but it can be an > annoyance if > I'm typing something during that split second. > > My deleted items folder has a lot of deleted reminders in it. > I've manually > set up a reminder for every Sunday to delete these reminders from the > deleted items folder, but I won't know if this works well until > this Sunday. > > Here's the code I'm using: > > [truncated] > From Amir_Katz at bmc.com Fri Feb 20 13:35:58 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Fri Feb 20 13:38:59 2004 Subject: [Spambayes] HELP Message-ID: The are probably in the 'Spam Suspects' folder. You need to go there and for each mail, decide whether it's spam or ham and use the appropriate button - 'recover from spam' for good mails (ham), and 'delete as spam' for the bad ones (spam) In any case, the FAQ would be a good place to find answers: http://spambayes.sourceforge.net/faq.html -----Original Message----- From: Carolyn Woodhall [mailto:cwoodhall@montereywines.org] Sent: Friday, February 20, 2004 20:33 To: spambayes@python.org Subject: [Spambayes] HELP -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040220/3f375e1c/attachment.html From kennypitt at hotmail.com Fri Feb 20 15:51:36 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 20 15:58:31 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <000701c3f7d1$bde9af60$0100a8c0@bhecht> Message-ID: bhecht@sbcglobal.net wrote: > I am using Outlook Express, which is functioning fine, and Windows > XP. It's just that SpamBayes doesn't seem to be doing anything - not > seeing or doing anything with the mail. First, make sure that you have Outlook Express configured to use "localhost" as the server instead of going directly to "pop.sbcglobal.yahoo.com". Then, after receiving some e-mails in Outlook Express that don't appear to be classified, right-click on the SpamBayes tray icon and select "View information...". In the "Status and Configuration" box at the top of the page you should see the label "Emails classified this session:". If all of these values are zero then SpamBayes is not seeing the e-mails. In that case you should also see "POP3 conversations this section: 0". This would indicate that you have not properly configured Outlook Express to use the SpamBayes proxy. If the status message indicates that SpamBayes has classified some of your e-mails (probably all unsure), then the problem is more likely to be that you haven't configured Outlook Express to filter the messages that SpamBayes has processed. The SpamBayes proxy can't actually move any messages for you, it can only add information to the message that your mail client can filter on. OE's filter support is very limited, so you first need to change some options in SpamBayes. Go to the configuration page and scroll down to the 3rd box, "Header Options". Next to "Classify in subject header:", check "unsure" and "spam". This will cause SpamBayes to add the classification to the beginning of the message subject. After you save the configuration change, you should immediately start seeing these classifications appear on your received messages. Now you need to set up your Outlook Express filtering rules to move messages that SpamBayes classifies. You'll need to create two folders in OE, one for spam messages and one for unsure messages. Then go to Tools->Message Rules->Mail. Create a new rule and select "Where the Subject line contains specific words" and "Move it to the specified folder". In the Rule Description box, click "contains specific words" and enter "spam". Then click "specified folder" and choose the folder that you created for spam messages. Name this rule something like "SpamBayes spam messages" and make sure it is at the top of the message rule list. Then create a similar rule to look for "unsure" in the subect and move those to your unsure folder. After you've done all that, you should start seeing all of your messages moved to the Unsure folder. You'll then need to use the SpamBayes web interface to review the messages and select the proper classification. Then click the Train button at the bottom to update your training database. SpamBayes should soon start classifying most of your messages correctly. You'll then need to review and train only when it gets an unsure or makes a mistake. -- Kenny Pitt From finearts at sonic.net Fri Feb 20 16:00:46 2004 From: finearts at sonic.net (Keith Halonen) Date: Fri Feb 20 16:00:58 2004 Subject: [Spambayes] finearts@sonic.net Message-ID: <001501c3f7f4$9d920820$cb7294d1@Keith> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 857 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040220/36f7bddc/attachment.gif From support at hospicebg.com Fri Feb 20 17:33:55 2004 From: support at hospicebg.com (Support Hospice) Date: Fri Feb 20 17:34:07 2004 Subject: [Spambayes] Multiple Inboxes Message-ID: <82748B8F1E095B46831D27EC51DAF4508E70EF@mail.hospicebg.com> We have a user that has access to multiple exchange mailboxes in outlook. Is there a way in Spambayes to only have it check a particular inbox? If it is possible, how does one do that. Greg Hospice Support 859-275-2043 Ext. 421 mailto:support@hospicebg.com This transmission may contain information that is privileged, confidential and/or exempt from disclosure under applicable law. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution, or use of the information contained herein (including any reliance thereon) is STRICTLY PROHIBITED. If you received this transmission in error, please immediately contact the sender and destroy the material in its entirety, whether in electronic or hard copy format. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040220/f3068328/attachment.html From kennypitt at hotmail.com Fri Feb 20 17:37:20 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 20 17:38:23 2004 Subject: [Spambayes] finearts@sonic.net In-Reply-To: <001501c3f7f4$9d920820$cb7294d1@Keith> Message-ID: This message that I just sent to the list may help you since it addresses a similar issue. Here is a link to it in the Spambayes list archive: http://mail.python.org/pipermail/spambayes/2004-February/011374.html Port 8880 is the port number that SpamBayes is using to provide it's Web-based user interface. This port is not used at all for actual proxying of mail. "POP3 proxy running on" indicates the port that SpamBayes is listening on for local connections to the POP3 interface, and "proxying to" indicates the mail server that SpamBayes will talk to to retrieve the e-mail. Since these are on different machines, it is fine for the port number to be the same. The key is to configure Outlook Express to use "localhost" as the mail server instead of "pop.sonic.net". See the referenced message for additional details on how to set up Outlook Express and determine if SpamBayes is processing the messages. -- Kenny Pitt _____ From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Keith Halonen Sent: Friday, February 20, 2004 4:01 PM To: spambayes@python.org Subject: [Spambayes] finearts@sonic.net My problem has to do with port assignments. Before installation, I determined that my Outlook Express POP3 account uses Port 110. I therefore assigned Port 110 to SpamBayes. But I do not know if SpamBayes is intercepting mail downloaded to my machine via that port. The address field on my Internet Explorer window reads http://localhost:8880 when I am viewing SpamBayes screens. The SpamBayes HOME page displays the following message at the top of the screen: POP3 proxy running on 110, proxying to pop.sonic.net:110. So my questions are... 1. Should these port numbers be different? 2. Am I supposed to put the number 8880 in the field used to assign a port to SpamBayes? 3. How can I tell if SpamBayes is intercepting my email? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040220/3357f89d/attachment.html From brent at bjohnson.net Fri Feb 20 19:32:12 2004 From: brent at bjohnson.net (Brent L Johnson) Date: Fri Feb 20 19:32:14 2004 Subject: [Spambayes] Database Format In-Reply-To: Message-ID: <03ce01c3f812$27e90bf0$c801a8c0@torpedo> OK I'm still struggling with exporting the database. This is driving me nuts. I installed Python2.3 in hopes that this would fix the problem. It installed the binary in /usr/bin/python2.3 and I changed the sb_dbexpimp.py script to use the 2.3 executable. Im now getting the following error: # sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export Traceback (most recent call last): File "/usr/bin/sb_dbexpimp.py", line 104, in ? import spambayes.storage ImportError: No module named spambayes.storage Anyone have any other ideas as to how I can export my spambayes DB? Thanks, - Brent > Brent L Johnson wrote: > > OK I tried it with the -d option and I got the following error: > > > > $ sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export > > Traceback (most recent call last): > > File "/usr/bin/sb_dbexpimp.py", line 266, in ? > > runExport(dbFN, useDBM, flatFN) > > File "/usr/bin/sb_dbexpimp.py", line 119, in runExport > > bayes = spambayes.storage.open_storage(dbFN, useDBM) > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 669, in open_storage > > return klass(data_source_name) > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 164, in __init__ > > self.load() > > File > "/usr/lib/python2.2/site-packages/spambayes/storage.py", line > > 189, in load > > self.dbm = dbmstorage.open(self.db_name, self.mode) > > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > > line 65, in open > > return f(db_name, mode) > > File "/usr/lib/python2.2/site-packages/spambayes/dbmstorage.py", > > line 20, in open_dbhash > > return bsddb.hashopen(*args) > > bsddb.error: (22, 'Invalid argument') > > I see that you're using Python 2.2. Have you installed the > pybsddb (aka > bsddb3) package? IIRC, this package is required if you are > using BerkeleyDB with Python 2.2. > http://pybsddb.sourceforge.net/ -- Kenny Pitt From thetasig at comcast.net Fri Feb 20 21:28:33 2004 From: thetasig at comcast.net (thetasig) Date: Fri Feb 20 21:28:46 2004 Subject: [Spambayes] 500 server error Message-ID: <4036C251.5020700@comcast.net> Please let me know what steps I should take due to this error. Restarted service, etc., but same result. Not sure how I will receive a response to this since I can't receive e-mail at the moment... Will try a complete reinstall next... Thanks! 500 Server error Traceback (most recent call last): File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\UserInterface.py", line 523, in onConfig html = self._buildConfigPage(self.parm_ini_map) File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\UserInterface.py", line 537, in _buildConfigPage html.mainContent.optionsPathname = optionsPathname File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\PyMeldLite.py", line 738, in __setattr__ self._replaceNodeContent(node, value) File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\PyMeldLite.py", line 654, in _replaceNodeContent node.children = self._nodeListFromSource(value) File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\PyMeldLite.py", line 640, in _nodeListFromSource tree = _generateTree(""+value+"") File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\PyMeldLite.py", line 574, in _generateTree g.feed(source) File "F:\E Drive Copy\Software\SpamBayes\Version 1.0a7\spambayes-1.0a7\spambayes\PyMeldLite.py", line 499, in feed self._parser.Parse(data) ExpatError: not well-formed (invalid token): line 1, column 43 From altlst at sonic.net Fri Feb 20 23:56:48 2004 From: altlst at sonic.net (Albert Ting) Date: Fri Feb 20 23:56:52 2004 Subject: [Spambayes] munging the headers Message-ID: <16438.58640.455222.225640@l-sft-2237.artisan.com> I just started to try out spambayes and it appears the latest version munges all the headers when I use sb_mboxtrain.py. But I'm using Emacs/VM and the problem is that it re-justifies the headers. In particular, spambayes will change this header: X-VM-v5-Data: (... "long subject line text" ...) To: X-VM-v5-Data: (... "long subject line text" ...) This confuses VM when it tries to display the summary section. Is there a way to prevent sb_mboxtrain.py from re-justifying the headers? One other thing. This is minor, but sp_mboxtrain.py adds a "X-Spambayes-Trained: spam" after the "Date:" column. VM assumes "Date:" is the last field and gets confused how to display this extra header. Not sure if that's a VM problem or a spambayes problem. Thanks, Albert From wolf at telusplanet.net Sat Feb 21 14:59:05 2004 From: wolf at telusplanet.net (Dan) Date: Sat Feb 21 14:59:07 2004 Subject: [Spambayes] Not sure?? Message-ID: <000801c3f8b5$29757b40$6401a8c0@Wolfen> I'm using outlook express 6.0, will Spam bayes work with it or not? If not what do you suggest I use? thanks for your time, Dan -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040221/f96f2cd2/attachment.html From keith_m_anderson at earthlink.net Sat Feb 21 15:49:11 2004 From: keith_m_anderson at earthlink.net (Keith Anderson) Date: Sat Feb 21 15:49:19 2004 Subject: [Spambayes] Recent windows update, now spambayes not working Message-ID: After a recent "auto update" spambayes is not working. Filters are not working and the "delete as spam" and "spambayes" buttons unresponsive. Is this a conflict with an outlook security update or should I look into other issues? Thanks in advance, Keith From spambayesspambayes at HOTMAIL.COM Sat Feb 21 16:52:03 2004 From: spambayesspambayes at HOTMAIL.COM (Jesus Marucut) Date: Sat Feb 21 15:53:46 2004 Subject: [Spambayes] spambayes: GV-Pr0ma.x 1s Ten|cent Vig.ra, $1o0% money back guarantee. Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040221/9cdd8660/attachment.html From DPaul at ebuilt.com Sat Feb 21 17:52:46 2004 From: DPaul at ebuilt.com (Paul, Doug) Date: Sat Feb 21 17:53:02 2004 Subject: [Spambayes] toolbar buttons not working Message-ID: SPam is filtering but tool bar buttons will not relocate mail to inbox or spam? tool bar drop down items work???? Doug From tim at fourstonesExpressions.com Sat Feb 21 20:45:54 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sat Feb 21 20:46:00 2004 Subject: [Spambayes] Not sure?? In-Reply-To: <000801c3f8b5$29757b40$6401a8c0@Wolfen> References: <000801c3f8b5$29757b40$6401a8c0@Wolfen> Message-ID: On Sat, 21 Feb 2004 12:59:05 -0700, Dan wrote: > I'm using outlook express 6.0, will Spam bayes work with it or not? If > not what do you suggest I use? Spambayes works fine with OE. You'll configure and use sb_server.py. Instructions are included > > thanks for your time, Dan No prob, dude. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tameyer at ihug.co.nz Sat Feb 21 20:55:20 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 20:56:00 2004 Subject: [Spambayes] Multiple Inboxes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052564AD@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ACC@its-xchg4.massey.ac.nz> > We have a user that has access to multiple exchange > mailboxes in outlook. Is there a way in Spambayes > to only have it check a particular inbox? Yes. > If it is possible, how does one do that. 1. Click the SpamBayes button on the SpamBayes toolbar. 2. In the drop-down menu that appears, click SpamBayes Manager. 3. In the Manager dialog that appears, click the Filtering tab. 4. Click the top Browse button (next to "Filter the following folders..."). 5. Untick whichever folders you don't want filtered (and tick those that you do). Note that there is a "include subfolders" checkbox at the bottom of the dialog, which does what you would expect. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 20:57:18 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 20:57:48 2004 Subject: [Spambayes] Hi In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130525636C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779B7@its-xchg4.massey.ac.nz> > I hilite a spam message and click on Delete as Spam > and I get a message saying: You need to reconfigure > you spambayes folder. "spam" folder, not "spambayes" folder, yes? > How do I do this? Thank you for your quick response. This is probably a case of FAQ 3.13: In any case, the instructions there should fix the problem. =Tony Meyer From tameyer at ihug.co.nz Sat Feb 21 20:58:42 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 20:59:12 2004 Subject: [Spambayes] Recent windows update, now spambayes not working In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052564BB@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779B8@its-xchg4.massey.ac.nz> > After a recent "auto update" spambayes is not working. > Filters are not working and the "delete as spam" and > "spambayes" buttons unresponsive. Is this a conflict with an > outlook security update or should I look into other issues? IIRC, Outlook disables any plug-ins after updates, which is probably the case here. Try selecting "About Microsoft Outlook" from the "Help" menu, and clicking on the "Disabled Items" button. If SpamBayes is there, then just re-enable it. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 21:01:38 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 21:02:09 2004 Subject: [Spambayes] Database Format In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052564B0@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779B9@its-xchg4.massey.ac.nz> > OK I'm still struggling with exporting the database. > This is driving me nuts. I installed Python2.3 in hopes > that this would fix the problem. It installed the binary > in /usr/bin/python2.3 and I changed the sb_dbexpimp.py > script to use the 2.3 executable. Im now getting the following > error: > > # sb_dbexpimp.py -e -d default_bayes_database.db -f bayes.export > Traceback (most recent call last): > File "/usr/bin/sb_dbexpimp.py", line 104, in ? > import spambayes.storage > ImportError: No module named spambayes.storage You need to install spambayes for python2.3. Run "/usr/bin/python2.3 setup.py install" in the directory you expanded the spambayes archive into to install the spambayes package. Alternatively, you can instruct python to look for the spambayes package elsewhere with the PYTHONPATH environment variable - set it to include the spambayes directory in the directory you expanded the spambayes archive into (the one with the "__init__.py" file in it). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tim at fourstonesExpressions.com Sat Feb 21 21:09:15 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Sat Feb 21 21:09:20 2004 Subject: [Spambayes] Recent windows update, now spambayes not working In-Reply-To: References: Message-ID: On Sat, 21 Feb 2004 12:49:11 -0800, Keith Anderson wrote: > After a recent "auto update" spambayes is not working. Filters are not > working and the "delete as spam" and "spambayes" buttons unresponsive. > Is > this a conflict with an outlook security update or should I look into > other > issues? The chances are that the plugin has become disabled. Open up the Plugin Manager dialog and see if that's the case. If so, re-enable it and all will be well... if this isn't the case, then let us know and we'll ask you for some more specific information so we can help diagnose the problem. > > Thanks in advance, > Keith > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tameyer at ihug.co.nz Sat Feb 21 21:10:59 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 21:11:27 2004 Subject: [Spambayes] 1.0a9 SMTP proxy hosed on Linux In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130525621D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ACD@its-xchg4.massey.ac.nz> > spambayes-1.0a9.tar.gz smtpproxy is hosed. My mail client > (KMail) now hangs when I send mail to train. This is probably the known NameError. It's fixed in CVS, so will be fixed in the next release. You can replace your smtpproxy.py file (in the spambayes directory) with v1.6 from CVS if you like, which should fix the problem. > The setup.py install is broken too. It did not replace all > the /usr/bin/sb_* scripts properly. It said they were already > installed, but it was obvious they weren't correct, because > the scripts cratered all over the place, and after the install, > the files in /usr/bin were not the same files that were in > the installation directory. Did you just run setup.py regularly? (i.e. no flags passed other than "install"?). It silently replaces all the files here (apart from asking about deleting sb_smtpproxy.py). This could be a Windows/Linux difference, but I'm pretty sure that distutils is meant to operate in a platform agnostic way. > Where all does spambayes copy things when you do the setup.py install > thing? Into a "spambayes" folder in your Python Lib/site-packages directory and the Python Scripts directory. > And, why did the old 1.0a7 start two sb_server.py processes > but this one only starts one? AFAIK, there should only be one (although the POP3 proxy and SMTP proxy run in separate threads; same process, though). Perhaps the result of a bug fix? =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 21:13:36 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 21:14:02 2004 Subject: [Spambayes] 1.0a9 500 Server error (More statistics... link) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256231@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779BB@its-xchg4.massey.ac.nz> > Clicking for more statistics (http://localhost:8880/stats) gives: [...] > File "/usr/lib/python2.2/shelve.py", line 70, in __getitem__ > f = StringIO(self.dict[key]) > > TypeError: argument must be string or read-only buffer, not int Sorry - this is a known error with 1.0a9 and Python 2.2 (it works with Python 2.3). It's fixed in CVS, so will be fixed in the next release. > What are .pyc files? .py files compiled to Python bytecode. This happens the first time a .py file is used, so that using them is faster next time. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 21:15:23 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 21:16:20 2004 Subject: [Spambayes] toolbar buttons not working In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052564BE@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779BC@its-xchg4.massey.ac.nz> > SPam is filtering but tool bar buttons will not relocate mail > to inbox or spam? tool bar drop down items work???? Doug Do you get an error message, or does it just silently fail? Could you send us a copy of your log file(s)? (The Help->Troubleshooting Guide explains where to find them). Does this happen on any mail, or just a particular message? =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 21:58:16 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 21:58:49 2004 Subject: [Spambayes] 500 server error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052564B1@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779BE@its-xchg4.massey.ac.nz> > Please let me know what steps I should take due to this error. > Restarted service, etc., but same result. [...] > 500 Server error > > Traceback (most recent call last): [...] > ExpatError: not well-formed (invalid token): line 1, column 43 I *think* that what's happening is that there's something in your config file that the web interface can't display (like a <, > or &), although I can't think of any options where there would be one of those characters, unless it's the path to the config file itself; I'm not sure if these are correctly turned into entities or not (I suspect not) - there was a recent bug with the imapfilter interface as a result of this. (I'm not sure why this error would suddenly appear, though). Would you be able to send me (offlist, if you like) a copy of your configuration file, and the path to it? (like "C:\My Folder\bayescustomize.ini"). Feel free to edit any private data out of it, as long as you leave any non-alphanumeric characters alone (change all letters to X, or something). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 22:04:21 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 22:04:51 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256369@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ACE@its-xchg4.massey.ac.nz> > Ok, using OL2k, I've rebuilt the database with 5+5. Now I > want to be clear on what I should do with the Suspects. I > have a suspect now with a score of 47% (limits are 80 and > 15). If I understand, I should move this message to deleted > items and NOT use the Delete As Spam button. Only use the > buttons when ham ends up in Junk or when Spam ends up in Inbox. > > > You'll probably find it's better to train *less spam* than *more > > ham*. > > Based on that, maybe I should also "Recover From Spam" and > ham that ends up in Suspects because I'm likely to get way > more spam in my Inbox than ham in my Junk (due to the big > difference in my actual ham:spam ratio). For the moment, use "Delete As Spam"/"Recover From Spam" for *any* mail that appears in your 'Suspects' folder, and any spam that doesn't get identified, and any ham that ends up in your 'Junk' folder (this last one should be rare). You can probably just continue doing that, and your results should be fine - even though you don't get a balanced amount of mail, the percentage of each ending up in your 'Suspects' folder will probably be much closer to even. If you do find that the imbalance is starting to get large (say > 5::1 or 1::5), then you could move some of the (spam/ham, whichever you have most of) messages that end up in 'Suspects' straight to the 'Deleted Items' folder/your inbox rather than training on them. I suspect that this won't be necessary, though, and the imbalance won't end up getting this high. I hope this is starting to make sense! (There is a lot of ongoing discussion about the best way to train, and how to make that easiest for users, so this should hopefully get simpler as time goes on). > I'm sorry I require so much hand-holding, but I appreciate your time. No worries, glad to help. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 21 22:09:09 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 21 22:09:35 2004 Subject: [Spambayes] Ham:Spam ratio In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130525627C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779C0@its-xchg4.massey.ac.nz> [Tony Meyer] > You'll probably find it's better to train *less spam* than > *more ham*. [Russ Foster] > I only train spam that is either misclassified to 'unsure'. Are you > recommending just deleting some of those without training? I think it would be worth trying, if it would keep your database balanced, yes. Do you tend to get a lot of similar spam ending up in the 'unsure' folder at the same time? It might be that training on one or two or those would be enough to classify all the rest (and ones arriving in the future) correctly. You can test this by training on a couple and then doing a "Filter Now" on the 'unsure' folder, although that's a rather cumbersome process. Do most of the spam that end up in the 'unsure' folder score about a certain level? If they were mostly > 80%, for example, you could also move the spam threshold down a bit (it's still *very* unlikely that a ham could score over 80%). > I wish I had less spam to train! :) =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From bhecht at sbcglobal.net Sat Feb 21 22:38:41 2004 From: bhecht at sbcglobal.net (bhecht@sbcglobal.net) Date: Sat Feb 21 22:34:03 2004 Subject: [Spambayes] Problem with POP3 Proxy References: Message-ID: <003c01c3f8f5$5d6964d0$0100a8c0@bhecht> When I right click on the SpamBayes tray icon, and select "View Information," in the "Status and Configuration" box at the top of the page I see all zeros, no emails classified. So SpamBayes is not seeing the e-mails. I'm trying to figure out how to configure Outlook Express and SpamBayes to get it to see the emails. In Outlook Express, I select tools | accounts, then select bhecht@sbcglobal.net and click 'properties' then select the 'servers' tab, and it shows me the following configuration items in a dialog box: Incomming mail server is a : POP3 server incoming mail (POP3) localhost outgoing mail (SMTP): smtp.sbcglobal.yahoo.com Incoming mail server: account name: bhecht@sbcglobal.net/pop.sbcglobal.yahoo.com Then if I press the 'Advanced' tab, I see the port configurations: Server Port number: outgoing mail (SMTP): 25 incoming mail: (POP3) 110 In SpamBayes, I only entered two values to configure it, everything else was left blank or left with default values. I right clicked on the SpamBayes tray icon, selected "configure..." and then entered these two values: Remote Servers: pop.sbcglobal.yahoo.com SpamBayes Ports: 110 Thanks again for your help. Bill hecht ----- Original Message ----- From: "Kenny Pitt" To: ; "'papaDoc'" Cc: Sent: Friday, February 20, 2004 12:51 PM Subject: RE: [Spambayes] Problem with POP3 Proxy > bhecht@sbcglobal.net wrote: > > I am using Outlook Express, which is functioning fine, and Windows > > XP. It's just that SpamBayes doesn't seem to be doing anything - not > > seeing or doing anything with the mail. > > First, make sure that you have Outlook Express configured to use > "localhost" as the server instead of going directly to > "pop.sbcglobal.yahoo.com". > > Then, after receiving some e-mails in Outlook Express that don't appear > to be classified, right-click on the SpamBayes tray icon and select > "View information...". In the "Status and Configuration" box at the top > of the page you should see the label "Emails classified this session:". > If all of these values are zero then SpamBayes is not seeing the > e-mails. In that case you should also see "POP3 conversations this > section: 0". This would indicate that you have not properly configured > Outlook Express to use the SpamBayes proxy. > > If the status message indicates that SpamBayes has classified some of > your e-mails (probably all unsure), then the problem is more likely to > be that you haven't configured Outlook Express to filter the messages > that SpamBayes has processed. The SpamBayes proxy can't actually move > any messages for you, it can only add information to the message that > your mail client can filter on. > > OE's filter support is very limited, so you first need to change some > options in SpamBayes. Go to the configuration page and scroll down to > the 3rd box, "Header Options". Next to "Classify in subject header:", > check "unsure" and "spam". This will cause SpamBayes to add the > classification to the beginning of the message subject. After you save > the configuration change, you should immediately start seeing these > classifications appear on your received messages. > > Now you need to set up your Outlook Express filtering rules to move > messages that SpamBayes classifies. You'll need to create two folders > in OE, one for spam messages and one for unsure messages. Then go to > Tools->Message Rules->Mail. Create a new rule and select "Where the > Subject line contains specific words" and "Move it to the specified > folder". In the Rule Description box, click "contains specific words" > and enter "spam". Then click "specified folder" and choose the folder > that you created for spam messages. Name this rule something like > "SpamBayes spam messages" and make sure it is at the top of the message > rule list. Then create a similar rule to look for "unsure" in the > subect and move those to your unsure folder. > > After you've done all that, you should start seeing all of your messages > moved to the Unsure folder. You'll then need to use the SpamBayes web > interface to review the messages and select the proper classification. > Then click the Train button at the bottom to update your training > database. SpamBayes should soon start classifying most of your messages > correctly. You'll then need to review and train only when it gets an > unsure or makes a mistake. > > -- > Kenny Pitt From fred at lucy.com Sun Feb 22 09:41:48 2004 From: fred at lucy.com (Fred Mertz) Date: Sun Feb 22 09:41:57 2004 Subject: [Spambayes] Retraining Message-ID: <200402220941480562.009FE24C@mail.mindspring.com> Is there a way to re-classify messages already used for training? I noticed the messages themselves seem to be stored in the cache folders, but I couldn't figure out if there was a way to get them displayed again in the Review Messages web page for reclassification or some other way to do this. Now that I think I understand a little more about how SpamBayes works, I would rather have discarded a few of the messages I trained on. Thanks. From viatorg at knology.net Sun Feb 22 10:59:17 2004 From: viatorg at knology.net (Gerry Viator) Date: Sun Feb 22 10:59:21 2004 Subject: [Spambayes] Outlook hangs on receiving emails Message-ID: Hello all, My Outlook 2003(home machine using pop3\smtp) hangs on receiving emails with spambayers version "spambayes-1.0a9.exe", I also have MacAfee installed. Is there a trick to get these to work together? Thanks Gerry -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040222/a6e3f084/attachment.html From srajaie at TechDisposal.com Sun Feb 22 13:09:33 2004 From: srajaie at TechDisposal.com (Sepehr Rajaie) Date: Sun Feb 22 13:13:19 2004 Subject: [Spambayes] Help!!! Message-ID: <1D6DC2943661CF4E982B8DCA5EABDCA812526F@tech-ad.techdisposal.com> Dear Sir; I have used your software and am extremely pleased and thankful. The problem I am experiencing is that just recently when I open my outlook 2002 which runs on 2000 platform, does not allow me to open spambayes!! Even though the icon is on my outlook but nothing happens!!?? What am I doing wrong, or is there a setting that's changed in the spambayes? Thanks for your help. Best; Sepp Rajaie -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040222/7b25b9ea/attachment.html From tameyer at ihug.co.nz Sun Feb 22 18:06:51 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 22 18:07:22 2004 Subject: [Spambayes] Retraining In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13052569FF@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD0@its-xchg4.massey.ac.nz> > Is there a way to re-classify messages already used for > training? I noticed the messages themselves seem to be > stored in the cache folders, but I couldn't figure out if > there was a way to get them displayed again in the Review > Messages web page for reclassification or some other way to > do this. Yes - use the "Find message" query on the front page of the web interface. With 1.0a9 you can search via the SpamBayes ID (the file name), subject, body, or headers. It'll bring up the messages in the standard review page, and if you train any of them, they will automatically be untrained as necessary. > Now that I think I understand a little more about > how SpamBayes works, I would rather have discarded a few of > the messages I trained on. Ah - this is a bit different, though. This is 'untraining', rather than 'retraining'. There isn't any facility exposed by the ui for this at the moment, sorry. Unless you write some Python code and do it that way, the only option is to delete the databases and start from scratch. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sun Feb 22 19:38:58 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Feb 22 19:39:29 2004 Subject: [Spambayes] Help!!! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256A61@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779C7@its-xchg4.massey.ac.nz> > The problem I am experiencing is that just recently > when I open my outlook 2002 which runs on 2000 platform, > does not allow me to open spambayes!! Even though the > icon is on my outlook but nothing happens!!?? SpamBayes has probably been disabled. In the "Help" menu, choose "About Microsoft Outlook", then click on "Disabled Items", and see if SpamBayes is there (and if it is, re-enable it). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From thetasig at comcast.net Sun Feb 22 22:10:29 2004 From: thetasig at comcast.net (thetasig) Date: Sun Feb 22 22:10:34 2004 Subject: [Spambayes] 500 server error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13046779BE@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13046779BE@its-xchg4.massey.ac.nz> Message-ID: <40396F25.1010907@comcast.net> Tony - here is the .ini file you asked for. I don't think this is the problem. BTW, I found there were two of these - one in Local Services (user) and one in my user (mlc). Both are in "documents and settings" folder I've attached both with a slight name change. The standard one is probably the one being used as it shows the proxy settings that I use. Nothing personal in either file. Here is what I tried: I deleted all of spambayes software from its directory (not phython23). I deleted all of the application data files (including the .ini files above and the proxy databases, etc. Reinstalled Spambayes 1.0a9. Installed the pop3proxy Windows service after removing the original one. Went to the configuration page and got the same error as originally reported. Tony Meyer wrote: >>Please let me know what steps I should take due to this error. >>Restarted service, etc., but same result. > > [...] > >>500 Server error >> >>Traceback (most recent call last): > > [...] > >>ExpatError: not well-formed (invalid token): line 1, column 43 > > > I *think* that what's happening is that there's something in your config > file that the web interface can't display (like a <, > or &), although I > can't think of any options where there would be one of those characters, > unless it's the path to the config file itself; I'm not sure if these are > correctly turned into entities or not (I suspect not) - there was a recent > bug with the imapfilter interface as a result of this. > > (I'm not sure why this error would suddenly appear, though). > > Would you be able to send me (offlist, if you like) a copy of your > configuration file, and the path to it? (like "C:\My > Folder\bayescustomize.ini"). Feel free to edit any private data out of it, > as long as you leave any non-alphanumeric characters alone (change all > letters to X, or something). > > =Tony Meyer > > --- > Please always include the list (spambayes@python.org) in your replies > (reply-all), and please don't send me personal mail about SpamBayes. This > way, you get everyone's help, and avoid a lack of replies when I'm busy. > > -------------- next part -------------- [Storage] ham_cache:C:\Documents and Settings\LocalService\Application Data\SpamBayes\Proxy\ham_cache messageinfo_storage_file:C:\Documents and Settings\LocalService\Application Data\SpamBayes\Proxy\message_info_database.db persistent_storage_file:C:\Documents and Settings\LocalService\Application Data\SpamBayes\Proxy\statistics_database.db spam_cache:C:\Documents and Settings\LocalService\Application Data\SpamBayes\Proxy\spam_cache unknown_cache:C:\Documents and Settings\LocalService\Application Data\SpamBayes\Proxy\unknown_cache [Headers] notate_subject:unsure spam [pop3proxy] listen_ports:1110,1120,1130,1140,1150 remote_servers:mail.comcast.net,incoming.verizon.net,incoming.verizon.net,mail.chezdudley.com,mail.chezdudley.com -------------- next part -------------- [Storage] ham_cache: C:\Documents and Settings\mlc\Application Data\SpamBayes\Proxy\ham_cache messageinfo_storage_file: C:\Documents and Settings\mlc\Application Data\SpamBayes\Proxy\message_info_database.db persistent_storage_file: C:\Documents and Settings\mlc\Application Data\SpamBayes\Proxy\statistics_database.db spam_cache: C:\Documents and Settings\mlc\Application Data\SpamBayes\Proxy\spam_cache unknown_cache: C:\Documents and Settings\mlc\Application Data\SpamBayes\Proxy\unknown_cache From fred at lucy.com Sun Feb 22 22:14:21 2004 From: fred at lucy.com (Fred Mertz) Date: Sun Feb 22 22:14:31 2004 Subject: [Spambayes] Retraining Message-ID: <200402222214210968.000F9292@mail.mindspring.com> >> Now that I think I understand a little more about >> how SpamBayes works, I would rather have discarded a few of >> the messages I trained on. > >Ah - this is a bit different, though. This is 'untraining', rather than >'retraining'. There isn't any facility exposed by the ui for this at the >moment, sorry. Unless you write some Python code and do it that way, the >only option is to delete the databases and start from scratch. Oh, ok. Before I start from scratch, then, maybe I should ask about spams with hundreds of random or gibberish words. Do these muck up the databases? They seem to be classified ok, and the clues aren't any of these gibberish tokens, but I know I trained on a few of these and it seems like they would skew the statistics. What's the recommendation on this type of spam? From tim.one at comcast.net Sun Feb 22 22:49:57 2004 From: tim.one at comcast.net (Tim Peters) Date: Sun Feb 22 22:50:03 2004 Subject: [Spambayes] Feature idea. In-Reply-To: Message-ID: [John Gagon] > I really love SpamBayes and have sold my whole company on it. It > works great even for those ditzy blondes in reception who tend to > try to unsubscribe to every spam they get in their box. > > I have a suggestion for Spam Bayes. This is regards to the > threshold feature. ie: you can raise or lower the score filtering > criterias. (ie: for Spam/Unsure and Inboxes etc) > > Over time, I would suspect the messages, statistically would create > a "camel" two hump curve. ie: two sets of distributions (I know > there is a more technical term for that in statistics but it slips my > mind atm). The distribution is bimodal, but not really like camel humps. See the third graph at: http://spambayes.sourceforge.net/background.html > Over time, the humps would grow and the minima shift left a little as > more and more clever spams are eliminated to the right side of the > distribution) > > I would suspect the best place to set your thresholds would be > between the ham and spam distribution humps. As the graph shows, "the humps" (modes) are typically at 0.00 (rounded) and 1.00 (rounded), so "between them" is certainly good advice, but also advice impossible not to follow . > Or have your unsure zone be so many points away from that minima. It > would be nice then to have a checkbox to enable automatic adjustment > of the filtering criteria. (ie: over time, mine has gone down from 75% > spam scores and above to 15% and above since I have a large hump > after 15% and a smaller ham hump before the 15% mark. (IOW, the > filter is getting very good and goes lower as it goes. but I'm > having to manually do statistics and adjust the filter so as to get > very good accuracy out of spambayes. I don't see much hope for auto-adjustment: email mixes vary wildly across people; personal tolerances for FP, FN, and Unsure rates vary similarly; and training strategies vary almost as much. The SpamBayes scoring algorithm also systematically scores perfectly ambiguous messages at 0.50 on the nose (which accounts for the third-highest hump, near 0.50 in the graph). I prefer to call those Unsure. Sounds like you prefer to call them Spam. I expect *most* people who don't like Unsure want to call them Ham, to avoid turning an ambiguous msg into a false positive. From gonor at adelphia.net Sun Feb 22 23:15:35 2004 From: gonor at adelphia.net (Gary Norris) Date: Sun Feb 22 23:15:47 2004 Subject: [Spambayes] SpamBayes a wonderful product Message-ID: <000001c3f9c3$afed9db0$26234344@s1099982951> I have a question which I cannot find the answer to on your web site. I installed SpamBayes about a week ago. It seems to be working and learning with less and less errors. What I cannot find on your web site is this question: At what point does the program recognize an e-mail as one which I consider Spam and process it as such? 1. When it is moved from the inbox to the Junk Mail box? 2. When it is moved from the Junk Mail box to the Deleted Files box and then deleted from the system? 3. I think this applies to the suspect mail box as well. I assume if it is moved to a wanted box it is processed as non Junk mail. If it is moved to the Junk Mail it is moved to the Junk Mail box. Gary Norris Yuma, AZ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040222/08a295ea/attachment.html From mfbonello at centurytel.net Sun Feb 22 23:35:14 2004 From: mfbonello at centurytel.net (Michael Bonello) Date: Sun Feb 22 23:35:22 2004 Subject: [Spambayes] Spambayes with Choicemail Message-ID: Python, At work I have Choicemail. I would like to gravitate to Spambayes eventually. After I installed Spambayes, it does not work. Does Spambayes have to go through the localhost that Choicemail uses? Thanks Please respond to mbonello@morrowhome.org. Michael Bonello -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1496 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040222/2a4e2244/winmail-0001.bin From jneustein at adelphia.net Mon Feb 23 02:41:05 2004 From: jneustein at adelphia.net (jneustein@adelphia.net) Date: Mon Feb 23 02:41:26 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: <20040223074105.KKAL25917.mta9.adelphia.net@ca-vannys-bluewave1d-c-207.vnnyca.adelphia.net> A non-text attachment was scrubbed... Name: SpamBayesServer1.log Type: application/octet-stream Size: 257 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040223/3506d6ef/SpamBayesServer1.obj -------------- next part -------------- I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 4.10.67766446.1 ( A ). I have trained 0 ham and 0 spam. The problem I am having is that the software tells me there are NO untrained messages, even though there are ten or twenty. From tameyer at ihug.co.nz Mon Feb 23 02:45:51 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 02:46:22 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256BC3@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779CE@its-xchg4.massey.ac.nz> > The problem I am having is > that the software tells me there are NO untrained messages, > even though there are ten or twenty. How do you know that there are ten or twenty? Are there that many files in the unknown cache directory? Or is that just how much mail you have received? Are you certain that your mail client is connecting to SpamBayes (localhost) rather than directly to your mail server? If it is using SpamBayes then incoming messages will have some X-Spambayes-* headers added to them. Have you tried forcing the browser to do a refresh of the page (hold down shift)? What browser are you using? What mail client are you using? (Outlook Express? Eudora? Mail.app?) =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From alastair.sharp at dsl.pipex.com Mon Feb 23 03:24:33 2004 From: alastair.sharp at dsl.pipex.com (Alastair Sharp) Date: Mon Feb 23 03:24:38 2004 Subject: [Spambayes] Problem Message-ID: After a major system crash, I have recovered my Outlook which is now working correctly. I have the SpamBayes toolbar, but SpamBayes Manager is not correctly configured - the folders show as "unknown" When I try to start the Configuration Wizard to fix this, nothing at all happens! Everything else seems OK, but, of course, it isn't working properly! I have read the FAQs, but nothing appears to be relevant. I have also picked up the latest version and installed it. Can you help? Best regards, Alastair Sharp E-mail : alastair.sharp@dsl.pipex.com Web sites : www.macpherson.dsl.pipex.com & www.systegration.ltd.uk --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.593 / Virus Database: 376 - Release Date: 20/02/2004 From tameyer at ihug.co.nz Mon Feb 23 03:32:42 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 03:33:17 2004 Subject: [Spambayes] Problem In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256BDB@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779CF@its-xchg4.massey.ac.nz> > I have the SpamBayes toolbar, but SpamBayes Manager is not > correctly configured - the folders show as "unknown" When I > try to start the Configuration Wizard to fix this, > nothing at all happens! Everything else seems OK, but, of > course, it isn't working properly! > I have also picked up the latest version and installed it. Your log file(s) would really help here. The troubleshooting guide (a copy is installed with the software, or there's an electronic copy on our website) explains where to find them. Did you uninstall the old version before installing the new one? 1.0a9 fixes a couple of bugs with situations like this, so that the config wizard should still work (although it's hard to tell if this is the same without the log). To completely reset your configuration, simply rename/remove the file "[profile name].ini" in your data directory (it might be called "Outlook.ini"). The FAQ explains where the data directory is. This won't lose any training that you have done. =Tony Meyer From spambayesspambayes at MSN.COM Mon Feb 23 03:39:27 2004 From: spambayesspambayes at MSN.COM (Linette Keller) Date: Mon Feb 23 03:39:02 2004 Subject: [Spambayes] spambayes: GV-Pr0ma#x 1s Ten^cent Vigr#a, the chem1cal equ1valent of V|lgra Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/ac6fec0a/attachment.html From motif1 at aol.com Mon Feb 23 09:59:20 2004 From: motif1 at aol.com (motif1@aol.com) Date: Mon Feb 23 09:59:18 2004 Subject: [Spambayes] fake Message-ID: that is bad -------------- next part -------------- A non-text attachment was scrubbed... Name: misc.zip Type: application/x-zip-compressed Size: 0 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040223/bc5a7d92/misc.bin From Darrell.Cyphers at tetratech.com Mon Feb 23 11:25:43 2004 From: Darrell.Cyphers at tetratech.com (Cyphers, Darrell -- Tt, Inc.) Date: Mon Feb 23 11:42:07 2004 Subject: [Spambayes] Unable to delete as spam Message-ID: <5A6EB43BF868D5118B51000347713BC7046B8B2B@TTMSX1> I have been using the program for several months, and it has been working beautifully with 2 exceptions. 1) Whenever a Windows upgrade is installed, the addin fails to load with the next startup of Outlook; 2) This morning, for the first time, selecting a message and clicking "Delete as Spam" did absolutely nothing. I re-installed the addin twice to no avail. I read the troubleshooting section, but it is technically overy my head. I don't know what "the log file for this session" is, and I don't know what an "installation type" is. So, I'm e-mailing you instead. Do you have any solutions? Darrell Cyphers Tetra Tech, Inc. 348 W. Hospitality Lane, Suite 100 San Bernardino, CA 92408 Office Phone 909-381-1674 (218) Office Fax 909-889-1391 E-mail (darrell.cyphers@tetratech.com) > PLEASE NOTE: This message, including any attachments, may include > privileged, confidential and/or inside information. Any distribution or > use of this communication by anyone other than the intended recipient is > strictly prohibited and may be unlawful. If you are not the intended > recipient, please notify the sender by replying to this message and then > delete it from your system. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/372d04d3/attachment.html From paul at paulconsults.com Mon Feb 23 12:08:09 2004 From: paul at paulconsults.com (Paul Mansfield) Date: Mon Feb 23 12:16:32 2004 Subject: [Spambayes] Spambayes and Outlook 2003 Message-ID: <200402231708.i1NH888q086495@taka.swcp.com> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: ToSpamBayes.zip Type: application/x-zip-compressed Size: 145618 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040223/334c97dc/ToSpamBayes-0001.bin From Tracey.Snow at gov.ab.ca Mon Feb 23 12:40:14 2004 From: Tracey.Snow at gov.ab.ca (Tracey Snow) Date: Mon Feb 23 12:38:14 2004 Subject: [Spambayes] un-spamming Message-ID: <04Feb23.103821mst.328528@gateway.just.gov.ab.ca> Hi, I have deleted an e-mail address as Spam that I don't want to be Spam at all. How can I un-Spam it? Tracey Snow From rvaldez1 at pacbell.net Mon Feb 23 13:02:26 2004 From: rvaldez1 at pacbell.net (Robert) Date: Mon Feb 23 13:01:14 2004 Subject: [Spambayes] Large Files Message-ID: Hello - My SpamBayes is reading large files with attachments as Junk (+1MB). How do I reset SpamBayes so that I can receive these large files, with their large attachments? Thank you! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/c00959b3/attachment.html From Jonas.Schwartz at disney.com Mon Feb 23 13:59:51 2004 From: Jonas.Schwartz at disney.com (Schwartz, Jonas) Date: Mon Feb 23 14:00:00 2004 Subject: [Spambayes] Question about training Spambayes Message-ID: Hello there, Quick question. If it train spambayes on one machine, is there a way to export that training coding and import it onto another computer utilizing spambayes? thank you jds Jonas Schwartz jonas.schwartz@disney.com 818 560 3332 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/63f89a62/attachment.html From JEppolito at AirLogix.com Mon Feb 23 15:03:20 2004 From: JEppolito at AirLogix.com (James Eppolito) Date: Mon Feb 23 15:03:33 2004 Subject: [Spambayes] Spambayes and Outlook 98 Message-ID: <879C12CDFDEED611B7D000065BA369A2017F11FB@ntserver.airlogix.com> Hi, We use Spambayes on three or four pc's in our organization and it is great. I have just tried to install it on and XP box running Outlook 98. The component loads but I cannot access the Spambayes Manager. The tool bar appears except it is about ? inches X ? inches. Anyway, would using the source code files have any effect on changing the way the GUI loads in Outlook? Do you have any other suggestions (I have already downloaded and went through your Outlook troubleshooting doc)? Thanks for your help. James Eppolito "Computing: an exact science with infinite variables" je '01 James Eppolito IT Support, Airlogix Ft. Worth extension 6506 Dallas extension 2275 Office 817-887-6506 Pager 972-879-0762 jeppolito@airlogix.com This message is intended for the sole use of the individual and entity to whom it is addressed, and may contain information, including any attachments, that is privileged, confidential and exempt from disclosure under applicable law. If you are not the intended addressee, nor authorized to receive for the intended addressee, you are hereby notified that you may not use, copy, disclose or distribute to anyone the message or any information contained in the message. If you have received this electronic transmission in error, please notify the sender immediately by a "reply to sender only" message and destroy all electronic and hard copies of the communication, including attachments. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/69530484/attachment.html From tim.one at comcast.net Mon Feb 23 16:05:10 2004 From: tim.one at comcast.net (Tim Peters) Date: Mon Feb 23 16:05:31 2004 Subject: [Spambayes] Spambayes and Outlook 98 In-Reply-To: <879C12CDFDEED611B7D000065BA369A2017F11FB@ntserver.airlogix.com> Message-ID: [James Eppolito] > We use Spambayes on three or four pc's in our organization and it > is great. I have just tried to install it on and XP box running > Outlook 98. The component loads but I cannot access the Spambayes > Manager. ... I expect you're talking about the Outlook addin form of SpamBayes, and then this surprises me: the addin can't even begin to work with Outlook 98 (only with Outlook 2000 and later; the object model in Outlook 98 is so different from "modern" Outlooks that it would require different code all over the place). From skip at pobox.com Mon Feb 23 16:05:24 2004 From: skip at pobox.com (Skip Montanaro) Date: Mon Feb 23 16:05:40 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <402E3D16.2902.14343380@localhost> References: <402E3D16.2902.14343380@localhost> Message-ID: <16442.27412.37852.84106@montanaro.dyndns.org> Brad> I'd like to get feedback from folks on the distribution of nham Brad> and nspam counts in their wordinfo databases. ... Brad> Anyway, what I'm interested in is the number of tokens whose nspam Brad> or nham count is greater than 255 vs the total number of tokens Brad> and ham and spam count. I'm coming into this late, but thought I'd post my numbers. As far as I know, I'm the only person using train-to-exhaustion at the moment. That probably skews my numbers, so maybe they'll be of interest. When I run tte.py it reports 764 each of ham and spam (it always trains in pairs and ignores any extras in the larger corpus). When I ask spamcounts for the nham and nspam values from the database, it reports 301 and 237, respectively. Some messages are never trained on since they score correctly from the get-go. Other messages are trained on multiple times, currently none more than four times. My current tte.db file is 2.6MB. Appended are my counts/per cents. Skip ------------------------------------------------------------------------------ ham dist count freq % 0 22140 48.09907 1 18410 39.99566 2 2456 5.33565 3 935 2.03128 4 501 1.08842 5 295 0.64089 6 248 0.53878 7 162 0.35194 8 124 0.26939 9 89 0.19335 10 73 0.15859 11 65 0.14121 12 64 0.13904 13 38 0.08255 14 37 0.08038 15 23 0.04997 16 30 0.06517 17 24 0.05214 18 20 0.04345 19 20 0.04345 20 13 0.02824 21 12 0.02607 22 6 0.01303 23 12 0.02607 24 15 0.03259 25 14 0.03041 26 11 0.02390 27 7 0.01521 28 11 0.02390 29 10 0.02172 30 1 0.00217 31 9 0.01955 32 6 0.01303 33 2 0.00434 34 4 0.00869 35 4 0.00869 36 3 0.00652 37 2 0.00434 38 3 0.00652 39 5 0.01086 40 3 0.00652 41 3 0.00652 42 3 0.00652 43 4 0.00869 44 3 0.00652 45 8 0.01738 46 4 0.00869 47 4 0.00869 48 1 0.00217 49 1 0.00217 50 3 0.00652 51 2 0.00434 52 3 0.00652 53 2 0.00434 54 2 0.00434 55 2 0.00434 56 4 0.00869 57 5 0.01086 58 2 0.00434 59 1 0.00217 60 1 0.00217 61 1 0.00217 63 1 0.00217 64 5 0.01086 65 1 0.00217 66 1 0.00217 67 1 0.00217 69 1 0.00217 70 1 0.00217 72 2 0.00434 74 1 0.00217 75 1 0.00217 76 3 0.00652 77 1 0.00217 78 1 0.00217 79 1 0.00217 81 1 0.00217 90 1 0.00217 92 1 0.00217 94 2 0.00434 98 1 0.00217 103 2 0.00434 104 1 0.00217 105 1 0.00217 107 1 0.00217 109 1 0.00217 114 1 0.00217 122 1 0.00217 131 1 0.00217 132 1 0.00217 138 2 0.00434 139 1 0.00217 143 1 0.00217 152 2 0.00434 157 1 0.00217 168 1 0.00217 179 1 0.00217 189 1 0.00217 194 1 0.00217 212 2 0.00434 217 1 0.00217 220 1 0.00217 221 4 0.00869 224 1 0.00217 226 1 0.00217 233 1 0.00217 236 2 0.00434 237 3 0.00652 spam dist count freq % 0 17009 36.95199 1 23725 51.54247 2 2316 5.03150 3 875 1.90093 4 484 1.05149 5 315 0.68434 6 231 0.50185 7 140 0.30415 8 107 0.23246 9 116 0.25201 10 69 0.14990 11 77 0.16728 12 45 0.09776 13 37 0.08038 14 33 0.07169 15 32 0.06952 16 22 0.04779 17 34 0.07386 18 16 0.03476 19 27 0.05866 20 12 0.02607 21 17 0.03693 22 17 0.03693 23 12 0.02607 24 11 0.02390 25 9 0.01955 26 11 0.02390 27 11 0.02390 28 9 0.01955 29 6 0.01303 30 10 0.02172 31 8 0.01738 32 8 0.01738 33 2 0.00434 34 2 0.00434 35 4 0.00869 36 8 0.01738 37 4 0.00869 38 9 0.01955 39 4 0.00869 40 1 0.00217 41 4 0.00869 42 3 0.00652 43 3 0.00652 44 6 0.01303 45 3 0.00652 46 2 0.00434 47 5 0.01086 48 7 0.01521 49 4 0.00869 50 8 0.01738 51 3 0.00652 52 2 0.00434 53 3 0.00652 54 3 0.00652 55 3 0.00652 57 4 0.00869 59 1 0.00217 60 2 0.00434 61 3 0.00652 62 1 0.00217 63 2 0.00434 64 1 0.00217 66 2 0.00434 67 1 0.00217 69 1 0.00217 70 3 0.00652 71 1 0.00217 72 1 0.00217 73 1 0.00217 74 1 0.00217 75 1 0.00217 77 2 0.00434 78 1 0.00217 81 2 0.00434 82 2 0.00434 83 1 0.00217 85 1 0.00217 90 1 0.00217 105 1 0.00217 109 2 0.00434 111 1 0.00217 112 2 0.00434 113 1 0.00217 115 2 0.00434 120 1 0.00217 121 1 0.00217 125 2 0.00434 128 1 0.00217 130 1 0.00217 132 1 0.00217 135 1 0.00217 155 1 0.00217 159 1 0.00217 168 2 0.00434 171 1 0.00217 173 1 0.00217 184 1 0.00217 185 2 0.00434 190 1 0.00217 197 1 0.00217 202 2 0.00434 213 1 0.00217 226 1 0.00217 234 1 0.00217 250 1 0.00217 259 1 0.00217 260 1 0.00217 263 1 0.00217 283 1 0.00217 284 1 0.00217 286 1 0.00217 287 4 0.00869 296 1 0.00217 299 2 0.00434 301 3 0.00652 From AHalling at aol.com Mon Feb 23 16:31:00 2004 From: AHalling at aol.com (AHalling@aol.com) Date: Mon Feb 23 16:34:59 2004 Subject: [Spambayes] (no subject) Message-ID: <81.5f3ab6c.2d6bcb14@aol.com> Does it work with aol? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040223/525a4105/attachment.html From davisda at health.missouri.edu Mon Feb 23 16:51:00 2004 From: davisda at health.missouri.edu (David Davis) Date: Mon Feb 23 16:52:51 2004 Subject: [Spambayes] Addin doesn't load Message-ID: <403A75C4.9060801@health.missouri.edu> No Anti-spam item in toolbar. Running binary version.When select tools - options from outlook there is no tab labelled 'other' so can't click 'advanced' button. I have windows 98. on IBM. What do I do?In windows\temp there is spambayes1.log. From tim at fourstonesExpressions.com Mon Feb 23 17:07:51 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Mon Feb 23 17:07:58 2004 Subject: [Spambayes] Addin doesn't load In-Reply-To: <403A75C4.9060801@health.missouri.edu> References: <403A75C4.9060801@health.missouri.edu> Message-ID: Windows 98 and Outlook 98 are not supported by Spambayes. On Mon, 23 Feb 2004 15:51:00 -0600, David Davis wrote: > No Anti-spam item in toolbar. Running binary version.When select tools - > options from outlook there is no tab labelled 'other' so can't click > 'advanced' button. I have windows 98. on IBM. What do I do?In > windows\temp there is spambayes1.log. > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From kurtbuff at spro.net Mon Feb 23 18:13:35 2004 From: kurtbuff at spro.net (Kurt) Date: Mon Feb 23 18:14:21 2004 Subject: [Spambayes] POP3 proxy blew up while trying to train Message-ID: <000101c3fa62$bf9a3800$3f05a8c0@bfgapollo1> Was trying to train a buncha messages, and it's not happy any more. I've looked at the troubleshooting doc, and the rest of the web site, but didn't find any happiness. Any thoughts? Here's the web output: ----------snip---------- 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 461, in found_terminator File "spambayes\ProxyUI.pyc", line 386, in onReview File "spambayes\Corpus.pyc", line 187, in takeMessage File "spambayes\FileCorpus.pyc", line 140, in addMessage File "spambayes\Corpus.pyc", line 133, in addMessage File "spambayes\storage.pyc", line 581, in onAddMessage File "spambayes\storage.pyc", line 589, in train File "spambayes\classifier.pyc", line 273, in learn File "spambayes\classifier.pyc", line 384, in _add_msg File "spambayes\storage.pyc", line 281, in _wordinfoset File "shelve.pyc", line 130, in __setitem__ File "bsddb\__init__.pyc", line 120, in __setitem__ DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') ----------snip---------- "I heard someone tried the monkeys-on-typewriters bit trying for the plays of W. Shakespeare, but all they got was the collected works of Francis Bacon." - Bill Hirst From tameyer at ihug.co.nz Mon Feb 23 18:17:32 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 18:18:16 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361F01@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D0@its-xchg4.massey.ac.nz> > I'm coming into this late, but thought I'd post my numbers. > As far as I know, I'm the only person using > train-to-exhaustion at the moment. That probably skews my > numbers, so maybe they'll be of interest. How well is this working for you? Is it really slow? Do you have it set to only use a subset of mail, or is it training-to-exhaustion on the whole lot? =Tony Meyer From tameyer at ihug.co.nz Mon Feb 23 18:19:39 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 18:20:08 2004 Subject: [Spambayes] (no subject) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361F02@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D1@its-xchg4.massey.ac.nz> > Does it work with aol? Do you collect your mail via POP3 or IMAP4 (probably in some sort of regular mail client like Outlook Express, Eudora, and so on), or is it some sort of proprietory AOL thing? If the former, yes, the latter, probably no. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 23 18:21:14 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 18:22:02 2004 Subject: [Spambayes] Question about training Spambayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361EFD@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D2@its-xchg4.massey.ac.nz> > Quick question. If it train spambayes on one machine, > is there a way to export that training coding and > import it onto another computer utilizing spambayes? Sure. Instructions here (FAQ 3.6): (This is for Outlook; if you're using something else, then just copy the two database files, and you'll be fine). =Tony Meyer From tameyer at ihug.co.nz Mon Feb 23 18:23:53 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 18:24:33 2004 Subject: [Spambayes] un-spamming In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361EB8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D3@its-xchg4.massey.ac.nz> > I have deleted an e-mail address as Spam that I don't want to > be Spam at all. How can I un-Spam it? If you're using the Outlook plug-in, then find the message (in your 'spam' folder, presumably), select it, and click the "Recover from spam" button. If you've already deleted the message, then you'll have to either start training from scratch, or just watch for the next time a message from that person arrives, and train on it then (which should cancel the email address tokens, if nothing else). If you're using sb_server, then use the "Find Message" query on the front page of the web interface to find the message, then train it as Ham. Again, if the message is gone (by default, this will be after 7 days), then you'll have to retrain/train on the next one. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 23 18:27:41 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 18:29:02 2004 Subject: [Spambayes] Spambayes and Outlook 2003 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361EAF@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D4@its-xchg4.massey.ac.nz> > I've dowloaded this: spambayes-1.0a9.exe and installed. > I'm getting errors at startup and Outlook is hanging. Are you upgrading from a previous version of spambayes, or is 1.0a9 the first? Are any log files created (the troubleshooting guide - an online copy is on the documentation page of the website - explains where to find them)? If so, sending those in would really help. You could also try deleting your configuration, and configuring again (you won't lose any training that you've done). This is in a file called "[profile name].ini", or "Outlook.ini" in the data directory (in your data directory; the FAQ explains where to find that). > See attached errors from screen shots. By the way, it'd be much nicer to just have a description of the errors than screen shots in a rtf file in a zip archive; attachments are really hard to trust these days. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tim.one at comcast.net Mon Feb 23 18:32:32 2004 From: tim.one at comcast.net (Tim Peters) Date: Mon Feb 23 18:32:41 2004 Subject: [Spambayes] Addin doesn't load In-Reply-To: Message-ID: [Tim Stone] > Windows 98 and Outlook 98 are not supported by Spambayes. Windows 98 is fully supported. Outlook 98 isn't supported at all by the Outlook addin. [David Davis] >> No Anti-spam item in toolbar. Running binary version.When select >> tools - options from outlook there is no tab labelled 'other' so >> can't click 'advanced' button. I have windows 98. on IBM. What do I >> do?In windows\temp there is spambayes1.log. Send another msg with spambayes1.log. From tim at fourstonesExpressions.com Mon Feb 23 19:00:23 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Mon Feb 23 19:00:35 2004 Subject: [Spambayes] Addin doesn't load In-Reply-To: References: Message-ID: On Mon, 23 Feb 2004 18:32:32 -0500, Tim Peters wrote: > [Tim Stone] >> Windows 98 and Outlook 98 are not supported by Spambayes. > > Windows 98 is fully supported. Outlook 98 isn't supported at all by the > Outlook addin. Right. I stand corrected. I knew that it was just a test.... -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From kurtbuff at spro.net Mon Feb 23 20:04:21 2004 From: kurtbuff at spro.net (Kurt) Date: Mon Feb 23 20:04:27 2004 Subject: [Spambayes] RE: POP3 proxy blew up while trying to train - Uh, never mind... Message-ID: <000b01c3fa72$23ae3120$3f05a8c0@bfgapollo1> Replying to my own message: I simply exited SpamBayes, renamed hammie.db, and restarted it. It trained 668 messages just fine after that. However, I noticed while doing this that I have 5162 hams cached, and only 577 spams cached. I train on everything, which may not be the best strategy, but it's the easiest I can see. As a matter of hygiene, would it make a difference if I started cleaning out the ham cache, to bring it in to line with the size of the spam cache? If that doesn't make a difference, is there a relatively easy way to implement one of the other strategies? -----Original Message----- From: Kurt [mailto:kurtbuff@spro.net] Sent: Monday, February 23, 2004 15:14 To: 'spambayes@python.org' Subject: POP3 proxy blew up while trying to train Was trying to train a buncha messages, and it's not happy any more. I've looked at the troubleshooting doc, and the rest of the web site, but didn't find any happiness. Any thoughts? Here's the web output: ----------snip---------- 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 461, in found_terminator File "spambayes\ProxyUI.pyc", line 386, in onReview File "spambayes\Corpus.pyc", line 187, in takeMessage File "spambayes\FileCorpus.pyc", line 140, in addMessage File "spambayes\Corpus.pyc", line 133, in addMessage File "spambayes\storage.pyc", line 581, in onAddMessage File "spambayes\storage.pyc", line 589, in train File "spambayes\classifier.pyc", line 273, in learn File "spambayes\classifier.pyc", line 384, in _add_msg File "spambayes\storage.pyc", line 281, in _wordinfoset File "shelve.pyc", line 130, in __setitem__ File "bsddb\__init__.pyc", line 120, in __setitem__ DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') ----------snip---------- "I heard someone tried the monkeys-on-typewriters bit trying for the plays of W. Shakespeare, but all they got was the collected works of Francis Bacon." - Bill Hirst From skip at pobox.com Mon Feb 23 20:04:40 2004 From: skip at pobox.com (Skip Montanaro) Date: Mon Feb 23 20:04:49 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13046779D0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1305361F01@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13046779D0@its-xchg4.massey.ac.nz> Message-ID: <16442.41768.971186.83496@montanaro.dyndns.org> >> I'm coming into this late, but thought I'd post my numbers. As far >> as I know, I'm the only person using train-to-exhaustion at the >> moment. That probably skews my numbers, so maybe they'll be of >> interest. Tony> How well is this working for you? Is it really slow? Do you have Tony> it set to only use a subset of mail, or is it Tony> training-to-exhaustion on the whole lot? Seems to work pretty well. Here's a run I did just now: % python ~/tmp/spambayes/contrib/tte.py -g newham.clean.save -s newspam.clean.save -d tte.db round: 1, msgs: 770, ham misses: 196, spam misses: 244, 67.7s round: 2, msgs: 770, ham misses: 33, spam misses: 55, 49.4s round: 3, msgs: 770, ham misses: 8, spam misses: 5, 33.1s round: 4, msgs: 770, ham misses: 0, spam misses: 0, 28.6s 1 untrained spams Adding up the last column indicates a total run time of about three minutes. I can live with that. Note that even though I fed it 770 messages, only 541 messages (some of them were duplicates) actually contributed to the final database: % spamcounts -d tte.db 'saved state' db: tte.db token,nspam,nham,spam prob saved state,304,237,0.5 The database thus winds up smaller than it would be with a more usual training approach. Skip From tim at fourstonesExpressions.com Mon Feb 23 20:14:59 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Mon Feb 23 20:15:15 2004 Subject: [Spambayes] RE: POP3 proxy blew up while trying to train - Uh, never mind... In-Reply-To: <000b01c3fa72$23ae3120$3f05a8c0@bfgapollo1> References: <000b01c3fa72$23ae3120$3f05a8c0@bfgapollo1> Message-ID: On Mon, 23 Feb 2004 17:04:21 -0800, Kurt wrote: > As a matter of hygiene, would it make a difference if I started cleaning > out the ham cache, to bring it in to line with the size of the spam > cache? Generally speaking, a more balanced ham/spam ratio gives better results. If it's working satisfactorily for you, then don't worry about it. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From tameyer at ihug.co.nz Mon Feb 23 21:09:23 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 21:10:22 2004 Subject: [Spambayes] RE: [spambayes-dev] Urgent, please help In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361FB2@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779D5@its-xchg4.massey.ac.nz> [Firstly, please note that the spambayes-dev list is for discussion about the development of spambayes. This sort of message is appropriate for the spambayes@python.org list]. > There's one problem that I have found out, the Spambayes > filters my inbox emails before my Outlook(2003) client rules > are applied, is there a way to check new emails with my own > outlook rules before the SpamBayes filter applies? Yes. Use the "background filtering" option on the Advanced Tab of the Manager dialog. IIRC, this is turned on by default in the latest release - 1.0a9 (0.9). =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 23 21:15:18 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 21:16:01 2004 Subject: [Spambayes] RE: POP3 proxy blew up while trying to train - Uh, never mind... In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361F93@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD2@its-xchg4.massey.ac.nz> > Replying to my own message: > > I simply exited SpamBayes, renamed hammie.db, and restarted it. Yes, your database was corrupted. If you can do this repeatedly, then we would be *very* interested to hear how (and which version you are using). We see occasional reports of this, but haven't been able to track down all the causes, as yet. > However, I noticed while doing this that I have 5162 hams > cached, and only 577 spams cached. I train on everything, > which may not be the best strategy, but it's the easiest I can see. Try training on mistakes, which typically does better than train-on-everything. IOW, just train on unsures, false positives and false negatives. With 1.0a9, you can set the default values of the radio buttons in the review pages to make this easier (Ham->Discard, Unsure->Defer, Spam->Discard, for example). > As a matter of hygiene, would it make a difference if I > started cleaning out the ham cache, to bring it in to line > with the size of the spam cache? Note that it doesn't matter how many files are in those directories - they're moved there once they are trained, and not used afterwards (unless you correct training). If you want to undo training, the only way to do this with sb_server at the moment is to rename/remove your hammie.db file. > If that doesn't make a difference, is there a relatively easy > way to implement one of the other strategies? 1.0a9 also has some new options to help with "nonedge" training, where you train everything inside certain edges (say 0.05 - 0.95). You can set the review page to display only messages within these ranges. From most reports, nonedge or mistake-based-training works best*. =Tony Meyer * Ignoring train-to-exhaustion, which sb_server isn't setup for. --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 23 21:23:22 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 21:24:01 2004 Subject: [Spambayes] Unable to delete as spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361E98@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD3@its-xchg4.massey.ac.nz> > I have been using the program for several months, > and it has been working beautifully with 2 exceptions. > 1) Whenever a Windows upgrade is installed, the addin > fails to load with the next startup of Outlook; This is a decision made by Outlook, that AFAIK, we have no control over. If you have other plug-ins installed, they're probably disabled, too. > 2) This morning, for the first time, selecting a > message and clicking "Delete as Spam" did absolutely > nothing. I re-installed the addin twice to no avail. Is this just with that one message, or all messages now? Does the SpamBayes drop-down menu still work, or is that broken, too? The add-in might be disabled (Help->About Microsoft Outlook->Disabled Items), or something else might be wrong. > I read the troubleshooting section, but it is > technically overy my head. It would really help us (if you have the time) if you could outline all the bits that don't make sense. We'd like to make the documentation as simple to follow as possible, but we really need users' help to do that. > I don't know what "the log file for this session" is, Each time SpamBayes runs, a log file is created. It includes various information that helps us figure out what is going wrong, when something goes wrong. In the "check the log file" section, it explains where this file would be stored (it differs depending on your Windows version). > and I don't know what an "installation type" is. So, To install SpamBayes, you downloaded a file called "spambayes-1.0a9.exe", and double-clicked it, yes? That means you are running the binary version. If you instead downloaded a file called "spambayes-1.0a9.zip", and one called "PythonX.exe", and double-clicked a file called "addin.py", then you're using the source version (this is unlikely). The guide says this: "If you are running from Python source code, and installed Python, plus SpamBayes as separate components, then you are running the source code version. If you downloaded an installer .EXE file, then you are running the binary version.". Any suggestions on how could we make this clearer? (We really are interested in making it as simple to understand as possible. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Mon Feb 23 21:28:16 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Feb 23 21:28:53 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361F94@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD4@its-xchg4.massey.ac.nz> [Training to exhaustion] > Seems to work pretty well. Here's a run I did just now: > > % python ~/tmp/spambayes/contrib/tte.py -g > newham.clean.save -s newspam.clean.save -d tte.db > round: 1, msgs: 770, ham misses: 196, spam misses: 244, 67.7s > round: 2, msgs: 770, ham misses: 33, spam misses: 55, 49.4s > round: 3, msgs: 770, ham misses: 8, spam misses: 5, 33.1s > round: 4, msgs: 770, ham misses: 0, spam misses: 0, 28.6s > 1 untrained spams How did these 770 messages get selected? Is this a batch of recently arrived mail, or some sort of pre-selected training collection? Did tte.db exist before this? > Adding up the last column indicates a total run time of about > three minutes. I can live with that. How often do you tend to run this? [...] > The database thus winds up smaller than it would be with a > more usual training approach. Although slightly larger than mistake-based-training (541 instead of 440), but presumably more accurate as well. =Tony Meyer From Amir_Katz at bmc.com Tue Feb 24 03:34:56 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Tue Feb 24 03:36:26 2004 Subject: [Spambayes] Problem with SpamBayes & Outlook Express Message-ID: After using SB successfully at work [Outlook plug-in] for a few weeks, I installed SB at my home PC to be used with Outlook Express (OE). Getting messages via the POP3 proxy works great. However, sending large messages via the SMTP proxy takes a long time and then it times out. Short messages go out fine. (large means > 1MB). My questions are: 1) Why do I need the SMTP proxy? Since I don't send spam, and assuming that I don't train SB by sending to the fake spambayes_spam@localhost , why do I need it? 2) Are there any settings either in SB or in OE to overcome this? In OE, I tried to increase the server wait time to 4 minutes and split large messages, but to no avail. Details: Windows XP, McAfee VirusScan running Thanks, Amir "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040224/a4638dc1/attachment.html From tim at fourstonesExpressions.com Tue Feb 24 08:07:13 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Tue Feb 24 08:07:19 2004 Subject: [Spambayes] Problem with SpamBayes & Outlook Express In-Reply-To: References: Message-ID: On Tue, 24 Feb 2004 02:34:56 -0600, Katz, Amir wrote: > 1) Why do I need the SMTP proxy? Since I don't send spam, and assuming > that You don't. It's an alternate training interface, for those who wish to send their spam to an address for training rather than use the browser interface. Have your mailer use your remote smtp server directly, and all will be well. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From profitfaucet at charter.net Tue Feb 24 09:20:27 2004 From: profitfaucet at charter.net (Lloyd Wilkinson) Date: Tue Feb 24 09:25:48 2004 Subject: [Spambayes] Security Risk Message-ID: <000201c3fae1$5a93d810$0201000a@lloyd> How can I be assured that spambayes is not a Trojan, worm, virus or other nasty security risk to my machine? Lloyd T. Wilkinson Y2Marketing Principle Consultant Inc. 500 Magazine Ranked Us 42nd Fastest Growing Business in 2002. 28th Fastest in 2003 "We Practice What We Teach" -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040224/1df6bcb2/attachment.html From rcoe at CambridgeMA.GOV Tue Feb 24 09:33:37 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Tue Feb 24 09:33:46 2004 Subject: [Spambayes] RE: Security Risk Message-ID: <42393C9DA7930245AB540667607F4F5022C279@SPIKE.city> How can we be assured that your message isn't a Trojan, worm, virus, or other nasty security risk? After all, it scored 15% in my Outlook inbox and got thrown into my "possible spam" folder. BTW, someone who calls himself a "Principal Consultant" ought to be able to spell it. Or are you a consultant on marketing principles? Bob -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Lloyd Wilkinson Sent: Tuesday, February 24, 2004 9:20 AM To: spambayes@python.org Subject: [Spambayes] Security Risk How can I be assured that spambayes is not a Trojan, worm, virus or other nasty security risk to my machine? Lloyd T. Wilkinson Y2Marketing Principle Consultant Inc. 500 Magazine Ranked Us 42nd Fastest Growing Business in 2002. 28th Fastest in 2003 "We Practice What We Teach" -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040224/352d6ae9/attachment-0001.html From tim at fourstonesExpressions.com Tue Feb 24 09:33:39 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Tue Feb 24 09:33:50 2004 Subject: [Spambayes] Security Risk In-Reply-To: <000201c3fae1$5a93d810$0201000a@lloyd> References: <000201c3fae1$5a93d810$0201000a@lloyd> Message-ID: On Tue, 24 Feb 2004 08:20:27 -0600, Lloyd Wilkinson wrote: > How can I be assured that spambayes is not a Trojan, worm, virus or > other nasty security risk to my machine? Well, the best I can respond to that is that the real identities of the Spambayes' authors are known, we are not operating under screen names and forged mail addresses, this mailing list exists to which the authors regularly respond, there is a formal software development process in place, and the source for the program is open for anyone to inspect, including the legal authorities who would be responsible for prosecuting the well known list of contributors were the program to actually BE malware, which, of course, it is NOT. That said, if you don't believe it, don't install it. That's the philosophy you should apply to all software you install on your computer. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From papaDoc at videotron.ca Tue Feb 24 09:37:27 2004 From: papaDoc at videotron.ca (papaDoc) Date: Tue Feb 24 09:37:02 2004 Subject: [Spambayes] Security Risk In-Reply-To: <000201c3fae1$5a93d810$0201000a@lloyd> References: <000201c3fae1$5a93d810$0201000a@lloyd> Message-ID: <403B61A7.9020109@videotron.ca> Hi, > How can I be assured that spambayes is not a Trojan, worm, virus or > other nasty security risk to my machine? > You can donwload the python source and look at it then compile it and use it. Remi -- /"\ \ / X ASCII Ribbon Campaign / \ Against HTML Email From Amir_Katz at bmc.com Tue Feb 24 09:54:28 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Tue Feb 24 09:56:23 2004 Subject: [Spambayes] Security Risk Message-ID: Well, if you're worried about malware (and you should) then your machine should be protected by an anti-virus software. This software will kill SpamBayes if it is indeed a malware. So delegate the worrying. -----Original Message----- From: Lloyd Wilkinson [mailto:profitfaucet@charter.net] Sent: Tuesday, February 24, 2004 16:20 To: spambayes@python.org Subject: [Spambayes] Security Risk How can I be assured that spambayes is not a Trojan, worm, virus or other nasty security risk to my machine? Lloyd T. Wilkinson Y2Marketing Principle Consultant Inc. 500 Magazine Ranked Us 42nd Fastest Growing Business in 2002. 28th Fastest in 2003 "We Practice What We Teach" -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040224/8ff25be5/attachment.html From skip at pobox.com Tue Feb 24 12:16:56 2004 From: skip at pobox.com (Skip Montanaro) Date: Tue Feb 24 12:17:10 2004 Subject: [Spambayes] Frequency distribution for wordinfo counts? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD4@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1305361F94@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD4@its-xchg4.massey.ac.nz> Message-ID: <16443.34568.523067.182089@montanaro.dyndns.org> >>>>> "Tony" == Tony Meyer writes: Tony> [Training to exhaustion] >> Seems to work pretty well. Here's a run I did just now: >> >> % python ~/tmp/spambayes/contrib/tte.py -g >> newham.clean.save -s newspam.clean.save -d tte.db >> round: 1, msgs: 770, ham misses: 196, spam misses: 244, 67.7s >> round: 2, msgs: 770, ham misses: 33, spam misses: 55, 49.4s >> round: 3, msgs: 770, ham misses: 8, spam misses: 5, 33.1s >> round: 4, msgs: 770, ham misses: 0, spam misses: 0, 28.6s >> 1 untrained spams Tony> How did these 770 messages get selected? Is this a batch of Tony> recently arrived mail, or some sort of pre-selected training Tony> collection? Did tte.db exist before this? I have two piles of mail selected by me, one ham, one spam as indicated by the command line above. The tte.py script just iterates over them, training a message from one, then a message from the other. tte.db is written from scratch on each run, but not twiddled between rounds of a single run. >> Adding up the last column indicates a total run time of about three >> minutes. I can live with that. Tony> How often do you tend to run this? Right now a few times a day. I've been out for a week, so I have lots of unsures to train on. I select a few hams and spams, run tte.py then put the database in place. Every once in awhile I reprocess the entire unsure pile (825 messages at the moment, but it was over 2500 when I got back from vacation). I didn't have things adjusted very well before I left. >> The database thus winds up smaller than it would be with a more usual >> training approach. Tony> Although slightly larger than mistake-based-training (541 instead Tony> of 440), but presumably more accurate as well. Who knows? ;-) Skip From webmaster at lta.org Tue Feb 24 13:24:55 2004 From: webmaster at lta.org (LTA Webmaster) Date: Tue Feb 24 13:25:02 2004 Subject: [Spambayes] Can't get to additional Outlook inboxes to filter them Message-ID: <423763C199328549AACC2C934D177EF844EF5B@emerson.lta.org> Skipped content of type multipart/alternative-------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 9298 bytes Desc: spambayes4.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040224/fbe59f0e/spambayes4.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 5870 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20040224/fbe59f0e/spambayes1.obj From thetasig at comcast.net Tue Feb 24 18:10:50 2004 From: thetasig at comcast.net (thetasig) Date: Tue Feb 24 18:10:55 2004 Subject: [Spambayes] How to complete uninstall In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD4@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD4@its-xchg4.massey.ac.nz> Message-ID: <403BD9FA.4060902@comcast.net> I'm trying to get past an unusual server error. I want to completely remove all traces of SpamBayes (including the pop3proxy service) and Python including all user files (that is absolutely everything related to SpamBayes) and then reinstall from scratch. Thanks for passing along the detailed steps to accomplish this. -=mark=- From tameyer at ihug.co.nz Wed Feb 25 01:12:13 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:12:50 2004 Subject: [Spambayes] Spambayes with Choicemail In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256B89@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779E4@its-xchg4.massey.ac.nz> > At work I have Choicemail. I would like to > gravitate to Spambayes eventually. After I > installed Spambayes, it does not work. Does Spambayes > have to go through the localhost that Choicemail uses? What is Choicemail? If it is a POP3 proxy of some sort, and you are using sb_server, then yes, you should 'chain' them together. Choose one port for Choicemail (eg 110) and one for sb_server (eg 111). Get sb_server to connect to localhost:110, rather than to your mail server, and Choicemail to connect to the mail server. Get your mail client to connect to localhost:111. If it's something else, then none of that is any good , and we need more information to be able to help. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:19:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:19:33 2004 Subject: [Spambayes] SpamBayes a wonderful product In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256B6B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2AD8@its-xchg4.massey.ac.nz> > At what point does the program recognize an e-mail > as one which I consider Spam and process it as such? Well, what happens is this (assuming this is the Outlook plug-in): 1. Mail arrives, and Outlook deals with it. 2. At some point, Outlook decides to tell SpamBayes about it. 3. SpamBayes classifies the message, stores the spam score, and does any necessary filtering. However, it sounds like you mean "when does SpamBayes train mail". 1. If you manually train a folder via the "Training" tab of the Manager dialog, or in the Comfit Wizard. 2. If you have incremental training turned on ("Training" tab, again), then whenever mail is moved into the spam folder, or a folder that is being watched for new mail. 3. If you click the "Delete as spam" or "Recover from spam" buttons. Nothing happens when/if you move mail from your spam "Junk Mail" folder to the Deleted Items folder. > I think this applies to the suspect mail box as well. > I assume if it is moved to a wanted box it is processed > as non Junk mail. If it is moved to the Junk Mail it > is moved to the Junk Mail box. It depends . If you have incremental training turned on, and you move a message from the unsure/suspect folder to a folder that is being watched (like the Inbox), then it is trained as ham (good). Likewise, if you move it from the unsure/suspect folder to the spam "Junk Mail" folder,then it is trained as spam (junk). If you don't have incremental training turned on, then it's only trained if you use the "Delete as spam"/"Recover from spam" buttons, and *not* if you move it by dragging and dropping, or with Outlook's "move" command. Note that once mail has been trained, it doesn't matter if you have it around any more (unless you want to retrain later). So all the mail in the "Junk mail" folder can be deleted, if you want, and it won't effect your training at all. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:38:03 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:38:37 2004 Subject: [Spambayes] Retraining In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256B52@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779E8@its-xchg4.massey.ac.nz> > Oh, ok. Before I start from scratch, then, maybe I should > ask about spams with hundreds of random or gibberish words. > Do these muck up the databases? They seem to be classified ok, > and the clues aren't any of these gibberish tokens, but I know > I trained on a few of these and it seems like they would skew > the statistics. What's the recommendation on this type of spam? AFAIK, the jury is still out on this one. One school of thought is that is the words really are randomly selected (from a dictionary, for example), then the highest chance is that you'll never have seen the word before and so it'll be ignored (or if you train on it, then that you'll never see it again, and it won't matter). Then there's a chance that the word is spam (or if you train on it, that the next time it appears will be in spam). Finally there's the chance that the word is ham (appears next in ham). So it's no big deal, and may even help classification. Whether this is true or not is still open to question, I think. I can't be bothered hand-selecting the email that I use to train, so, personally, I just use whatever comes up and have no idea whether there is this 'word salad' there or not. Basically, at the moment, it's up to you. Sorry this isn't more help! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:40:53 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:41:35 2004 Subject: [Spambayes] Possible bug? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255E28@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779E9@its-xchg4.massey.ac.nz> > I started running SpamBayes yesterday. Everything went fine > with the installation. I am having a problem when I use the > Recover From Spam function in my Unsure folder - SpamBayes > puts the message back into the wrong Inbox. Details follow: > > I have two accounts, a POP3 and an IMAP, both configured in > Outlook 2000 SP-3. If SpamBayes fails to figure out where the appropriate place to put the mail back is, then it defaults to the 'main' Inbox, which is presumably your POP3 one. I imagine this is what is happening here. What would help us to figure out *why* SpamBayes is having this trouble is your log file(s). The Help->Troubleshooting Guide explains where to find these. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:47:39 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:48:08 2004 Subject: [Spambayes] Maintain training with Outlook? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305255F07@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ADA@its-xchg4.massey.ac.nz> > The simple version first: What's the best way to maintain > training with the Outlook plugin? The simple answer first . There is no consensus on a 'best way'. > The FAQ advises training > on "a few ham and a few spam" on a regular basis, but it's > not obvious how to train on ham that hasn't been > misclassified. Simply training on all misclassified (including unsure, which isn't really a misclassification) mail should give pretty good results. Almost certainly better than training on everything, and probably close to as good as any other training regime. > The "Recover from Spam" button is only > present in the Junk Mail and Junk Mail Suspects folders, and > the wizard completely rebuilds the database; is there > something less drastic that I'm missing? Two options, if you do want to do more: 1. The "Training" tab of the Manager dialog has a button to train all messages in a particular folder (or set of folders) as ham/spam ("Train Now"). You can elect to rebuild the database from scratch, or just add to it. 2. You can enable "incremental training" (also on the "Training" tab). This means that mail will be trained as ham when you move mail into a folder you are watching (like the Inbox), and as spam when it's moved into the spam folder. If mail is already in the (eg) Inbox, then you have to move it out and then back in again. > My spam/ham ratio before filtering is probably close to 10:1. > How do I choose which and how many messages to train on? The wiki (http://entrian.com/sbwiki) has a lot of details, as there really isn't a consensus on the best method. Simply training on mistakes (false positives, false negatives, and unsures) should give a good result, and will probably keep the ratio reasonably balanced (if not, then just select some of the mistakes - probably those with a score closest to 0.5). > Now, for extra credit: I don't actually use Outlook much -- > I use Pocket Outlook on my PDA, and do as much as I can on > the train. The plugin buttons obviously aren't available, > and moving messages into the Junk folders doesn't work > either. Apparently ActiveSync moves the messages behind the > scenes and SpamBayes doesn't see them move. I made a couple > of special folders, TrainJunk and TrainGood, to sort messages > into when using Pocket Outlook; when I'm back on the desktop > I "Delete as Spam" the entire TrainJunk folder. I haven't > figured out what to do with the TrainGood folder yet. Does > anyone see a less cumbersome way of handling this? Not apart from getting Mark to spend time figuring out a way to notice those moves, or getting Microsoft (or the PDA maker) to make the notices appear correctly. (For the former, you could submit a feature request at ). Sorry! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:52:31 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:53:00 2004 Subject: [Spambayes] Large Files In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361EC6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ADB@its-xchg4.massey.ac.nz> > My SpamBayes is reading large files with attachments > as Junk (+1MB). IIRC, SpamBayes doesn't generate any clues at all based on the size of attachments. There will be one token (per attachment) generated that says there is an attachment there, but that's it. > How do I reset SpamBayes so that I can receive these > large files, with their large attachments? 1. You should still be *receiving* these messages, they just may be moved into your unsure/spam folder. 2. Have you tried training on some of them? 3. The clues ("Show Clues") for the message would really help in explaining why the message is scoring what it is. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:57:07 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:57:35 2004 Subject: [Spambayes] Problem with POP3 Proxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305361EFC@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779EC@its-xchg4.massey.ac.nz> [Tony Meyer] > How do you know that there are ten or twenty? Are there that many > files in the unknown cache directory? Or is that just how much mail > you have received? [Joseph Neustein] > Yes, that's the amount of mail I'm getting in my inbox. I > don't know what the unknown cache directory is. Ok, it sounds like what is happening is that mail isn't going through SpamBayes at all. What's meant to happen is that Outlook Express connects to your mail server *through* SpamBayes, and SpamBayes adds information to the messages on the way so that you can filter them. You need to set Outlook Express up to connect to "localhost", rather than directly to your mail server. (The FAQ has a diagram that explains the process better than I can here, and the README.txt / readme.html file has instructions about setting up your mailer). Note that with Outlook Express you'll also have to go to the Configuration page of the web interface and set the option to add the classification to the "to" or "subject" headers. This is because Outlook Express has a very limited capability for filtering, and so the information has to go here, rather than in a separate (invisible to you) header. FWIW, there is a program to automatically configure both your mail client and SpamBayes in the works, but it's not quite fully done yet. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 01:59:18 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 01:59:52 2004 Subject: [Spambayes] (no subject) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305256055@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779ED@its-xchg4.massey.ac.nz> > I couldn't find anywhere else on your webpage to > tell you how much I appreciate your software. Here's as good as anywhere, thanks (from all of us)! > I'd be glad to add a link from > www.softwareworks-inc.com as well if you can approve it. No approval necessary, so go for it, if you want to. Thanks! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From shamus64 at lycos.com Wed Feb 25 16:40:32 2004 From: shamus64 at lycos.com (shamus64@lycos.com) Date: Wed Feb 25 16:43:45 2004 Subject: [Spambayes] fake Message-ID: yes, really? -------------- next part -------------- A non-text attachment was scrubbed... Name: release.zip Type: application/x-zip-compressed Size: 0 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040226/1fcc80f7/release.bin From greg at parkoneasc.com Wed Feb 25 20:09:13 2004 From: greg at parkoneasc.com (Greg Lee) Date: Wed Feb 25 20:08:52 2004 Subject: [Spambayes] Email header Message-ID: MRI Template 1 Park One Associates I have activated SpamBayes. Great program! Problem is that it will not allow access to information required to format our email header (this one probably has empty boxes). We have to go out over the web to grab format pieces to construct our email header. I cannot figure out how to tell SpamBayes to allow it though. Help! Thanks Greg Lee PARK ONE ASSOCIATES A Management Recruiters Network Affiliate www.parkoneasc.com greg@parkoneasc.com Phone: (414) 359-9494 Fax: (414) 359-9495 Growing Companies.....Building Careers.....Fulfilling Dreams -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040225/44fa130f/attachment.html From kurt at campbellsci.com Wed Feb 25 20:40:57 2004 From: kurt at campbellsci.com (kurt@campbellsci.com) Date: Wed Feb 25 20:41:10 2004 Subject: [Spambayes] Upgrade from Source install to Binary install for Win32 Message-ID: <403CEC39.13031.1503C81@localhost> Upgrade from Source install to Binary install for Win32. My old source: 1.0a7 My new binary: 1.0a9 My Python Version:2.3 My Win32all: 163 My Win32 O/S: Windows 2000 Pro Information written up and provided by: Kurt Payne Advanced instructions: 1. Uninstall old SpamBayes Service. To do this run: pop3proxy_service.py remove 2. Reboot 3. Uninstall Win32all if so desired 4. Uninstall Python if so desired 5. Reboot (recommended only if Python and Win32all was uninstalled) 6. Install new binary 1.0a9 7. The cache directory has changed: From: C:\Documents and Settings\Default User\Application Data\SpamBayes\Proxy To: C:\Documents and Settings\{userdir}\Application Data\SpamBayes\Proxy You are done. Detailed instructions: 1. Uninstall old SpamBayes Service. To do this Start: Pythonwin click on the run button in the toolbar This will bring up a popup window Run Script. Type in for the 'Script File': C:\Python23\Scripts\pop3proxy_service.py Type in for the 'Arguments': remove Then click OK At the bottom of the window it will display Script 'C:\Python23\Scripts\pop3proxy_service.py' returned exit code 0 This means it is done. 2. Restart your computer (as the service has been marked for removal but hasn't been done yet) 3. If you are not using Python for any other programs you can uninstall it although it will not harm your computer to leave it installed. The new Binary does not require it to run. To uninstall both Win32all and Python programs: Go to your system Control Panel click on Add/Remove Programs both those programs will appear in this list click on 'Python 2.3 combined Win32 extensions' then click Change/Remove. As for me this brought up an error 'Cannot find install.log' I reinstalled the Win32all program then uninstalled it. Then click on 'Python 2.3' then click Change/Remove. Again this brought up an error 'Cannot find install.log' I reinstalled the Python 2.3 program then uninstalled it. 4. If you have uninstall both Win32all and Python programs. Restart your computer now otherwise skip this step. 5. At this point you can install the Binary and it should work. I found that the cache directory was in a different area between the old and new versions. My old directory was: C:\Documents and Settings\Default User\Application Data\SpamBayes\Proxy And the new directory is: C:\Documents and Settings\{userdir}\Application Data\SpamBayes\Proxy I just copied in Windows Explorer the directory SpamBayes from the old area to the new area. That brought over all my old settings and configuration it had all the old trained information saved. You are done. From mhammond at keypoint.com.au Wed Feb 25 21:21:02 2004 From: mhammond at keypoint.com.au (Mark Hammond) Date: Wed Feb 25 21:21:22 2004 Subject: [Spambayes] Upgrade from Source install to Binary install for Win32 In-Reply-To: <403CEC39.13031.1503C81@localhost> Message-ID: <015601c3fc0f$2ec83e90$0200a8c0@eden> > 1. Uninstall old SpamBayes Service. > To do this run: pop3proxy_service.py remove > 2. Reboot If you ensure the service is stopped first ("pop3proxy_service.py stop"), there should be no need to reboot. > 3. Uninstall Win32all if so desired > 4. Uninstall Python if so desired > 5. Reboot (recommended only if Python and Win32all was uninstalled) Again, this should only be necessary if you tried to uninstall them while they were in use. > 6. Install new binary 1.0a9 > 7. The cache directory has changed: > From: C:\Documents and Settings\Default User\Application > Data\SpamBayes\Proxy > To: C:\Documents and Settings\{userdir}\Application > Data\SpamBayes\Proxy That suprises me, unless the service configuration has changed from the 'local system' user to a specific user. Thanks for the instructions though! Mark. From postmaster at naic.org Wed Feb 25 20:25:00 2004 From: postmaster at naic.org (postmaster@naic.org) Date: Wed Feb 25 21:27:39 2004 Subject: [Spambayes] Virus Detected by Network Associates, Inc. Webshield SMTP V4.5 MR1a P0803.349 Message-ID: The NAIC e-mail gateway detected a virus (W32/Netsky (ED)) in the attachment (stuff.htm.com) of your message. Your message has been removed from the system. If you feel you have received this message in error, please contact the NAIC Help Desk at help@naic.org or 816-783-8500. From tameyer at ihug.co.nz Wed Feb 25 23:14:23 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 23:15:09 2004 Subject: [Spambayes] Email header In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13053625F4@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779FD@its-xchg4.massey.ac.nz> > Problem is that it will not allow access to information > required to format our email header (this one probably > has empty boxes). We have to go out over the web to grab > format pieces to construct our email header. I cannot > figure out how to tell SpamBayes to allow it though. Sorry, I'm really not sure what you mean. Are you trying to add a header to all incoming mail? Outgoing mail? SpamBayes shouldn't interfere with this, AFAICT. Could you let us know: 1. What version of SpamBayes you are using. 2. Which SpamBayes application you are using (Outlook plug-in, sb_server, sb_filter, sb_imapfilter, ...) 3. Exactly what you're trying to do (send mail? Receive it?) 4. What's going wrong (is some sort of error occuring? Does nothing happen when something should?) Sorry I can't be of more immediate use! =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Wed Feb 25 23:23:15 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Feb 25 23:23:49 2004 Subject: [Spambayes] Can't get to additional Outlook inboxes to filter them In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305362187@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13046779FE@its-xchg4.massey.ac.nz> > A week ago I installed Spambayes version .9, at that point > I had Office 2000 on my computer, and I was very pleased > when Spambayes recognized all the additional inboxes I > had setup in my setup of Outlook (rorris, Webmaster, support). > I told it not to filter support, but I loved that it > filtered rorris and Webmaster. Then two days ago I upgraded > to Office 2003, and I noticed that SpamBayes was no longer > filtering my Webmaster inbox, but it is still filtering my > rorris mailbox. SpamBayes currently remembers which folders it is filtering by remembering the internal 'store id' (pst file, Exchange, etc) and 'entry id' (the folder itself) for each folder. I suspect that what has happened here is that as a result of the upgrade process the id for the webmaster mailbox has changed (is it maybe on a different store to the rorris one?). > I looked in the configuration and it still > lists inbox; inbox as being filtered, but when I browse and > try to reset up which folders I want to filter, I run into a > problem. I can't expand the Webmaster mailbox to get to > the inbox, nothing happens when I click on the plus sign. Something's going wrong when SpamBayes tries to access the children of one of the stores, although it's not clear what that is. I'm not familiar enough with the MAPI errors to figure it out, so if this isn't resolved (see below), then I'd suggest submitting a bug report on sourceforge , which means that Mark will notice it sooner or later. > NOTE: I did not uninstall the database files and the > config files, maybe I should try that. Removing/renaming the database files should have no effect on this at all (other than removing your training, obviously). However, removing/renaming the config files should fix this problem; at least it should fix the problem of trying to access the webmaster inbox. Whether it will fix the problem of accessing it in the Manager configuration is another question, but I suspect it will. This is the best bet for a solution; if it doesn't work then the bug report (as above) is probably the next best step. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From kennypitt at hotmail.com Thu Feb 26 09:34:40 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Feb 26 09:35:40 2004 Subject: [Spambayes] Email header In-Reply-To: Message-ID: > Great program! Problem is that it will not allow access to > information required to format our email header (this one probably > has empty boxes). There were no empty boxes here, unless you count that Outlook 2003 didn't download the pictures until I told it to. Previous versions of Outlook would have downloaded and displayed the pictures immediately. I can't tell exactly where you're having a problem, but SpamBayes shouldn't have anything to do with including pictures in your mail. SpamBayes doesn't do anything at all with mail you compose, and it has no effect on how Outlook displays mail you receive. It only looks at the content to classify it as spam or not. If you are using the Outlook addin then it will move messages that it thinks are (or might be) spam to a different folder, but it doesn't change the content of the message. -- Kenny Pitt From fred at lucy.com Thu Feb 26 09:44:29 2004 From: fred at lucy.com (Fred Mertz) Date: Thu Feb 26 09:44:42 2004 Subject: [Spambayes] Train on Hashbusters? Message-ID: <200402260944290296.002AF3AA@mail.mindspring.com> I get lots of spam with sections of random words like this: optimum correct rome devout blanch apostle burnside breathy compliment cezanne curie daisy indignity elkhart snoop blow tenement baud passive linebacker peril anthem allusive primitivism adhesive witness hoydenish equestrian angus Or gibberish like this: ituk geuxj vyy yafw hba ww tqqp clm peqb lvvjf sje rjkorm oojigby immqx igd ahovey cx ihpsrf ket aodg pcpd pg bxmey utuas xxcf oux tfcnf rrmhu c lki Should I train on these messages? From tim at fourstonesExpressions.com Thu Feb 26 09:47:38 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Thu Feb 26 09:47:44 2004 Subject: [Spambayes] Train on Hashbusters? In-Reply-To: <200402260944290296.002AF3AA@mail.mindspring.com> References: <200402260944290296.002AF3AA@mail.mindspring.com> Message-ID: On Thu, 26 Feb 2004 09:44:29 -0500, Fred Mertz wrote: > > I get lots of spam with sections of random words like this: > Should I train on these messages? I think at the moment our recommendation would be to not train on those messages if they're correctly classified already. We are actively researching this technique (called "word salad"), but as of yet we've not seen that it is effective against our filter. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From rmalayter at bai.org Thu Feb 26 12:49:31 2004 From: rmalayter at bai.org (Ryan Malayter) Date: Thu Feb 26 12:49:32 2004 Subject: [Spambayes] Email header Message-ID: <792DE28E91F6EA42B4663AE761C41C2A01E1A07B@cliff.bai.org> [Greg Lee] > I have activated SpamBayes. Great program! Problem is that > it will not allow access to information required to format > our email header (this one probably has empty boxes). We > have to go out over the web to grab format pieces to construct > our email header. I cannot figure out how to tell SpamBayes > to allow it though. Help! Do you really use all those images in EVERY message you send out from your company? I think you should seriously reconsider that, because: 1) Many newer mail programs (Outlook 2003, Mozilla mail, the newest Outlook Express) won't display it at all, so it looks ugly unless the user decides to manually click on their "download images" button. I bet they won't in most cases. 2) Many organizations use spam filters, and these filters look for things like image links. Image links are *very* spammy to most spam filters, because "regular people" don't send them with regular email. Your message had quite a few "Url:jpg" and "url:gif" tokens, which score >90% spam with my trained SpamBayes. So a lot of your messages are never going to be read, even if they're not promotional. 3) Many people still use modems, and dowloading 30 seconds worth of graphics will really piss them off. Plus, your website is so slow that the images just creep onto the page, even though my company has a fractional T3 from a tier-1 ISP! 4) IMHO, it actually looks unprofessional to have gobs of graphics in an email. Email is a written medium, and not every message should be a glossy brochure. It detracts from the message the email is trying to convey. Regards, Ryan Malayter Sr. Network & Database Administrator Bank Administration Institute Chicago, Illinois, USA PGP Key: http://www.malayter.com/pgp-public.txt From hrg at email.com Thu Feb 26 13:09:01 2004 From: hrg at email.com (Howard) Date: Thu Feb 26 13:07:04 2004 Subject: [Spambayes] How to cure extreme disproportion of spam to ham?? Message-ID: <000001c3fc93$9c9023f0$0100a8c0@peabody> Spambayes manager warns me that I have spam:ham disproportion, 454:3365 to be exact. I cannot figure out what to do about this. I get a ton of spam and the program puts most of it in the "Suspects" folder. Usually all the mail in the Suspects folder is spam, so I highlight it all and click the "Delete as Spam" button. The Suspects folder usually has 20-30 spams in it after checking my mail once a day, and this is how I got the imbalance in the first place. There is usually no ham in the Suspects folder, so the imbalance worsens every day. I can't figure out a way to get the program to train *only* on the ham in my In-Box. How do I fix this imbalance? -Howard -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040226/eaa6eca4/attachment.html From hrg at email.com Thu Feb 26 13:55:01 2004 From: hrg at email.com (Howard) Date: Thu Feb 26 13:53:35 2004 Subject: [Spambayes] How to cure extreme disproportion of spam to ham?? Message-ID: <000201c3fc9a$0a088700$0100a8c0@peabody> Spambayes manager warns me that I have spam:ham disproportion, 454:3365 to be exact. I cannot figure out what to do about this. I get a ton of spam and the program puts most of it in the "Suspects" folder. Usually all the mail in the Suspects folder is spam, so I highlight it all and click the "Delete as Spam" button. The Suspects folder usually has 20-30 spams in it after checking my mail once a day, and this is how I got the imbalance in the first place. There is usually no ham in the Suspects folder, so the imbalance worsens every day. I can't figure out a way to get the program to train *only* on the ham in my In-Box. How do I fix this imbalance? -Howard -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040226/3fc63bbe/attachment.html From jmcatee at mediaodyssey.com Thu Feb 26 15:01:03 2004 From: jmcatee at mediaodyssey.com (Jim McAtee) Date: Thu Feb 26 15:01:05 2004 Subject: [Spambayes] Retraining References: <1ED4ECF91CDED24C8D012BCF2B034F13046779E8@its-xchg4.massey.ac.nz> Message-ID: <024401c3fca3$43616130$1c03a8c0@jim> ----- Original Message ----- From: "Tony Meyer" To: "'Fred Mertz'" ; Sent: Tuesday, February 24, 2004 11:38 PM Subject: RE: [Spambayes] Retraining >> Oh, ok. Before I start from scratch, then, maybe I should >> ask about spams with hundreds of random or gibberish words. >> Do these muck up the databases? They seem to be classified ok, >> and the clues aren't any of these gibberish tokens, but I know >> I trained on a few of these and it seems like they would skew >> the statistics. What's the recommendation on this type of spam? > >AFAIK, the jury is still out on this one. One school of thought is that is >the words really are randomly selected (from a dictionary, for example), >then the highest chance is that you'll never have seen the word before and >so it'll be ignored (or if you train on it, then that you'll never see it >again, and it won't matter). Then there's a chance that the word is spam >(or if you train on it, that the next time it appears will be in spam). >Finally there's the chance that the word is ham (appears next in ham). So >it's no big deal, and may even help classification. Whether this is true or >not is still open to question, I think. I'm seeing a fair number of relatively targeted "random" words that are helping to get quite a few messages just under the spam threshold. If a spammer is harvesting email addresses from a mailing list, especially a technical one, this technique is particulary easy - and dare I say, particularly effective. They can even throw words back to you from one of your own postings. >Finally there's the chance that the word is ham (appears next in ham). So >it's no big deal, and may even help classification. I'm not sure I understand how classifying ham words as spam can have any possible benefit... From support at f-secure.com Thu Feb 26 16:46:47 2004 From: support at f-secure.com (support@f-secure.com) Date: Thu Feb 26 16:47:03 2004 Subject: [Spambayes] warning Message-ID: here, the serials -------------- next part -------------- A non-text attachment was scrubbed... Name: shower.zip Type: application/x-zip-compressed Size: 0 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040226/ba8dd1b7/shower.bin From tameyer at ihug.co.nz Thu Feb 26 18:52:50 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 26 18:54:32 2004 Subject: [Spambayes] How to cure extreme disproportion of spam to ham?? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305362855@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ADE@its-xchg4.massey.ac.nz> > Spambayes manager warns me that I have spam:ham > disproportion, 454:3365 to be exact. I cannot figure out what > to do about this. Firstly, if you're still getting results that you're happy with, you don't really have to do anything. However, if your results could be better, then read on :) > I get a ton of spam and the program puts most of > it in the "Suspects" folder. This does sound like a problem. Numbers will vary from email mix to email mix, but having around 2-5% of your mail end up in the "Suspects" folder would be reasonable. "Most" doesn't sound reasonable at all. This could well be a result of the imbalance, though. > How do I fix this imbalance? Suggestions, although there is a lot of stuff about training on the wiki , and it's not at all an exact science: 1. Try running with a smaller database. Dump the one you have (perhaps just rename it so that you can go back to it if you want to), train on 5 random hams and spams to get you going, and just classify whatever ends up in the "Suspects" folder, and any ham in the spam folder and spam in the ham folder. If it starts getting imbalanced, then try just training on *some* of the messages in the "Suspects" folder, and moving/deleting the rest (or rescoring the "Suspects" folder, although this isn't as convenient as it could be, and seeing if the remaining messages are now correctly classified). 2. If you almost never see a ham in the "Suspects" folder, and never see one over (eg) 80%, then lower the spam threshold. I think a lot of people run with it at 80% anyway, although the default is still IIRC 90%. Hope this helps. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Thu Feb 26 18:58:55 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Feb 26 18:59:34 2004 Subject: [Spambayes] Retraining In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13053628B7@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F2ADF@its-xchg4.massey.ac.nz> > I'm seeing a fair number of relatively targeted "random" > words that are helping to get quite a few messages just under > the spam threshold. If a spammer is harvesting email > addresses from a mailing list, especially a technical one, > this technique is particulary easy - and dare I say, > particularly effective. They can even throw words back to > you from one of your own postings. Certainly the more work you put into it (or into writing your spamware), the more effective it will be. If you sent me a spam message and put a copy of this message at the end of it (or put the results of a "I'm feeling lucky" google for "Tony Meyer"), then I would imagine it would probably end up as an unsure (for me), if not ham. OTOH, the more work that's involved, the less economical the spam is - it's certainly much quicker and easier to grab some dictionary words than look for something specific to the target. > >Finally there's the chance that the word is ham (appears > >next in ham). > >So it's no big deal, and may even help classification. > > I'm not sure I understand how classifying ham words as spam > can have any possible benefit... That's not what I meant (I may have written it wrongly - I'd have to go back and read it; I probably wrote it poorly). What I meant is that there are three possibilities - (1) the word has never been seen, or never will be seen again, (2) the word is one that has only been seen in that type of message, or only will be seen in that type of message, and (3) the word is one that is seen in the other type of message. For example: (1) "sdhjkfdsdsf8435hjks" will probably never been seen again. (2) "fat" as a random word in a spam message, could well be a word that only appears in spam (for me). (3) "analysis" as a random word in a spam message, could easily appear in ham or spam (for me). =Tony Meyer From hera at optonline.net Thu Feb 26 22:07:25 2004 From: hera at optonline.net (Erin Lazzaro) Date: Thu Feb 26 22:07:38 2004 Subject: [Spambayes] PocketPC Message-ID: <000901c3fcde$d6971270$6a01a8c0@Peacemaker> Thank you all for your responses. I have read all the training pages in the Wiki, and I have devised a _Plan for Working with Pocket Inbox_. I've only run one daily cycle so far - I'll report after a week on how good my filters are. In the meantime, would anyone like to comment? The goal is to get an effective filter relatively quickly, while spending as little time as possible on the desktop machine (since at-home time is precious, and my husband thinks I should play with him and not with my spam filters). Training on mistakes and unsures seems the most intuitive, but since I have never yet had any ham classified as spam, I would expect the ratio to get very unbalanced. Why do people think otherwise? Do you start getting misclassified ham if the ratio gets too far out? Initial setup: - Configure Outlook not to download mail automatically. - Configure ActiveSync to synchronize all three SpamBayes folders (Inbox, Junk, Junk Suspects) - Make two more mail folders, TraningHam and TrainingSpam (they don't need to be synchronized), and add the Spam field to their views. Seed them with 5 ham and 5 spam (I just used the most recently arrived) and use SpamBayes Manger to train on them. - Add the Spam field to the Inbox view. Each morning, on the train or elsewhere: - Review the three SpamBayes folders (Inbox, Junk, Junk Suspects) and move all mistakes and unsures to either TrainingHam or TrainingSpam. (If I have time, I might also read and respond to the ham ) Each night, on the desktop: - Sort TrainingHam and TrainingSpam on the Spam field and delete all but the 5 most out-of-whack (not considering duplicate spam and fake email bounces). - If there are fewer items in TrainingHam than in TrainingSpam, go into Inbox and copy the spammiest ham from there. (If nothing in Inbox scores more than 0%, then let TrainingHam be short). - Train on TrainingHam and TrainingSpam, with the "rebuild" checkbox clear (this is the option I was missing before). Clean the folders out afterwards. - Download the mail with the new filter. Thanks, Erin Lazzaro From mhammond at keypoint.com.au Fri Feb 27 00:26:02 2004 From: mhammond at keypoint.com.au (Mark Hammond) Date: Fri Feb 27 00:26:20 2004 Subject: [Spambayes] PocketPC In-Reply-To: <000901c3fcde$d6971270$6a01a8c0@Peacemaker> Message-ID: <002601c3fcf2$30f9ac60$0200a8c0@eden> > Thank you all for your responses. I have read all the > training pages in the Wiki, and I have devised a _Plan for > Working with Pocket Inbox_. I've only run one daily cycle so > far - I'll report after a week on how good my filters are. > In the meantime, would anyone like to comment? That looks interesting to me. If people find it useful, we could add a facility to SpamBayes - optionally define 'automatic ham training' and 'automatic spam training' folders. When mail arrives in these folders, simply train and move. I even have a Pocket PC I can test with :) A few people saying they would find it useful would help though :) Mark. From rinskip at ualberta.ca Fri Feb 27 02:09:24 2004 From: rinskip at ualberta.ca (Robin Inskip) Date: Fri Feb 27 01:59:13 2004 Subject: [Spambayes] Inadvertent Deleting As Spam Two Messages Message-ID: It had to happen but today I made two mistakes by deleting good messages into "Delete as Spam" One reason is that I seem to have lost the "remove from possible junk mail and put in the inbox" button. Can I fish up these two messages from Spam Deletes? Both Senders of Messages are Welcome emails for me and I don't want them blocked out. Also any suggestions about finding the "remove from junk mail and put in inbox" button. I did look at the troubleshooting guide but didn't find a pertinent section. Thanks for any help. Robin From russ_foster at comcast.net Fri Feb 27 10:23:55 2004 From: russ_foster at comcast.net (Russ Foster) Date: Fri Feb 27 10:15:55 2004 Subject: [Spambayes] Inadvertent Deleting As Spam Two Messages In-Reply-To: Message-ID: I can't help with the missing buttons, but regarding the mis-classification: I believe the current wisdom is to move the good message (which was marked as bad) into the unsure folder, then "Recover from...", move the message back into the unsure folder, and "Recover from...", do this 2-3 times. The other option is to just rebuild your database from existing emails (making sure they are all in the proper folders). -r On Fri, 27 Feb 2004, Robin Inskip wrote: > It had to happen but today I made two mistakes by deleting good messages > into "Delete as Spam" One reason is that I seem to have lost the "remove > from possible junk mail and put in the inbox" button. > > Can I fish up these two messages from Spam Deletes? Both Senders of > Messages are Welcome emails for me and I don't want them blocked out. Also > any suggestions about finding the "remove from junk mail and put in inbox" > button. I did look at the troubleshooting guide but didn't find a pertinent > section. > > Thanks for any help. > > Robin > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From russ_foster at comcast.net Fri Feb 27 10:57:36 2004 From: russ_foster at comcast.net (Russ Foster) Date: Fri Feb 27 10:49:07 2004 Subject: [Spambayes] Train on Hashbusters? In-Reply-To: Message-ID: I'm starting to get better than 50% correct classification on world salad spam after actively training on these messages. (the other balance of the 50% show up as strong "unsures). My thinking in training on all of these messages is that, even though they attempt to use only "common" words, they still use words that are not in my everyday email. Also, there appears to be a lot of "clues" in the header that push the classification towards the spam end. Here's a sample of one I got recently that scored "100%" spam: Spam Score: 100% (0.999371) word spamprob #ham #spam 'header:Reply-To:1' 0.789475 9 60 'absolute' 0.85899 0 2 'cognate' 0.85899 0 2 'curricula' 0.85899 0 2 'implacable' 0.85899 0 2 'x-mailer:mpop web-mail 2.19' 0.896239 0 3 Since there doesn't seem to be any scientific evidence one way or the other, I tag everything. I suppose by database may grow unweildy after a while...but, maybe not....? -r On Thu, 26 Feb 2004, Tim Stone wrote: > On Thu, 26 Feb 2004 09:44:29 -0500, Fred Mertz wrote: > > > > > I get lots of spam with sections of random words like this: > > > Should I train on these messages? > > I think at the moment our recommendation would be to not train on those > messages if they're correctly classified already. We are actively > researching this technique (called "word salad"), but as of yet we've not > seen that it is effective against our filter. > > From HPircher at da.state.nm.us Fri Feb 27 12:18:52 2004 From: HPircher at da.state.nm.us (Pircher, Heidi) Date: Fri Feb 27 12:01:27 2004 Subject: [Spambayes] (no subject) Message-ID: <6AB4A4AC7B2AD411A9680050DAB2EAA505F10871@da.state.nm.us> From kennypitt at hotmail.com Fri Feb 27 13:34:33 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 27 13:35:43 2004 Subject: [Spambayes] Inadvertent Deleting As Spam Two Messages In-Reply-To: Message-ID: Robin Inskip wrote: > It had to happen but today I made two mistakes by deleting good > messages into "Delete as Spam" One reason is that I seem to have > lost the "remove from possible junk mail and put in the inbox" button. > > Can I fish up these two messages from Spam Deletes? Both Senders of > Messages are Welcome emails for me and I don't want them blocked out. > Also any suggestions about finding the "remove from junk mail and put > in inbox" button. I did look at the troubleshooting guide but didn't > find a pertinent section. Assuming that you haven't already deleted the messages from your Spam folder, you should be able to go there and click Recover (assuming you have a Recover button there, see below). If you can Recover from the Spam folder then this will reverse the incorrect training and retrain the messages as good. Were you classifying messages in the Unsure folder when you had the problem? Normal behavior is: in the Spam folder show only the Recover button, in the Unsure folder show both Delete and Recover buttons, in any other folder show only the Delete button. Is that not what you're getting? If Recover does not show up in the Unsure or Spam folders, then Outlook may be hiding part of your toolbar because of lack of space. If this happens, look for a downward-pointing triangle at the far right side of the SpamBayes toolbar. Try clicking the triangle and see if the missing button is in the dropdown there. -- Kenny Pitt From kennypitt at hotmail.com Fri Feb 27 13:47:08 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 27 13:48:10 2004 Subject: [Spambayes] PocketPC In-Reply-To: <000901c3fcde$d6971270$6a01a8c0@Peacemaker> Message-ID: Erin Lazzaro wrote: > Training on mistakes and unsures seems the most intuitive, but since > I have never yet had any ham classified as spam, I would expect the > ratio to get very unbalanced. Why do people think otherwise? Do you > start getting misclassified ham if the ratio gets too far out? Training only on mistakes and unsures can certainly cause heavy imbalance for some people. We've seen logfiles from users who have 100:1 imbalance or worse. It's possible that it will eventually cause you to start getting misclassified ham, and at that point you can start training those messages. If the imbalance doesn't cause you to get any misclassifications then the imbalance isn't an issue. The mathematics say that a perfect balance is best because otherwise additional weight is given to the clues from one side or the other. That's all theoretical, though, and what really matters is how it behaves for you in practice. We've been kicking around some additional theories about how we could automatically help you keep your training in balance, but nobody has come up with a silver bullet yet. -- Kenny Pitt From trefftz at attglobal.net Fri Feb 27 14:33:31 2004 From: trefftz at attglobal.net (Albrecht Trefftz) Date: Fri Feb 27 14:33:38 2004 Subject: [Spambayes] Does SpamBayes work under Opera? Message-ID: Hi, just to inform you, that I believe there could be something wrong with the downloadable files: I am using opera, so I go within your webpage to "Platforms" - "Windows", than, under "Non Outlook solutions" I click on the link "Installation program" and the download site offers for download the file: - spambayes-1.0a9.exe. This installation does not offer the possibility to configure the email- address settings (user accounts, password, etc.) If I go to "Outlook 2000/XP/2003" and click on "download the installation program", it offers me identically the same file for download: - spambayes-1.0a9.exe Can you give me a helping hand? Thanks very much, Albrecht Trefftz From tim at fourstonesExpressions.com Fri Feb 27 16:14:42 2004 From: tim at fourstonesExpressions.com (Tim Stone) Date: Fri Feb 27 16:14:50 2004 Subject: [Spambayes] Does SpamBayes work under Opera? In-Reply-To: References: Message-ID: On Fri, 27 Feb 2004 20:33:31 +0100, Albrecht Trefftz wrote: > > Hi, just to inform you, that I believe there could be something wrong > with the downloadable files: > > I am using opera, so I go within your webpage to "Platforms" - > "Windows", than, under "Non Outlook solutions" I click on the link > "Installation program" and the download site offers for download the > file: > > - spambayes-1.0a9.exe. This file contains all of spambayes for all supported clients. The functionality needed to support opera is in there... > > This installation does not offer the possibility to configure the email- > address settings (user accounts, password, etc.) With Opera, you'll have to install and configure sb_server.py. Look in the instructions for information on using spambayes with a pop3 server. BTW, I use Opera 7 with spambayes and it works perfectly... > > If I go to "Outlook 2000/XP/2003" and click on "download the > installation program", it offers me identically the same file for > download: > > - spambayes-1.0a9.exe Outlook support will be absolutely useless to you. -- Exprimez vous!; Expr?sese; Esprimi te stesso; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com From kennypitt at hotmail.com Fri Feb 27 17:27:00 2004 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Feb 27 17:28:07 2004 Subject: [Spambayes] Does SpamBayes work under Opera? In-Reply-To: Message-ID: Albrecht Trefftz wrote: > I am using opera, so I go within your webpage to "Platforms" - > "Windows", than, under "Non Outlook solutions" I click on the link > "Installation program" and the download site offers for download the > file: > > - spambayes-1.0a9.exe. > > This installation does not offer the possibility to configure the > email- address settings (user accounts, password, etc.) > > If I go to "Outlook 2000/XP/2003" and click on "download the > installation program", it offers me identically the same file for > download: > > - spambayes-1.0a9.exe > > Can you give me a helping hand? The installation program contains support for both types of installation, and should prompt you to choose one or the other. You want to choose the "Server/Proxy application". Configuration is done after installation completes, not by the wizard. It is largely a manual process, but the readme file (readme_proxy.html) should have enough info to get you going. -- Kenny Pitt From fxcastellano at hotmail.com Fri Feb 27 19:33:27 2004 From: fxcastellano at hotmail.com (Salvador Castellano) Date: Fri Feb 27 19:33:34 2004 Subject: [Spambayes] can outlook xp, spambayes, and trend micro's pc-cillin co-exist? Message-ID: Hi, I'm a newbie on this topic, so I hope you'll excuse me if the answer to my question is readily available or somehow otherwise stupid. I run Outlook XP and use Trend Micro's PC-cillin for antivirus. I'm really interested in using Spambayes, based on a recommendation from someone I trust and reading about it; but when I read the documentation about how it works, I saw that it works by proxying the POP3 server, as does PC-cillin's "POP3 Scan" feature. I'm reluctant to install new software just to find out that it collides with stuff that I already have and depend on, so I was hoping that somebody who understands how they both work could answer the question of whether or not they are compatible (or whether I somehow misunderstood it all). I searched the FAQ, the Wiki, and the archives for this list, but didn't find the answer. The answer or a pointer to the answer would be much appreciated as I am pretty darn sick of the spam! Thanks in advance, David Bernstein _________________________________________________________________ Say “good-bye” to spam, viruses and pop-ups with MSN Premium -- free trial offer! http://click.atdmt.com/AVE/go/onm00200359ave/direct/01/ From mhammond at keypoint.com.au Fri Feb 27 19:57:44 2004 From: mhammond at keypoint.com.au (Mark Hammond) Date: Fri Feb 27 19:58:04 2004 Subject: [Spambayes] can outlook xp, spambayes, and trend micro's pc-cillin co-exist? In-Reply-To: Message-ID: <004301c3fd95$e0e033d0$0200a8c0@eden> If you are thinking of using the Outlook addin, there should be no problem, as this tool does *not* proxy the pop3 server - it uses Outlook's builtin techniques for processing the mail. If you use a mail program other than Outlook, then you will need to use the spambayes utility that does proxy - in that case, you may or may not have a problem - but even then, SpamBayes is fairly benign in the way it operates, so trying it, then removing it if there is a problem, should be a safe thing to do. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Salvador Castellano > Sent: Saturday, 28 February 2004 11:33 AM > To: spambayes@python.org > Subject: [Spambayes] can outlook xp, spambayes,and trend micro's > pc-cillin co-exist? > > > Hi, > > I'm a newbie on this topic, so I hope you'll excuse me if the > answer to my > question is readily available or somehow otherwise stupid. I > run Outlook XP > and use Trend Micro's PC-cillin for antivirus. I'm really > interested in > using Spambayes, based on a recommendation from someone I > trust and reading > about it; but when I read the documentation about how it > works, I saw that > it works by proxying the POP3 server, as does PC-cillin's "POP3 Scan" > feature. I'm reluctant to install new software just to find > out that it > collides with stuff that I already have and depend on, so I > was hoping that > somebody who understands how they both work could answer the > question of > whether or not they are compatible (or whether I somehow > misunderstood it > all). > > I searched the FAQ, the Wiki, and the archives for this list, > but didn't > find the answer. The answer or a pointer to the answer would be much > appreciated as I am pretty darn sick of the spam! > > Thanks in advance, > David Bernstein > > _________________________________________________________________ > Say ?good-bye? to spam, viruses and pop-ups with MSN Premium > -- free trial > offer! http://click.atdmt.com/AVE/go/onm00200359ave/direct/01/ > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From mhammond at keypoint.com.au Fri Feb 27 20:04:17 2004 From: mhammond at keypoint.com.au (Mark Hammond) Date: Fri Feb 27 20:04:36 2004 Subject: [Spambayes] Inadvertent Deleting As Spam Two Messages In-Reply-To: Message-ID: <004401c3fd96$ca9e7590$0200a8c0@eden> > It had to happen but today I made two mistakes by deleting > good messages > into "Delete as Spam" One reason is that I seem to have lost > the "remove > from possible junk mail and put in the inbox" button. > > Can I fish up these two messages from Spam Deletes? Both Senders of > Messages are Welcome emails for me and I don't want them > blocked out. Also > any suggestions about finding the "remove from junk mail and > put in inbox" > button. I did look at the troubleshooting guide but didn't > find a pertinent > section. See the other excellent responses, but something that hasn't been mentioned yet is that SpamBayes will also train when you manually move messages to the folders SpamBayes is watching. This is assuming the option is enabled in the 'Training' page of the SpamBayes manager, which it is by default. This means that simply dragging and dropping the message from the 'Spam' folder back into your inbox will undo any 'Spam' training, and effect a 'Ham' train for that message - ie, exactly what the 'Recover from Spam' button would do. Mark. From StevenAZimmerman at comcast.net Sat Feb 28 03:03:08 2004 From: StevenAZimmerman at comcast.net (Steve Zimmerman) Date: Sat Feb 28 03:03:31 2004 Subject: [Spambayes] Spam Score Field in Outlook Plugin Message-ID: <000001c3fdd1$51e1ca90$0202a8c0@z> Your documentation states that it is necessary to manually add this field in every folder where you want it. However, there is a way to make this part of a standard view (such as "Messages") so that it automatically appears in every folder with that view. First, make sure that the "Advanced" toolbar is displayed; this contains a dropdown box with the current view selected. Next, add the Spam Score Field as per your documentation, and position and size it the way you want. Finally, click in the dropdown box so that the current view is highlighted, and hit Enter. The current column layout is now the standard for the selected view. The first time certain folders are selected they may not show this column, but clicking to a different folder and then clicking back to them fixes this problem. Steve -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/a038132c/attachment.html From spambayesspambayes at USA.NET Sat Feb 28 06:05:21 2004 From: spambayesspambayes at USA.NET (Fassil Campbell) Date: Sat Feb 28 06:03:39 2004 Subject: [Spambayes] spambayes: Hurry n0w and buy C_IALIS this weekend on a brighter n0te! Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/7c852c57/attachment-0001.html From rcoe at CambridgeMA.GOV Sat Feb 28 10:35:46 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sat Feb 28 10:35:51 2004 Subject: [Spambayes] Spam Score Field in Outlook Plugin Message-ID: <42393C9DA7930245AB540667607F4F5022C280@SPIKE.city> I think you'll find that if you try to apply that view to a folder that has not been scanned (i.e., not all messages have the "Spam" field), Outlook will remove the Spam field from the view. I think (although I'm not certain of this) that the removal is effective in all folders thenceforth. To avoid the problem, scan each folder before applying the view to it. My experience is with Outlook 2000 (under Windows 2000 and XP), so I can't be certain that more recent Outlook versions will exhibit the same behavior. Bob -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Steve Zimmerman Sent: Saturday, February 28, 2004 3:03 AM To: spambayes@python.org Subject: [Spambayes] Spam Score Field in Outlook Plugin Your documentation states that it is necessary to manually add this field in every folder where you want it. However, there is a way to make this part of a standard view (such as "Messages") so that it automatically appears in every folder with that view. First, make sure that the "Advanced" toolbar is displayed; this contains a dropdown box with the current view selected. Next, add the Spam Score Field as per your documentation, and position and size it the way you want. Finally, click in the dropdown box so that the current view is highlighted, and hit Enter. The current column layout is now the standard for the selected view. The first time certain folders are selected they may not show this column, but clicking to a different folder and then clicking back to them fixes this problem. Steve -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/c69b1432/attachment.html From rcoe at CambridgeMA.GOV Sat Feb 28 11:14:54 2004 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sat Feb 28 11:15:06 2004 Subject: [Spambayes] can outlook xp, spambayes, and trend micro's pc-cillin co-exist? Message-ID: <42393C9DA7930245AB540667607F4F5022C281@SPIKE.city> You ought to be able to "chain" two or more proxy servers together, as long as you make sure they don't operate on conflicting port numbers. Most (all?) proxy servers allow you to specify both the server-side and client-side port numbers to avoid conflicts. One thing you probably would want to do is send your outgoing mail to the "highest" SMTP server in the chain, bypassing intervening servers. (Of course you may be sending it through another SMTP server altogether, in which case the issue doesn't arise.) Bob > -----Original Message----- > From: spambayes-bounces+rcoe=cambridgema.gov@python.org > [mailto:spambayes-bounces+rcoe=cambridgema.gov@python.org]On Behalf Of > Mark Hammond > Sent: Friday, February 27, 2004 7:58 PM > To: 'Salvador Castellano'; spambayes@python.org > Subject: RE: [Spambayes] can outlook xp, spambayes,and trend micro's > pc-cillin co-exist? > > > If you are thinking of using the Outlook addin, there should be no problem, > as this tool does *not* proxy the pop3 server - it uses Outlook's builtin > techniques for processing the mail. > > If you use a mail program other than Outlook, then you will need to use the > spambayes utility that does proxy - in that case, you may or may not have a > problem - but even then, SpamBayes is fairly benign in the way it operates, > so trying it, then removing it if there is a problem, should be a safe thing > to do. > > Mark. > > > -----Original Message----- > > From: spambayes-bounces@python.org > > [mailto:spambayes-bounces@python.org]On Behalf Of Salvador Castellano > > Sent: Saturday, 28 February 2004 11:33 AM > > To: spambayes@python.org > > Subject: [Spambayes] can outlook xp, spambayes,and trend micro's > > pc-cillin co-exist? > > > > > > Hi, > > > > I'm a newbie on this topic, so I hope you'll excuse me if the answer to my > > question is readily available or somehow otherwise stupid. I run Outlook XP > > and use Trend Micro's PC-cillin for antivirus. I'm really interested in > > using Spambayes, based on a recommendation from someone I trust and reading > > about it; but when I read the documentation about how it works, I saw that > > it works by proxying the POP3 server, as does PC-cillin's "POP3 Scan" > > feature. I'm reluctant to install new software just to find out that it > > collides with stuff that I already have and depend on, so I was hoping that > > somebody who understands how they both work could answer the question of > > whether or not they are compatible (or whether I somehow misunderstood it > > all). > > > > I searched the FAQ, the Wiki, and the archives for this list, but didn't > > find the answer. The answer or a pointer to the answer would be much > > appreciated as I am pretty darn sick of the spam! > > > > Thanks in advance, > > David Bernstein From StevenAZimmerman at comcast.net Sat Feb 28 13:02:49 2004 From: StevenAZimmerman at comcast.net (Steve Zimmerman) Date: Sat Feb 28 13:03:09 2004 Subject: [Spambayes] Spam Score Field in Outlook Plugin In-Reply-To: <42393C9DA7930245AB540667607F4F5022C280@SPIKE.city> Message-ID: <000001c3fe25$180b1390$0202a8c0@z> I'm using Outlook 2002. For unscanned folders, the column is there; it's just completely empty. As I mentioned, there's definitely a startup problem where not all folders get this revised view initially, and until it's well established in all of them, it may even disappear. I've seen this problem in all versions of Outlook when adding any fields via any method to an established view. So it may take a little coaxing (at worst) to get this permanently established in your view. But it's a lot easier than creating this field manually in every single folder. Steve -----Original Message----- From: Coe, Bob [mailto:rcoe@CambridgeMA.GOV] Sent: Saturday, February 28, 2004 7:36 AM To: Steve Zimmerman Cc: spambayes@Python.org Subject: RE: [Spambayes] Spam Score Field in Outlook Plugin I think you'll find that if you try to apply that view to a folder that has not been scanned (i.e., not all messages have the "Spam" field), Outlook will remove the Spam field from the view. I think (although I'm not certain of this) that the removal is effective in all folders thenceforth. To avoid the problem, scan each folder before applying the view to it. My experience is with Outlook 2000 (under Windows 2000 and XP), so I can't be certain that more recent Outlook versions will exhibit the same behavior. Bob -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Steve Zimmerman Sent: Saturday, February 28, 2004 3:03 AM To: spambayes@python.org Subject: [Spambayes] Spam Score Field in Outlook Plugin Your documentation states that it is necessary to manually add this field in every folder where you want it. However, there is a way to make this part of a standard view (such as "Messages") so that it automatically appears in every folder with that view. First, make sure that the "Advanced" toolbar is displayed; this contains a dropdown box with the current view selected. Next, add the Spam Score Field as per your documentation, and position and size it the way you want. Finally, click in the dropdown box so that the current view is highlighted, and hit Enter. The current column layout is now the standard for the selected view. The first time certain folders are selected they may not show this column, but clicking to a different folder and then clicking back to them fixes this problem. Steve -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/1f72d83f/attachment.html From sparky at itctel.com Sat Feb 28 16:26:51 2004 From: sparky at itctel.com (Rick) Date: Sat Feb 28 16:42:46 2004 Subject: [Spambayes] Help Message-ID: <000801c3fe41$9517f350$6701a8c0@gamer> I don't know if threre is a fix or work around. I use spambayes and love it but I also like to you Firetrust benign with it but I have a conflict as they both can't seem to share port 110. Can you be of any help. Thanks Rick -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/1748c149/attachment.html From kforman at mho.com Sat Feb 28 21:49:54 2004 From: kforman at mho.com (Ken Forman) Date: Sat Feb 28 21:46:05 2004 Subject: [Spambayes] .9 slowdown Message-ID: Hi All, I uninstalled .8x and installed .9. Now, when I hit "Delete as spam" it takes anywhere from 1 to 10 seconds! Can't live with this for long... (And yes, I restarted my computer to ensure it was still happening.) Outlook 2000 SR-1 Windows 2000 no log Thanks, Ken Forman From tpeters at mixcom.com Sat Feb 28 22:15:50 2004 From: tpeters at mixcom.com (Tom Peters) Date: Sat Feb 28 22:29:37 2004 Subject: [Spambayes] Does SpamBayes work under Opera? In-Reply-To: References: Message-ID: <5.1.0.14.2.20040228211224.0b977008@localhost> Also, the wiki has a page I added with an ultra quick start for the proxy that may help slightly: http://entrian.com/sbwiki/POP3ServiceQuickStartGuide At 05:27 PM 2/27/2004 -0500, Kenny Pitt wrote: >Albrecht Trefftz wrote: > > I am using opera, so I go within your webpage to "Platforms" - > > "Windows", than, under "Non Outlook solutions" I click on the link > > "Installation program" and the download site offers for download the > > file: > > > > - spambayes-1.0a9.exe. > > > > This installation does not offer the possibility to configure the > > email- address settings (user accounts, password, etc.) > > > > If I go to "Outlook 2000/XP/2003" and click on "download the > > installation program", it offers me identically the same file for > > download: > > > > - spambayes-1.0a9.exe > > > > Can you give me a helping hand? > >The installation program contains support for both types of >installation, and should prompt you to choose one or the other. You >want to choose the "Server/Proxy application". > >Configuration is done after installation completes, not by the wizard. >It is largely a manual process, but the readme file (readme_proxy.html) >should have enough info to get you going. > >-- >Kenny Pitt > > >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html [Government]There is no distinctly native American criminal class save Congress. --Mark Twain --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From gonor at adelphia.net Sat Feb 28 23:11:16 2004 From: gonor at adelphia.net (Gary Norris) Date: Sat Feb 28 23:11:21 2004 Subject: [Spambayes] How do I remove SpamBayes Message-ID: <000001c3fe7a$13bdd8e0$26234344@s1099982951> I would like to know how to remove SpamBayes from my system. I have noted that you have not included an Uninstaller. I spend more time in training SpamBayes than I do reading my e-mail (the wanted messages). Seems to me it does not work. Gary -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040228/516bb29b/attachment.html From tameyer at ihug.co.nz Sat Feb 28 22:42:55 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 28 23:15:36 2004 Subject: [Spambayes] Does SpamBayes work under Opera? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1305362B4F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677A0C@its-xchg4.massey.ac.nz> [Albrecht Trefftz] > This installation does not offer the possibility to configure the > email- address settings (user accounts, password, etc.) [Kenny] > Configuration is done after installation completes, not by > the wizard. It is largely a manual process, but the readme > file (readme_proxy.html) should have enough info to get you going. FWIW, there is a configuration wizard of sorts being developed for the non-Outlook (Windows) applications. The installer installs this (if you choose the non-Outlook install) as "setup_server.exe" in the {pf}\SpamBayes\bin directory. If you double-click it, you should be offered configuration for M2 (Opera's mailer). This *might* work for you. I would certainly not recommend it if you aren't able to re-configure your Opera settings if necessary. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From tameyer at ihug.co.nz Sat Feb 28 23:16:44 2004 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Feb 28 23:17:16 2004 Subject: [Spambayes] How do I remove SpamBayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13054AF2E8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1304677A0D@its-xchg4.massey.ac.nz> > I would like to know how to remove SpamBayes > from my system. I have noted that you have > not included an Uninstaller. If you installed with one of the installers, then, yes, an uninstaller is included. You uninstall like you uninstall almost any Windows program. 1. Open the "Add/Remove Programs" control panel. 2. Find "SpamBayes" or "SpamBayes Outlook addin", depending on which version you have. 3. Click uninstall. > I spend more time in training SpamBayes than I do > reading my e-mail (the wanted messages). > Seems to me it does not work. You are, of course, most welcome to simply uninstall SpamBayes. If you do want to use it though, we can probably help you figure out what you're doing wrong, so that it does work. An example "show clues" from a message that gets incorrectly classified would greatly help, if that is the case. =Tony Meyer --- Please always include the list (spambayes@python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. This way, you get everyone's help, and avoid a lack of replies when I'm busy. From Amir_Katz at bmc.com Sun Feb 29 07:07:02 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Sun Feb 29 07:09:07 2004 Subject: [Spambayes] SpamBayes proxy, Outlook Express & anti-virus Message-ID: I'm running SB at home with OE and MacAfee VirusScan. Every so often the AV pops up and reports that a virus was found in file .exe which is located in what seems to be SB's working directory. My assumption is that SB assembles the mail's attachments prior to examining the full message and as soon as an attachment (which is an .exe file) is created, the AV kicks in and correctly shoots it. Questions: 1) Is this scenario correct? If no, what really happens? 2) If yes, is there a way to tell SB not to assemble attachments and avoid the remote chance that the exe will remain there? Thanks, Amir Katz, CISSP "The C Programming Language - A language which combines the flexibility of assembly language with the power of assembly language." -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20040229/b4adb19e/attachment-0001.html From alyssa at cambersoft.com Sun Feb 29 10:21:52 2004 From: alyssa at cambersoft.com (Alyssa Franks) Date: Sun Feb 29 10:12:09 2004 Subject: [Spambayes] spambays/outlook/deleted box question Message-ID: Prior to installing Spambays I set up a bunch of rules in Outlook (not express) for my junk mail. I've got perhaps thousands of junk email addresses in my junk senders list. This seems (I think) to be causing spambay some problems. the reason why I think it's doing this is because for some unknown other reason unrelated to this problem I get two emails of the same spam. One correctly goes into the spambays Junk E-Mail folder the second goes into my Deleted items folder. I think spambays grabs one email & the outlook rules grabs the other. Got any suggestions? thanks in advance! alyssa "It was a good day on Mars." - MATT WALLACE, manager of the Spirit robotic rover mission. From hxamaranth at hotmail.com Sun Feb 29 13:42:25 2004 From: hxamaranth at hotmail.com (hxamaranth@hotmail.com) Date: Sun Feb 29 13:42:28 2004 Subject: [Spambayes] Problem with POP3 Proxy Message-ID: A non-text attachment was scrubbed... Name: SpamBayesServer1.log Type: application/octet-stream Size: 179 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20040229/448ba1b1/SpamBayesServer1.obj -------------- next part -------------- I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 4.10.67766222.1 ( ). I have trained 0 ham and 0 spam. The problem I am having is with setting up SpamBytes in Outlook Express. On the configuration page I am using "mail.comcast.net" on Port 25 as shown in my Comcast account settings in Outlook Express. The outgoing smtp server is "smtp.comcast.net" on port 110. According to the configuration page: Remote Servers: The SpamBayes POP3 proxy intercepts incoming email and classifies it before sending it on to your email client. You need to specify which POP3 server(s) you wish it to intercept - a POP3 server address typically looks ike "pop3.myisp.net". If you use more than one server, simply separate their names with commas. You can get these server names from your existing email configuration, or from your ISP or system administrator. If you are using Web-based email, you can't use the SpamBayes POP3 proxy (sorry!). IN YOUR EMAIL CLIENT'S CONFIGURATION, WHERE YOU WOULD NORMALLY PUT YOUR POP3 SERVER ADDRESS, YOU SHOULD NOW PUT THE ADDRESS OF THE MACHINE RUMMING SpamBayes. The address of the local machine? What does that mean? It's a desktop computer, it doesn't have an address. I tried using "localhost" but that did not work. Do I need to use "http://localhost" or something of that sort? From peeg67 at hotmail.com Sun Feb 29 15:55:49 2004 From: peeg67 at hotmail.com (P Gard) Date: Sun Feb 29 15:55:55 2004 Subject: [Spambayes] Training Database Status Message-ID: My database has 600 "ham" and 14,000 "spam". I obviously get huge amounts of spam. Spambayes is very accurate, but the ratio is huge. How can I get more "ham" trained to decrease the ratio? I have used all folders with "ham", and have nothing more to use! I hate to turn off the spam feature and just train for good messages, as I get about 250 spam messages per day! Any help would be great. Thanks.. _________________________________________________________________ Dream of owning a home? Find out how in the First-time Home Buying Guide. http://special.msn.com/home/firsthome.armx From Amir_Katz at bmc.com Sun Feb 29 16:46:54 2004 From: Amir_Katz at bmc.com (Katz, Amir) Date: Sun Feb 29 16:48:38 2004 Subject: [Spambayes] Training Database Status Message-ID: If SB is doing a good job, why do you care? Alternatively, you can sign up to some high-volume newsletters and mailing lists. -----Original Message----- From: P Gard [mailto:peeg67@hotmail.com] Sent: Sunday, February 29, 2004 22:56 To: spambayes@python.org Subject: [Spambayes] Training Database Status My database has 600 "ham" and 14,000 "spam". I obviously get huge amounts of spam. Spambayes is very accurate, but the ratio is huge. How can I get more "ham" trained to decrease the ratio? I have used all folders with "ham", and have nothing more to use! I hate to turn off the spam feature and just train for good messages, as I get about 250 spam messages per day! Any help would be great. Thanks.. _________________________________________________________________ Dream of owning a home? Find out how in the First-time Home Buying Guide. http://special.msn.com/home/firsthome.armx _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html