From JJ708 at bellsouth.net Sat Nov 1 08:09:10 2003 From: JJ708 at bellsouth.net (John Jones) Date: Sat Nov 1 08:09:26 2003 Subject: [Spambayes] Error on install - release 0.81 on Outlook 2002 Message-ID: <000201c3a079$5a7e00c0$77a0d6d1@IBMR32> Release .7 also receives the same error as documented below Thanks John Jones JJ708@bellsouth.net -----Original Message----- From: John Jones [mailto:JJ708@bellsouth.net] Sent: Thursday, October 30, 2003 7:59 PM To: 'spambayes@python.org' Subject: Unable to install on Outlook 2002 When I attempt to install release 0.81 on Outlook 2002 (on Windows XP) I receive the following error message. Unable to register DLL/OCX: Dll Register server failed; code 0x00000000. Any suggestions would be appreciated. Thanks John Jones JJ708@bellsouth.net From weatherman55 at optonline.net Sat Nov 1 16:31:56 2003 From: weatherman55 at optonline.net (Danny) Date: Sat Nov 1 16:28:18 2003 Subject: [Spambayes] (no subject) Message-ID: V 0.81 WinXPPro When I try to recover emails from "Junk Suspects" they completely disappear....I cannot find them. Please tell me 1. where they are 2. how i can stop this from happening. Thank you From rowdanny at hotmail.com Sat Nov 1 16:51:24 2003 From: rowdanny at hotmail.com (Danny .) Date: Sat Nov 1 16:51:28 2003 Subject: [Spambayes] lost email Message-ID: When i try to recover an email from the "junk suspects" it disappears. A search comes up empty. Can anyone tell me 1. where to find these emails and 2. how to stop this behavior? Thanks Danny _________________________________________________________________ Add MSN 8 Internet Software to your current Internet access and enjoy patented spam control and more. Get two months FREE! http://join.msn.com/?page=dept/byoa From olli.marttila at kolumbus.fi Sat Nov 1 19:27:46 2003 From: olli.marttila at kolumbus.fi (Olli J. Marttila) Date: Sat Nov 1 19:27:59 2003 Subject: [Spambayes] Installation difficulties / SuSE/KMail/IMAP Message-ID: <200311020227.46998.olli.marttila@kolumbus.fi> Please, help! I am at loss with installing spambayes. I have tried to get it installed innumerable hours in two weeks. Sigh... OS: SuSE Linux, version 8.2 Personal. IMAP protocol (kolumbus.fi) Python 2.3.2 (#1, Oct 23 2003, 00:59:25) [GCC 3.3 20030226 (prerelease) (SuSE Linux)] on linux2 Spambayes-1.0a6 Mail programme: KMail 1. I downloaded spambayes-1.0a6.1.tar.gz and detarred it in directory /downloads/spambayes-1.0a6 as root. 2. In that directory I run ./setup.py install. Thereafter I deleted the directory in question, again as root. 3. I opened X window as marttila and run '/usr/local/bin/sb_imapfilter.py -b' with the following result message: ---- SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). You do not have a dbm module available to use. You need to either use a pickle (see the FAQ), use Python 2.3 (or above), or install a dbm module such as bsddb (see http://sf.net/projects/pybsddb). ---- What does this mean? I _have_ Python 2.3 installed, as seen above. I also have installed Berkeley DB, version 4.1.2.5 and bsddb3-4.1.6, just in case ... Building bsddb3 ('python setup.py build') seemed to go through without any complaints. However 'python test.py' complained about ImportError: libdb-4.1.so: cannot open shared object file: No such file or directory. However, 'locate libdb-4.1.so' tells that the library in question is in the directory /usr/local/BerkeleyDB.4.1/lib/. Whatever I do, the only result of my installation attempts is the above message complaining about a missing dbm module. Am I dumb - inexperienced I am, sure - as so many seem to have succeeded in installing Spambayes - or is there something missing from the installation directions? Olli J. Marttila From spambayes at whateley.com Sat Nov 1 20:08:16 2003 From: spambayes at whateley.com (Brendon) Date: Sat Nov 1 20:08:23 2003 Subject: [Spambayes] Installation difficulties / SuSE/KMail/IMAP In-Reply-To: <200311020227.46998.olli.marttila@kolumbus.fi> References: <200311020227.46998.olli.marttila@kolumbus.fi> Message-ID: <200311011708.16510.spambayes@whateley.com> Try typing "python -V" at a command prompt. You may have two versions of python on your machine and could be picking up an older one in the path. Brendon On Saturday 01 November 2003 04:27 pm, Olli J. Marttila wrote: > Please, help! > > I am at loss with installing spambayes. I have tried to get > it installed innumerable hours in two weeks. Sigh... > > OS: SuSE Linux, version 8.2 Personal. > IMAP protocol (kolumbus.fi) > Python 2.3.2 (#1, Oct 23 2003, 00:59:25) > [GCC 3.3 20030226 (prerelease) (SuSE Linux)] on linux2 > Spambayes-1.0a6 > Mail programme: KMail > > > 1. I downloaded spambayes-1.0a6.1.tar.gz and detarred it in > directory /downloads/spambayes-1.0a6 as root. > > 2. In that directory I run ./setup.py install. Thereafter I > deleted the directory in question, again as root. > > 3. I opened X window as marttila and run > '/usr/local/bin/sb_imapfilter.py -b' > with the following result message: > > ---- > > SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), > using SpamBayes IMAP Filter Web Interface Alpha2, version > 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). > > You do not have a dbm module available to use. > You need to either use a pickle (see the FAQ), use Python > 2.3 (or above), or install a dbm module such as bsddb (see > http://sf.net/projects/pybsddb). > > ---- > > What does this mean? I _have_ Python 2.3 installed, as seen > above. > I also have installed Berkeley DB, version 4.1.2.5 and > bsddb3-4.1.6, just in case ... > > Building bsddb3 ('python setup.py build') seemed to go > through without any complaints. However 'python test.py' > complained about ImportError: libdb-4.1.so: cannot open > shared object file: No such file or directory. > > However, 'locate libdb-4.1.so' tells that the library in > question is in the directory > /usr/local/BerkeleyDB.4.1/lib/. > > Whatever I do, the only result of my installation attempts > is the above message complaining about a missing dbm > module. > > Am I dumb - inexperienced I am, sure - as so many seem to > have succeeded in installing Spambayes - or is there > something missing from the installation directions? > > Olli J. Marttila > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From jabailo at earthlink.net Sun Nov 2 02:12:42 2003 From: jabailo at earthlink.net (j. a. bailo) Date: Sun Nov 2 02:12:47 2003 Subject: [Spambayes] Sending on smtp proxy with Evolution Message-ID: <20275462.1067757162671.JavaMail.root@misspiggy.psp.pas.earthlink.net> I am a spam bayes newbie. Using: Redhat Linux 9 Evolution mail client Earthlink ISP I set up the pop3 proxy easily. It does not do any spam catching yet but I have not yet 'trained' it. The smpt proxy is the problem: 0. I set up the smtp outbound proxy for localhost:9002 then configured Evolution to use localhost:9002 as the sending mail server instead of mail.earthlink.net. The mail is not able to be sent - -I am guessing because mail.earthlink.net requires authentication in order to send mail. How do I configure the proxy to use authentication ? Oddly, the pop3 proxy, which also requires authentication seems to work fine (!?) Once I solve this I can start training spam bayes by sending mail to the spam address @ localhost. ps -- what is 'ham' ? I read about it in several places but did not see a hyperlink or definition to explain what it is. Thanks, john bailo ______________________ j. a. bailo From jchaines at qwest.net Sun Nov 2 13:36:38 2003 From: jchaines at qwest.net (jchaines) Date: Sun Nov 2 14:44:50 2003 Subject: [Spambayes] Addin does not load - Buttons Not Removed With Message-ID: Hello from Colorado, I have the almost the exact same problem that you sent in. The difference is that I somehow did manage to get rid of the Spambayes icons in my Outlook window by accidentally clicking around in the area of the icons as sown below. I thought I was changing from the inbox to the Sent box when all of the Spambayes related icons disappeared including those under the Folder List section! After removing every trace of SpamBayes that I could locate, I have subsequently tried several times to reinstall Spambayes with no success. Did you ever get a useful response to your posting? Is so, I?d love to see the answer. Thanks, Jim Haines. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 14663 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031102/d157cce0/attachment-0001.png -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 20189 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031102/d157cce0/attachment-0001.jpe From brian at jocund.net Sun Nov 2 20:31:38 2003 From: brian at jocund.net (Brian C. Kiefer) Date: Sun Nov 2 20:31:45 2003 Subject: [Spambayes] problem when starting outlook after the first time starting after installation Message-ID: <000001c3a1aa$3bdbaaf0$3200a8c0@case> Hi, When I first installed SpamBayes, it ran correctly the first time. I trained it. It worked well. The next time I started Outlook (shutdown properly), it asked if I wanted to disable the plug-in or continue with it not work. When I choose to continue, Outlook crashes. It fails in the following DLL: shlwapi.dll, ModVer: 6.0.3790.0. I am running SpamBayes 008.1 and Outlook 10.0.2627.1. Below is the copy of the SpamBaye logs. Sincerely, Brian Kiefer -=-=-=-=-= Begin Log =-=-=-=-=-=- Loaded bayes database from 'C:\Documents and Settings\kieferbc\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\kieferbc\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 195 spam and 0 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.2.3790 () using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] -=-=-=-=-= End Log =-=-=-=-=-=- From sethg at GoodmanAssociates.com Sun Nov 2 20:51:42 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Sun Nov 2 20:51:43 2003 Subject: [Spambayes] using external DNSBL's with Outlook Message-ID: Even though the FAQ clearly states that *ideally*, DNSBL's are done at the server level, many of us are stuck with ISP's who do little, if any of this. Personally, I would prefer SMTP rejects at the server for MX's listed on selected DNSBL's, but since I don't run my own mail server, I don't have this option. Therefore, I have taken to using a POP3 proxy to check several external DNSBL's and filter based on the results. When there is a hit on an external list, the POP3 proxy adds an "X-" type header line to identify the source of the hit. I then have an Outlook rule that filters on that header line and moves the message to the spam folder. I would like to figure out a way to best use this added information along with SpamBayes. As I don't know how the SpamBayes Outlook plug-in fits into the hierarchy of POP3 email processing, it is hard for me to figure out what to do. Before installing SpamBayes, my POP3 email went through the following processing hierarchy (as far as I know): 1) anti-virus filters email coming in on port 110 2) POP3 proxy checks header IP addresses against external DNSBL's 3) Outlook2000 processes message rules The FAQ talks about a POP3 proxy component, but I don't think this applies to the Outlook plug-in. How does the SpamBayes Outlook plug-in fit in with the above processing hierarchy? When exactly does SpamBayes get to work on a message? There is some mention in the FAQ about "fighting" between SpamBayes and the Outlook rule processing, and suggests that the background filtering option will fix this. How does the background filtering option affect the processing order? What point in time are the processing delays in the control panel for background processing referenced to? There is also mention in the FAQ about an "incremental" training mode where all the incoming mail is included in the database. Is this the default processing mode? If not, do you recommend this and how do I enable it? Thanks for a great program! Regards, Seth Goodman Goodman Associates, LLC From tim.one at comcast.net Sun Nov 2 21:25:55 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 2 21:25:59 2003 Subject: [Spambayes] using external DNSBL's with Outlook In-Reply-To: Message-ID: [Seth Goodman] > ... > Before installing SpamBayes, my POP3 email went through the following > processing hierarchy (as far as I know): > > 1) anti-virus filters email coming in on port 110 > 2) POP3 proxy checks header IP addresses against external DNSBL's The Outlook plug-in doesn't use a proxy. 2') Outlook sucks the email of your POP3 server, and stores it in its own internal format. > 3) Outlook2000 processes message rules > > The FAQ talks about a POP3 proxy component, but I don't think this > applies to the Outlook plug-in. That's right. > How does the SpamBayes Outlook plug-in fit in with the above > processing hierarchy? Somewhere around #3. In particular, your POP3 server is history by the time the plugin sees a message. Indeed, it's difficult for SpamBayes to try to reconstruct the bytestream your POP3 server originally delivered (step 2' turns it into quite a mess). > When exactly does SpamBayes get to work on a message? Heh: nobody knows. The Outlook rule system isn't strong enough to allow plugging SpamBayes in as an Outlook rule, so instead SpamBayes hooks Outlook "folder add" events. Exactly when and why Outlook triggers one of those events isn't documented by Microsoft, and appears to vary wildly depending on Outlook version and system load. Sometimes SpamBayes appears to see a message before Outlook rules run, sometimes after they run, sometimes we don't get notified about a new message at all, and sometimes we get notified about a new message more than once (e.g., if an Outlook rule moves a message and SpamBayes is set to watch both the Inbox and the destination folder, we *might* be told about a message twice). > There is some mention in the FAQ about "fighting" between SpamBayes > and the Outlook rule processing, and suggests that the background > filtering option will fix this. How does the background filtering > option affect the processing order? It slows SpamBayes down a lot, under the empirically verified theory that Outlook rules will more-than-less reliably run first then. There's no way we know of to make SpamBayes reliably run before Outlook rules run. > What point in time are the processing delays in the control panel for > background processing referenced to? Stick the cursor inside one of the "seconds" boxes and hit F1. A balloon will pop up explaining what the value in that seconds-box controls. > There is also mention in the FAQ about an "incremental" training mode > where all the incoming mail is included in the database. Is this the > default processing mode? If not, do you recommend this and how do I > enable it? That's not relevant to the Outlook plugin. From rwschultz at pacbell.net Sun Nov 2 21:50:43 2003 From: rwschultz at pacbell.net (Bob) Date: Sun Nov 2 21:50:46 2003 Subject: [Spambayes] User and Password request Message-ID: I am trying to set up SpamBays with Windows XP Pro using the Express Plus email program. Downloads are all current versions. On running the scripts/sb_server.py -b command, I was able to configure the program for localhost, but then my email wouldn't work at all. I tried to go back into the configuration program to change to my machine name, but was asked for a user name and password. What user name and password does this want. I have tried all I can think of to no avail. Thanks, Bob From sethg at GoodmanAssociates.com Sun Nov 2 23:45:51 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Sun Nov 2 23:45:59 2003 Subject: [Spambayes] using external DNSBL's with Outlook In-Reply-To: Message-ID: > [Seth Goodman] > > > When exactly does SpamBayes get to work on a message? > > Heh: nobody knows. The Outlook rule system isn't strong enough to allow > plugging SpamBayes in as an Outlook rule, so instead SpamBayes > hooks Outlook > "folder add" events. Exactly when and why Outlook triggers one of those > events isn't documented by Microsoft, and appears to vary wildly depending > on Outlook version and system load. Sometimes SpamBayes appears to see a > message before Outlook rules run, sometimes after they run, sometimes we > don't get notified about a new message at all, and sometimes we > get notified > about a new message more than once (e.g., if an Outlook rule > moves a message > and SpamBayes is set to watch both the Inbox and the destination > folder, we > *might* be told about a message twice). > That sounds like it was really hard to work around. Microsoft probably didn't document the relationship to the rule processor because they don't really know themselves. I bet there were too many people involved and the code's an unholy mess. The only way I can think of to avoid this (I'm sure you folks have beat this one up and down) is to install yourself as a proxy so you get the message before Outlook mangles it. You'd have to mark up the headers with the results and then have the Outlook rules process things "normally". Of course, that would make all the nice stuff that you've added in Outlook nearly impossible. This is a wonderful package you've put together. I particularly appreciate how easy it is to simultaneously retrain the classifier and move the message to it's proper folder by just hitting a button. That makes operation nearly effortless. I'll fool around with the background processing option to see if I can get DNSBL tagged messages moved before SpamBayes classifies them. Alternatively, is there some way that I can tell SpamBayes to tokenize the particular X- header line that contains the DNSBL tag? I'm not even certain that would achieve the desired result. Regards, Seth Goodman Goodman Associates, LLC From sanjaydarisi at cox.net Mon Nov 3 04:56:14 2003 From: sanjaydarisi at cox.net (Sanjay Darisi) Date: Mon Nov 3 04:51:00 2003 Subject: [Spambayes] Problem with creating outlook binary installer! In-Reply-To: <65i4uxd8.fsf@python.net> References: <20031101101308.EFFQ24944.fed1mtao01.cox.net@smtp.west.cox.net> <65i4uxd8.fsf@python.net> Message-ID: <3FA6263E.2010407@cox.net> I am curious on how to make the spambayes outlook addin binary installer. Is py2exe used or McMillan installer? 'cos I tried with both and got errors. Like I tried using py2exe with setup_all.py file in the windows/py2exe dir and got a few errors regarding the options of py2exe like 'exclude-dll' etc. As Thomas Heller suggested, I have used the latest py2exe (0.5.0a4), win32all-161 and grabbed the cvs version of setup_all.py file. No luck :-( Now, i am getting import module - no module found regarding the typelib modules mentioned in the setup_all.py file. How to create binary installer for outlook addin? Then i went back to McMillan installer and tried. I used the Outlook2000\installer\spambayes_addin.py|spec|iss files. I was able to get a .exe file at the end. But, it won't work like I was able to install and the spambayes won't come up when I start the Outlook. Is it the same file that is used in building the spambayes outlook addin binary installer? I have few COM errors in the log file, complaining about lack of resource section for the image files and an assertion error saying 'Should not yet have a toolbar' Is there anything else that I have to do inaddition to using those files? Could anyone let me know how the binary installer of spambayes Outlook addin is built and could you share the script used in this process? I'd really appreciate your help. Thank you, Sanjay. Thomas Heller wrote: > writes: > > > >>I am curious on how to make the spambayes outlook addin binary >>installer. Is py2exe used or McMillan installer? 'cos I tried with >>both and got errors. Like I tried using py2exe with setup_all.py file >>in the windows/py2exe dir and got a few errors regarding the options >>of py2exe like 'exclude-dll' etc. I have py2exe 0.4.2 and python 2.3 >> >> > >If you try py2exe, you should use the 0.5.0a prerelease in the files >section. You need win32all build 161, however. > >Thomas > > >_______________________________________________ >spambayes-dev mailing list >spambayes-dev@python.org >http://mail.python.org/mailman/listinfo/spambayes-dev > > > From dbartling at revereelectric.com Mon Nov 3 09:35:26 2003 From: dbartling at revereelectric.com (Donna Bartling) Date: Mon Nov 3 09:36:28 2003 Subject: [Spambayes] help Message-ID: I had tried SpamBayes and loved it for two weeks. It worked as it should. Then every time I opened Outlook, it would lock up and not install properly. I finally had to delete the spambayes from my computer to be able to use my Outlook. Have you seen this before? I have tried to reload a couple of times but it locks up my system, and I have to delete again. I have Windows 2000, Pro Donna Bartling -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 862 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031103/5d3e8c50/attachment.gif From GrangMJ at LOUISVILLE.STORTEK.COM Mon Nov 3 10:05:33 2003 From: GrangMJ at LOUISVILLE.STORTEK.COM (Granger, Michael J) Date: Mon Nov 3 10:05:42 2003 Subject: [Spambayes] (no subject) Message-ID: <80E84A07223C5E48AFD64B81788349C308E4369C@hqemaila.louisville.stortek.com> Sir, I have been using this wonderful program now for a few months on my Outlook 2000. Over the weekend, for reasons I won't go into I had to get upgraded to Office XP. Now, in my new iteration of Outlook, SpamBayes does not work. The pulldown interface is still there, but when I click on Spambayes Manager, it doesn't respond. It's like it's just dead. Is this a known problem and does it have a solution? Thanks, _Mike Granger From kenfield at sprynet.com Mon Nov 3 10:08:11 2003 From: kenfield at sprynet.com (Kenfield) Date: Mon Nov 3 10:05:48 2003 Subject: [Spambayes] Purchase of Spambayes Message-ID: <001301c3a21c$70a4ac10$6501a8c0@kekbackup> I purchased your product and requested a disk. This was done several weeks ago. We have not received the disk. Please advise when we may expect to receive it Thanks Ken Kennedy 310 877 9985 370 Fordyce Road Los Angeles, CA 90049 kenfield@sprynet.com From anthony at interlink.com.au Mon Nov 3 10:06:15 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Mon Nov 3 10:09:50 2003 Subject: [Spambayes] Purchase of Spambayes In-Reply-To: <001301c3a21c$70a4ac10$6501a8c0@kekbackup> Message-ID: <200311031506.hA3F6FrO016553@localhost.localdomain> >>> "Kenfield" wrote > I purchased your product and requested a disk. > This was done several weeks ago. > We have not received the disk. > Please advise when we may expect to receive it > Thanks I think you must be confused - spambayes is non-commercial open source software, and as far as I know, no-one's selling any disks of it. So I'm not sure who you purchased it from, or what you purchased, but it almost certainly wasn't us. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From sethg at GoodmanAssociates.com Mon Nov 3 11:43:15 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Mon Nov 3 11:43:19 2003 Subject: [Spambayes] using external DNSBL's with Outlook In-Reply-To: Message-ID: > > [Seth Goodman] > > > > > When exactly does SpamBayes get to work on a message? > > > > Heh: nobody knows. The Outlook rule system isn't strong > enough to allow > > plugging SpamBayes in as an Outlook rule, so instead SpamBayes > > hooks Outlook > > "folder add" events. Exactly when and why Outlook triggers one of those > > events isn't documented by Microsoft, and appears to vary > wildly depending > > on Outlook version and system load. Sometimes SpamBayes > appears to see a > > message before Outlook rules run, sometimes after they run, sometimes we > > don't get notified about a new message at all, and sometimes we > > get notified > > about a new message more than once (e.g., if an Outlook rule > > moves a message > > and SpamBayes is set to watch both the Inbox and the destination > > folder, we > > *might* be told about a message twice). > > I'm sure I'm not telling you anything you don't know, but I had another thought on this and I was just curious. There is an Outlook rule action called "perform a custom action". I have no idea how to hook into that, but if it does what the name suggests, then you have a chance of being reliably triggered after the arrival of new messages. Your process would then execute as an ordinary rule in a user-specified sequence along with all other rules, making the system more flexible. Since the other rules do seem to operate reliably after new messages arrive, I assume that Outlook triggers rule processing properly, even if it triggers other actions erratically. You could still continue to trigger on "folder add events" to look for drag-and-drop reclassifications, if you wished. As an aside, I notice that you only update the database when the user reclassifies an email. I'm just curious why you don't update the database based on *all* new messages classified? Wouldn't this increase the accuracy of the classifier? This is listed in the bug tracker with no response, but it is obviously not a bug, it is part of the design. Is there a down side to having the database represent too many messages? The other part of this question is that if I successfully make a rule that shunts off the DNSBL-tagged messages before SpamBayes get to see them, does moving them with an Outlook rule into the SpamBayes spam folder trigger you to train on that message, as if it were a reclassification? Is this a no-no since it was never classified in the first place? Regards, Seth Goodman Goodman Associates, LLC From sethg at GoodmanAssociates.com Mon Nov 3 11:46:28 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Mon Nov 3 11:46:30 2003 Subject: [Spambayes] optimal training ratio of spam/ham Message-ID: Is there an optimal ratio of spam to ham for a training set? Is there an optimal total corpus size for training? Regards, Seth Goodman Goodman Associates, LLC From rmalayter at bai.org Mon Nov 3 12:25:52 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Nov 3 12:25:56 2003 Subject: [Spambayes] optimal training ratio of spam/ham Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C35F1@cliff.bai.org> > From: Seth Goodman > Subject: [Spambayes] optimal training ratio of spam/ham > > Is there an optimal ratio of spam to ham for a training set? Yes, there is: 1-to-1 > Is there an optimal total corpus size for training? Not really, but evidence seems to suggest that a thousand or messages in seems to work well. However, 10,000 or more messages seems to decrease the capture rates somewhat. From don at goscooters.com Mon Nov 3 13:38:54 2003 From: don at goscooters.com (Go Scooters) Date: Mon Nov 3 13:39:07 2003 Subject: [Spambayes] Outlook 98 Message-ID: <001501c3a239$bbbd8060$02a5e8d8@dgc.bc.hsia.telus.net> DGC's Tallship Hi, Can anyone please tell me if Spambayes will work with Outlook 98? In particular will the Outlook plug-in work? Thanks, Don From clewis at iquest.net Mon Nov 3 13:53:53 2003 From: clewis at iquest.net (Chuck Lewis) Date: Mon Nov 3 13:45:58 2003 Subject: [Spambayes] Deleting Email in Definite Spam folder ? Message-ID: <01f401c3a23b$d4fbfc80$190a10ac@GR43> Hi Folks, My Outlook .pst file is getting rather large and I was wondering about deleting email in the Spambayes Definite Spam folder which currently has 10,788 emails in it. Is that a good or bad idea ? I wasn't sure of this program rolls through this to "learn" and "relearn" ? Thanks ! Chuck From stramiello at cox.net Mon Nov 3 14:41:50 2003 From: stramiello at cox.net (Al Stramiello) Date: Mon Nov 3 14:43:54 2003 Subject: [Spambayes] Please help! Message-ID: I am using Windows 2000 ...... Outlook 2000 ... SpamBayes 0.81. I was able to install SpamBayes but cannot get the SpamBayes Wizard to run. I deleted SpamBayes according to the SpamBayes website and reinstalled. The Configuration Wizard still will not run. Any help will be greatly appreciated. Please provide specific steps to help. Thanks Al From richie at entrian.com Mon Nov 3 15:39:03 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Nov 3 15:39:15 2003 Subject: [Spambayes] User and Password request In-Reply-To: References: Message-ID: [Bob] > I tried to go back into the configuration program to change > to my machine name, but was asked for a user name and password. > What user name and password does this want. Do you mean that your web browser is asking you for a username and password before it will show you the configuration page? You must have checked one of the "HTTP Authentication" options, in which case it's asking for whatever username and password you entered there. If you left the defaults in place then the username and password will be "admin" / "admin". To prevent it prompting in future, set the "HTTP Authentication" option back to "None". -- Richie Hindle richie@entrian.com From jchaines at qwest.net Mon Nov 3 16:42:10 2003 From: jchaines at qwest.net (jchaines) Date: Mon Nov 3 16:42:47 2003 Subject: [Spambayes] Wizard not running on reinstallation and runningof Spabayes setup. Message-ID: In my short time on this forum, it appears that Al's problem is common to many users. Are their remnants of SpamBayes that remain in the registry after the user removes the program then tries to reinstall? If so, what are they and how can they be removed safely? I am extremely anxious to reinstall Spambayes but, like many others, have this same problem of the SpamBayes Wizard not appearing after downloading and running the setup program. Any help would be most appreciated. Thanks, Jim From erik at sky.cz Mon Nov 3 17:15:05 2003 From: erik at sky.cz (Erik Piper) Date: Mon Nov 3 17:20:35 2003 Subject: [Spambayes] Spambayes - a few more steps forward in Czech in plugin 0081 In-Reply-To: <16135.4777.574295.719680@montanaro.dyndns.org> Message-ID: Dear SpamBayes team, I'm the fellow who was working in a Czech environment, ran into SpamBayes crashing and burning on Czech characters, started asking questions on the list, and eventually stopped asking (too frustrated and too embarrassed at my ignorance within Python, though I still apologize for the unannounced disappearance). Today I downloaded version 8 of the plugin, first at work and at home, and found somewhat more promising, but still not perfect, results. Work computer (W98SE CZ, Outlook 2K CZ): ABSOLUTELY no problems at ALL. Home computer (W2K CZ, Outlook 2K CZ): Installation fine, startup fine. But: 1. clicking the Data Folder button in the Advanced tab leads to an error dialog: SpamBayes 'ascii' codec can't encode character u'\xed' in position 52: ordinal not in range(128) In the old discussions on this matter, the SpamBayes team noted that the guilty character (and this here seems to be the same one) was an accented i, which is contained in the Application Data folder in Czech Windows 2000. (I'm not sure about Win98 SE; it may have an unlocalized app. data folder name.) 2. When running Spambayes' Configuration Wizard using the "I have sorted..." option (though NOT when using the other two options), the same error message as mentioned above appears after the last step of the wizard is complete, i.e. just before jumping back into the SpamBayes manager's main dialog. 3. Attempts to use the Training tab to train SpamBayes based on designated spam/ham folders soft-hang during the "Saving: Writing to Database..." step. By "soft-hang," I mean that Outlook does not become hung, but the "Saving: Writing to database..." message remains until you switch tabs/close the dialog, and more importantly, a check of the Training database status after the training reveals no change to the sizes of the trained-ham and trained-spam counts. It sounds to me, as a layman, as if SpamBayes is having trouble getting to its own database due to problem 1 above. 4. The "Delete as Spam" and "Recover from "Spam" buttons have no effect. Interestingly, I have somehow managed to get 32 messages trained as ham, or so the Training Database Status tells me. Probably in the course of moving things about during setup, before I discovered that SpamBayes was not working as well as I had thought. I know you've solved a lot of problems related to Unicode so far; hopefully you'll be able to resolve this one as well. As usual I am available for testing. Though the mere thought of messing around with actual python scripts again makes my head spin, there's no end to what I'd do in the name of science. I guess. Cheers and thanks in advance, Erik Piper From rmalayter at bai.org Mon Nov 3 18:49:28 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Mon Nov 3 18:49:32 2003 Subject: [Spambayes] timing problems in cached mode in Outlook 2003 Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C35FA@cliff.bai.org> When running Outlook 2003 in cached mode (which only available when connecting to an Exchange server), the intermittent downloading of messages seems to confuse SpamBayes. I have the max background processing delay of 10 second turned on, but this doesn't seem to be long enough. Every time I open Outlook 2003, only about 7 to 10 of my new messages get filtered. I have to manually run the filtering. Is anyone else experiencing the same thing? My suggestions for ameliorating the problem would be 1) allow the user to make the initial processing delay longer than 10 seconds 2) have two separate delay values, one that is in effect when Outlook is first opened, the other delay that is in effect at all other times 3) have a "polling" process that checks for new items without a Spam field N seconds And of course, background processing and reasonable default delays should be automatically enabled when SB detects that Outlook 2003 is running in cached mode. Just wanted to get people's thoughts before I submitted a feature request. Regards, Ryan From adam.walker at rbwconsulting.com Mon Nov 3 21:09:09 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Mon Nov 3 21:09:26 2003 Subject: [Spambayes] Outlook 98 In-Reply-To: <001501c3a239$bbbd8060$02a5e8d8@dgc.bc.hsia.telus.net> References: <001501c3a239$bbbd8060$02a5e8d8@dgc.bc.hsia.telus.net> Message-ID: <3FA70A45.3030507@rbwconsulting.com> I believe Outlook 2000 was the first version to support the required APIs. So no, I don't think the plugin will work, however if you get your email imap or pop3 then you may use the proxies to run spambayes. Go Scooters wrote: >DGC's Tallship > >Hi, >Can anyone please tell me if Spambayes will work with Outlook 98? In >particular will the Outlook plug-in work? > >Thanks, >Don > >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html > > > -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 261 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031103/34123d4b/attachment.bin From iamserg at hotmail.com Mon Nov 3 21:10:33 2003 From: iamserg at hotmail.com (Serg Kagno) Date: Mon Nov 3 21:10:40 2003 Subject: [Spambayes] I'm losing my new messages. Message-ID: <000401c3a278$d3f9e210$7243a2ac@bioscorrected21> I'm using Windows XP, and have downloaded SpamBayes within the last coupla days, and can't find my new messages. I'm looking for where there supposed to go, and I modify the folder, but they still keep disappearing. From victor at site0.com Mon Nov 3 21:12:46 2003 From: victor at site0.com (Victor Issa) Date: Mon Nov 3 21:12:08 2003 Subject: [Spambayes] Multiple Accounts - only main inbox being filtered Message-ID: I installed SpamBayes on Outlook 2003 and trained it with over 5000 messages. It appears to move items in the main inbox to the spam folders, but anything sent to another account or moved to other folders with Outlook rules is not being moved properly.. They are being scored properly. I've tried setting background filtering to several options but it doesn't seem to make a difference.. I've also tried turning background filtering off. Any suggestions? Sincerely, Victor Issa 818-894-3252 From WeMrthn at aol.com Mon Nov 3 21:22:34 2003 From: WeMrthn at aol.com (WeMrthn@aol.com) Date: Mon Nov 3 21:22:51 2003 Subject: [Spambayes] I'm losing my new messages. Message-ID: <1d6.137d6a46.2cd8676a@aol.com> I just wrote to you because SpamBayes, which I just installed, seems to be taking my messages and putting them somewhere. I try to change the folders in the settings choices, but the messages keep disappearing. I am using Windows XP. From hlyner at seznam.cz Mon Nov 3 22:25:55 2003 From: hlyner at seznam.cz (Heinz Lyner) Date: Mon Nov 3 22:31:03 2003 Subject: [Spambayes] Cannot uninstall/install Message-ID: I have the same problem as described here earlier: I have tried everything to remove any trace of Spambayes, but I have still a fragment of the Toolbar and the rating columns. And oft course I cannot install a new copy! For any help I would be grateful Regards t everybody for Prague Heinz From russ_foster at comcast.net Mon Nov 3 21:52:47 2003 From: russ_foster at comcast.net (Russ Foster) Date: Mon Nov 3 22:58:28 2003 Subject: [Spambayes] optimal training ratio of spam/ham In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C35F1@cliff.bai.org> Message-ID: On Mon, 3 Nov 2003, Ryan Malayter wrote: > > Is there an optimal total corpus size for training? > > Not really, but evidence seems to suggest that a thousand or messages in > seems to work well. However, 10,000 or more messages seems to decrease > the capture rates somewhat. After installed SpamBayes I deleted the database because I was curious how long it would take to become effective. I get about 100 emails a day, with about 1/3 of those bring "spam". The day after I installed SB I was got less than 5 spam messages in my INBOX (down from 33) and 3 in my REVIEW box. The remaining 25 were filtered properly. I have since tweaked my settings so that I need 93% (up from 90%) to be filtered as SPAM and 20% or less to keep in my INBOX. So anything between 20% and 93% is put in REVIEW. I just completed two weeks of SB (starting from a clean DB). Last week (start of week 2). I've gotten 1 spam message delivered to my INBOX, and probably 20 message put in REVIEW. 15 of the REVIEW messages were SPAM, the other 5 were good. I find SB to be very effective. I am currently training another product on my second email account which gets < 20 non-spam messages and 100 spam per day. I would have thought this should learn quicker, but its been a week and I still get 10-20 spam in my INBOX and 5 GOOD messages in my SPAM (this is a HUGE problem). If only I could get SB working on my Linux box, I think I'd be set, but I'm having lots of problems getting the dependancies installed... -r From wsy at merl.com Mon Nov 3 23:03:41 2003 From: wsy at merl.com (Bill Yerazunis) Date: Mon Nov 3 23:03:45 2003 Subject: [Spambayes] optimal training ratio of spam/ham In-Reply-To: (message from Russ Foster on Mon, 3 Nov 2003 20:52:47 -0600 (CST)) References: Message-ID: <200311040403.hA443fi27506@localhost.localdomain> From: Russ Foster I find SB to be very effective. I am currently training another product on my second email account which gets < 20 non-spam messages and 100 spam per day. I would have thought this should learn quicker, but its been a week and I still get 10-20 spam in my INBOX and 5 GOOD messages in my SPAM (this is a HUGE problem). If only I could get SB working on my Linux box, I think I'd be set, but I'm having lots of problems getting the dependancies installed... Russ: What product/program are you running on your Linux box? (I wrote one of the free ones; if it's mine and it isn't working for you, I'd like to see if I can fix it.) -Bill Yerazunis (spy on SpamBayes, author of CRM114) From vanhorn at whidbey.com Mon Nov 3 23:34:33 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Mon Nov 3 23:34:55 2003 Subject: [Spambayes] optimal training ratio of spam/ham References: Message-ID: <3FA72C59.C42AF6C1@whidbey.com> What's the other product? Anything unusual about the mail pattern on that account? Van Russ Foster wrote: > I find SB to be very effective. I am currently training another product on > my second email account which gets < 20 non-spam messages and 100 spam per > day. I would have thought this should learn quicker, but its been a week > and I still get 10-20 spam in my INBOX and 5 GOOD messages in my SPAM > (this is a HUGE problem). -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From neale at woozle.org Tue Nov 4 00:38:55 2003 From: neale at woozle.org (Neale Pickett) Date: Tue Nov 4 00:39:00 2003 Subject: [Spambayes] optimal training ratio of spam/ham In-Reply-To: (Russ Foster's message of "Mon, 3 Nov 2003 20:52:47 -0600 (CST)") References: Message-ID: Russ Foster writes: > If only I could get SB working on my Linux box, I think I'd be set, but > I'm having lots of problems getting the dependancies installed... Russ, I can only assume from this comment that you're using an RPM-based system. You may have a better luck installing Python from source. You can download the source tarball from python.org, untar it somewhere, then (as root): ./configure && make install After a few minutes, you should have a working version of Python, which should be all you need to get SpamBayes installed. Let me know how that goes :) Neale From anthony at interlink.com.au Tue Nov 4 04:51:58 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Nov 4 04:56:05 2003 Subject: [Spambayes] Spambayes - a few more steps forward in Czech in plugin 0081 In-Reply-To: Message-ID: <200311040952.hA49pwhv004482@localhost.localdomain> >>> "Erik Piper" wrote > Home computer (W2K CZ, Outlook 2K CZ): Installation fine, startup fine. But: > 1. clicking the Data Folder button in the Advanced tab leads to an error > dialog: > SpamBayes > 'ascii' codec can't encode character u'\xed' in position 52: ordinal no t in > range(128) Can you post the entire traceback here? Anthony -- Anthony Baxter It's never too late to have a happy childhood. From Frank.Sowinski at fujitsu-siemens.com Tue Nov 4 06:53:34 2003 From: Frank.Sowinski at fujitsu-siemens.com (Sowinski, Frank) Date: Tue Nov 4 06:56:00 2003 Subject: [Spambayes] Feature Request Message-ID: <1B482BD387CFF748B9D7547F4FBB722245E7A2@ABGEX01E.abg.fsc.net> Hi, we have some write only folders in our Outlook Exchange Server environment. One of the folders purpose is to store spam mails. I cannot read the contents of the folder but I can copy my spam mails to it. The Spam Bayes Outlook PlugIn does not allow the selection of that folder as spam folder. The parent folder shows a + sign, but you cannot expand it. I think the reeason is that the sub folder is write only. Please fix that in the browse dialog of SpamByes Outlook Plug In. Thank you Frank Sowinski From jeff at livdes.com Tue Nov 4 08:35:14 2003 From: jeff at livdes.com (Jeffrey Barnaby) Date: Tue Nov 4 08:34:53 2003 Subject: [Spambayes] possible bug Message-ID: Running- Windows 98se Binary version 0.81 September 9 2003 --- how do you uninstall spam bayes --- I see no way to remove this from my outlook bar Thank you for a quick response. Jeffrey Barnaby --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.534 / Virus Database: 329 - Release Date: 10/31/03 From kennypitt at hotmail.com Tue Nov 4 09:15:28 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 09:15:54 2003 Subject: [Spambayes] Multiple Accounts - only main inbox being filtered In-Reply-To: Message-ID: Victor Issa wrote: > I installed SpamBayes on Outlook 2003 and trained it with over 5000 > messages. > > It appears to move items in the main inbox to the spam folders, but > anything sent to another account or moved to other folders with > Outlook rules is not being moved properly.. They are being scored > properly. I've tried setting background filtering to several options > but it doesn't seem to make a difference.. I've also tried turning > background filtering off. If an Outlook rule moves a message out of your filtered folders before SpamBayes processes it then it will not be moved to your spam folder. Outlook rules should always run first if background filtering is on, but may still run first in some cases even if background filtering is off. You also need to make sure that the folder where "anything sent to another account" gets placed is included in your list of filtered folders. How are you determining that messages "are begin scored properly"? I've noticed that a lot of people don't realize that choosing "Show spam clues for current message" scores the selected message right then and there, and has nothing to do with any previous processing by SpamBayes. If you have set up your folder view in Outlook to display the Spam field added by SpamBayes (see "Viewing and Using the Spam Score Field" under SpamBayes/Help/About SpamBayes if you haven't), then a score in that field is a good indication that SpamBayes saw the message. If the field is blank then SpamBayes did not process the message automatically. Could you grab a copy of your log file after receiving several of the messages that aren't processed correctly, and then post it here? The "Check the log file" section under SpamBayes/Help/Troubleshooting Guide shows how to do this if you haven't done it before. -- Kenny Pitt From kennypitt at hotmail.com Tue Nov 4 09:44:39 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 09:45:02 2003 Subject: [Spambayes] possible bug In-Reply-To: Message-ID: Jeffrey Barnaby wrote: > --- how do you uninstall spam bayes --- I see no way to remove this > from my outlook bar. An unfortunate side-effect of the bizarre way that Outlook exposes toolbars to plugins. The problem is that we don't have access to the Outlook interfaces that allow removing the toolbar at the time of uninstall. If you have already run the uninstall, then the SpamBayes addin itself should be gone. You just need to remove the toolbar manually. If you right-click an unused area of the toolbar in your main Outlook window and select Customize, you should get a list of toolbars that includes SpamBayes. Just highlight SpamBayes and click Delete and the toolbar should go away. If you also have the Spam field showing in any of your views, you'll have to remove that manually as well by going to "Customize Current View", clicking "Fields...", and removing it from the "Show theses fields" list. -- Kenny Pitt From richard.somerset at ntlworld.com Tue Nov 4 10:27:35 2003 From: richard.somerset at ntlworld.com (Richard Somerset) Date: Tue Nov 4 10:28:38 2003 Subject: [Spambayes] Outlook2002 Message-ID: <000201c3a2e8$2c3180e0$0500a8c0@Somersets.local> Spambyes works very well for about 6 days but now I get a message 'You must configure the spam folder' but how do I do that? Regards, Richard Somerset From sharper at attglobal.net Tue Nov 4 10:53:53 2003 From: sharper at attglobal.net (Stephen F. Harper) Date: Tue Nov 4 11:11:34 2003 Subject: [Spambayes] DBRunRecoveryError: (-30982 possible cause Message-ID: <4.3.2.7.2.20031104104920.00b3d360@localhost> I have the error message >DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database >recovery -- fatal region error detected; run recovery') I think I can cause this reliably by clicking the train button and while the programming is training, exiting the "web page" in the browser. If you are training a "large" number of messages (~50 or so), it takes some amount of time. With my short attention span, I sometimes forget and try to open another web page, at this point I get a corrupted database. Hope this helps some.... Thanks for the great work!!! Stephen F. Harper Stephen F. Harper 919.303.2498 home 919.264.8532 cell s.f.harper@ieee.org From stephen.cholet at mci.com Tue Nov 4 12:54:59 2003 From: stephen.cholet at mci.com (Stephen Cholet) Date: Tue Nov 4 12:49:03 2003 Subject: [Spambayes] Outlook 2003 Message-ID: <002b01c3a2fc$c4a3bff0$59122aa6@wcomnet.com> Do you have any information that addresses any differences or issues associated with using the program with Outlook 2003? Thanks. Steve Stephen Cholet, PMP Planning Engineer IP Access Planning 678-259-1629 v 870-1629 From erik at sky.cz Tue Nov 4 12:51:43 2003 From: erik at sky.cz (Erik Piper) Date: Tue Nov 4 12:56:53 2003 Subject: [Spambayes] Spambayes - a few more steps forward in Czech in plugin 0081 In-Reply-To: <200311040952.hA49pwhv004482@localhost.localdomain> Message-ID: Sure, Anthony. Shame I didn't remember to do so in the first place. Exception in thread Thread-1: Traceback (most recent call last): File "out1.pyz/threading", line 436, in __bootstrap File "out1.pyz/threading", line 416, in run File "out1.pyz/dialogs.async_processor", line 226, in thread_target File "out1.pyz/train", line 169, in trainer File "out1.pyz/manager", line 311, in Adopt File "out1.pyz/shutil", line 170, in move File "out1.pyz/shutil", line 83, in copy2 File "out1.pyz/shutil", line 58, in copystat UnicodeEncodeError: 'ascii' codec can't encode character u'\xed' in position 52: ordinal not in range(128) The precise same footprint in the traceback arises when I repro the "soft hang" of training in the Training tab, mentioned in the original e-mail. At the risk of supplying useless information, here's the log from stepping through the configuration wizard: (...prelude...) Loaded bayes database from 'C:\Documents and Settings\Administrator\Data aplikaci\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Administrator\Data aplikaci\SpamBayes\default_message_database.db' Bayes database initialized with 2 spam and 72 good messages *** - message database has 44 messages - bayes has 74 - something is screwey (...) GetNextPage with current 0 IDD_WIZARD_WELCOME GetNextPage with current 3 IDD_WIZARD_FOLDERS_TRAIN About to train with really_really_long_string_of_gobbledygook Checked 72 in folder Ham - 72 new entries found. Checked 2 in folder Junk E-Mail - 2 new entries found. GetNextPage with current 4 IDD_WIZARD_TRAIN GetNextPage with current 1 IDD_WIZARD_FOLDERS_WATCH GetNextPage with current 2 IDD_WIZARD_FOLDERS_REST Wizard Done! Saving wizard changes Saving configuration -> C:\Documents and Settings\Administrator\Data aplikaci\SpamBayes\Microsoft Outlook Internet Settings.ini In other words, even though Spambayes throws me an error dialog, no event is logged. By the way, I can no longer repro the failure of the "Recover from Spam" and "Delete as Spam" buttons; if I can confirm a set of repro steps in the future, I'll let y'all know. Erik Piper -----Original Message----- From: Anthony Baxter [mailto:anthony@interlink.com.au] Sent: Tuesday, November 04, 2003 10:52 AM To: Erik Piper Cc: spambayes@python.org Subject: Re: [Spambayes] Spambayes - a few more steps forward in Czech in plugin 0081 >>> "Erik Piper" wrote > Home computer (W2K CZ, Outlook 2K CZ): Installation fine, startup fine. But: > 1. clicking the Data Folder button in the Advanced tab leads to an error > dialog: > SpamBayes > 'ascii' codec can't encode character u'\xed' in position 52: ordinal no t in > range(128) Can you post the entire traceback here? Anthony -- Anthony Baxter It's never too late to have a happy childhood. From ljholish at speakeasy.net Tue Nov 4 13:00:34 2003 From: ljholish at speakeasy.net (Larry Holish) Date: Tue Nov 4 12:59:59 2003 Subject: [Spambayes] sb_mboxtrain.py mucks with body of email? Message-ID: <20031104180034.GA5037@localhost> Hi, I have a Debian Linux based setup using fetchmail-Exim-procmail-mutt (maildirs), and spambayes-1.0a6. Setup and did initial training of spambayes. Classifying is going well using sb_filter.py and procmail. Trained on some unsures using sb_mboxtrain.py from command line and went back into my mutt inbox to check the headers. X-Spambayes-Trained headers were added, but I also got a line like this: -Spambayes-Trained: ham added as the first line of the email *body*. Also happened with a few mails in my spam directory. I don't think the email body should be touched. Any clues? -- Larry Holish From jabailo at earthlink.net Tue Nov 4 13:34:05 2003 From: jabailo at earthlink.net (John Bailo) Date: Tue Nov 4 13:39:54 2003 Subject: [Spambayes] Is simple better? In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C35FA@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A012C35FA@cliff.bai.org> Message-ID: <03Nov4.104946pst.118249@gateway.manus.com> I spent a few hours this weekend to get SpamBayes to work, with Evolution on RedHat 9. I did not succeed. But then I picked up an idea from a newsgroup about just creating a filter that only allows email with my name in the recipient list through to my inbox. Now I am filtering 99 percent of my spam with a simple inbox filter. less is more. From AndrewC at primeworld.com Tue Nov 4 13:38:40 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Tue Nov 4 13:42:36 2003 Subject: [Spambayes] Is simple better? Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44C8@mail1.primeline.com> Yes, but if you do that, you're dramatically increasing the chance of getting false positives and false negatives. You gotta be careful about doing something like that. I have an email address that goes into my current mailbox, but it is almost entirely spam now; I would say that less than one tenth of one percent of all the email that comes to it is legit. Why do I still filter it? Because there's still a small chance that I could miss something by either disabling the account. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of John Bailo Sent: Tuesday, November 04, 2003 1:34 PM To: spambayes@python.org Subject: [Spambayes] Is simple better? I spent a few hours this weekend to get SpamBayes to work, with Evolution on RedHat 9. I did not succeed. But then I picked up an idea from a newsgroup about just creating a filter that only allows email with my name in the recipient list through to my inbox. Now I am filtering 99 percent of my spam with a simple inbox filter. less is more. _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From AndrewC at primeworld.com Tue Nov 4 13:43:22 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Tue Nov 4 13:47:27 2003 Subject: [Spambayes] Can't disable filtering on a certain folder Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44C9@mail1.primeline.com> I'm using SB0.81, and I've found an interesting problem: I enabled filtering on another user's inbox. I weed through his email for him (my manager) but he decided that he didn't like the automatic filtering because if I wasn't there he couldn't tell what was getting filtered out (a couple messages were going into Unsure and they were legit but he couldn't read them because they were in my Unsure folder). So I turned off filtering in Outlook on that folder. However, it still seems to be filtering on his inbox, even though I don't want it to. I've tried resetting the filtering settings several times now, and I'm worried that even a reinstall of SB isn't going to fix the problem. Any advice? Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com From jabailo at earthlink.net Tue Nov 4 13:46:39 2003 From: jabailo at earthlink.net (John Bailo) Date: Tue Nov 4 13:52:34 2003 Subject: [Spambayes] Is simple better? In-Reply-To: <7F976E05D3F5AB4ABFC9334D182F7B511E44C8@mail1.primeline.com> References: <7F976E05D3F5AB4ABFC9334D182F7B511E44C8@mail1.primeline.com> Message-ID: <03Nov4.110222pst.118251@gateway.manus.com> On Tue, 2003-11-04 at 10:38, Andrew J. Coutermarsh wrote: > Yes, but if you do that, you're dramatically increasing the chance of > getting false positives and false negatives. You gotta be careful about can you be more specific? > doing something like that. > > I have an email address that goes into my current mailbox, but it is almost > entirely spam now; I would say that less than one tenth of one percent of > all the email that comes to it is legit. Why do I still filter it? Because > there's still a small chance that I could miss something by either disabling > the account. > > Andrew J. Coutermarsh > Prime Resources Corp. > I.S. Department > Ph: (203) 331-9100 x3236 > Fx: (203) 551-3324 > andrewc@primeworld.com > > > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On > Behalf Of John Bailo > Sent: Tuesday, November 04, 2003 1:34 PM > To: spambayes@python.org > Subject: [Spambayes] Is simple better? > > > I spent a few hours this weekend to get SpamBayes to work, with > Evolution on RedHat 9. I did not succeed. > > But then I picked up an idea from a newsgroup about just creating > a filter that only allows email with my name in the recipient list > through to my inbox. > > Now I am filtering 99 percent of my spam with a simple inbox filter. > > less is more. > > > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From bcd at fcohenlaw.com Tue Nov 4 13:55:02 2003 From: bcd at fcohenlaw.com (Blaine C. Dickenson) Date: Tue Nov 4 13:55:02 2003 Subject: [Spambayes] How do I download the plugin for Outlook Express? Message-ID: <000801c3a305$280cb030$220a0a0a@XP101> Blaine C. Dickenson, Esq. Cohen, Norris, Scherer, Weinberger & Wolmer 561-844-3600 bcd@fcohenlaw.com From AndrewC at primeworld.com Tue Nov 4 13:53:17 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Tue Nov 4 13:57:19 2003 Subject: [Spambayes] Is simple better? Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44CA@mail1.primeline.com> > can you be more specific? Well, what I meant by that is this, for example: If somebody includes you in a CC:, it would get snatched up. Conversely, spammers often put your name in the To: line, thus making it go through to your inbox. So while simple CAN work, it's often better to use it in conjunction with a filter. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com -----Original Message----- From: John Bailo [mailto:jabailo@earthlink.net] Sent: Tuesday, November 04, 2003 1:47 PM To: Andrew J. Coutermarsh Cc: spambayes@python.org Subject: RE: [Spambayes] Is simple better? On Tue, 2003-11-04 at 10:38, Andrew J. Coutermarsh wrote: > Yes, but if you do that, you're dramatically increasing the chance of > getting false positives and false negatives. You gotta be careful about can you be more specific? > doing something like that. > > I have an email address that goes into my current mailbox, but it is almost > entirely spam now; I would say that less than one tenth of one percent of > all the email that comes to it is legit. Why do I still filter it? Because > there's still a small chance that I could miss something by either disabling > the account. > > Andrew J. Coutermarsh > Prime Resources Corp. > I.S. Department > Ph: (203) 331-9100 x3236 > Fx: (203) 551-3324 > andrewc@primeworld.com > > > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On > Behalf Of John Bailo > Sent: Tuesday, November 04, 2003 1:34 PM > To: spambayes@python.org > Subject: [Spambayes] Is simple better? > > > I spent a few hours this weekend to get SpamBayes to work, with > Evolution on RedHat 9. I did not succeed. > > But then I picked up an idea from a newsgroup about just creating > a filter that only allows email with my name in the recipient list > through to my inbox. > > Now I am filtering 99 percent of my spam with a simple inbox filter. > > less is more. > > > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From jacsib at lutecium.org Tue Nov 4 14:18:26 2003 From: jacsib at lutecium.org (Jacques B. Siboni) Date: Tue Nov 4 14:18:25 2003 Subject: [Spambayes] Site level protection Message-ID: <3FA7FB82.9C1DB417@lutecium.org> Hi all, I am pretty happy since i installed spambayes on our linux machine (RedHat 7.3). I have installed it at a user based level. Now I want to install it at the site/domain level to filter all of the incoming mails whose email addresses are xxxxx@lutecium.org. I suppose this problem has already been solved but I could not find the info in the archives. Could you give me directions to solve this. mailers are sendmail, procmail, smtp Thanks in advance Cheers Jacques -- Dr. Jacques B. Siboni mailto:jacsib@Lutecium.org 8 pass. Charles Albert, F75018 Paris, France Tel. & Fax: 33 (0) 1 42 28 76 78 Home Page: http://www.lutecium.org/jacsib/ From sethg at GoodmanAssociates.com Tue Nov 4 15:10:08 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 4 15:10:21 2003 Subject: [Spambayes] possible feature request: ham training from Unsure folder Message-ID: The following is my guess at how the Unsure and Spam folders work, and if this is correct, I have a related feature request. As I am new to SpamBayes, I welcome your corrections and explanations. 1) If the message spam score is less than the ham threshold, the message is left in the watched folder. No training is done with the message. If the user then highlights that message and hits the "Delete As Spam" button, the message is moved to the Spam folder and it is trained on as spam. 2) If the message spam score is greater than the spam threshold, the message is moved from the watched folder into the Spam folder. No training is done with the message. If the user then highlights that message and hits "Recover from spam" button, the message is moved back to its' original watched folder (not necessarily the Inbox) and it is trained on as ham. 3) If the message spam score is between the ham and spam thresholds, the message is moved from the watched folder to the Unsure folder. No training is done with the message. If the user then highlights that message and hits "Delete as spam" button, the message is moved to the Spam folder and it is trained on as spam. If the user manually moves the message to a non-Spam folder, no training is done with the message. If this is correct, I think it exposes a minor weakness. This is based on the premise that SpamBayes should only train on messages that the system cannot already classify correctly. This assumes that the reason for not training on all messages is, as other folks have pointed out, that classification accuracy suffers when the training corpus is too large (around 10K messages). If these assumptions are correct, then why not train on *all* messages in the Unsure folder when the user manually classifies them? The resulting feature request would be that when the Unsure folder is selected, SpamBayes should display *two* classification buttons: "Delete As Spam" and "Keep As Good". The "Keep As Good" button would work exactly like the "Recover from spam" button in the Spam folder, that is, move the message back to its' original watched folder and train on the message as ham. This would have two benefits, one definite and the other proposed. The definite benefit is that in the case that the message came from a watched folder other than the Inbox, it could be returned to the proper folder automatically. The proposed second benefit is that since the message was not definitively classified, training on it as ham would improve the future classification accuracy of the system and result in fewer messages in the Unsure folder. Corrections, reactions and education are welcome. Regards, Seth Goodman Goodman Associates, LLC From rmalayter at bai.org Tue Nov 4 15:12:34 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Nov 4 15:12:37 2003 Subject: [Spambayes] Site level protection Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3657@cliff.bai.org> http://assp.sourceforge.net is an oraganizational-level bayesian filter. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On Behalf Of Jacques B. Siboni > Sent: Tuesday, November 04, 2003 1:18 PM > To: spambayes@python.org > Subject: [Spambayes] Site level protection > > Hi all, > > I am pretty happy since i installed spambayes on our linux > machine (RedHat > 7.3). I have installed it at a user based level. > > Now I want to install it at the site/domain level to filter > all of the > incoming mails whose email addresses are xxxxx@lutecium.org. > > I suppose this problem has already been solved but I could > not find the info > in the archives. > Could you give me directions to solve this. > > mailers are sendmail, procmail, smtp > > Thanks in advance > > Cheers > > Jacques From neale at woozle.org Tue Nov 4 15:20:12 2003 From: neale at woozle.org (Neale Pickett) Date: Tue Nov 4 15:20:19 2003 Subject: [Spambayes] Is simple better? In-Reply-To: <7F976E05D3F5AB4ABFC9334D182F7B511E44CA@mail1.primeline.com> (Andrew J. Coutermarsh's message of "Tue, 4 Nov 2003 13:53:17 -0500") References: <7F976E05D3F5AB4ABFC9334D182F7B511E44CA@mail1.primeline.com> Message-ID: "Andrew J. Coutermarsh" writes: >> can you be more specific? > > Well, what I meant by that is this, for example: If somebody includes > you in a CC:, it would get snatched up. I think Andrew meant BCC here. Also, some companies send e-commerce receipts to "undisclosed recipients", so that would go into your spam folder as well. Be sure to poke through your spam folder when you order something on-line. Mailing lists suffer a similar fate, but you probably already sort those into their own folders. I used a "if it's not to me, it's spam" rule in procmail for quite a while. It did work pretty well, but spambayes does a much better job. John, since you mentioned you were using Red Hat, I'm going to guess that your problem had to do with package dependencies while trying to install Python. You may have better luck installing Python from source. Just download it, untar, and run ./configure && make install as root. You should then have a working Python installation and can install SpamBayes just fine. Hope this helps Neale From neale at woozle.org Tue Nov 4 15:22:03 2003 From: neale at woozle.org (Neale Pickett) Date: Tue Nov 4 15:22:08 2003 Subject: [Spambayes] sb_mboxtrain.py mucks with body of email? In-Reply-To: <20031104180034.GA5037@localhost> (Larry Holish's message of "Tue, 4 Nov 2003 12:00:34 -0600") References: <20031104180034.GA5037@localhost> Message-ID: Larry Holish writes: > -Spambayes-Trained: ham > > added as the first line of the email *body*. Larry, could you forward to me (or the list) one of the spam messages that got mangled? This is bizarre, I don't know why it would happen, unless maybe SpamBayes and Mutt disagree about where the message body begins. Neale From AndrewC at primeworld.com Tue Nov 4 15:18:44 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Tue Nov 4 15:22:39 2003 Subject: [Spambayes] Is simple better? Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44CB@mail1.primeline.com> > I think Andrew meant BCC here. Actually, no, I meant CC. Many spam rules in certain programs will say "If my name is in the To: line" but say nothing about the CC: line, so it WOULD get picked up as well. In addition there is, of course, the BCC: but I thought that one would go without saying. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com -----Original Message----- From: Neale Pickett [mailto:neale@woozle.org] Sent: Tuesday, November 04, 2003 3:20 PM To: Andrew J. Coutermarsh Cc: 'jabailo@earthlink.net'; spambayes@python.org Subject: Re: [Spambayes] Is simple better? "Andrew J. Coutermarsh" writes: >> can you be more specific? > > Well, what I meant by that is this, for example: If somebody includes > you in a CC:, it would get snatched up. I think Andrew meant BCC here. Also, some companies send e-commerce receipts to "undisclosed recipients", so that would go into your spam folder as well. Be sure to poke through your spam folder when you order something on-line. Mailing lists suffer a similar fate, but you probably already sort those into their own folders. I used a "if it's not to me, it's spam" rule in procmail for quite a while. It did work pretty well, but spambayes does a much better job. John, since you mentioned you were using Red Hat, I'm going to guess that your problem had to do with package dependencies while trying to install Python. You may have better luck installing Python from source. Just download it, untar, and run ./configure && make install as root. You should then have a working Python installation and can install SpamBayes just fine. Hope this helps Neale From kennypitt at hotmail.com Tue Nov 4 15:22:18 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 15:22:52 2003 Subject: [Spambayes] Site level protection In-Reply-To: <3FA7FB82.9C1DB417@lutecium.org> Message-ID: Jacques B. Siboni wrote: > Hi all, > > I am pretty happy since i installed spambayes on our linux machine > (RedHat > 7.3). I have installed it at a user based level. > > Now I want to install it at the site/domain level to filter all of > the incoming mails whose email addresses are xxxxx@lutecium.org. > > I suppose this problem has already been solved but I could not find > the info in the archives. > Could you give me directions to solve this. > > mailers are sendmail, procmail, smtp SpamBayes does have a filter program that you could probably get set up in this environment, but the results might not be exactly what you hoped for since SpamBayes uses a very "personalized" approach to filtering. This is covered in "6.2 Are there plans to develop a server-side SpamBayes solution?" of the FAQ at http://spambayes.sourceforge.net/faq.html. If you decide to go ahead and set this up, you can find some info on using SpamBayes with Procmail at http://entrian.com/sbwiki/SbFilter. I'm not a Unix user, so I'm not sure where to point you for more help with sendmail or smtp setup. -- Kenny Pitt From neale at woozle.org Tue Nov 4 15:29:53 2003 From: neale at woozle.org (Neale Pickett) Date: Tue Nov 4 15:29:56 2003 Subject: [Spambayes] Site level protection In-Reply-To: <3FA7FB82.9C1DB417@lutecium.org> (Jacques B. Siboni's message of "Tue, 04 Nov 2003 19:18:26 +0000") References: <3FA7FB82.9C1DB417@lutecium.org> Message-ID: "Jacques B. Siboni" writes: > Now I want to install it at the site/domain level to filter all of the > incoming mails whose email addresses are xxxxx@lutecium.org. > > mailers are sendmail, procmail, smtp This has been done, but I don't seem to find it documented anywhere offhand. Could whomever has done this before please write it up on the wiki? http://www.entrian.com/sbwiki You can probably do this as a milter, Jacques. I would suggest running sb_xmlrpcserver.py as a daemon, then piping everything through sb_client.py, but that's probably just because I wrote those programs and I haven't heard of them being used much yet :). You could also use sb_smtpproxy.py (in the spambayes directory) running on port 25 of your public IP address. You can have it proxy to port 25 of 127.0.0.1, where you tell sendmail to bind. The real challenge here is finding a way to train your database. But, as I say, I think someone else has already solved this. (I only pay attention 10% of the time around here.) Neale From kennypitt at hotmail.com Tue Nov 4 15:32:51 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 15:33:21 2003 Subject: [Spambayes] possible feature request: ham training from Unsurefolder In-Reply-To: Message-ID: Seth Goodman wrote: > The following is my guess at how the Unsure and Spam folders work, > and if this is correct, I have a related feature request. As I am > new to SpamBayes, I welcome your corrections and explanations. > > 1) If the message spam score is less than the ham threshold, the > message is left in the watched folder. No training is done with the > message. If the user then highlights that message and hits the > "Delete As Spam" button, the message is moved to the Spam folder and > it is trained on as spam. Yes. > 2) If the message spam score is greater than the spam threshold, the > message is moved from the watched folder into the Spam folder. No > training is done with the message. If the user then highlights that > message and hits "Recover from spam" button, the message is moved > back to its' original watched folder (not necessarily the Inbox) and > it is trained on as ham. Still good so far, although dragging the message back to a watched non-spam folder also has the same effect. > 3) If the message spam score is between the ham and spam thresholds, > the message is moved from the watched folder to the Unsure folder. > No training is done with the message. If the user then highlights > that message and hits "Delete as spam" button, the message is moved > to the Spam folder and it is trained on as spam. If the user > manually moves the message to a non-Spam folder, no training is done > with the message. Almost. If the user clicks "Delete As Spam" or drags to the spam folder then the message is trained as spam. If the user clicks "Recover From Spam" or drags to a watched non-spam folder then the message is trained as good. > If this is correct, I think it exposes a minor weakness. This is > based on the premise that SpamBayes should only train on messages > that the system cannot already classify correctly. This assumes that > the reason for not training on all messages is, as other folks have > pointed out, that classification accuracy suffers when the training > corpus is too large (around 10K messages). If these assumptions are > correct, then why not train on *all* messages in the Unsure folder > when the user manually classifies them? The resulting feature > request would be that when the Unsure folder is selected, SpamBayes > should display *two* classification buttons: "Delete As Spam" and > "Keep As Good". The "Keep As Good" button would work exactly like > the "Recover from spam" button in the Spam folder, that is, move the > message back to its' original watched folder and train on the message > as ham. This is exactly what *should* be happening when you select your Unsure folder. You should see both the "Delete As Spam" and "Recover From Spam" buttons. If this isn't what you see, could you possibly send a copy of your spambayes1.log file and maybe some screen caps of your toolbar and your filtering configuration in SpamBayes Manager to the list? -- Kenny Pitt From neale at woozle.org Tue Nov 4 15:34:32 2003 From: neale at woozle.org (Neale Pickett) Date: Tue Nov 4 15:34:36 2003 Subject: [Spambayes] Is simple better? In-Reply-To: <7F976E05D3F5AB4ABFC9334D182F7B511E44CB@mail1.primeline.com> (Andrew J. Coutermarsh's message of "Tue, 4 Nov 2003 15:18:44 -0500") References: <7F976E05D3F5AB4ABFC9334D182F7B511E44CB@mail1.primeline.com> Message-ID: "Andrew J. Coutermarsh" writes: >> I think Andrew meant BCC here. > > Actually, no, I meant CC. Many spam rules in certain programs will > say "If my name is in the To: line" but say nothing about the CC: > line, so it WOULD get picked up as well. Interesting. For the record, if you're going to do this sort of thing, you need to check the following headers for your email address: To Cc Bcc Original-To Original-Cc Original-Bcc Resent-To Resent-Cc Resent-Bcc Original-Resent-To Original-Resent-Cc Original-Resent-Bcc X-Envelope-To Apparently-To Apparently-Resent-To I guess some MTAs add a Bcc line, which sort of misses the point of a BCC. I seem to recall that Exchange does this. Good luck, Neale From sethg at GoodmanAssociates.com Tue Nov 4 16:12:13 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 4 16:12:24 2003 Subject: [Spambayes] possible feature request: ham training from Unsurefolder In-Reply-To: <200311042133.hA4LXOuu007935@inbound-mx1.atl.registeredsite.com> Message-ID: > > If this is correct, I think it exposes a minor weakness. This is > > based on the premise that SpamBayes should only train on messages > > that the system cannot already classify correctly. This assumes that > > the reason for not training on all messages is, as other folks have > > pointed out, that classification accuracy suffers when the training > > corpus is too large (around 10K messages). If these assumptions are > > correct, then why not train on *all* messages in the Unsure folder > > when the user manually classifies them? The resulting feature > > request would be that when the Unsure folder is selected, SpamBayes > > should display *two* classification buttons: "Delete As Spam" and > > "Keep As Good". The "Keep As Good" button would work exactly like > > the "Recover from spam" button in the Spam folder, that is, move the > > message back to its' original watched folder and train on the message > > as ham. > > This is exactly what *should* be happening when you select your Unsure > folder. You should see both the "Delete As Spam" and "Recover From > Spam" buttons. If this isn't what you see, could you possibly send a > copy of your spambayes1.log file and maybe some screen caps of your > toolbar and your filtering configuration in SpamBayes Manager to the > list? > > -- > Kenny Pitt > > I reset the configuration, retrained and verified all the folder selections. I didn't make any changes, except to add the SpamBayes Spam folder to the spam training list, though it didn't contain any messages at the time. When in the Unsure folder, I now see that the SpamBayes toolbar does not fit where I placed it and there is a continuation symbol, which I probably missed before, on the right edge. Moving things around so all the toolbars fit, I now see that both buttons display when in the Unsure folder. Looks like this is a clear case of "pilot error". Thanks for your patience. Seth Goodman From rdfield at voicenet.com Tue Nov 4 17:28:38 2003 From: rdfield at voicenet.com (R. D. Field) Date: Tue Nov 4 17:29:21 2003 Subject: [Spambayes] Unable to install Message-ID: <000001c3a323$04fce010$321447d1@David> During the install process I get the following error message: Unable to register the DLL/OCX Dll register werver failed code0x00000000 David Field www.rdfield.com -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 5088 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031104/380f28ca/attachment-0002.jpe -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 5675 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031104/380f28ca/attachment-0003.jpe From doctuck at fix.net Tue Nov 4 19:10:53 2003 From: doctuck at fix.net (Greg Tucker) Date: Tue Nov 4 19:11:10 2003 Subject: [Spambayes] (no subject) Message-ID: <000001c3a331$4a75b960$6901a8c0@gregtucker> I love spam bayes, but alas, after working 99.9 % of the time, it just stopped working, for no apparent reason. I have uninstalled and reinstalled several times, all to no avail. I am not knowledgeable enough to file a bug report, so I guess it is adios for now. Damn! I should probably clean out the registry and try again in a few weeks. This is one heck of a product. I had so little spam that I almost felt unwanted, but not quite. My best to you. If and when a new version is bug free, you can count on my donation. Thanks, Greg From JSumpton at EQUAT.com Tue Nov 4 20:47:15 2003 From: JSumpton at EQUAT.com (Jerry Sumpton) Date: Tue Nov 4 20:49:00 2003 Subject: [Spambayes] spambayes modified Message-ID: Hi, I would like to talk to someone about a modified version of SpamBayes that would interrogate a server with a bit of information and then deliver sorted messages based on the response. We are confident that we would make this software available and we would facilitate and participate in ongoing donations based on the use. It looks like you have the majority of the code written already. Thanks, Jerry. From kennypitt at hotmail.com Tue Nov 4 15:11:00 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 21:54:38 2003 Subject: [Spambayes] Is simple better? In-Reply-To: <7F976E05D3F5AB4ABFC9334D182F7B511E44CA@mail1.primeline.com> Message-ID: Andrew J. Coutermarsh wrote: >> can you be more specific? > > Well, what I meant by that is this, for example: If somebody includes > you in a CC:, it would get snatched up. Conversely, spammers often > put your name in the To: line, thus making it go through to your > inbox. So while simple CAN work, it's often better to use it in > conjunction with a filter. You may also miss messages sent to mailing lists that you subscribe to, because mailing lists often only include the list address in the To field. Notice that the original message has simply "To: spambayes@python.org". If I were using a filter that prevented any message that did not have my e-mail address in the To field then I would never have seen it. > > -----Original Message----- > From: John Bailo [mailto:jabailo@earthlink.net] > Sent: Tuesday, November 04, 2003 1:47 PM > To: Andrew J. Coutermarsh > Cc: spambayes@python.org > Subject: RE: [Spambayes] Is simple better? > > On Tue, 2003-11-04 at 10:38, Andrew J. Coutermarsh wrote: >> Yes, but if you do that, you're dramatically increasing the chance of >> getting false positives and false negatives. You gotta be careful >> about > > can you be more specific? > >> -----Original Message----- >> From: spambayes-bounces@python.org >> [mailto:spambayes-bounces@python.org] On Behalf Of John Bailo Sent: >> Tuesday, November 04, 2003 1:34 PM >> To: spambayes@python.org >> Subject: [Spambayes] Is simple better? >> >> >> I spent a few hours this weekend to get SpamBayes to work, with >> Evolution on RedHat 9. I did not succeed. >> >> But then I picked up an idea from a newsgroup about just creating >> a filter that only allows email with my name in the recipient list >> through to my inbox. >> >> Now I am filtering 99 percent of my spam with a simple inbox filter. >> >> less is more. -- Kenny Pitt From kennypitt at hotmail.com Tue Nov 4 15:53:06 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 4 22:13:42 2003 Subject: [Spambayes] How do I download the plugin for Outlook Express? In-Reply-To: <000801c3a305$280cb030$220a0a0a@XP101> Message-ID: There isn't a binary installer that works with Outlook Express yet. You would have to set it up from source code, which is a rather involved process if you are an end user. We are currently testing an installer that will hopefully be available in the next version of SpamBayes. Also, please be aware that SpamBayes does not integrate directly with Outlook Express in the way that the Outlook plugin does with the full version of Outlook (new toolbar buttons, configuration dialogs, etc). -- Kenny Pitt From hari.nair at alopa.com Tue Nov 4 22:51:26 2003 From: hari.nair at alopa.com (Harindranath P R) Date: Tue Nov 4 22:53:03 2003 Subject: [Spambayes] Accidental 'Delete as spam' Message-ID: Hi, I have been using the outlook plug-in for some time now. Today I have accidentally used the 'Delete as spam' on a valuable mail. So the questions I have are: 1, Is there any way to recover the mail. 2, Will this cause problems to mails in future on the same subject or same sender. What can I do to fix it. I have been mostly using 'delete as spam' on mails as a training and don't have a big collection of mails to train on. So I'd rather not have to training from scratch. Regards, - Hari. -- +-------------------------------------------------+ | Harindranath P. R. | +---------------------+---------------------------+ |Alopa Networks Inc. |phone: +91-80-6618004 (O) | |Sunnyvale, USA, | +91-80-6768714 (R) | |Bangalore, India |fax: +91-80-6618057 | |http://www.alopa.com |email: hari.nair@alopa.com | +---------------------+---------------------------+ From tim.one at comcast.net Tue Nov 4 23:14:33 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Nov 4 23:14:39 2003 Subject: [Spambayes] Accidental 'Delete as spam' In-Reply-To: Message-ID: [Harindranath P R] > I have been using the outlook plug-in for some time now. > Today I have accidentally used the 'Delete as spam' on a valuable > mail. So the questions I have are: > 1, Is there any way to recover the mail. 'Delete as spam' doesn't actually delete anything -- it moves the email to the Spam folder, and trains the email as spam. I can't tell you where your Spam folder is, because *you* chose which folder to use to hold Spam as part of setting up SpamBayes. So go to your Spam folder, select each email in turn that you called spam by mistake, and click on the 'Recover from Spam' button for each. > 2, Will this cause problems to mails in future on the same > subject or same sender. Certainly -- you told the system they were spam. > What can I do to fix it. Nothing beyond the above: 'Recover from Spam' untrains the email as spam, and moves it back to the folder it originally came from. From foof at synthcode.com Wed Nov 5 00:25:45 2003 From: foof at synthcode.com (Alex Shinn) Date: Wed Nov 5 00:32:52 2003 Subject: [Spambayes] spambayes quit on me Message-ID: <87znfbl73a.wl@strelka.synthcode.com> I had been using spambayes 1.0a3 but it recently started failing with a "non-spam count would go negative" error leaving my inbox full of hundreds of spam per day. I tried installing 1.0a6 and resetting my .hammiedb, but it still fails with the same error: lain:foof$ sb_mboxtrain.py -d ~/.hammiedb -g /var/mail/foof -s ~/Mail/spam Training ham (/var/mail/foof): Reading as Unix mbox Trained 0 out of 0 messages Training spam (/home/foof/Mail/spam): Reading as MH mailbox 2580Traceback (most recent call last): File "/usr/bin/sb_mboxtrain.py", line 315, in ? main() File "/usr/bin/sb_mboxtrain.py", line 306, in main train(h, s, True, force, trainnew, removetrained) File "/usr/bin/sb_mboxtrain.py", line 236, in train mhdir_train(h, path, is_spam, force) File "/usr/bin/sb_mboxtrain.py", line 209, in mhdir_train msg_train(h, msg, is_spam, force) File "/usr/bin/sb_mboxtrain.py", line 82, in msg_train h.untrain(msg, not is_spam) File "/usr/lib/python2.3/site-packages/spambayes/hammie.py", line 169, in untrain self.bayes.unlearn(tokenize(msg), is_spam) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 218, in unlearn self._remove_msg(wordstream, is_spam) File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 358, in _remove_msg raise ValueError("non-spam count would go negative!") ValueError: non-spam count would go negative! lain:foof$ uname -a Linux lain 2.4.19 #1 SMP Mon Sep 9 12:31:27 CDT 2002 i586 unknown unknown GNU/Linux lain:foof$ python -V Python 2.3+ Any ideas? -- Alex From alan at vaneyk.com Wed Nov 5 02:09:24 2003 From: alan at vaneyk.com (Alan Wong) Date: Wed Nov 5 02:20:27 2003 Subject: [Spambayes] Serverside version? Message-ID: The Suggested server side solutions are not for windows platforms. Does anyone have a Spambayes server side solution for MS Exchange on a windows server platform? Thanks and Regards Alan -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Monday, September 29, 2003 9:19 AM To: Knut Olav Klo; spambayes@python.org Subject: RE: [Spambayes] Serverside version? > What I wonder is if you have server side > version of this wonderful tool or if I can configure my > outlook with... lets say 5 mailboxes and have the spambayes > remove spam from all accounts in one outlook? Do you mean one Outlook profile and 5 accounts that that profile checks? This works fine. If you want to use spambayes with different profiles, I believe that also works fine, although you have to configure each one separately (since the folders will be different). > If this works > then I can have one outlook up an running with all accounts > on my server removeing spam Check out , which has notes from people who have successfully setup spambayes server-side. There's also a FAQ about it (link at the end of all list mail). =Tony Meyer _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html CAUTION - This message may contain privileged and confidential information intended only for the use of the addressee named above. If you are not the intended recipient of this message you are hereby notified that any use, disclosure, dissemination, copying, distribution or any action taken or omitted to be taken in reliance on it is prohibited and may be unlawful. If you have received this message in error please notify van Eyk Research immediately via email at mail@vaneyk.com.au or by phone on 02 9247 6000. From anthony at interlink.com.au Wed Nov 5 07:17:35 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Nov 5 07:20:37 2003 Subject: [Spambayes] Spambayes version 1.0a7 is released Message-ID: <200311051217.hA5CHZ2M016109@localhost.localdomain> The spambayes team is pleased to announce the seventh release of the source distribution of spambayes. Get it from the 'Download' page at http://www.spambayes.org/download.html The SpamBayes project is working on developing a Bayesian anti-spam filter, initially based on the work of Paul Graham. The major difference between this and other, similar projects is the emphasis on testing newer approaches to scoring messages. SpamBayes is not a single application. The core code is a message classifier, however there are several applications available as part of the SpamBayes project which use the classifier in specific contexts. More information is available at the Spambayes website at http://www.spambayes.org/ Detailed information on the changes in this release are at the end of this message. Enjoy the new release and your spam-free mailbox :-) Anthony (on behalf of the spambayes team) New in this release (from the file WHAT_IS_NEW.txt in the distribution): This file covers the major changes between each release. For more details, the reader is referred to the changelog (changelog.txt in the main directory of the archive), or for extreme details, to the check-ins archive (please see ) Changes are broken into sections for each application, plus one that will probably only interest developers, and one for everything else. Any actions necessary to move to this release from the previous release are noted in the "Transition" section. New in Alpha Release 7 ====================== -------------------------- ** Incompatible changes ** -------------------------- o If you are using a pickle for storage, your 'message info' database would previously still have been a dbm (where available). This is no longer the case - if you are using a pickle for the statistics database, you have a pickle for everything. Your old 'message info' database is not converted (and there is no utility provided to do so), but you should not suffer any ill effects from this, *unless* you are using sb_imapfilter.py. In that case, you will find that the filter trains and classifies all messages in the folders it examines, even if it has seen them before - this will only occur once, however. There should be no other incompatible changes (from 1.0a6) in this release. POP3 Proxy / SMTP Proxy ----------------------- o An error where a failure message would be printed by the SMTP proxy, even on success, was fixed. Web Interface ------------- o The bug which caused the "TypeError" when trying to access the database after setting a configuration option via the interface has been fixed. POP3 Proxy Service / POP3 Proxy Tray Application ------------------------------------------------ o Both the pop3proxy_service.py and pop3proxy_tray.py scripts are now installed (with "setup.py install") if the user is using Windows. IMAP Filter ----------- o Better handle displaying the available folders when server and login information is not available. o Better handle problems parsing the date. o If IMAP over SSL is available, offer it via the web interface. General ------- o Various improvements have been made to the management of the 'message info' database. As outlined above, it will now be stored as a pickle, if your statistics database uses a pickle. In addition, we attempt to close the database when we should, and make sure that we explicitly update it. This should hopefully go some way to solving the "DB_RUN_RECOVERY" errors that have been regularly reported - we would be interested to hear from you if upgrading to 1.0a7 does appear to solve this problem for you (email spambayes@python.org). o We now try to determine the type of dbm storage used from the file, if one already exists. This should make the transition between formats a little easier. o Fix sb_xmlrpcserver to work with the renamed (since 1.0a5) scripts. o Fix the sense of include_trained in sb_mboxtrain. Transition ========== If you are transitioning from a version older than 1.0a6, please also read the notes in the previous release notes (accessible from ). Other than the changes listed in "Incompatible changes" at the top of this document, there are no known transition issues. Reported Bugs Fixed =================== The following bugs tracked via the Sourceforge system were fixed: 809769, 814322, 816400, 810342, 818552 A URL containing the details of these bugs can be made by appending the bug number to this URL: http://sourceforge.net/tracker/index.php?func=detail&group_id=61702&atid=498103&aid= Feature Requests Added ====================== No feature requests tracked via the Sourceforge system were added for this release. Patches integrated =================== No patches tracked via the Sourceforge system were integrated for this release. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 5402 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031105/a1f2dec0/attachment-0001.bin From jbennett at kaac.com Wed Nov 5 08:02:12 2003 From: jbennett at kaac.com (John C. Bennett--KAAC) Date: Wed Nov 5 08:02:18 2003 Subject: [Spambayes] bug? Message-ID: When selecting multiple items in the "Junk Suspects" folder and clicking "delete as spam" or "recover from spam", Outlook 2000 often hangs and I get the "This program has experienced a problem and has to close" dialog box. OS Name Microsoft Windows XP Professional Version 5.1.2600 Service Pack 1 Build 2600 OS Manufacturer Microsoft Corporation System Manufacturer Dell Computer Corporation System Model OptiPlex GX270 System Type X86-based PC Processor x86 Family 15 Model 2 Stepping 9 GenuineIntel ~2992 Mhz BIOS Version/Date Dell Computer Corporation A02, 7/16/2003 SMBIOS Version 2.3 Windows Directory C:\WINDOWS System Directory C:\WINDOWS\System32 Boot Device \Device\HarddiskVolume2 Locale United States Hardware Abstraction Layer Version = "5.1.2600.1106 (xpsp1.020828-1920)" Total Physical Memory 512.00 MB Available Physical Memory 231.81 MB Total Virtual Memory 1.72 GB Available Virtual Memory 1.27 GB Page File Space 1.22 GB Page File C:\pagefile.sys From jeff.warren at invensys.com Wed Nov 5 06:11:47 2003 From: jeff.warren at invensys.com (jeff.warren@invensys.com) Date: Wed Nov 5 08:29:50 2003 Subject: [Spambayes] No Filterable Mail Items Selected Message-ID: Hi, I wasn't sure if this was a bug or just operator error; normally spam emails are detected and filtered correctly but I have received numerous spam emails that SpamBayes does not seem to be able to filter. The emails appear to have cells formatted and the text inside them. If one of these emails is forwarded it is recognised and filtered correctly, the cells appear to be stripped out at the receiving end. I converted the email to plain text, but this gave the same result as the HTML version. I have attached all the log files there were, plus two samples of emails that were not detected as spam. I am using XP SP1, Outlook 2002 SP2 and the SpamBayes installation file used was SpamBayes-Outlook-Setup-0081.exe. Best regards, Jeff Warren Email: jeff.warren@invensys.com -------------- next part -------------- A non-text attachment was scrubbed... Name: No Filterable Mail Items Selected.zip Type: application/x-zip-compressed Size: 37791 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031105/1856132f/NoFilterableMailItemsSelected-0001.bin From papaDoc at videotron.ca Wed Nov 5 08:58:30 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 5 08:58:33 2003 Subject: [Spambayes] spambayes quit on me In-Reply-To: <87znfbl73a.wl@strelka.synthcode.com> References: <87znfbl73a.wl@strelka.synthcode.com> Message-ID: <3FA90206.9020505@videotron.ca> Hi Alex, This can happen if you deleted an old database. The email used to train the old database contain X-Spambayes-trained: ham or spam (or an header similar to this one) When you work with a new database and change the status of an email (used to train the old database) the number of spam or ham is decreased and this can go negative if you use many email used to trained the old database. What I will do is delete our database (if you still have a bunch of spam and ham) then try the following command line (with the -f) With the -f the status of the old mail is not taken into account. it train on all the sb_mboxtrain.py -f -d ~/.hammiedb -g /var/mail/foof -s ~/Mail/spam Remi >I had been using spambayes 1.0a3 but it recently started failing with a >"non-spam count would go negative" error leaving my inbox full of >hundreds of spam per day. I tried installing 1.0a6 and resetting my >.hammiedb, but it still fails with the same error: > File "/usr/lib/python2.3/site-packages/spambayes/classifier.py", line 358, in _remove_msg > raise ValueError("non-spam count would go negative!") >ValueError: non-spam count would go negative! > >lain:foof$ uname -a >Linux lain 2.4.19 #1 SMP Mon Sep 9 12:31:27 CDT 2002 i586 unknown unknown GNU/Linux > >lain:foof$ python -V >Python 2.3+ > >Any ideas? > > > From papaDoc at videotron.ca Wed Nov 5 09:52:27 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 5 09:52:30 2003 Subject: [Spambayes] Re: [spambayes-dev] More CVS branch/tags questions Message-ID: <3FA90EAB.1040001@videotron.ca> Hi, > > Richie Hindle wrote: > > No, definitely not! Fixes made on the bugfix branch should be > > batch-merged onto the head once in a while, using "cvs up -j". > That's cool, I didn't know CVS could do that. I was simply going off > the previous description of bug fixing that you referenced in your > previous message. > You never know what you will learn from and OpenSource project....... I printed the explanation so a big thank to "CVS MASTER" Hindle.......... I won't be afraid to branch anymore..... Remi From mike at MikeAndKellyCrowe.com Wed Nov 5 10:47:12 2003 From: mike at MikeAndKellyCrowe.com (Mike Crowe) Date: Wed Nov 5 11:02:46 2003 Subject: [Spambayes] Whole domain falsly triggering HAM Message-ID: <3FA91B80.9080007@MikeAndKellyCrowe.com> Hi folks, I have a problem where lots of emails from my company are getting into possible spam. Here are the offending lines from the spam clues: Spam Score: 32% (0.316305) word spamprob #ham #spam 'elcom,' 0.397725 2771 221 'email addr:elcomrep.com' 0.398081 3668 293 My company is ELCOM, and our domain is elcomrep.com. I've probably mistrained, but need to know how to fix it. Using Outlook Plugin version 0.81 (I think, I installed recently but can't see how to pull up a version number). -- Cheers! Mike From natsysco at cox.net Wed Nov 5 11:27:34 2003 From: natsysco at cox.net (natsysco@cox.net) Date: Wed Nov 5 11:27:43 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION Message-ID: <20031105162732.MESM28419.fed1mtao03.cox.net@smtp.west.cox.net> STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L > > From: spambayes-request@python.org > Date: 2003/10/31 Fri PM 09:28:44 EST > To: natsysco@cox.net > Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b > > Mailing list subscription confirmation notice for mailing list > Spambayes > > We have received a request from 68.104.61.72 for subscription of your > email address, "natsysco@cox.net", to the spambayes@python.org mailing > list. To confirm that you want to be added to this mailing list, > simply reply to this message, keeping the Subject: header intact. Or > visit this web page: > > http://mail.python.org/mailman/confirm/spambayes/a20473f683f3015a12504428b98d81bac3a0340b > > > Or include the following line -- and only the following line -- in a > message to spambayes-request@python.org: > > confirm a20473f683f3015a12504428b98d81bac3a0340b > > Note that simply sending a `reply' to this message should work from > most mail readers, since that usually leaves the Subject: line in the > right form (additional "Re:" text in the Subject: is okay). > > If you do not wish to be subscribed from this list, please simply > disregard this message. If you think you are being maliciously > subscribed to the list, or have any other questions, send them to > spambayes-owner@python.org. > From AndrewC at primeworld.com Wed Nov 5 11:26:35 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Wed Nov 5 11:30:34 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44D0@mail1.primeline.com> It never ceases to amaze me that some people just can't seem to follow directions. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com -----Original Message----- From: spambayes-bounces+andrewc=primeworld.com@python.org [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On Behalf Of natsysco@cox.net Sent: Wednesday, November 05, 2003 11:28 AM To: spambayes@python.org Subject: [Spambayes] STOP MY SUBSCRIPTION STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L > > From: spambayes-request@python.org > Date: 2003/10/31 Fri PM 09:28:44 EST > To: natsysco@cox.net > Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b > > Mailing list subscription confirmation notice for mailing list > Spambayes > > We have received a request from 68.104.61.72 for subscription of your > email address, "natsysco@cox.net", to the spambayes@python.org mailing > list. To confirm that you want to be added to this mailing list, > simply reply to this message, keeping the Subject: header intact. Or > visit this web page: > > http://mail.python.org/mailman/confirm/spambayes/a20473f683f3015a12504428b98 d81bac3a0340b > > > Or include the following line -- and only the following line -- in a > message to spambayes-request@python.org: > > confirm a20473f683f3015a12504428b98d81bac3a0340b > > Note that simply sending a `reply' to this message should work from > most mail readers, since that usually leaves the Subject: line in the > right form (additional "Re:" text in the Subject: is okay). > > If you do not wish to be subscribed from this list, please simply > disregard this message. If you think you are being maliciously > subscribed to the list, or have any other questions, send them to > spambayes-owner@python.org. > _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From dreed at srdcorp.com Wed Nov 5 11:31:35 2003 From: dreed at srdcorp.com (Reed, David) Date: Wed Nov 5 11:34:28 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION Message-ID: <4C542552EF434E43B2F2572854A73CBD6F6FFA@corp-mail1.srdcorp.com> No joke... :) -----Original Message----- From: Andrew J. Coutermarsh [mailto:AndrewC@primeworld.com] Sent: Wednesday, November 05, 2003 11:27 AM To: 'natsysco@cox.net'; spambayes@python.org Subject: RE: [Spambayes] STOP MY SUBSCRIPTION It never ceases to amaze me that some people just can't seem to follow directions. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com -----Original Message----- From: spambayes-bounces+andrewc=primeworld.com@python.org [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On Behalf Of natsysco@cox.net Sent: Wednesday, November 05, 2003 11:28 AM To: spambayes@python.org Subject: [Spambayes] STOP MY SUBSCRIPTION STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L > > From: spambayes-request@python.org > Date: 2003/10/31 Fri PM 09:28:44 EST > To: natsysco@cox.net > Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b > > Mailing list subscription confirmation notice for mailing list > Spambayes > > We have received a request from 68.104.61.72 for subscription of your > email address, "natsysco@cox.net", to the spambayes@python.org mailing > list. To confirm that you want to be added to this mailing list, > simply reply to this message, keeping the Subject: header intact. Or > visit this web page: > > http://mail.python.org/mailman/confirm/spambayes/a20473f683f3015a12504428b98 d81bac3a0340b > > > Or include the following line -- and only the following line -- in a > message to spambayes-request@python.org: > > confirm a20473f683f3015a12504428b98d81bac3a0340b > > Note that simply sending a `reply' to this message should work from > most mail readers, since that usually leaves the Subject: line in the > right form (additional "Re:" text in the Subject: is okay). > > If you do not wish to be subscribed from this list, please simply > disregard this message. If you think you are being maliciously > subscribed to the list, or have any other questions, send them to > spambayes-owner@python.org. > _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From rmalayter at bai.org Wed Nov 5 11:55:03 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Nov 5 11:55:47 2003 Subject: [Spambayes] Serverside version? Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C368F@cliff.bai.org> > From: Alan Wong > Subject: RE: [Spambayes] Serverside version? > > The Suggested server side solutions are not for windows platforms. > Does anyone have a Spambayes server side solution for MS Exchange on a > windows server platform? The open-source Anti-Spam Server Proxy (ASSP) I mentioned does work on windows servers. You simply need to install Perl, which is free. It also works with excahnge server, since it works at the SMTP level. If you want to run them on the same box, you need to change the port that the windows/exchange SMTP service uses, and run the ASSP on port 25. See http://assp.sourceforge.net/fom/cache/86.html for info on using it with Exchange. From rmalayter at bai.org Wed Nov 5 13:24:12 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Nov 5 13:24:15 2003 Subject: [Spambayes] Whole domain falsly triggering HAM Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3692@cliff.bai.org> > From: Mike Crowe > Subject: [Spambayes] Whole domain falsly triggering HAM > > My company is ELCOM, and our domain is elcomrep.com. I've > probably mistrained, but need to know how to fix it. If you've badly mistrained spambayes, (which it looks like, since you have a lot of messages from your own domain trained as spam) you probably need to rebuild your whole training database. Hand-sort a bunch (hundreds at least) of recent emails into spam and non-spam folders. You must be sure they are all categorized correctly. I like to do this with copies of messages. Then go to the Spambayes Manager, click on the training tab. Choose your hand-sorted ham and spam folders as the training source, and make sure that "rebuild database" is checked. Then choose "start training". I would then turn off the options whereby spambayes trains when you move messages between folders. This is confusing for some people, and can be a source of mistraining. Regards, -Ryan- From mikesmithjunkmail at earthlink.net Wed Nov 5 14:11:44 2003 From: mikesmithjunkmail at earthlink.net (Mike Smith) Date: Wed Nov 5 14:11:54 2003 Subject: [Spambayes] Lotus Notes filter error KeyError: ('Hammie', 'header_spam_string') Message-ID: <2384386.1068059505079.JavaMail.root@gonzo.psp.pas.earthlink.net> Help! My company has just switched from Outlook to Lotus Notes. I receive > 300 spam daily, which was managed quite well by the Outlook plug-in, but now it's out of control until I get the notes filter working. I am runnnig Lotus Notes 5.0.11 on Windows 2000. Here is what I have done: Downloaded and installed Python version 2.3.2 Downloaded spambayes-1.0a7.zip, unzipped and ran setup.py build and setup.py install In notes, created folder spam, with subfolders Ham, Spam, Train as Ham, Train as Spam Replicated notes to local database msmith1.nsf Now, if I run sb_notesfilter.py to classify only (no training has been done yet), I get the following: 0 documents processed 0 classified as spam 0 classified as ham 0 classified as unsure The Spambayes database currently has 0 Spam and 0 Ham When I try to train, it dies. Here is the command I'm entering: C:\Program Files\Python23\Scripts>sb_notesfilter.py -t -d spambayes -l c:\apps\N otes\Data\msmith1.nsf -f spam Here is the Traceback: Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line 384, in ? sbfname, doTrain, doClassify) File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line 321, in run processAndTrain(vtrainspam, vspam, bayes, True, notesindex) File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line 237, in proce ssAndTrain str = options["Hammie", "header_spam_string"] File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", line 562, in __getitem__ return self.get(key[0], key[1]) File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", line 559, in get return self.get_option(sect, opt).get() File "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", line 553, in get_option return self._options[sect, opt] KeyError: ('Hammie', 'header_spam_string Am I missing something? From kennypitt at hotmail.com Wed Nov 5 14:42:35 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 5 14:43:20 2003 Subject: [Spambayes] Lotus Notes filter error KeyError: ('Hammie', 'header_spam_string') In-Reply-To: <2384386.1068059505079.JavaMail.root@gonzo.psp.pas.earthlink.net> Message-ID: Mike Smith wrote: > When I try to train, it dies. Here is the command I'm entering: > C:\Program Files\Python23\Scripts>sb_notesfilter.py -t -d spambayes > -l c:\apps\N > otes\Data\msmith1.nsf -f spam > > Here is the Traceback: > Traceback (most recent call last): > File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line > 384, in ? sbfname, doTrain, doClassify) > File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line > 321, in run processAndTrain(vtrainspam, vspam, bayes, True, > notesindex) File "C:\Program > Files\Python23\Scripts\sb_notesfilter.py", line 237, in proce > ssAndTrain > str = options["Hammie", "header_spam_string"] > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 562, in __getitem__ return self.get(key[0], key[1]) > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 559, in get return self.get_option(sect, opt).get() > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 553, in get_option return self._options[sect, opt] > KeyError: ('Hammie', 'header_spam_string > > Am I missing something? No, it's not you, it's a bug in the script. Some of the options were renamed or moved for 1.0a7, and it looks like a few references were missed (probably because not too many people use notesfilter). Since you have the source, you can fix this yourself for now. Go into the scripts subdirectory (under your source directory, not your Python23 directory) and open sb_notesfilter.py in any text editor. Wherever you find 'options["Hammie",' just change "Hammie" to "Headers". Then run the setup.py build/install commands again to update. As I mentioned, I don't think sb_notesfilter has been used/tested as much as some of the other scripts, so this may not be the only thing that is a bit out-of-date, but good luck. Keep us updated on how this works for you. -- Kenny Pitt From richie at entrian.com Wed Nov 5 14:56:00 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Nov 5 14:56:13 2003 Subject: [Spambayes] Lotus Notes filter error KeyError: ('Hammie', 'header_spam_string') In-Reply-To: <2384386.1068059505079.JavaMail.root@gonzo.psp.pas.earthlink.net> References: <2384386.1068059505079.JavaMail.root@gonzo.psp.pas.earthlink.net> Message-ID: <50liqvg29dnuk1s4k5hdvf1q5um1r01jop@4ax.com> [Mike] > File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line 237, in processAndTrain > str = options["Hammie", "header_spam_string"] I don't know much about the Notes stuff, but that looks like a bug. That piece of code should probably be: if is_spam: str = options["Headers", "header_spam_string"] else: str = options["Headers", "header_ham_string"] You can see from spambayes/Options.py that header_spam_string goes in the Headers section, not the Hammie section. There are a few other places in sb_notesfilter.py where similar code ("options["Hammie", "header_xxx_string"]") appears - that should be changed too. Mike, does changing "Hammie" to "Headers" in those places fix your problem? I'm forwarding this to spambayes-dev to see whether anyone there knows for sure whether I'm right about this...? -- Richie Hindle richie@entrian.com From skip at pobox.com Wed Nov 5 15:04:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 5 15:04:17 2003 Subject: [Spambayes] Whole domain falsly triggering HAM In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C3692@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A012C3692@cliff.bai.org> Message-ID: <16297.22453.130170.349926@montanaro.dyndns.org> Ryan> Hand-sort a bunch (hundreds at least) of recent emails into spam Ryan> and non-spam folders. You must be sure they are all categorized Ryan> correctly. I like to do this with copies of messages. I would start with a much smaller set, 10-20 of each. I started from scratch yesterday and have so far trained on 100 spams and 64 hams. Seat-of-the-pants it's doing just as good as my old training database which had 5659 spams and 7363 hams. Aside from one or two messages used to prime the pump, all of the 164 messages were unsures or mistakes. Skip From cah11 at cornell.edu Wed Nov 5 15:14:33 2003 From: cah11 at cornell.edu (Carol Ann Halseth) Date: Wed Nov 5 15:14:40 2003 Subject: [Spambayes] Undergraduate Alumni Questionnaire - Please Help Message-ID: <5.2.1.1.2.20031105151347.01c80c30@postoffice7.mail.cornell.edu> Greetings Joshua, We received your name from the registrar at Cornell as a student who took at least 20% of their curricula in the area of East Asia. Many courses such as art, architecture, etc. might not have had the majority of the course work as East Asia related, but as with all things it's hard to make an exact science. We're getting in touch with recent graduates whose studies were related to East Asia (as explained earlier), to get some information regarding career plans. We are interested in your response even if East Asia isn't your career objective or your major area of study. We are required to report this information (in summary form) to the U.S. Department of Education, where we obtain significant grants that help us support East Asian studies at Cornell. Please take a moment to complete the form below (fill in the blanks) and return it to me (via e-mail is fine). Thank you for taking the time to help with this survey! Time is of the essence, so if you could return this by (or before, preferably), July 1 it would be very appreciated. Carol East Asia Program, 140 Uris Hall, 607/255-5685, fax 607/255-1388, cah11@cornell.edu LAST: FIRST: Degree (BA/BS)/ Major(s): SSN: MAILING ADDRESS: PHONE: PERMANENT (Parent's) ADDRESS: PERM. PHONE: AREA of INTEREST: COUNTRY of INTEREST: PRC ____ Taiwan ____ Japan ___ Korea ___ Hong Kong ___ NOW THAT YOU'VE OBTAINED YOUR BA, WHAT IS YOUR CAREER PLAN? CONTINUED STUDY: School: Address: POST-CU EMPLOYMENT: Position: Employer: Address: NATIONAL RESOURCE CENTER CAREER CATEGORIES: Please mark only one, either your first choice if you are still seeking employment, or the category that fits the job named above. __ US Postsecondary Educational Institution __ US Elementary or Secondary Educational Institution __ US Federal, State, or Local Government (including military service) __ US Private Sector, Non-Profit __ US Private Sector, For Profit __ US Museum __ International or Foreign Organization geographically located IN the US __ International or Foreign Organization geographically located OUTSIDE the US __ Out of the Job Market (not seeking employment) Would you like to receive East Asia Program newsletters (2x/year)? __ Yes __ No Carol Ann Halseth Admin Supervisor, Fellowship & Travel Grant Coordinator East Asia Program 140 Uris Hall Cornell University Ithaca, NY 14853 tel: 607-255-6222 fax: 607-255-1388 url: http://www.einaudi.cornell.edu/eastasia/ E-mail: cah11@cornell.edu From kennypitt at hotmail.com Wed Nov 5 15:34:13 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 5 15:34:37 2003 Subject: [Spambayes] Undergraduate Alumni Questionnaire - Please Help In-Reply-To: <5.2.1.1.2.20031105151347.01c80c30@postoffice7.mail.cornell.edu> Message-ID: Carol Ann Halseth wrote: > Greetings Joshua, > We received your name from the registrar at Cornell as a student who > took > at least 20% of their curricula in the area of East Asia. Many > courses > such as art, architecture, etc. might not have had the majority of the > course work as East Asia related, but as with all things it's hard to > make > an exact science. OK, this is either a blatant case of mistaken identity, or the strangest spam I've ever seen. :-) -- Kenny Pitt From Remi.Ricard at simlog.com Wed Nov 5 16:17:53 2003 From: Remi.Ricard at simlog.com (Remi Ricard) Date: Wed Nov 5 16:17:56 2003 Subject: [Spambayes] Trying spambayes-1.0a7 Message-ID: <3FA96901.3080404@simlog.com> Hi, This is the error message I get when I try to save the configuration in the web ui What I did is: 1- Download spambayes-1.0a7.zip 2- unzip it 3- Go to spambayes-1.0a6 directory 4- setup.py install 5- Go to c:/my_python_script_dir_where_the_spambayes_script_are_copied 6- Start sb_server -b 7- Go to the web interface 8- Add the pop server pop1.test.ca,pop2.test.ca,pop3,test,ca (I added real name) 9- Add the port 6110,6111,6112 10- Push the "Save" button Then bang this error 500 Server error Traceback (most recent call last): File "c:\Devtools\spambayes\spambayes\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "c:\Devtools\spambayes\spambayes\spambayes\UserInterface.py", line 684, in onChangeopts self.reReadOptions() File "c:\Devtools\spambayes\spambayes\spambayes\ProxyUI.py", line 530, in reReadOptions state = self.state_recreator() File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 736, in _recreateState prepare(state) File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 752, in prepare state.createWorkers() File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 623, in createWorkers self.bayes = storage.open_storage(filename, self.useDB) File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 677, in open_storage return klass(data_source_name) File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 162, in __init__ self.load() File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 187, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "c:\Devtools\spambayes\spambayes\spambayes\dbmstorage.py", line 60, in open return f(db_name, mode) File "c:\Devtools\spambayes\spambayes\spambayes\dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) error: (22, 'Invalid argument') Remi From mikesmithjunkmail at earthlink.net Wed Nov 5 16:37:15 2003 From: mikesmithjunkmail at earthlink.net (Mike Smith) Date: Wed Nov 5 16:37:33 2003 Subject: [Spambayes] Lotus Notes filter error KeyError: ('Hammie','header_spam_string') Message-ID: <15467598.1068068235545.JavaMail.root@gonzo.psp.pas.earthlink.net> Replacing :Hammie" with "Header" worked. Thank you. Mike Mike Smith wrote: > When I try to train, it dies. Here is the command I'm entering: > C:\Program Files\Python23\Scripts>sb_notesfilter.py -t -d spambayes > -l c:\apps\N > otes\Data\msmith1.nsf -f spam > > Here is the Traceback: > Traceback (most recent call last): > File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line > 384, in ? sbfname, doTrain, doClassify) > File "C:\Program Files\Python23\Scripts\sb_notesfilter.py", line > 321, in run processAndTrain(vtrainspam, vspam, bayes, True, > notesindex) File "C:\Program > Files\Python23\Scripts\sb_notesfilter.py", line 237, in proce > ssAndTrain > str = options["Hammie", "header_spam_string"] > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 562, in __getitem__ return self.get(key[0], key[1]) > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 559, in get return self.get_option(sect, opt).get() > File > "C:\PROGRA~1\Python23\lib\site-packages\spambayes\OptionsClass.py", > line 553, in get_option return self._options[sect, opt] > KeyError: ('Hammie', 'header_spam_string > > Am I missing something? No, it's not you, it's a bug in the script. Some of the options were renamed or moved for 1.0a7, and it looks like a few references were missed (probably because not too many people use notesfilter). Since you have the source, you can fix this yourself for now. Go into the scripts subdirectory (under your source directory, not your Python23 directory) and open sb_notesfilter.py in any text editor. Wherever you find 'options["Hammie",' just change "Hammie" to "Headers". Then run the setup.py build/install commands again to update. As I mentioned, I don't think sb_notesfilter has been used/tested as much as some of the other scripts, so this may not be the only thing that is a bit out-of-date, but good luck. Keep us updated on how this works for you. -- Kenny Pitt -------------- From russ_foster at comcast.net Wed Nov 5 15:52:06 2003 From: russ_foster at comcast.net (Russ Foster) Date: Wed Nov 5 16:57:43 2003 Subject: [Spambayes] optimal training ratio of spam/ham In-Reply-To: <3FA72C59.C42AF6C1@whidbey.com> Message-ID: Nothing highly unusual about the mail pattern. Its less work related and more personal contacts. qsf (Quick Spam Filter), also available from SF. On Mon, 3 Nov 2003, G. Armour Van Horn wrote: > What's the other product? Anything unusual about the mail pattern on that > account? From russ_foster at comcast.net Wed Nov 5 15:56:54 2003 From: russ_foster at comcast.net (Russ Foster) Date: Wed Nov 5 17:02:37 2003 Subject: [Spambayes] optimal training ratio of spam/ham In-Reply-To: Message-ID: Yes, I'm trying to get SpamBayes installed on a RedHat 7.3 system. My problem is getting the latest version of Python. Here's the output: -- [root@orion Python-2.3.2]# ./configure checking MACHDEP... linux2 checking EXTRAPLATDIR... checking for --without-gcc... no checking for --with-cxx=... no checking for c++... c++ checking for C++ compiler default output... a.out checking whether the C++ compiler works... configure: error: cannot run C++ compiled programs. If you meant to cross compile, use `--host'. See `config.log' for more details. [root@orion Python-2.3.2]# -- Any ideas? -r On Mon, 3 Nov 2003, Neale Pickett wrote: > I can only assume from this comment that you're using an RPM-based > system. You may have a better luck installing Python from source. You > can download the source tarball from python.org, untar it somewhere, > then (as root): > > ./configure && make install > > After a few minutes, you should have a working version of Python, which > should be all you need to get SpamBayes installed. > > Let me know how that goes :) From richie at entrian.com Wed Nov 5 17:13:07 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Nov 5 17:13:20 2003 Subject: [Spambayes] Trying spambayes-1.0a7 In-Reply-To: <3FA96901.3080404@simlog.com> References: <3FA96901.3080404@simlog.com> Message-ID: <5vsiqv0d97em3ain6gnlv926hjm5p10p7s@4ax.com> [Remi] > error: (22, 'Invalid argument') I just followed your instructions exactly but had no error. Two things strike me as odd about your stack trace. One is the location of your spambayes library - Python is in C:\Devtools\Python\PYTHON~1.2\, but spambayes is in c:\Devtools\spambayes\spambayes. I'd have expected spambayes to be in C:\Devtools\Python\PYTHON~1.2\Lib\site-packages - is it possible you have a PYTHONPATH, or a .pth file, that's pointing to the wrong spambayes library? If you type 'python -c "import spambayes; print spambayes.__file__"', what do you get? Is it what you're expecting, given that you've run the standard spambayes installer? The other thing which is odd is those traceback pathnames - why the DOS-style filenames, C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY? Are you running an unusual build of Python, or running in an unusual environment? -- Richie Hindle richie@entrian.com From sethg at GoodmanAssociates.com Wed Nov 5 21:07:43 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Wed Nov 5 21:07:46 2003 Subject: [Spambayes] repeatable condition where SpamBayes misses new message Message-ID: I have background processing set to wait 10 seconds before starting and 1 second between messages. I use the preview pane on the Inbox and things are set to change a message from Unread to Read after it is displayed in the preview pane for two seconds. When a new message hits the inbox, if the cursor was previously at the top of the Inbox (most recent message), the cursor will move to the new message (who told it to do that?). After 2 seconds in the preview pane, the new message will change from Unread to Read before the 10 second SpamBayes timer delay runs out. When this occurs, SpamBayes neither scores nor moves the new message: it obviously no longer sees it as new. This appears to be repeatable. This explains at least one condition where SpamBayes does not get triggered on a new message. Since there appears to be a general problem triggering SpamBayes exactly one time for every new message, and there is some conflict with the Outlook rule processor, it would be *nice* if something could be done to fix this. Here are a few admittedly naive suggestions. Perhaps one of them may be useful to you. Scenario I) When you get your "folder add event" or the start timer delay times out, check the watched folders for messages with no spam score that are not already on the list of messages to be processed. This would also be a good time to check the list for duplicate message ID's, if you still get those. Then process messages as usual. Scenario II) Do the initial scoring in a POP3 proxy and put the score in one X- header line and the classification in another X- header line. This insures that every message is scored exactly once. Use an Outlook rule to move the message into either the Unsure or Spam folders, if appropriate. Process all other Outlook rules after this one. Disable the incremental training option so folder moves do not cause training and don't trigger on "folder add events". Use only the toolbar buttons for reclassification and retraining. Delete the background processing option. Scenario III) Instead of triggering on "folder add events", run your process as a "custom action" in an Outlook rule that runs whenever a new message appears. The internal mechanism for triggering rules when new messages show up seems to be reliable. Microsoft has provided a prototype "custom action" called Launcher.dll that calls a designated external program with a new message ID. Process all other Outlook rules after this one. Disable the incremental training option so folder moves do not cause training and don't trigger on "folder add events". Use only the toolbar buttons for reclassification and retraining. Delete the background processing option. The code for Launcher.dll is at ftp://ftp.microsoft.com/services/TechNet/samples/BOES/BO/MAILEXCH/exchange/a ppfarm/ as the file "Printex.exe". This is a self-extractor that will expand to the proper directory structure when invoked with "Printex.exe -d". The Readme.txt file explains how to get Launcher.dll to call whatever external program you want and pass it the message ID of the new message. It includes a printing application (Exprint) that prints out a message for testing purposes. Seth Goodman From foof at synthcode.com Wed Nov 5 21:48:43 2003 From: foof at synthcode.com (Alex Shinn) Date: Wed Nov 5 21:54:03 2003 Subject: [Spambayes] spambayes quit on me In-Reply-To: <3FA90206.9020505@videotron.ca> References: <87znfbl73a.wl@strelka.synthcode.com> <3FA90206.9020505@videotron.ca> Message-ID: <87vfpyky9g.wl@strelka.synthcode.com> At Wed, 05 Nov 2003 08:58:30 -0500, papaDoc wrote: > > What I will do is delete our database (if you still have a bunch of > spam and ham) then try the following command line (with the -f) With > the -f the status of the old mail is not taken into account. > > sb_mboxtrain.py -f -d ~/.hammiedb -g /var/mail/foof -s ~/Mail/spam Thanks Remi, that did the trick. I'm not sure why it stopped working the first time (before I deleted the db), but it's back to normal now. -- Alex From anthony at interlink.com.au Wed Nov 5 23:32:44 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed Nov 5 23:33:04 2003 Subject: [Spambayes] Spambayes 1.0a7 - windows proxy_tray installation In-Reply-To: <3FA9CE74.30408@swiftdsl.com.au> Message-ID: <200311060432.hA64WiVL001903@localhost.localdomain> >>> Phil Pierotti wrote > Let me prefix this by saying that I'm sure I've got a screw loose here, > or something. > > Was happily running 1.0a6 > DOwnloaded the ZIP > UnZipped > Ran setup.py install > went to Python23 and ran Scripts\pop3proxy_service.py > > Fine and dandy > when I run Scripts\pop3proxy_tray.py it says: > > C:\Python23>Scripts\pop3proxy_tray.py > Traceback (most recent call last): > File "C:\Python23\Scripts\pop3proxy_tray.py", line 467, in ? > main() > File "C:\Python23\Scripts\pop3proxy_tray.py", line 463, in main > w = MainWindow() > File "C:\Python23\Scripts\pop3proxy_tray.py", line 164, in __init__ > 0, icon_flags) > pywintypes.error: (2, 'LoadImage', 'The system cannot find the file > specified.') > > what am I missing/misunderstanding? I have no idea, I'm afraid - I'm just the release monkey . I'm cc'ing the SB list for more clues. From vanhorn at whidbey.com Wed Nov 5 23:52:30 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Wed Nov 5 23:52:51 2003 Subject: [Spambayes] optimal training ratio of spam/ham References: Message-ID: <3FA9D38E.E0C6121@whidbey.com> This sounds like a problem I ran into with another Python program, but I no longer remember exactly which program it was, or even which of my Red Hat machines it was on. However, at least through 7.3 and possibly later, Red Hat depends on having Python 1.57 (I think) available for certain internal functions. Therefore, the later versions of Python install as "Python2" instead of "Python" and programs that need the later need to be specifically pointed at that. In the earlier case, it took a "configure with python=/path/to/python2" to make the program work. Does someone more on top of these things remember the specifics, and know how Russ can use them in this case? Van Russ Foster wrote: > Yes, I'm trying to get SpamBayes installed on a RedHat 7.3 system. > > My problem is getting the latest version of Python. Here's the > output: > > -- > > [root@orion Python-2.3.2]# ./configure > checking MACHDEP... linux2 > checking EXTRAPLATDIR... > checking for --without-gcc... no > checking for --with-cxx=... no > checking for c++... c++ > checking for C++ compiler default output... a.out > checking whether the C++ compiler works... configure: error: cannot run > C++ compiled programs. > If you meant to cross compile, use `--host'. > See `config.log' for more details. > [root@orion Python-2.3.2]# > > -- > > Any ideas? > > -r > > On Mon, 3 Nov 2003, Neale Pickett wrote: > > I can only assume from this comment that you're using an RPM-based > > system. You may have a better luck installing Python from source. You > > can download the source tarball from python.org, untar it somewhere, > > then (as root): > > > > ./configure && make install > > > > After a few minutes, you should have a working version of Python, which > > should be all you need to get SpamBayes installed. > > > > Let me know how that goes :) > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From phil.pierotti at swiftdsl.com.au Thu Nov 6 00:06:40 2003 From: phil.pierotti at swiftdsl.com.au (Phil Pierotti) Date: Thu Nov 6 00:06:50 2003 Subject: [Spambayes] Spambayes 1.0a7 - windows proxy_tray installation In-Reply-To: <200311060432.hA64WiVL001903@localhost.localdomain> References: <200311060432.hA64WiVL001903@localhost.localdomain> Message-ID: <3FA9D6E0.3000409@swiftdsl.com.au> Found the issue: paths for the icon resources etc.... in pop3proxy_tray.py (line #144 onwards) startedIconPathName = "%s\\..\\windows\\resources\\sb-started.ico" % \ (os.path.dirname(sb_server.__file__),) stoppedIconPathName = "%s\\..\\windows\\resources\\sb-stopped.ico" % \ (os.path.dirname(sb_server.__file__),) # When 1.0a6 is released, the above line will need to change to: ## iconPathName = "%s\\..\\windows\\resources\\sbicon.ico" % \ ## (os.path.dirname(sb_server.__file__),) [mebbe I didn't install something the way it was expecting] But what I see is that under the distribution, there's windows\pop3proxy_*.py ("the scripts") windows\resources\ (with all the .ico etc resources for the systray program) The scripts are installed under \Python23\Scripts by setuup.py, but there's no corresponding \Python23\windows\resources\ with all the icon/resources So: (a) did I not install something properly (b) dd the installer not install the resources properly (c) are the paths in the script wrong (d) all of the above (e) none of the above, I'm just smoking too much crack (as per usual) Thanks, Phil P Anthony Baxter wrote: > > >>>>Phil Pierotti wrote >>>> >>>> >>Let me prefix this by saying that I'm sure I've got a screw loose here, >>or something. >> >>Was happily running 1.0a6 >>DOwnloaded the ZIP >>UnZipped >>Ran setup.py install >>went to Python23 and ran Scripts\pop3proxy_service.py >> >>Fine and dandy >>when I run Scripts\pop3proxy_tray.py it says: >> >>C:\Python23>Scripts\pop3proxy_tray.py >>Traceback (most recent call last): >> File "C:\Python23\Scripts\pop3proxy_tray.py", line 467, in ? >> main() >> File "C:\Python23\Scripts\pop3proxy_tray.py", line 463, in main >> w = MainWindow() >> File "C:\Python23\Scripts\pop3proxy_tray.py", line 164, in __init__ >> 0, icon_flags) >>pywintypes.error: (2, 'LoadImage', 'The system cannot find the file >>specified.') >> >>what am I missing/misunderstanding? >> >> > >I have no idea, I'm afraid - I'm just the release monkey . I'm >cc'ing the SB list for more clues. > > > > > From jlriehm at itctel.com Thu Nov 6 08:58:15 2003 From: jlriehm at itctel.com (Lois Riehm) Date: Thu Nov 6 09:23:07 2003 Subject: [Spambayes] help Message-ID: <000801c3a46e$07fec200$5314ddcc@jlriehm> I need spam help- I thought it was free- jlriehm@itctel.com thank you From Remi.Ricard at simlog.com Thu Nov 6 09:37:36 2003 From: Remi.Ricard at simlog.com (Remi Ricard) Date: Thu Nov 6 09:37:43 2003 Subject: [Spambayes] Trying spambayes-1.0a7 In-Reply-To: <5vsiqv0d97em3ain6gnlv926hjm5p10p7s@4ax.com> References: <3FA96901.3080404@simlog.com> <5vsiqv0d97em3ain6gnlv926hjm5p10p7s@4ax.com> Message-ID: <3FAA5CB0.8000509@simlog.com> Hi Richie, >I just followed your instructions exactly but had no error. Two things >strike me as odd about your stack trace. One is the location of your >spambayes library - Python is in C:\Devtools\Python\PYTHON~1.2\, but >spambayes is in c:\Devtools\spambayes\spambayes. > UsuallyI was running spambayes from CVS in a given directory. Since I wanted to install version 1.0a7 I did the installation. >I'd have expected >spambayes to be in C:\Devtools\Python\PYTHON~1.2\Lib\site-packages - is it >possible you have a PYTHONPATH, or a .pth file, that's pointing to the >wrong spambayes library? > When I was coming back home I though "Hey my python path point to my old spambayes location but is it more than that >If you type 'python -c "import spambayes; print >spambayes.__file__"', what do you get? > This is what I get: python -c "import spambayes; print spambayes.__file__": not found >Is it what you're expecting, given that you've run the standard spambayes installer? > No >The other thing which is odd is those traceback pathnames - why the >DOS-style filenames, C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY? >Are you running an unusual build of Python, or running in an unusual >environment? > Ok this is my setup I'm running Windows 2000 cygwin is installed and mks I'm doing my command in an MKS Korn Shell window When I was doing "which python" I was getting c:/cygwin/bin/python.exe oups.... So I added c:/somedir/Python/Python-2.2.2 in front of the cygwin directory in my path Now when I do "which python" I'm getting c:/devtools/Python/Python-2.2.2/python.exe My environment variable PYTHONPATH is set to c:/devtools/Python/Python-2.2.2 is this correct ???? After changing the environment variable and the path I cleaned the python-2.2.2/Scripts from spambayes stuff and also the Lib/site-packages Then I reinstalled everything setup.py install I got spambayes-1.0a7% setup.py install running install running build running build_py not copying spambayes\cdb.py (output up-to-date) running build_scripts not copying scripts\sb_client.py (up-to-date) running install_lib creating C:\Devtools\Python\PYTHON~1.2\Lib\site-packages\spambayes copying build\lib\spambayes\cdb.py -> C:\Devtools\Python\PYTHON~1.2\Lib\site-packages\spambayes running install_scripts copying build\scripts-2.2\pop3proxy_service.py -> C:\Devtools\Python\PYTHON~1.2\Scripts This look go to me spambayes-1.0a7% setup.py install running install running build running build_py not copying spambayes\cdb.py (output up-to-date) not copying spambayes\cdb_classifier.py (output up-to-date) not copying spambayes\chi2.py (output up-to-date) running build_scripts not copying scripts\sb_client.py (up-to-date) not copying scripts\sb_dbexpimp.py (up-to-date) running install_lib not copying build\lib\spambayes\cdb.py (output up-to-date) not copying build\lib\spambayes\cdb_classifier.py (output up-to-date) not copying build\lib\spambayes\chi2.py (output up-to-date) skipping byte-compilation of C:\Devtools\Python\PYTHON~1.2\Lib\site-packages\spambayes\cdb.py to cdb.pyc skipping byte-compilation of C:\Devtools\Python\PYTHON~1.2\Lib\site-packages\spa mbayes\cdb.py to cdb.pyc > > > From kennypitt at hotmail.com Thu Nov 6 10:22:18 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Nov 6 10:22:42 2003 Subject: [Spambayes] Spambayes 1.0a7 - windows proxy_tray installation In-Reply-To: <3FA9D6E0.3000409@swiftdsl.com.au> Message-ID: Phil Pierotti wrote: > [mebbe I didn't install something the way it was expecting] > But what I see is that under the distribution, there's > windows\pop3proxy_*.py ("the scripts") > windows\resources\ (with all the .ico etc resources for the > systray program) > > The scripts are installed under > \Python23\Scripts > by setuup.py, but there's no corresponding > \Python23\windows\resources\ > with all the icon/resources > > So: > > (a) did I not install something properly > (b) dd the installer not install the resources properly > (c) are the paths in the script wrong > (d) all of the above > (e) none of the above, I'm just smoking too much crack (as per usual) Looks like you hit the nail right on the head. Glad I finished reading my inbox before replying to your previous message . At some point in the not-too-distant past, a decision was made that the Windows scripts pop3proxy_service.py and pop3proxy_tray.py should be installed to the Python Scripts directory along with the other command-line scripts. It seems this was a bit premature, as pop3proxy_tray obviously isn't designed to be run that way. When run from source, the icon resources are required to be present in a directory structure that isn't appropriate for installing into the main Python directory. For now, you should be able to get around this problem by going to the windows dir in your original source tree and running pop3proxy_tray.py from there. I've CC'd the spambayes-dev list in hopes that someone can take a look at this. At the very least, we should probably stop copying it to the Python\Scripts directory until the problem is fixed. -- Kenny Pitt From skip at pobox.com Thu Nov 6 15:21:31 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Nov 6 15:21:48 2003 Subject: [Spambayes] help In-Reply-To: <000801c3a46e$07fec200$5314ddcc@jlriehm> References: <000801c3a46e$07fec200$5314ddcc@jlriehm> Message-ID: <16298.44363.118835.725353@montanaro.dyndns.org> Lois> I need spam help- I thought it was free- Yes, SpamBayes is free. -- Skip Montanaro Got gigs? http://www.musi-cal.com/ http://www.mojam.com/ Got spam? http://spambayes.sf.net/ From olli.marttila at kolumbus.fi Thu Nov 6 16:19:10 2003 From: olli.marttila at kolumbus.fi (Olli J. Marttila) Date: Thu Nov 6 16:19:21 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? Message-ID: <200311062319.10448.olli.marttila@kolumbus.fi> Dear list, I overcome the first hurdles in installing Spambayes0.7 in SuSE Linux 8.2 by compiling python 2.3.2 instead of the default rpm-file. As my e-mail protocol is IMAP, I run sb_imapfilter.py -b with the following result: ------------------ marttila@linux:~> sb_imapfilter.py -b SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). User interface url is http://localhost:8880/ Traceback (most recent call last): File "/usr/local/bin/sb_imapfilter.py", line 825, in ? run() File "/usr/local/bin/sb_imapfilter.py", line 801, in run IMAPSession)) File "/usr/local/lib/python2.3/site-packages/spambayes/ImapUI.py", line 116, in __init__ UserInterface.UserInterface.__init__(self, cls, parm_map, adv_map) File "/usr/local/lib/python2.3/site-packages/spambayes/UserInterface.py", line 246, in __init__ BaseUserInterface.__init__(self) File "/usr/local/lib/python2.3/site-packages/spambayes/UserInterface.py", line 116, in __init__ htmlSource, self._images = self.readUIResources() File "/usr/local/lib/python2.3/site-packages/spambayes/UserInterface.py", line 232, in readUIResources from spambayes.resources import ui_html File "/usr/local/lib/python2.3/site-packages/spambayes/resources/ui_html.py", line 7, in ? import zlib ImportError: No module named zlib -------------- Yet I have both zlib-1.1.4-105 and zlib-devel-1.1.4-105 installed from SuSE home site. In addition, writing URL http://localhost:8880 leads to an advertisment site www.netbreakthrough.com - apparently because the installation did not get through. So Iam still at a loss what to do. Can anyone help me? Olli J. Marttila From richie at entrian.com Thu Nov 6 16:44:15 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 6 16:44:30 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? In-Reply-To: <200311062319.10448.olli.marttila@kolumbus.fi> References: <200311062319.10448.olli.marttila@kolumbus.fi> Message-ID: [Olli] > In addition, writing URL http://localhost:8880 leads to an > advertisment site www.netbreakthrough.com - apparently > because the installation did not get through. I can't help you with your build problems, but this is down to a feature of Mozilla whereby if it can't find the address you're looking for then it does an I'm-feeling-lucky Google search for whatever the address is. The clever folks behind www.netbreakthrough.com have registered www.localhost.com.au, managed to make it Google's first-ranked hit for "localhost", then made it redirect to www.netbreakthrough.com. They probably get a million hits a day from that. 8-) -- Richie Hindle richie@entrian.com From admin at eco.com.au Thu Nov 6 16:45:11 2003 From: admin at eco.com.au (Anna Tredwell) Date: Thu Nov 6 16:45:25 2003 Subject: [Spambayes] Good Job Message-ID: <000001c3a4af$44c83770$0200000a@wwwafhgkzf27j5> Hi, I just wanted to write a quick thank you, you probably only ever get people emailing to complain about problems, I thought I would reverse the trend and say thanks. I had a problem with my SpamBayes not filtering and instead of instinctively reaching for the send button to complain I sat myself down and read through your trouble shooting guide. Surprise surprise, I actually fixed the problem myself! Thanks to all the support staff who spend time writing these detailed solutions to probs, you should be congratulated. Thanks again and keep up the good work ? SpamBayes is a god send!!!! at --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.536 / Virus Database: 331 - Release Date: 3/11/2003 From Russell at roanestate.edu Thu Nov 6 16:49:16 2003 From: Russell at roanestate.edu (Russell, Gail) Date: Thu Nov 6 16:49:22 2003 Subject: [Spambayes] Enable Spambayes Message-ID: When I installed Spambayes I noticed a "Junk Mail" folder in my inbox. Now the Junk Mail folder is gone (I did not delete it) and I'm getting a message that I must enable Spambayes. How do I do that? I checked FAQ and don't see it listed. Gail Russell, Placement and Alumni Relations Roane State Community College 276 Patton Lane Harriman, TN 37748 865-882-4695 (phone) 865-882-4554 (phone) From richie at entrian.com Thu Nov 6 17:10:23 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 6 17:10:56 2003 Subject: [Spambayes] Good Job In-Reply-To: <000001c3a4af$44c83770$0200000a@wwwafhgkzf27j5> References: <000001c3a4af$44c83770$0200000a@wwwafhgkzf27j5> Message-ID: <3khlqvopaug0bkg8jeqplmrk5nbsdu3c3f@4ax.com> [Anna] > I just wanted to write a quick thank you [...] you should be congratulated. Thanks for the thanks - it's messages like this (and the private yachts) that keep us going! -- Richie Hindle richie@entrian.com From strav24 at hotmail.com Thu Nov 6 17:16:05 2003 From: strav24 at hotmail.com (Jimmy Stavropoulos) Date: Thu Nov 6 17:16:08 2003 Subject: [Spambayes] Enhancement Suggestion for Envelope in System Tray Message-ID: In the FAQ question 3.8 "How can I get rid of the envelope tray icon for spam?", it states that this cannot be done by SpamBayes. Rather than trying to get rid of the Outlook envelope icon, why doesn't SpamBayes optionally display it's own envelope icon. SpamBayes knows when a message arrives and it also knows whether it was moved to a Spam folder. Therefore, it can display its own icon in the system tray to represent when a non-Spam message arrives. It should also optionally show up for possible spam. I have no problem adding another icon to my system tray to avoid interuptions at work every 5 minutes to check if my mail is Spam or legitimate. - Jimmy S. _________________________________________________________________ Is your computer infected with a virus? Find out with a FREE computer virus scan from McAfee. Take the FreeScan now! http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963 From mhammond at skippinet.com.au Thu Nov 6 17:24:55 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Nov 6 17:24:37 2003 Subject: [Spambayes] repeatable condition where SpamBayes misses new message In-Reply-To: Message-ID: <021b01c3a4b4$cebae810$0500a8c0@eden> Thanks for the note. You ar 100% correct in your analysis of the problem. Wouldn't another solution simply be to set the background processing to a much lower value? Remember the intention here is to prevent conflicts with the builtin rules - I have mine set around the defaults of a 1/2 seconds. This way it will be processed before the preview-pane marks it as read. Another potential solution would be to see if I can determine the 'selected' message in the preview pane, and explicitly score it ignoring the 'read' state of just that one message. Either way, it is unlikely to happen unless someone steps up to try and do some of it. We are far more likely to spend time helping people add features than trying to keep up with them all ourselves! Regards, Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Seth Goodman > Sent: Thursday, 6 November 2003 1:08 PM > To: SpamBayes Email Forum > Subject: [Spambayes] repeatable condition where SpamBayes misses new > message > > > I have background processing set to wait 10 seconds before > starting and 1 > second between messages. I use the preview pane on the Inbox > and things are > set to change a message from Unread to Read after it is > displayed in the > preview pane for two seconds. When a new message hits the > inbox, if the > cursor was previously at the top of the Inbox (most recent > message), the > cursor will move to the new message (who told it to do > that?). After 2 > seconds in the preview pane, the new message will change from > Unread to Read > before the 10 second SpamBayes timer delay runs out. When > this occurs, > SpamBayes neither scores nor moves the new message: it > obviously no longer > sees it as new. This appears to be repeatable. > > This explains at least one condition where SpamBayes does not > get triggered > on a new message. Since there appears to be a general > problem triggering > SpamBayes exactly one time for every new message, and there > is some conflict > with the Outlook rule processor, it would be *nice* if > something could be > done to fix this. Here are a few admittedly naive > suggestions. Perhaps one > of them may be useful to you. > > Scenario I) When you get your "folder add event" or the start > timer delay > times out, check the watched folders for messages with no > spam score that > are not already on the list of messages to be processed. > This would also be > a good time to check the list for duplicate message ID's, if > you still get > those. Then process messages as usual. > > Scenario II) Do the initial scoring in a POP3 proxy and put > the score in one > X- header line and the classification in another X- header line. This > insures that every message is scored exactly once. Use an > Outlook rule to > move the message into either the Unsure or Spam folders, if > appropriate. > Process all other Outlook rules after this one. Disable the > incremental > training option so folder moves do not cause training and > don't trigger on > "folder add events". Use only the toolbar buttons for > reclassification and > retraining. Delete the background processing option. > > Scenario III) Instead of triggering on "folder add events", > run your process > as a "custom action" in an Outlook rule that runs whenever a > new message > appears. The internal mechanism for triggering rules when > new messages show > up seems to be reliable. Microsoft has provided a prototype > "custom action" > called Launcher.dll that calls a designated external program > with a new > message ID. Process all other Outlook rules after this one. > Disable the > incremental training option so folder moves do not cause > training and don't > trigger on "folder add events". Use only the toolbar buttons for > reclassification and retraining. Delete the background > processing option. > > The code for Launcher.dll is at > > ftp://ftp.microsoft.com/services/TechNet/samples/BOES/BO/MAILE XCH/exchange/a ppfarm/ as the file "Printex.exe". This is a self-extractor that will expand to the proper directory structure when invoked with "Printex.exe -d". The Readme.txt file explains how to get Launcher.dll to call whatever external program you want and pass it the message ID of the new message. It includes a printing application (Exprint) that prints out a message for testing purposes. Seth Goodman _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From mhammond at skippinet.com.au Thu Nov 6 17:29:47 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Nov 6 17:29:26 2003 Subject: [Spambayes] Spambayes - a few more steps forward in Czech inplugin 0081 In-Reply-To: Message-ID: <021c01c3a4b5$7c673400$0500a8c0@eden> Thanks Erik, We are tracking this in https://sourceforge.net/tracker/index.php?func=detail&aid=835435&group_id=61 702&atid=498103 - see a potential work-around in that bug. I will be able to fix this for the next release. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Erik Piper > Sent: Wednesday, 5 November 2003 4:52 AM > To: spambayes@python.org > Subject: RE: [Spambayes] Spambayes - a few more steps forward in Czech > inplugin 0081 > > > Sure, Anthony. Shame I didn't remember to do so in the first place. > > Exception in thread Thread-1: > Traceback (most recent call last): > File "out1.pyz/threading", line 436, in __bootstrap > File "out1.pyz/threading", line 416, in run > File "out1.pyz/dialogs.async_processor", line 226, in thread_target > File "out1.pyz/train", line 169, in trainer > File "out1.pyz/manager", line 311, in Adopt > File "out1.pyz/shutil", line 170, in move > File "out1.pyz/shutil", line 83, in copy2 > File "out1.pyz/shutil", line 58, in copystat > UnicodeEncodeError: 'ascii' codec can't encode character > u'\xed' in position > 52: ordinal not in range(128) > > The precise same footprint in the traceback arises when I > repro the "soft > hang" of training in the Training tab, mentioned in the > original e-mail. > > At the risk of supplying useless information, here's the log > from stepping > through the configuration wizard: > > (...prelude...) > Loaded bayes database from 'C:\Documents and > Settings\Administrator\Data > aplikaci\SpamBayes\default_bayes_database.db' > Loaded message database from 'C:\Documents and > Settings\Administrator\Data > aplikaci\SpamBayes\default_message_database.db' > Bayes database initialized with 2 spam and 72 good messages > *** - message database has 44 messages - bayes has 74 - > something is screwey > > (...) > > GetNextPage with current 0 IDD_WIZARD_WELCOME > GetNextPage with current 3 IDD_WIZARD_FOLDERS_TRAIN > About to train with really_really_long_string_of_gobbledygook > Checked 72 in folder Ham - 72 new entries found. > Checked 2 in folder Junk E-Mail - 2 new entries found. > GetNextPage with current 4 IDD_WIZARD_TRAIN > GetNextPage with current 1 IDD_WIZARD_FOLDERS_WATCH > GetNextPage with current 2 IDD_WIZARD_FOLDERS_REST > Wizard Done! > Saving wizard changes > Saving configuration -> C:\Documents and Settings\Administrator\Data > aplikaci\SpamBayes\Microsoft Outlook Internet Settings.ini > > In other words, even though Spambayes throws me an error > dialog, no event is > logged. > > By the way, I can no longer repro the failure of the "Recover > from Spam" and > "Delete as Spam" buttons; if I can confirm a set of repro steps in the > future, I'll let y'all know. > > Erik Piper > > -----Original Message----- > From: Anthony Baxter [mailto:anthony@interlink.com.au] > Sent: Tuesday, November 04, 2003 10:52 AM > To: Erik Piper > Cc: spambayes@python.org > Subject: Re: [Spambayes] Spambayes - a few more steps forward > in Czech in > plugin 0081 > > > > >>> "Erik Piper" wrote > > > Home computer (W2K CZ, Outlook 2K CZ): Installation fine, > startup fine. > But: > > 1. clicking the Data Folder button in the Advanced tab > leads to an error > > dialog: > > SpamBayes > > 'ascii' codec can't encode character u'\xed' in > position 52: ordinal no > t in > > range(128) > > Can you post the entire traceback here? > > Anthony > > -- > Anthony Baxter > It's never too late to have a happy childhood. > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From skip at pobox.com Thu Nov 6 17:47:17 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Nov 6 17:47:29 2003 Subject: [Spambayes] Good Job In-Reply-To: <3khlqvopaug0bkg8jeqplmrk5nbsdu3c3f@4ax.com> References: <000001c3a4af$44c83770$0200000a@wwwafhgkzf27j5> <3khlqvopaug0bkg8jeqplmrk5nbsdu3c3f@4ax.com> Message-ID: <16298.53109.675350.641694@montanaro.dyndns.org> Richie> [Anna] >> I just wanted to write a quick thank you [...] you should be >> congratulated. Richie> Thanks for the thanks - it's messages like this (and the private Richie> yachts) that keep us going! Actually, it's not necessary to show your thanks by actually giving us your private yacht. We're nearly as happy if we can borrow it on weekends. ;-) can-you-please-make-sure-the-bar-is-stocked-this-time?-ly, y'rs, Skip From pekka.takala at pp.inet.fi Thu Nov 6 18:10:20 2003 From: pekka.takala at pp.inet.fi (Pekka Takala) Date: Thu Nov 6 18:09:34 2003 Subject: [Spambayes] Multi-user-capabilities of Spambayes Message-ID: <3FAAD4DC.4040701@pp.inet.fi> Hi I am using Spambayes to filter my e-mail. All worked fine before my girlfriend also wanted that the mail should be filtered for she also. How can I get it working so that when I start up my e-mail client Netscape the spambayes works with my account, and when my girlfriend uses Netscape to read her mail, the spambayes works with her account. The problem is that the web interface won't know which of us is using the system and so she can read my emails by just typing in localhost:8880. I have tried to script the netscape, only getting problems that it cannot create the pop3 cache folders to /home/ptakala/ or /home/phambu, depending on which one of us started the netscape. Could this be somehow fixed so that we get our mailbox caches to our own home folders? -- Pekka "Pihti" Takala Nothing can be so bad that you cannot find something good in it! 65XXX assembler programmer/developer, linux user From richie at entrian.com Thu Nov 6 18:26:24 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 6 18:26:38 2003 Subject: [Spambayes] Multi-user-capabilities of Spambayes In-Reply-To: <3FAAD4DC.4040701@pp.inet.fi> References: <3FAAD4DC.4040701@pp.inet.fi> Message-ID: [Pekka] > How can I get it working so that when I start up my e-mail client > Netscape the spambayes works with my account, and when my girlfriend > uses Netscape to read her mail, the spambayes works with her account. > The problem is that the web interface won't know which of us is using > the system and so she can read my emails by just typing in localhost:8880. The only way to let two people use sb_server without being to see each other's cached emails is to run two instances of spambayes, with each one's web interface running on a different port. In /home/ptakala/spambayes run "sb_server.py -u 8880 &", and in /home/phambu/spambayes run "sb_server.py -u 8881 &". Then you'll each have your own spambayes instance - you use http://localhost:8880/ and she uses http://localhost:8881/ This means you won't be sharing a database (though you could always copy your existing database into her environment to start with), but your training requirements are probably different anyway. If you're really paranoid about your girlfriend seeing your emails 8-) you can use the web configuration page to set up a password on the web interface, so that she can't go to http://localhost:8880/ and see your web interface without knowing your password. Make sure she doesn't have read permission to /home/ptakala/spambayes! -- Richie Hindle richie@entrian.com From delete at GoodmanAssociates.com Thu Nov 6 18:36:35 2003 From: delete at GoodmanAssociates.com (Seth Goodman) Date: Thu Nov 6 18:36:36 2003 Subject: [Spambayes] repeatable condition where SpamBayes misses new message In-Reply-To: <021b01c3a4b4$cebae810$0500a8c0@eden> Message-ID: > Wouldn't another solution simply be to set the background processing to a > much lower value? Remember the intention here is to prevent > conflicts with > the builtin rules - I have mine set around the defaults of a 1/2 seconds. > This way it will be processed before the preview-pane marks it as read. Thanks, Mark. I originally had the "processing start delay" timer set to 10 seconds because I had a POP3 proxy in place to do external DNSBL lookups. These lookups could be quite slow. Now I've decided to go all-or-nothing with SpamBayes, so I've removed the external proxy and set the "processing start delay" to 2.0 seconds and the "delay between processing items" to 1.0 seconds. This works much, much better. Thanks again. I'm just curious what the "delay between processing items" timer is meant for. Doesn't the "processing start delay", which is retriggerable (sorry, I'm a hardware guy), prevent SpamBayes from looking for new folder add events when new messages are arriving? Also, how does this work when I am receiving a large message that takes longer than the "processing start delay" to receive? Seth Goodman to reply, change delete to sethg From richie at entrian.com Thu Nov 6 18:40:19 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 6 18:40:33 2003 Subject: [Spambayes] DBRunRecoveryError: (-30982 possible cause In-Reply-To: <4.3.2.7.2.20031104104920.00b3d360@localhost> References: <4.3.2.7.2.20031104104920.00b3d360@localhost> Message-ID: [Stephen] > I have the error message > >DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database > >recovery -- fatal region error detected; run recovery') > > I think I can cause this reliably by clicking the train button and while > the programming is training, exiting the "web page" in the browser. When I do this, sb_server hangs up and I have to kill it using Ctrl+Break or Task Manager (I'm on Windows). No database corruption should occur as a result... ...But if this happened and you *didn't* kill it, but instead restarted it so you had two running, that could maybe cause database corruption. I haven't been able to make it happen in my environment, but it's not a bad theory. Is it possible that that's what happened, or did you either not see it hang, or always kill it before starting another? (I'm in the process of fixing the hanging problem.) -- Richie Hindle richie@entrian.com From cej at intech.com Thu Nov 6 20:48:39 2003 From: cej at intech.com (Christopher Jastram) Date: Thu Nov 6 20:49:04 2003 Subject: [Spambayes] SpamBayes + Cyrus question Message-ID: <3FAAF9F7.1020500@intech.com> I'm not sure if this is the right mailing list -- probably isn't. I've set up mail filtering with cyrus (shot myself in the foot a few times -- esp. with master.cf...rgggg). It's working, and I have an X-SpamBayes-Classification header working and functional. Now, spam is cool and great and immediately trashed. Ham is kept. Unsure is kept. Buuut -- is it possible to have Cyrus run each message through spambayes whenever I move a message from one folder to another? (This is probably not a good idea, for performance reasons, but I'd like to try it). Does anyone know of a subsystem in Cyrus that'll do the job? Thanks, chris From cej at intech.com Thu Nov 6 20:54:24 2003 From: cej at intech.com (Christopher Jastram) Date: Thu Nov 6 20:54:37 2003 Subject: [Spambayes] Client/Server filtering model? Anyone have code? Message-ID: <3FAAFB50.6010301@intech.com> Currently, I have a script that runs through every user's spool directory and; Everything in "Spam" or "Junk" is read as spam Things like the main inbox, sent, drafts and trash are discarded Everything else is read as ham. This script takes a significant amount of time to run (along the order of 30-40 minutes), is run on a nightly basis, and only processes a few users (just a couple test users -- once everyone switches from POP to IMAP, there'll be hell to pay unless I can figure out a faster way to do this.) The system load also jumped from 0.1 - 0.2 to a steady 1.6 - 1.8 with the addition of spam filtering (we get a lot of spam). An idea I had -- would it be possible to have *one* (multithreaded?) constantly running python server that reads mail, evaluates it (addes a classified or trained header) and passes it back? The advantage is that the database would not be re-read on every invokation. Another advantage might be in the outsourcing of the processing to another machine. Just an idea. I'll work on something like this myself when I get a roundtoit (always short on time), but I'm wondering if anyone has prototype code...? (Is this even a good idea?) Thanks, chris P.S. on the 'lot of spam' note: most of the mail comes to 10 years worth of employees who longer work here. We used to bounce it, but the resulting mail queue and processing time took our mail server to its knees. Repeatedly. Both Exchange and Postfix/Cyrus. Now there's a dead-letter box and a 3-day hold period instead of the default 5, which brings it down to manageable (sorta) levels. Anyone have any additional ideas or guides to using Postfix to drop blatantly spammy email? To give you a sense of what I'm looking for, here's an example; I cut incoming spam by a good 10 percent, just by dropping all messages coming from outside, but claiming to be from within our domain. Surely there's more such rules that I haven't thought of? From natsysco at cox.net Thu Nov 6 22:12:54 2003 From: natsysco at cox.net (Nat Talpis) Date: Thu Nov 6 22:12:13 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION References: <7F976E05D3F5AB4ABFC9334D182F7B511E44D0@mail1.primeline.com> Message-ID: <006b01c3a4dd$0c953aa0$6501a8c0@natsysievqckcu> It's arrogant perfect people like you that drive normal people like us from these proceedings. ----- Original Message ----- From: "Andrew J. Coutermarsh" To: ; Sent: Wednesday, November 05, 2003 8:26 AM Subject: RE: [Spambayes] STOP MY SUBSCRIPTION > It never ceases to amaze me that some people just can't seem to follow > directions. > > Andrew J. Coutermarsh > Prime Resources Corp. > I.S. Department > Ph: (203) 331-9100 x3236 > Fx: (203) 551-3324 > andrewc@primeworld.com > > > -----Original Message----- > From: spambayes-bounces+andrewc=primeworld.com@python.org > [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On Behalf Of > natsysco@cox.net > Sent: Wednesday, November 05, 2003 11:28 AM > To: spambayes@python.org > Subject: [Spambayes] STOP MY SUBSCRIPTION > > STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L > > > > From: spambayes-request@python.org > > Date: 2003/10/31 Fri PM 09:28:44 EST > > To: natsysco@cox.net > > Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b > > > > Mailing list subscription confirmation notice for mailing list > > Spambayes > > > > We have received a request from 68.104.61.72 for subscription of your > > email address, "natsysco@cox.net", to the spambayes@python.org mailing > > list. To confirm that you want to be added to this mailing list, > > simply reply to this message, keeping the Subject: header intact. Or > > visit this web page: > > > > > http://mail.python.org/mailman/confirm/spambayes/a20473f683f3015a12504428b98 > d81bac3a0340b > > > > > > Or include the following line -- and only the following line -- in a > > message to spambayes-request@python.org: > > > > confirm a20473f683f3015a12504428b98d81bac3a0340b > > > > Note that simply sending a `reply' to this message should work from > > most mail readers, since that usually leaves the Subject: line in the > > right form (additional "Re:" text in the Subject: is okay). > > > > If you do not wish to be subscribed from this list, please simply > > disregard this message. If you think you are being maliciously > > subscribed to the list, or have any other questions, send them to > > spambayes-owner@python.org. > > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From dave at boost-consulting.com Thu Nov 6 23:12:08 2003 From: dave at boost-consulting.com (David Abrahams) Date: Thu Nov 6 23:20:15 2003 Subject: [Spambayes] Should I train on this sort of thing? Message-ID: I almost never look inside spam, but it seems as though much of what I get these days is just designed to confuse bayesian spam blockers. The enclosed is an example. Should I let SpamBayes train on these or are they actually ineffective? -------------- next part -------------- An embedded message was scrubbed... From: "Dawson" Subject: Re: %RND_UC_CHAR[2-8], he imagined that Date: Thu, 06 Nov 2003 10:06:13 -0600 Size: 1562 Url: http://mail.python.org/pipermail/spambayes/attachments/20031106/8e7925db/attachment.mht -------------- next part -------------- -- Dave Abrahams Boost Consulting www.boost-consulting.com From tim.one at comcast.net Thu Nov 6 23:50:04 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Nov 6 23:50:20 2003 Subject: [Spambayes] Should I train on this sort of thing? In-Reply-To: Message-ID: [David Abrahams] > I almost never look inside spam, but it seems as though much of what I > get these days is just designed to confuse bayesian spam blockers. Na, the purpose of random dictionary crap is to frustrate fingerprinting schemes. If oddball but legit words like "bituminous" and "chalkline" are actually part of your desired email life, they'll be scored hammy enough that a random appearance or two in spam isn't going to hurt anything. OTOH, if they're not part of your ham, then SpamBayes effectively ignores them (it ignores words whose spamprobs lie inside (.4, .6) by default). In short, this isn't an effective attack against a Bayesian filter. It's an effective attack against fingerprinting schemes, though (unless the spammer misconfigures their ratware and ends up generating the same collection of dictionary words every time -- look at enough of these, and you'll eventually find clues like $RANDOM and %RAND%, where they screwed things up so badly that *no* random cruft gets generated). > The enclosed is an example. Should I let SpamBayes train on these or > are they actually ineffective? It shouldn't hurt, and might help. SpamBayes deobfuscates some of the other tricks in your example, extracting the plain text: Free! No Obligation! Quotes available here! That's spammy stuff in almost anyone's database. bituminous only scores as a strong ham word in mine . back-to-the-coal-mine-ly y'rs - tim From neale at woozle.org Fri Nov 7 01:25:26 2003 From: neale at woozle.org (Neale Pickett) Date: Fri Nov 7 01:25:33 2003 Subject: [Spambayes] Undergraduate Alumni Questionnaire - Please Help In-Reply-To: <5.2.1.1.2.20031105151347.01c80c30@postoffice7.mail.cornell.edu> (Carol Ann Halseth's message of "Wed, 05 Nov 2003 15:14:33 -0500") References: <5.2.1.1.2.20031105151347.01c80c30@postoffice7.mail.cornell.edu> Message-ID: Then, "Kenny Pitt" writes: > OK, this is either a blatant case of mistaken identity, or the > strangest spam I've ever seen. :-) It looks like identity theft to me. This sort of thing will probably make spam filtering a whole lot more relevant to a lot of people. Scary. Carol Ann Halseth writes: > LAST: > FIRST: > SSN: > MAILING ADDRESS: > PHONE: > PERMANENT (Parent's) ADDRESS: > PERM. PHONE: > Position: > Employer: > Address: From neale at woozle.org Fri Nov 7 01:28:50 2003 From: neale at woozle.org (Neale Pickett) Date: Fri Nov 7 01:28:59 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? In-Reply-To: <200311062319.10448.olli.marttila@kolumbus.fi> (Olli J. Marttila's message of "Thu, 6 Nov 2003 23:19:10 +0200") References: <200311062319.10448.olli.marttila@kolumbus.fi> Message-ID: "Olli J. Marttila" writes: > Yet I have both zlib-1.1.4-105 and zlib-devel-1.1.4-105 > installed from SuSE home site. > > So Iam still at a loss what to do. Can anyone help me? Olli, It sounds like SuSE bundles the python zlib module in a different package. It won't be called zlib or zlib-devel, but it might be called something like python-zlib or zlib-python. You need to install that package to get the error message to go away. Your other option, like I'm starting to think we should tell all users of RPM-based systems, you can install Python 2.3 from source. Download the source code, untar it, and run ./configure; make install That will get everything you need installed. Neale From tpeters at mixcom.com Fri Nov 7 01:36:04 2003 From: tpeters at mixcom.com (Tom Peters) Date: Fri Nov 7 01:36:13 2003 Subject: [Spambayes] 1.0a6 and Norton antivirus Message-ID: <5.1.0.14.0.20031107002648.02682cc8@localhost> I had oddness with 1.0a5 and a6 and Norton antivirus under win2000 and XP. Once I got it configured, Norton would process each and every message twice. Once it dawned on me that it had hooked into ports 25 and 110 on its own, was processing each message, then passing them on to "real" ports 25 and 110, I changed ports. I had Eudora Pro talk to ports 3025 and 3110 (chosen arbitrarily) and set SpamBayes to proxy them to/from 25 and 110. Behaves very nicely now. Eudora can be told from command line which ini file to look at for settings, so I can start it proxied or direct, depending which icon I pick. Since I got it working with 1.0a5, I've never used to non-proxied icon again. Any problems with this approach? [Love] The adoration of his heart had been to her only as the perfume of a wild flower, which she had carelessly crushed' with her foot in passing. --Longfellow (1819-1892) --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From JohnH at snetworking.com Fri Nov 7 01:42:20 2003 From: JohnH at snetworking.com (John Hall) Date: Fri Nov 7 01:43:10 2003 Subject: [Spambayes] Is there code to intercept email to run on Windows? Message-ID: <223286A46C32EE42BF7C2E4177EAD42605C78A@sylvester2.snetworking.com> I presume that most folks are not using Windows platforms for the Bayesian Filtering. However, I am in the situation where I am. Is there an SMTP proxy or something like one (commercial is OK and probably preferred) that I could use to intercept the incoming email before it gets to our Exchange Server to allow me to experiment with some Bayesian filtering without having to write some code to listen to ports and send to ports (for which I have no clue). Any recommendations would be appreciated. Thanks, John From vanhorn at whidbey.com Fri Nov 7 02:01:43 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Fri Nov 7 02:02:01 2003 Subject: [Spambayes] Is there code to intercept email to run on Windows? References: <223286A46C32EE42BF7C2E4177EAD42605C78A@sylvester2.snetworking.com> Message-ID: <3FAB4356.316C9700@whidbey.com> John, Lots of us use Windows. I run two SpamBayes POP3 proxy on two different Windows 2000 workstations, one handles four accounts, the other three. The most common configuration for SpamBayes is the Outlook plugin, which only runs on Windows but is only useful for those running Outlook (not Outlook Express) in recent versions. You will have to install Python on your computer for this to work, I believe there are pointers to this on the spambayes.sourceforge.net site. I happened to use the Python from Active State, which works fine here. Van John Hall wrote: > I presume that most folks are not using Windows platforms for the > Bayesian Filtering. However, I am in the situation where I am. > > Is there an SMTP proxy or something like one (commercial is OK and > probably preferred) that I could use to intercept the incoming email > before it gets to our Exchange Server to allow me to experiment with > some Bayesian filtering without having to write some code to listen to > ports and send to ports (for which I have no clue). > > Any recommendations would be appreciated. > > Thanks, > > John > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From tpeters at mixcom.com Fri Nov 7 02:02:08 2003 From: tpeters at mixcom.com (Tom Peters) Date: Fri Nov 7 02:02:21 2003 Subject: [Spambayes] Saving hammie.db when upgrading? Message-ID: <5.1.0.14.0.20031107005438.00b8c1b0@localhost> Nothing I've seen in the FAQ so far seems to address the question (else I'm just not seeing it): Was hammie.db preserved when I upgraded to 1.0a6? I have 1.0a7 downloaded and want to install it in the next day or so. Can I preserve the training I've done so far? How? Does running setup.py install find and copy/convert the db file? [Change] Nothing that I can do will change the structure of the universe. But maybe, by raising my voice I can help the greatest of all causes -- goodwill among men and peace on earth. --Albert Einstein --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From sabuck at comcast.net Fri Nov 7 08:42:36 2003 From: sabuck at comcast.net (Sherry Buckowing) Date: Fri Nov 7 08:42:42 2003 Subject: [Spambayes] help Message-ID: <000001c3a535$011fb210$6801a8c0@sherry> Hi, Ok, I did something stupid and deleted the spam folder out of my deleted items. Now that I realize my error I can't seem to correct it. I have uninstalled and reinstalled the program etc. but now I keep getting a message that says I need to configure my spam folder. Can you tell me how to fix this? SB sabuck@comcast.net From kennypitt at hotmail.com Fri Nov 7 09:10:45 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 7 09:11:12 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION In-Reply-To: <006b01c3a4dd$0c953aa0$6501a8c0@natsysievqckcu> Message-ID: Nat Talpis wrote: > It's arrogant perfect people like you that drive normal people like > us from these proceedings. Let's not turn this into a flame war, folks. This is typically a very civil group. > ----- Original Message ----- > From: "Andrew J. Coutermarsh" > To: ; > Sent: Wednesday, November 05, 2003 8:26 AM > Subject: RE: [Spambayes] STOP MY SUBSCRIPTION > > >> It never ceases to amaze me that some people just can't seem to >> follow directions. OK, maybe that could have been said a little more tactfully (), but the point is that all the information you need to stop your subscription is included right there in the message you replied to. >> -----Original Message----- >> From: spambayes-bounces+andrewc=primeworld.com@python.org >> [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On >> Behalf Of natsysco@cox.net Sent: Wednesday, November 05, 2003 11:28 >> AM >> To: spambayes@python.org >> Subject: [Spambayes] STOP MY SUBSCRIPTION >> >> STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L >>> >>> From: spambayes-request@python.org >>> Date: 2003/10/31 Fri PM 09:28:44 EST >>> To: natsysco@cox.net >>> Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b >>> >>> Mailing list subscription confirmation notice for mailing list >>> Spambayes >>> >>> We have received a request from 68.104.61.72 for subscription of >>> your email address, "natsysco@cox.net", to the spambayes@python.org >>> mailing list. To confirm that you want to be added to this mailing >>> list, simply reply to this message, keeping the Subject: header >>> intact. Or visit this web page: >>> [snip] >>> >>> If you do not wish to be subscribed from this list, please simply >>> disregard this message. If you think you are being maliciously >>> subscribed to the list, or have any other questions, send them to >>> spambayes-owner@python.org. I have been participating on this list for several months now and have rarely if ever seen a spam message sent to it. As this is a subscription-based mailing list, the only way to receive mail from it is if you have been subscribed, either by your own choice or by someone subscribing you without your permission. This confirmation e-mail is intended to prevent such malicious subscriptions by requiring the person at the destination e-mail address to confirm that they did in fact want the subscription. If someone else subscribed you, or if you subscribed yourself and then changed your mind, you can simply ignore the confirmation message as stated in the last paragraph and you will not receive any further messages. If you are certain that someone else subscribed you without your permission, then you can report that to the list owner at the specified e-mail address rather than to the entire list. If you have already confirmed your subscription, then a link is included at the bottom of every message received from the list that will allow you to unsubscribe yourself. -- Kenny Pitt From kennypitt at hotmail.com Fri Nov 7 09:26:04 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 7 09:26:26 2003 Subject: [Spambayes] help In-Reply-To: <000001c3a535$011fb210$6801a8c0@sherry> Message-ID: Sherry Buckowing wrote: > Ok, I did something stupid and deleted the spam folder out of my > deleted items. Now that I realize my error I can't seem to correct > it. I have uninstalled and reinstalled the program etc. but now I > keep getting a message that says I need to configure my spam folder. > Can you tell me how to fix this? Sure. SpamBayes locates your spam folder using a unique identifier assigned by Outlook. When you recreate your spam folder, it is given a different identifier even if you give it the same name, so SpamBayes can't find it. Note that uninstalling and reinstalling SpamBayes does not remove your configuration files, so that's why it is still looking for your old folder. After you recreate the folder, just open SpamBayes Manager from the SpamBayes dropdown on your toolbar. Go to the Filtering tab and look in the Certain Spam section. You'll probably see something like "Moved to folder ". Just click the Browse button next to this and select your new spam folder from the list. Now just close the SpamBayes Manager and you should be all set. -- Kenny Pitt From kennypitt at hotmail.com Fri Nov 7 09:40:15 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 7 09:40:38 2003 Subject: [Spambayes] Undergraduate Alumni Questionnaire - Please Help In-Reply-To: Message-ID: Neale Pickett wrote: > Then, "Kenny Pitt" writes: > >> OK, this is either a blatant case of mistaken identity, or the >> strangest spam I've ever seen. :-) > > It looks like identity theft to me. This sort of thing will probably > make spam filtering a whole lot more relevant to a lot of people. > > Scary. > > > Carol Ann Halseth writes: > >> LAST: >> FIRST: >> SSN: >> MAILING ADDRESS: >> PHONE: >> PERMANENT (Parent's) ADDRESS: >> PERM. PHONE: >> Position: >> Employer: >> Address: Actually, in this instance it did turn out to be a legitimate case of mistaken identity. I got a personal message from her explaining that she had done a Google search for a former student in her department, and this list was the only e-mail address that popped up. You can go to the Cornell University website at www.cornell.edu and do a people search for Carol Ann Halseth, and she is a legitimate member of the faculty. However, this is exactly the sort of thing that not-so-legitimate people will take advantage of for identity theft or other scams. IMHO, not the kind of transaction that should be conducted via e-mail, and I would definitely want to confirm who I was sending to and why before I replied with all that sensitive info. -- Kenny Pitt From kennypitt at hotmail.com Fri Nov 7 09:46:47 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 7 09:47:10 2003 Subject: [Spambayes] Client/Server filtering model? Anyone have code? In-Reply-To: <3FAAFB50.6010301@intech.com> Message-ID: Christopher Jastram wrote: > Currently, I have a script that runs through every user's spool > directory and; > Everything in "Spam" or "Junk" is read as spam > Things like the main inbox, sent, drafts and trash are discarded > Everything else is read as ham. > > An idea I had -- would it be possible to have *one* (multithreaded?) > constantly running python server that reads mail, evaluates it (addes > a classified or trained header) and passes it back? The advantage is > that the database would not be re-read on every invokation. Another > advantage might be in the outsourcing of the processing to another > machine. I don't use it myself so can't give you any details, but you might look into the sb_xmlrpcserver script. IIRC, this is just the sort of situation it is intended for. It listens on a particular TCP port and acts as a Web service that receives a message and classifies it. It should be possible to run it either locally or on a separate machine. -- Kenny Pitt From cdellario at whatif-productions.com Fri Nov 7 09:50:58 2003 From: cdellario at whatif-productions.com (Chris Dellario) Date: Fri Nov 7 09:50:07 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION Message-ID: <113EE4C6211B1D41A34E54A089F4795C0AFCA8@mailbox.whatif-productions.com> Gee, I never realized that you had to be both arrogant and perfect to follow simple directions. ------------------------------------------------------------ Chris Dellario Lead Engineer Whatif Productions LLC http://www.whatif.info (617) 977-0115 -----Original Message----- From: Nat Talpis [mailto:natsysco@cox.net] Sent: Thursday, November 06, 2003 10:13 PM To: Andrew J. Coutermarsh; spambayes@python.org Subject: Re: [Spambayes] STOP MY SUBSCRIPTION It's arrogant perfect people like you that drive normal people like us from these proceedings. ----- Original Message ----- From: "Andrew J. Coutermarsh" To: ; Sent: Wednesday, November 05, 2003 8:26 AM Subject: RE: [Spambayes] STOP MY SUBSCRIPTION > It never ceases to amaze me that some people just can't seem to follow > directions. > > Andrew J. Coutermarsh > Prime Resources Corp. > I.S. Department > Ph: (203) 331-9100 x3236 > Fx: (203) 551-3324 > andrewc@primeworld.com > > > -----Original Message----- > From: spambayes-bounces+andrewc=primeworld.com@python.org > [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On Behalf Of > natsysco@cox.net > Sent: Wednesday, November 05, 2003 11:28 AM > To: spambayes@python.org > Subject: [Spambayes] STOP MY SUBSCRIPTION > > STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L > > > > From: spambayes-request@python.org > > Date: 2003/10/31 Fri PM 09:28:44 EST > > To: natsysco@cox.net > > Subject: confirm a20473f683f3015a12504428b98d81bac3a0340b > > > > Mailing list subscription confirmation notice for mailing list > > Spambayes > > > > We have received a request from 68.104.61.72 for subscription of your > > email address, "natsysco@cox.net", to the spambayes@python.org mailing > > list. To confirm that you want to be added to this mailing list, > > simply reply to this message, keeping the Subject: header intact. Or > > visit this web page: > > > > > http://mail.python.org/mailman/confirm/spambayes/a20473f683f3015a1250442 8b98 > d81bac3a0340b > > > > > > Or include the following line -- and only the following line -- in a > > message to spambayes-request@python.org: > > > > confirm a20473f683f3015a12504428b98d81bac3a0340b > > > > Note that simply sending a `reply' to this message should work from > > most mail readers, since that usually leaves the Subject: line in the > > right form (additional "Re:" text in the Subject: is okay). > > > > If you do not wish to be subscribed from this list, please simply > > disregard this message. If you think you are being maliciously > > subscribed to the list, or have any other questions, send them to > > spambayes-owner@python.org. > > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From AndrewC at primeworld.com Fri Nov 7 10:04:14 2003 From: AndrewC at primeworld.com (Andrew J. Coutermarsh) Date: Fri Nov 7 10:08:25 2003 Subject: [Spambayes] STOP MY SUBSCRIPTION Message-ID: <7F976E05D3F5AB4ABFC9334D182F7B511E44D6@mail1.primeline.com> Nat Talpis: > It's arrogant perfect people like you that drive normal people > like us from these proceedings. And see, I don't think it's too arrogant or unreasonable to expect people to be able to read directions. If you're unwilling to read the directions on how to install your television, don't sue the TV manufacturer for the hernia you got while trying to lift it out of its box. Andrew J. Coutermarsh Prime Resources Corp. I.S. Department Ph: (203) 331-9100 x3236 Fx: (203) 551-3324 andrewc@primeworld.com > -----Original Message----- > From: Nat Talpis [mailto:natsysco@cox.net] > Sent: Thursday, November 06, 2003 10:13 PM > To: Andrew J. Coutermarsh; spambayes@python.org > Subject: Re: [Spambayes] STOP MY SUBSCRIPTION > > It's arrogant perfect people like you that drive normal people like us > from these proceedings. > > ----- Original Message ----- > From: "Andrew J. Coutermarsh" > To: ; > Sent: Wednesday, November 05, 2003 8:26 AM > Subject: RE: [Spambayes] STOP MY SUBSCRIPTION > > > It never ceases to amaze me that some people just can't seem to follow > directions. > > -----Original Message----- > From: spambayes-bounces+andrewc=primeworld.com@python.org > [mailto:spambayes-bounces+andrewc=primeworld.com@python.org] On Behalf > Of > natsysco@cox.net > Sent: Wednesday, November 05, 2003 11:28 AM > To: spambayes@python.org > Subject: [Spambayes] STOP MY SUBSCRIPTION > > STOP MY SUBSCRIPTION, YOU ARE ADDING SPAM TO MY EMAIL...L From bob at jellyvision.com Fri Nov 7 11:39:45 2003 From: bob at jellyvision.com (Bob Chojnacki) Date: Fri Nov 7 11:35:47 2003 Subject: [Spambayes] Outlook Envelope Tray Icon Message-ID: Hi, I really like SpamBayes and the Outlook plugin. It is working much better than other spam filters, considering I get 85-95% spam. I am currently using version 008.1. I read your FAQ about the problems with making the Outlook envelope tray icon go away. (I am also not sure if this is the right email address to send this comment, so please bear with me if it isn't.) Is the following link helpful? (Keep in mind that I am not a Windows programmer): http://www.slipstick.com/dev/code/clearenvicon.htm Thanks, Bob P.S.: http://www.techhit.com/autoread/ may also be of use. ------- Page text attached ------ To clear the Outlook envelope icon from the Windows system tray A common issue for people who subscribe to many mailing lists or use Net Folders to share data is controlling when Outlook puts an envelope in the system tray in the right side of the Windows task bar. Typically, they want to see the envelope icon only when they have new mail in the Inbox that is not from one of those lists and not part of a Net Folders transmission. Since the envelope goes away after a user opens the latest items, you might think that setting the Unread property on those lower priority items to False would be enough to do the job. However, it doesn't suffice. Even if you use code or a Rules Wizard rule to mark items as Read, the envelope icon persists. The icon placed in the system tray is governed by hidden window that developers don't have direct access to via the Outlook Object Model or Collaboration Data Objects. Therefore, you need to go outside Outlook for a solution. Outlook MVP Neo has come up with an Outlook 2000/2002 VBA method that uses Windows API calls to remove the icon from the system tray. You can copy and paste the code below to your project or download the clear_icon.txt file. One application of the RemoveNewMailIcon subroutine might be to run some code when the Application.NewMail event fires to see if any "interesting" new items have actually come in -- "interesting" here meaning items for which you want to see the envelope in the system tray. If not, call the RemoveNewMailIcon subroutine to clear the icon: Private Sub Application_NewMail() ' add some code to check whether the latest items are "interesting" ' if not interesting, clear the envelope icon Call RemoveNewMailIcon End Sub It turns out that integrating Neo's code into Outlook events isn't as straightforward as you might like. A discussion in the microsoft.public.outlook.program_vba newsgroup uncovered some apparent timing issues. Code | More Information Code ' Code sample by Outlook MVP "Neo" ' Removes the New Mail icon from the Windows system tray, ' and resets Outlook's new mail notification engine. ' Tested against Outlook 2000 (IMO) and Outlook 2002 (POP Account) ' Send questions and comments to neo@mvps.org ' WARNING: Due to the use of AddressOf, code must ' go into a module and not ThisOutlookSession or ' a class module ' Entry Point is RemoveNewMailIcon. Option Explicit Public Const WUM_RESETNOTIFICATION As Long = &H407 'Required Public constants, types & declares 'for the Shell_Notify API method Public Const NIM_ADD As Long = &H0 Public Const NIM_MODIFY As Long = &H1 Public Const NIM_DELETE As Long = &H2 Public Const NIF_ICON As Long = &H2 'adding an ICON Public Const NIF_TIP As Long = &H4 'adding a TIP Public Const NIF_MESSAGE As Long = &H1 'want return messages ' Structure needed for Shell_Notify API Type NOTIFYICONDATA cbSize As Long hwnd As Long uID As Long uFlags As Long uCallbackMessage As Long hIcon As Long szTip As String * 64 End Type Declare Function SendMessage Lib "user32" Alias "SendMessageA" _ (ByVal hwnd As Long, ByVal wMsg As Long, _ ByVal wParam As Integer, ByVal lParam As Any) As Long Declare Function GetClassName Lib "user32" _ Alias "GetClassNameA" _ (ByVal hwnd As Long, _ ByVal lpClassName As String, _ ByVal nMaxCount As Long) As Long Declare Function GetWindowTextLength Lib "user32" _ Alias "GetWindowTextLengthA" _ (ByVal hwnd As Long) As Long Declare Function GetWindowText Lib "user32" _ Alias "GetWindowTextA" _ (ByVal hwnd As Long, _ ByVal lpString As String, _ ByVal cch As Long) As Long Declare Function EnumWindows Lib "user32" _ (ByVal lpEnumFunc As Long, _ ByVal lParam As Long) As Long Declare Function Shell_NotifyIcon Lib "shell32.dll" _ Alias "Shell_NotifyIconA" _ (ByVal dwMessage As Long, _ lpData As NOTIFYICONDATA) As Long Declare Function FindWindow Lib "user32" Alias "FindWindowA" _ (ByVal lpClassName As String, _ ByVal lpWindowName As String) As Long ' This is the entry point that makes it happen Sub RemoveNewMailIcon() EnumWindows AddressOf EnumWindowProc, 0 End Sub Public Function EnumWindowProc(ByVal hwnd As Long, _ ByVal lParam As Long) As Long 'Do stuff here with hwnd Dim sClass As String Dim sIDType As String Dim sTitle As String Dim hResult As Long sTitle = GetWindowIdentification(hwnd, sIDType, sClass) If sTitle = "rctrl_renwnd32" Then hResult = KillNewMailIcon(hwnd) End If If hResult Then EnumWindowProc = False ' Reset the new mail notification engine Call SendMessage(hwnd, WUM_RESETNOTIFICATION, 0&, 0&) Else EnumWindowProc = True End If End Function Private Function GetWindowIdentification(ByVal hwnd As Long, _ sIDType As String, _ sClass As String) As String Dim nSize As Long Dim sTitle As String 'get the size of the string required 'to hold the window title nSize = GetWindowTextLength(hwnd) 'if the return is 0, there is no title If nSize > 0 Then sTitle = Space$(nSize + 1) Call GetWindowText(hwnd, sTitle, nSize + 1) sIDType = "title" sClass = Space$(64) Call GetClassName(hwnd, sClass, 64) Else 'no title, so get the class name instead sTitle = Space$(64) Call GetClassName(hwnd, sTitle, 64) sClass = sTitle sIDType = "class" End If GetWindowIdentification = TrimNull(sTitle) End Function Private Function TrimNull(startstr As String) As String Dim pos As Integer pos = InStr(startstr, Chr$(0)) If pos Then TrimNull = Left(startstr, pos - 1) Exit Function End If 'if this far, there was 'no Chr$(0), so return the string TrimNull = startstr End Function Private Function KillNewMailIcon(ByVal hwnd As Long) As Boolean Dim pShell_Notify As NOTIFYICONDATA Dim hResult As Long 'setup the Shell_Notify structure pShell_Notify.cbSize = Len(pShell_Notify) pShell_Notify.hwnd = hwnd pShell_Notify.uID = 0 ' Remove it from the system tray and catch result hResult = Shell_NotifyIcon(NIM_DELETE, pShell_Notify) If (hResult) Then KillNewMailIcon = True Else KillNewMailIcon = False End If End Function More Information AutoRead for Outlook -- Custom action for Rules Wizard to mark messages that meet certain criteria as read and suppress the new mail indicator in the system tray. From parz at shaw.ca Fri Nov 7 12:38:21 2003 From: parz at shaw.ca (Parzival) Date: Fri Nov 7 12:41:09 2003 Subject: [Spambayes] How does Spambayes deal with those "random" words in spam? Message-ID: <200311071138.21673.parz@shaw.ca> Lots of spam contains "random" words in the subject and visible or hidden in the message body. Since each spam contains different random strings, these words are very likely not to re-appear in subsequent spam. Does this reduce the effectiveness of the classification? A human seeing such a message with garbage words would immediatly recognize it as spam. Could the classfier be extended to assign higher spam ratings to messages containing a large amount of "words which have never been seen"? Possibly a user could pre-seed the classifier with a dictionary of words in his/her language and/or jargon. -- Parzival From rmalayter at bai.org Fri Nov 7 12:49:26 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Nov 7 12:49:32 2003 Subject: [Spambayes] Is there code to intercept email to run on Windows? Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3761@cliff.bai.org> http://assp.sourceforge.net is a decent open-source Bayesian SMTP filter. It requires Perl to be installed, and it needs to sit in front of your SMTP gateway, which makes testing on live mail a bit hard unless you can send a copy of your incoming mail to it. Setup isn't exactly easy, but most IT types can probably manage. http://Spamassassin.org also has rules-based, blacklist, and even bayesian filtering available at the SMTP level, but it is much harder to install and use, especially on Windows machines. Someone also once pointed out an SMTP plug-in that was available for http://popfile.sourceforge.net, but I haven't been able to find a recent version of that. I'm playing with assp in a test environment right now, and I like what I see so far. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On Behalf Of John Hall > Sent: Friday, November 07, 2003 12:42 AM > To: spambayes@python.org > Subject: [Spambayes] Is there code to intercept email to run > on Windows? > > I presume that most folks are not using Windows platforms for the > Bayesian Filtering. However, I am in the situation where I am. > > Is there an SMTP proxy or something like one (commercial is OK and > probably preferred) that I could use to intercept the incoming email > before it gets to our Exchange Server to allow me to experiment with > some Bayesian filtering without having to write some code to listen to > ports and send to ports (for which I have no clue). > > Any recommendations would be appreciated. > > Thanks, > > John > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From mikei at gwnsoftware.com Fri Nov 7 13:02:22 2003 From: mikei at gwnsoftware.com (Mike Iles) Date: Fri Nov 7 13:02:11 2003 Subject: [Spambayes] Incremental Training Message-ID: <49E95F5EE462824597FA4D82F220F5AD013E35@cinder.toronto.kinitos.com> I've been running SpamBayes for a few weeks now and find it to be extremely accurate, even with a relatively low training volume (500 good, 80 spam). Thanks for putting out such a great tool! My question is about the incremental training: it appears to me that when SpamBayes classifies a message as spam it doesn't also train against it. I would think that this would be a very nice attribute: if the spam I was receiving was evolving in a given direction SpamBayes would then track it without my intervention. I realize that I can continue to train by occasionally unchecking the 'rebuild database' box and clicking the 'train' button again. Happily, this seems to recognize the duplicates and doesn't train twice against them. In any case, am I missing a configuration option to enable this? If not, is the method I've described above the recommended way to continue to train against the spam I'm receiving? Many thanks, Mike. From kennypitt at hotmail.com Fri Nov 7 13:13:34 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 7 13:13:57 2003 Subject: [Spambayes] Outlook Envelope Tray Icon In-Reply-To: Message-ID: Bob Chojnacki wrote: > Hi, > > I really like SpamBayes and the Outlook plugin. It is working much > better than other spam filters, considering I get 85-95% spam. I am > currently using version 008.1. I read your FAQ about the problems > with making the Outlook envelope tray icon go away. (I am also not > sure if this is the right email address to send this comment, so > please bear with me if it isn't.) > > Is the following link helpful? (Keep in mind that I am not a Windows > programmer): > > http://www.slipstick.com/dev/code/clearenvicon.htm Thanks for the link. I created the following code to implement this in the Outlook plugin and attached it to a menu item for testing. It was, in fact, successful in removing the new mail envelope from the taskbar. Now, the *really* tricky part is figuring out when to remove the icon. ==================== def RemoveNewMailIcon(): win32gui.EnumWindows(_removeIconCallback, None) def _removeIconCallback(hwnd, extra): # Check for Outlook window class. if win32gui.GetClassName(hwnd) == "rctrl_renwnd32": # Got the correct class, but we need to make sure window title is # empty because there may be other top-level Outlook windows. if win32gui.GetWindowText(hwnd) == "": return not _killNewMailIcon(hwnd) else: return True else: return True WUM_RESETNOTIFICATION = win32con.WM_USER + 7 def _killNewMailIcon(hwnd): nid = (hwnd, 0) if not win32gui.Shell_NotifyIcon(win32gui.NIM_DELETE, nid): return False else: win32gui.SendMessage(hwnd, WUM_RESETNOTIFICATION, 0, 0) return True ==================== -- Kenny Pitt From vanhorn at whidbey.com Fri Nov 7 13:15:00 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Fri Nov 7 13:15:23 2003 Subject: [Spambayes] How does Spambayes deal with those "random" words inspam? References: <200311071138.21673.parz@shaw.ca> Message-ID: <3FABE124.9F70F8A7@whidbey.com> This was tested in the early days, and the conclusion was to ignore them. Words that appear only once in the corpus are known as "hapax legemona" (if that's not correct it is close) and casually referred to be the testers as "hapaxes". They have no effect at all, because a hapax will have a score of .5, and the default is to ignore anything between .4 and .6. The non-random words the spam includes will have scores that are meaningful. Leaving the hapaxes in the database doubtless increases the size, but there's nothing that can be done about that easily. I could speculate that a mechanism to delete hapaxes that have been in the database for some length of time would cure this, but that would mean timestamping every entry in the database. You can't just delete all hapaxes immediately because that would prevent any additional words entering the database at all. Van Parzival wrote: > Lots of spam contains "random" words in the subject and visible or hidden in > the message body. Since each spam contains different random strings, these > words are very likely not to re-appear in subsequent spam. Does this reduce > the effectiveness of the classification? > > A human seeing such a message with garbage words would immediatly recognize it > as spam. Could the classfier be extended to assign higher spam ratings to > messages containing a large amount of "words which have never been seen"? > Possibly a user could pre-seed the classifier with a dictionary of words in > his/her language and/or jargon. > > -- Parzival > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From wsy at merl.com Fri Nov 7 13:31:12 2003 From: wsy at merl.com (Bill Yerazunis) Date: Fri Nov 7 13:31:17 2003 Subject: [Spambayes] How does Spambayes deal with those "random" words inspam? In-Reply-To: <3FABE124.9F70F8A7@whidbey.com> (vanhorn@whidbey.com) References: <200311071138.21673.parz@shaw.ca> <3FABE124.9F70F8A7@whidbey.com> Message-ID: <200311071831.hA7IVCX02341@localhost.localdomain> From: "G. Armour Van Horn" This was tested in the early days, and the conclusion was to ignore them. Words that appear only once in the corpus are known as "hapax legemona" (if that's not correct it is close) and casually referred to be the testers as "hapaxes". They have no effect at all, because a hapax will have a score of .5, and the default is to ignore anything between .4 and .6. The non-random words the spam includes will have scores that are meaningful. Good memory there! (thanks to Google and Bartleby.com:) Hapax Legomenon - from the greek hapax, once + legomenon, neuter sing. passive participle of legein, to count. NOUN: A word or form that occurs only once in the recorded corpus of a given language. Another approach is to give hapaxes very low weight, but ignore nothing, not even hapaxes. For example, the current incarnation of CRM114 would give a single-word hapax a probabalistic weight of .5 +- .03125 (that's what it works out to be). Note that this isn't quite _ignored_, but it's very close. ---------------- Now, there's another possibility- "fnord-counting". This is taking the incoming text and running it through the equivalent of a spellchecker; any word (that is, [a-zA-Z]+ ) that isn't recognized is changed to "fnord". - Texts you can easily read have near-zero fnords. - Texts with a lot of anti-filter noise and random words have high fnord counts. - Texts in another language are almost all fnords. It might be useful. ;) The code for fnord-counting is latent in CRM114 mailfilter but not turned on. -Bill Yerazunis From bob at jellyvision.com Fri Nov 7 13:40:33 2003 From: bob at jellyvision.com (Bob Chojnacki) Date: Fri Nov 7 13:37:09 2003 Subject: [Spambayes] Outlook Envelope Tray Icon In-Reply-To: <20031107121504.7ea42d3ec3ef466293ceca3f8ae215f9.in@ansel.jellyvision.com> Message-ID: > Now, the *really* tricky part is figuring out when to remove the icon. I noticed right after I sent my email (blush) the comment in their code ' add some code to check whether the latest items are "interesting" The comment is akin to the old Steve Martin comedy routine: "How to become a millionaire. First, get a million dollars..." Sorry about that. Bob > -----Original Message----- > From: Kenny Pitt [mailto:kennypitt@hotmail.com] > Sent: Friday, November 07, 2003 12:14 PM > To: 'Bob Chojnacki'; spambayes@python.org > Cc: spambayes-dev@python.org > Subject: RE: [Spambayes] Outlook Envelope Tray Icon > > > Bob Chojnacki wrote: > > Hi, > > > > I really like SpamBayes and the Outlook plugin. It is working much > > better than other spam filters, considering I get 85-95% spam. I am > > currently using version 008.1. I read your FAQ about the problems > > with making the Outlook envelope tray icon go away. (I am also not > > sure if this is the right email address to send this comment, so > > please bear with me if it isn't.) > > > > Is the following link helpful? (Keep in mind that I am not a Windows > > programmer): > > > > http://www.slipstick.com/dev/code/clearenvicon.htm > > Thanks for the link. I created the following code to implement this in > the Outlook plugin and attached it to a menu item for testing. It was, > in fact, successful in removing the new mail envelope from the taskbar. > Now, the *really* tricky part is figuring out when to remove the icon. > > ==================== > > def RemoveNewMailIcon(): > win32gui.EnumWindows(_removeIconCallback, None) > > def _removeIconCallback(hwnd, extra): > # Check for Outlook window class. > if win32gui.GetClassName(hwnd) == "rctrl_renwnd32": > # Got the correct class, but we need to make sure window title > is > # empty because there may be other top-level Outlook windows. > if win32gui.GetWindowText(hwnd) == "": > return not _killNewMailIcon(hwnd) > else: > return True > else: > return True > > WUM_RESETNOTIFICATION = win32con.WM_USER + 7 > def _killNewMailIcon(hwnd): > nid = (hwnd, 0) > if not win32gui.Shell_NotifyIcon(win32gui.NIM_DELETE, nid): > return False > else: > win32gui.SendMessage(hwnd, WUM_RESETNOTIFICATION, 0, 0) > return True > > ==================== > > -- > Kenny Pitt From rmalayter at bai.org Fri Nov 7 13:47:54 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Nov 7 13:47:57 2003 Subject: [Spambayes] Outlook Envelope Tray Icon Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3765@cliff.bai.org> > From: Bob Chojnacki > Subject: RE: [Spambayes] Outlook Envelope Tray Icon > > I noticed right after I sent my email (blush) the comment in > their code > > ' add some code to check whether the latest items are "interesting" > > The comment is akin to the old Steve Martin comedy routine: > > "How to become a millionaire. First, get a million dollars..." > > Sorry about that. Is it really that hard? Maybe I'm not thinking it through enough, but I suggest this simple approach: Check for unread messages in the SpamBayes "watched" folders. Check the spam score on each of those unread messages. If any exist where the Spam score is below the certain ham threshold, show the icon if not, everything new was spam, and you can remove the icon. This might take a second or two two but it can happen right after every SpamBayes scoring run gets triggered. So we'll see the new mail icon for at most a few seconds. Regards, Ryan From russ_foster at comcast.net Fri Nov 7 12:46:30 2003 From: russ_foster at comcast.net (Russ Foster) Date: Fri Nov 7 13:52:10 2003 Subject: [Spambayes] How does Spambayes deal with those "random" words inspam? In-Reply-To: <200311071831.hA7IVCX02341@localhost.localdomain> Message-ID: So, for example, this text would have a very high fnord count: Acocdrnig to an elgnsih unviesitry sutdy the oredr of letetrs in a wrod dosen't mttaer, the olny thnig thta's iopmrantt is that the frsit and lsat ltteer of eevry word is in the crcreot ptoision. The rset can be jmbueld and one is stlil able to raed the txet wiohtut dclftfuiiy. On Fri, 7 Nov 2003, Bill Yerazunis wrote: > > Now, there's another possibility- "fnord-counting". This is taking > the incoming text and running it through the equivalent of a spellchecker; > any word (that is, [a-zA-Z]+ ) that isn't recognized is changed to "fnord". From adam.walker at rbwconsulting.com Fri Nov 7 14:07:48 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 7 14:08:05 2003 Subject: [Spambayes] Outlook Envelope Tray Icon In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C3765@cliff.bai.org> References: <792DE28E91F6EA42B4663AE761C41C2A012C3765@cliff.bai.org> Message-ID: <3FABED84.5050607@rbwconsulting.com> What about mail delivered to unwatched folders? What about mail delivered to watched and unwatched folders in the same batch? Why do people feel they need to drop everything and read an email when it comes in? Ryan Malayter wrote: > >Is it really that hard? > >Maybe I'm not thinking it through enough, but I suggest this simple >approach: > Check for unread messages in the SpamBayes "watched" folders. > Check the spam score on each of those unread messages. > If any exist where the Spam score is below the certain ham >threshold, show the icon > if not, everything new was spam, and you can remove the icon. > >This might take a second or two two but it can happen right after every >SpamBayes scoring run gets triggered. So we'll see the new mail icon for >at most a few seconds. > >Regards, > Ryan > > > From smeza at wchra.com Fri Nov 7 13:06:11 2003 From: smeza at wchra.com (Sandy Meza) Date: Fri Nov 7 14:10:47 2003 Subject: [Spambayes] HELP Message-ID: Yesterday I downloaded Spam Bayes Outlook Plugin 2.3+ Binary and this morning I cannot open Outlook 2000. I am running Windows XP Professional. When I tried to install yesterday from the exe file I got the error message cannot register the dll files. I have done a system restore, deleted outlook and reinstalled and I still cannot open outlook. What should I do? From parz at shaw.ca Fri Nov 7 14:15:30 2003 From: parz at shaw.ca (Parzival) Date: Fri Nov 7 14:18:01 2003 Subject: [Spambayes] How does Spambayes deal with those "random" words inspam? In-Reply-To: <3FABE124.9F70F8A7@whidbey.com> References: <200311071138.21673.parz@shaw.ca> <3FABE124.9F70F8A7@whidbey.com> Message-ID: <200311071315.30607.parz@shaw.ca> On November 7, 2003 12:15, you wrote: > This was tested in the early days, and the conclusion was to ignore them. > Words that appear only once in the corpus are known as "hapax legemona" (if > that's not correct it is close) and casually referred to be the testers as > "hapaxes". They have no effect at all, because a hapax will have a score of > .5, and the default is to ignore anything between .4 and .6. The non-random > words the spam includes will have scores that are meaningful. Yes, I didn't expect that one could draw any conclusions from words seen only once, but I was suggesting that the /number/ of hapaxes (hoo boy! I lernt a knew werd taday!) in a message might be significant: In other words if the current body of ham contains, say 10% hapaxes, the body of spam contains 18% hapaxes, and an unclassified message contains say, 21%, then might this not be a case for weighting the message toward spam? Also, "hapax" is defined as a word not in the current corpus, but the words I would suggest considering are words which are "not words", and in the computer realization, this would be w.r.t. the body of ham, /and/ some dictionary that the user supplies (e.g. /usr/dict/words). I would hypothesize that in the Subject of messages, this kind of discrimination would be more pronounced, as body text often has lots of computer-generated cruft, while humans tend to actually write Subject text, and even a few garbage words in a subject increase the proportion of new words in the subject drastically. So in effect, I am suggesting that Spambayes would say, hmm, I have here a message with 10% words I have never seen in ham, nor in my dictionary, and the usual ham message contains 5% such words, while the usual spam message contains 9% such words so I think this message is closer to ham than to spam in respect of this measure. -- Parzival From taz at rolleigh.com Fri Nov 7 15:36:03 2003 From: taz at rolleigh.com (matt lennon) Date: Fri Nov 7 15:36:57 2003 Subject: [Spambayes] Spam filter problem Message-ID: I have a email that it will not let through the system, even after I "recover from spam" this email is created by our FTP server for file upload notification. How can I tell it to ignore this certain sender. Please help, Matt Lennon -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.197 / Virus Database: 260.2.9 - Release Date: 11/6/2003 From KFredrickson at cfp-group.com Fri Nov 7 16:55:44 2003 From: KFredrickson at cfp-group.com (Kim Fredrickson) Date: Fri Nov 7 16:58:36 2003 Subject: [Spambayes] Help - it quit working! Message-ID: I downloaded Spambayes to my machine and it worked beautifully - for one day! The next day when I came in and started my computer, it generated a fatal flaw with the program and now does not work. The Spambayes button on the toolbar pulls down a blank menu. I removed it from my machine, restarted and downloaded again, but the same problem exists. Can this be fixed? Kimberly Fredrickson The CFP Group, LLC Providing Comprehensive Facility Planning and Architecture Phone * 615-846-0222 Email * kfredrickson@cfp-group.com **************************** The information transmitted is private and confidential, and is intended only for the person or entity to whom or which it is addressed, and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you receive this in error, please contact the sender and delete the material from any computer From sethg at GoodmanAssociates.com Fri Nov 7 18:08:05 2003 From: sethg at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 7 18:08:11 2003 Subject: [Spambayes] simpler way to invoke SpamBayes from Outlook rule Message-ID: I just noticed there is a standard action for Outlook rules called "start application", which runs a user-specified application. Thus, we could easily make a rule to run the SpamBayes message processor when any new message arrives. If this rule were the *last* rule, then all other rules would process first without the use of background mode. Since many message sorting rules often end with "stop processing further rules", the rule invoking the SpamBayes engine would thus only run when the message does not fit any of the other rule categories. This would get rid of the need for background processing mode and get rid of the need to watch for "folder add events". I realize this is a naive question and that the code is already written as it is, but wouldn't this make the Outlook specific code a good deal simpler and thus easier to maintain? From tim.one at comcast.net Fri Nov 7 18:26:43 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 7 18:26:46 2003 Subject: [Spambayes] simpler way to invoke SpamBayes from Outlook rule In-Reply-To: Message-ID: [Seth Goodman] > I just noticed there is a standard action for Outlook rules called > "start application", which runs a user-specified application. Thus, > we could easily make a rule to run the SpamBayes message processor > when any new message arrives. I'm sorry to say that this one is simply impractical: process creation is an extremely expensive undertaking on Windows, and especially on the older Win9x/ME flavors. It would be unbearably slow for anyone getting a non-trivial amount of email. The application that got launched could try to scan for *lots* of new messages instead of just one, to amortize the gross startup cost -- but then it would be fighting the same Outlook timing mysteries as the current scheme. From sethg.delete at GoodmanAssociates.com Fri Nov 7 18:39:47 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 7 18:39:48 2003 Subject: [Spambayes] feature suggestion: new mail sound Message-ID: I saw some posts about the Outlook tray envelope icon and new mail sound. There were various opinions as to how they could be made to work sensibly. The tray icon is not easy, but I think there is a way to make the new mail sound useful. First, we would turn off the new mail sound globally in Outlook (that's an improvement already). Then SpamBayes could (optionally) play the sound when a message was classified as either ham or unsure (configurable, if you think it necessary). If you currently use background mode, all the rules that sort out the mailing list entries that don't require sound notification are run first, so only personally addressed messages that are not spam would generate the sound. If you really wanted sound notification for a particular mailing list, you could add "play a sound" to the rule that sorts for that mailing list. Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From sethg.delete at GoodmanAssociates.com Fri Nov 7 18:50:56 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 7 18:51:00 2003 Subject: [Spambayes] simpler way to invoke SpamBayes from Outlook rule In-Reply-To: Message-ID: > [Seth Goodman] > > I just noticed there is a standard action for Outlook rules called > > "start application", which runs a user-specified application. Thus, > > we could easily make a rule to run the SpamBayes message processor > > when any new message arrives. > > I'm sorry to say that this one is simply impractical: process creation is > an extremely expensive undertaking on Windows, and especially on the older > Win9x/ME flavors. It would be unbearably slow for anyone getting a > non-trivial amount of email. The application that got launched > could try to > scan for *lots* of new messages instead of just one, to amortize the gross > startup cost -- but then it would be fighting the same Outlook timing > mysteries as the current scheme. > Thanks for the follow-up. Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From jimp at sybersearch.com Fri Nov 7 19:18:07 2003 From: jimp at sybersearch.com (Jim Partridge) Date: Fri Nov 7 19:18:21 2003 Subject: [Spambayes] Feedback Message-ID: <001301c3a58d$cd328dc0$6401a8c0@jp751n> Hi Folks, I have been using your application now for several days and I'mvery impressed. It works very well with my Outlook. If you ever need an endorsement or want our group to sell it for you, just let us know! Thanks, Jim Partridge B.Comm, MBA President & CEO 102-145 West 15th St. North Vancouver B.C. V7M 1R9 Canada Ph: 604-990-2516 Fx: 604-987-8162 jimp@sybersearch.com http://www.sybersearch.com Visit our hiring center at http://pleaseapply.to/sybersearch -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 10863 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031107/15f44ddb/attachment-0001.jpe From gerry at ITWatchdogs.com Fri Nov 7 19:19:37 2003 From: gerry at ITWatchdogs.com (Gerry Cullen) Date: Fri Nov 7 19:19:44 2003 Subject: [Spambayes] dll won't load in W2003 Message-ID: <000001c3a58d$febb10b0$6501a8c0@itwcullen> I am running Windows 2003 and get the message "Unable to register the DLL/OCX: Dll register server failed; 0x0000000." The dll is addin/spambayes_addin.dll is what is trying to be loaded. Any ideas? Gerry From PIMMEL at mail.nwmissouri.edu Fri Nov 7 21:57:41 2003 From: PIMMEL at mail.nwmissouri.edu (Immel,Patrick) Date: Fri Nov 7 21:57:49 2003 Subject: [Spambayes] Mail not autosending in Outlook 2002 Message-ID: <147CF8DFB9C5D41187300001FA7EE39015165545@mail.nwmissouri.edu> Hello, First I am basically very happy with the performance of this program. I have a question problem though! I am running Outlook 2002 with SpamBayes .81 on Windows 2000 pro. I have Outlook set to automatically send mail as soon as I hit the *send* button. With SpamBayes installed the mail does not automatically send, yet when I uninstall SB, the mail starts to auto send. Is this a known issue and is there a fix / work around? Thanks for the help! Patrick Immel Assistant Professor Lighting & Scenic Designer Northwest Missouri State University From richie at entrian.com Sat Nov 8 04:25:36 2003 From: richie at entrian.com (Richie Hindle) Date: Sat Nov 8 04:26:06 2003 Subject: [Spambayes] simpler way to invoke SpamBayes from Outlook rule In-Reply-To: References: Message-ID: <18dpqv88kaq9kaulg666p3hv2jmoilflu8@4ax.com> [Seth] > I just noticed there is a standard action for Outlook rules called > "start application", which runs a user-specified application. Thus, > we could easily make a rule to run the SpamBayes message processor > when any new message arrives. [Tim] > I'm sorry to say that this one is simply impractical: process creation is > an extremely expensive undertaking on Windows, and especially on the older > Win9x/ME flavors. Would that still be true for a very small C program that talked to a long-running Spambayes server process? I know that process startup is incredibly bad on Windows compared with any sensible OS, but that's a relative measure. I just started Notepad for the first time in ages and it took on the order of a quarter of a second (on my fast machine, but people do run Outlook on fast machines 8-). Doing it a second time is close to instantateous. Maybe doing it a thousand times would be significant, but it's still relative to the time taken to fetch, classify, sort and store a thousand emails. -- Richie Hindle richie@entrian.com From barry.bear at lycos.co.uk Sat Nov 8 04:40:09 2003 From: barry.bear at lycos.co.uk (B.B) Date: Sat Nov 8 04:40:09 2003 Subject: [Spambayes] Outlook plug-in 0.81 continuous training. Message-ID: <001a01c3a5dc$4cd43670$2b5a6b3e@barrys> First of all thank you all for a great product, it just works. I seldom have a false positive and only a few unsure, I am impressed. I have a question about the continuous training feature. On the first page of the Spambayes manager it gives the current status of the training database of how many good and how many spam. I have noticed that the number of spam increments with every spam mail received but the number of good does not increment with every good e-mail received. It remains at the number it was when I initially trained it. Is this the correct behaviour? Thanks Barry From DBaumann at technexxus.com Sat Nov 8 13:27:11 2003 From: DBaumann at technexxus.com (David N. Baumann) Date: Sat Nov 8 13:27:15 2003 Subject: [Spambayes] Offer to Help Message-ID: I'm very impressed with your work and would be glad to help with your documentation and in evaluating features and functionality (e.g., "rules export function" or other utility for setting up multiple machines). Let me know where to begin. David Baumann Partner: Mintz Levin Cohn Ferris Glovsky and Popeo, PC CEO: TechNexxus, LLC dbaumann@mintz.com dbaumann@technexxus.com 701 Pennsylvania Avenue, N.W. Suite 900 Washington, DC 20004 Phone: 202.434.7450 Fax: 202.434.7400 Cell: 202.329.5588 www.mintz.com www.technexxus.com STATEMENT OF CONFIDENTIALITY: The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain confidential or privileged information. If you are not the intended recipient, please notify Mintz, Levin, Cohn, Ferris, Glovsky and Popeo, P.C. immediately at either (617) 542-6000 or at ISdirector@Mintz.com, and destroy all copies of this message and any attachments. From desertdummies at qwest.net Sat Nov 8 16:37:58 2003 From: desertdummies at qwest.net (Desertdummies) Date: Sat Nov 8 16:38:16 2003 Subject: [Spambayes] Spambayes Question Message-ID: <000001c3a640$976c49c0$6501a8c0@cactus1> How do I tell Spambayes to mark the messages as "read" when it moves them to the spam folder? Thanks. Kent From larrym3 at cox.net Sat Nov 8 17:11:09 2003 From: larrym3 at cox.net (Larry J. Martin) Date: Sat Nov 8 17:11:13 2003 Subject: [Spambayes] Cannot Open Spambayes Manager Message-ID: <000001c3a645$37876440$4a413841@chmokc.chm.net> When I try to open the manager I get a message to install Visio. Here is the log file. I am sending this from my business account. I am registered using my personal account larrym3@cox.net Loaded bayes database from 'C:\Documents and Settings\lmartin.CHMOKC.000\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\lmartin.CHMOKC.000\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 634 spam and 522 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Spam Processing 0 missed spam in folder 'Inbox' took 0.248356ms FAILED to add the toolbar item 'SpamBayesCommand.Manager' - (-2147352567, 'Exception occurred.', (0, None, None, None, 0, -2147467259), None) Deleted the dead popup control - re-creating Message 'jza positive grand wh? keep ?p the st?ruggle, a!nnex o?t of that m?e5s tbztiz' had a Spam classification of 'Unsure' Training on message 'jza positive grand wh? keep ?p the st?ruggle, a!nnex o?t of that m?e5s tbztiz' - trained as spam SpamBayes - Disconnecting from Outlook SpamBayes processed 2 messages, finding 1 spam and 1 unsure Addin terminating: 2 COM client and 2 COM servers exist. Larry J. Martin CHM Inc. ljmartin@chm.net cell: 405-514-6556 work: 405-733-2887 From kbird at net2dish.net Sat Nov 8 23:50:56 2003 From: kbird at net2dish.net (Kathy Lynch) Date: Sat Nov 8 23:51:29 2003 Subject: [Spambayes] SpamBayes -- I've really done it now! Message-ID: <000801c3a67d$1a5e0d00$0100a8c0@delldual550> Good day! Can I recover? In my Windows XP, I just installed the add-in of SpamBayes and I have already lost approximately twenty good messages from my Outlook 2000 Inbox. Can I recover them? Thanks. Patrick From pfarley at iisfa.org Sun Nov 9 10:43:34 2003 From: pfarley at iisfa.org (Paul Farley) Date: Sun Nov 9 10:43:59 2003 Subject: [Spambayes] Install Error Message-ID: <000e01c3a6d8$40f59950$8505a8c0@aragorn> I'm trying to install SPAM Bayes on a Windows XP SP1 box running Outlook 2000. I'm getting the following error: Unable to register DLL/OCX: DllRegisterServer.....spambayes_addin.dll Ideas? From Paul.Farley at EventLevel.com Sun Nov 9 10:54:33 2003 From: Paul.Farley at EventLevel.com (Paul Farley) Date: Sun Nov 9 10:54:56 2003 Subject: [Spambayes] Install Error In-Reply-To: <000e01c3a6d8$40f59950$8505a8c0@aragorn> Message-ID: <000f01c3a6d9$c8397a70$8505a8c0@aragorn> I've tried the uninstall trick, but saw some other ideas on the bug forum just now, so I'll try that first. -----Original Message----- From: Paul Farley [mailto:pfarley@iisfa.org] Sent: Sunday, November 09, 2003 10:44 To: spambayes@python.org Subject: [Spambayes] Install Error I'm trying to install SPAM Bayes on a Windows XP SP1 box running Outlook 2000. I'm getting the following error: Unable to register DLL/OCX: DllRegisterServer.....spambayes_addin.dll Ideas? _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From bob at 1776.com Sun Nov 9 12:41:10 2003 From: bob at 1776.com (Robert K. Coe) Date: Sun Nov 9 12:45:01 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! Message-ID: <000001c3a6e8$aa329110$6501a8c0@CambridgeMA.gov> I don't mean to be facetious, but is the Outlook add-in still being supported? Over the past several weeks (at least as long as 0.81 has been out there), literally dozens of people have reported the "Unable to register the DLL/OCX ..." error and the (related?) error in which Spambayes sometimes quits working and/or prevents Outlook from working for one or more users. It would seem that these are serious problems, and they're certainly real. I've seen them both, and I'm not a clueless newbie. But through it all I can't remember seeing a single comment from any of the developers. No acknowledgement that they've confirmed (or not been able to confirm) the errors. No promise to get out a fix. Nothing. (Unless I'm getting senile, which isn't out of the question; I'm not getting any younger.) Just for the record, I've gotten both of the aforementioned errors, as have several of my users. For me personally, the most serious problem arose yesterday, when I converted my Windows 2000 laptop to Windows XP (a complete from-scratch reinstallation). Ever since, if I try to open Outlook with Spambayes running, Outlook exits and offers to send an error report to Microsoft. (I let it do so the first time.) If I remove Spambayes, Outlook works normally. I can't prove it, but I suspect that the problem dates to the attempt (around version 0.7 or 0.8) to simplify the installation process by blurring the distinction between user-specific and machine-specific setup. (Before that, the distinction was clean but poorly explained in the documentation, leading to much confusion.) And in the two cases I know the most about, the problem occurred when a Spambayes user tried to use a computer on which no one had run Outlook with Spambayes before. I fully realize that this is free software and that those who wrote it have to spend some time on their day jobs. So I don't want to make too much out of this. But this program does a good job in an area important to all of us, and it would be a shame to see it wither. We were all a little incensed a month or two ago when a columnist denounced Spambayes as not ready for prime time. But I have to say that I'm beginning to think he has a point. Bob From adam.walker at rbwconsulting.com Sun Nov 9 13:03:02 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Sun Nov 9 13:03:40 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! In-Reply-To: <000001c3a6e8$aa329110$6501a8c0@CambridgeMA.gov> References: <000001c3a6e8$aa329110$6501a8c0@CambridgeMA.gov> Message-ID: <3FAE8156.2060700@rbwconsulting.com> Robert K. Coe wrote: >I don't mean to be facetious, but is the Outlook add-in still being supported? Over the past several weeks (at least as long as 0.81 has been out there), literally dozens of people have reported the "Unable to register the DLL/OCX ..." error and the (related?) error in which Spambayes sometimes > > I believe a new installer is being tested that is based on a new (sandbox) version py2exe that should fix this problem. Every release of the plugin brings a new onslaught of problems, I can see why Mark would want to test installer very well before switching to something new. >I fully realize that this is free software and that those who wrote it have to spend some time on their day jobs. So I don't want to make too much out of this. But this program does a good job in an area important to all of us, and it would be a shame to see it wither. We were all a little incensed a month or two ago when a columnist denounced Spambayes as not ready for prime time. But I have to say that I'm beginning to think he has a point. > > > It was a guy in a weblog. As long as there is zero in the first position of the version number, then no-one is making any promises that it is ready for prime-time. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 261 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031109/e1011fa3/attachment.bin From colet at code-energy.com Sun Nov 9 14:55:51 2003 From: colet at code-energy.com (Cole Tuininga) Date: Sun Nov 9 14:59:56 2003 Subject: [Spambayes] sb_imapfilter fails with string index out of range Message-ID: <1068407751.20436.9.camel@colap> Hi all - I'm a spambayes newbie, trying to set it up for the first time. Looked through the archives but didn't see anything that about this specific problem. A little background: The mail server is running courier imap (ssl) on a debian woody system. The mail client is typically evolution on the off chance that it matters. On the client system, I've compiled and installed python2.3. I installed spambayes with (seemingly) no problem. I couldn't seem to get pybsddb to work, so I'm opting to go with pickle storage. My .spambayesrc file looks like this: [Headers] notate_subject=spam [imap] password=*snip* server=someserver:993 use_ssl=True username=*snip* When I started up the imap filter ( sb_imapfilter.py -b ) it seems to run ok. I can go in with the web browser and train it, classify mail, etc. However, when I try to start the "daemon" version ( sb_imapfilter.py -c -t -l 5 ) I get a traceback: SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Traceback (most recent call last): File "/usr/local/bin/sb_imapfilter.py", line 825, in ? run() File "/usr/local/bin/sb_imapfilter.py", line 815, in run imap_filter.Filter() File "/usr/local/bin/sb_imapfilter.py", line 667, in Filter imap.SelectFolder(self.spam_folder.name) File "/usr/local/bin/sb_imapfilter.py", line 245, in SelectFolder response = self.select(folder, None) File "/usr/local/encap/python-2.3/lib/python2.3/imaplib.py", line 606, in select typ, dat = self._simple_command(name, mailbox) File "/usr/local/encap/python-2.3/lib/python2.3/imaplib.py", line 1000, in _simple_command return self._command_complete(name, self._command(name, *args)) File "/usr/local/encap/python-2.3/lib/python2.3/imaplib.py", line 774, in _command data = '%s %s' % (data, self._checkquote(arg)) File "/usr/local/encap/python-2.3/lib/python2.3/imaplib.py", line 983, in _checkquote if (arg[0],arg[-1]) in (('(',')'),('"','"')): IndexError: string index out of range I'm using the latest release from the downloads area (1.0a7 IIRC). Anybody have hints/ideas/suggestions? Thanks in advance... -- "... one of the main causes of the fall of the Roman Empire was that, lacking zero, they had no way to indicate successful termination of their C programs." -- Robert Firth Cole Tuininga Lead Developer Code Energy, Inc colet@code-energy.com PGP Key ID: 0x43E5755D From spambayes at whateley.com Sat Nov 8 02:13:16 2003 From: spambayes at whateley.com (Brendon) Date: Sun Nov 9 16:12:01 2003 Subject: [spambayes-dev] Re: [Spambayes] Outlook Envelope Tray Icon In-Reply-To: <3FABED84.5050607@rbwconsulting.com> References: <792DE28E91F6EA42B4663AE761C41C2A012C3765@cliff.bai.org> <3FABED84.5050607@rbwconsulting.com> Message-ID: <200311071220.06215.spambayes@whateley.com> On Friday 07 November 2003 11:07 am, Adam Walker wrote: > What about mail delivered to unwatched folders? What about mail > delivered to watched and unwatched folders in the same batch? Why do > people feel they need to drop everything and read an email when it comes > in? > Probably the same reason most people can't _not_ answer a ringing phone! That, or just lonely? Brendon. From tpeters at mixcom.com Sun Nov 9 18:07:38 2003 From: tpeters at mixcom.com (Tom Peters) Date: Sun Nov 9 18:07:53 2003 Subject: [Spambayes] Installing the next beta Message-ID: <5.1.0.14.0.20031109170030.00b85a70@localhost> I was wondering where Spambayes keeps its database of words it gets from training. I'm guessing hammie.db since that file gets updated every time I train. The last two batches of mail I trained on (47 messages and 64 messages) were perfect- Spambayes correctly identified all spam and all ham. I have 1.0a7 downloaded and want to install it in the next day or so. Can I preserve the training I've done so far? How? Does running setup.py install find and copy/convert the db file? Or should I install 1.0a7 and copy my old hammie.db file over the top of the one it creates/installs? -T --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tim.one at comcast.net Sun Nov 9 18:09:47 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 9 18:09:52 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! In-Reply-To: <000001c3a6e8$aa329110$6501a8c0@CambridgeMA.gov> Message-ID: [Robert K. Coe] > I don't mean to be facetious, but is the Outlook add-in still > being supported? Over the past several weeks (at least as long > as 0.81 has been out there), literally dozens of people have > reported the "Unable to register the DLL/OCX ..." error and the > (related?) error in which Spambayes sometimes quits working > and/or prevents Outlook from working for one or more users. It > would seem that these are serious problems, Yes. > and they're certainly real. Yes. > I've seen them both, and I'm not a clueless newbie. So what caused them <0.1 wink>? > But through it all I can't remember seeing a single comment from > any of the developers. No acknowledgement that they've confirmed > (or not been able to confirm) the errors. No promise to get out a > fix. Nothing. (Unless I'm getting senile, which isn't out of the > question; I'm not getting any younger.) Everything known about such things is recorded in the project bug tracker. There have always been reports of this kind, since the first alpha release. The developers never see them happen, so, no, can't confirm any of 'em. So long as they remain mysteries, also no, no fix is forthcoming so long as that remains true. "Literally dozens" seems hyperbolic to me, but even if it's hundreds, that's a small percentage of the people who have downloaded the Outlook addin. If you haven't noticed before , that's what Windows is like: no matter what piece of Windows software you may look at, a small percentage of users never get it to work. Sometimes an extremely persistent user figures out why (kinda -- this is the "well, I installed a new mouse driver, and suddenly it works!" kind of pseudo explanation), but often there's never a resolution. Every problem reported that Mark could reproduce has already been fixed. Likewise every problem I've ever seen has already been fixed (across 3 distinct installations). The only real hopes remaining for others are that (1) enough reports accumulate on the bug tracker so that a previously unsuspected pattern appears (like "ah, they're all running ZoneAlarm version XYZ with its mail quarantine feature enabled"; and/or, (2) someone knowledgeable about Outlook quirks who experiences a problem digs into it, and figures it out themself, or can at least relate it to the (often undocumented) details of the Outlook API; and/or, (3) Mark installs a new version of Outlook or Windows and sees the problem himself. Keep in mind that we didn't write Outlook, Microsoft keeps the Outlook source code secret, and that Mark already fixed dozens of problems stemming from undocumented, reverse-engineered differences among Outlook and OS versions. When working with a Microsoft API, you're working blind (you can't look at *their* code, you can only guess at the truth about the bits they didn't reveal, or documented incorrectly, or forgot to update the docs when some new version or service pack changed behavior). > ... > I fully realize that this is free software and that those who > wrote it have to spend some time on their day jobs. I can make about one hour per week for spambayes, and since I have no problems of any kind with the Outlook addin, usually spend it analyzing "high unsures", looking for tricks we may be missing. If I had a problem with the Outlook addin, I'd spend my hour on that instead. Someone who does have a problem is going to have to volunteer to track it down to the bitter end, or pay someone to do that for them. Another possibility is to give the Inboxer trial version a shot, and pay for it if it doesn't have the same problem (I don't know whether they see the same kinds of mystery problems, but it's possible that they don't). From pat at lyntronics.com Sun Nov 9 18:12:37 2003 From: pat at lyntronics.com (Patrick Lynch) Date: Sun Nov 9 18:12:51 2003 Subject: [Spambayes] Outlook add-in -- how to restore mail? Message-ID: In less than three minutes after installing the Outlook add-in, I managed to delete good mail from my inbox. Running Windows XP, and Outlook 2000. How do I restore the deleted Inbox messages? From richie at entrian.com Sun Nov 9 18:35:29 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 9 18:35:46 2003 Subject: [Spambayes] Installing the next beta In-Reply-To: <5.1.0.14.0.20031109170030.00b85a70@localhost> References: <5.1.0.14.0.20031109170030.00b85a70@localhost> Message-ID: <2kjtqv02li03j30nag2blh3re2t96smop6@4ax.com> [Tom] > I have 1.0a7 downloaded and want to install it in the next day or so. Can I > preserve the training I've done so far? How? You don't say which SpamBayes application you're running - Outlook plugin, POP3 proxy, IMAP filter, sb_filter...? You also don't say what version you're currently running. If you're running the POP3 proxy, version 1.0a6, you can just upgrade. It will automatically pick up your old settings and your old database. If you're using the 1.0a6 IMAP filter, this note from WHAT_IS_NEW.txt applies: -------------------------- ** Incompatible changes ** -------------------------- o If you are using a pickle for storage, your 'message info' database would previously still have been a dbm (where available). This is no longer the case - if you are using a pickle for the statistics database, you have a pickle for everything. Your old 'message info' database is not converted (and there is no utility provided to do so), but you should not suffer any ill effects from this, *unless* you are using sb_imapfilter.py. In that case, you will find that the filter trains and classifies all messages in the folders it examines, even if it has seen them before - this will only occur once, however. There should be no other incompatible changes (from 1.0a6) in this release. -- Richie Hindle richie@entrian.com From pat at lyntronics.com Sun Nov 9 20:45:00 2003 From: pat at lyntronics.com (Patrick Lynch) Date: Sun Nov 9 20:45:15 2003 Subject: [Spambayes] Outlook add-in -- how to restore mail? - CANCEL THIS! Message-ID: Please disregard my query earlier today. I found the "Junk E-Mail" folder and my mail was there! From tim.one at comcast.net Sun Nov 9 21:03:09 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 9 21:03:06 2003 Subject: [Spambayes] Outlook add-in -- how to restore mail? - CANCEL THIS! In-Reply-To: Message-ID: [Patrick Lynch] > Please disregard my query earlier today. I found the "Junk E-Mail" > folder and my mail was there! Glad you're unstuck! Note that it may *not* be there, though: when you configured SpamBayes, you had to tell it about two folders, a spam folder and an "uncertain" folder. As you continue training the system, it becomes much less likely that a good email will wind up in the spam folder. The uncertain folder is the other place to look. I get about 700 emails a day, and look at my spam folder just once at the end of the day, or sometimes not at all. I can't remember the last time I found a ham there. I look at my uncertain folder several times per day, though. Unusual ham ends up there, sometimes so unusual I can't tell whether it's ham or spam without more effort than it's worth to figure out. Those I just throw away. The rest of the unsures I train on. From tpeters at mixcom.com Sun Nov 9 22:30:44 2003 From: tpeters at mixcom.com (Tom Peters) Date: Sun Nov 9 22:40:55 2003 Subject: [Spambayes] Installing the next beta In-Reply-To: <2kjtqv02li03j30nag2blh3re2t96smop6@4ax.com> References: <5.1.0.14.0.20031109170030.00b85a70@localhost> <5.1.0.14.0.20031109170030.00b85a70@localhost> Message-ID: <5.1.0.14.0.20031109212118.01d255a0@localhost> Sorry I didn't give all the details. I use the POP3 proxy. I downloaded the package labeled 1.0a5 then upgraded to 1.0a6 which is what I'm using now. I installed a6 in a separate directory from the a5 release which is why I think it didn't seem to retain the training from the prior release, but it was hard to tell since I hadn't done extensive training. At 11:35 PM 11/9/2003 +0000, you wrote: > > I have 1.0a7 downloaded and want to install it in the next day or so. > Can I > > preserve the training I've done so far? How? > >You don't say which SpamBayes application you're running - Outlook plugin, >POP3 proxy, IMAP filter, sb_filter...? You also don't say what version >you're currently running. > >If you're running the POP3 proxy, version 1.0a6, you can just upgrade. It >will automatically pick up your old settings and your old database. [Government]A government which robs Peter to pay Paul can always depend on the support of Paul. --George Bernard Shaw --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From pfarley at iisfa.org Sun Nov 9 23:31:52 2003 From: pfarley at iisfa.org (Paul Farley) Date: Sun Nov 9 23:32:19 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! In-Reply-To: Message-ID: <000401c3a743$9502c520$8505a8c0@aragorn> I have a machine that is consistently giving that error, I don't see a log file, is there any other information I could collect that would help in figuring out what's causing the problem? -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: Sunday, November 09, 2003 18:10 To: Robert K. Coe Cc: spambayes@Python.org Subject: RE: [Spambayes] Outlook add-in is SICK and needs help! [Robert K. Coe] > I don't mean to be facetious, but is the Outlook add-in still > being supported? Over the past several weeks (at least as long > as 0.81 has been out there), literally dozens of people have > reported the "Unable to register the DLL/OCX ..." error and the > (related?) error in which Spambayes sometimes quits working > and/or prevents Outlook from working for one or more users. It > would seem that these are serious problems, Yes. > and they're certainly real. Yes. > I've seen them both, and I'm not a clueless newbie. So what caused them <0.1 wink>? > But through it all I can't remember seeing a single comment from > any of the developers. No acknowledgement that they've confirmed > (or not been able to confirm) the errors. No promise to get out a > fix. Nothing. (Unless I'm getting senile, which isn't out of the > question; I'm not getting any younger.) Everything known about such things is recorded in the project bug tracker. There have always been reports of this kind, since the first alpha release. The developers never see them happen, so, no, can't confirm any of 'em. So long as they remain mysteries, also no, no fix is forthcoming so long as that remains true. "Literally dozens" seems hyperbolic to me, but even if it's hundreds, that's a small percentage of the people who have downloaded the Outlook addin. If you haven't noticed before , that's what Windows is like: no matter what piece of Windows software you may look at, a small percentage of users never get it to work. Sometimes an extremely persistent user figures out why (kinda -- this is the "well, I installed a new mouse driver, and suddenly it works!" kind of pseudo explanation), but often there's never a resolution. Every problem reported that Mark could reproduce has already been fixed. Likewise every problem I've ever seen has already been fixed (across 3 distinct installations). The only real hopes remaining for others are that (1) enough reports accumulate on the bug tracker so that a previously unsuspected pattern appears (like "ah, they're all running ZoneAlarm version XYZ with its mail quarantine feature enabled"; and/or, (2) someone knowledgeable about Outlook quirks who experiences a problem digs into it, and figures it out themself, or can at least relate it to the (often undocumented) details of the Outlook API; and/or, (3) Mark installs a new version of Outlook or Windows and sees the problem himself. Keep in mind that we didn't write Outlook, Microsoft keeps the Outlook source code secret, and that Mark already fixed dozens of problems stemming from undocumented, reverse-engineered differences among Outlook and OS versions. When working with a Microsoft API, you're working blind (you can't look at *their* code, you can only guess at the truth about the bits they didn't reveal, or documented incorrectly, or forgot to update the docs when some new version or service pack changed behavior). > ... > I fully realize that this is free software and that those who > wrote it have to spend some time on their day jobs. I can make about one hour per week for spambayes, and since I have no problems of any kind with the Outlook addin, usually spend it analyzing "high unsures", looking for tricks we may be missing. If I had a problem with the Outlook addin, I'd spend my hour on that instead. Someone who does have a problem is going to have to volunteer to track it down to the bitter end, or pay someone to do that for them. Another possibility is to give the Inboxer trial version a shot, and pay for it if it doesn't have the same problem (I don't know whether they see the same kinds of mystery problems, but it's possible that they don't). _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From Mark.Howells at softoption.com Mon Nov 10 04:25:39 2003 From: Mark.Howells at softoption.com (Mark Howells) Date: Mon Nov 10 04:27:00 2003 Subject: [Spambayes] Spambayes Question Message-ID: <5846CF419D2EF5439036CC3126A3A995017B1A@SOSERVER1.softoption.local> > -----Original Message----- > From: Desertdummies [mailto:desertdummies@qwest.net] > Subject: [Spambayes] Spambayes Question > > How do I tell Spambayes to mark the messages as "read" when > it moves them to > the spam folder? Open the SpamBayes manager, select the 'Filtering' tab and check the 'Mark spam as read' and 'Mark possible spam as read' checkboxes as appropriate. Mark -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 260.2.9 - Release Date: 11/6/2003 From Mark.Howells at softoption.com Mon Nov 10 04:28:11 2003 From: Mark.Howells at softoption.com (Mark Howells) Date: Mon Nov 10 04:29:04 2003 Subject: [Spambayes] SpamBayes -- I've really done it now! Message-ID: <5846CF419D2EF5439036CC3126A3A995017B1B@SOSERVER1.softoption.local> > -----Original Message----- > From: Kathy Lynch [mailto:kbird@net2dish.net] > Subject: [Spambayes] SpamBayes -- I've really done it now! > > In my Windows XP, I just installed the add-in of SpamBayes > and I have already lost approximately twenty good messages > from my Outlook 2000 Inbox. They should have been moved into the "Certain spam" or "Possible spam" folders defined in the 'Filtering' tab of the SpamBayes manager. Just select the messages there and click on the "Recover from spam" button. Mark -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 260.2.9 - Release Date: 11/6/2003 From seant at iname.com Mon Nov 10 09:28:18 2003 From: seant at iname.com (Sean True) Date: Mon Nov 10 09:27:39 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! In-Reply-To: Message-ID: <005301c3a796$e3125bb0$0201a8c0@swapwizard.com> ... > > I fully realize that this is free software and that those who > > wrote it have to spend some time on their day jobs. > > I can make about one hour per week for spambayes, and since I have no > problems of any kind with the Outlook addin, usually spend it > analyzing > "high unsures", looking for tricks we may be missing. If I > had a problem > with the Outlook addin, I'd spend my hour on that instead. > Someone who does > have a problem is going to have to volunteer to track it down > to the bitter > end, or pay someone to do that for them. Another possibility > is to give the > Inboxer trial version a shot, and pay for it if it doesn't > have the same > problem (I don't know whether they see the same kinds of > mystery problems, > but it's possible that they don't). > > Oh, if only that were true. We have few reports of this. Not many, mostly from people installing (attempting to install) on systems with Outlook Express instead of Outlook. I believe we saw a few before we tightened up the "detect Netscape as a really incompatible MAPI provider" logic. And most recently, I discovered that installing a Python 2.3 based binary on top of a 2.2 based binary (without uninstalling first) provokes a lovely error of this flavor. I promptly built a mandatory uninstall into our installer. BTW, for those that are wondering ... Tim is absolutely right about the nightmare that is Outlook. 4 different versions to support, two different operating systems, and a hostile debugging environment. I finally spent a day figuring out how to use the remote debugger inside ActiveState Komodo to debug the binary addin in Outlook ... just so I could figure out which of 300 lines was incurring the 20 MB memory leak ... only in the binary. Still, if enough of your non-technical friends buy it, it might turn into a living. -- Sean From dnoble at appsolution.net Mon Nov 10 09:52:04 2003 From: dnoble at appsolution.net (DNoble) Date: Mon Nov 10 09:54:00 2003 Subject: [Spambayes] Development Participation Message-ID: <003201c3a79a$352d28a0$0c00a8c0@ASP0100> After reviewing numerous SPAM tools I have to give credit to SpamBayes for their intelligent grasp of the issue. First you are the only tool of 15 to actually work correctly on a Windows 2000 Advanced server running Terminal Services. Your Developers clearly understand the importance of using the Windows Registry correctly. Second you kept It simple. There is nothing worse than deploying a tool to 250+ users, most of which don't need the tool, that asks 25 questions during load, most of which the user doesn't know the answer to and the administrator can't default. Finally your tool bar is not inundated with ridiculous features that generally only server to create more spam or return undeliverable email warnings back to the user. I also noticed that the memory load and processor load was 85% less than the average SPAM tool I tested. Lastly you can't beat the price. In Short Don't change a thing. I am the owner of an Application Hosting Company (ASP) serving 250+ Customers worldwide on a server farm of 10+ servers running Windows 2000 Terminal Services, Citrix, and a host of applications for our customers. Most of my users don't require SPAM protection inside outlook as we stop most of it at the mail servers, however some have POP accounts with other providers and it was becoming a nightmare. Your tool gave me a way of solving the problem without managing ridiculous registry settings for software companies that assume all registry settings should go to H_Key_Local Machine. Most Vendors also don't understand the shared installation of Office where multiple users share the same instance of Office on a server. Anyway, I Have 15+ years development experience with VB, C++, VBA, SQL, Access, HTML, XML etc. You Probably don't want my help with documentation as I tend to be too technical however any development help you need don't hesitate to drop me a line. By the way one thing that would be useful would be an administrative option to turn off the initial setup questions. Don't get me wrong, 4 questions all defaulted appropriately is not bad at all. However in a network even 4 question times 250 users produces a bunch of phone calls. Let me know if I can help. Regards, Darrell J Noble Application Solution Providers, Inc. http://www.appsolution.net Main Phone: 303-932-0311 From kennypitt at hotmail.com Mon Nov 10 09:54:22 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 10 09:54:51 2003 Subject: [Spambayes] Outlook plug-in 0.81 continuous training. In-Reply-To: <001a01c3a5dc$4cd43670$2b5a6b3e@barrys> Message-ID: B.B wrote: > I have a question about the continuous training feature. On the first > page of the Spambayes manager it gives the current status of the > training database of how many good and how many spam. I have noticed > that the number of spam increments with every spam mail received but > the number of good does not increment with every good e-mail > received. It remains at the number it was when I initially trained > it. Is this the correct behaviour? Are you certain that your trained spam count is going up? Under normal circumstances, the Outlook addin does not train on *any* received messages unless you tell it to with the Delete or Recover buttons. Since the counts in the SpamBayes Manager show the number of messages that you have trained on, neither should increment just by receiving a message. You might want to keep careful tabs on these numbers for a couple of days to verify that your spam count is in fact increasing with every received spam message. If so, there could be several possible explainations. The most likely one that I can think of would be if you have an Outlook rule that moves messages into the folder that you have designated for Certain Spam. SpamBayes will train on any untrained messages that are moved into the spam folder. -- Kenny Pitt From gnunn at bellsouth.net Mon Nov 10 11:41:12 2003 From: gnunn at bellsouth.net (gnunn@bellsouth.net) Date: Mon Nov 10 11:41:13 2003 Subject: [Spambayes] Deleting Messages from Junk Folder Message-ID: Do I have to keep all the spam in the junk folder for the program to work. I would like to delete it. Some of it has objectionable material in it. From luro at solari.it Mon Nov 10 11:46:51 2003 From: luro at solari.it (Luca Rodaro) Date: Mon Nov 10 11:47:15 2003 Subject: [Spambayes] Question : rules for multiple users Message-ID: As a system admin, I am planning to set up a spambayes pop3proxy for my organization. So, many users will redirect their mail client to the pop3proxy listener. There's one thing I am not understanding. Does any user need to train the proxy by himself with his own rules for spam? Or is it possible to define global rules, valid for all users? Are the rules individual, global, or is it possible to define rules of both categories? Thank you for your help. Best regards from Italy Luca Rodaro Udine - Italy From Mark.Howells at softoption.com Mon Nov 10 11:52:45 2003 From: Mark.Howells at softoption.com (Mark Howells) Date: Mon Nov 10 11:55:59 2003 Subject: [Spambayes] Deleting Messages from Junk Folder Message-ID: <5846CF419D2EF5439036CC3126A3A995017B20@SOSERVER1.softoption.local> > From: gnunn@bellsouth.net [mailto:gnunn@bellsouth.net] > > Do I have to keep all the spam in the junk folder for the > program to work. I > would like to delete it. Some of it has objectionable material in it. Go ahead and delete it, All SpamBayes information is held in external databases. Mark -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 260.2.9 - Release Date: 11/6/2003 From balfson at mediasoftusa.com Mon Nov 10 12:54:39 2003 From: balfson at mediasoftusa.com (Bob Alfson) Date: Mon Nov 10 12:56:18 2003 Subject: [Spambayes] Outlook running under WinXP-SP1 Message-ID: <313AB038FC08D5118ADC0008C75DE36FEC0FAF@exchange.microageokc.com> Have tried all suggestions in troubleshooting guide, but still can't enable v.81 From thetasig at comcast.net Mon Nov 10 14:17:10 2003 From: thetasig at comcast.net (theta sigma) Date: Mon Nov 10 14:17:14 2003 Subject: [Spambayes] Mozilla and Spambayes Message-ID: <3FAFE436.90606@comcast.net> I've been successful in migrating from Eudora to Mozilla. I'm still using SpamBayes service. I'd be happy to help out if anyone needs assistance. -=mark=- From tim.one at comcast.net Mon Nov 10 14:44:33 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 10 14:44:39 2003 Subject: [Spambayes] Outlook add-in is SICK and needs help! In-Reply-To: <000401c3a743$9502c520$8505a8c0@aragorn> Message-ID: [Paul Farley] > I have a machine that is consistently giving that error, I don't see > a log file, is there any other information I could collect that would > help in figuring out what's causing the problem? Well, let's start with: + Which error, exactly (Robert Coe mentioned 3 different errors in the base message you're replying to indirectly here)? + Which version of the addin? + Which version and service pack (or "technical refresh") of Outlook? + Which OS, which service pack level of that OS, and which nationality? Whichever combo of answers you give, I probably don't have access to the same combo. The hope then is that other people on this list with the same combo as yours speak up, one way or the other. If nobody replies, there's nothing more we can do. If other people with the same combo report the same problem, then there's a basis for making progress. If other people with the same combo report that they don't have the same problem, then there's some other difference among your systems that you'll have to figure out in cooperation with those who replied. IOW, problems the developers can't reproduce will get fixed only to the extent that people experiencing the problems actively cooperate with each other. That *can* work great. A Wiki is a good way for people to share experiences in one place, and Richie Hindle set one up for SpamBayes: http://www.entrian.com/sbwiki/ From sethg.delete at GoodmanAssociates.com Mon Nov 10 14:45:29 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Mon Nov 10 14:45:37 2003 Subject: [Spambayes] ageing out database entries Message-ID: I saw mention in some other threads about methods of rebalancing spam and ham in the databases. My question concerns preventing the databases from growing without limit. At the moment, I have only about 550 ham and 750 spam, but I am still adding fifteen or twenty spam per day from the Unsure folder. Unless this slows down, after another three months or so, I will have around 2K spams and climbing. Not only will this imbalance the data set, but I've heard that empirical tests show that too large a database decreases accuracy. So I am curious what, if any, measures SpamBayes takes to control the size of the databases. I saw the post that suggested ageing out databases to control total number plus same start date and then using random deletion to balance the data set sizes. I also am uncomfortable with the idea of randomly deleting spam from the database, but have also noted that someone knowledgeable stated "intuition is a poor guide". As a hardware guy who does signal processing for a living, I am not at all surprised at this in a stochastic approach like yours. However, if empirical evidence tells you to keep the database size limited, an important step would be for the program to do this in a reasonable way, whatever that is. One other Bayesian spam program (K9) sets separate limits on the number of spam and ham, trains on every message and deletes the oldest message when a new on comes in. This achieves whatever spam/ham balance you want without regard to start date. Another obvious approach is to enforce the same start date on the two data sets and accept any spam/ham imbalance. I have no idea what works best, but I'd be willing to help with the experiments needed. Since I don't know jack about Python, Windows API's, Outlook, or VBA, I can't help with that type of programming. If you can isolate it to some C modules, I *can* code in that, as long as you don't talk to me about Windows object classes. I wouldn't mind writing code to process a database file in various ways whose structure was explained somewhere. The software I've written has been limited to DSP routines, DSP simulations (Matlab), communications and hard real-time code for embedded systems (don't need no stinkin' windows). However, I'd also be happy to run test sets to quantify the results of different methods of pruning the databases. If I can help out in any way, please let me know. Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From tim.one at comcast.net Mon Nov 10 15:11:59 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 10 15:12:07 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Seth Goodman] > I saw mention in some other threads about methods of rebalancing spam > and ham in the databases. My question concerns preventing the > databases from growing without limit. At the moment, I have only > about 550 ham and 750 spam, but I am still adding fifteen or twenty > spam per day from the Unsure folder. If you're seeing spam at the high end of the Unsure range, and you're not seeing ham at the high end of the Unsure range, another idea is to reduce the spam cutoff value (I use 20 and 80 as my cutoffs now). The default values are set closer to the endpoints to help protect new users from initial classification glitches. > Unless this slows down, after another three months or so, I will have > around 2K spams and climbing. Not only will this imbalance the data > set, Unless you also add more ham, sure. But it should slow down. > but I've heard that empirical tests show that too large a database > decreases accuracy. I saw no evidence of that when I ran mass tests, where tens of thousands of each kind were trained on. To the contrary, the more training, the better the results, although a graph of performance versus training size looked more logarithmic than anything else (IOW, you hit a point of diminishing returns quickly). Also, that was on a fixed set of messages. The characteristics of live data may well be different, since spam and ham do change over time. That's one argument in favor of expiring old data. OTOH, I have correspondents I hear from no more than once or twice per year, and if I tossed the rare samples from them out of my ham training set, some of them would have a hard time getting scored as ham the next time they wrote. Lots of tradeoffs. > So I am curious what, if any, measures SpamBayes takes to control the > size of the databases. Currently none. I've been using my main addin database for about a year now, and it's grown to 639 ham and 1049 spam. I rarely train it anymore -- it's more than good enough as-is. This is against an email load of about 700 new msgs per day, so in all I've trained on about 2-3 days' worth of data over the past year. > I saw the post that suggested ageing out databases to control total > number plus same start date and then using random deletion to balance > the data set sizes. I also am uncomfortable with the idea of randomly > deleting spam from the database, but have also noted that someone > knowledgeable stated "intuition is a poor guide". I don't like random deletion of trained ham because of what I said above. That's an experiment I've tried, and I was unhappy with the results. I'm afraid we all get *some* kinds of ham that's so rare a statistical sampling isn't going to find it. > As a hardware guy who does signal processing for a living, I am not at > all surprised at this in a stochastic approach like yours. However, if > empirical evidence tells you to keep the database size limited, an > important step would be for the program to do this in a reasonable way, > whatever that is. I suggest you wait until you have a real problem before trying to solve it. Part of "intuition is a poor guide" is that lots of solutions turn out to be unnecessary . > ... > Since I don't know jack about Python, Windows API's, Outlook, or VBA, > I can't help with that type of programming. If you can isolate it to > some C modules, I *can* code in that, as long as you don't talk to me > about Windows object classes. SpamBayes is written 100% in Python. Since you know C, you'd find the Python tutorial easy going, and would be doing interesting things in the language the same day you start learning it. From kennypitt at hotmail.com Mon Nov 10 15:19:16 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 10 15:19:43 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: Seth Goodman wrote: > However, if empirical evidence tells > you to keep the database size limited, an important step would be for > the program to do this in a reasonable way, whatever that is. One > other Bayesian spam program (K9) sets separate limits on the number > of spam and ham, trains on every message and deletes the oldest > message when a new on comes in. This achieves whatever spam/ham > balance you want without regard to start date. In K9, the limits you're talking about only control how many complete messages of each type are stored in cache for future *re-training*. They do not affect the contents of the actual training database. K9 does not currently do any aging of the training data, although I believe it has been discussed in that context as well. In the training database, both K9 and SpamBayes store only a list of tokens with counts of how many times each has been seen in spam and in ham. No other information is stored about the original message that the token was seen in. The most effective way of aging out tokens would seem to be to keep track of the date that each token was last seen, and set a threshold that says if a token has not been seen in n days then remove it from the training data. Unfortunately, this adds a significant amount of size to the training database as well as increasing the amount of work to be done when classifying a message (thus decreasing the performance). -- Kenny Pitt From neale at woozle.org Mon Nov 10 15:19:48 2003 From: neale at woozle.org (Neale Pickett) Date: Mon Nov 10 15:19:52 2003 Subject: [Spambayes] Saving hammie.db when upgrading? In-Reply-To: <5.1.0.14.0.20031107005438.00b8c1b0@localhost> (Tom Peters's message of "Fri, 07 Nov 2003 01:02:08 -0600") References: <5.1.0.14.0.20031107005438.00b8c1b0@localhost> Message-ID: Tom Peters writes: > I have 1.0a7 downloaded and want to install it in the next day or > so. Can I preserve the training I've done so far? How? The database file format hasn't changed, so you should be able to use 1.0a7 in place the same way you used 1.0a6. Just don't run "sb_filter.py -n" :) Neale From matt at mondoinfo.com Mon Nov 10 15:40:06 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Mon Nov 10 15:40:20 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: References: Message-ID: <1068495943.66.1099@sake.mondoinfo.com> > The most effective way of aging out tokens would seem to be to keep > track of the date that each token was last seen, and set a > threshold that says if a token has not been seen in n days then > remove it from the training data. I did some work on that sort of thing some months ago. I found that, at least with regard to my mail, the most effective strategy was to retain tokens that had not just been seen but had actually been used in scoring. With a database that was trained on all incoming mail every day, I was able to reduce the database size to something between 10% and 15% of the size it would normally be before the error rate increased significantly. At about the same time that I found that, I upgraded to a new laptop with 15 times the disk space of my old one, so my researches along those lines became somewhat less urgent. I also found that Tim Peters' strategy of just not bothering to train was also very effective in keeping the database size down. > Unfortunately, this adds a significant amount of size to the > training database as well as increasing the amount of work to be > done when classifying a message (thus decreasing the performance). Indeed, and keeping a database open r/w while scoring increases the chances that a random crash will corrupt it. Regards, Matt From richie at entrian.com Mon Nov 10 15:46:19 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Nov 10 15:46:40 2003 Subject: [Spambayes] Re: Offer to Help / Development Participation In-Reply-To: References: Message-ID: <8rsvqv0vt1804s1akodgn77060evpgr83e@4ax.com> David, Darrell, [David] > I'm very impressed with your work and would be glad to help [...] [Darrell] > [...] any development help you need don't hesitate to drop me a line. Many thanks for the offers! Maybe other developers would like to make specific suggestions (hence I've forwarded this to the spambayes-dev mailing list), but there's a whole bunch of things that you could do, starting with the non-technical: o Try to reproduce bugs that we're having trouble reproducing; see the bug list at http://sourceforge.net/tracker/?group_id=61702&atid=498103 (807217 is my personal hate figure). o Help with testing; we're only able to test within our own environments, and only the developers who are around at the time of a release are able to do even that. Some "real people" who could help test in their environments would be a big help. o Help improve the website; there's a Wiki page about that at http://www.entrian.com/sbwiki/WebSiteDevelopment o Help improve the documentation, especially for the non-Outlook applications (POP3 proxy, IMAP filter, Notes filter, sb_filter). o Help out newbies on the mailing list. o Make contributions to the Wiki, http://www.entrian.com/sbwiki - any hints and tips, scripts, recipes etc. o Taking part in discussions on the developer's mailing list at spambayes-dev@python.org. You don't need to be a developer to participate, you just need to have a decent grasp of the project and have opinions about how it should be developed. For those with programming skills, there's even more you could help with, even without in-depth knowledge of the code. The code's pretty accessible, and developers are always glad to answer questions about how it all works. Here's a small list off the top of my head: o Test patches, tidying them up, making them fit the coding standard (http://www.python.org/peps/pep-0008.html) if they don't already. See http://sourceforge.net/tracker/?group_id=61702&atid=498105 o Fix bugs - turning a bug report into a patch makes it far more likely to be fixed! o Improve our unit tests, or help develop an acceptance test framework. o Once you've got a handle on how the code works, implement feature requests. o Backport bugfixes from the head onto the bugfix branch, although our branch strategy is a little up in the air at the moment, so that's one for the future. o Help with sailing our fleet of luxury yachts from the Caribbean to the Med for the Spring season... or am I dreaming again? 8-) There are probably a dozen other things that I haven't thought of. -- Richie Hindle richie@entrian.com From richie at entrian.com Mon Nov 10 15:59:45 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Nov 10 16:00:03 2003 Subject: [Spambayes] Question : rules for multiple users In-Reply-To: References: Message-ID: Hi Luca, > As a system admin, I am planning to set up > a spambayes pop3proxy for my organization. > So, many users will redirect their mail client > to the pop3proxy listener. > > There's one thing I am not understanding. > Does any user need to train the proxy by himself > with his own rules for spam? > Or is it possible to define global rules, valid for > all users? > Are the rules individual, global, or is it possible > to define rules of both categories? If all your users share the same instance of the proxy, then they all share a training database. The same rules will apply to all of them. This is a mixed blessing; the database will be trained very quickly, and on the whole the system will work well for a bunch of people whose use of email tends to be quite similar (eg. if the vast majority of legitimate email is work-related, and they work in similar jobs). On the other hand, if they all subscribe to a wide variety of mailing lists, their training efforts will at best dilute each other, and at worst contradict each other. Imagine a doctor working with sexual problems, vs. a banker working with African investments. Words like Viagra, Erection, Nigeria and Fund Transfer will be trained in very different ways by the two of them. Spambayes is really intended to be used in a single-user environment (or in a multi-user environment by giving each user their own instance of the proxy, which is just another way of saying the same thing). To my knowledge, no-one has made a scientific study of how well it performs in a multi-user environment vs. the equivalent single-user environments. My gut feeling is that it will perform merely extremely well, instead of astonishingly well. 8-) The other caveat with multiple users using the same proxy is that they will be able to read each other's emails, because the training interface lets you read the emails that are awaiting training. -- Richie Hindle richie@entrian.com From tim.one at comcast.net Mon Nov 10 16:33:03 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 10 16:33:09 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Kenny Pitt] > ... > In the training database, both K9 and SpamBayes store only a list of > tokens with counts of how many times each has been seen in spam and in > ham. No other information is stored about the original message that > the token was seen in. The most effective way of aging out tokens > would seem to be to keep track of the date that each token was last > seen, and set a threshold that says if a token has not been seen in n > days then remove it from the training data. Unfortunately, this adds > a significant amount of size to the training database as well as > increasing the amount of work to be done when classifying a message > (thus decreasing the performance). SpamBayes originally saved a lot more info about each token, including a timestamp recording its most recent use in scoring. The effect on database size is indeed large, but the effect on processing time is minor. At that time, SpamBayes scored at least 80 messages/second on my home machine, and it's slower than that now (mostly due to I/O costs and fancier-- despite leaner --database schemes). The extra fields were deleted because nobody had figured out a compelling use for them. Part of the problem in designing an expiration scheme is that bags of words get added on a per-message basis, so should be removed on a per-message basis too. Then you have to coordinate a message database with the token database, or expand the token database to remember the bags of tokens it was trained on. So that's plenty of work, while most of the developers still have databases so small that it's hard to find them on a modern disk . From olli.marttila at kolumbus.fi Mon Nov 10 19:20:30 2003 From: olli.marttila at kolumbus.fi (Olli J. Marttila) Date: Mon Nov 10 19:20:42 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? Message-ID: <6.0.0.22.0.20031111021758.02439170@127.0.0.1> Thanks to Christopher Jastram (I had to reconfigure python with the flag --enable-zlib; why was this not mentioned in installation instructions?) I was eventually able to open http://localhost:8880 and start training of Spambayes by feeding incoming messages one by one to the "Train on a message ..." window. Stupid as I am, I don't still succeed in setting the IMAP filter to run periodically. Command marttila@linux:~> python sb_imapfilter.py -c -t -l 10 is answered by message python: can't open file 'sb_imapfilter.py'; and defining the path marttila@linux:~> python /usr/local/bin/sb_imapfilter.py -c -t -l 10 doesn't have a better success: SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Traceback (most recent call last): File "/usr/local/bin/sb_imapfilter.py", line 825, in ? run() File "/usr/local/bin/sb_imapfilter.py", line 815, in run imap_filter.Filter() File "/usr/local/bin/sb_imapfilter.py", line 667, in Filter imap.SelectFolder(self.spam_folder.name) File "/usr/local/bin/sb_imapfilter.py", line 245, in SelectFolder response = self.select(folder, None) File "/usr/local/lib/python2.3/imaplib.py", line 606, in select typ, dat = self._simple_command(name, mailbox) File "/usr/local/lib/python2.3/imaplib.py", line 1000, in _simple_command return self._command_complete(name, self._command(name, *args)) File "/usr/local/lib/python2.3/imaplib.py", line 774, in _command data = '%s %s' % (data, self._checkquote(arg)) File "/usr/local/lib/python2.3/imaplib.py", line 983, in _checkquote if (arg[0],arg[-1]) in (('(',')'),('"','"')): IndexError: string index out of range I certainly must have missed some essential stages. - BTW, what is Spambayes Manager? I understood that SpamBayes should process the mail in three categories, but I cannot experience anything like that. How can I get this classifying started? - I created local mailboxes for spam and ham in my ~/Mail directory, and moved a little more than one thousand messages in them, arrived during last two weeks, but didn't know how to get Spambayes to use them for training - it refused to understand their format. My OS in SuSE Linux Personal edition 8.2, I am relying in KMail, and - what is apparent from what written above - using IMAP protocol. Spambayes version is 0.7. I have an external firewall (192.168.1.254). There is another, commercial spam filter (McAfee SpamKiller) for the Windows2000 mail programme (Eudora), that seems to filter most spam messages before they reach KMail; still there is need an effective filter for Linux based operations. Any good advice is welcome - and I have experienced with gratefulness that there are good advisers on this site! My instructor and superior, former chairman of the Board of the ADB services of the University, once defined that the programme manuals are written in a way that the instructions can be understood only by people who already know how to proceed without those instructions. - There is a saying in Finnish that the instructions must be modelled for the simple people wringing steel wire, and now I certainly need lots of wire to become to understand the installation instructions, be it the fact that I use Linux and KMail or that my mail protocol is IMAP. (There are good instructions in Finnish how to proceed with Mozilla Mail and POP3 protocol.) Olli J. Marttila *********************** On Friday 07 November 2003 03:36, Christopher Jastram wrote: > > > Take this with a grain of salt, because I haven't had > this particular problem. However, I have many similar > problems, both with Python and with other pieces of > software. > > Here's the deal: python is looking for a python module > called zlib, which is NOT included in the zlib / > zlib-devel RPMs. Fortunately, you've already compiled > Python (this tells me you know something about how this > stuff works). What you want is to enable 'zlib' support > when you compile python. You'll need zlib-devel to be > installed for Python to compile with zlib support > (already installed). Once Python is built and installed > with zlib support, the zlib.py module will exist in a > funcational form, and you'll be all set. > > How to enable zlib support? I dunno. Been a while since > I've had to compile python (I love FreeBSD ports!). Are > they using a configure script? If so, look in > ./configure --help for something like '--enable-zlib'. > Otherwise, look at the docs. (tip: grep -ir zlib doc/*) > > chris From flyfishermann at earthlink.net Mon Nov 10 20:29:17 2003 From: flyfishermann at earthlink.net (Richard Mann) Date: Mon Nov 10 20:29:24 2003 Subject: [Spambayes] SpamBayes Outlook addin Message-ID: <3FB03B6D.9070609@earthlink.net> I installed SpamBayes Outlook addin on my home system (Windows Xp and Outlook 2002) with no trouble and everything seems to be working properly. However, I also loaded it at work on my Windows Me/Outlook 98 machine and the addin does not load at all when I open Outlook, despite no errors during the installation. I have run regsvr and also uninstalled/reinstalled the addin with no success. The addin does not show under the com addins in Outlook 98. Any ideas? Thanks in advance for your help! From tim.one at comcast.net Mon Nov 10 21:25:31 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 10 21:25:35 2003 Subject: [Spambayes] SpamBayes Outlook addin In-Reply-To: <3FB03B6D.9070609@earthlink.net> Message-ID: [Richard Mann] > I installed SpamBayes Outlook addin on my home system (Windows Xp and > Outlook 2002) with no trouble and everything seems to be working > properly. > > However, I also loaded it at work on my Windows Me/Outlook 98 machine > and the addin does not load at all when I open Outlook, despite no > errors during the installation. I have run regsvr and also > uninstalled/reinstalled the addin with no success. The addin does not > show under the com addins in Outlook 98. > > Any ideas? Sorry, the addin can't be used with Outlook 98. Microsoft made huge changes to Outlook's object model between OL98 and OL2000, and it would take quite different code on our end to deal with OL98. I don't think any of the current developers even has access to an OL98 installation, so I don't expect this to change (while it's possible that an OL98 user will decide to rewrite our code for OL98's quirks and contribute that back to the project, I don't expect it). From desertdummies at qwest.net Mon Nov 10 22:07:58 2003 From: desertdummies at qwest.net (Desertdummies) Date: Mon Nov 10 22:08:08 2003 Subject: [Spambayes] Suggestion Message-ID: <000a01c3a801$05814f20$6501a8c0@cactus1> It would be nice if, when you select the "Delete As Spam" button, that it would be marked as "read" when placed into your spam folder. Thanks for a great program. Kent From Phil.Jollans at MESSRING.de Tue Nov 11 04:16:46 2003 From: Phil.Jollans at MESSRING.de (Phil.Jollans@MESSRING.de) Date: Tue Nov 11 04:16:55 2003 Subject: [Spambayes] FAQ 3.8 How can I get rid of the envelope tray icon for spam? Message-ID: <20D8C15B1A415F4DBDC591B80A6E567F387A08@exchange-server.messring.de> Hi, with regard to the FAQ 3.8 How can I get rid of the envelope tray icon for spam? for Outlook. I have found that if I select "Mark All as Read" from the context menu for the Spam folder, then the envlope icon disappears. It may be possible to do this via the outlook automation interface, I have no idea. If not, I think it would be worth a mention in the FAQ list, instead of just stating that "you'll have to put up with the little envelope, sorry". Thanks for a really excellent product. Regards Phil Jollans MESSRING Systembau MSG GmbH Robert-Stirling-Ring 1 82152 Krailling Tel: +49-89-898139-712 Fax: +49-89-898139-924 phil.jollans@messring.de http://www.messring.com ____________________________________________________ Bitte beachten Sie unseren Haftungsauschluss unter: Please note our exclusion of liability at: http://www.messring.de/html/e-mail_disclaimer.html From rmalayter at bai.org Tue Nov 11 08:52:03 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Nov 11 08:52:09 2003 Subject: [Spambayes] Suggestion Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C37FE@cliff.bai.org> This is an option in the latest version 0.81. It's the bottom item on the Training tab. > -----Original Message----- > From: spambayes-bounces+rmalayter=bai.org@python.org > [mailto:spambayes-bounces+rmalayter=bai.org@python.org] On > Behalf Of Desertdummies > Sent: Monday, November 10, 2003 9:08 PM > To: spambayes@python.org > Subject: [Spambayes] Suggestion > > It would be nice if, when you select the "Delete As Spam" > button, that it would be marked as "read" when placed into > your spam folder. > > Thanks for a great program. > > Kent > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From kennypitt at hotmail.com Tue Nov 11 09:30:51 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 11 09:31:21 2003 Subject: [Spambayes] FAQ 3.8 How can I get rid of the envelope tray icon forspam? In-Reply-To: <20D8C15B1A415F4DBDC591B80A6E567F387A08@exchange-server.messring.de> Message-ID: Phil.Jollans@MESSRING.de wrote: > Hi, > > with regard to the FAQ > 3.8 How can I get rid of the envelope tray icon for spam? > for Outlook. > > I have found that if I select "Mark All as Read" from the context > menu for the Spam folder, then the envlope icon disappears. > > It may be possible to do this via the outlook automation interface, I > have no idea. If not, I think it would be worth a mention in the FAQ > list, instead of just stating that "you'll have to put up with the > little envelope, sorry". Thanks for the suggestion. I already have a bit of code that can remove the envelope icon from the tray, but unfortunately there is a bit more to the problem. What most people want is to avoid having to open Outlook to find out whether all the new messages are spam or not. So they want SpamBayes to remove the envelope icon automatically if there are no "interesting" new messages. But how do we determine, after we classify a new message as spam, that there are no other "interesting" messages that still haven't been read? Because the user can set up Outlook rules to move messages anywhere they want, it is impossible to know specific folders to check for unread messages and it would be very time-consuming to search every folder in the Outlook store for unread messages each time we classified a message as spam. -- Kenny Pitt From cej at intech.com Tue Nov 11 10:31:04 2003 From: cej at intech.com (Christopher Jastram) Date: Tue Nov 11 10:39:55 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? In-Reply-To: <6.0.0.22.0.20031111021758.02439170@127.0.0.1> References: <6.0.0.22.0.20031111021758.02439170@127.0.0.1> Message-ID: <3FB100B8.7050202@intech.com> Exactly what version of Python are you running? I had the exact same problem a few weeks ago, and I think I fixed it by downloading and installing the very latest version of Python from source from python.org. Now, you are installing from source -- is it from RPM source or python.org? I think the version was 2.3.2, but I'm not sure. The computer I have it on is at home -- grrrr. My platform is SuSE 8.2 Professional, connecting to a Cyrus IMAP server. I was setting sb_imapfilter to run in concert with Evolution, but I abandoned it in favor of server-side filtering (which works quite well, so far). And, you are absolutely right about many manuals being written for people who do not need them. I am only now (after 5 years' intensive experience in Linux and FreeBSD) able to sit down with a manual and work out how to use the software without using a HOWTO or tutorial. It is a very time-consuming process, and I generally print out the manual on paper and make lots of notes. Same for source code, but I can't print source code out, which sucks (10k pages is no good). Christopher Jastram Olli J. Marttila wrote: > > Thanks to Christopher Jastram (I had to reconfigure > python with the flag --enable-zlib; why was this not > mentioned in installation instructions?) I was eventually > able to open http://localhost:8880 and start training of > Spambayes by feeding incoming messages one by one to the > "Train on a message ..." window. Stupid as I am, I don't > still succeed in setting the IMAP filter to run > periodically. Command > > marttila@linux:~> python sb_imapfilter.py -c -t -l 10 > > is answered by message > > python: can't open file 'sb_imapfilter.py'; > > and defining the path > > marttila@linux:~> python /usr/local/bin/sb_imapfilter.py -c > -t -l 10 > > doesn't have a better success: > > SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), > using SpamBayes IMAP Filter Web Interface Alpha2, version > 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). > > Traceback (most recent call last): > File "/usr/local/bin/sb_imapfilter.py", line 825, in ? > run() > File "/usr/local/bin/sb_imapfilter.py", line 815, in run > imap_filter.Filter() > File "/usr/local/bin/sb_imapfilter.py", line 667, in Filter > imap.SelectFolder(self.spam_folder.name) > File "/usr/local/bin/sb_imapfilter.py", line 245, in > SelectFolder > response = self.select(folder, None) > File "/usr/local/lib/python2.3/imaplib.py", line 606, in > select > typ, dat = self._simple_command(name, mailbox) > File "/usr/local/lib/python2.3/imaplib.py", line 1000, in > _simple_command > return self._command_complete(name, self._command(name, > *args)) > File "/usr/local/lib/python2.3/imaplib.py", line 774, in > _command > data = '%s %s' % (data, self._checkquote(arg)) > File "/usr/local/lib/python2.3/imaplib.py", line 983, in > _checkquote > if (arg[0],arg[-1]) in (('(',')'),('"','"')): > IndexError: string index out of range > > I certainly must have missed some essential stages. - BTW, > what is Spambayes Manager? I understood that SpamBayes > should process the mail in three categories, but I cannot > experience anything like that. How can I get this > classifying started? - I created local mailboxes for spam > and ham in my ~/Mail directory, and moved a little more > than one thousand messages in them, arrived during last two > weeks, but didn't know how to get Spambayes to use them for > training - it refused to understand their format. > > My OS in SuSE Linux Personal edition 8.2, I am relying in > KMail, and - what is apparent from what written above - > using IMAP protocol. Spambayes version is 0.7. I have an > external firewall (192.168.1.254). There is another, > commercial spam filter (McAfee SpamKiller) for the > Windows2000 mail programme (Eudora), that seems to filter > most spam messages before they reach KMail; still there is > need an effective filter for Linux based operations. > > Any good advice is welcome - and I have experienced with > gratefulness that there are good advisers on this site! My > instructor and superior, former chairman of the Board of > the ADB services of the University, once defined that the > programme manuals are written in a way that the > instructions can be understood only by people who already > know how to proceed without those instructions. > > - There is a saying in Finnish that the instructions must be > modelled for the simple people wringing steel wire, and now > I certainly need lots of wire to become to understand the > installation instructions, be it the fact that I use Linux > and KMail or that my mail protocol is IMAP. (There are good > instructions in Finnish how to proceed with Mozilla Mail > and POP3 protocol.) > > Olli J. Marttila > > *********************** > > On Friday 07 November 2003 03:36, Christopher Jastram wrote: > > > > > > Take this with a grain of salt, because I haven't had > > this particular problem. However, I have many similar > > problems, both with Python and with other pieces of > > software. > > > > Here's the deal: python is looking for a python module > > called zlib, which is NOT included in the zlib / > > zlib-devel RPMs. Fortunately, you've already compiled > > Python (this tells me you know something about how this > > stuff works). What you want is to enable 'zlib' support > > when you compile python. You'll need zlib-devel to be > > installed for Python to compile with zlib support > > (already installed). Once Python is built and installed > > with zlib support, the zlib.py module will exist in a > > funcational form, and you'll be all set. > > > > How to enable zlib support? I dunno. Been a while since > > I've had to compile python (I love FreeBSD ports!). Are > > they using a configure script? If so, look in > > ./configure --help for something like '--enable-zlib'. > > Otherwise, look at the docs. (tip: grep -ir zlib doc/*) > > > > chris > > From ricardo at talentum.com.ar Tue Nov 11 11:37:53 2003 From: ricardo at talentum.com.ar (Ricardo Basualdo) Date: Tue Nov 11 11:38:56 2003 Subject: [Spambayes] Complete Removal Message-ID: <3F9F889800381C78@mta4.fibertel.com.ar> (added by postmaster@fibertel.com.ar) My OS is Windows XP. I just bought and installed Office 2003. I tested Outlook's 2003 new Junk Mail filter and it filtered about 99% of the junk, so I decided to uninstall Spam Bayes (even though it has being very useful for me while I used it, for which I'm thankful for this). Okay, the problem is that Spam Bayes does not have an uninstall option, so I used the Add Remove Programs control panel to uninstall it and even though it doesn't show anymore in the list of installed programs, I got a message saying "some components could not be uninstalled and you should do it manually". After that I re-started my computer and Outlook, but the buttons of Spam Bayes, even though disabled, still appear there. Here is a screen shot. I also tried to disable it by going to the Com Add-Inns window, but as you can see, Spam Bayes doesn't show on the list. So, how can I remove it completely and stop the Icons of Spam Bayes to keep showing in Outlook? PD: I read all your FAQ and other notes and there's nothing specifically addressing this issue. Thanks for your answer. Ricardo Basualdo TALENTUM THINK TANK President -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 4325 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031111/5eb4d045/attachment.jpe -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 12379 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031111/5eb4d045/attachment-0001.jpe From hcross at alltrista.com Tue Nov 11 11:55:47 2003 From: hcross at alltrista.com (Cross, Harvey) Date: Tue Nov 11 11:55:54 2003 Subject: [Spambayes] HELP! Message-ID: <3D23F6A5C4D8D31197CA00508B9B0F0301D88E6A@cpc_ex1> I need your help if possible. I have used Spambayes for several months and love it! For some reason it disappeared off of the tool bar. It was not operative. I went to tools, options, other, advanced and com add-ins. The Spambayes was there and not checked. I checked it and hit ok. If I go back it is unchecked again. I have uninstalled and reinstalled the program several times. Nothing seems to work. I HATE SPAM. ANY HELP WILL BE APPRECIATED. Harvey Cross Alltrista Consumer Products ( 770-979-5896 * HCross@alltrista.com From olli.marttila at kolumbus.fi Tue Nov 11 11:59:18 2003 From: olli.marttila at kolumbus.fi (Olli J. Marttila) Date: Tue Nov 11 11:59:31 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? In-Reply-To: <3FB100B8.7050202@intech.com> References: <6.0.0.22.0.20031111021758.02439170@127.0.0.1> <3FB100B8.7050202@intech.com> Message-ID: <6.0.0.22.0.20031111182200.02471298@127.0.0.1> I have the newest version of python (the same 2.3.2), downloaded from python.org as a tarball. Somebody adviced me to throw the rpm-version of SuSe to the dustbin and install from the source; in fact, I installed the binaries, not from the source (*.src.tar.gz). I tried to do the same with zlib but both apt-get and yast2 refused to install it claiming that I already have the newest version and that uninstalling it is impossible. Then I installed also zlib-devel - without knowing whether it was necessary - but just in case. Anyway, I think that what _was_ necessary was the --enable-zlib flag in configuring python for installation, thank you. I see that we have similar installation. I too prefer to filter the messages on server side - like my Windows spam filter. What is funny is that SpamKiller keeps what it classifies as spam in the server and does not even show those messages when I am working in Linux environment. What I miss are detailed instructions for getting SpamBayes operating with KMail using IMAP protocol, without downloading the spam messages. In fact, I have been using Linux for years, because the workstations in my former employer were Linux-workstations - Linus Thorvalds comes from the University of Helsinki - but there was special personnel with degree in computer science who took care of the workstations. My own experience started 1961 with autocode, machine code and Algol and Elliot 503, but Linux is me kind of hobby. I miss OS/2 that was my chief OS for years until I had to update my computer and with it the operating system. I have never really understood the most fanatic Linux users as there are so many important Windows programmes - I mean, important to me - and I only have a wish to get rid of Windows some day, so I am testing corresponding Linux programmes, Spambayes among them. Olli J. Marttila (grey beard, retaired) ****************** At 11.11.2003 17:31, Christopher Jastram wrote: >Exactly what version of Python are you running? > >I had the exact same problem a few weeks ago, and I think I fixed it by >downloading and installing the very latest version of Python from source >from python.org. Now, you are installing from source -- is it from RPM >source or python.org? > >I think the version was 2.3.2, but I'm not sure. The computer I have it >on is at home -- grrrr. > >My platform is SuSE 8.2 Professional, connecting to a Cyrus IMAP >server. I was setting sb_imapfilter to run in concert with Evolution, but >I abandoned it in favor of server-side filtering (which works quite well, >so far). > >And, you are absolutely right about many manuals being written for people >who do not need them. I am only now (after 5 years' intensive experience >in Linux and FreeBSD) able to sit down with a manual and work out how to >use the software without using a HOWTO or tutorial. It is a very >time-consuming process, and I generally print out the manual on paper and >make lots of notes. Same for source code, but I can't print source code >out, which sucks (10k pages is no good). > >Christopher Jastram From kennypitt at hotmail.com Tue Nov 11 12:00:02 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 11 12:00:47 2003 Subject: [Spambayes] Complete Removal In-Reply-To: <3F9F889800381C78@mta4.fibertel.com.ar> (added bypostmaster@fibertel.com.ar) Message-ID: Ricardo Basualdo wrote: > So, how can I remove it completely and stop the Icons of Spam Bayes > to keep showing in Outlook? The SpamBayes uninstall program doesn't have access to Outlook to delete the toolbar, so you'll have to do so manually. Just right-click your toolbar and select Customize. Then highlight SpamBayes and click Delete. That should be all there is to it. > PD: I read all your FAQ and other notes and there's nothing specifically > addressing this issue. Yes, this is probably a good candidate for adding to the FAQ. I'll see if I can find time to write something up and submit it for consideration. -- Kenny Pitt From kennypitt at hotmail.com Tue Nov 11 12:05:13 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 11 12:05:50 2003 Subject: [Spambayes] Complete Removal In-Reply-To: <3F9F889800381C78@mta4.fibertel.com.ar> (added bypostmaster@fibertel.com.ar) Message-ID: Ricardo Basualdo wrote: > So, how can I remove it completely and stop the Icons of Spam Bayes > to keep showing in Outlook? The SpamBayes uninstall program doesn't have access to Outlook to delete the toolbar, so you'll have to do so manually. Just right-click your toolbar and select Customize. Then highlight SpamBayes and click Delete. That should be all there is to it. > PD: I read all your FAQ and other notes and there's nothing specifically > addressing this issue. Yes, this is probably a good candidate for adding to the FAQ. I'll see if I can find time to write something up and submit it for consideration. -- Kenny Pitt From sethg.delete at GoodmanAssociates.com Tue Nov 11 14:48:52 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 11 14:48:59 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Seth Goodman] > > but I've heard that empirical tests show that too large a database > > decreases accuracy. [Tim Peters] > > I saw no evidence of that when I ran mass tests, where tens of > thousands of > each kind were trained on. To the contrary, the more training, the better > the results, although a graph of performance versus training size looked > more logarithmic than anything else (IOW, you hit a point of diminishing > returns quickly). Also, that was on a fixed set of messages. The > characteristics of live data may well be different, since spam and ham do > change over time. That's one argument in favor of expiring old > data. OTOH, > I have correspondents I hear from no more than once or twice per year, and > if I tossed the rare samples from them out of my ham training set, some of > them would have a hard time getting scored as ham the next time > they wrote. > Lots of tradeoffs. > Here's the post where I got the idea that large training sets are bad. No one disputed this when it was posted, so I mistook it for fact. [Seth Goodman] > > From: Seth Goodman > > Subject: [Spambayes] optimal training ratio of spam/ham > > > > Is there an optimal ratio of spam to ham for a training set? [Ryan Malayter] >Yes, there is: 1-to-1 [Seth Goodman] > > Is there an optimal total corpus size for training? [Ryan Malayter] > Not really, but evidence seems to suggest that a thousand or messages in > seems to work well. However, 10,000 or more messages seems to decrease > the capture rates somewhat. [Tim Peters] > I suggest you wait until you have a real problem before trying to > solve it. > Part of "intuition is a poor guide" is that lots of solutions > turn out to be > unnecessary . I certainly agree and have no desire to rush in and break something that works. My suggestion was based on the above post, not from any observations of my own. If you can think of something useful for me to do to help out the SpamBayes effort, please point me in that direction. [Tim Peters] > SpamBayes is written 100% in Python. Since you know C, you'd find the > Python tutorial easy going, and would be doing interesting things in the > language the same day you start learning it. Fair enough. Could you point me to the tutorial you mentioned? Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From richie at entrian.com Tue Nov 11 14:59:47 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 11 15:00:33 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: References: Message-ID: [Seth] > Could you point me to the tutorial you mentioned? Tim probably meant http://www.python.org/doc/current/tut/tut.html http://diveintopython.org/ also comes highly recommended. -- Richie Hindle richie@entrian.com From sethg.delete at GoodmanAssociates.com Tue Nov 11 15:00:31 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 11 15:00:40 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: > Seth Goodman wrote: > > However, if empirical evidence tells > > you to keep the database size limited, an important step would be for > > the program to do this in a reasonable way, whatever that is. One > > other Bayesian spam program (K9) sets separate limits on the number > > of spam and ham, trains on every message and deletes the oldest > > message when a new on comes in. This achieves whatever spam/ham > > balance you want without regard to start date. > > In K9, the limits you're talking about only control how many complete > messages of each type are stored in cache for future *re-training*. > They do not affect the contents of the actual training database. K9 > does not currently do any aging of the training data, although I believe > it has been discussed in that context as well. > > In the training database, both K9 and SpamBayes store only a list of > tokens with counts of how many times each has been seen in spam and in > ham. No other information is stored about the original message that the > token was seen in. The most effective way of aging out tokens would > seem to be to keep track of the date that each token was last seen, and > set a threshold that says if a token has not been seen in n days then > remove it from the training data. Unfortunately, this adds a > significant amount of size to the training database as well as > increasing the amount of work to be done when classifying a message > (thus decreasing the performance). > > -- > Kenny Pitt Well, it does appear that K9 actually stores two complete message corpuses: one for spam and one for ham. K9 also has a command called "Organize", which moves messages between corpuses after you do any reclassification. The reason that they give in their instructions for keeping both message corpuses around is to allow you to retrain from scratch, should you need to in the future. I'm not suggesting you do the same, unless you have some other purpose. This would be one way to age out entire messages instead of individual tokens, but from other responses in this thread, that might not be a good thing at all. Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From sethg.delete at GoodmanAssociates.com Tue Nov 11 15:05:52 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 11 15:05:55 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: > -----Original Message----- > From: Richie Hindle [mailto:richie@entrian.com] > Sent: Tuesday, November 11, 2003 2:00 PM > To: sethg@GoodmanAssociates.com; spambayes@python.org > Subject: Re: [Spambayes] ageing out database entries > > > > [Seth] > > Could you point me to the tutorial you mentioned? > > Tim probably meant http://www.python.org/doc/current/tut/tut.html > > http://diveintopython.org/ also comes highly recommended. > > -- > Richie Hindle > richie@entrian.com > Thanks very much, Richie. Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From Mark.Wolfgram at plexus.com Tue Nov 11 16:05:15 2003 From: Mark.Wolfgram at plexus.com (Mark Wolfgram) Date: Tue Nov 11 16:05:22 2003 Subject: [Spambayes] Retrieve a deleted/noted SPAM email Message-ID: <8779A067212B474293557E47366A353A0459E53C@neen-mail-005.na.plexus.com> I accidentally hit the "Delete as SPAM" button and I now need to retrieve the email. How can I do that? thanks, Mark Wolfgram Sales Support Manager Plexus Corp. 55 Jewelers Park Drive PO Box 156 Neenah, WI 54957-0156 phone: 920-751-3202 fax: 920-751-5395 e-mail: mark.wolfgram@plexus.com www.plexus.com From rmalayter at bai.org Tue Nov 11 16:54:18 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Tue Nov 11 16:54:23 2003 Subject: [Spambayes] ageing out database entries Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C383F@cliff.bai.org> > From: Seth Goodman > Subject: RE: [Spambayes] ageing out database entries > [Seth Goodman] > > > Is there an optimal total corpus size for training? > > [Ryan Malayter] > > Not really, but evidence seems to suggest that a thousand > or messages in > > seems to work well. However, 10,000 or more messages seems > to decrease > > the capture rates somewhat. I don't remember exactly where I read that rule-of-thumb, but I recall it came from this list a wile back. I did some searching through the list archives, and came up with a dew messages that mentioned the 10,000 number as a threshold of sorts. Perhaps there are just diminishing (or no) increases in accuracy above this point, and accuracy doesn't actually get worse. Here's the list: http://mail.python.org/pipermail/spambayes/2002-November/001705.html http://mail.python.org/pipermail/spambayes/2002-November/001752.html http://mail.python.org/pipermail/spambayes/2002-October/000997.html From brett at realestate-school.com Tue Nov 11 17:13:43 2003 From: brett at realestate-school.com (Brett Brewer) Date: Tue Nov 11 17:12:36 2003 Subject: [Spambayes] possible feature request and PHP question Message-ID: To whom it may concern, I LOVE spambayes and really appreciate the fact that this is an open-source package. Thanks very much! I also have 2 questions: 1. have you considered implementing the ideas presented in the Paul Graham article on spam filters that fight back: http://www.paulgraham.com/ffb.html 2. Do you think you python code could be translated to PHP? I don't know Python, but I know the hell out of PHP and I wonder how hard it would be to translate so that the source could be modified my the millions of PHP programmers out there. I know there's an add-on for PHP that will let you write windows apps, but apparently it's not very efficient. Any thoughts on this? Thanks, Brett Brewer From skip at pobox.com Tue Nov 11 17:28:47 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 11 17:28:54 2003 Subject: [Spambayes] possible feature request and PHP question In-Reply-To: References: Message-ID: <16305.25247.271337.547221@montanaro.dyndns.org> Brett> 1. have you considered implementing the ideas presented in the Brett> Paul Graham article on spam filters that fight back: Brett> http://www.paulgraham.com/ffb.html Yes, that has been tried. Richard Jowsey's stuff is mentioned on the SB website: http://spambayes.sourceforge.net/related.html Brett> 2. Do you think you python code could be translated to PHP? I Brett> don't know Python, but I know the hell out of PHP and I wonder Brett> how hard it would be to translate so that the source could be Brett> modified my the millions of PHP programmers out there. I know Brett> there's an add-on for PHP that will let you write windows Brett> apps, but apparently it's not very efficient. Any thoughts on Brett> this? I suppose you could translate it to PHP. This is probably the wrong group to ask about that though, since those of us here who write programs mostly write in Python. Rather than effectively forking the SpamBayes code into Python and PHP versions, your time might be better spent learning Python. Looking at just the .py files in the SpamBayes CVS tree I see over 150 files with a total of about 40,000 lines (well-commented code in general, but still a *lot* of Python). That includes code for a number of different SpamBayes applications (proxy servers for POP3 and IMAP, Outlook plugin, Unix-y filter stuff, etc) and lots of code to support testing changes to the system. That doesn't even begin to scratch the surface, because SpamBayes makes heavy use of a lot of stuff in the standard Python library, including the email package, the new sets module, and the config file parser, just to name a few. Skip From sgandenberger at cincinnati-test.com Tue Nov 11 17:32:21 2003 From: sgandenberger at cincinnati-test.com (Scott J. Gandenberger) Date: Tue Nov 11 17:32:37 2003 Subject: [Spambayes] Compatibility Message-ID: I have a user that has SpamBayes Outlook Plug-in installed. He recently bought a new Axim PDA from Dell which came with a copy of Outlook 2003. Shortly after installing Outlook 2003 which replaced his Outlook 2000, SpamBayes stopped working. Is the current SpamBayes Outlook plug-in Outlook 2003 compatible? If it is, can you tell me why after uninstalling and re-installing SpamBayes, the SpamBayes toolbar buttons will not work? We have been using SpamBayes Outlook Plug-in for about 3 months on about 5 machines without a single false positive. It has been a great time saver for us. Thanks, Scott Scott Gandenberger IS Manager Cincinnati Test Systems, Inc. From hank at marinar.com Tue Nov 11 17:48:52 2003 From: hank at marinar.com (hank@marinar.com) Date: Tue Nov 11 17:48:59 2003 Subject: [Spambayes] notate_subject In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29B6@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1303A3C939@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13026F29B6@its-xchg4.massey.ac.nz> Message-ID: <20031111224852.GE18670@hank.marinar.com> On Mon, Oct 20, 2003 at 07:52:36PM +1300, Tony Meyer wrote: > """ > *** hammie.py Mon Oct 20 19:51:31 2003 > --- hammie2.py Mon Oct 20 19:47:20 2003 > *************** > *** 116,121 **** > --- 116,125 ---- > else: > is_spam = False > disp = options["Headers", "header_unsure_string"] > + if disp in options["Headers", "notate_subject"]: > + subj = msg["Subject"] > + del msg["Subject"] > + msg["Subject"] = "%s %s" % (disp, subj) > if train: > self.train(msg, is_spam, True) > disp += ("; %."+str(options["Headers", > "header_score_digits"])+"f") % Prob > """ First delve into Python, but here goes. I by no means mean to nitpick, but it would be very nice to see the Prob score in the Subject as well. I couldn't get it to work and also look to the config file for which disp to notate, so I hard coded it to make only messages classified as 'spam' appear in the subject with the score. Here is how I do it: """ --- hammie.py.orig Fri Nov 7 09:20:37 2003 +++ hammie.py Tue Nov 11 17:40:55 2003 @@ -130,6 +130,10 @@ disp += " (%d)"%x del msg[header] msg.add_header(header, disp) + if is_spam: + subj = msg["Subject"] + del msg["Subject"] + msg["Subject"] = "[%s] %s" % (disp, subj) if debug: disp = self.formatclues(clues) del msg[debugheader] """ So far, I am VERY happy with how this filter is performing. I am now rolling it out to the rest of my clients, and so far it is well received. From anthony at interlink.com.au Tue Nov 11 22:24:43 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Nov 11 22:25:33 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: <200311120324.hAC3OiTC001445@localhost.localdomain> >>> "Seth Goodman" wrote > I certainly agree and have no desire to rush in and break something that > works. My suggestion was based on the above post, not from any observations > of my own. If you can think of something useful for me to do to help out > the SpamBayes effort, please point me in that direction. Large training sets were shown to lead to a slight decrease in accuracy, but we're talking tens of thousands of messages in either direction. It'll be in the spambayes list archives, back in the dark ages. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From tim.one at comcast.net Tue Nov 11 22:38:05 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Nov 11 22:38:15 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Seth Goodman] > Here's the post where I got the idea that large training sets are > bad. No one disputed this when it was posted, so I mistook it for > fact. I hope you don't believe everything you get in email . Seriously, there are too many misconceptions about everything on the Web for a sane person to bother disputing any but the most egregious. > ... > I certainly agree and have no desire to rush in and break something > that works. My suggestion was based on the above post, not from any > observations of my own. If you can think of something useful for me > to do to help out the SpamBayes effort, please point me in that > direction. There you go: >> I suggest you wait until you have a real problem before trying to >> solve it. What *bothers* you about SpamBayes? What doesn't work right, or what was too hard to figure out, or what's still too confusing? What's missing? Open source is overwhelmingly driven by people scratching their own itches. You find something *you* want to improve, because your own experience and judgment says there's something lacking, and then you improve it. If the itch is strong enough, nothing can stop you. Docs always need improving, there's a backlog of mysterious & rare problems reported against the Outlook addin, and there's a steady trickle of people looking for a multi-user server-side version of SpamBayes. If you want the user community to drive what you do, those are the ones I hear about most often. Since this isn't a business, most of us don't have a marketing plan, or overriding vision, beyond keeping our own inboxes clear of spam. By the miracle of open source , that gets to extend to other peoples' inboxes too, and we never have to make a change for any artificial reason (it's not like we're going to get an influx of revenue if we force people to upgrade ...). The core classification engine (tokenization & scoring) is something many newcomers wonder about, but those who persist long enough to run actual tests against their ideas are rare. The few "new idea" tests reported against over the last couple months have shown results so weak that they're hard to distinguish from statistical noise. I can't remember the last time testing a change in strategy actually produced a clear winner -- the glory days there appear to be long gone. Still, we know we're missing the "white on white" (more generally, foreground close-to background color) HTML obfuscation trick, and it would be good to catch that; OTOH, that requires parsing and semantic analysis a good deal beyond what we're doing now. >> SpamBayes is written 100% in Python. Since you know C, you'd find >> the Python tutorial easy going, and would be doing interesting >> things in the language the same day you start learning it. > Fair enough. Could you point me to the tutorial you mentioned? Somebody already did (thanks!). I recommended the Python tutorial that comes with Python specifically because you have a strong C background. The complaint we hear about the Python tutorial most often from Python newcomers is that it seems to assume you already know C . Much of the Python standard library consists of wrappers around the C library functions you already know, although often with a twist to squash gratuitous x-platform differences in behavior. From tim.one at comcast.net Tue Nov 11 22:38:06 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Nov 11 22:38:19 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: <200311120324.hAC3OiTC001445@localhost.localdomain> Message-ID: [Anthony Baxter] > Large training sets were shown to lead to a slight decrease in > accuracy, but we're talking tens of thousands of messages in either > direction. It'll be in the spambayes list archives, back in the dark > ages. Was that result unique to you, Anthony? I ran tens of thousands of messages, and didn't see that. Guido ran tens of thousands but didn't keep testing long enough to report anything. Who else ran against test sets that large? And you always got somewhat different results because you mixed ham from multiple users, right? From gautams at aztec.soft.net Tue Nov 11 23:55:52 2003 From: gautams at aztec.soft.net (Gautam Sabba) Date: Wed Nov 12 00:00:22 2003 Subject: [Spambayes] Plugin fails to load Message-ID: Hi, I've installed Spambayes for Outlook XP. I've been using it for a couple of months now and things were just fine till yesterday. When trying to start up Outlook I got an error message saying that a "serious error occurred with spambayes and would I like to disable it". I did and that's the last I saw of Spambayes :-(. I see the plugin listed in the COM Add-ins... menu but it is unchecked. Checking it and re-starting Outlook doesn't seem to help. It remains unchecked. I've tried all the tricks on the troubleshooting page but I think I've hit a wall. Can anyone help me? I also tried deleting the item from COM Add-ins... Menu, re-adding (pointed to the dll) and checking the newly added item. This hangs Outlook. Even after uninstalling, the toolbar remained. I deleted it and re-installed. Now, I don't see the toolbar anymore and it is not listed in the toolbar menu. TIA, Gautam From usr7-28hl at xemaps.com Wed Nov 12 01:54:36 2003 From: usr7-28hl at xemaps.com (Keith Russell) Date: Wed Nov 12 01:54:45 2003 Subject: [Spambayes] [POP3 Version] Training Failure Message-ID: <20031111235415.E0DC.USR7-28HL@xemaps.com> Hi, everyone. I've recently started getting the following errors when I try to train using the Web interface: Spambayes Web Interface: Home > Review 500 Server error Traceback (most recent call last): File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\ProxyUI.py", line 310, in onReview targetCorpus.takeMessage(id, sourceCorpus) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\Corpus.py", line 201, in takeMessage self.addMessage(msg) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\FileCorpus.py", line 143, in addMessage Corpus.Corpus.addMessage(self, message) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\Corpus.py", line 136, in addMessage obs.onAddMessage(message) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\storage.py", line 571, in onAddMessage self.train(message) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\storage.py", line 579, in train self.bayes.learn(message.tokenize(), self.is_spam) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\classifier.py", line 211, in learn self._add_msg(wordstream, is_spam) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\classifier.py", line 346, in _add_msg self._wordinfoset(word, record) File "F:\PROGRA~1\Python\Lib\site-packages\spambayes\storage.py", line 270, in _wordinfoset self.db[word] = record.__getstate__() File "F:\PROGRA~1\Python\lib\shelve.py", line 130, in __setitem__ self.dict[key] = f.getvalue() File "F:\PROGRA~1\Python\lib\bsddb\__init__.py", line 120, in __setitem__ self.db[key] = value DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') I thought at first that it was because I had switched from using "net start pop3proxy" to "python sb_server.py -b" to start SpamBayes. However, I switched back to using pop3proxy and I'm still getting the errors. I'm using 1.0a6 under Windows XP Pro and IE 6.0. Thanks for your help. From RHavermahl at ielinc.com Wed Nov 12 08:32:03 2003 From: RHavermahl at ielinc.com (Ray Havermahl) Date: Wed Nov 12 08:32:29 2003 Subject: [Spambayes] How to remove Spambayes completely Message-ID: How do I remove Spambayes from Outlook? It did not install properly, and I want to remove it so that I can perform a fresh install. Ray Havermahl Engineering/Design Independent Engineering Laboratories, inc. www.ielinc.com Phone (734) 426-9833 Fax (734) 426-9838 From tschill at earthlink.net Wed Nov 12 08:43:02 2003 From: tschill at earthlink.net (Tom Schill) Date: Wed Nov 12 08:43:00 2003 Subject: [Spambayes] Outlook 2000 Message-ID: Since installing Spambayes 0.81, I cannot configure the Outlook Today screen. Clicking on the 'Customize Outlook Today' simply does nothing. Is this a known problem? Regards, Tom Schill From Mark.Howells at softoption.com Wed Nov 12 08:59:00 2003 From: Mark.Howells at softoption.com (Mark Howells) Date: Wed Nov 12 09:00:00 2003 Subject: [Spambayes] Outlook 2000 Message-ID: <5846CF419D2EF5439036CC3126A3A995017B26@SOSERVER1.softoption.local> Not something I have ever done ... but it does appear to be broken on my system too. I'm not going to lose a lot of sleep over it tho. Regards Mark Howells > -----Original Message----- > From: Tom Schill [mailto:tschill@earthlink.net] > Sent: 12 November 2003 13:43 > To: spambayes@python.org > Subject: [Spambayes] Outlook 2000 > > > Since installing Spambayes 0.81, I cannot configure the Outlook Today > screen. Clicking on the 'Customize Outlook Today' simply does > nothing. Is > this a known problem? > > Regards, > Tom Schill > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > > -- > Incoming mail is certified Virus Free. > Checked by AVG Anti-Virus (http://www.grisoft.com). > Version: 7.0.195 / Virus Database: 260.2.10 - Release Date: 11/10/2003 > > -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 260.2.10 - Release Date: 11/10/2003 From rickm at wtainc.com Wed Nov 12 09:09:00 2003 From: rickm at wtainc.com (Rick R. Monnin) Date: Wed Nov 12 09:09:24 2003 Subject: [Spambayes] SpamBayes just stopped working Message-ID: <000001c3a926$8b369f50$7101010a@rickmlap> Greetings. I have been using SpamBayes for some time now and it has saved me a great deal of time. Thanks! However, after installing Microsoft hot fix KB824145 yesterday, SpamBayes no longer functions. When I click on the SpamBayes icon to bring up the configuration page, nothing happens. Also spam is no longer filtered. I tried uninstalling/reinstalling, but it seems that Outlook doesn't even see SpamBayes any longer as it is no longer listed in the add-on manager. In fact after reinstalling, the SpamBayes icon toolbar is missing as well. Please help. I am using Windows XP Sp1, version .81 of SpamBayes, with Outlook 2002. This is all that is listed in the log files: Registered: SpamBayes.OutlookAddin Registration complete. What I am doing wrong? Thanks. Rick Monnin From papaDoc at videotron.ca Wed Nov 12 09:21:54 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 12 09:21:59 2003 Subject: [Spambayes] Follow-up on trying to use spambayes-1.0a7 In-Reply-To: <3FAA5CB0.8000509@simlog.com> References: <3FA96901.3080404@simlog.com> <5vsiqv0d97em3ain6gnlv926hjm5p10p7s@4ax.com> <3FAA5CB0.8000509@simlog.com> Message-ID: <3FB24202.7050802@videotron.ca> Hi, I was trying to install Spambayes 1.0a7 and I was getting this error message: 500 Server error Traceback (most recent call last): File "c:\Devtools\spambayes\spambayes\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "c:\Devtools\spambayes\spambayes\spambayes\UserInterface.py", line 684, in onChangeopts self.reReadOptions() File "c:\Devtools\spambayes\spambayes\spambayes\ProxyUI.py", line 530, in reReadOptions state = self.state_recreator() File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 736, in _recreateState prepare(state) File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 752, in prepare state.createWorkers() File "C:\Devtools\Python\PYTHON~1.2\Scripts\SB_SER~1.PY", line 623, in createWorkers self.bayes = storage.open_storage(filename, self.useDB) File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 677, in open_storage return klass(data_source_name) File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 162, in __init__ self.load() File "c:\Devtools\spambayes\spambayes\spambayes\storage.py", line 187, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "c:\Devtools\spambayes\spambayes\spambayes\dbmstorage.py", line 60, in open return f(db_name, mode) File "c:\Devtools\spambayes\spambayes\spambayes\dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) error: (22, 'Invalid argument') I solved my problem with PYTHONPATH and using the python I want (moving order of the directories in my path variable). and I was still getting the error. I was looking in the dmstorage.py and I saw that def open_dbhash(*args): """Open a bsddb hash. Don't use this on Windows, unless Python 2.3 or greater is used, in which case bsddb3 is actually named bsddb.""" I was using Python-2.2.2 on windows oups I upgraded to Python-2.3.2 and everything is fine. I think we should have a nice message saying "DON'T DO THAT. UPGRADE...." instead of an exception Remi P.S, Thanks to all who helped me on this. From kennypitt at hotmail.com Wed Nov 12 09:28:27 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 12 09:28:57 2003 Subject: [Spambayes] [POP3 Version] Training Failure In-Reply-To: <20031111235415.E0DC.USR7-28HL@xemaps.com> Message-ID: Keith Russell wrote: > Hi, everyone. > > I've recently started getting the following errors when I try to train > using the Web interface: > > DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run > database recovery -- fatal region error detected; run recovery') > > > I thought at first that it was because I had switched from using "net > start pop3proxy" to "python sb_server.py -b" to start SpamBayes. > However, > I switched back to using pop3proxy and I'm still getting the errors. > > I'm using 1.0a6 under Windows XP Pro and IE 6.0. This is a known problem currently being tracked in bug #807217 (see this url, which will probably wrap: http://sourceforge.net/tracker/index.php?func=detail&aid=807217&group_id =61702&atid=498103) This has been a difficult problem to track down as noone seems able to come up with a consistent way to reproduce it. Do you remember doing anything like closing your browser or moving to a different page while SpamBayes was doing training? One report has suggested that this might cause the problem, but it appears that there may be other scenarios as well. It is unlikely, although possible, that the change in the way you start SpamBayes had something to do with the problem. We do know that once you get into this state, it won't go away on it's own since it has basically corrupted a portion of your training database. You may be able to get around the problem by getting rid of your "message_info_database.db" file and letting SpamBayes recreate it. This seems to be the file most often responsible for the error. I suggest just moving the file to a backup directory instead of deleting it until you know for sure if it clears things up. Unless you have changed your configuration, the file should be located in a directory like "C:\Documents and Settings\username\Application Data\SpamBayes\Proxy". -- Kenny Pitt From ASalomon at leviton.com Wed Nov 12 09:34:58 2003 From: ASalomon at leviton.com (Salomon, Allyn) Date: Wed Nov 12 09:35:08 2003 Subject: [Spambayes] (no subject) Message-ID: I am using Mac OS9. Is there a way to use the Bayesian filter on Outlook on my machine? If yes, are there directions for the setup of the solution you recommend? Allyn Salomon Leviton Manufacturing Co., Inc. Marketing Communications *718-281-6665 *718-281-6252 *asalomon@leviton.com From wsy at merl.com Wed Nov 12 09:46:43 2003 From: wsy at merl.com (Bill Yerazunis) Date: Wed Nov 12 09:47:51 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: References: Message-ID: <200311121446.hACEkhs30918@localhost.localdomain> From: "Tim Peters" [Anthony Baxter] > Large training sets were shown to lead to a slight decrease in > accuracy, but we're talking tens of thousands of messages in either > direction. It'll be in the spambayes list archives, back in the dark > ages. Was that result unique to you, Anthony? I ran tens of thousands of messages, and didn't see that. Guido ran tens of thousands but didn't keep testing long enough to report anything. Who else ran against test sets that large? And you always got somewhat different results because you mixed ham from multiple users, right? I ran some specific tests to verify or refute this, and in the case where you allow the database/hashfile/whatever to grow without bound, there's very little impact on accuracy between TOE (Train Only Errors) and TEFT (Train Every Thing). The accuracy loss happens when you limit the maximum size of your database. You then have to start throwing away stuff from the database; the problem of "how to remember what you can afford to forget" becomes a major issue. That may well be the source of the legend- it's not that learning it is bad, but it's the forgetting to make room for other things that's the bad part. -Bill Yerazunis From ljenkins at imtco.com Wed Nov 12 10:14:44 2003 From: ljenkins at imtco.com (Lyle Jenkins) Date: Wed Nov 12 10:11:34 2003 Subject: [Spambayes] New User Questions - Login Window / Slow Startup Message-ID: <001f01c3a92f$b43cdaf0$9001a8c0@Triplewood.imtco.com> I started using SpamBayes on Monday. I am running version .81 with Windows 2000 SP3 and Outlook 200 SR-1 9.0.0.3821. It is working VERY well at catching spam. Issue 1: On Tuesday, when I started up Outlook a Login Window with testuser as the User ID came up. I can click on "cancel" to continue. Is a password required? How can I set my own User Id and password? How can I eliminate the window? Issue 2: Each time I startup Outlook it seems to take longer. Does SpamBayes examine all email in my Inbox every time Outlook starts? Should I move all my mail out of the Inbox to other folders? Lyle L. Jenkins IT Coordinator Integrated Marketing Technologies, Inc. PH: 330.225.3550 x2405 FX: 330.225.4605 Email: LJenkins@imtco.com From gruetzmacher at ais-dresden.de Wed Nov 12 10:50:30 2003 From: gruetzmacher at ais-dresden.de (=?iso-8859-1?Q?=22Gr=FCtzmacher=2C_Lukas=22?=) Date: Wed Nov 12 10:50:41 2003 Subject: [Spambayes] Spam Clues: Re: index.cgi redirection Message-ID: <08C94E00610F2E43A406811D00880C9737D0D6@exchserv.ais> Hi Spambayes gurus, I'm using SpamBayes Outlook Plugin since a few weeks. I'm subscribed on the webtool list (bugzilla). Even I have trained SpamBayes with many (over 100) mails from this list as good the most of them are identified as "possible spam". I'm currently not able to understand why. 1) Can you explain me what parts of the Spam Clues are calculated to reach the 0.386739 (below) for the mail ? (I could not found any description in the documentation !?) 2) Is it a problem of SpamBayes or of the list or of my configuration ? My SpamBayes Version is 0.81. I'm using Outlook 2000. If you need more info please drop me a note. Thanks a lot in adcance, Lukas Spam Score: 39% (0.386739) word spamprob #ham #spam '*H*' 0.252683 - - '*S*' 0.0261612 - - 'thanks.' 0.0229609 116 0 'gerv' 0.051145 49 0 'users' 0.0563025 44 0 'wrote' 0.0784543 30 0 'valid' 0.129344 16 0 'reference.' 0.150224 13 0 'email addr:ripley.netscape.com...' 0.158768 12 0 'tony' 0.158768 12 0 'message-id:@ripley.netscape.com' 0.178801 124 2 'from:addr:netspace.net.au' 0.191429 9 0 'from:addr:tonytay' 0.191429 9 0 'from:name:tony tay' 0.191429 9 0 'reply-to:addr:netspace.net.au' 0.191429 9 0 'reply-to:addr:tonytay' 0.191429 9 0 'reply-to:name:tony tay' 0.191429 9 0 'somewhere' 0.205521 8 0 'page,' 0.221854 7 0 'webserver' 0.221854 7 0 'email addr:mozilla.org>' 0.241007 6 0 'tay' 0.241007 6 0 'email name:<gerv' 0.26378 5 0 '"gervase' 0.291304 4 0 'markham"' 0.291304 4 0 'index.cgi' 0.325243 3 0 'index.cgi"' 0.325243 3 0 'skills' 0.325243 3 0 'show' 0.335038 52 2 'subject:: ' 0.360905 881 40 'cgi' 0.368132 2 0 'redirection' 0.368132 2 0 'subject:index.cgi' 0.368132 2 0 'subject:redirection' 0.368132 2 0 'subject:.' 0.377811 247 12 'skip:i 10' 0.39963 281 15 'proto:http' 0.614138 989 127 'sender:no real name:2**0' 0.61633 277 36 'can' 0.61691 415 54 'skip:m 10' 0.622438 315 42 'skip:_ 40' 0.629628 247 34 'are' 0.631922 418 58 'you' 0.63923 601 86 'header:Date:1' 0.647151 912 135 'header:From:1' 0.647151 912 135 'looking' 0.648891 39 6 'header:Return-Path:1' 0.653197 888 135 'header:Message-ID:1' 0.656727 738 114 'your' 0.664352 381 61 '>' 0.67241 192 32 'to:addr:mozilla-webtools' 0.674642 202 34 'to:addr:mozilla.org' 0.674642 202 34 'list' 0.678021 246 42 'then' 0.67905 151 26 'to:no real name:2**0' 0.679622 677 116 'url:mozilla' 0.682229 195 34 'such' 0.686906 72 13 'header:Errors-To:1' 0.687754 190 34 'url:org' 0.687915 218 39 'header:Organization:1' 0.695641 183 34 'url:mail' 0.697927 181 34 'sender:addr:mozilla.org' 0.701386 178 34 'mailing' 0.701803 204 39 'sender:addr:mozilla-webtools-admin' 0.702546 177 34 'email addr:mozilla.org' 0.704878 175 34 'url:listinfo' 0.704878 175 34 'email name:mozilla-webtools' 0.70605 174 34 'url:mozilla-webtools' 0.708406 172 34 'wrote:' 0.735492 141 32 'skip:& 10' 0.748577 144 35 '"i' 0.760306 2 1 'remove' 0.779962 47 14 'make' 0.792355 128 40 'header:Received:3' 0.812182 236 83 'instead.' 0.819863 6 3 'header:Reply-To:1' 0.822864 118 45 'set' 0.834317 84 35 Message Stream: X-MS-Mail-Gibberish: Microsoft Mail Internet Headers Version 2.0 Received: from internetserver ([10.1.1.2]) by exchserv.ais with Microsoft SMTPSVC(5.0.2195.6713); Wed, 12 Nov 2003 16:19:08 +0100 Received: from rheet.mozilla.org ([207.126.111.202]) by internetserver with Microsoft SMTPSVC(5.0.2195.6713); Wed, 12 Nov 2003 16:24:17 +0100 Received: from rheet.mozilla.org (localhost.localdomain [127.0.0.1]) by rheet.mozilla.org with ESMTPoe id hACFJ3fP016539; Wed, 12 Nov 2003 07:19:03 -0800 Path: secnews.netscape.com!not-for-mail From: "Tony Tay" Newsgroups: netscape.public.mozilla.webtools Subject: Re: index.cgi redirection Date: Thu, 13 Nov 2003 02:12:13 +1100 Organization: Another Netscape Collabra Server User Lines: 17 Message-ID: References: Reply-To: "Tony Tay" NNTP-Posting-Host: dsl-203-113-202-153.VIC.netspace.net.au X-Priority: 3 X-MSMail-Priority: Normal X-Newsreader: Microsoft Outlook Express 6.00.2720.3000 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 Xref: secnews.netscape.com netscape.public.mozilla.webtools:14876 To: mozilla-webtools@mozilla.org Sender: mozilla-webtools-admin@mozilla.org Errors-To: mozilla-webtools-admin@mozilla.org X-BeenThere: mozilla-webtools@mozilla.org X-Mailman-Version: 2.0.13 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Mozilla Webtools List-Unsubscribe: , Return-Path: mozilla-webtools-admin@mozilla.org X-OriginalArrivalTime: 12 Nov 2003 15:24:18.0678 (UTC) FILETIME=[0A3D3160:01C3A931] Is there a howto somewhere i can reference. My CGI skills are a little wanting. Thanks. "Gervase Markham" wrote in message news:botgg5$gu43@ripley.netscape.com... > Tony Tay wrote: > > > What do i have to do to make sure the "i think you are looking for > > index.cgi" does not show and users are show index.html.tmpl instead. > > Set up your webserver such that index.cgi is a valid name for an index > page, and then remove index.html. > > Gerv > _______________________________________________ mozilla-webtools mailing list mozilla-webtools@mozilla.org http://mail.mozilla.org/listinfo/mozilla-webtools Re: index.cgi redirection

Is there a howto somewhere i can reference.  My CGI skills are a little
wanting.  Thanks.

"Gervase Markham" <gerv@mozilla.org> wrote in message
news:botgg5$gu43@ripley.netscape.com...
> Tony Tay wrote:
>
> > What do i have to do to make sure the "i think you are looking for
> > index.cgi" does not show and users are show index.html.tmpl instead.
>
> Set up your webserver such that index.cgi is a valid name for an index
> page, and then remove index.html.
>
> Gerv
>


_______________________________________________
mozilla-webtools mailing list
mozilla-webtools@mozilla.org
http://mail.mozilla.org/listinfo/mozilla-webtools

Message Tokens: 102 unique tokens '"gervase' '>' '"i' 'and' 'are' 'can' 'cc:none' 'cgi' 'content-type:text/plain' 'does' 'email addr:mozilla.org' 'email addr:mozilla.org>' 'email addr:ripley.netscape.com...' 'email name: Subject: Re: index.cgi redirection Date: Wed, 12 Nov 2003 16:12:13 +0100 Size: 2973 Url: http://mail.python.org/pipermail/spambayes/attachments/20031112/0d605166/attachment.mht From kennypitt at hotmail.com Wed Nov 12 11:13:25 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 12 11:13:56 2003 Subject: [Spambayes] Spam Clues: Re: index.cgi redirection In-Reply-To: <08C94E00610F2E43A406811D00880C9737D0D6@exchserv.ais> Message-ID: Gr?tzmacher, Lukas wrote: > Even I have trained SpamBayes with many (over 100) mails from this > list as good the most of them are identified as "possible spam". I'm > currently not able to understand why. > > 1) Can you explain me what parts of the Spam Clues are calculated to > reach the 0.386739 (below) for the mail ? (I could not found any > description in the documentation !?) 2) Is it a problem of SpamBayes > or of the list or of my configuration ? > > Spam Score: 39% (0.386739) > > > word spamprob #ham #spam > 'proto:http' 0.614138 989 127 > 'can' 0.61691 415 54 > 'are' 0.631922 418 58 > 'you' 0.63923 601 86 > 'header:Date:1' 0.647151 912 135 > 'header:From:1' 0.647151 912 135 > 'header:Return-Path:1' 0.653197 888 135 > 'header:Message-ID:1' 0.656727 738 114 > 'to:no real name:2**0' 0.679622 677 116 > 'header:Received:3' 0.812182 236 83 How many total hams and spams have you trained on? The clues I left above particularly stood out to me because you are getting relatively high spam probabilities even though the ham counts are much higher than the spam counts. This usually indicates that you have unbalanced training data where you have a lot more messages of one type than the other. In this case, I would guess several thousand hams vs. only a couple hundred spams. Unbalanced training data can cause accuracy problems, and in particular can make it difficult for additional training to overcome the effects of words that appear in both ham and spam. All probabilities are based on ratios, not absolute numbers. For a given word, the raw ham ratio is the number of times the word has been seen in a ham message divided by the total number of ham messages that have been trained. The raw spam ratio is computed the same way, and then the two ratios are combined to form the spamprob for that word. If you have trained on 2000 ham messages, then a word that has appeared 100 times would have a raw ham score of 0.05. If you have only trained on 200 spam messages then it only takes 10 occurences of the word in spam to get the same 0.05 score. -- Kenny Pitt From gruetzmacher at ais-dresden.de Wed Nov 12 11:27:34 2003 From: gruetzmacher at ais-dresden.de (=?iso-8859-1?Q?=22Gr=FCtzmacher=2C_Lukas=22?=) Date: Wed Nov 12 11:27:41 2003 Subject: [Spambayes] Spam Clues: Re: index.cgi redirection Message-ID: <08C94E00610F2E43A406811D00880C9705A5D0@exchserv.ais> The SpamBayes Manager reports the training status as about 1600 ham and 135 spam mails, even I think I had more spam mails. Do I understand you right: Because I have more ham then spam mails my training becomes unbalanced ? Lukas > -----Original Message----- > From: Kenny Pitt [mailto:kennypitt@hotmail.com] > Sent: Wednesday, November 12, 2003 5:13 PM > To: Gr?tzmacher, Lukas; spambayes@python.org > Subject: RE: [Spambayes] Spam Clues: Re: index.cgi redirection > > > Gr?tzmacher, Lukas wrote: > > Even I have trained SpamBayes with many (over 100) mails from this > > list as good the most of them are identified as "possible spam". I'm > > currently not able to understand why. > > > > 1) Can you explain me what parts of the Spam Clues are calculated to > > reach the 0.386739 (below) for the mail ? (I could not found any > > description in the documentation !?) 2) Is it a problem of SpamBayes > > or of the list or of my configuration ? > > > > Spam Score: 39% (0.386739) > > > > > > word spamprob #ham #spam > > 'proto:http' 0.614138 989 127 > > 'can' 0.61691 415 54 > > 'are' 0.631922 418 58 > > 'you' 0.63923 601 86 > > 'header:Date:1' 0.647151 912 135 > > 'header:From:1' 0.647151 912 135 > > 'header:Return-Path:1' 0.653197 888 135 > > 'header:Message-ID:1' 0.656727 738 114 > > 'to:no real name:2**0' 0.679622 677 116 > > 'header:Received:3' 0.812182 236 83 > > How many total hams and spams have you trained on? The clues I left > above particularly stood out to me because you are getting relatively > high spam probabilities even though the ham counts are much > higher than > the spam counts. This usually indicates that you have unbalanced > training data where you have a lot more messages of one type than the > other. In this case, I would guess several thousand hams vs. only a > couple hundred spams. > > Unbalanced training data can cause accuracy problems, and in > particular > can make it difficult for additional training to overcome the > effects of > words that appear in both ham and spam. All probabilities > are based on > ratios, not absolute numbers. For a given word, the raw ham ratio is > the number of times the word has been seen in a ham message divided by > the total number of ham messages that have been trained. The raw spam > ratio is computed the same way, and then the two ratios are > combined to > form the spamprob for that word. If you have trained on 2000 ham > messages, then a word that has appeared 100 times would have a raw ham > score of 0.05. If you have only trained on 200 spam messages then it > only takes 10 occurences of the word in spam to get the same > 0.05 score. > > -- > Kenny Pitt > > From richie at entrian.com Wed Nov 12 11:33:53 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Nov 12 11:34:15 2003 Subject: [Spambayes] (no subject) In-Reply-To: References: Message-ID: [Allyn] > I am using Mac OS9. Is there a way to use the Bayesian filter > on Outlook on my machine? If yes, are there directions for the > setup of the solution you recommend? Yes, you can run SpamBayes on MacOS 9. You can't use the Outlook plugin, but you can use the POP3 proxy or the IMAP filter (depending on which protocol you use to collect your email). Instructions are in the README: http://cvs.sourceforge.net/viewcvs.py/*checkout*/spambayes/spambayes/README.txt?rev=HEAD&content-type=text/plain -- Richie Hindle richie@entrian.com From kennypitt at hotmail.com Wed Nov 12 12:00:35 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 12 12:01:16 2003 Subject: [Spambayes] Spam Clues: Re: index.cgi redirection In-Reply-To: <08C94E00610F2E43A406811D00880C9705A5D0@exchserv.ais> Message-ID: Gr?tzmacher, Lukas wrote: > The SpamBayes Manager reports the training status as about 1600 ham > and 135 spam mails, even I think I had more spam mails. > > Do I understand you right: Because I have more ham then spam mails my > training becomes unbalanced ? That's correct. In this case, the best solution is probably to train on more spam messages. You can do this from the Training tab in SpamBayes Manager. Select one or more directories containing spam messages that you've saved (save some for a while if you haven't already been keeping them) and don't select any folders for ham messages. Make sure you turn off the "Rebuild entire database" option and then start training. If necessary, you can repeat this several times as you collect additional spam messages. Your goal should be to get close to equal numbers of ham and spam. -- Kenny Pitt From m0davis at pacbell.net Wed Nov 12 12:32:43 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Wed Nov 12 12:33:12 2003 Subject: [Spambayes] Can't start pop3proxy_service alpha 7 Message-ID: I'm using alpha 7 under Windows XP Pro SP1. Until recently, I was using alpha 6, and I managed to have sb_server.py working semi-okay for me. Now, I've upgraded to alpha 7, and I get the following error (see below). Note that a very similar error occurs whether I use sb_server, pop3proxy_service, or pop3proxy_tray. C:\Program Files\Python23\Scripts>pop3proxy_service.py install Traceback (most recent call last): File "C:\Program Files\Python23\Scripts\pop3proxy_service.py", line 82, in ? import sb_server File "C:\Program Files\Python23\Scripts\sb_server.py", line 100, in ? import spambayes.message File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", line 200, in ? msginfoDB = MessageInfoPickle(message_info_db_name) File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", line 134, in __init__ self.load() File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", line 146, in load self.db = pickle.load(fp) File "C:\PROGRA~1\Python23\lib\pickle.py", line 1390, in load return Unpickler(file).load() File "C:\PROGRA~1\Python23\lib\pickle.py", line 872, in load dispatch[key](self) KeyError: '\x00' Please help! -Martin From milt at necam.com Wed Nov 12 12:52:06 2003 From: milt at necam.com (Atkinson, Milt) Date: Wed Nov 12 12:52:40 2003 Subject: [Spambayes] Outlook 2000 Message-ID: Might be just a coincidence, but did you just do a IE update? ... See this ... http://support.microsoft.com/?kbid=820575 -----Original Message----- From: Tom Schill [mailto:tschill@earthlink.net] Sent: Wednesday, November 12, 2003 7:43 AM To: spambayes@python.org Subject: [Spambayes] Outlook 2000 Since installing Spambayes 0.81, I cannot configure the Outlook Today screen. Clicking on the 'Customize Outlook Today' simply does nothing. Is this a known problem? Regards, Tom Schill _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From sourceforge at rodland.no Wed Nov 12 13:15:09 2003 From: sourceforge at rodland.no (Fredrik Rodland) Date: Wed Nov 12 13:15:34 2003 Subject: [Spambayes] Outlook 2000 In-Reply-To: <5846CF419D2EF5439036CC3126A3A995017B26@SOSERVER1.softoption.local> Message-ID: this is _not_broken on my system outlook 2000, cvs version of spambayes 0.81 - updated today. F -- Fredrik R?dland Technical Architect, Stocknet, Oslo, Norway Stocknet: http://www.stocknet.com phone: +47 23 28 40 17 Private: http://rodland.no phone: +47 99 21 98 17 > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Mark Howells > Sent: 12. november 2003 14:59 > To: spambayes@python.org > Subject: RE: [Spambayes] Outlook 2000 > > > Not something I have ever done ... but it does appear to be > broken on my system too. I'm not going to lose a lot of sleep > over it tho. > > Regards > > Mark Howells > > > > -----Original Message----- > > From: Tom Schill [mailto:tschill@earthlink.net] > > Sent: 12 November 2003 13:43 > > To: spambayes@python.org > > Subject: [Spambayes] Outlook 2000 > > > > > > Since installing Spambayes 0.81, I cannot configure the Outlook Today > > screen. Clicking on the 'Customize Outlook Today' simply does > > nothing. Is > > this a known problem? > > > > Regards, > > Tom Schill > > > > > > _______________________________________________ > > Spambayes@python.org > > http://mail.python.org/mailman/listinfo/spambayes > > Check the FAQ before asking: http://spambayes.sf.net/faq.html > > > > -- > > Incoming mail is certified Virus Free. > > Checked by AVG Anti-Virus (http://www.grisoft.com). > > Version: 7.0.195 / Virus Database: 260.2.10 - Release Date: 11/10/2003 > > > > > > -- > Outgoing mail is certified Virus Free. > Checked by AVG Anti-Virus (http://www.grisoft.com). > Version: 7.0.195 / Virus Database: 260.2.10 - Release Date: 11/10/2003 > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html From milt at necam.com Wed Nov 12 08:51:10 2003 From: milt at necam.com (Atkinson, Milt) Date: Wed Nov 12 13:50:04 2003 Subject: [Spambayes] Outlook 2000 Message-ID: Might be just a coincidence, but did you just do a IE update? ... See this ... http://support.microsoft.com/?kbid=820575 -----Original Message----- From: Tom Schill [mailto:tschill@earthlink.net] Sent: Wednesday, November 12, 2003 7:43 AM To: spambayes@python.org Subject: [Spambayes] Outlook 2000 Since installing Spambayes 0.81, I cannot configure the Outlook Today screen. Clicking on the 'Customize Outlook Today' simply does nothing. Is this a known problem? Regards, Tom Schill _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From ljenkins at imtco.com Wed Nov 12 15:39:09 2003 From: ljenkins at imtco.com (Lyle Jenkins) Date: Wed Nov 12 15:35:54 2003 Subject: [Spambayes] Login Screen Not a Spambayes Problem Message-ID: <004701c3a95d$066179d0$9001a8c0@Triplewood.imtco.com> I am sorry about my Issue # 1 in my post to this list from earlier today. The password window that popped up was caused by my system admin installing an exchange4linux client on my machine. Lyle L. Jenkins IT Coordinator Integrated Marketing Technologies, Inc. PH: 330.225.3550 x2405 FX: 330.225.4605 Email: LJenkins@imtco.com From milt at necam.com Wed Nov 12 16:12:30 2003 From: milt at necam.com (Atkinson, Milt) Date: Wed Nov 12 16:12:44 2003 Subject: [Spambayes] Outlook 98 Message-ID: Some Outlook 98 users are complaining the Tool Bar for Spambayes is not available after they install and launch Outlook. Any ideas what might be causing this? From kennypitt at hotmail.com Wed Nov 12 16:34:24 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 12 16:34:49 2003 Subject: [Spambayes] Outlook 98 In-Reply-To: Message-ID: Atkinson, Milt wrote: > Some Outlook 98 users are complaining the Tool Bar for Spambayes is > not available after they install and launch Outlook. Any ideas what > might be causing this? Unfortunately, Outlook 98 is not supported. You must use Outlook 2000/XP/2003 to use the Outlook plugin. If you are using POP3 or IMAP mail, then you should be able to set up sb_server or sb_imapfilter to work with Outlook 98. Currently, there is no simple installer for these like there is for the Outlook plugin, so a good bit of manual setup would be required. -- Kenny Pitt From eric at benefitservicesgroup.com Wed Nov 12 16:49:23 2003 From: eric at benefitservicesgroup.com (Eric C. Vogel) Date: Wed Nov 12 16:46:38 2003 Subject: [Spambayes] Outlook 98 Message-ID: I highly suggest upgrading to Outlook 2003, you will see a HUGH improvement. There were some nice touches in XP/2002. The minute or SBS 2003 Media Kit is avalible at our reseller, I am ordering it on the spot. I cannot wait to get Outlook 2003 (1 licience for every licence of SBS 2003 and 1 FrontPage 2003) I am rolling out Outlook 2003 and installing FrontPage 2003 on my PC. The server can wait durring the holiday when the boss is gone or in Janurary since I have to do a wipe and re-install on the server. (NT 4 had a 4 Gig limitation on the first parition. (SBS 4.x) and I upgraded to SBS 2000. So if I want a bigger partition and it is pretty full, I have to wipe. :( ) Eric Vogel -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Kenny Pitt Sent: Wednesday, November 12, 2003 4:34 PM To: 'Atkinson, Milt'; spambayes@python.org Subject: RE: [Spambayes] Outlook 98 Atkinson, Milt wrote: > Some Outlook 98 users are complaining the Tool Bar for Spambayes is > not available after they install and launch Outlook. Any ideas what > might be causing this? Unfortunately, Outlook 98 is not supported. You must use Outlook 2000/XP/2003 to use the Outlook plugin. If you are using POP3 or IMAP mail, then you should be able to set up sb_server or sb_imapfilter to work with Outlook 98. Currently, there is no simple installer for these like there is for the Outlook plugin, so a good bit of manual setup would be required. -- Kenny Pitt _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From rmalayter at bai.org Wed Nov 12 17:38:40 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Nov 12 17:38:44 2003 Subject: [Spambayes] Outlook 98 Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3898@cliff.bai.org> [Eric Vogel] > I highly suggest upgrading to Outlook 2003, you will see a HUGH > improvement. Especially in the area of offline use. The "cached mode" is light-years ahead of any of the other offline synchronization options I've seen on any mail system. If you disconnect from the server, outlook simply doesn't care. When you reconnect, it synchronizes all of your folders in the background as CPU and network bandwidth permit, so it doesn't bug you. Very nice. Spambayes doesn't deal completely with the intermittent timing of cached mode, so I have to manually run my filters every once it a while. Hopefully that will be fixed in the near future, but even if it is not, cached mode is worth it. [Eric Vogel] > (NT 4 had a 4 Gig limitation on the first partition...it is > pretty full, I have to wipe :( This is even more off-topic, but NT4 actually didn't have such a limitation. It was just that the NT4 boot disk/CD setup program had a 4 GB, FAT-only limitation. If you formatted your hard disk on another NT machine, you could put it in a new machine and NT4 could install onto it. I had an 8 GB NTFS boot partition on one of my old NT4 computers. Of course MS didn't document this anywhere. More significantly, you don't have to wipe your boot partition. You can use Ghost or Partition Magic to move your installation to a new array or set of disks, even converting it to NTFS and a new cluster size along the way if you haven't done so already. A lot less hassle than a complete reinstall. If you want to talk about this process further, please mail me off-list. Regards, Ryan From listas at loquecreas.com Wed Nov 12 20:03:28 2003 From: listas at loquecreas.com (Pablo Vieira) Date: Wed Nov 12 20:04:01 2003 Subject: [Spambayes] Hello and Spanish version Message-ID: Hi, I'm new to the list. I just wanted to say hi and ask you guys if there is a Spanish version of Spambayes for Outlook. If not, I'd like to help in the translation. Best regards, Pablo From ks at obsideon.com Thu Nov 13 03:10:44 2003 From: ks at obsideon.com (Kirit Saelensminde) Date: Thu Nov 13 03:11:28 2003 Subject: [Spambayes] Great program Message-ID: It's a great program. Can't wait until it learns a bit more (just started this morning and didn't prep anything, but then I get 1400 spams a week so it shouldn't take long). Your Outlook integration instructions state that it isn't possible to add the Spam column to all folders at once. This kindof depends on how people use Outlook, but certainly on Outlook 2000 it's quite easy. Whilst in the Inbox (or any of the other folders that you want to show the Spam column for) go to View -> Current View -> Define Views... From here select the view that you want to add the column to (the usual view will be either Message or Messages with Autopreview) and then choose Modify and Fields. Add the Spam column and it will be added to the view definition. Any folder that uses this view will then automatically gain the Spam column. Any folders that use a customised view can be done as per your current instructions. Thanks for a great program, Kirit S?lensminde - Obsideon Limited http://www.obsideon.com/ ICQ: 10009115 MSN Messenger: ks (at) obsideon.com From stuartf at the-i-junction.com Thu Nov 13 04:00:42 2003 From: stuartf at the-i-junction.com (Stuart Forsyth) Date: Thu Nov 13 04:02:02 2003 Subject: [Spambayes] bounce? Message-ID: <002a01c3a9c4$9ecf3650$4282010a@theijunction.icelogic.co.za> Is there a feature to bounce spam in spambayes? Regards Stuart --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.538 / Virus Database: 333 - Release Date: 2003/11/10 From tim.one at comcast.net Thu Nov 13 09:15:25 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Nov 13 09:15:34 2003 Subject: [Spambayes] bounce? In-Reply-To: <002a01c3a9c4$9ecf3650$4282010a@theijunction.icelogic.co.za> Message-ID: [Stuart Forsyth] > Is there a feature to bounce spam in spambayes? No, and there won't be. The sender address in spam is almost always forged. At best, the forged address doesn't exist, so your attempt to bounce it bounces right back to you, and now the original piece of spam has clogged the net with two additional useless pieces of email. At worst, the forged address is a real one, and your bounce ends up in the inbox of an innocent third party. You really can't appreciate the pain this can cause until some spammer forges stuartf@the-i-junction.com as the sender address (want to wake up to find 10,000 bounces in your inbox, every day? along with assorted random threats because outraged victims believe *you're* the one sending farm porm spam to their kids? etc). BTW, some systems with this (mis)feature forge the sender address on the bounces it generates, so that *you* don't see the consequences of bouncing to a bogus or innocent address. That's even worse -- then you can talk yourself into thinking you're doing good, when at best you're increasing the spam-related load on the net, and at worst are harrassing innocent victims. The one thing a bounce-back gimmick can't do is bother the spammers. From stuartf at the-i-junction.com Thu Nov 13 09:19:33 2003 From: stuartf at the-i-junction.com (Stuart Forsyth) Date: Thu Nov 13 09:23:03 2003 Subject: [Spambayes] bounce? In-Reply-To: Message-ID: <004501c3a9f1$29b69570$4282010a@theijunction.icelogic.co.za> Thanks for putting it all in perspective for me ;-) >From now on I'll avoid that bounce like the plague. -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: 13 November 2003 04:15 PM To: Stuart Forsyth; spambayes@python.org Subject: RE: [Spambayes] bounce? [Stuart Forsyth] > Is there a feature to bounce spam in spambayes? No, and there won't be. The sender address in spam is almost always forged. At best, the forged address doesn't exist, so your attempt to bounce it bounces right back to you, and now the original piece of spam has clogged the net with two additional useless pieces of email. At worst, the forged address is a real one, and your bounce ends up in the inbox of an innocent third party. You really can't appreciate the pain this can cause until some spammer forges stuartf@the-i-junction.com as the sender address (want to wake up to find 10,000 bounces in your inbox, every day? along with assorted random threats because outraged victims believe *you're* the one sending farm porm spam to their kids? etc). BTW, some systems with this (mis)feature forge the sender address on the bounces it generates, so that *you* don't see the consequences of bouncing to a bogus or innocent address. That's even worse -- then you can talk yourself into thinking you're doing good, when at best you're increasing the spam-related load on the net, and at worst are harrassing innocent victims. The one thing a bounce-back gimmick can't do is bother the spammers. --- Incoming mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.538 / Virus Database: 333 - Release Date: 2003/11/10 --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.538 / Virus Database: 333 - Release Date: 2003/11/10 From rcoe at CambridgeMA.GOV Thu Nov 13 11:07:58 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Thu Nov 13 11:08:08 2003 Subject: [Spambayes] RE: Great program Message-ID: Before you make that "quite easy" change, you'd better remember to scan every folder with Spambayes. If you don't, my experience is that Outlook will reject the Spam column, give you an error message, and turn the Spam column off in that view. To get it back, you'll have to edit the view again. You did try this procedure before you recommended it, right? ;^) Bob > -----Original Message----- > From: Kirit Saelensminde [mailto:ks@obsideon.com] > Sent: Thursday, November 13, 2003 3:11 AM > To: 'spambayes@python.org' > Subject: [Spambayes] Great program > > > > It's a great program. Can't wait until it learns a bit more > (just started > this morning and didn't prep anything, but then I get 1400 > spams a week so > it shouldn't take long). > > Your Outlook integration instructions state that it isn't > possible to add > the Spam column to all folders at once. This kindof depends > on how people > use Outlook, but certainly on Outlook 2000 it's quite easy. > > Whilst in the Inbox (or any of the other folders that you > want to show the > Spam column for) go to View -> Current View -> Define > Views... From here > select the view that you want to add the column to (the usual > view will be > either Message or Messages with Autopreview) and then choose > Modify and > Fields. Add the Spam column and it will be added to the view > definition. > > Any folder that uses this view will then automatically gain > the Spam column. > Any folders that use a customised view can be done as per your current > instructions. > > > Thanks for a great program, > > > > Kirit S?lensminde - Obsideon Limited > http://www.obsideon.com/ > > ICQ: 10009115 > MSN Messenger: ks (at) obsideon.com > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From PWalsh at Yantra.com Thu Nov 13 12:16:01 2003 From: PWalsh at Yantra.com (Pamela Walsh) Date: Thu Nov 13 12:17:47 2003 Subject: [Spambayes] I am a new spambayes user Message-ID: I have been a HUGE fan for about 3 weeks, training has gone well, and spams down to less than 10/day. Something happened over the weekend (?) Since Monday morning I still train every night but get the following message: Found 0 spam, 0 unsure, and 0 good ones (3050 errors) How do I get back to training w/good results? I am a pc novice, and have 0 technical skills/lingo. PLEASE HELP Pamela Walsh Human Resources Visa / Benefits Specialist T: 978 513 6117 F: 978 513 6016 Yantra Corporation Solutions for Fulfillment Excellence www.yantra.com -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: Blank Bkgrd.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20031113/af1d78a5/attachment.gif From arc39 at cam.ac.uk Thu Nov 13 12:47:12 2003 From: arc39 at cam.ac.uk (Adrian Cybriwsky) Date: Thu Nov 13 12:47:32 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. Message-ID: <000601c3aa0e$2aec2910$50f56f83@Cc642499b> Sorry, I didn't see this addressed anywhere else. After months of being a very happy spambayes user (when 98% good filtering was the norm) spambayes now seems to correctly filter less than 50% of my spam. The difference seems to be cleverly-written spam which SB's tokenizer doesn't seem to find any tokens for. In other words, the email comes as one big blob with the usual variation of disguising ip address and so forth using escape codes. Spambayes continues to give emails for the "Penis Patch" scores of about 7-9% despite the fact that I've nominally trained it on over 2000 of them (Yes, I get 200+ "penis patch" emails alone a day). Apologies for sending this here, but I could not find a better place to send this. I seem to have the latest ver of SB running on XP/Office. Keep up the great work! If you need samples of the spam, just email me. Cheers, Adrian From dreed at srdcorp.com Thu Nov 13 13:07:34 2003 From: dreed at srdcorp.com (Reed, David) Date: Thu Nov 13 13:11:58 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. Message-ID: <4C542552EF434E43B2F2572854A73CBD6F70BF@corp-mail1.srdcorp.com> Adrian, If they email you, will you get it, or will it get filtered as spam? ... David -----Original Message----- From: Adrian Cybriwsky [mailto:arc39@cam.ac.uk] Sent: Thursday, November 13, 2003 12:47 PM To: spambayes@python.org Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. Sorry, I didn't see this addressed anywhere else. After months of being a very happy spambayes user (when 98% good filtering was the norm) spambayes now seems to correctly filter less than 50% of my spam. The difference seems to be cleverly-written spam which SB's tokenizer doesn't seem to find any tokens for. In other words, the email comes as one big blob with the usual variation of disguising ip address and so forth using escape codes. Spambayes continues to give emails for the "Penis Patch" scores of about 7-9% despite the fact that I've nominally trained it on over 2000 of them (Yes, I get 200+ "penis patch" emails alone a day). Apologies for sending this here, but I could not find a better place to send this. I seem to have the latest ver of SB running on XP/Office. Keep up the great work! If you need samples of the spam, just email me. Cheers, Adrian _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From kennypitt at hotmail.com Thu Nov 13 13:11:02 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Nov 13 13:12:10 2003 Subject: [Spambayes] I am a new spambayes user In-Reply-To: Message-ID: Pamela Walsh wrote: > Since Monday morning I still train every night but get the following > message: > Found 0 spam, 0 unsure, and 0 good ones (3050 errors) > > How do I get back to training w/good results? > I am a pc novice, and have 0 technical skills/lingo. I'll assume that you're running the Outlook plugin. I don't remember hearing about this problem before, so we'll need a little extra information in order to diagnose it. I'll try to walk you through the required steps. First, open SpamBayes Manager, go to the Advanced tab, and click the Diagnostics button. In the box labeled "Log file verbosity", change the value to 3 and then close out of the dialog and SpamBayes Manager. Now perform whatever training operation caused the error. If the error doesn't happen immediately, you'll have to wait until the next time you see the error to perform the remaining steps. Go back to the Diagnostics dialog as above. Change the "Log file verbosity" back to 0, and then click "View log". This will bring up the Notepad application with a log file containing lots of info that will be useful to us. Use the "File|Save As" menu option to save a copy of the file. Name it something easy to remember like "sb_log.txt", and put it in a directory that you will be able to find again later. You can now close Notepad, the Diagnostics dialog, and SpamBayes Manager. Finally, send another e-mail to the list and attach the log file that you just saved. In that e-mail, please also include the following information: * A description of how you went about performing the training. * A description of the exact problem as you saw it while recording the log file. * Your operating system version (Windows 98, Windows XP, etc) * Your Outlook version (Outlook 2000, etc.) * The SpamBayes version information from the General tab in SpamBayes Manager. -- Kenny Pitt From m0davis at pacbell.net Thu Nov 13 13:50:07 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Thu Nov 13 13:49:24 2003 Subject: [Spambayes] Re: Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: I'm still having this problem and no one responded, so I'm posting again. Martin Stone Davis wrote: > I'm using alpha 7 under Windows XP Pro SP1. Until recently, I was using > alpha 6, and I managed to have sb_server.py working semi-okay for me. > Now, I've upgraded to alpha 7, and I get the following error (see > below). Note that a very similar error occurs whether I use sb_server, > pop3proxy_service, or pop3proxy_tray. > > C:\Program Files\Python23\Scripts>pop3proxy_service.py install > Traceback (most recent call last): > File "C:\Program Files\Python23\Scripts\pop3proxy_service.py", line > 82, in ? > import sb_server > File "C:\Program Files\Python23\Scripts\sb_server.py", line 100, in ? > import spambayes.message > File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", > line 200, > in ? > msginfoDB = MessageInfoPickle(message_info_db_name) > File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", > line 134, > in __init__ > self.load() > File "C:\PROGRA~1\Python23\Lib\site-packages\spambayes\message.py", > line 146, > in load > self.db = pickle.load(fp) > File "C:\PROGRA~1\Python23\lib\pickle.py", line 1390, in load > return Unpickler(file).load() > File "C:\PROGRA~1\Python23\lib\pickle.py", line 872, in load > dispatch[key](self) > KeyError: '\x00' > > > Please help! > -Martin > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From jimkent at pe.net Thu Nov 13 14:05:21 2003 From: jimkent at pe.net (Jim Kent) Date: Thu Nov 13 14:00:01 2003 Subject: [Spambayes] Delete as Spam problem Message-ID: <000601c3aa19$1a319ff0$de5b2640@JIM> Suddenly, whenever I try to "Delete As Spam" a message in my e-mail Inbox in MS Outlook, I get a message "Invalid Configuration. You must configure the Spam Folder." I don't have the slightest Idea of how to do this. From kennypitt at hotmail.com Thu Nov 13 14:31:49 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Nov 13 14:32:21 2003 Subject: [Spambayes] Delete as Spam problem In-Reply-To: <000601c3aa19$1a319ff0$de5b2640@JIM> Message-ID: Jim Kent wrote: > Suddenly, whenever I try to "Delete As Spam" a message in my e-mail > Inbox in MS Outlook, I get a message "Invalid Configuration. You must > configure the Spam Folder." > > > > I don't have the slightest Idea of how to do this. >From the SpamBayes dropdown menu on the toolbar, go to SpamBayes Manager. Select the Filtering tab. In the box labeled Certain Spam, you should find a Browse button. When you click Browse, you will see a list of all of Outlook's folders. Just select the one that you want your spam moved to a then click OK. That should be all there is to it. -- Kenny Pitt From tommy at ilm.com Thu Nov 13 15:06:01 2003 From: tommy at ilm.com (Tommy Burnette) Date: Thu Nov 13 15:06:40 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? Message-ID: <16307.58409.581150.801884@evoke.lucasdigital.com> Hi Folks, I've been very happily using spambayes with xemacs & vim for about a month now. When the latest version was announced I switched right up to it, but since then I've noticed a behaviour in sb_mboxtrain.py that is a bit of a pain. (no idea if this is new or not, I just notcied it recently). basically, when sb_mboxtrain.py rewrites the mbox to include its "X-Spambayes-Trained: ham" lines, it is re-formatting the mail headers. This wreaks havoc on the display in vm's summary mode and is a bit of a pain to fix. e.g. this line in a mail header: X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] ["31722" "Tue" "12" "November" "2002" "15:05:56" "-0800" "Gayle Peterson" "gayle@lucasdigital.com" nil "457" "2003 Holiday Schedule" "^From:" nil nil "11" nil nil nil nil nil] nil) is being transformed into this: X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] ["31722" "Tue" "12" "November" "2002" "15:05:56" "-0800" "Gayle Peterson" "gayle@lucasdigital.com" nil "457" "2003 Holiday Schedule" "^From:" nil nil "11" nil nil nil nil nil] nil) which, while neatly formatted, has introduced newlines into places like the From name, etc. and makes them display as: From: Gayle Peterson Subject: 2003 Holiday Schedule instead of: From: Gayle Peterson Subject: 2003 Holiday Schedule in vm. is there an option to either make sb_mboxtrain.py not rewrite the mbox file, or at least make it not re-format when it does? thanks in advance, From nanci.lewis at approval4u.com Thu Nov 13 15:20:45 2003 From: nanci.lewis at approval4u.com (Nanci.Lewis) Date: Thu Nov 13 15:21:02 2003 Subject: [Spambayes] (no subject) Message-ID: Hi. I have been seeing the incoming mail envelope icon on the lower right side of my computer but there is no mail in any of the folders. Why would this be? I didn't see this addressed on the faq's. Thank you. Nanci Lewis Vice President Coastal Financial (310) 376-4568 ext. 313 (310) 892-LOAN (5626) cell phone (310) 372-7668 fax 500 S. Sepulveda Blvd. #301 Manhattan Beach, CA 90266 www.approval4u.com From Allen.Wyatt at nokia.com Thu Nov 13 15:24:11 2003 From: Allen.Wyatt at nokia.com (Allen.Wyatt@nokia.com) Date: Thu Nov 13 15:24:23 2003 Subject: [Spambayes] Out-of-office assistant and SpamBayes Message-ID: <39AB02E248BA714CA1BD2B109F375A3BC3BEDA@daebe001.americas.nokia.com> I'll be on vacation in a couple weeks. Usually, I use Outlook's out-of-office assistant tool to automatically reply to any emails received while on vacation with a "I'm out and will be back on ..." message. How will SpamBayes interact with the out-of-office assistant settings? Will anyone who spams me get the reply email saying I'm out of the office, thus being able to verify that my email address is a valid one they should send lots more spam to? From mtiller at ford.com Thu Nov 13 15:52:25 2003 From: mtiller at ford.com (Tiller, Michael (M.M.)) Date: Thu Nov 13 15:52:44 2003 Subject: [Spambayes] Out-of-office assistant and SpamBayes Message-ID: Come to think of it, I was out all last week. Normally when I leave I get tons of messages when my out of office replies to spam are bounced. I didn't get any last week. Wow, how did SpamBayes do that?!?!? Another great feature of SpamBayes! -- Mike > -----Original Message----- > From: Allen.Wyatt@nokia.com [mailto:Allen.Wyatt@nokia.com] > Sent: Thursday, November 13, 2003 3:24 PM > To: spambayes@python.org > Subject: [Spambayes] Out-of-office assistant and SpamBayes > > > I'll be on vacation in a couple weeks. Usually, I use > Outlook's out-of-office assistant tool to automatically reply > to any emails received while on vacation with a "I'm out and > will be back on ..." message. How will SpamBayes interact > with the out-of-office assistant settings? Will anyone who > spams me get the reply email saying I'm out of the office, > thus being able to verify that my email address is a valid > one they should send lots more spam to? > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From listas at loquecreas.com Thu Nov 13 15:54:59 2003 From: listas at loquecreas.com (Pablo Vieira) Date: Thu Nov 13 15:55:39 2003 Subject: [Spambayes] I18N and L10N Message-ID: Hi again, I want to insist on the internationalisation and localisation of Spambayes. Do you guys have any plans about this? Any easy way to translate the software (such as a text file containing entries in the menus, dialog boxes, help files, etc.?). I'm mostly interested in the Outlook version, but will happily collaborate on the other variants as well. My mother tongue is Spanish. Regards, Pablo From cweisbrod at cogeco.ca Thu Nov 13 16:01:23 2003 From: cweisbrod at cogeco.ca (cweisbrod@cogeco.ca) Date: Thu Nov 13 16:01:32 2003 Subject: [Spambayes] .db files Message-ID: <63340-220031141321123308@M2W081.mail2web.com> Hi there, I've recently started using SpamBayes and have had great success in finally avoiding unsolicited email. The only problem I have with SpamBayes is the significant overhead required to get the code running. Installing the Python binaries first and then having a process that consumes 10-15 Mb of runtime memory is less than optimal. I am a software engineer with 9+ years of experience and I fully understand the need for a project like this to be written in a platform-independant manner. However, I would really like to write a scaled-down version of this for the Windows platform. I'm quite certain that I can write a self-installing/uninstalling service that implements a POP/SMTP proxy, performs the statistical processing, and exposes user interfaces for training and configuring. I have a mathematics and physics degree, so the statistics stuff shouldn't be too difficult to implement. The entire implementation would be contained in a single executable file much less than 1 Mb in size (I don't use MFC). Of course this executable would have to create its' own config files, spambayes.messageinfo.db, hammie.db, etc. That's where I need some information. The best implementation would make use of existing spambayes.messageinfo.db and hammie.db files so that any existing training is not lost. However, I need to know how these files are organized. Short of studying the Python source for answers, is there any specific documentation on the organization of these files? If not, would somebody involved with this project be willing to provide me with this information? If I can get this type of information, I would be happy to make my upcoming development efforts available to the SpamBayes project. I suspect many people would really appreciate the option I'm suggesting. Best regards, Clint Weisbrod. -------------------------------------------------------------------- mail2web - Check your email from the web at http://mail2web.com/ . From kennypitt at hotmail.com Thu Nov 13 16:27:19 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Nov 13 16:28:07 2003 Subject: [Spambayes] .db files In-Reply-To: <63340-220031141321123308@M2W081.mail2web.com> Message-ID: cweisbrod@cogeco.ca wrote: > Hi there, > > I've recently started using SpamBayes and have had great success in > finally avoiding unsolicited email. The only problem I have with > SpamBayes is the significant overhead required to get the code > running. Installing the Python binaries first and then having a > process that consumes 10-15 Mb of runtime memory is less than optimal. > > I am a software engineer with 9+ years of experience and I fully > understand the need for a project like this to be written in a > platform-independant manner. However, I would really like to write a > scaled-down version of this for the Windows platform. I'm quite > certain that I can write a self-installing/uninstalling service that > implements a POP/SMTP proxy, performs the statistical processing, and > exposes user interfaces for training and configuring. I have a > mathematics and physics degree, so the statistics stuff shouldn't be > too difficult to implement. The entire implementation would be > contained in a single executable file much less than 1 Mb in size (I > don't use MFC). Of course this executable would have to create its' > own config files, spambayes.messageinfo.db, hammie.db, etc. That's > where I need some information. If this is the type of implementation you're looking for, you might be better served by the K9 filter here: http://keir.net/k9.html. It is not open source but it is freeware, and the executable is only about 70k and uses very little memory. > The best implementation would make use of existing > spambayes.messageinfo.db and hammie.db files so that any existing > training is not lost. However, I need to know how these files are > organized. Short of studying the Python source for answers, is there > any specific documentation on the organization of these files? If > not, would somebody involved with this project be willing to provide > me with this information? The .db files use the Berkeley DB engine from Sleepy Cat, and IIRC the actual data records are stored in a python-specific format. Your best bet would be to use the sb_dbexpimp script to export your training data into a flat file, from which it would be easy to parse and convert to any format you want. Be careful if you try to convert the data for a different Bayesian engine because SpamBayes adds special tokens based on the structure of the message that probably wouldn't be understood by a different engine. K9, for example, also adds special tokens to the training data but in a different format and with different meanings. -- Kenny Pitt From richie at entrian.com Thu Nov 13 17:07:49 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 13 17:08:13 2003 Subject: [Spambayes] Re: Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: [Martin] > I'm still having this problem and no one responded, so I'm posting again. I have it on my list of things to look at, but it's a long list. 8-) In the short term, if you're not using IMAP filter, you can probably work around it by moving aside the spambayes.messageinfo.db file. Please don't delete the file - we might ask for a copy of it at some point, to try to track down the underlying cause. -- Richie Hindle richie@entrian.com From adam.walker at rbwconsulting.com Thu Nov 13 18:02:39 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Thu Nov 13 18:03:03 2003 Subject: [Spambayes] .db files In-Reply-To: References: Message-ID: <3FB40D8F.70109@rbwconsulting.com> Kenny Pitt wrote: >cweisbrod@cogeco.ca wrote: > > >>Hi there, >> >>I've recently started using SpamBayes and have had great success in >>finally avoiding unsolicited email. The only problem I have with >>SpamBayes is the significant overhead required to get the code >>running. Installing the Python binaries first and then having a >>process that consumes 10-15 Mb of runtime memory is less than optimal. >> >> >> >If this is the type of implementation you're looking for, you might be >better served by the K9 filter here: http://keir.net/k9.html. It is not >open source but it is freeware, and the executable is only about 70k and >uses very little memory. > > > I just downloaded and installed K9. The exe is small but it's memory footprint on my system is 4388K right after startup. Spambayes pop3proxy_tray footprint is 7236K after its been running all day. >>The best implementation would make use of existing >>spambayes.messageinfo.db and hammie.db files so that any existing >>training is not lost. However, I need to know how these files are >>organized. Short of studying the Python source for answers, is there >>any specific documentation on the organization of these files? If >>not, would somebody involved with this project be willing to provide >>me with this information? >> >> > >The .db files use the Berkeley DB engine from Sleepy Cat, and IIRC the >actual data records are stored in a python-specific format. Your best >bet would be to use the sb_dbexpimp script to export your training data >into a flat file, from which it would be easy to parse and convert to >any format you want. > > The format depends on which db is install on your system. But it is most likely Berkeley DB. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 261 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031113/79d3e38f/attachment.bin From m0davis at pacbell.net Thu Nov 13 18:10:01 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Thu Nov 13 18:09:22 2003 Subject: [Spambayes] Re: Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: Richie Hindle wrote: > [Martin] > >>I'm still having this problem and no one responded, so I'm posting again. > > > I have it on my list of things to look at, but it's a long list. 8-) > > In the short term, if you're not using IMAP filter, you can probably work > around it by moving aside the spambayes.messageinfo.db file. Please don't > delete the file - we might ask for a copy of it at some point, to try to > track down the underlying cause. > Okay, I'll be patient. By the way, I *did* try to do a completely clean install. I actually already deleted (sorry!) hammie.db, spambayes.messageinfo.db, along with the ini file, and maybe some other things. I then did setup.py build, setup.py install, and then tried sb_server.py install. That's where it failed. I'll try uninstalling python, deleting everything, and starting over. I'll post again with the results. -Martin From m0davis at pacbell.net Thu Nov 13 18:24:41 2003 From: m0davis at pacbell.net (Martin Stone Davis) Date: Thu Nov 13 18:39:27 2003 Subject: [Spambayes] Re: Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: Martin Stone Davis wrote: > I'll try uninstalling python, deleting everything, and starting over. > I'll post again with the results. > Results: same thing :( -Martin From anumber1 at swbell.net Thu Nov 13 23:23:50 2003 From: anumber1 at swbell.net (Rob Anderson) Date: Thu Nov 13 23:23:59 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. Message-ID: I am having the same problem. I have 311 ham and 1093 spam in my training database yet SB only catches about 50% of the incoming spam. I read somewhere that people were bragging about 98% of their spam being filter by SB. I am using the defaults of 15% and 90%. I've been using the Outlook plug-in v0.81 for a couple months now with Outlook 2000 on a WinXP Home machine. Any suggestions would be appreciated. Thanks. Rob Anderson From david at rebirthing.co.nz Thu Nov 13 23:45:56 2003 From: david at rebirthing.co.nz (David McNab) Date: Thu Nov 13 23:48:10 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: References: Message-ID: <3FB45E04.4060902@rebirthing.co.nz> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Rob Anderson wrote: | I am having the same problem. I have 311 ham and 1093 spam in my training | database yet SB only catches about 50% of the incoming spam. Same with me. I knew it was only a matter of time before the spammers started installing/training some bayesian filters, and learning how to compose their messages to distort the histogram and pass as 'ham'. Didn't expect it to happen *quite* so quickly though. | Any suggestions would be appreciated. Maybe time to switch to a challenge/response system - one that when receiving messages, sends a 'challenge' email to the sender, asking them to reply. Once replies are received, senders end up on the whitelist, and their emails are delivered without further challenge. There are various commercial host-based services, such as www.spamarrest.com. Also, software such as Active Spam Killer - www.paganini.net/ask. I'd suggest switching to challenge/response based filtering. That'll keep the spammers at bay for a few months. It's an arms race: * users install pattern-based spam filters ~ * spammers learn to circumvent the monitored patterns, eg ~ with words like 'v1@grA'. * users install bayesian filters ~ * spammers learn how to manipulate the histograms * users install text-based challenge/response ~ * spammers install parsers/heuristics to auto-reply * users install image/audio-based challenge/response ~ * spammers install neural-net OCR software * users install software with more sophisticated Turing Test ~ * spammers upgrade their AI software ... Cheers David -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.1 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQE/tF4EUuEuev/PjrcRAs+jAKChNEpVuzO6hNFuYKhWFOHjpujZVACbBzhK WAKX7mTjlvZaMyrkR1OAk/s= =epkG -----END PGP SIGNATURE----- From tim.one at comcast.net Fri Nov 14 01:55:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 14 01:55:58 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: Message-ID: [Rob Anderson] > I am having the same problem. I have 311 ham and 1093 spam Start by training on more ham. > in my training database yet SB only catches about 50% of the incoming > spam. That's indeed atrocious. > I read somewhere that people were bragging about 98% of their spam > being filter by SB. That should be achievable. > I am using the defaults of 15% and 90%. For a reason? That is, if all the spam it doesn't "catch" shows up in the Uncertain folder with a score of 89%, you don't get any sympathy for refusing to reduce your spam cutoff to 88% . > I've been using the Outlook plug-in v0.81 for a couple months now with > Outlook 2000 on a WinXP Home machine. > > Any suggestions would be appreciated. Pick a low-scoring spam you're sure any spam filter *should* have nailed. Use your mouse to select SpamBayes -> Show spam clues for current message Mail the email message that generates to the list (not to me directly, please -- I can't promise to make time to look at it; someone here should be able to, though). From tim.one at comcast.net Fri Nov 14 02:18:27 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 14 02:18:26 2003 Subject: [Spambayes] .db files In-Reply-To: <63340-220031141321123308@M2W081.mail2web.com> Message-ID: [cweisbrod@cogeco.ca] > I've recently started using SpamBayes and have had great success in > finally avoiding unsolicited email. The only problem I have with > SpamBayes is the significant overhead required to get the code > running. Installing the Python binaries first and then having a > process that consumes 10-15 Mb of runtime memory is less than optimal. Are you running on a TRS-80 ? > I am a software engineer with 9+ years of experience and I fully > understand the need for a project like this to be written in a > platform-independant manner. However, I would really like to write a > scaled-down version of this for the Windows platform. I'm quite > certain that I can write a self-installing/uninstalling service that > implements a POP/SMTP proxy, performs the statistical processing, and > exposes user interfaces for training and configuring. I have a > mathematics and physics degree, so the statistics stuff shouldn't be > too difficult to implement. The entire implementation would be > contained in a single executable file much less than 1 Mb in size (I > don't use MFC). Neither do we, at least not in this project. > Of course this executable would have to create its own config files, > spambayes.messageinfo.db, hammie.db, etc. That's where I need some > information. Others have addressed that, and it's complicated. I'll warn you in advance that the memory and processing power don't go into the statistics, though: the memory mainly goes into caching a potentially huge number of token statistics records in memory to make it bearably fast. If you dig up each token from an on-disk database each time you see it, it probably won't run fast enough for high-volume users to tolerate. You might overcome that by writing your own database implementation, specialized to the particular data formats this thing needs to access, one not burdened (as is the Sleepycat DB) by needing to pay to be all things to all people. You should also know that the actual scoring is a tiny part of the code, and quite straightforward. Parsing email, tokenization, dealing with a gazillion incompatible email systems, and putting up a user-friendly UI are each harder than the mathematical part, and the code for each dwarfs the total code for all the computational parts. BTW, despite the use of a memory cache on top of the Sleepycat DB, the classifier runs several times faster still if I eliminate Sleepycat entirely and run the entire database in memory (as one large Python dictionary). For a large (but not unreasonably large) token database, that can easily suck up 50MB of RAM just for the dict. For perspective, I happened to be window-shopping for a new home computer this week, and even entry-level systems come with 128MB of RAM these days; I'll probably get at least 4x that much. 50MB is about 10% of that -- it won't even be worth thinking about idly for me anymore. Not that there's anything wrong with lean & mean. It just went out of style around the time you started your career . From rcoe at CambridgeMA.GOV Fri Nov 14 07:33:54 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Fri Nov 14 07:34:04 2003 Subject: [Spambayes] RE: .db files Message-ID: I hate to rain on your parade, but most people aren't going to care. Modern computers relegate size and speed to secondary considerations, at least in the context we're talking about here. To us, reliability and compatibility are an order of magnitude more important, and specialized development efforts like the one you're proposing go against that grain. Right now, Spambayes is suffering from too many bugs and too few people to solve them. If you contribute a new piece of optimized code, people may appreciate its efficiency, but it's unlikely that anyone will help you maintain it. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: cweisbrod@cogeco.ca [mailto:cweisbrod@cogeco.ca] > Sent: Thursday, November 13, 2003 4:01 PM > To: spambayes@python.org > Subject: [Spambayes] .db files > > > Hi there, > > I've recently started using SpamBayes and have had great > success in finally > avoiding unsolicited email. The only problem I have with > SpamBayes is the > significant overhead required to get the code running. Installing the > Python binaries first and then having a process that consumes > 10-15 Mb of > runtime memory is less than optimal. > > I am a software engineer with 9+ years of experience and I > fully understand > the need for a project like this to be written in a > platform-independant > manner. However, I would really like to write a scaled-down > version of this > for the Windows platform. I'm quite certain that I can write a > self-installing/uninstalling service that implements a POP/SMTP proxy, > performs the statistical processing, and exposes user interfaces for > training and configuring. I have a mathematics and physics > degree, so the > statistics stuff shouldn't be too difficult to implement. The entire > implementation would be contained in a single executable file > much less > than 1 Mb in size (I don't use MFC). Of course this > executable would have > to create its' own config files, spambayes.messageinfo.db, > hammie.db, etc. > That's where I need some information. > > The best implementation would make use of existing > spambayes.messageinfo.db > and hammie.db files so that any existing training is not > lost. However, I > need to know how these files are organized. Short of studying > the Python > source for answers, is there any specific documentation on > the organization > of these files? If not, would somebody involved with this project be > willing to provide me with this information? > > If I can get this type of information, I would be happy to > make my upcoming > development efforts available to the SpamBayes project. I suspect many > people would really appreciate the option I'm suggesting. > > Best regards, > > Clint Weisbrod. > > -------------------------------------------------------------------- > mail2web - Check your email from the web at > http://mail2web.com/ . > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From papaDoc at videotron.ca Fri Nov 14 07:36:18 2003 From: papaDoc at videotron.ca (papaDoc) Date: Fri Nov 14 07:36:22 2003 Subject: [Spambayes] .db files In-Reply-To: References: Message-ID: <3FB4CC42.6060306@videotron.ca> Hi, >Not that there's anything wrong with lean & mean. > I hope ! >It just went out of style around the time you started your career . > I will come back in 20 years (like in fashions) It also depens where you work. I work in simulation and every calculation and memory you can save make your simulator more appealing since you can do more with less. (Do you prefer to pay 3000$ for a simulator you can run on a Current Standard PC or pay 250 000$ for a simulator that will do the same thing on a bigger screan) (See www.simlog.com) Remi From rmalayter at bai.org Fri Nov 14 08:36:30 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Nov 14 08:36:37 2003 Subject: [Spambayes] .db files Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C392A@cliff.bai.org> > From: Tim Peters > Not that there's anything wrong with lean & mean. > It just went out of style around the time you started > your career . And there are those (like my boss) who would say that programming "lean and mean" in straight C without the use of MFC or other easy-to-use libraries is wasteful of the most expensive resource in the computer world: the hourly wages of the programmer. The reason languages like VB and Java have been so embraced is not efficiency or elegance, it's their ability to enable rapid application development. Hardware is cheap (at least since 1990 or so). People are not. Regards, Ryan From wsy at merl.com Fri Nov 14 09:00:51 2003 From: wsy at merl.com (Bill Yerazunis) Date: Fri Nov 14 09:01:05 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <3FB45E04.4060902@rebirthing.co.nz> (message from David McNab on Fri, 14 Nov 2003 17:45:56 +1300) References: <3FB45E04.4060902@rebirthing.co.nz> Message-ID: <200311141400.hAEE0pP07342@localhost.localdomain> From: David McNab Rob Anderson wrote: | I am having the same problem. I have 311 ham and 1093 spam in my training | database yet SB only catches about 50% of the incoming spam. Same with me. I'd check to see if you're having a configuration problem. CRM114 is running essentially the same code (ok, maybe a 2x improvement according to the SB torture-test corpus) and I'm still solidly in "ivory snow" accuracy. Maybe time to switch to a challenge/response system - one that when receiving messages, sends a 'challenge' email to the sender, asking them to reply. Once replies are received, senders end up on the whitelist, and their emails are delivered without further challenge. See "CAMRAM", which does this only when a Bayesian says "I can't tell for sure". I'd suggest switching to challenge/response based filtering. That'll keep the spammers at bay for a few months. Except that there's no path back to the spammer for the challenge; they do their best to cover their tracks as their IP address is the one thing that can cost them their upstream connectivity. thus, C/R systems _will_ stop them. The problem is that C/R will annoy the rest of us. -Bill Yerazunis From dreed at srdcorp.com Fri Nov 14 09:03:49 2003 From: dreed at srdcorp.com (Reed, David) Date: Fri Nov 14 09:06:52 2003 Subject: [Spambayes] .db files Message-ID: <4C542552EF434E43B2F2572854A73CBD6F70CE@corp-mail1.srdcorp.com> OUCH!!! (Personally, I prefer lean and mean myself...but then again, I started on the TRS-80's when I was about 10!) >>Not that there's anything wrong with lean & mean. It just went out of style >>around the time you started your career . _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From dreed at srdcorp.com Fri Nov 14 09:05:55 2003 From: dreed at srdcorp.com (Reed, David) Date: Fri Nov 14 09:08:56 2003 Subject: [Spambayes] RE: .db files Message-ID: <4C542552EF434E43B2F2572854A73CBD6F70CF@corp-mail1.srdcorp.com> Much as it distresses me, I have to agree with Bob. Seems to me the major force in hardware advancement is software gluttoney, however. :) But my first OS was the TRS-80 Color DOS and TRS-DOS, so what do I know? David -----Original Message----- From: Coe, Bob [mailto:rcoe@CambridgeMA.GOV] Sent: Friday, November 14, 2003 7:34 AM To: spambayes@python.org Subject: [Spambayes] RE: .db files I hate to rain on your parade, but most people aren't going to care. Modern computers relegate size and speed to secondary considerations, at least in the context we're talking about here. To us, reliability and compatibility are an order of magnitude more important, and specialized development efforts like the one you're proposing go against that grain. Right now, Spambayes is suffering from too many bugs and too few people to solve them. If you contribute a new piece of optimized code, people may appreciate its efficiency, but it's unlikely that anyone will help you maintain it. Bob MIS Department, City of Cambridge 831 Massachusetts Ave, Cambridge MA 02139 ? 617-349-4217 ? fax 617-349-6165 > -----Original Message----- > From: cweisbrod@cogeco.ca [mailto:cweisbrod@cogeco.ca] > Sent: Thursday, November 13, 2003 4:01 PM > To: spambayes@python.org > Subject: [Spambayes] .db files > > > Hi there, > > I've recently started using SpamBayes and have had great > success in finally > avoiding unsolicited email. The only problem I have with > SpamBayes is the > significant overhead required to get the code running. Installing the > Python binaries first and then having a process that consumes > 10-15 Mb of > runtime memory is less than optimal. > > I am a software engineer with 9+ years of experience and I > fully understand > the need for a project like this to be written in a > platform-independant > manner. However, I would really like to write a scaled-down > version of this > for the Windows platform. I'm quite certain that I can write a > self-installing/uninstalling service that implements a POP/SMTP proxy, > performs the statistical processing, and exposes user interfaces for > training and configuring. I have a mathematics and physics > degree, so the > statistics stuff shouldn't be too difficult to implement. The entire > implementation would be contained in a single executable file > much less > than 1 Mb in size (I don't use MFC). Of course this > executable would have > to create its' own config files, spambayes.messageinfo.db, > hammie.db, etc. > That's where I need some information. > > The best implementation would make use of existing > spambayes.messageinfo.db > and hammie.db files so that any existing training is not > lost. However, I > need to know how these files are organized. Short of studying > the Python > source for answers, is there any specific documentation on > the organization > of these files? If not, would somebody involved with this project be > willing to provide me with this information? > > If I can get this type of information, I would be happy to > make my upcoming > development efforts available to the SpamBayes project. I suspect many > people would really appreciate the option I'm suggesting. > > Best regards, > > Clint Weisbrod. > > -------------------------------------------------------------------- > mail2web - Check your email from the web at > http://mail2web.com/ . > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From bkc at murkworks.com Fri Nov 14 10:23:30 2003 From: bkc at murkworks.com (Brad Clements) Date: Fri Nov 14 09:31:13 2003 Subject: [Spambayes] .db files In-Reply-To: References: <63340-220031141321123308@M2W081.mail2web.com> Message-ID: <3FB4A0C9.22499.40A9831@localhost> On 14 Nov 2003 at 2:18, Tim Peters wrote: > Not that there's anything wrong with lean & mean. It just went out of style > around the time you started your career . Which is a damn shame, too. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From dhealy at houston.rr.com Fri Nov 14 09:31:26 2003 From: dhealy at houston.rr.com (dhealy) Date: Fri Nov 14 09:31:40 2003 Subject: [Spambayes] Lost Spam folder Message-ID: I am new to SpamBayes and have created a problem I hope I can fix. I have accidentally deleted the Spam folder from MS Outlook. I recreated the folder, but SpamBayes apparently can't find it. SpamBayes is apparently working, I got 28 messages this morning, one in the folder Possible Spam, three in my inbox, but none in the Spam folder I recreated. When I look in the Possible Spam folder, select a message, and click Delete as Spam, the message leaves the Possible Spam folder, but doesn't show up in the Spam folder. Is there some way to point SpamBayes to the Spam folder without reinstalling and retraining? Before I installed SpamBayes the first time I had saved 2,000 spams for training. I have since deleted those. Thanks, Dan H. dhealy@houston.rr.com From kennypitt at hotmail.com Fri Nov 14 09:38:06 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 14 09:46:18 2003 Subject: [Spambayes] .db files In-Reply-To: <3FB40D8F.70109@rbwconsulting.com> Message-ID: Adam Walker wrote: > Kenny Pitt wrote: >> If this is the type of implementation you're looking for, you might >> be better served by the K9 filter here: http://keir.net/k9.html. It >> is not open source but it is freeware, and the executable is only >> about 70k and uses very little memory. > > I just downloaded and installed K9. The exe is small but it's memory > footprint on my system is 4388K right after startup. > Spambayes pop3proxy_tray footprint is 7236K after its been running > all day. My numbers were a little different. When I first start K9, I get Mem Usage 3,136K and VM Size 2,744K. When I first start pop3proxy_tray I get Mem Usage 9,904K and VM Size 7,368K. After the first time I retrieve mail through pop3proxy_tray the memory goes up to Mem Usage 11,268K and VM Size 7,936K. I don't have any accounts set up to filter through K9 right now, so can't check that statistic. A large percentage of the memory used by K9 or SpamBayes is for the training data, so actual footprint will always depend on how many messages you have trained. My current training data has 18,453 good words and 28,013 spam words in K9. SpamBayes doesn't display the token counts, but I have 283 ham messages and 107 spam messages trained. Personally, I still prefer SpamBayes. To me, these memory differences aren't enough to be significant, and being written in a compiled language (I assume C++ for K9, but don't know for sure) doesn't always guarantee improved performance. SpamBayes has always been plenty efficient for me. -- Kenny Pitt From seant at iname.com Fri Nov 14 09:54:57 2003 From: seant at iname.com (Sean True) Date: Fri Nov 14 10:01:19 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <3FB45E04.4060902@rebirthing.co.nz> Message-ID: <001501c3aabf$4916c3d0$0201a8c0@swapwizard.com> > Rob Anderson wrote: > | I am having the same problem. I have 311 ham and 1093 spam > in my training > | database yet SB only catches about 50% of the incoming spam. > Here's statistics from a one day run using the out of the box InBoxer stats as the starting spot. I don't consider a system fully trained until I've processed mail with it for about two weeks at these incoming mail rates. Conditions: DIAGNOSTIC mgr.bayes.nspam 1176 DIAGNOSTIC mgr.bayes.nham 943 DIAGNOSTIC mgr.config.filter.unsure_threshold 15 DIAGNOSTIC mgr.config.filter.spam_threshold 90 Decisions made initially: DIAGNOSTIC mgr.config.filter.nham_seen 1141 DIAGNOSTIC mgr.config.filter.nspam_seen 190 DIAGNOSTIC mgr.config.filter.nunsure_seen 91 Decisions made: DIAGNOSTIC mgr.config.filter.unsure2spam 75 DIAGNOSTIC mgr.config.filter.unsure2ham 16 DIAGNOSTIC mgr.config.filter.nham_retrained 0 DIAGNOSTIC mgr.config.filter.nspam_retrained 5 Very few spam made it into my InBox. Zero, actually. The 5 spam that were false positives were all one liners from an automated system. 80% of the unsures were actually spam. Same Spambayes engine as in 0.81 The differences may be accounted for by: - Rob get's worse spam than I do. (Seems unlikely, I actively seek out entertaining spam sources) - Rob's stats are unbalanced (a lot more spam than ham) (Tim already suggested that) - Having generic stats collected by someone else to start with provides a lot of protection against spam you haven't seen yet (could be, that's why I collect spam) From glenn.w.hoffman at lmco.com Fri Nov 14 10:11:39 2003 From: glenn.w.hoffman at lmco.com (Hoffman, Glenn W) Date: Fri Nov 14 10:11:46 2003 Subject: [Spambayes] Spambayes fails to install with error message (Windows XP Home Edition Message-ID: Me too. I recently updated from 0.7 to 0.81 of the Outlook plug-in and observed this warning. No ill effects observed from the failure. I am using Windows 2000 with SP 4. Outlook 2000 SP 3. Is this just an installer defect, or something more serious? Glenn -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of Mike Piff Sent: Sunday, October 19, 2003 10:24 AM To: spambayes@python.org Subject: [Spambayes] Spambayes fails to install with error message (Windows XP Home Edition When I run the installer I get a warning box saying that dllRegister service failed to register xxx.dll. Does anyone know what this means? I don't appear to have a dllregister service! Mike From cweisbrod at cogeco.ca Fri Nov 14 11:09:38 2003 From: cweisbrod at cogeco.ca (cweisbrod@cogeco.ca) Date: Fri Nov 14 11:09:47 2003 Subject: [Spambayes] .db files Message-ID: <99610-22003115141693811@M2W049.mail2web.com> Thanks very much for the input and the warnings. I realize I've been somewhat naive in my contemplation over the implementation of SpamBayes. Particularly didn't realize there is so much incompatibility with email systems. Can you give me an example of some incompatibilities that you have encountered? Thanks very much. Original Message: ----------------- From: Tim Peters tim.one@comcast.net Date: Fri, 14 Nov 2003 02:18:27 -0500 To: cweisbrod@cogeco.ca, spambayes@python.org Subject: RE: [Spambayes] .db files [cweisbrod@cogeco.ca] > I've recently started using SpamBayes and have had great success in > finally avoiding unsolicited email. The only problem I have with > SpamBayes is the significant overhead required to get the code > running. Installing the Python binaries first and then having a > process that consumes 10-15 Mb of runtime memory is less than optimal. Are you running on a TRS-80 ? > I am a software engineer with 9+ years of experience and I fully > understand the need for a project like this to be written in a > platform-independant manner. However, I would really like to write a > scaled-down version of this for the Windows platform. I'm quite > certain that I can write a self-installing/uninstalling service that > implements a POP/SMTP proxy, performs the statistical processing, and > exposes user interfaces for training and configuring. I have a > mathematics and physics degree, so the statistics stuff shouldn't be > too difficult to implement. The entire implementation would be > contained in a single executable file much less than 1 Mb in size (I > don't use MFC). Neither do we, at least not in this project. > Of course this executable would have to create its own config files, > spambayes.messageinfo.db, hammie.db, etc. That's where I need some > information. Others have addressed that, and it's complicated. I'll warn you in advance that the memory and processing power don't go into the statistics, though: the memory mainly goes into caching a potentially huge number of token statistics records in memory to make it bearably fast. If you dig up each token from an on-disk database each time you see it, it probably won't run fast enough for high-volume users to tolerate. You might overcome that by writing your own database implementation, specialized to the particular data formats this thing needs to access, one not burdened (as is the Sleepycat DB) by needing to pay to be all things to all people. You should also know that the actual scoring is a tiny part of the code, and quite straightforward. Parsing email, tokenization, dealing with a gazillion incompatible email systems, and putting up a user-friendly UI are each harder than the mathematical part, and the code for each dwarfs the total code for all the computational parts. BTW, despite the use of a memory cache on top of the Sleepycat DB, the classifier runs several times faster still if I eliminate Sleepycat entirely and run the entire database in memory (as one large Python dictionary). For a large (but not unreasonably large) token database, that can easily suck up 50MB of RAM just for the dict. For perspective, I happened to be window-shopping for a new home computer this week, and even entry-level systems come with 128MB of RAM these days; I'll probably get at least 4x that much. 50MB is about 10% of that -- it won't even be worth thinking about idly for me anymore. Not that there's anything wrong with lean & mean. It just went out of style around the time you started your career . -------------------------------------------------------------------- mail2web - Check your email from the web at http://mail2web.com/ . From seant at webreply.com Fri Nov 14 10:40:20 2003 From: seant at webreply.com (Sean True) Date: Fri Nov 14 12:06:37 2003 Subject: [Spambayes] RE: [spambayes-dev] Code locations in Spambayes Outlook plugin Message-ID: <003501c3aac5$9c9c6400$0201a8c0@swapwizard.com> > > (Why I'm doing this: the headers aren't accessible in Outlook except > > via View|Options, or programmatically. I want an Outlook form that > > automatically displays the headers, but doing it in Visual Basic > > Script is problematic because of Outlook's security policies. > > Using MAPI directly appears to sidestep most Outlook whining. > At least so > far. There are also commercial (if inexpensive) ways to use the Outlook object model without the security policy whines: http://www.dimastr.com/redemption/ which is free for single machine evaluation, and inexpensive for distribution. If I were rewriting the OL plugin from scratch, I'd consider using this. It allows you to create shadow objects which have OL object model properties and methods, and then _it_ does all the nasty MAPI work to make the security issues go away. -- Sean ============== Sean True Chief Technologist InBoxer: Intelligent spam filter for Outlook 2000+ Download from www.inboxer.com If you like it, tell your friends -- if you don't, tell me. From tim.one at comcast.net Fri Nov 14 12:09:42 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 14 12:09:45 2003 Subject: [Spambayes] .db files In-Reply-To: <99610-22003115141693811@M2W049.mail2web.com> Message-ID: [cweisbrod@cogeco.ca] > Thanks very much for the input and the warnings. I realize I've been > somewhat naive in my contemplation over the implementation of > SpamBayes. Particularly didn't realize there is so much > incompatibility with email systems. Can you give me an example of > some incompatibilities that you have encountered? Well, I'm not clear on what you really want to accomplish. You said I would really like to write a scaled-down version of this for the Windows platform The spambayes Outlook addin is overwhelmingly the most popular way to use spambayes on Windows, so if you want to target that then you have to reimplement all the code we use to interface with Outlook directly (which is a much larger pile of code than the classifier proper -- and much harder to get right (the MS APIs you need to work with are massive and delicate)). OTOH, if you don't want to replace the Outlook addin, few current spambayes-on-Windows users will care (the Outlook addin is much easier to live with than any web-based interface, because the former integrates directly with normal user Outlook workflow -- it's just another set of buttons on the Outlook toolbar, and, for the most part, works "all by magic" after initial setup and training). If you want to integrate with non-Outlook clients on Windows, they each have their own ways of dealing with email, and there are several different formats (some open, some proprietary) in use for storing email. If you just want to deal with proxies, then you've got POP3 and a large variety of partially-broken IMAP servers to wrestle with. If you want the same functionality in your classifier, you're going to have to write code to decode email in its full current generality, including character set conversions, recursive MIME structure, and translating assorted encodings (spambayes routlnely decodes quoted-printable and base64, but the latter only when the MIME type of a section is text/*; we also decode numeric character entities in HTML). Python comes with libraries for doing all of that stuff, written by experts in the various fields, and debugged by thousands of users in real life over a span of years. This is harder to do than you'd first believe, because while standards exist covering these areas, writing software that adheres to those standards is useless in practice: too many programs that generate and send email violate too many of the specified rules, so software trying to make sense of email has to be extremely forgiving. You can't know how in advance, though -- you find out what you need to forgive by seeing real-life email break your code, iterating until new email stops breaking it. That's been going on for years in the Python libraries, and is still in progress (it's a kind of thing that can never end, as new bugs in email producers and servers keep appearing, and some of those become popular for non-technical reasons). It's possible to write a different kind of tokenizer, one that takes the incoming email bytes at face value, and doesn't try to do any semantic analysis. That kind of tokenizer won't break no matter how wildly a piece of email may violate "the rules" (because you're ignoring all the rules then too). Implementation shortcuts like that could save you tens of thousands of lines of code and months of effort -- but then it gets increasingly distant from what the current spambayes code does. From skip at pobox.com Fri Nov 14 12:18:43 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Nov 14 12:19:10 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: References: Message-ID: <16309.3699.302897.207278@montanaro.dyndns.org> Rob> I am having the same problem. I have 311 ham and 1093 spam in my Rob> training database yet SB only catches about 50% of the incoming Rob> spam. I would try retraining from scratch. I'm getting very good results with just over 100 ham and spam at the moment. I'm beginning to believe it's not even necessary to train on all unsures or mistakes. My mail gets transferred from my server and scored in bunches every five minutes (24x7) and I get a lot of mail, so I may come in in the morning and find a dozen unsures in my mailbox (as well as a few hundred properly classified spams). I try training on one or two unsure messages, then recheck the remaining unsures, eliminating any which now score as ham or spam. (See below for how I do that.) I've developed a few seat-of-the-pants training maxims, both from personal experience and from reading what others have done: * Don't be afraid to retrain from scratch. The system learns quickly. Retraining from scratch is often the quickest way to recover from training mistakes. * Bigger is not always better, no matter what all those enlargement messages would have you believe. A larger database is harder to examine for mistakes, and a few mistakes skewed in the same directionn may be hard to overcome with correct training. You'll also reach a point where you want to just delete all that spam. Once you do that, you've completely lost the ability to find mistakes. If you only have a few messages in your training database things will be easier to manage. * Never train on the same message twice. Using iterative reasoning it's easy see you should never train on the same 100 or 1000 times either. ;-) * Seek balance in your training database. Similar numbers of ham and spam are good. * Don't automatically train on all incoming messages. If you get swamped with spam, you will quickly wind up with a training database which is wildly out-of-balance. * Don't worry about training on every unsure message either. Some messages just aren't amenable to a strict classification. For example, a bounce message from a mail server containing an attached spam may be best left untrained. It contains both strong ham clues (all the postmaster gibberish which you would get in a bounce of an otherwise valid message) and strong spam clues (the spam message itself). Calling that message as ham or spam is likely to worsen the classification of future mail bounces or future similar spam. My environment is much different than yours, so I don't know how you'd get the Outlook plugin to score messages again, but if it can do that, a little judicious checking will probably avoid the need to over-train. For example, if there are several unsure messages related to online prescriptions, training on just one of them as spam may be sufficient to cause the rest to now score as spam. For those on Unix-y systems (I use Mac OS X) with access to the CVS repository, here's what I run to check my unsures: sb_filter.py ~/Mail/unsure | python ~/tmp/scan-unsures.py Where scan-unsures.py is #!/usr/bin/env python import sys, re sub = msgid = cls = "" for line in sys.stdin: if line.startswith("From "): sub = msgid = cls = "" elif line.lower().startswith("subject: "): sub = line.strip() elif line.lower().startswith("message-id: "): msgid = line.strip() elif line.lower().startswith("x-spambayes-classification: "): cls = line.strip() if re.search("unsure", cls) is not None: print sub print msgid print cls sub = msgid = cls = "" You need the latest version of sb_filter.py which I checked in a couple days ago. Skip From skip at pobox.com Fri Nov 14 12:26:39 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Nov 14 12:26:53 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <3FB45E04.4060902@rebirthing.co.nz> References: <3FB45E04.4060902@rebirthing.co.nz> Message-ID: <16309.4175.734332.770721@montanaro.dyndns.org> Rob> I am having the same problem. I have 311 ham and 1093 spam in my Rob> training database yet SB only catches about 50% of the incoming Rob> spam. David> Same with me. David> I knew it was only a matter of time before the spammers started David> installing/training some bayesian filters, and learning how to David> compose their messages to distort the histogram and pass as David> 'ham'. I suspect factors other than "intelligent spammers" (is that an oxymoron?) are at work. See my earlier response to Rob's message. I don't think they are doing all that good a job of "manipulating the histograms". They can't see inside my database. If they could, they'd probably put a lot of words in there which are hammy to me. Since they don't, they're just pissing in the wind. As Tim has pointed out many times, at some point the spammers have to make a sales pitch. If they want to sell you cars, they either have to mention that in their message or refer you to a car sales website. To grab your initial attention, they have to make the subject more-or-less readable. Somewhere in there (the text of the subject, the text of the message or URLs contained in the message) they have to show their hand. SpamBayes classifies all that stuff. We may need to unravel their encoding tricks more than we do today (I have an experimental latin-1-to-ascii codec to strip accents and have also experimented with deleting all punctuation from tokens, for example), but that's not terribly hard to do. Skip From mdhpub at blueyonder.co.uk Fri Nov 14 12:31:43 2003 From: mdhpub at blueyonder.co.uk (Mathew Hendry) Date: Fri Nov 14 12:31:48 2003 Subject: [Spambayes] How to tell SpamBayes to check more headers (using MSO plugin) Message-ID: Is there a way, without hacking the source, to tell SpamBayes to examine more of the headers in mails? In particular these ones, added by SpamAssassin and SpamCop Mail: X-Spam-Level: ********* X-Spam-Status: hits=9.7 tests=ADDR_NUMS_AT_BIGSITE,BIZ_TLD, DATE_IN_FUTURE_12_24,FORGED_HOTMAIL_RCVD,FORGED_MUA_OUTLOOK, MSGID_FROM_MTA_HOTMAIL,USERPASS version=2.60 X-SpamCop-Disposition: Blocked bl.spamcop.net I had a dig around on spambayes.org and couldn't find anything. I'm using the Outlook plugin if that makes a difference. Cheers, -- Mat. From mikes at members-insurance.com Fri Nov 14 13:03:51 2003 From: mikes at members-insurance.com (Michael K. Schummers) Date: Fri Nov 14 13:04:14 2003 Subject: [Spambayes] Training/Scoring in SpamBayes Message-ID: <002b01c3aad9$a9c41150$d60000c8@PRK94> Clearly there is something in ongoing Training of SpamBayes that I do not understand. I am running 0.81 Outlook plugin on Win XP Pro with Outlook 2000. A while back (weeks, not days), I retrained to get a more balanced training of ham vs. spam. As time has progressed, I now have a five to one ratio of ham vs. spam in the statistics reported by SpamBayes manager (3653 ham vs. 639 spam). My understanding from the forum comments is that messages will NOT be trained in the "natural" assignment process of scoring and moving to Unsure, Spam or leaving in the Inbox. New messages WILL be trained if moved from Unsure to Inbox or Spam. They will also be trained if moved from Inbox to Spam or Spam to Inbox (clearly this will also occur if "Delete as Spam" or "Recover from Spam" buttons are used in the respective folders. Is there some other training that will occur (without the manual steps of invoking from SpamBayes Manager) automatically?? It doesn't appear to me (but I have not kept detailed statistics) that I am manually classifying significantly more "ham" messages than "spam" messages?and this propensity to swing the training ration strongly in the "ham" direction has me a bit confused (the effective screening of "spam" tends to get worse gradually as the ratio starts to get out of balance). Am I missing some element of the process?? ? Michael K. Schummers General Manager ? ? ? ? ? (859) 455-7317 HYPERLINK "mailto:mikes@members-insurance.com"mikes@members-insurance.com --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.538 / Virus Database: 333 - Release Date: 11/10/2003 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 6113 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031114/6dfb0d97/attachment-0001.gif -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 7559 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031114/6dfb0d97/attachment-0001.jpe From tim.one at comcast.net Fri Nov 14 13:29:40 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 14 13:29:42 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <200311141400.hAEE0pP07342@localhost.localdomain> Message-ID: [Bill Yerazunis, on challenge/response gimmicks] > ... > Except that there's no path back to the spammer for the challenge; > they do their best to cover their tracks as their IP address is the > one thing that can cost them their upstream connectivity. > > thus, C/R systems _will_ stop them. > > The problem is that C/R will annoy the rest of us. I confess I won't respond to a C/R challenge anymore. The last several times I tried to play along, the web-based response systems took too damn long to use, and the email-based response systems bounced my reply with another challenge! The latter was almost certainly a bug in the specific C/R system(s) used, but it's all the same to me: I gave all the time I felt I could afford to give when I made the original reply, and any gimmick that asks me to pour additional time into it is just too onerous to bother with. But I'm a computer geek, and understand what C/R is trying to accomplish. My sisters aren't computer geeks, and they won't respond to C/R challenges either, for quite different reasons: they never reply to unsolicited email (in part because I advised them not to), and don't have the specialized knowledge needed to distinguish a geeky C/R challenge from random address-harvester spam. "When in doubt, delete unopened" is basic email self-defense (esp. for Windows users, which almost everyone's sisters are ), and a geek-generated challenge is certainly something a non-geek has reason to doubt. That said, a C/R system is great for people who don't want to get any email at all <0.7 wink>. From sethg.delete at GoodmanAssociates.com Fri Nov 14 14:09:13 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 14 14:09:37 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Kenny Pitt] > In K9, the limits you're talking about only control how many complete > messages of each type are stored in cache for future *re-training*. > They do not affect the contents of the actual training database. K9 > does not currently do any aging of the training data, although I believe > it has been discussed in that context as well. Today I just saw the following entry in the K9 configuration instructions: "In addition to automatically cleaning up the Recent Emails list, you can choose to clean out the Good and Spam folders of old emails when they reach a certain size." You can see the K9 configuration instructions at http://keir.net/k9_configuration.html It looks like K9 *does* age out old messages once the message counts for spam or ham are reached and a new message is available for training. It appears that they train on everything, and when the maximum number of messages in the corpus is reached, they make room for the new message by deleting and untraining the oldest one. This appears to be another reason why K9 stores the two complete training corpuses. This avoids having to timestamp every token, which is a big saving. The advantages I see to this scheme are: 1) It allows you to set a predetermined desired number of messages for your spam and ham training sets. It appears from looking at the archives that there were some "magic ratios" of spam/ham, or sweet spots, where SpamBayes performed better for reasons that no one will probably ever understand. 2) Depending on the nature of your mail traffic, it will prevent the spam/ham ratio from becoming pathologically skewed without any overt user action. 3) It allows you to keep the tokens current with you email stream. The disadvantages I see to this scheme are: 1) If you have infrequent correspondents who send messages very atypical of your other ham, you would have to set the maximum number of ham messages very high. The discussion about the effects of large training sets aside, this would mean a very large ham corpus file and a bigger than normal token database. 2) If you have a highly asymmetrical spam/ham mail stream, keeping the spam and ham training sets to a fixed size will prevent the training sets from being contemporaneous. I have no idea if this is actually a problem. 3) The ham message corpus file, in particular, could get very large depending on whether SpamBayes does anything with file attachments. If SpamBayes does not need the complete file attachment to untrain a message, this would not be an issue. Whether or not any of this is worthwhile for SpamBayes hinges on questions that I pose to those who have done experiments: 1) Is there any advantage in keeping the total training set size and spam/ham ratio of the training set fixed? 2) Is it better to train on mistakes (the current system) or train on everything (except list traffic that Outlook rules get out of the way before SpamBayes sees new messages)? --------------------------------- Why am I asking these questions? [Tim Peters] >>> I suggest you wait until you have a real problem before trying to >>> solve it. > >What *bothers* you about SpamBayes? What doesn't work right, or what was >too hard to figure out, or what's still too confusing? What's missing? Well, what bothers me, so far, is that despite training on 620 ham and 1403 spam, SpamBayes still manages to miss (score as ham) 5-10 messages per day out of around 150 scored messages. Most of these missed spams have an initial score very close to zero, so simply lowering the ham threshold would not fix it. After training as spam, their spam score often increases respectably, but sometimes, the score stays below 5%. This indicates that the same message would be missed next time, as well. I don't know if I just need to get a bigger or more balanced training set, if there are some types of tokens (such as embedded URL's in HTML spam) that are not currently parsed or if this is just as good as it gets. Anyway, that's what I would like to see improved and it is the motivation for the above discussion. Any thoughts from those who've been there already? -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From sethg.delete at GoodmanAssociates.com Fri Nov 14 14:29:06 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 14 14:29:10 2003 Subject: [Spambayes] Training/Scoring in SpamBayes In-Reply-To: <002b01c3aad9$a9c41150$d60000c8@PRK94> Message-ID: [Michael K. Schummers] > moving to Unsure, Spam or leaving in the Inbox. New messages WILL > be trained > if moved from Unsure to Inbox or Spam. They will also be trained if moved > from Inbox to Spam or Spam to Inbox (clearly this will also occur > if "Delete > as Spam" or "Recover from Spam" buttons are used in the > respective folders. Is this true? If the incremental training checkboxes (Outlook plug-in version 0.81) are checked, does SpamBayes then trigger on the folder move it initiated when it moved a spam to the spam folder and thus train on all definite spam? I currently have those boxes unchecked, as I frequently make mistakes when I drag-and-drop messages, so I prefer to use the toolbar button to do all retraining (much harder to screw up). -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From kennypitt at hotmail.com Fri Nov 14 15:06:01 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 14 15:06:28 2003 Subject: [Spambayes] Training/Scoring in SpamBayes In-Reply-To: Message-ID: Seth Goodman wrote: > [Michael K. Schummers] >> moving to Unsure, Spam or leaving in the Inbox. New messages WILL >> be trained >> if moved from Unsure to Inbox or Spam. They will also be trained if >> moved from Inbox to Spam or Spam to Inbox (clearly this will also >> occur >> if "Delete >> as Spam" or "Recover from Spam" buttons are used in the >> respective folders. > > Is this true? If the incremental training checkboxes (Outlook plug-in > version 0.81) are checked, does SpamBayes then trigger on the folder > move it initiated when it moved a spam to the spam folder and thus > train on all definite spam? SpamBayes will only train on a message moved to the spam folder if it's initial classification score was below the spam threshold. -- Kenny Pitt From kennypitt at hotmail.com Fri Nov 14 15:19:35 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 14 15:20:37 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <16309.3699.302897.207278@montanaro.dyndns.org> Message-ID: Skip Montanaro wrote: > I've developed a few seat-of-the-pants training maxims, both from > personal experience and from reading what others have done: > > * Bigger is not always better, no matter what all those > enlargement messages would have you believe. That's a very good point, and there's a lot to be said for training on the minimum number of messages that you need to produce reasonable results. Because all scores are based on ratios, every additional message that you train on dilutes the effect of the prior tokens in that corpus that don't appear in the new message. For example, if I start with 50 trained hams and have a token that has been seen 10 times, it contributes a ham probability of 0.2 (10/50) to the scoring. If I later train on 50 more hams that don't contain that token, it's ham probability drops to 0.1 (10/100). So training on more ham can actually cause you to miss good messages that you were previously classifying correctly. -- Kenny Pitt From kennypitt at hotmail.com Fri Nov 14 15:48:51 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Fri Nov 14 17:52:16 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: Seth Goodman wrote: > [Kenny Pitt] >> In K9, the limits you're talking about only control how many complete >> messages of each type are stored in cache for future *re-training*. >> They do not affect the contents of the actual training database. K9 >> does not currently do any aging of the training data, although I >> believe it has been discussed in that context as well. > > Today I just saw the following entry in the K9 configuration > instructions: > > "In addition to automatically cleaning up the Recent Emails list, you > can choose to clean out the Good and Spam folders of old emails when > they reach a certain size." Yes, but that still only affects the token databases if you *rebuild* them. > It looks like K9 *does* age out old messages once the message counts > for spam or ham are reached and a new message is available for > training... Yes, it does age out old messages from the cache. > ... It appears that they train on everything, ... Yes, it does train on everything you receive (which may or may not be a good thing, opinions differ). In recent versions, it also has an "Only add reclassified emails" option in the Advanced config that trains more like the SpamBayes Outlook addin. If this option is on, tokens won't get added to the training databases unless you reclassify the message, but the actual message will still be saved in the message cache. > ... and when the > maximum number of messages in the corpus is reached, they make room > for the new message by deleting and untraining the oldest one... It deletes the oldest message from the corpus, but does not untrain it. K9 does not store the original scoring of the message, and it allows you to re-organize at any time to place messages in the correct corpus according to the current scoring. There is no guarantee that the corpus a message is currently stored in matches how it was originally trained. You can also delete the entire contents of the cache at any time without having any effect on the scoring, until you rebuild the training databases. > ... This > appears to be another reason why K9 stores the two complete training > corpuses. This avoids having to timestamp every token, which is a > big saving. The only reason K9 keeps the actual messages is so that you can rebuild the training databases at any time. Understanding the significance of this requires that you go back to the early days of K9 when it used a slightly different method of training. Originally, K9 did not have the ability to "untrain" a message. Every message was added to either the spam or good word database as soon as it was received and scored. If the user reclassified the message, it was repeatedly added to the reclassified database until the new score for the message surpassed a certain threshold (which I believe was somewhere around >80% for spam and <20% for good, give or take). Obviously, this could result in some strange training data over time, so an occasional rebuild of the databases was useful to make sure that each message was trained in only one of the spam or good databases. -- Kenny Pitt From rmalayter at bai.org Fri Nov 14 18:23:54 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Fri Nov 14 18:23:58 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3945@cliff.bai.org> > From: Kenny Pitt > Because all scores are based on ratios, every additional message that > you train on dilutes the effect of the prior tokens in that > corpus that > don't appear in the new message. For example, if I start with 50 > trained hams and have a token that has been seen 10 times, it > contributes a ham probability of 0.2 (10/50) to the scoring. > If I later > train on 50 more hams that don't contain that token, it's ham > probability drops to 0.1 (10/100). > So training on more ham can actually cause you to miss good messages > that you were previously classifying correctly. I don't think your examples are correct, mathematically. Your 0.2 and 0.1 don't take into account how often the token is seen in the other corpus. The actual formula used by spambayes for the probabiltiy of a given token is more complex, and requires looking at the size of both corpuses and the number of occurances in each. See http://www.paulgraham.com/spam.html for a good explanation of the general method used. So, in your example, if the token never occurred in a spam, your single-token ham probabilites would actualy be something more like 0.99 and 0.99 instead of 0.2 and 0.1. The way the probabilities are actually computed, the more data you have, the more accurate your probabilities get, and the better the filter will perform. Up to a point, of course... there will always be diminsighing returns. There's not much difference in practical terms between 99.7% accuracy and 99.8% accuracy. Regards, Ryan From sadkin at wood-carvings.com Fri Nov 14 18:54:16 2003 From: sadkin at wood-carvings.com (Michael Sadkin) Date: Fri Nov 14 18:54:19 2003 Subject: [Spambayes] filtering question Message-ID: When I start Outlook for the 1st time, the incoming mail is not always filtered. If I shut down Outlook and reopen it, the mail is then filtered. Why does this happen" Thank you, Michael Sadkin www.wood-carvings.com msadkin@twcny.rr.com From sethg.delete at GoodmanAssociates.com Fri Nov 14 19:28:34 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Fri Nov 14 19:28:36 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: Kenny, Thanks for the corrections. I obviously have a lot to learn about the details of the various Bayesian classifiers. Do you have any comments on the stuff at the bottom of my previous post (reproduced below)? This is what motivated the discussion of K9 training in the first place and I would value any insight or suggestions that anyone could offer. > --------------------------------- > > Why am I asking these questions? > > [Tim Peters] > >>> I suggest you wait until you have a real problem before trying to > >>> solve it. > > > >What *bothers* you about SpamBayes? What doesn't work right, or what was > >too hard to figure out, or what's still too confusing? What's missing? > > Well, what bothers me, so far, is that despite training on 620 > ham and 1403 > spam, SpamBayes still manages to miss (score as ham) 5-10 messages per day > out of around 150 scored messages. Most of these missed spams have an > initial score very close to zero, so simply lowering the ham > threshold would > not fix it. After training as spam, their spam score often increases > respectably, but sometimes, the score stays below 5%. This indicates that > the same message would be missed next time, as well. I don't > know if I just > need to get a bigger or more balanced training set, if there are > some types > of tokens (such as embedded URL's in HTML spam) that are not currently > parsed or if this is just as good as it gets. Anyway, that's what I would > like to see improved and it is the motivation for the above discussion. > > Any thoughts from those who've been there already? -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From listas at loquecreas.com Fri Nov 14 20:02:00 2003 From: listas at loquecreas.com (Pablo Vieira) Date: Fri Nov 14 20:02:34 2003 Subject: [Spambayes] RV: I18N and L10N Message-ID: All right, third try, then I'll quit :-) Since you guys state very clearly that no one should email the developers directly I'm putting this here. I just want to ask if there's any interest at all in having Spambayes available in Spanish or not. Regards, Pablo _____ De: Pablo Vieira [mailto:listas@loquecreas.com] Enviado el: jueves, 13 de noviembre de 2003 21:55 Para: 'spambayes@python.org' Asunto: I18N and L10N Hi again, I want to insist on the internationalisation and localisation of Spambayes. Do you guys have any plans about this? Any easy way to translate the software (such as a text file containing entries in the menus, dialog boxes, help files, etc.?). I'm mostly interested in the Outlook version, but will happily collaborate on the other variants as well. My mother tongue is Spanish. Regards, Pablo From tim.one at comcast.net Fri Nov 14 20:11:51 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 14 20:11:58 2003 Subject: [Spambayes] filtering question In-Reply-To: Message-ID: [Michael Sadkin] > When I start Outlook for the 1st time, the incoming mail is not always > filtered. If I shut down Outlook and reopen it, the mail is then > filtered. Why does this happen" The mail is filtered when you reopen Outlook because the first thing SpamBayes does is look for messages in your watched folders that haven't been processed yet. I don't think we'll ever know why SpamBayes can miss some messages in a batch of incoming mail. SpamBayes relies on Outlook to tell it about new messages, and sometimes Outlook simply doesn't. Only the people who wrote Outlook might be able to explain why not. From experience, it appears that Outlook sometimes skips telling SpamBayes about new messages when the load on the system is high. When you first open Outlook and a whole bunch of new messages come in at once-- and especially if you have a high-speed connection, a slow machine, too little RAM, and/or are running other programs simultaneously --Outlook is seemingly too busy retrieving them to bother talking to addins about each new message it gets. Something that should help is enabling the addin's "background" processing, via SpamBayes -> SpamBayes Manager ... -> Advanced -> Enable background filtering. That makes SpamBayes "sleep" a lot, thus reducing the peak load on an overburdened system, and letting Outlook hog more of the machine's resources when Outlook is struggling to keep up with the incoming load. From stevensb at telus.net Sat Nov 15 00:01:35 2003 From: stevensb at telus.net (BCS) Date: Sat Nov 15 00:01:40 2003 Subject: [Spambayes] DB_RUNRECOVERY Message-ID: <003101c3ab35$8c2e3240$60b0b8a1@brianathlon12> I can't find any recovery program, where is it and what is it called? I am running Win XP Pro, 640MB RAM, 6.7GB free on Windows drive, trained 1656 spam and 353 Ham. 500 Server error Traceback (most recent call last): File "C:\PYTHON23\Lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\PYTHON23\Lib\site-packages\spambayes\ProxyUI.py", line 310, in onReview targetCorpus.takeMessage(id, sourceCorpus) File "C:\PYTHON23\Lib\site-packages\spambayes\Corpus.py", line 201, in takeMessage self.addMessage(msg) File "C:\PYTHON23\Lib\site-packages\spambayes\FileCorpus.py", line 143, in addMessage Corpus.Corpus.addMessage(self, message) File "C:\PYTHON23\Lib\site-packages\spambayes\Corpus.py", line 136, in addMessage obs.onAddMessage(message) File "C:\PYTHON23\Lib\site-packages\spambayes\storage.py", line 580, in onAddMessage self.train(message) File "C:\PYTHON23\Lib\site-packages\spambayes\storage.py", line 588, in train self.bayes.learn(message.tokenize(), self.is_spam) File "C:\PYTHON23\Lib\site-packages\spambayes\classifier.py", line 211, in learn self._add_msg(wordstream, is_spam) File "C:\PYTHON23\Lib\site-packages\spambayes\classifier.py", line 346, in _add_msg self._wordinfoset(word, record) File "C:\PYTHON23\Lib\site-packages\spambayes\storage.py", line 279, in _wordinfoset self.db[word] = record.__getstate__() File "C:\PYTHON23\lib\shelve.py", line 130, in __setitem__ self.dict[key] = f.getvalue() File "C:\PYTHON23\lib\bsddb\__init__.py", line 120, in __setitem__ self.db[key] = value DBRunRecoveryError: (-30982, 'DB_RUNRECOVERY: Fatal error, run database recovery -- fatal region error detected; run recovery') From gerrit at nl.linux.org Sat Nov 15 09:16:52 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Sat Nov 15 09:17:11 2003 Subject: [Spambayes] tactics Message-ID: <20031115141652.GA8955@nl.linux.org> Hi, currently, I only train email classified as unsure. Is this a wise tactic? Most unsure-mail I get is ham, so the ham/spam ratio is now 2 already, and grawing. Does it help to train on spam messages already classified as such with 100% certainty? yours, Gerrit. -- 188. If an artizan has undertaken to rear a child and teaches him his craft, he can not be demanded back. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From tmj at juntunen.us Sat Nov 15 10:10:20 2003 From: tmj at juntunen.us (Thomas Juntunen) Date: Sat Nov 15 10:10:21 2003 Subject: [Spambayes] RV: I18N and L10N In-Reply-To: Message-ID: On 11/14/2003, spambayes-request@python.org imposed order on a stream of electrons to say: >All right, third try, then I'll quit :-) > >Since you guys state very clearly that no one should email the developers >directly I'm putting this here. I just want to ask if there's any interest >at all in having Spambayes available in Spanish or not. I'm not one of the developers, but are you aware that SpamBayes is an open source project with all the source code and docs freely available for downloading? If you have a working familiarity with Python, no one will stop you from making all the necessary changes to the presentation logic. As I understand the SB mechanism, it shouldn't matter much what language the content is in; your training choices will build an appropriate training db. Once you have a Spanish-version complete, I'm sure the developers woould be glad to find room for it on the web site. HTH, Thomas Juntunen From richie at entrian.com Sat Nov 15 11:25:22 2003 From: richie at entrian.com (Richie Hindle) Date: Sat Nov 15 11:25:46 2003 Subject: [Spambayes] DB_RUNRECOVERY In-Reply-To: <003101c3ab35$8c2e3240$60b0b8a1@brianathlon12> References: <003101c3ab35$8c2e3240$60b0b8a1@brianathlon12> Message-ID: [Steven] > I can't find any recovery program, where is it and what is it called? We don't beleive the recovery program (which is a part of Sleepcat BD, www.sleepycat.com) works with our databases. I'm afraid this is a problem that we haven't yet been able to get to the bottom of. -- Richie Hindle richie@entrian.com From richie at entrian.com Sat Nov 15 11:57:27 2003 From: richie at entrian.com (Richie Hindle) Date: Sat Nov 15 11:57:51 2003 Subject: [Spambayes] RV: I18N and L10N In-Reply-To: References: Message-ID: Pablo, [Moving this thread to spambayes-dev@python.org] > All right, third try, then I'll quit :-) Our apologies - we don't mean to ignore people who offer to help - far from it! > I just want to ask if there's any interest > at all in having Spambayes available in Spanish or not. We'd love to have international versions, though there are a lot of issues involved. I don't mean to put you off the idea, or to imply that we're not prepared to put effort into this, but these things need taking into account... Many (most?) of the English strings in SpamBayes are mixed in with the code. Taking the source code as it is an translating the strings into Spanish would be unmaintainable - we'd have two entirely separate versions of the code, and any edits would have to be applied to both. So the first job to do would be to pull out all those hard-coded strings into a language file. That's not a huge job, and one that any computer-literate person could probably do 95% of, even if they weren't a programmer. Still more effort than simply translating a collection of English phrases into Spanish, though. More English text appears in HTML pages. Some of these are mostly text, like the Outlook help pages, and maintaining two versions would not be too bad (though any stylistic changes might have to be applied twice). Some, however, are quite strictly defined in ways that makes them machine-readable - the web interface (as used by the POP3 proxy and IMAP filter) defines its user interface in little pieces of HTML that are joined together by the program at runtime. Translating that stuff would need more technical knowledge, and probably a significant amount of re-engineering to make it maintainable. A lot of Outlook interface (at least the dialogs) are defined in Windows resource files, which require Visual Studio to edit them (or there may be third party programs to it - any free ones that people know of?). You can edit them by hand but it's a huge pain. They also contain a lot of information that's not just language strings, meaning there's a lot of duplication between the different language versions that causes maintenance headaches. I'd be interested to hear from people out there in the world with solutions to this problem! Lastly a social issue. You could become the support department for hundred of Spanish SpamBayes users! So. Do you still want to do this? 8-) And are there SpamBayes developers (or other Python-literate SpamBayes users - Pablo, you don't say whether you're a Python programmer?) who have the time to make the necessary software changes for this to work? There may be other, cheaper, alternatives to doing the work that I haven't considered (for instance, translating all the strings in place and then maintaining the edits as a set of patches that get applied to each release - are there Open Source projects that work that way?) I may be painting an unnecessarily bleak picture here, because I'm no expert on i18n. I'd love for someone to come in and say "No, you've got it all wrong, it's easy, look!" Anyone? -- Richie Hindle richie@entrian.com From Steve at HaHaHa.com.au Sat Nov 15 19:19:19 2003 From: Steve at HaHaHa.com.au (Steve Davis) Date: Sat Nov 15 19:19:37 2003 Subject: [Spambayes] Idea for spambayes Message-ID: <002f01c3abd7$476278e0$0100a8c0@office> After using spambayes for a few months now I am convinced it is the best thing since slice bread with spam. I still get the ocassional false positive and check my spam folder every few days. I am now satisfied that anything over 90% is either spam or not something I am interested in reading as it will look like spam. How about an option that at a certain score (which can be chosen) , an email will be auto deleted. Thus I would have three settings < 50% = ham -> inbox 50% - 90% = possibly spam -> spam box > 90% = definitely spam, just delete it. All the best Steve Davis Too much time on my hands? - www.hahaha.com.au/weblog ######################################################################## ############### This email is copyright, trade marked and written in a secret code that is uncrackable by ASIO, the CIA or anyone over 7 years of age. I am tired of dealing with virusii so I use virus scanning software (PC-Cillin), a firewall (Zone Alarm) and a condom (Ansell) to protect me. My computer has no protection at all. If you have received the "Snow White" virus from hahaha@sexyfun.com and want to complain to me at hahaha.com.au, don't make yourself look silly by not knowing the difference between a username and a domain name like dozens who preceed you. ######################################################################## ############### From jmcatee at mediaodyssey.com Sat Nov 15 19:50:34 2003 From: jmcatee at mediaodyssey.com (Jim McAtee) Date: Sat Nov 15 19:50:35 2003 Subject: [Spambayes] OE Question Message-ID: <045801c3abdb$a4a57440$1c03a8c0@jim> I just set up SpamBayes on my PC running Win2k Professional. My email client is Outlook Express, so I'm running the POP3 proxy as a service. So far, it's working well. I'm trying to set up additional OE email accounts, both on the same POP server and on different servers. But I'm not sure how (or if) you can set up OE to use an alternative port for POP3. Any ideas? Thanks, Jim From jmcatee at mediaodyssey.com Sat Nov 15 21:14:05 2003 From: jmcatee at mediaodyssey.com (Jim McAtee) Date: Sat Nov 15 21:14:08 2003 Subject: [Spambayes] Training with cache on the POP proxy Message-ID: <069301c3abe7$500f2230$1c03a8c0@jim> Regarding training, I'm wondering what is the recommended method of using cached messages. I've run about 300 spam and 1100 ham messages through the interface, mostly by having the server process existing Outlook Express dbx folder files, but also using the web interface. Now when I go to the review page, probably 95% or more of the messages are shown as either 'ham' or 'spam', with the remaining 'unsure'. Is it recommended to train on either the ham or spam messages for which SpamBayes has already made a correct decision? Will this reinforce the decision making, make it less accurate, or have no effect? From tim.one at comcast.net Sat Nov 15 23:49:34 2003 From: tim.one at comcast.net (Tim Peters) Date: Sat Nov 15 23:49:42 2003 Subject: [Spambayes] RV: I18N and L10N In-Reply-To: Message-ID: [Pablo Vieira] > ... > Since you guys state very clearly that no one should email the > developers directly I'm putting this here. I just want to ask if > there's any interest at all in having Spambayes available in Spanish > or not. I expect that, like most of the rest of the developers, I think that would be great, but don't have relevant experience, or time, to give to it. If I were you , I'd announce my *intent* on the newsgroup comp.lang.python, to attract the interest of Python programmers with real-life I/L*N experience. There are people who know pretty much exactly what to do, but they don't hang out on this list, and this has much more to do with using Python's relevant features (like Unicode) correctly than with SpamBayes specifically. You might have luck asking on a Zope mailing list too (Zope is a popular web content management system coded in Python, with users all over the world, and within the last couple years has benefited by many peoples' intense help with I/L*N). A problem I know came up repeatedly in the Zope experience: a 100% commitment to Unicode can make life much easier, but old-time Python programmers have to be dragged kicking and screaming to Unicode ("it's inefficient", "it's wasteful", "it's too hard", ..., all the kinds of things old people say when they're too cranky to learn new tricks ). You'll have my support in fighting that battle, but not really much of my help -- because I'm one of the old farts who still hasn't learned anything about how to live in a Unicode world. Asians are likely to complain about Unicode too, but adapting SpamBayes to Asian languages has many deep problems that European languages shouldn't face (spambayes splits the body into tokens by whitespace, and that's it -- it deliberately didn't assume 7-bit ASCII English). I'm not sure whether the Python email package plays nicely with Unicode. That could be a real problem at the starting gate, if not. From tim.one at comcast.net Sun Nov 16 00:12:01 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 00:12:05 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <16309.3699.302897.207278@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > ... > I've developed a few seat-of-the-pants training maxims, both from > personal experience and from reading what others have done: > > * Don't be afraid to retrain from scratch. The system learns > quickly. Retraining from scratch is often the quickest way to > recover from training mistakes. > > * Bigger is not always better, no matter what all those > enlargement messages would have you believe. A larger database > is harder to examine for mistakes, and a few mistakes skewed in > the same directionn may be hard to overcome with correct > training. You'll also reach a point where you want to just > delete all that spam. Once you do that, you've completely lost > the ability to find mistakes. If you only have a few messages > in your training database things will be easier to manage. > > * Never train on the same message twice. Using iterative > reasoning it's easy see you should never train on the same 100 > or 1000 times either. ;-) > > * Seek balance in your training database. Similar numbers of ham > and spam are good. > > * Don't automatically train on all incoming messages. If you get > swamped with spam, you will quickly wind up with a training > database which is wildly out-of-balance. > > * Don't worry about training on every unsure message either. Some > messages just aren't amenable to a strict classification. For > example, a bounce message from a mail server containing an > attached spam may be best left untrained. It contains both > strong ham clues (all the postmaster gibberish which you would > get in a bounce of an otherwise valid message) and strong spam > clues (the spam message itself). Calling that message as ham > or spam is likely to worsen the classification of future mail > bounces or future similar spam. Those are excellent suggestions, Skip! How about immortalizing them on Richie's SpamBayes Wiki? (Good info in email msgs is rarely found again.) > My environment is much different than yours, so I don't know how > you'd get the Outlook plugin to score messages again, but if it can > do that, a little judicious checking will probably avoid the need to > over-train. In current incarnations of the addin, it's easy, but has moved from where it was found in earlier versions of the addin Spambayes -> Filter messages ... That brings up a dialog box with 4 sets of choices: + A multi-folder selector to choose the folder(s) you want to filter (just the Unsure folder in this application of it). + A radio button to choose whether you *just* want msgs rescored, or also want them shuffled around to other folders based on your other SpamBayes settings. (You do want them shuffled around in this application of it.) + Whether to rescore email that's already been scored. (Yes, in this application of it.) + Whether to restrict rescoring to unread mail. (Depends on what else is in your Unsure folder, I suppose.) From tim.one at comcast.net Sun Nov 16 00:24:41 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 00:24:45 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C3945@cliff.bai.org> Message-ID: [Ryan Malayter] > ... > The way the probabilities are actually computed, the more data you > have, the more accurate your probabilities get, and the better the > filter will perform. If the measured statistical properties of ham and spam never changed over time, that would be true. A possible problem in practice is that the true probabilities do change some over time, and then the more *stale* training data you're carrying forward, the worse the math estimates current probabilities. That's one reason to favor a small database in practice: it responds more quickly to new training; it doesn't have so much of the past pointing it in out-of-date directions. > Up to a point, of course... there will always be diminsighing returns. > There's not much difference in practical terms between 99.7% accuracy > and 99.8% accuracy. That depends on what you're doing, and expressing those as error rates makes the issue clearer: a .3% error rate is 50% larger than a .2% error rate. This project was originally aimed at filtering high-volume tech mailing lists, and when you're dealing with 10s or 100s of thousands of emails per day, an absolute .1% increase in the error rate can translate to hundreds of additional messages kicked out for moderator review every day. If the comp.lang.python tests at the time had been able to achieve *only* 99.9% accuracy, I probably would have given up. That said, I agree that for personal email, for most people (those who don't get thousands of emails per day) a difference of 0.1% in the error rate will be noticed only by the self-destructively obsessed . From tim.one at comcast.net Sun Nov 16 00:39:52 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 00:39:56 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: [Seth Goodman] > ... > Well, what bothers me, so far, is that despite training on 620 ham > and 1403 spam, SpamBayes still manages to miss (score as ham) 5-10 > messages per day out of around 150 scored messages. I get about 700 emails per day, about 200 of them spam, and see one or two spam left in my inbox per week. My training data is a little better balanced than yours, and about the same total number of messages, and I'm certain I haven't trained any messages into the wrong category. My cutoffs are at 20 and 80. The vast bulk of my ham comes from technical mailing lists, which appears exceptionally easy to identify as ham. > Most of these missed spams have an initial score very close to zero, so > simply lowering the ham threshold would not fix it. There's no way to diagnose this without staring at the evidence the classifier used to reach its decision. It's not magic, it's just throwing a bunch of numbers at each other . I don't recall which SpamBayes application you're using. If it's the Outlook addin, just do Spambayes -> Show spam clues for current message > After training as spam, their spam score often increases respectably, > but sometimes, the score stays below 5%. This indicates that the same > message would be missed next time, as well. Yes. > I don't know if I just need to get a bigger or more balanced training > set, if there are some types of tokens (such as embedded URL's in HTML > spam) that are not currently parsed or if this is just as good as it > gets. It sounds unreasonably bad to me, but we're not going to *guess* the cause. If you generate a report containing the evidence the classifier used, that will tell us exactly why the message got its score. Best guess is that something is screwed up in your training data. This *sounds* like symptoms some others have had before they discovered that they trained some messages into the wrong category. From tim.one at comcast.net Sun Nov 16 01:22:21 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 01:22:26 2003 Subject: [Spambayes] .db files In-Reply-To: <3FB4CC42.6060306@videotron.ca> Message-ID: [papaDoc] > ... > It also depens where you work. I work in simulation and every > calculation and memory you can save make your simulator more > appealing since you can do more with less. (Do you prefer > to pay 3000$ for a simulator you can run on a Current Standard > PC or pay 250 000$ for a simulator that will do the same thing > on a bigger screan) (See www.simlog.com) I'd go for the 3000 choice, but some large companies wouldn't: inexpensiveness is often viewed as correlated with risk. I worked at Cray Research for 10 years, in "supercomputer" compiler development, so am all too familiar with speed-at-any-cost applications. They have their place, and some apps have a legitimate need for that. But we put more effort into each of Cray's compilers each year than the sum total of all work put into Python, and its libraries, over the last 12 years combined, and that's the way the world works: the *human* costs of producing highly optimized code have become unaffordable for all but apps that truly need that level of optimization, and have a market willing to pay those extreme costs. SpamBayes was scoring email the same day I started to write it -- and my desktop PC has a clock rate 5x faster than the US$10,000,000+ Cray Y-MP that was the top of the line when I left Cray (in 1988). SpamBayes may have been too slow and big to use on a 10 million dollar machine 15 years ago, but it runs happily on my 3-year-old laptop today. I would be happy to see it run faster, but not happy enough to endure the pain of getting there. Since DB access time appears to be the bottleneck now, it may not even have been significantly faster coded in C. But it's all moot. It would have taken at least 30x longer for me to code the thing in C, and I didn't have that much time to give to it: if it weren't for the decision to use Python, SpamBayes simply wouldn't exist. The economic tradeoffs that drove that decision have only gotten "even more so" since then. I take solace in that it appears to require 128KB of RAM just to boot Windows XP . From tim.one at comcast.net Sun Nov 16 01:22:22 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 01:22:32 2003 Subject: [Spambayes] .db files In-Reply-To: Message-ID: [Kenny Pitt] > ... > A large percentage of the memory used by K9 or SpamBayes is for the > training data, so actual footprint will always depend on how many > messages you have trained. I don't know about K9. Most SpamBayes users have a Sleepycat Berkeley database now, and that's disk-based. The memory burden when using a disk-based database has much more to do with the number of messages you've scored in the current session than with the number of message you've trained on. Most token information sits on disk unused: only the tokens actually seen when scoring new email get read up. For speed, spambayes keeps the info for tokens used in scoring in memory too, so memory use will increase as more messages get scored. From dalke at dalkescientific.com Sun Nov 16 04:33:33 2003 From: dalke at dalkescientific.com (Andrew Dalke) Date: Sun Nov 16 04:33:20 2003 Subject: [Spambayes] pop3 culler Message-ID: I am the proud recipient of about 170 spams a day which is 1.5MB per day. My mail is held by my ISP and has a limit of 10MB, which means that if I don't check mail for a week then it will get full. My primary machine is a laptop, which I take with me on travels. I suspend it when I'm not using it. I have an NT box in my house which I can keep running all the time. When the virus was going around a couple months ago I was getting a 110K attachment every minute or so, which meant that my mailbox was filling up overnight. I ended up running Kevin Altis's kill script (with mods) on my NT box to remove all emails from the POP3 server. I'm going to be traveling soon and don't know often I'll be able to access mail from my laptop. It may be I can only connect through a slow connection, if at all. Thus, I want to set up a cull script on my NT box which connects to my POP3 account and removes any and all high scoring spam and emails which have MS Windows executable attachements, except those which meet some simple whitelist criteria. (Being cautious, I might instead have it save all the spams to the local NT drive for review when I get home.) I would like to use spambayes to handle this case. That is, I have one server with two clients that connect to it. The first is my laptop and the second is the culler, which connects every, say, 30 minutes and deletes spams. This should greatly reduce the amount of data in my mailbox. I did not find mention of this case in the FAQ nor in my review of the SB code did I see any existing program which could handle it. Does such a beast exist or should I look into writing one myself? If the latter, what's a good starting point? Andrew dalke@dalkescientific.com From richie at entrian.com Sun Nov 16 06:29:30 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 16 06:30:01 2003 Subject: [Spambayes] pop3 culler In-Reply-To: References: Message-ID: Andrew, > the culler, which connects every, say, 30 minutes > and deletes spams. My approach would be to use the SpamBayes POP3 proxy (sb_server.py) as-is, and write a script that used Python's POP3 client module to connect, retrieve the emails, look at the X-Spambayes-Classification header, and delete the ones marked as spam. If I weren't storing the messages, I'd set a high spam threshold. I'd then publish the script on the SpamBayes Wiki at http://entrian.com/sbwiki 8-) -- Richie Hindle richie@entrian.com From richie at entrian.com Sun Nov 16 06:41:52 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 16 06:42:14 2003 Subject: [Spambayes] Training with cache on the POP proxy In-Reply-To: <069301c3abe7$500f2230$1c03a8c0@jim> References: <069301c3abe7$500f2230$1c03a8c0@jim> Message-ID: <5ioervgn339hkif5aupp2p91jlqa5rvsv9@4ax.com> [Jim] > Regarding training, I'm wondering what is the recommended method of using > cached messages. Skip recently posted a few guidelines, which are in the archives here: http://mail.python.org/pipermail/spambayes/2003-November/009267.html -- Richie Hindle richie@entrian.com From richie at entrian.com Sun Nov 16 06:44:52 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 16 06:45:13 2003 Subject: [Spambayes] OE Question In-Reply-To: <045801c3abdb$a4a57440$1c03a8c0@jim> References: <045801c3abdb$a4a57440$1c03a8c0@jim> Message-ID: [Jim] > I'm trying to set up additional OE email accounts, both on the same POP > server and on different servers. But I'm not sure how (or if) you can set up > OE to use an alternative port for POP3. Any ideas? A quick Google (for "outlook express" "pop3 port") found this page: http://support.aol2pop.com/AOL2Pop_Port.html which has these instructions: ----------------------------------------------------------------------- >From the main Outlook Express window, select Accounts... from the Tools menu. In the Internet Accounts listing, highlight the e-mail account [...]. In the Properties dialog, select the Advanced tab. The first section of the tab should read Server Port Numbers. ----------------------------------------------------------------------- -- Richie Hindle richie@entrian.com From webmaster at beyond-thoughts.com Sun Nov 16 08:59:18 2003 From: webmaster at beyond-thoughts.com (Christoph Becker-Freyseng) Date: Sun Nov 16 09:00:19 2003 Subject: [Spambayes] AT & T Patent on tricking Anti-Spam-Tools Message-ID: <3FB782B6.2020809@beyond-thoughts.com> AT & T got a patent (6.643.686) on tricking Anti-Spam-Tools. I'm not sure if it's good or bad ... http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,643,686.WKU.&OS=PN/6,643,686&RS=PN/6,643,686 cu cbf From wsy at merl.com Sun Nov 16 09:30:46 2003 From: wsy at merl.com (Bill Yerazunis) Date: Sun Nov 16 09:30:55 2003 Subject: [Spambayes] AT & T Patent on tricking Anti-Spam-Tools In-Reply-To: <3FB782B6.2020809@beyond-thoughts.com> (message from Christoph Becker-Freyseng on Sun, 16 Nov 2003 14:59:18 +0100) References: <3FB782B6.2020809@beyond-thoughts.com> Message-ID: <200311161430.hAGEUkk28280@localhost.localdomain> From: Christoph Becker-Freyseng AT & T got a patent (6.643.686) on tricking Anti-Spam-Tools. I'm not sure if it's good or bad ... http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,643,686.WKU.&OS=PN/6,643,686&RS=PN/6,643,686 Not bad. It's the idea of making all your messages different (e.g. putting in a bit of different text in each message, or in each group of messages). The idea is to foil spamfilters that detect duplicate messages. Now, there's a good reason for ATT to patent this- it means that anyone doing this can be sued by ATT. I approve. :) -Bill YerazunisC From rcoe at CambridgeMA.GOV Sun Nov 16 09:42:16 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Nov 16 09:42:19 2003 Subject: [Spambayes] RE: AT & T Patent on tricking Anti-Spam-Tools Message-ID: > From: Christoph Becker-Freyseng [mailto:webmaster@beyond-thoughts.com] > Sent: Sunday, November 16, 2003 8:59 AM > To: spambayes@python.org > Subject: [Spambayes] AT & T Patent on tricking Anti-Spam-Tools > > > AT & T got a patent (6.643.686) on tricking Anti-Spam-Tools. > I'm not sure if it's good or bad ... Arguably, all software patents are bad. But that said, it's hard to see the practical effect of this one as anything but good. If ATT doesn't enforce it, we're no worse off than we are now. If they do enforce it, it stands to increase spammers' legal exposure and/or their cost of doing business. Or, if they have to stop using the patented algorithm, maybe it will decrease the effectiveness of their activity. Bob From rcoe at CambridgeMA.GOV Sun Nov 16 10:33:44 2003 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Sun Nov 16 10:33:49 2003 Subject: [Spambayes] RE: pop3 culler Message-ID: If Andrew's PC has a full-time connection to the Internet, he might be able to do it without even writing a script. The PC would be allowed to download everything, accommodating the ISP's restriction on mailbox size. Then assuming a firewall penetrable by an authenticated client, Andrew would connect his laptop to his PC, download what he wants, leave what he prefers to read later, and delete the rest. I'm assuming that the Spambayes POP3 proxy, which I haven't used, functions as both a POP3 client and a POP3 server. If it doesn't, there are other proxies (VPOP3, for example) that do. Bob > -----Original Message----- > From: Richie Hindle [mailto:richie@entrian.com] > Sent: Sunday, November 16, 2003 6:30 AM > To: Andrew Dalke; spambayes@python.org > Subject: Re: [Spambayes] pop3 culler > > > Andrew, > > > the culler, which connects every, say, 30 minutes and deletes spams. > > My approach would be to use the SpamBayes POP3 proxy (sb_server.py) as-is, > and write a script that used Python's POP3 client module to connect, > retrieve the emails, look at the X-Spambayes-Classification header, and > delete the ones marked as spam. If I weren't storing the messages, I'd > set a high spam threshold. > > I'd then publish the script on the SpamBayes Wiki at > http://entrian.com/sbwiki 8-) From richie at entrian.com Sun Nov 16 11:47:35 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 16 11:47:57 2003 Subject: [Spambayes] RE: pop3 culler In-Reply-To: References: Message-ID: [Bob] > I'm assuming that the Spambayes POP3 proxy, which I haven't > used, functions as both a POP3 client and a POP3 server. Sort of, but not in the sense that you mean. It doesn't store messages; it merely annotates them with extra headers as they pass through. -- Richie Hindle richie@entrian.com From tmj at juntunen.us Sun Nov 16 11:48:19 2003 From: tmj at juntunen.us (Thomas Juntunen) Date: Sun Nov 16 11:48:24 2003 Subject: [Spambayes] RV: I18N and L10N In-Reply-To: Message-ID: On 11/16/2003, "Tim Peters" imposed order on a stream of electrons to say: >I expect that, like most of the rest of the developers, I think that would >be great, but don't have relevant experience, or time, to give to it. If I >were you , I'd announce my *intent* on the newsgroup comp.lang.python, >to attract the interest of Python programmers with real-life I/L*N >experience. There are people who know pretty much exactly what to do, but >they don't hang out on this list, and this has much more to do with using >Python's relevant features (like Unicode) correctly than with SpamBayes >specifically. You might have luck asking on a Zope mailing list too (Zope >is a popular web content management system coded in Python, with users all >over the world, and within the last couple years has benefited by many >peoples' intense help with I/L*N). You folks might find this article useful: http://www.joelonsoftware.com/articles/Unicode.html Thomas Juntunen From Jeff at ModestSystems.com Sun Nov 16 13:05:46 2003 From: Jeff at ModestSystems.com (Jeff Jansen) Date: Sun Nov 16 13:05:52 2003 Subject: [Spambayes] Training using the Outlook Plug-In Message-ID: I have a question about how and when the SpamBayes Outlook plug-in "learns" (that is, trains itself) during use. Apparently, with the appropriate check boxes checked (on the "Training" tab of the Spambayes Manager dialog box), SB will learn whenever I move a message into or out of the Spam folder using the "Delete As Spam" or "Recover From Spam" buttons on the toolbar. Thus, anytime I manually identify an "unsure" message as spam or ham, SB learns. That's great. My question is about the messages that it successfully identifies as spam or ham during use based on the current status of the training database. Does SB learn from those messages as well, or does it merely move the message into the appropriate folder? Maybe the real question is, is there anything for SB to learn from such messages? I'm thinking that whatever the "pattern" that causes a message to be correctly identified as spam or ham, the database would be "strengthened" by the knowledge that there is one more message that matches that pattern, so that there IS something for SB to learn from those messages. So the question is, is that what SB does? The bottom line is, if I'm basically happy with the way SB is working, do I need to periodically forcibly retrain it (using the "Start Training" button) and make it look at all the messages that have arrived since the last forcible training, or are those messages already reflected in the training database? Jeff Jansen | Portland, Oregon | USA From papaDoc at videotron.ca Sun Nov 16 14:39:11 2003 From: papaDoc at videotron.ca (Remi Ricard) Date: Sun Nov 16 14:36:30 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: References: Message-ID: <1069011550.3384.3.camel@porsche.hq.simlog.com> Hi, > It looks like K9 *does* age out old messages once the message counts for > spam or ham are reached and a new message is available for training. You can "clean" your database from old trained message with a patch that I provided (go see the patch on sourceforge). What it does is: you provide a value ( let say 60) so every messages older than 60 days will be remove from the database (If they were used for training. Remi -- Remi Ricard From m.nanne at testtechniek.com Sun Nov 16 14:59:01 2003 From: m.nanne at testtechniek.com (Nanne Technical Services) Date: Sun Nov 16 14:59:16 2003 Subject: [Spambayes] outlook only starts in SAFEMODE Message-ID: <000001c3ac7c$14e3c720$c100000a@LAPTOPMARCEL> Hello , After I have installed SpamBayes Outlook plug-in - 0.7 ,outlook 2002 (10.2627/10.2625) only wants to start in Safemode. I deinstalled SpamBayesit but the error stayed. Then I made another user in Xp , when I am logged in as this new user outlook start normal. I have exported my database and imported in the new user and it works. (so I think the database is ok). Then I reinstalled OFFICE XP again but the same error is there again. I have searched everything but I can't find anything. Will you please help? With kind regards, Marcel Nanne Sint Annaparochie The Netherlands From tim.one at comcast.net Sun Nov 16 16:18:32 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 16:18:24 2003 Subject: [Spambayes] RE: .db files In-Reply-To: <4C542552EF434E43B2F2572854A73CBD6F70CF@corp-mail1.srdcorp.com> Message-ID: [David Reed] > Much as it distresses me, I have to agree with Bob. > > Seems to me the major force in hardware advancement is software > gluttoney, however. :) The HW manufacturers strongly encourage this, too. For example, I worked in computer speech recognition for 6 years, and Intel was always asking us how we could make the product require more processing power, because they wanted demanding apps to drive sales of new CPUs. In speech recog, that's easy to do, so Intel liked us a lot. Seagate was a major early investor, in part because they recognized that if speech recog caught on, it would stimulate sales of new, bigger disks. IIRC, memory chip manufacturers didn't talk to us, presumably because Microsoft OS bloat and gamers were already doing more than they could handle at the time to stimulate demand for new, bigger memory chips. In short, software gluttony feeds new HW sales, so the HW folks are big fans of software bloat. SW folks like bloat too, because "new features" drive upgrade sales. A happy consequence for us (SpamBayes doesn't have anything to sell) is the modern PCs are so powerful that even a pig like SpamBayes can run on them without strain . From tim.one at comcast.net Sun Nov 16 16:37:50 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 16 16:37:42 2003 Subject: [Spambayes] Spam Clues: Re: index.cgi redirection In-Reply-To: <08C94E00610F2E43A406811D00880C9705A5D0@exchserv.ais> Message-ID: [Gr?tzmacher, Lukas] > The SpamBayes Manager reports the training status as about 1600 ham > and 135 spam mails, even I think I had more spam mails. > > Do I understand you right: Because I have more ham then spam mails > my training becomes unbalanced ? It is badly unbalanced (more than a 10-to-1 ratio favoring ham), but I'm not sure that's your *only* problem. For example, this punched me in the eye:' >>> word spamprob #ham #spam >>> ... >>> 'header:Date:1' 0.647151 912 135 >>> 'header:From:1' 0.647151 912 135 Those are just recording whether an email has a Date line in the email headers, and a From header line. Virtually all email does have both such lines, in which case their spamprobs are 0.5 -- neutral. So it's very surprising to see the presence of a Date or From line show up as a significant clue. You said you trained on 135 spam, and those two lines show that every one of your spam had both header lines. That's not surprising -- it's expected. The baffling part is that only 912 of your 1600 ham had Date and From lines, less than 60% of your ham. What kind of ham do you get that lacks Date and >From headers? I don't know that it's impossible, but it is fishy and so something to look at. From ghlowe at worldnet.att.net Sun Nov 16 17:57:00 2003 From: ghlowe at worldnet.att.net (George Lowe) Date: Sun Nov 16 17:57:21 2003 Subject: [Spambayes] Installation Problems Message-ID: <000201c3ac94$f6c44120$48ac480c@CPQ20181235371> When I attempted to install, I got the message: " Unable to register the DLL/OCX; DllRegisterServer failed; code 0x00000000". Suggestions? George Lowe From sethg.delete at GoodmanAssociates.com Sun Nov 16 18:15:35 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Sun Nov 16 18:15:37 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: > [Seth Goodman] > > ... > > Well, what bothers me, so far, is that despite training on 620 ham > > and 1403 spam, SpamBayes still manages to miss (score as ham) 5-10 > > messages per day out of around 150 scored messages. [Tim Peters] > > I get about 700 emails per day, about 200 of them spam, and see one or two > spam left in my inbox per week. My training data is a little better > balanced than yours, and about the same total number of messages, and I'm > certain I haven't trained any messages into the wrong category. > My cutoffs > are at 20 and 80. The vast bulk of my ham comes from technical mailing > lists, which appears exceptionally easy to identify as ham. My cutoffs were the default 90 and 15. My mail stream is as follows: about 300 messages per day, of which 120 are spam. One significant difference in our setups is that all of my listmail traffic, which proved to be all ham, is taken care of by Outlook rules and not seen by SpamBayes. I would have even more listmail traffic if it were not for the fact that several of my lists are available as newsgroups. When I started using SpamBayes, I found, just as you report, that the listmail traffic was always correctly classified, but it also never contained spam. It seemed inappropriate to fill the ham token database with tokens typical of mail that I could reliably classify without SpamBayes. I then put SpamBayes in background mode so the Outlook rules took care of all the listmail and only non-list traffic would get scored and classified. That leaves about 140 messages per day, 120 of which are spam. The remaining 20 ham messages are personal and business communications, i.e. what I really want in my inbox. Perhaps this type of message stream is tough for SpamBayes and I should run things differently? [Seth Goodman] > > I don't know if I just need to get a bigger or more balanced training > > set, if there are some types of tokens (such as embedded URL's in HTML > > spam) that are not currently parsed or if this is just as good as it > > gets. [Tim Peters] > > It sounds unreasonably bad to me, but we're not going to *guess* > the cause. > If you generate a report containing the evidence the classifier used, that > will tell us exactly why the message got its score. Best guess is that > something is screwed up in your training data. This *sounds* > like symptoms > some others have had before they discovered that they trained > some messages > into the wrong category. OK, here's what I'll do. I'll eliminate the two possible problems you raised before I ask anyone to troubleshoot anything. These are the unbalanced training set size and to re-validate all my training messages. I've pruned my initial spam training corpus by deleting the oldest messages so that I have about 650 spam and 650 ham as an initial training set. The only exception to pruning by age is that I kept all spams that were previously misclassified as ham (scored less than 15) to seed the spam token database with the "tough to classify spam". Due to the preponderance of spam I get in the message stream I filter, my ham corpus goes back about two months and my spam corpus goes back about a week. I've gone over each message corpus one more time, but only found one questionable message in the ham corpus: it was hard to decide what it was, but I moved it to the spam corpus. I will train and rebuild the database on these new corpuses. I will leave my thresholds at the default 90/15, as I *never* got a false positive and I like it that way. In the past, virtually all of my unsures were spam. Therefore, in the unsure group, I will train on all ham (if I ever see any there) but I will only train on spam with a score of less than 50 to limit the growth of the spam token database. This is still not perfect, as the spam corpus will continue to grow but the ham corpus probably won't. I'll run the system for a while, keep statistics, and wait for things to settle down. I'll let you know what I see and if I still get too many missed spams, I'll post a few of the classifier outputs for the missed spams for evaluation. If you'd prefer that I run this experiment differently, please let me know and I will restart it. -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From sethg.delete at GoodmanAssociates.com Sun Nov 16 20:20:10 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Sun Nov 16 20:20:15 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: Tim, An piece of information is evident from going over my past statistics. The system performed respectably after the initial training on around 600 spam and 600 ham. The missed spam initially went down as I trained on more of it, but then it seemed to increase until it missed over 10%. I previously trained on all missed spam and everything in the unsure folder. -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above From tameyer at ihug.co.nz Sun Nov 16 23:05:29 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Nov 16 23:02:57 2003 Subject: [Spambayes] pop3 culler In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130415CDF2@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D0@its-xchg4.massey.ac.nz> > Thus, I want to set up a cull script on my > NT box which connects to my POP3 account and removes > any and all high scoring spam and emails which have > MS Windows executable attachements, except those > which meet some simple whitelist criteria. [...] > I did not find mention of this case in the FAQ nor > in my review of the SB code did I see any existing > program which could handle it. Does such a beast > exist or should I look into writing one myself? If > the latter, what's a good starting point? The only thing I would add to Richie's comments is that if your connection from the NT box is slow or expensive, then you could get it to do this based on header analysis only (it's simple enough to add an option to sb_server to get it to analyse on TOP instead of RETR). The archives for this list and the -dev one have various messages about the results from header-only analysis, but basically (and of course, YMMV) I personally found that it tended to be good enough for this sort of task, and nicely conservative (tending to leave spam, rather than remove ham). =Tony Meyer From Tom-Moore at att.net Mon Nov 17 01:04:09 2003 From: Tom-Moore at att.net (Tom Moore) Date: Mon Nov 17 01:04:17 2003 Subject: [Spambayes] Prompt to install "VBA (2720)" Message-ID: <000101c3acd0$9e81af20$314b4a0c@tmoore> I have mail set up using Outlook for two accounts under Windows XP Pro. One is working fine. Mail is being filtered and the tool bar items work fine. The second (mine) is not working properly. Mail is being filtered and the tool bar items appear but clicking on a tool bad item results in the Windows Installer trying to install "VBA (2720)". It prompts to "Please insert disk labeled "Visual Basic for Applications (2720)". Interestingly enough, while the the "Delete as SPAM" and "Recover from SPAM" items display the above dialog, they work to move the mail message from one folder to another. The SpamBayes menu on the other hand displays the installation dialog and fails to display any menu. The SpamBayes log displays the following. The one error about failing to add the toolbar item shows up on both accounts and as such appears to possibly be a red herring. Loaded bayes database from 'C:\Documents and Settings\Tom Moore\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Tom Moore\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 254 spam and 153 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.1.2600 (Service Pack 1) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Junk E-Mail Processing 0 missed spam in folder 'Inbox' took 49.644ms FAILED to add the toolbar item 'SpamBayesCommand.Manager' - (-2147352567, 'Exception occurred.', (0, None, None, None, 0, -2147467259), None) Deleted the dead popup control - re-creating Message 'SPAM test' had a Spam classification of 'No' SpamBayes - Disconnecting from Outlook SpamBayes processed 1 messages, finding 0 spam and 0 unsure Addin terminating: 2 COM client and 2 COM servers exist. I have reinstalled Office XP while logged in as both accounts. I have also removed and reinstalled SpamBayes while logged in as both accounts. Nothing seems to help. Any suggestions? Regards, Tom Moore From Tom-Moore at att.net Mon Nov 17 01:27:22 2003 From: Tom-Moore at att.net (Tom Moore) Date: Mon Nov 17 01:27:29 2003 Subject: [Spambayes] Prompt to install "VBA (2720)" In-Reply-To: <000101c3acd0$9e81af20$314b4a0c@tmoore> Message-ID: <000601c3acd3$dcbf5730$314b4a0c@tmoore> I fixed the problem. I selected the "Browse" button in the installer dialog and it was looking for a file "vba6.msi" A search of my C drive for this file name turned it up in a totally unrelated application directory. I selected that file and the installer was happy. Now Outlook works for both logins. From dalke at dalkescientific.com Mon Nov 17 01:43:07 2003 From: dalke at dalkescientific.com (Andrew Dalke) Date: Mon Nov 17 01:42:42 2003 Subject: [Spambayes] pop3 culler In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D0@its-xchg4.massey.ac.nz> Message-ID: <4DC43B64-18C9-11D8-AECD-000393C92466@dalkescientific.com> Richie Hindle: > My approach would be to use the SpamBayes POP3 proxy (sb_server.py) > as-is, > and write a script that used Python's POP3 client module to connect, > retrieve the emails, look at the X-Spambayes-Classification header, and > delete the ones marked as spam. That's a cute way to do it. It feels like a hack, because of the extra server->proxy->culler network step, but OTOH the machine isn't doing anything else. > If I weren't storing the messages, I'd set a high spam threshold. Yeah, and whitelist known good email sources. Bob Coe: > If Andrew's PC has a full-time connection to the Internet, he might be > able to do it without even writing a script. [...] > Then assuming a firewall penetrable by an authenticated client, Andrew > would connect his laptop to his PC, download what he wants, leave what > he prefers to read later, and delete the rest. It's a DSL connection, but I could get one of those dyndns entries. There are a few problems. I would need to open up my firewall a smidgeon, which I'm not happy about. It would also mean I can't use the web interface to my mail (eg, from a cybercafe) or ssh (my client's site doesn't allow connections to web-based email because downloads don't go through their virus detectors). All in all, too complicated for my tastes. Richie's seems much simpler. Tony Meyer: > The only thing I would add to Richie's comments is that if your > connection > from the NT box is slow or expensive, then you could get it to do this > based > on header analysis only DSL. Bandwidth isn't a problem for this, My take-home message ... wait, I am at home! .. is that no such tool already exists. Okay, I'll play around with it for a bit and post what results. Andrew dalke@dalkescientific.com From sanjaydarisi at cox.net Mon Nov 17 08:43:38 2003 From: sanjaydarisi at cox.net (Sanjay Darisi) Date: Mon Nov 17 08:43:51 2003 Subject: [Spambayes] Few quick questions! Message-ID: <3FB8D08A.5050802@cox.net> I have three quick questions regarding spambayes outlook addin. Firstly, Isn't that possible to add spam field directly to the Outlook view instead of user adding it manually from user-defined fields? Secondly, If I want to add the delete as spam/recover as ham buttons to the message view that is displayed when an email message is double-clicked in Outlook, how do I do that? Any ideas? Thirdly, If I want to add some personal signature regarding spambayes after I install it for my Outlook, How do I do that? Any suggestions? Thanks in advance, Sanjay. From kennypitt at hotmail.com Mon Nov 17 09:58:42 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 17 09:59:09 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: <792DE28E91F6EA42B4663AE761C41C2A012C3945@cliff.bai.org> Message-ID: Ryan Malayter wrote: >> From: Kenny Pitt > >> Because all scores are based on ratios, every additional message that >> you train on dilutes the effect of the prior tokens in that >> corpus that >> don't appear in the new message. For example, if I start with 50 >> trained hams and have a token that has been seen 10 times, it >> contributes a ham probability of 0.2 (10/50) to the scoring. >> If I later >> train on 50 more hams that don't contain that token, it's ham >> probability drops to 0.1 (10/100). > >> So training on more ham can actually cause you to miss good messages >> that you were previously classifying correctly. > > I don't think your examples are correct, mathematically. Your 0.2 and > 0.1 don't take into account how often the token is seen in the other > corpus. The actual formula used by spambayes for the probabiltiy of a > given token is more complex, and requires looking at the size of both > corpuses and the number of occurances in each. See > http://www.paulgraham.com/spam.html for a good explanation of the > general method used. > > So, in your example, if the token never occurred in a spam, your > single-token ham probabilites would actualy be something more like > 0.99 and 0.99 instead of 0.2 and 0.1. > > The way the probabilities are actually computed, the more data you > have, the more accurate your probabilities get, and the better the > filter will perform. Up to a point, of course... there will always be > diminsighing returns. There's not much difference in practical terms > between 99.7% accuracy and 99.8% accuracy. My mistake, I should have used the word *ratio* instead of *probability* in this case. There are a number of factors that contribute to the final probability including unknown word strength, the Robinson "rare word" adjustment, the optional ham/spam imbalance adjustment, etc. I was trying to simplify somewhat, and hopefully did not over-simplify to the point of incorrectness. The intended point is this. Once you have trained on a sufficient number of messages, there are only two components that contribute significantly to the spam probability of a single word: the ratio of ham token count to total ham message count and the ratio of spam token count to total spam message count. If the spam ratio and the ham token count do not change, the computed probability for that word will get *spammier* if I increase the total *ham* message count. Note that none of this has any effect if the token has only ever been seen in one corpus or the other. Before the Robinson adjustment, the spam probability of a token that has only appeared in spam will be 1.0 regardless of whether it appeared once in two messages or once in 200,000 messages. Similiary, the spam probability of a token that has only appeared in ham will be 0.0. The total number of messages becomes a factor at the point when a word has been seen in both ham and spam and we need to decide which of the two is more likely. For those with a statistical mindset who want to follow this computation all the way through, here is the complete formula (as I understand it) that is used by SpamBayes to compute the spam probability of a single word when the ham/spam imbalance adjustment is *not* enabled: hc = ham token count nh = total number of ham messages sc = spam token count ns = total number of spam messages hr = ham ratio = hc / nh sr = spam ratio = sc / ns p = base spam probability = sr / (sr + hr) S = unknown word strength (static factor = 0.45 by default) x = unknown word probability (static factor = 0.5 by default) n = total number of messages = nh + ns sp = final spam probability = ((S * x) + (n * p)) / (S + n) -- Kenny Pitt From kennypitt at hotmail.com Mon Nov 17 10:15:11 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 17 10:15:37 2003 Subject: [Spambayes] ageing out database entries In-Reply-To: Message-ID: Seth Goodman wrote: > Kenny, > > ... Do you have any > comments on the stuff at the bottom of my previous post (reproduced > below)? This is what motivated the discussion of K9 training in the > first place and I would value any insight or suggestions that anyone > could offer. > >> Well, what bothers me, so far, is that despite training on 620 >> ham and 1403 >> spam, SpamBayes still manages to miss (score as ham) 5-10 messages >> per day out of around 150 scored messages. Most of these missed >> spams have an initial score very close to zero, so simply lowering >> the ham >> threshold would >> not fix it. After training as spam, their spam score often increases >> respectably, but sometimes, the score stays below 5%. This >> indicates that the same message would be missed next time, as well. >> I don't >> know if I just >> need to get a bigger or more balanced training set, if there are >> some types >> of tokens (such as embedded URL's in HTML spam) that are not >> currently parsed or if this is just as good as it gets. Anyway, >> that's what I would like to see improved and it is the motivation >> for the above discussion. >> >> Any thoughts from those who've been there already? There is currently a discussion developing under the subject "SpamBayes now filers less than 50% of my spam" regarding the significance of imbalance and possible difficulties as you train on larger numbers of messages. Some of it may apply to your situation. In your case, my guess is that you should consider how similar are the spam messages that got missed to previous spams that you have classified. You have 1403 spam messages trained, so even training 5 new spams isn't much in relation to the total number of spams if the new spam tokens haven't been seen much before. If those spams contain tokens that also appear in your ham, then it could potentially take a significant amount of training to counteract the effects of the ham tokens. You have fewer trained hams than you do spams, so a token in the ham training will contribute a little more strongly to the overall probability than the same token in the spam training. -- Kenny Pitt From skip at pobox.com Mon Nov 17 10:23:49 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Nov 17 10:24:24 2003 Subject: [Spambayes] tactics In-Reply-To: <20031115141652.GA8955@nl.linux.org> References: <20031115141652.GA8955@nl.linux.org> Message-ID: <16312.59397.605801.227853@montanaro.dyndns.org> Gerrit> currently, I only train email classified as unsure. Gerrit> Is this a wise tactic? That is a subject of some speculation. Different training approaches have not been rigorously tested. I think one reason for that is that it would require a fair amount of manual effort. (You'd have to look at everything to identify classification mistakes.) Gerrit> Most unsure-mail I get is ham, so the ham/spam ratio is now 2 Gerrit> already, and grawing. Does it help to train on spam messages Gerrit> already classified as such with 100% certainty? What are your ham_cutoff and spam_cutoff set to? Perhaps you should just increase your ham_cutoff value to push more of the unsures into the ham section. Almost all of my unsures are spam, but they score all over the region between ham_cutoff and spam_cutoff. I am slowly moving my spam_cutoff down to compensate, but have to be careful I don't start gobbling up the occasion good email which scores as unsure. Skip From milton at spine.med.utoronto.ca Mon Nov 17 10:53:29 2003 From: milton at spine.med.utoronto.ca (Milton Charlton) Date: Mon Nov 17 10:53:16 2003 Subject: [Spambayes] Install problem XP Outlook// spambayes_addin.dll// Unable to register the DLL/OCX: DllRegisterServer Message-ID: <01d901c3ad22$f259a380$08fda8c0@kinase> When I try to install Spambayes to Outlook XP I get this error that concerns the file spambayes_addin.dll: Unable to register the DLL/OCX: DllRegisterServer;failed; code 0x00000000. I see this mentioned a few times in the archive but could find no solution. Norton anti virus is turned off and Zonealarm is turned off. The machine is P4 on an Intel motherboard using a Matrox dual monitor card. Palm Hotsync is running and Pivot software is running. Spambayes works fine on my P3 with W98. I really want to use Spambayes and have several hundred spams to train with since I was away for a week. I hope someone there has the answer. Thanks! Milton P. Charlton, From tim.one at comcast.net Mon Nov 17 11:56:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 17 11:56:51 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <3FB8D08A.5050802@cox.net> Message-ID: [Sanjay Darisi] > I have three quick questions regarding spambayes outlook addin. > > Firstly, Isn't that possible to add spam field directly to the > Outlook view instead of user adding it manually from user-defined > fields? We don't know of a way to accomplish this. Outlook doesn't appear to expose this part of its internals to external programs. > Secondly, If I want to add the delete as spam/recover as ham > buttons to the message view that is displayed when an email message > is double-clicked in Outlook, how do I do that? Any ideas? You would have to write a bunch of new code, calling Outlook's API. Offhand I don't have any idea exactly what all that would require in the end. It's not likely to be easy. > Thirdly, If I want to add some personal signature regarding > spambayes after I install it for my Outlook, How do I do that? Any > suggestions? Sorry, I don't know what you're asking here. Try giving more details? From kennypitt at hotmail.com Mon Nov 17 11:57:55 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 17 11:58:20 2003 Subject: [Spambayes] Training using the Outlook Plug-In In-Reply-To: Message-ID: Jeff Jansen wrote: > My question is about the messages that it successfully identifies as > spam or ham during use based on the current status of the training > database. Does SB learn from those messages as well, or does it > merely move the message into the appropriate folder? No, SB does not learn automatically from messages that it classifies. It only trains when you manually identify a message as one or the other. > ... Maybe the real > question is, is there anything for SB to learn from such messages? > I'm thinking that whatever the "pattern" that causes a message to be > correctly identified as spam or ham, the database would be > "strengthened" by the knowledge that there is one more message that > matches that pattern, so that there IS something for SB to learn from > those messages. So the question is, is that what SB does? There is a reasonable argument that training on spam messages that have already been classified as spam can increase the chances of catching future spam as the messages mutate. However, training on *everything* has the potential to cause your training data to have significantly more of one type of message than the other. There are definate accuracy penalties when your training is out of balance. > The bottom line is, if I'm basically happy with the way SB is > working, do I need to periodically forcibly retrain it (using the > "Start Training" button) and make it look at all the messages that > have arrived since the last forcible training, or are those messages > already reflected in the training database? IMHO, this really isn't necessary unless your training data gets out of balance. Then you can do training with the "Rebuild entire database" option turned off to get back in balance. In this case, you'll need to choose your training folders carefully so that you only increase the type of messages that you have too few of. -- Kenny Pitt From skip at pobox.com Mon Nov 17 12:15:11 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Nov 17 12:17:51 2003 Subject: [Spambayes] pop3 culler In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130415CDF2@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13026F29D0@its-xchg4.massey.ac.nz> Message-ID: <16313.543.675533.660001@montanaro.dyndns.org> Tony> ... header analysis only (it's simple enough to add an option to Tony> sb_server to get it to analyse on TOP instead of RETR). This is probably more spambayes-dev territory, but might you encounter potential problems using TOP if only part of the message is retrieved? I'm thinking that if you try to parse an incomplete MIME body the email parser might barf. If not, I don't see any reason to restrict yourself to just the mail header. You might as well use whatever clues exist in the fragmented body. Skip From tim.one at comcast.net Mon Nov 17 12:26:25 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon Nov 17 12:26:19 2003 Subject: [Spambayes] SpamBayes now filers less than 50% of my spam. In-Reply-To: Message-ID: [Kenny Pitt] > ... > For those with a statistical mindset who want to follow this > computation all the way through, here is the complete formula (as I > understand it) that is used by SpamBayes to compute the spam > probability of a single word when the ham/spam imbalance adjustment > is *not* enabled: > > hc = ham token count > nh = total number of ham messages > sc = spam token count > ns = total number of spam messages > hr = ham ratio = hc / nh > sr = spam ratio = sc / ns > p = base spam probability = sr / (sr + hr) > > S = unknown word strength (static factor = 0.45 by default) > x = unknown word probability (static factor = 0.5 by default) > > n = total number of messages = nh + ns > sp = final spam probability = ((S * x) + (n * p)) / (S + n) That's almost right. Note that sp is a weighted average of x and p. n is the weight given to p, and should be larger the more confidence we have in the by-counting spamprob guess. You should instead have: n = total number of messages the token appeared in = hc + sc That is, the confidence we have in p is proportional to the number of messages the token has appeared in, not to the total number of all messages. The only thing the infamous "ham/spam imbalance adjustment" did was reduce n in cases of unbalanced training data, following the inuition that the more wildly unbalanced the training data, the less confidence we have in the by-counting spamprob guesses. It reduced n to what it *would* have been if we had equal numbers of trained ham and spam, with the same hr and sr ratios. It didn't do a good job in practice, though. From dreed at srdcorp.com Mon Nov 17 13:13:31 2003 From: dreed at srdcorp.com (Reed, David) Date: Mon Nov 17 13:16:38 2003 Subject: [Spambayes] Spambayes Outlook Plugin (Latest) unresponsive Message-ID: <4C542552EF434E43B2F2572854A73CBD6F710F@corp-mail1.srdcorp.com> Hi All, I installed the SpamBayes Outlook plugin several months ago for our Company President. It's been working, mostly, since then. Oh sure, a few times it has stopped working, and I'd go fix it. Well, today it stopped working. I upgraded it to the latest Outlook Plugin (.081?), and retrained it. However, it still isn't responding. I uninstalled it and reinstalled it. Still not change. No response. I can't even select an email and "DELETE AS SPAM", because it just doesn't. It's totally unresponsive to anything. It only seems to be his computer (the Company President, naturally, who isn't a computer guy at all, so he has NO patience for computer problems), and I am seemingly powerless to do anything about it. Has anyone else got any idea what might be wrong? Like I say, I've deleted it and reinstalled it, I've upgraded it, and I've retrained it already. I can't select an email and DELETE AS SPAM either. All ideas and suggestions welcome! Thanks, David From sdarisi at imap2.asu.edu Mon Nov 17 13:31:09 2003 From: sdarisi at imap2.asu.edu (Sanjay Darisi) Date: Mon Nov 17 13:24:30 2003 Subject: [Spambayes] Few quick questions! Message-ID: <3FB913ED.9000401@imap2.asu.edu> >/ Thirdly, If I want to add some personal signature regarding />/ spambayes after I install it for my Outlook, How do I do that? Any />/ suggestions? / Tim> Sorry, I don't know what you're asking here. Try giving more details? After installing spambayes Outlook addin. For every outgoing message, new email or reply msg sent by user, I want to add some msg (say as signature) at the end. How do I do that? Sanjay. From sanjaydarisi at cox.net Mon Nov 17 13:42:12 2003 From: sanjaydarisi at cox.net (Sanjay Darisi) Date: Mon Nov 17 13:35:23 2003 Subject: [Spambayes] Few quick questions! Message-ID: <3FB91684.4030307@cox.net> /> Thirdly, If I want to add some personal signature regarding /> spambayes after I install it for my Outlook, How do I do that? Any > suggestions? > Tim> Sorry, I don't know what you're asking here. Try giving more details? After installing spambayes Outlook addin. For every outgoing message, new email or reply msg sent by user, I want to add some msg (say as signature) at the end. How do I do that? Sanjay. From kennypitt at hotmail.com Mon Nov 17 13:52:46 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 17 13:53:13 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <3FB91684.4030307@cox.net> Message-ID: Sanjay Darisi wrote: > After installing spambayes Outlook addin. For every outgoing message, > new email or reply msg sent by user, I want to add some msg (say as > signature) at the end. How do I do that? SpamBayes doesn't do anything like that, but you can do it very easily in your Outlook settings. Select Tools | Options and then go to the "Mail Format" tab. Click on the "Signatures..." button to create your sig. Then just select that signature for either or both of "Signature for new messages" and "Signature for replies and forwards". -- Kenny Pitt From chayo at agmac.com Mon Nov 17 15:49:31 2003 From: chayo at agmac.com (chayo) Date: Mon Nov 17 15:49:37 2003 Subject: [Spambayes] (no subject) Message-ID: <000801c3ad4c$4f3f15c0$6501a8c0@CMcKean> My name is Jeff Foster. I amtrying to find out why Iand one of my real estate agents are having so much trouble with ouyr Outlook since installing Spambayes? I am sending this from Chayo's computer. I have read all ofthe trouble shootint information available. Here is my problem, Outlook loads very very slow or doesn't load at all. Please reply to jfoster@agmac.com From jens.rantil at telia.com Mon Nov 17 16:53:13 2003 From: jens.rantil at telia.com (Jens Rantil) Date: Mon Nov 17 16:54:27 2003 Subject: [Spambayes] tactics In-Reply-To: <20031115141652.GA8955@nl.linux.org> References: <20031115141652.GA8955@nl.linux.org> Message-ID: <20031117225313.0daa3e22.jens.rantil@telia.com> Hi Gerrit, On Sat, 15 Nov 2003 15:16:52 +0100 Gerrit Holl wrote: > Is this a wise tactic? Perhaps this might be a good answer: On Sun, 16 Nov 2003 11:41:52 +0000 Richie Hindle wrote: > Skip recently posted a few guidelines, which are in the archives here: > http://mail.python.org/pipermail/spambayes/2003-November/009267.html Regards, Jens From tameyer at ihug.co.nz Mon Nov 17 18:25:32 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Nov 17 18:23:02 2003 Subject: [Spambayes] pop3 culler In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130415D179@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D3@its-xchg4.massey.ac.nz> [Tony] > header analysis only (it's simple enough to add an option to > sb_server to get it to analyse on TOP instead of RETR). [Skip] > This is probably more spambayes-dev territory, but might you > encounter potential problems using TOP if only part of the > message is retrieved? I should have been clearer - I was thinking more along the lines of a TOP 0 (headers only) - if that classifies as spam, then doing a RETR and storing it somewhere for later review, and if not, moving on to the next message. Doing the opposite of this - RETRing if the message classifies as ham (otherwise leaving on the server) - could be useful if the user is temporarily connecting via a slow/expensive connection (like a mobile phone, or in a hotel, or something like that). I played around with implementing something like this a while back, but other projects took over... :) =Tony Meyer From sdarisi at imap2.asu.edu Mon Nov 17 21:20:00 2003 From: sdarisi at imap2.asu.edu (Sanjay Darisi) Date: Mon Nov 17 21:13:09 2003 Subject: [Spambayes] Few quick questions! Message-ID: <3FB981D0.5070806@imap2.asu.edu> Sanjay Darisi wrote: >/ After installing spambayes Outlook addin. For every outgoing message, />/ new email or reply msg sent by user, I want to add some msg (say as />/ signature) at the end. How do I do that? / Kenny wrote: > SpamBayes doesn't do anything like that, but you can do it very easily > in your Outlook settings. Select Tools | Options and then go to the > "Mail Format" tab. Click on the "Signatures..." button to create your > sig. Then just select that signature for either or both of "Signature > for new messages" and "Signature for replies and forwards". I know how to do this manually. But I would like to do it in addin itself. Like after installing the addin it has to append a signature to every outgoing email. How can I do this? Thank you, Sanjay. From richie at entrian.com Tue Nov 18 03:46:27 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 18 03:46:51 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <3FB981D0.5070806@imap2.asu.edu> References: <3FB981D0.5070806@imap2.asu.edu> Message-ID: [Sanjay] > I know how to do this manually. But I would like to do it in addin > itself. Like after installing the addin it has to append a signature > to every outgoing email. How can I do this? That's a strange requirement - what do you want the signature to say, if you don't mind me asking? -- Richie Hindle richie@entrian.com From ssmith at ayrhill.net Tue Nov 18 08:26:19 2003 From: ssmith at ayrhill.net (Steve Smith) Date: Tue Nov 18 08:26:42 2003 Subject: [Spambayes] Question Message-ID: <004301c3add7$915d71a0$0a01a8c0@ayrhillwkst1> Hello, First, let me say that I love SpamBayes. No other product I have tried (and there have been many) is as effective. However, I do have a recurring issue that I can't seem to resolve. I have a program running on another machine which sends email messages to me periodically. These messages are always from the same sender ("Ernestine"), contain *no* message body, contain an attachment (a .WAV file), and have the same subject ("FW: Voice Mail"). SpamBayes always sends this message to the "Junk Suspects" folder. I click "Recover from Spam", but the next time such a message is received, it too is sent to the "Junk Suspects" folder. Is it possible to have SpamBayes IGNORE messages sent from a particular sender, or in some other way prevent these messages from being sent to "Junk Senders?" Thanks for a great product! -Steve Windows XP Pro Outlook 2000 SpamBayes binary version 0.81 9/9/03 From anthony at interlink.com.au Tue Nov 18 08:36:12 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Nov 18 08:37:24 2003 Subject: [Spambayes] Question In-Reply-To: <004301c3add7$915d71a0$0a01a8c0@ayrhillwkst1> Message-ID: <200311181336.hAIDaCRq003719@localhost.localdomain> > However, I do have a recurring issue that I can't seem to resolve. I have a > program running on another machine which sends email messages to me > periodically. These messages are always from the same sender ("Ernestine"), > contain *no* message body, contain an attachment (a .WAV file), and have the > same subject ("FW: Voice Mail"). SpamBayes always sends this message to the > "Junk Suspects" folder. I click "Recover from Spam", but the next time such > a message is received, it too is sent to the "Junk Suspects" folder. Is it > possible to have SpamBayes IGNORE messages sent from a particular sender, or > in some other way prevent these messages from being sent to "Junk Senders?" If you look at the spam clues, what does it show? Obviously this message has some clues that are resulting in it being classed as Unsure. -- Anthony Baxter It's never too late to have a happy childhood. From edrubins at andisplace.com Tue Nov 18 09:22:25 2003 From: edrubins at andisplace.com (Yehudah Rubinsky) Date: Tue Nov 18 09:22:32 2003 Subject: [Spambayes] Re: Question Message-ID: <5.1.0.14.0.20031118091742.00b1fc70@localhost> Steve Smith wrote: >[....] >Is it possible to have SpamBayes IGNORE messages sent from a particular sender, or >in some other way prevent these messages from being sent to "Junk Senders? Since you're running Outlook, have you tried filtering them with a normal Outlook rule? You may have to set the background processing delay (sorry, I don't run Outlook and the plug-in, so I can't tell you exactly how to do that) to make sure Outlook filters it first. Shalom, Yehudah Rubinsky From whitey at tramac.com Tue Nov 18 11:02:01 2003 From: whitey at tramac.com (George LaCasse) Date: Tue Nov 18 10:57:15 2003 Subject: [Spambayes] Enable SpamBayes??? Message-ID: It's installed but when I try to Delete a file as spam it is telling me to "Enable SpamBayes". How do you do that? I cannot find anything in the SpamBayes Manager. George LaCasse Roanoke, VA From deane at davenportappraisals.com Tue Nov 18 11:10:39 2003 From: deane at davenportappraisals.com (Deane Davenport) Date: Tue Nov 18 11:13:31 2003 Subject: [Spambayes] ?? Message-ID: <5C53A33B3461D049ACC118BFB874510B0C4D85@ac2k01.davenportappraisals.com> Can I tag more than one file at a time to "Delete as Spam"? Deane This message (including any attachments) is confidential and may be privileged. If you have received it by mistake please notify the sender by return e-mail and delete this message from your system. Any unauthorized use or dissemination of this message in whole or in part is strictly prohibited. Please note that e-mails are susceptible to change. Appraisers/Consultants, Inc. (including its group companies) shall not be liable for the improper or incomplete transmission of the information contained in this communication nor for any delay in its receipt or damage to your system. Appraisers/Consultants, Inc. (or its group companies) does not guarantee that the integrity of this communication has been maintained nor that this communication is free of viruses, interceptions or interference. Visit our Website at http://www.DavenportAppraisals.com From milton at spine.med.utoronto.ca Tue Nov 18 11:24:14 2003 From: milton at spine.med.utoronto.ca (Milton Charlton) Date: Tue Nov 18 11:24:21 2003 Subject: [Spambayes] Install problem Outlook 2002// spambayes_addin.dll//other addin present Message-ID: <002401c3adf0$687dff70$08fda8c0@kinase> My previous complaint referred to Outlook XP when I am actually running Outlook 2002 in XP Home with all XP and Office 2002 updates installed. I am also running a previously installed addin called Sperry Duplicate email eliminator. Could that cause a problem? Sperry says "our software was tested with a lot of third party add-ins and software. There are no known issues of installation problem between those add-ins." When I try to install Spambayes to Outlook 2002 I get this error that concerns the file spambayes_addin.dll: Unable to register the DLL/OCX: DllRegisterServer;failed; code 0x00000000. Norton anti virus is turned off and Zonealarm is turned off. The machine is P4 on an Intel motherboard using a Matrox dual monitor card. Palm Hotsync is running and Pivot software is running. Milton P. Charlton, Professor Physiology Department Medical Science Building RM.# 3308 University of Toronto 1 King's College Circle Toronto, ON, M5S1A8 Canada tel: 416-978-6355 fax: 416-978-4940 milton@spine.med.utoronto.ca From bbarnes at encompass-ts.com Tue Nov 18 11:48:07 2003 From: bbarnes at encompass-ts.com (Ben Barnes) Date: Tue Nov 18 11:48:12 2003 Subject: [Spambayes] (no subject) Message-ID: <0B66FA446CBE5D4CBE3B1812F0C4018ABAF24E@pdxex01.encompass-ts.com> When I click the SpamBayes icon in outlook, the drop down menu is a 2cm by 2cm square which is blank... this just started when I retbooted my box... any clue? From oscanuch at prolog.net Tue Nov 18 14:35:49 2003 From: oscanuch at prolog.net (oscanuch@prolog.net) Date: Tue Nov 18 14:28:41 2003 Subject: [Spambayes] Question - Deleted mail not coming of the server (Win 2000) Message-ID: Hi, I have been using your latest version of Spambayes for a few weeks now to manage the junk mail on several of my PC's that run Outlook. I have noticed that the mail I am deleting as junk mail is staying on the pop server. If I stop Spambayes and delete the messages manually I see no such delay. I am familiar with my outlook settings and have confirmed that all messages deleted from the filtered pop accounts are set to be removed from the server when the deleted items folder is emptied. Can you give me some advise to correct this delay problem? Thanks, Michael. From tommy at ilm.com Tue Nov 18 14:33:42 2003 From: tommy at ilm.com (Tommy Burnette) Date: Tue Nov 18 14:33:51 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <16307.58409.581150.801884@evoke.lucasdigital.com> References: <16307.58409.581150.801884@evoke.lucasdigital.com> Message-ID: <16314.29718.355927.546956@evoke.lucasdigital.com> Hi Again, it's been a few days since I sent this... if there are no options for keeping this from happening, perhaps someone can point me to where I can change it in my own copy of the source? thanks again, Tommy Burnette writes: | Hi Folks, | | I've been very happily using spambayes with xemacs & vim for about a | month now. When the latest version was announced I switched right up | to it, but since then I've noticed a behaviour in sb_mboxtrain.py that | is a bit of a pain. (no idea if this is new or not, I just notcied it | recently). | | basically, when sb_mboxtrain.py rewrites the mbox to include its | "X-Spambayes-Trained: ham" lines, it is re-formatting the mail | headers. This wreaks havoc on the display in vm's summary mode and is | a bit of a pain to fix. | | e.g. this line in a mail header: | | X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] | ["31722" "Tue" "12" "November" "2002" "15:05:56" "-0800" "Gayle Peterson" "gayle@lucasdigital.com" nil "457" "2003 Holiday Schedule" "^From:" nil nil "11" nil nil nil nil nil] | nil) | | is being transformed into this: | | X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] | ["31722" "Tue" "12" "November" "2002" "15:05:56" "-0800" "Gayle | Peterson" "gayle@lucasdigital.com" nil "457" "2003 Holiday | Schedule" "^From:" nil nil "11" nil nil nil nil nil] nil) | | which, while neatly formatted, has introduced newlines into places | like the From name, etc. and makes them display as: | | From: Gayle | Peterson Subject: 2003 Holiday | Schedule | | instead of: | | From: Gayle Peterson Subject: 2003 Holiday Schedule | | in vm. is there an option to either make sb_mboxtrain.py not rewrite | the mbox file, or at least make it not re-format when it does? | | thanks in advance, | | | _______________________________________________ | Spambayes@python.org | http://mail.python.org/mailman/listinfo/spambayes | Check the FAQ before asking: http://spambayes.sf.net/faq.html From skip at pobox.com Tue Nov 18 15:03:57 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 18 15:04:14 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <16314.29718.355927.546956@evoke.lucasdigital.com> References: <16307.58409.581150.801884@evoke.lucasdigital.com> <16314.29718.355927.546956@evoke.lucasdigital.com> Message-ID: <16314.31533.853719.484196@montanaro.dyndns.org> Tommy> it's been a few days since I sent this... if there are no Tommy> options for keeping this from happening, perhaps someone can Tommy> point me to where I can change it in my own copy of the source? Tommy, I would have thought someone with stronger credentials in this area would have responded, so I didn't respond to your original post. I think the place you need to look for this is in the email package, probably starting with the Generator module. Depending what version of Python you're using, that will have either come with your Python distro or be one that you downloaded (to get a recent enough version for SpamBayes' use). I suspect it doesn't pay attention to headers which were already folded when it parses the message headers. If/when you find bugs, you probably will want to post them to (at least) the mimelib SF project, and possibly the Python bugs page as well. Skip From kennypitt at hotmail.com Tue Nov 18 15:09:37 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 18 15:10:59 2003 Subject: [Spambayes] Question - Deleted mail not coming of the server (Win2000) In-Reply-To: Message-ID: oscanuch@prolog.net wrote: > I have been using your latest version of Spambayes for a few > weeks now to manage the junk mail on several of my PC's that run > Outlook. I have noticed that the mail I am deleting as junk mail is > staying on the pop server. If I stop Spambayes and delete the > messages manually I see no such delay. I am familiar with my outlook > settings and have confirmed that all messages deleted from the > filtered pop accounts are set to be removed from the server when the > deleted items folder is emptied. Can you give me some advise to > correct this delay problem? SpamBayes does not put items that it classifies as spam, or items that you delete as spam via the toolbar, into the Deleted Items folder. They are moved to a "quarantine" folder, and you can find the name of this folder by looking on the Filtering tab in SpamBayes Manager. If you have Outlook configured to only remove messages from your POP server when Deleted Items is emptied, then you first need to delete the spam messages from the quarantine folder before you empty Deleted Items. -- Kenny Pitt From rayc at wolfagency.com Tue Nov 18 15:09:47 2003 From: rayc at wolfagency.com (Ray Caputo) Date: Tue Nov 18 15:11:25 2003 Subject: [Spambayes] Folder Problem Message-ID: I've loaded SpamBayes and processed over 2,000 e-mails. Today I decided to delete the contents of the "junk" folder, but deleted the folder itself. I've added a new "junk" folder, but when I try to move an e-mail from the "suspect" folder to it, I get a message that the Spam folder needs to be configured. How do I do this? Thanks. Ray Caputo, President Wolf Agency Inc. From rayc at wolfagency.com Tue Nov 18 15:15:41 2003 From: rayc at wolfagency.com (Ray Caputo) Date: Tue Nov 18 15:17:18 2003 Subject: [Spambayes] Folder problem Message-ID: Please ignore the previous message, I figured out how to configure the folder just after I sent the message. Thanks. Ray Caputo, President Wolf Agency Inc. From sdarisi at imap2.asu.edu Tue Nov 18 16:13:40 2003 From: sdarisi at imap2.asu.edu (Sanjay Darisi) Date: Tue Nov 18 16:06:45 2003 Subject: [Spambayes] Few quick questions! Message-ID: <3FBA8B84.1090309@imap2.asu.edu> [Sanjay] >/ I know how to do this manually. But I would like to do it in addin />/ itself. Like after installing the addin it has to append a signature />/ to every outgoing email. How can I do this? / [Richie] > That's a strange requirement - what do you want the signature to say, if > you don't mind me asking? Just like that, Like spambayes is protecting my inbox...How to do that? sanjay From tameyer at ihug.co.nz Tue Nov 18 17:21:00 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 18 17:18:41 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206559@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B141@its-xchg4.massey.ac.nz> [...] > If/when you find bugs, you probably will want to post them to > (at least) the mimelib SF project, and possibly the Python > bugs page as well. Correct me if I'm wrong, but isn't the mimelib SF project basically closed now? I think that bugs are meant to be reported via the Python bugs page, and probably discussed on the email-sig (http://www.python.org/sigs/email-sig). =Tony Meyer From skip at pobox.com Tue Nov 18 17:20:35 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 18 17:20:56 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B141@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1304206559@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F130212B141@its-xchg4.massey.ac.nz> Message-ID: <16314.39731.23362.800676@montanaro.dyndns.org> >>>>> "Tony" == Tony Meyer writes: Tony> [...] >> If/when you find bugs, you probably will want to post them to (at >> least) the mimelib SF project, and possibly the Python bugs page as >> well. Tony> Correct me if I'm wrong, but isn't the mimelib SF project Tony> basically closed now? I think that bugs are meant to be reported Tony> via the Python bugs page, and probably discussed on the email-sig Tony> (http://www.python.org/sigs/email-sig). Could be. Barry's the authority on that (cc'd). Skip From tameyer at ihug.co.nz Tue Nov 18 17:38:34 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 18 17:37:38 2003 Subject: [Spambayes] ?? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13042064BA@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B145@its-xchg4.massey.ac.nz> > Can I tag more than one file at a time to "Delete as Spam"? Yes. Hint: The easy way to find this out would have simply been to try it. You'd find that it works. =Tony Meyer From tameyer at ihug.co.nz Tue Nov 18 17:47:30 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 18 17:44:59 2003 Subject: [Spambayes] How to tell SpamBayes to check more headers (using MSOplugin) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130415C8EB@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D5@its-xchg4.massey.ac.nz> > Is there a way, without hacking the source, to tell SpamBayes > to examine more of the headers in mails? In particular these > ones, added by SpamAssassin and SpamCop Mail: You can add to the 'safe_headers' option (in the 'Tokenizer' section). This means that you'll get tokens generated indicated the number of times a header line appears in the mail (so you'd get a "header:x-spam-level:1" token, for example), and possibly (depending on your settings), a token if the header is absent (like "noheader:x-spam-status"). > X-Spam-Level: ********* > X-Spam-Status: hits=9.7 tests=ADDR_NUMS_AT_BIGSITE,BIZ_TLD, > DATE_IN_FUTURE_12_24,FORGED_HOTMAIL_RCVD,FORGED_MUA_OUTLOOK, > MSGID_FROM_MTA_HOTMAIL,USERPASS version=2.60 > X-SpamCop-Disposition: Blocked bl.spamcop.net However, if these headers appear for both ham and spam, and it's just the contents of the headers that varies, then this isn't going to do you much good. In that case, you'd have to add code to tokenizer.py (in the tokenize_headers function) to add specific tokens for SpamAssassin and SpamCop. (If you do, please submit a patch). There has been discussion of this sort of thing before (googling though the archives would probably find it easily enough); IIRC, there were arguments that the classifications should stay separate, and it would be more effective to have a final rule based on a combination of the classifications. > I'm using the Outlook plugin if that makes a difference. It means that you change your (low-level) options via the default_bayes_customize.ini file in your data directory (not the one in the program files directory). =Tony Meyer From richie at entrian.com Tue Nov 18 18:13:13 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 18 18:13:39 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <3FBA8B84.1090309@imap2.asu.edu> References: <3FBA8B84.1090309@imap2.asu.edu> Message-ID: [Sanjay] > [...] after installing the addin it has to append a signature > to every outgoing email. How can I do this? [Richie] > That's a strange requirement - what do you want the signature to say, if > you don't mind me asking? [Sanjay] > Just like that, Like spambayes is protecting my inbox...How to do that? Yay, free advertising for us! 8-) Sadly I have no idea how you can do this. -- Richie Hindle richie@entrian.com From gflakus at sccd.ctc.edu Tue Nov 18 19:22:29 2003 From: gflakus at sccd.ctc.edu (Flakus, Gayle) Date: Tue Nov 18 19:19:38 2003 Subject: [Spambayes] Recall: Missing e-mail Message-ID: <4B73F518F0F98A4EA67DDAA3DB84941A0119F1DB@SCCDMAIL.SCCD.CTC.EDU> Flakus, Gayle would like to recall the message, "Missing e-mail". From tommy at ilm.com Tue Nov 18 20:20:38 2003 From: tommy at ilm.com (Tommy Burnette) Date: Tue Nov 18 20:20:48 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <16314.39731.23362.800676@montanaro.dyndns.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1304206559@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F130212B141@its-xchg4.massey.ac.nz> <16314.39731.23362.800676@montanaro.dyndns.org> Message-ID: <16314.50534.892315.807354@evoke.lucasdigital.com> Luckily I've known him long enough that I just email him directly immediately after I heard from you, Skip :) thanks, Skip Montanaro writes: | >>>>> "Tony" == Tony Meyer writes: | | Tony> [...] | >> If/when you find bugs, you probably will want to post them to (at | >> least) the mimelib SF project, and possibly the Python bugs page as | >> well. | | Tony> Correct me if I'm wrong, but isn't the mimelib SF project | Tony> basically closed now? I think that bugs are meant to be reported | Tony> via the Python bugs page, and probably discussed on the email-sig | Tony> (http://www.python.org/sigs/email-sig). | | Could be. Barry's the authority on that (cc'd). | | Skip From tameyer at ihug.co.nz Tue Nov 18 21:03:58 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 18 21:01:26 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13042065DB@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D6@its-xchg4.massey.ac.nz> [Sanjay] > [...] after installing the addin it has to append a > signature to every outgoing email. How can I do this? [Richie] > That's a strange requirement - what do you want the > signature to say, if you don't mind me asking? [Sanjay] > Just like that, Like spambayes is protecting my inbox...How to do > that? [Richie] > Yay, free advertising for us! 8-) Sadly I have no idea how > you can do this. The SpamBayes Outlook plug-in doesn't touch outgoing mail at all, so this isn't a simple task at all. If you're familiar with MAPI/Outlook and Python you could probably add this to the plug-in, but it'll probably be a fair bit of work. (You *might* be able to 'watch' the Outbox and alter messages that appear in there before they actually get sent, but I don't really know if this would work, or if there would be a better way). Googling for information about doing this with Outlook would probably come up with stuff, and then you could integrate it into the plug-in refering to the existing code. All-in-all, though, it would be a hundred times (at least) simpler to just use Outlook's existing tools to add a signature. Is there any reason you can't do that? =Tony Meyer From barry at python.org Tue Nov 18 21:02:48 2003 From: barry at python.org (Barry Warsaw) Date: Tue Nov 18 21:02:56 2003 Subject: [Spambayes] sb_mboxtrain.py reformatting email? In-Reply-To: <16314.39731.23362.800676@montanaro.dyndns.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1304206559@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F130212B141@its-xchg4.massey.ac.nz> <16314.39731.23362.800676@montanaro.dyndns.org> Message-ID: <1069207367.23962.19.camel@anthem> On Tue, 2003-11-18 at 17:20, Skip Montanaro wrote: > >>>>> "Tony" == Tony Meyer writes: > > Tony> [...] > >> If/when you find bugs, you probably will want to post them to (at > >> least) the mimelib SF project, and possibly the Python bugs page as > >> well. > > Tony> Correct me if I'm wrong, but isn't the mimelib SF project > Tony> basically closed now? I think that bugs are meant to be reported > Tony> via the Python bugs page, and probably discussed on the email-sig > Tony> (http://www.python.org/sigs/email-sig). > > Could be. Barry's the authority on that (cc'd). Tony's right on both counts, but ever since Tommy secretly embedded that tiny image of me in The Hulk's navel, I've owed him one. I'll reply to Tommy's private message, and he's free to forward it along. -Barry From rayparker at sasktel.net Tue Nov 18 22:09:39 2003 From: rayparker at sasktel.net (Raymond Parker) Date: Tue Nov 18 22:11:13 2003 Subject: [Spambayes] outlook express Message-ID: <000801c3ae4a$9b5d45e0$43e16e40@e7r8x3> Am I correct in inferring that this system will not work on Outlook Express? From tameyer at ihug.co.nz Tue Nov 18 22:17:37 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 18 22:15:07 2003 Subject: [Spambayes] outlook express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130420665B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B14B@its-xchg4.massey.ac.nz> > Am I correct in inferring that this system will not work on > Outlook Express? No. See: =Tony Meyer From jmcatee at mediaodyssey.com Tue Nov 18 22:53:45 2003 From: jmcatee at mediaodyssey.com (Jim McAtee) Date: Tue Nov 18 22:53:50 2003 Subject: [Spambayes] outlook express References: <1ED4ECF91CDED24C8D012BCF2B034F130212B14B@its-xchg4.massey.ac.nz> Message-ID: <057801c3ae50$bcc04600$1c03a8c0@jim> Somewhere in the documentation, probably for the POP3 proxy, it's stated that the forwarding (to the SMTP proxy) method of training the server doesn't work with Outlook Express because OE may not forward all headers. I was wondering if anyone has reviewed this lately for the most recent version(s) of Outlook Express to see if it's still a valid statement. And what are the problems caused in the training when these headers are lost? I'd like to use this method to do the occasional training of Spambayes, unless it's unwise. I get over 1000 messages a day (mostly from technical mailing lists) with about 30% spam. Using the 'Review messages' web page with cached messages is getting more and more unworkable with this volume of mail. ----- Original Message ----- From: "Tony Meyer" To: "'Raymond Parker'" ; Sent: Tuesday, November 18, 2003 8:17 PM Subject: RE: [Spambayes] outlook express > Am I correct in inferring that this system will not work on > Outlook Express? No. See: =Tony Meyer From kjirstin at earthlink.net Tue Nov 18 23:56:33 2003 From: kjirstin at earthlink.net (kjirstin@earthlink.net) Date: Tue Nov 18 23:56:40 2003 Subject: [Spambayes] Help! Message-ID: I am totally right-brained, and I HATE THIS PROGRAM!!! My darling husband installed it, then he went out of town!!! All I need is a simple way of moving my mail that isn?t spam, but that your program thinks is spam, into my email, so I can respond to it. This should be fairly simple. It is impossible! I?ve tried adding the address to my address book, which you have now changed the name to ?contacts?. This is not working, even when I type it?cut and paste is totally impossible. Many of these emails are from companies I do work for, and I must respond in 24 hours or lose the job! Help! k- P.S. A yellow smiley face showed up once that allowed me to capture two of my emails, but I can?t seem to get it back. From tameyer at ihug.co.nz Wed Nov 19 00:52:05 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 00:49:36 2003 Subject: [Spambayes] outlook express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130420667D@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DA@its-xchg4.massey.ac.nz> > Somewhere in the documentation, probably for the POP3 proxy, > it's stated that the forwarding (to the SMTP proxy) method of > training the server doesn't work with Outlook Express because > OE may not forward all headers. I was wondering if anyone > has reviewed this lately for the most recent version(s) of > Outlook Express to see if it's still a valid statement. Yes, it is definitely still valid. Realistically, it probably always will be - Microsoft doesn't really have any reason to improve OE; it's only there to give people a free option - if you want something good (sic?), you need Outlook. The only way around this that I can see is once again offer to include the spambayes id in the message body (which is included in forwarded messages, obviously). This is particularly inelegant, though, and at the time it was removed (because it didn't really fit when code was refactored), not all that many people used the SMTP proxy. > And what are the problems caused in the training when these > headers are lost? The spambayes id is not included (it's one of the headers added by the proxy), so spambayes can't find the message in the cache to do the appropriate training. You could manually add write the id into the message body, but that seems like it would be even less convenient. The SMTP proxy can simply train on the raw message forwarded to it, rather than looking up the message in the cache, but this will more than likely result in poor training - for example, all messages will be 'from' you, and all the useful data in the headers will be unavailable for training. > I'd like to use this method to do the > occasional training of Spambayes, unless it's unwise. It is definitely unwise. > I get over 1000 messages a day (mostly from technical mailing > lists) with about 30% spam. Using the 'Review messages' web > page with cached messages is getting more and more unworkable > with this volume of mail. Do you still need to train? I would have thought that you'd get pretty acceptable results quite quickly with that volume of mail. On positive notes: * Versions of spambayes post-1.0 have additional options for reviewing messages via the web interface. In particular, you can limit the number of messages that appear on one screen (maybe this made it into 1.0a7?), and you can change the default training for messages classified as ham/spam/unsure. * I really will finish the sb_pop3dnd.py script soon. This lets you do drag'n'drop training in any mailer that lets you drag'n'drop mail between accounts, and works with both POP3 and IMAP (this includes OE). The main reason it's taken so long to finish is that Twisted's (a communications toolkit for Python) imap support has been pretty poor, and extremely volatile - hopefully that has changed by now. There are probably other ways you could train, too. For example, if you are only training on a small percentage of mail (which is presumably the case), then you could store copies of mail to be trained in a couple of folders specifically for the task. Once in a while (every week, say), you could use the web interface to train on the folder's dbx file, clear out the folder, and train that way. =Tony Meyer From tameyer at ihug.co.nz Wed Nov 19 00:59:40 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 00:57:09 2003 Subject: [Spambayes] Help! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206697@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DB@its-xchg4.massey.ac.nz> > I am totally right-brained, and I HATE THIS PROGRAM!!! > My darling husband installed it, then he went out of town!!! I think your displeasure is aimed at the wrong thing here . > All I need is a simple way of moving my mail that isn't spam, > but that your program thinks is spam, into my email, so I > can respond to it. It sounds like you're using Outlook - (i.e. at the top of the mail program's window, it will end with "Microsoft Outlook"); if that's not the case, then much of this will be incorrect. If the mail is classified as spam, it will be moved to a different folder. Once it's there, though, you can treat it like any other mail, including replying to it (just like you would any other mail). However, to teach the program that it is in fact good mail, you should be able find the little yellow face you mentioned (with "Recover from Spam" next to it) and click on that, which will move the (hilited) message back to it's original folder. This button should be up in the toolbars somewhere (exactly where depends on your setup). If you can't see it, maybe it's hidden in a drop-down menu on the toolbar? (it would have a little down arrow at the end of the toolbar). > I've tried adding the address to my address book, which > you have now changed the name to "contacts". This is quite confusing! Adding the address book to what? SpamBayes doesn't use the address book at all, so whatever it is, that won't help with anything. We don't change the name, either, so that's as a result of something else (the address book should still be there under the 'Tools' menu). One thing that does spring to mind as a possibility is that your husband changed you from using "Outlook Express" to "Outlook" (OE lists addresses under "Address Book", or something like that, whereas Outlook does have a "Contacts" folder). In that case, then the real issue is getting used to a new mail program - it should be mostly like the old one, though, and the program help might be of use. =Tony Meyer From Mark.Howells at softoption.com Wed Nov 19 04:43:11 2003 From: Mark.Howells at softoption.com (Mark Howells) Date: Wed Nov 19 04:44:35 2003 Subject: [Spambayes] Install problem Outlook 2002// spambayes_addin.dll//otheraddin present Message-ID: <5846CF419D2EF5439036CC3126A3A995017B39@SOSERVER1.softoption.local> > -----Original Message----- > From: Milton Charlton [mailto:milton@spine.med.utoronto.ca] > Sent: 18 November 2003 16:24 > > When I try to install Spambayes to Outlook 2002 I get this error that > concerns > the file spambayes_addin.dll: > Unable to register the DLL/OCX: DllRegisterServer;failed; > code 0x00000000. I routinely get this (on XP) and each time I immediately re-install and the install works. YMMV Mark Howells -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 261.2.0 - Release Date: 11/18/2003 From gerrit at nl.linux.org Wed Nov 19 06:54:24 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 06:54:43 2003 Subject: [Spambayes] AssertionError: hamcount <= nham Message-ID: <20031119115424.GA4439@nl.linux.org> Hi, whenever I untrain a spam message accidantly trained as ham, by database gets corrupted. I get the following error message: Traceback (most recent call last): File "/usr/local/bin/sb_filter.py", line 203, in ? main() File "/usr/local/bin/sb_filter.py", line 199, in main action(msg) File "/usr/local/bin/sb_filter.py", line 132, in filter return h.filter(msg) File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 109, in filter prob, clues = self._scoremsg(msg, True) File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 158, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 395, in _getclues prob = self.probability(record) File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 242, in probability assert hamcount <= nham AssertionError It makes no difference whether I use DataBase or pickle format. Is there any solution apart from restarting the whole database? This would mean I would have to restart the whole database after each erroneous training. The latter is what I'm doing now :( yours, Gerrit. -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From gerrit at nl.linux.org Wed Nov 19 06:59:03 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 06:59:23 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119115424.GA4439@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> Message-ID: <20031119115903.GA4549@nl.linux.org> Gerrit Holl wrote: > whenever I untrain a spam message accidantly trained as ham, my > database gets corrupted. I get the following error message: Some extra information: 12:58:03:8409:gerrit@topjaklont:~$ python /home/gerrit/cvs/spambayes/utilities/which_database.py Pickle is available. Dumbdbm is available. Dbhash is available. Bsddb[3] is available. Your storage /home/gerrit/hammie.db is a: bsddb[3] yours, Gerrit. -- 101. If there are no mercantile arrangements in the place whither he went, he shall leave the entire amount of money which he received with the broker to give to the merchant. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From dreed at srdcorp.com Wed Nov 19 08:47:16 2003 From: dreed at srdcorp.com (Reed, David) Date: Wed Nov 19 08:50:25 2003 Subject: [Spambayes] SpamBayes and Outlook 2003 Message-ID: <4C542552EF434E43B2F2572854A73CBD6F712B@corp-mail1.srdcorp.com> Hi Everyone, I attended the Microsoft Product Launch for Office 2003 yesterday, and came right back to work and installed the new version of Office on my test system at work. I'm pleased to report that the SpamBayes plug-in (current version), .81, seems to be working flawlessly with Outlook 2003, in a Corporate/Workgroup mode. Regards, David Reed From dreed at srdcorp.com Wed Nov 19 08:47:55 2003 From: dreed at srdcorp.com (Reed, David) Date: Wed Nov 19 08:51:00 2003 Subject: [Spambayes] [REPOST] Spambayes Outlook Plugin (Latest) unresponsive Message-ID: <4C542552EF434E43B2F2572854A73CBD6F712C@corp-mail1.srdcorp.com> Hi All, I installed the SpamBayes Outlook plugin several months ago for our Company President. It's been working, mostly, since then. Oh sure, a few times it has stopped working, and I'd go fix it. Well, today it stopped working. I upgraded it to the latest Outlook Plugin (.081?), and retrained it. However, it still isn't responding. I uninstalled it and reinstalled it. Still not change. No response. I can't even select an email and "DELETE AS SPAM", because it just doesn't. It's totally unresponsive to anything. It only seems to be his computer (the Company President, naturally, who isn't a computer guy at all, so he has NO patience for computer problems), and I am seemingly powerless to do anything about it. Has anyone else got any idea what might be wrong? Like I say, I've deleted it and reinstalled it, I've upgraded it, and I've retrained it already. I can't select an email and DELETE AS SPAM either. All ideas and suggestions welcome! Thanks, David From mlacrosse at nmcaa.org Wed Nov 19 09:06:39 2003 From: mlacrosse at nmcaa.org (Matthew LaCrosse) Date: Wed Nov 19 09:07:36 2003 Subject: [Spambayes] Almost everything goes to Junk Suspects even after training. In-Reply-To: Message-ID: I can't seem to figure out why it keeps sending almost everything to Junk Suspects even after doing "Recover from Spam" it does send it to my inbox but it never seems to learn what it is that always should go to the Inbox. Any ideas? Matt -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of spambayes-request@python.org Sent: Wednesday, November 19, 2003 3:45 AM To: spambayes@python.org Subject: Spambayes Digest, Vol 63, Issue 48 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. Recall: Missing e-mail (Flakus, Gayle) 2. RE: sb_mboxtrain.py reformatting email? (Tommy Burnette) 3. RE: Few quick questions! (Tony Meyer) 4. RE: sb_mboxtrain.py reformatting email? (Barry Warsaw) 5. outlook express (Raymond Parker) 6. RE: outlook express (Tony Meyer) 7. Re: outlook express (Jim McAtee) 8. Help! (kjirstin@earthlink.net) 9. RE: outlook express (Tony Meyer) 10. RE: Help! (Tony Meyer) 11. RE: Install problem Outlook 2002// spambayes_addin.dll//otheraddin present (Mark Howells) ---------------------------------------------------------------------- Message: 1 Date: Tue, 18 Nov 2003 16:22:29 -0800 From: "Flakus, Gayle" Subject: [Spambayes] Recall: Missing e-mail To: Message-ID: <4B73F518F0F98A4EA67DDAA3DB84941A0119F1DB@SCCDMAIL.SCCD.CTC.EDU> Content-Type: text/plain; charset="us-ascii" Flakus, Gayle would like to recall the message, "Missing e-mail". ------------------------------ Message: 2 Date: Tue, 18 Nov 2003 17:20:38 -0800 From: Tommy Burnette Subject: RE: [Spambayes] sb_mboxtrain.py reformatting email? To: skip@pobox.com Cc: barry@python.org, spambayes@python.org Message-ID: <16314.50534.892315.807354@evoke.lucasdigital.com> Content-Type: text/plain; charset=us-ascii Luckily I've known him long enough that I just email him directly immediately after I heard from you, Skip :) thanks, Skip Montanaro writes: | >>>>> "Tony" == Tony Meyer writes: | | Tony> [...] | >> If/when you find bugs, you probably will want to post them to (at | >> least) the mimelib SF project, and possibly the Python bugs page as | >> well. | | Tony> Correct me if I'm wrong, but isn't the mimelib SF project | Tony> basically closed now? I think that bugs are meant to be reported | Tony> via the Python bugs page, and probably discussed on the email-sig | Tony> (http://www.python.org/sigs/email-sig). | | Could be. Barry's the authority on that (cc'd). | | Skip ------------------------------ Message: 3 Date: Wed, 19 Nov 2003 15:03:58 +1300 From: "Tony Meyer" Subject: RE: [Spambayes] Few quick questions! To: Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D6@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" [Sanjay] > [...] after installing the addin it has to append a > signature to every outgoing email. How can I do this? [Richie] > That's a strange requirement - what do you want the > signature to say, if you don't mind me asking? [Sanjay] > Just like that, Like spambayes is protecting my inbox...How to do > that? [Richie] > Yay, free advertising for us! 8-) Sadly I have no idea how > you can do this. The SpamBayes Outlook plug-in doesn't touch outgoing mail at all, so this isn't a simple task at all. If you're familiar with MAPI/Outlook and Python you could probably add this to the plug-in, but it'll probably be a fair bit of work. (You *might* be able to 'watch' the Outbox and alter messages that appear in there before they actually get sent, but I don't really know if this would work, or if there would be a better way). Googling for information about doing this with Outlook would probably come up with stuff, and then you could integrate it into the plug-in refering to the existing code. All-in-all, though, it would be a hundred times (at least) simpler to just use Outlook's existing tools to add a signature. Is there any reason you can't do that? =Tony Meyer ------------------------------ Message: 4 Date: Tue, 18 Nov 2003 21:02:48 -0500 From: Barry Warsaw Subject: RE: [Spambayes] sb_mboxtrain.py reformatting email? To: Skip Montanaro Cc: spambayes@python.org Message-ID: <1069207367.23962.19.camel@anthem> Content-Type: text/plain On Tue, 2003-11-18 at 17:20, Skip Montanaro wrote: > >>>>> "Tony" == Tony Meyer writes: > > Tony> [...] > >> If/when you find bugs, you probably will want to post them to (at > >> least) the mimelib SF project, and possibly the Python bugs page as > >> well. > > Tony> Correct me if I'm wrong, but isn't the mimelib SF project > Tony> basically closed now? I think that bugs are meant to be reported > Tony> via the Python bugs page, and probably discussed on the email-sig > Tony> (http://www.python.org/sigs/email-sig). > > Could be. Barry's the authority on that (cc'd). Tony's right on both counts, but ever since Tommy secretly embedded that tiny image of me in The Hulk's navel, I've owed him one. I'll reply to Tommy's private message, and he's free to forward it along. -Barry ------------------------------ Message: 5 Date: Tue, 18 Nov 2003 21:09:39 -0600 From: Raymond Parker Subject: [Spambayes] outlook express To: spambayes@python.org Message-ID: <000801c3ae4a$9b5d45e0$43e16e40@e7r8x3> Content-Type: text/plain; charset=iso-8859-1 Am I correct in inferring that this system will not work on Outlook Express? ------------------------------ Message: 6 Date: Wed, 19 Nov 2003 16:17:37 +1300 From: "Tony Meyer" Subject: RE: [Spambayes] outlook express To: "'Raymond Parker'" , Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B14B@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" > Am I correct in inferring that this system will not work on > Outlook Express? No. See: =Tony Meyer ------------------------------ Message: 7 Date: Tue, 18 Nov 2003 20:53:45 -0700 From: "Jim McAtee" Subject: Re: [Spambayes] outlook express To: Message-ID: <057801c3ae50$bcc04600$1c03a8c0@jim> Content-Type: text/plain; charset="iso-8859-1" Somewhere in the documentation, probably for the POP3 proxy, it's stated that the forwarding (to the SMTP proxy) method of training the server doesn't work with Outlook Express because OE may not forward all headers. I was wondering if anyone has reviewed this lately for the most recent version(s) of Outlook Express to see if it's still a valid statement. And what are the problems caused in the training when these headers are lost? I'd like to use this method to do the occasional training of Spambayes, unless it's unwise. I get over 1000 messages a day (mostly from technical mailing lists) with about 30% spam. Using the 'Review messages' web page with cached messages is getting more and more unworkable with this volume of mail. ----- Original Message ----- From: "Tony Meyer" To: "'Raymond Parker'" ; Sent: Tuesday, November 18, 2003 8:17 PM Subject: RE: [Spambayes] outlook express > Am I correct in inferring that this system will not work on > Outlook Express? No. See: =Tony Meyer ------------------------------ Message: 8 Date: Tue, 18 Nov 2003 21:56:33 -0700 From: kjirstin@earthlink.net Subject: [Spambayes] Help! To: spambayes@python.org Message-ID: Content-Type: text/plain; charset="Windows-1252" I am totally right-brained, and I HATE THIS PROGRAM!!! My darling husband installed it, then he went out of town!!! All I need is a simple way of moving my mail that isnt spam, but that your program thinks is spam, into my email, so I can respond to it. This should be fairly simple. It is impossible! Ive tried adding the address to my address book, which you have now changed the name to contacts. This is not working, even when I type itcut and paste is totally impossible. Many of these emails are from companies I do work for, and I must respond in 24 hours or lose the job! Help! k- P.S. A yellow smiley face showed up once that allowed me to capture two of my emails, but I cant seem to get it back. ------------------------------ Message: 9 Date: Wed, 19 Nov 2003 18:52:05 +1300 From: "Tony Meyer" Subject: RE: [Spambayes] outlook express To: "'Jim McAtee'" , Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DA@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" > Somewhere in the documentation, probably for the POP3 proxy, > it's stated that the forwarding (to the SMTP proxy) method of > training the server doesn't work with Outlook Express because > OE may not forward all headers. I was wondering if anyone > has reviewed this lately for the most recent version(s) of > Outlook Express to see if it's still a valid statement. Yes, it is definitely still valid. Realistically, it probably always will be - Microsoft doesn't really have any reason to improve OE; it's only there to give people a free option - if you want something good (sic?), you need Outlook. The only way around this that I can see is once again offer to include the spambayes id in the message body (which is included in forwarded messages, obviously). This is particularly inelegant, though, and at the time it was removed (because it didn't really fit when code was refactored), not all that many people used the SMTP proxy. > And what are the problems caused in the training when these > headers are lost? The spambayes id is not included (it's one of the headers added by the proxy), so spambayes can't find the message in the cache to do the appropriate training. You could manually add write the id into the message body, but that seems like it would be even less convenient. The SMTP proxy can simply train on the raw message forwarded to it, rather than looking up the message in the cache, but this will more than likely result in poor training - for example, all messages will be 'from' you, and all the useful data in the headers will be unavailable for training. > I'd like to use this method to do the > occasional training of Spambayes, unless it's unwise. It is definitely unwise. > I get over 1000 messages a day (mostly from technical mailing > lists) with about 30% spam. Using the 'Review messages' web > page with cached messages is getting more and more unworkable > with this volume of mail. Do you still need to train? I would have thought that you'd get pretty acceptable results quite quickly with that volume of mail. On positive notes: * Versions of spambayes post-1.0 have additional options for reviewing messages via the web interface. In particular, you can limit the number of messages that appear on one screen (maybe this made it into 1.0a7?), and you can change the default training for messages classified as ham/spam/unsure. * I really will finish the sb_pop3dnd.py script soon. This lets you do drag'n'drop training in any mailer that lets you drag'n'drop mail between accounts, and works with both POP3 and IMAP (this includes OE). The main reason it's taken so long to finish is that Twisted's (a communications toolkit for Python) imap support has been pretty poor, and extremely volatile - hopefully that has changed by now. There are probably other ways you could train, too. For example, if you are only training on a small percentage of mail (which is presumably the case), then you could store copies of mail to be trained in a couple of folders specifically for the task. Once in a while (every week, say), you could use the web interface to train on the folder's dbx file, clear out the folder, and train that way. =Tony Meyer ------------------------------ Message: 10 Date: Wed, 19 Nov 2003 18:59:40 +1300 From: "Tony Meyer" Subject: RE: [Spambayes] Help! To: , Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DB@its-xchg4.massey.ac.nz> Content-Type: text/plain; charset="US-ASCII" > I am totally right-brained, and I HATE THIS PROGRAM!!! > My darling husband installed it, then he went out of town!!! I think your displeasure is aimed at the wrong thing here . > All I need is a simple way of moving my mail that isn't spam, > but that your program thinks is spam, into my email, so I > can respond to it. It sounds like you're using Outlook - (i.e. at the top of the mail program's window, it will end with "Microsoft Outlook"); if that's not the case, then much of this will be incorrect. If the mail is classified as spam, it will be moved to a different folder. Once it's there, though, you can treat it like any other mail, including replying to it (just like you would any other mail). However, to teach the program that it is in fact good mail, you should be able find the little yellow face you mentioned (with "Recover from Spam" next to it) and click on that, which will move the (hilited) message back to it's original folder. This button should be up in the toolbars somewhere (exactly where depends on your setup). If you can't see it, maybe it's hidden in a drop-down menu on the toolbar? (it would have a little down arrow at the end of the toolbar). > I've tried adding the address to my address book, which > you have now changed the name to "contacts". This is quite confusing! Adding the address book to what? SpamBayes doesn't use the address book at all, so whatever it is, that won't help with anything. We don't change the name, either, so that's as a result of something else (the address book should still be there under the 'Tools' menu). One thing that does spring to mind as a possibility is that your husband changed you from using "Outlook Express" to "Outlook" (OE lists addresses under "Address Book", or something like that, whereas Outlook does have a "Contacts" folder). In that case, then the real issue is getting used to a new mail program - it should be mostly like the old one, though, and the program help might be of use. =Tony Meyer ------------------------------ Message: 11 Date: Wed, 19 Nov 2003 09:43:11 -0000 From: "Mark Howells" Subject: RE: [Spambayes] Install problem Outlook 2002// spambayes_addin.dll//otheraddin present To: , Message-ID: <5846CF419D2EF5439036CC3126A3A995017B39@SOSERVER1.softoption.local> Content-Type: text/plain; charset="iso-8859-1" > -----Original Message----- > From: Milton Charlton [mailto:milton@spine.med.utoronto.ca] > Sent: 18 November 2003 16:24 > > When I try to install Spambayes to Outlook 2002 I get this error that > concerns > the file spambayes_addin.dll: > Unable to register the DLL/OCX: DllRegisterServer;failed; > code 0x00000000. I routinely get this (on XP) and each time I immediately re-install and the install works. YMMV Mark Howells -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.195 / Virus Database: 261.2.0 - Release Date: 11/18/2003 ------------------------------ _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html End of Spambayes Digest, Vol 63, Issue 48 ***************************************** From papaDoc at videotron.ca Wed Nov 19 09:08:35 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 19 09:09:09 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119115424.GA4439@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> Message-ID: <3FBB7963.5070308@videotron.ca> Hi, >Hi, > >whenever I untrain a spam message accidantly trained as ham, by >database gets corrupted. I get the following error message: > >Traceback (most recent call last): > File "/usr/local/bin/sb_filter.py", line 203, in ? > main() > File "/usr/local/bin/sb_filter.py", line 199, in main > action(msg) > File "/usr/local/bin/sb_filter.py", line 132, in filter > return h.filter(msg) > File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 109, in filter > prob, clues = self._scoremsg(msg, True) > File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg > return self.bayes.spamprob(tokenize(msg), evidence) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 158, in chi2_spamprob > clues = self._getclues(wordstream) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 395, in _getclues > prob = self.probability(record) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 242, in probability > assert hamcount <= nham >AssertionError > > This can occur for all db available. When Spambayes train on a email it is adding a X-Spambayes-train: spam or ham (or something similar) in the header. When you untrain it check this header and remove the email token from the "spam/ham" part of the database. You can have the problem if the email was training with a database then you delete this database. You start with a new database and try to train again with the email you already train on. Since the email still have the X-Spambayes-train header you don't train on with it except if you force the training (the -f options) If later you find a mistake and you untrain. with the above email. Since it was never really train with the later database the hamcount or spamcount can co negative. So start again and force the training on all message sb_mboxtrain -f -g you_ham_mbox -s your_spam_mbox You will be able to correct your mistake. Remi From diego at blbhydraulic.com Wed Nov 19 09:19:57 2003 From: diego at blbhydraulic.com (diego@blbhydraulic.com) Date: Wed Nov 19 09:22:51 2003 Subject: [Spambayes] error in configuration Message-ID: Hi there, I'm tryng to install spambayes on my machine. I have a XP Professional SP1 system, with python 2.3. I need to connect to an Imap Domino server. I get this error when I go to configure folders to filter. What can I do? Thanks, Diego Balbinot 500 Server error Traceback (most recent call last): File "C:\Python23\Lib\site-packages\spambayes\Dibbler.py", line 453, in found_terminator getattr(plugin, name)(**params) File "C:\Python23\Lib\site-packages\spambayes\ImapUI.py", line 163, in onFilterfolders self._login_to_imap() File "C:\Python23\Lib\site-packages\spambayes\ImapUI.py", line 208, in _login_to_imap self.imap.login(username, self.imap_pwd) File "C:\spambayes\spambayes-1.0a7\scripts\sb_imapfilter.py", line 205, in login BaseIMAP.login(self, username, pwd) # superclass login File "C:\Python23\lib\imaplib.py", line 471, in login typ, dat = self._simple_command('LOGIN', user, self._quote(password)) File "C:\Python23\lib\imaplib.py", line 992, in _quote arg = arg.replace('\\', '\\\\') AttributeError: 'tuple' object has no attribute 'replace' From gerrit at nl.linux.org Wed Nov 19 09:23:03 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 09:23:22 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <3FBB7963.5070308@videotron.ca> References: <20031119115424.GA4439@nl.linux.org> <3FBB7963.5070308@videotron.ca> Message-ID: <20031119142303.GA5569@nl.linux.org> papaDoc wrote: > >whenever I untrain a spam message accidantly trained as ham, by > >database gets corrupted. I get the following error message: > If later you find a mistake and you untrain. with the above email. Since > it was never really train with the later database the > hamcount or spamcount can co negative. > > So start again and force the training on all message > sb_mboxtrain -f -g you_ham_mbox -s your_spam_mbox > You will be able to correct your mistake. Thanks for your answer. I will try to avoid it in the future. Correcting it will force me to keep al my spam. I currently don't, but I could do it from now on. Gerrit. -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From tony.flury at bt.com Wed Nov 19 10:59:46 2003 From: tony.flury at bt.com (tony.flury@bt.com) Date: Wed Nov 19 11:01:16 2003 Subject: [Spambayes] Problem with Spam bayes installation on Windows 2000pc Message-ID: The plug fails to initialise - I attach the Logs files from 4 attempts - all of which mention permissions problems <> <> <> <> Any assistance would be useful -- Tony Flury -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 2946 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031119/3a649e1c/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 3307 bytes Desc: spambayes3.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031119/3a649e1c/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 3307 bytes Desc: spambayes2.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031119/3a649e1c/spambayes2.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 2946 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031119/3a649e1c/spambayes1-0001.obj From Charlie at NABOR.com Wed Nov 19 10:59:24 2003 From: Charlie at NABOR.com (Charlie Willits) Date: Wed Nov 19 11:01:43 2003 Subject: [Spambayes] HTML Correction Message-ID: <6A3F11D54A5CEA4B86F8B4CC26E86D1B012AD2@mail.nabor.local> Version of Windows: Windows XP Version of SpamBayes: SpamBayes-Outlook-Setup-0081.exe Just a typo on the Welcome.html -- the reference to adding Spam Score to Oulook folder view is wrong:

Outlook does not allow us to automatically add the spam score to your Outlook folder views - but you can do it manually by following these instructions.

It should probably be something more like:

Outlook does not allow us to automatically add the spam score to your Outlook folder views - but you can do it manually by following these instructions.

Looking forward to using the product! Charlie From kennypitt at hotmail.com Wed Nov 19 11:44:54 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 19 11:46:47 2003 Subject: [Spambayes] Problem with Spam bayes installation on Windows 2000pc In-Reply-To: Message-ID: tony.flury@bt.com wrote: > The plug fails to initialise - I attach the Logs files from 4 > attempts - all of which mention permissions problems > > <> <> <> > <> > > Any assistance would be useful Which version of Outlook are you running? Is this the first version of Outlook that you installed, or did you upgrade it from a previous version? If you upgraded, what version did you upgrade from? Also, do you run under a user account that does not have Administrator privileges? -- Kenny Pitt From kennypitt at hotmail.com Wed Nov 19 11:50:17 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 19 11:50:51 2003 Subject: [Spambayes] HTML Correction In-Reply-To: <6A3F11D54A5CEA4B86F8B4CC26E86D1B012AD2@mail.nabor.local> Message-ID: Charlie Willits wrote: > Version of Windows: Windows XP > Version of SpamBayes: SpamBayes-Outlook-Setup-0081.exe > > > Just a typo on the Welcome.html -- the reference to adding Spam Score > to Oulook folder view is wrong: > >

Outlook does not allow us to automatically add the style="font-style: italic;">spam score to your Outlook folder > views - but you can href="file:///e:/src/spambayes/Outlook2000/about.html#Field">do it > manually by following these instructions.
>

> > It should probably be something more like: > >

Outlook does not allow us to automatically add the style="font-style: italic;">spam score to your Outlook folder > views - but you can do it manually by > following these instructions.
>

Thanks for the report. That problem was fixed a couple of months ago, but there hasn't been a new release of the plugin since then. Should be fixed in the next release. -- Kenny Pitt From hildies at citlink.net Wed Nov 19 11:50:58 2003 From: hildies at citlink.net (Jack Hildebrand) Date: Wed Nov 19 11:51:02 2003 Subject: [Spambayes] anti-spam software Message-ID: <001701c3aebd$4f600290$01fea8c0@hewlittpackard> Dear Sir: Am I to understand that you do not currently have anti-spam software available for Outlook Express users? Thank you for your response. Jack Hildebrand Elk Grove, CA From dreed at srdcorp.com Wed Nov 19 11:49:45 2003 From: dreed at srdcorp.com (Reed, David) Date: Wed Nov 19 11:52:50 2003 Subject: [Spambayes] [REPOST] Spambayes Outlook Plugin (Latest) unresponsive Message-ID: <4C542552EF434E43B2F2572854A73CBD6F7134@corp-mail1.srdcorp.com> Hi All, I installed the SpamBayes Outlook plugin several months ago for our Company President. It's been working, mostly, since then. Oh sure, a few times it has stopped working, and I'd go fix it. Well, today it stopped working. I upgraded it to the latest Outlook Plugin (.081?), and retrained it. However, it still isn't responding. I uninstalled it and reinstalled it. Still not change. No response. I can't even select an email and "DELETE AS SPAM", because it just doesn't. It's totally unresponsive to anything. It only seems to be his computer (the Company President, naturally, who isn't a computer guy at all, so he has NO patience for computer problems), and I am seemingly powerless to do anything about it. Has anyone else got any idea what might be wrong? Like I say, I've deleted it and reinstalled it, I've upgraded it, and I've retrained it already. I can't select an email and DELETE AS SPAM either. All ideas and suggestions welcome! Thanks, David From kennypitt at hotmail.com Wed Nov 19 11:53:02 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 19 11:53:38 2003 Subject: [Spambayes] Almost everything goes to Junk Suspects even aftertraining. In-Reply-To: Message-ID: Matthew LaCrosse wrote: > I can't seem to figure out why it keeps sending almost everything to > Junk Suspects even after doing "Recover from Spam" it does send it to > my inbox but it never seems to learn what it is that always should go > to the Inbox. Any ideas? Can you submit the output of "Show spam clues" for one of the messages? -- Kenny Pitt From skip at pobox.com Wed Nov 19 12:06:51 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 12:07:05 2003 Subject: [Spambayes] anti-spam software In-Reply-To: <001701c3aebd$4f600290$01fea8c0@hewlittpackard> References: <001701c3aebd$4f600290$01fea8c0@hewlittpackard> Message-ID: <16315.41771.147253.763487@montanaro.dyndns.org> Jack> Am I to understand that you do not currently have anti-spam Jack> software available for Outlook Express users? Jack, You can use the sb_server.py to proxy POP3 or sb_impafilter.py to proxy IMAP between Outlook Express and your mail server. For details, start here: http://spambayes.sf.net/faq.html#does-spambayes-work-with-outlook-express Skip From gerrit at nl.linux.org Wed Nov 19 12:42:13 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 12:42:41 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119115424.GA4439@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> Message-ID: <20031119174213.GA7318@nl.linux.org> Gerrit Holl wrote: > whenever I untrain a spam message accidantly trained as ham, by > database gets corrupted. I get the following error message: Unfortunately, it also happens when I don't untrain any message. It seems like the database gets corrupted whenever two different processes open it. Do I have to manually configure lockfiles or something similar? The error message still is: > Traceback (most recent call last): > File "/usr/local/bin/sb_filter.py", line 203, in ? > main() > File "/usr/local/bin/sb_filter.py", line 199, in main > action(msg) > File "/usr/local/bin/sb_filter.py", line 132, in filter > return h.filter(msg) > File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 109, in filter > prob, clues = self._scoremsg(msg, True) > File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg > return self.bayes.spamprob(tokenize(msg), evidence) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 158, in chi2_spamprob > clues = self._getclues(wordstream) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 395, in _getclues > prob = self.probability(record) > File "/usr/local/lib/python2.3/site-packages/spambayes/classifier.py", line 242, in probability > assert hamcount <= nham > AssertionError yours, Gerrit. -- 62. If he do not plant the field that was given over to him as a garden, if it be arable land (for corn or sesame) the gardener shall pay the owner the produce of the field for the years that he let it lie fallow, according to the product of neighboring fields, put the field in arable condition and return it to its owner. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From papaDoc at videotron.ca Wed Nov 19 12:50:59 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 19 12:51:04 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119174213.GA7318@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> Message-ID: <3FBBAD83.1010602@videotron.ca> Gerrit Holl wrote: >Gerrit Holl wrote: > > >>whenever I untrain a spam message accidantly trained as ham, by >>database gets corrupted. I get the following error message: >> Can we have more informations: What is the command line you are using ? Remi From skip at pobox.com Wed Nov 19 12:55:18 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 12:56:45 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119174213.GA7318@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> Message-ID: <16315.44678.36753.228513@montanaro.dyndns.org> Gerrit> Unfortunately, it also happens when I don't untrain any message. Gerrit> It seems like the database gets corrupted whenever two different Gerrit> processes open it. Do I have to manually configure lockfiles or Gerrit> something similar? Yes, that's a known problem. Skip From gerrit at nl.linux.org Wed Nov 19 12:57:23 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 12:58:29 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <3FBBAD83.1010602@videotron.ca> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> Message-ID: <20031119175723.GA22642@nl.linux.org> > >Gerrit Holl wrote: [database gets corrupted] > Can we have more informations: > What is the command line you are using ? > > Remi I am using the procmail method for filtering: :0 fw:hamlock | /usr/local/bin/sb_filter.py I am using the mutt method for training: macro index S "|sb_filter.py -s | procmail\n" macro pager S "|sb_filter.py -s | procmail\n" macro index H "|sb_filter.py -g | procmail\n" macro pager H "|sb_filter.py -g | procmail\n" For the server I use: sb_server.py -d /home/gerrit/hammie.db My .spambayesrc is: [Categorization] ham_cutoff = 0.05 spam_cutoff = 0.95 [Hammie] train_on_filter = False [Headers] add_unique_id = True clue_mailheader_cutoff = 0.2 header_score_digits = 3 header_score_logarithm = True [Storage] persistent_storage_file = /home/gerrit/hammie.db persistent_use_database = True And I use this script for checking the number of hams and spams in my db: #!/usr/bin/python import spambayes.storage d = spambayes.storage.open_storage("/home/gerrit/hammie.db", True) print "Spam:", d.nspam print "Ham:", d.nham d.close() I will now try to reproduce the bug. yours, Gerrit. -- Mozilla _is_ the web: it grows faster than you can download it. 1011001 1101111 1110101 1110010 1110011 0101100 1000111 1100101 1110010 1110010 1101001 1110100 From skip at pobox.com Wed Nov 19 13:09:28 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 13:11:50 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119175723.GA22642@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> Message-ID: <16315.45528.694389.991984@montanaro.dyndns.org> Gerrit> I am using the procmail method for filtering: Gerrit> :0 fw:hamlock Gerrit> | /usr/local/bin/sb_filter.py Gerrit> I am using the mutt method for training: Gerrit> macro index S "|sb_filter.py -s | procmail\n" Gerrit> macro pager S "|sb_filter.py -s | procmail\n" Gerrit> macro index H "|sb_filter.py -g | procmail\n" Gerrit> macro pager H "|sb_filter.py -g | procmail\n" Gerrit> For the server I use: Gerrit> sb_server.py -d /home/gerrit/hammie.db Gerrit, Why do you use both sb_filter and sb_server? Both sb_server and sb_filter (when given the -s or -g flags) will open the database file for update. If you're already training from mutt, sb_server doesn't add anything. Skip From gerrit at nl.linux.org Wed Nov 19 13:13:36 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 13:14:01 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <16315.45528.694389.991984@montanaro.dyndns.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> Message-ID: <20031119181336.GB7620@nl.linux.org> Skip Montanaro wrote: > Gerrit> sb_server.py -d /home/gerrit/hammie.db > > Gerrit, > > Why do you use both sb_filter and sb_server? Both sb_server and sb_filter > (when given the -s or -g flags) will open the database file for update. If > you're already training from mutt, sb_server doesn't add anything. I use it to check why a certain message has been classified as spam or ham. When I don't understand why a message is not correctly classified, I launch sb_server and ask it to give the full explanation (classify). Is there a script that simply does this but uses stdout instead of a web server? Gerrit. -- 275. If any one hire a ferryboat, he shall pay three gerahs in money per day. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From rmalayter at bai.org Wed Nov 19 13:53:49 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Nov 19 13:53:57 2003 Subject: [Spambayes] Few quick questions! Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C398F@cliff.bai.org> Why not put the references to SpamBayes in your Outlook signature? This is likely not a feature that will ever make it into SpamBayes, since it's so easy to do with most mail clients. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org] On Behalf Of Sanjay Darisi > Sent: Tuesday, November 18, 2003 3:14 PM > To: spambayes@python.org > Subject: [Spambayes] Few quick questions! > > [Sanjay] > >/ I know how to do this manually. But I would like to do it in addin > />/ itself. Like after installing the addin it has to append > a signature > />/ to every outgoing email. How can I do this? > / > [Richie] > > That's a strange requirement - what do you want the > signature to say, if > > you don't mind me asking? > > Just like that, Like spambayes is protecting my inbox...How > to do that? > > sanjay > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > From sethg.delete at GoodmanAssociates.com Wed Nov 19 14:02:10 2003 From: sethg.delete at GoodmanAssociates.com (Seth Goodman) Date: Wed Nov 19 14:18:32 2003 Subject: [Spambayes] spam designed to defeat Bayesian filters Message-ID: Attached is an email (along with resulting spam clues) that apparently was designed specifically to get past Bayesian filters. I believe this was mentioned before as the "white on white" HTML problem. The email has a large number of legitimate words, probably randomly picked from a dictionary, in a section where the font color is almost white on a white background. There is a little snippet of HTML at the end that contains my email address. I don't know what it does but I don't like the looks of it. The message appears blank, unless you look very closely and then look at the HTML source. Not only does this message slip through the classifier as ham, but training on this message as spam would probably reduce the effectiveness of the classifier. My questions are: 1) What is this thing? Does it harvest addresses when rendered? 2) Are there any approaches that have been discussed to ignore the "almost white" text during parsing? 3) Is anything in the works for this exploit? -- Seth Goodman Humans: please remove ".delete" to reply Spambots: please disregard the above -------------- next part -------------- An embedded message was scrubbed... From: "Dion Tiff" Subject: movie's Date: Wed, 19 Nov 2003 03:12:07 -0600 Size: 1074 Url: http://mail.python.org/pipermail/spambayes/attachments/20031119/88148245/attachment-0002.mht -------------- next part -------------- An embedded message was scrubbed... From: unknown sender Subject: Spam Clues: movie's Date: no date Size: 25616 Url: http://mail.python.org/pipermail/spambayes/attachments/20031119/88148245/attachment-0003.mht From papaDoc at videotron.ca Wed Nov 19 14:27:40 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed Nov 19 14:27:46 2003 Subject: [Spambayes] spam designed to defeat Bayesian filters In-Reply-To: References: Message-ID: <3FBBC42C.9080805@videotron.ca> Hi, This classify for me as Spam Spam probability: *77.55% (0.7754698219)*. This is the token *Word* *Probability* *Times in ham* *Times in spam* *H* 0.445814 - - *S* 0.996753 - - nov 0.065217 3 0 vcr 0.065217 3 0 before 0.083754 6 1 skip:1 10 0.083754 6 1 battery 0.091837 2 0 controls 0.091837 2 0 emergency 0.091837 2 0 given 0.091837 2 0 knowing 0.091837 2 0 protection 0.091837 2 0 selection 0.091837 2 0 separately 0.091837 2 0 switches 0.091837 2 0 talking 0.091837 2 0 teaching 0.091837 2 0 telling 0.091837 2 0 window 0.091837 2 0 either 0.097789 5 1 good 0.117544 4 1 control 0.147499 3 1 network 0.147499 3 1 provides 0.147499 3 1 tell 0.147499 3 1 white 0.147499 3 1 based 0.149229 5 2 basic 0.155172 1 0 considered 0.155172 1 0 consists 0.155172 1 0 containing 0.155172 1 0 core 0.155172 1 0 corrected 0.155172 1 0 effects 0.155172 1 0 elements 0.155172 1 0 elevator 0.155172 1 0 empty 0.155172 1 0 enables 0.155172 1 0 encouraged 0.155172 1 0 from: 0.155172 1 0 graph 0.155172 1 0 graphics 0.155172 1 0 ground 0.155172 1 0 keyboard 0.155172 1 0 kind 0.155172 1 0 laboratory 0.155172 1 0 language 0.155172 1 0 necessarily 0.155172 1 0 none 0.155172 1 0 produced 0.155172 1 0 proper 0.155172 1 0 properties 0.155172 1 0 protect 0.155172 1 0 seem 0.155172 1 0 seems 0.155172 1 0 senior 0.155172 1 0 separate 0.155172 1 0 subject: 0.155172 1 0 switching 0.155172 1 0 target 0.155172 1 0 task 0.155172 1 0 telephone 0.155172 1 0 whole 0.155172 1 0 because 0.184556 7 4 got 0.195811 5 3 else 0.198686 2 1 whether 0.198686 2 1 version 0.219899 3 2 going 0.231109 4 3 skip:h 10 0.231109 4 3 content-type:text/plain 0.2363 32 27 balance 0.844828 0 1 ball 0.844828 0 1 bar 0.844828 0 1 basis 0.844828 0 1 become 0.844828 0 1 becomes 0.844828 0 1 consumption 0.844828 0 1 continually 0.844828 0 1 continued 0.844828 0 1 continues 0.844828 0 1 continuing 0.844828 0 1 convention 0.844828 0 1 conventions 0.844828 0 1 convinced 0.844828 0 1 corner 0.844828 0 1 electronic 0.844828 0 1 encourage 0.844828 0 1 ended 0.844828 0 1 government 0.844828 0 1 grant 0.844828 0 1 grants 0.844828 0 1 graphic 0.844828 0 1 greatly 0.844828 0 1 green 0.844828 0 1 gross 0.844828 0 1 groups 0.844828 0 1 growing 0.844828 0 1 lands 0.844828 0 1 message-id:invalid 0.844828 0 1 natural 0.844828 0 1 naturally 0.844828 0 1 necessary 0.844828 0 1 neither 0.844828 0 1 normal 0.844828 0 1 promise 0.844828 0 1 promised 0.844828 0 1 protected 0.844828 0 1 prove 0.844828 0 1 seeing 0.844828 0 1 seek 0.844828 0 1 sees 0.844828 0 1 selected 0.844828 0 1 self 0.844828 0 1 series 0.844828 0 1 skip:( 20 0.844828 0 1 skip:[ 10 0.844828 0 1 skip:b 40 0.844828 0 1 tape 0.844828 0 1 to:none 0.844828 0 1 willing 0.844828 0 1 wins 0.844828 0 1 woman 0.844828 0 1 women 0.844828 0 1 contain 0.908163 0 2 contained 0.908163 0 2 end 0.908163 0 2 grow 0.908163 0 2 known 0.908163 0 2 national 0.908163 0 2 production 0.908163 0 2 project 0.908163 0 2 properly 0.908163 0 2 provide. 0.908163 0 2 provided 0.908163 0 2 publication 0.908163 0 2 sending 0.908163 0 2 serious 0.908163 0 2 women's 0.908163 0 2 wondering 0.908163 0 2 copy 0.934783 0 3 giving 0.934783 0 3 growth 0.934783 0 3 nearly 0.934783 0 3 needs 0.934783 0 3 secure 0.934783 0 3 skip:x 10 0.934783 0 3 taking 0.958716 0 5 wish 0.958716 0 5 without 0.969799 0 7 mail 0.980349 0 11 url:biz 0.99236 0 29 Seth Goodman wrote: >Attached is an email (along with resulting spam clues) that apparently was >designed specifically to get past Bayesian filters. I believe this was >mentioned before as the "white on white" HTML problem. The email has a >large number of legitimate words, probably randomly picked from a >dictionary, in a section where the font color is almost white on a white >background. There is a little snippet of HTML at the end that contains my >email address. I don't know what it does but I don't like the looks of it. >The message appears blank, unless you look very closely and then look at the >HTML source. Not only does this message slip through the classifier as ham, >but training on this message as spam would probably reduce the effectiveness >of the classifier. > >My questions are: > >1) What is this thing? Does it harvest addresses when rendered? > >2) Are there any approaches that have been discussed to ignore the "almost >white" text during parsing? > >3) Is anything in the works for this exploit? > >-- >Seth Goodman > > Humans: please remove ".delete" to reply > > Spambots: please disregard the above > > > > ------------------------------------------------------------------------ > > Subject: > movie's > From: > "Dion Tiff" > Date: > Wed, 19 Nov 2003 03:12:07 -0600 > To: > > > > hriie os hy vsicwj k tnu mk k vcr uuhfw wawp ucu neqge oo cqstcpw >jrsqldm qvkm ncy fim > > > > > ------------------------------------------------------------------------ > > Subject: > Spam Clues: movie's > > >Spam Score: 0% (0) > > > >word spamprob #ham #spam >'*H*' 1 - - >'*S*' 0 - - >'provides' 0.00790861 28 0 >'project' 0.00819672 27 0 >'basic' 0.00884086 25 0 >'efforts' 0.0100223 22 0 >'electronics' 0.0100223 22 0 >'bar' 0.0110024 20 0 >'nature' 0.012894 17 0 >'switch' 0.0136778 16 0 >'wind' 0.0136778 16 0 >'neither' 0.0145631 15 0 >'groups' 0.0155709 14 0 >'necessarily' 0.0167286 13 0 >'properly' 0.0167286 13 0 >'bars' 0.0180723 12 0 >'enable' 0.0180723 12 0 >'noise' 0.0180723 12 0 >'continuing' 0.0196507 11 0 >'convinced' 0.0196507 11 0 >'projects' 0.0196507 11 0 >'technical' 0.0209429 53 1 >'correct.' 0.0215311 10 0 >'grateful' 0.0215311 10 0 >'senior' 0.0215311 10 0 >'tells' 0.0215311 10 0 >'window' 0.0215311 10 0 >'conventional' 0.0238095 9 0 >'glass' 0.0238095 9 0 >'go.' 0.0238095 9 0 >'knows' 0.0238095 9 0 >'nation' 0.0238095 9 0 >'series' 0.0238095 9 0 >'encourage' 0.0266272 8 0 >'technique' 0.0266272 8 0 >'wine' 0.0266272 8 0 >'controlling' 0.0302013 7 0 >'core' 0.0302013 7 0 >'emphasis' 0.0302013 7 0 >'grant' 0.0302013 7 0 >'granted' 0.0302013 7 0 >'nervous' 0.0302013 7 0 >'networks' 0.0302013 7 0 >'protected' 0.0302013 7 0 >'selecting' 0.0302013 7 0 >'talked' 0.0302013 7 0 >'tape' 0.0302013 7 0 >'win' 0.0302013 7 0 >'wishes' 0.0302013 7 0 >'conversation' 0.0348837 6 0 >'election' 0.0348837 6 0 >'encouraging' 0.0348837 6 0 >'governor' 0.0348837 6 0 >'grows' 0.0348837 6 0 >'suspected' 0.0348837 6 0 >'winter' 0.0348837 6 0 >'night' 0.0351794 31 1 >'bear' 0.0412844 5 0 >'copied' 0.0412844 5 0 >'corrected' 0.0412844 5 0 >'effectively' 0.0412844 5 0 >'encounter' 0.0412844 5 0 >'encourages' 0.0412844 5 0 >'ending' 0.0412844 5 0 >'kill' 0.0412844 5 0 >'label' 0.0412844 5 0 >'promising' 0.0412844 5 0 >'sends' 0.0412844 5 0 >'talks' 0.0412844 5 0 >'teach' 0.0412844 5 0 >'winning' 0.0412844 5 0 >'network' 0.0474167 41 2 >'whether' 0.0474167 41 2 >'beginning' 0.048731 22 1 >'constraints' 0.0505618 4 0 >'context' 0.0505618 4 0 >'continually' 0.0505618 4 0 >'contract.' 0.0505618 4 0 >'contribution' 0.0505618 4 0 >'convince' 0.0505618 4 0 >'convincing' 0.0505618 4 0 >'cope' 0.0505618 4 0 >'correcting' 0.0505618 4 0 >'elect' 0.0505618 4 0 >'keeps' 0.0505618 4 0 >'keys' 0.0505618 4 0 >'kinds' 0.0505618 4 0 >'knocked' 0.0505618 4 0 >'promised' 0.0505618 4 0 >'selects' 0.0505618 4 0 >'switching' 0.0505618 4 0 >'widespread' 0.0505618 4 0 >'wondered' 0.0505618 4 0 >'graphics' 0.0532931 20 1 >'key' 0.0635763 30 2 >'backs' 0.0652174 3 0 >'backwards' 0.0652174 3 0 >'basically' 0.0652174 3 0 >'consists' 0.0652174 3 0 >'elements' 0.0652174 3 0 >'elsewhere' 0.0652174 3 0 >'encouraged' 0.0652174 3 0 >"government's" 0.0652174 3 0 >"governor's" 0.0652174 3 0 >'grew' 0.0652174 3 0 >'lacks' 0.0652174 3 0 >'ladies' 0.0652174 3 0 >'names.' 0.0652174 3 0 >'nearby' 0.0652174 3 0 >'next.' 0.0652174 3 0 >'noisy' 0.0652174 3 0 >'programmers' 0.0652174 3 0 >'proportion' 0.0652174 3 0 >'protest' 0.0652174 3 0 >'sells' 0.0652174 3 0 >'separately' 0.0652174 3 0 >'serial' 0.0652174 3 0 >'teeth.' 0.0652174 3 0 >'within.' 0.0652174 3 0 >'telling' 0.0655701 16 1 >'group' 0.065624 42 3 >'systems' 0.067776 28 2 >'greatly' 0.0695772 15 1 >'none' 0.0695772 15 1 >'telephone' 0.0695772 15 1 >'sensitive' 0.0741059 14 1 >'separate' 0.0741059 14 1 >'glad' 0.0780931 24 2 >'wide' 0.0780931 24 2 >'basis' 0.0792652 13 1 >'ground' 0.0792652 13 1 >'consider' 0.0797402 34 3 >'copy' 0.0840554 52 5 >'seems' 0.0842721 32 3 >'property' 0.0845266 22 2 >'base' 0.0851967 12 1 >'becomes' 0.0851967 12 1 >'team' 0.087111 50 5 >'suspended' 0.0918367 2 0 >"system's" 0.0918367 2 0 >'table.' 0.0918367 2 0 >'tank' 0.0918367 2 0 >'tea' 0.0918367 2 0 >'teacher' 0.0918367 2 0 >'teaching' 0.0918367 2 0 >'whilst' 0.0918367 2 0 >'whoever' 0.0918367 2 0 >'wider' 0.0918367 2 0 >'wild' 0.0918367 2 0 >"window's" 0.0918367 2 0 >'x-mailer:qualcomm windows eudora version 5.1' 0.925475 2 >30 >'url:biz' 0.995187 1 277 > > From fredfredwin at yahoo.com Wed Nov 19 14:37:38 2003 From: fredfredwin at yahoo.com (fred fredwin) Date: Wed Nov 19 14:37:42 2003 Subject: [Spambayes] question about Spambayes setup... Message-ID: <20031119193738.3430.qmail@web41901.mail.yahoo.com> Greetings ! I recently came across SpamBayes and am very excited about using it (tried other spam filtering software, and it was a pretty big waste of time). I've got it set up, and it works fine, but I want to know if anyone has any setup tips? I was thinking of having a folder with known bad spam, which would not be deleted and kept in Outlook today, along with known ham, kept in another folder, each having about 500 messages to compare incoming spam to, which would periodically take new messages (ham and spam), put them in the respective folder, and then periodically delete old spam/ham messages. This would keep an updated list of spam/ham, to stay on par with the spammers (they always were one step ahead any kind of spam filtering software). any suggestions would be appreciated... Fred --------------------------------- Do you Yahoo!? Protect your identity with Yahoo! Mail AddressGuard From John.Hall at sciatl.com Wed Nov 19 14:40:30 2003 From: John.Hall at sciatl.com (Hall, John) Date: Wed Nov 19 14:41:18 2003 Subject: [Spambayes] "Duplicate" e-mails using SpamBayes Message-ID: I love SpamBayes...question...occasionally after I open and read an e-mail, then close it, it appears in my In Box as if I haven't opened it, still bold face...anyone else have that "problem?" Because of the great Spam filtering performance I can live with it, just curious if anyone else has this occur...JH - - - - - - - Appended by Scientific-Atlanta, Inc. - - - - - - - This e-mail and any attachments may contain information which is confidential, proprietary, privileged or otherwise protected by law. The information is solely intended for the named addressee (or a person responsible for delivering it to the addressee). If you are not the intended recipient of this message, you are not authorized to read, print, retain, copy or disseminate this message or any part of it. If you have received this e-mail in error, please notify the sender immediately by return e-mail and delete it from your computer. From rmalayter at bai.org Wed Nov 19 14:46:27 2003 From: rmalayter at bai.org (Ryan Malayter) Date: Wed Nov 19 14:46:34 2003 Subject: [Spambayes] spam designed to defeat Bayesian filters Message-ID: <792DE28E91F6EA42B4663AE761C41C2A012C3996@cliff.bai.org> > From: Seth Goodman > 1) What is this thing? Does it harvest addresses when rendered? Exactly. It's called a web bug. Opening this email in a program that automatically downloads pictures would tell them that your address is valid, and you viewed their spam. > 2) Are there any approaches that have been discussed to > ignore the "almost white" text during parsing? Yes, and most have been rejected, because they have not shown an increase in capture rate. For the most part, the URLs and the headers used often condemn this type of spam despite the random words. However, that doesn't seem to be the case with this one. If you can propose a method, then get someone to code it into the tokenizer (or do it yourself) for testing, let us know. The really strange thing is that most of the random words had a very high ham probability with my corpus as well as yours. The message scored as a 0% on my SpamBayes installation as well. It looks like someone is analyzing a bunch of mail with a bayesian filter, and coming up with their own list of "universal" list of innocent words. Since this message is malformed (the bad headers prevent the HTML from showing up correctly), and it doesn't really contain any sales pitch, this is probably a test a spammer is using to see how many of this style of spam he/she can get through. Of course, once sales-pitch words are added to it, with will score much higher. Regards, Ryan From tim.one at comcast.net Wed Nov 19 14:54:28 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Nov 19 14:54:35 2003 Subject: [Spambayes] spam designed to defeat Bayesian filters In-Reply-To: Message-ID: [Seth Goodman] > Attached is an email (along with resulting spam clues) that > apparently was designed specifically to get past Bayesian filters. The purpose of random text is to defeat fingerprinting schemes. The best random crap of this kind can hope to do against our kind of filter is push it down to unsure; although for some people, some of the time, it will score as ham -- as Skip said recently, unless they can access your training database, to find out what is and isn't hammy and spammy to *you*, random text is just pissing in the wind. > I believe this was mentioned before as the "white on white" HTML > problem. The email has a large number of legitimate words, probably > randomly picked from a dictionary, in a section where the font color > is almost white on a white background. There is a little snippet of > HTML at the end that contains my email address. I don't know what it > does but I don't like the looks of it. It's probably "a beacon". If your email reader (like Outlook ) normally renders HTML, and is willing to fetch images over the web, the sender can arrange to send *back* any kind of info it wants in the URL. Like the email address the spam was sent to. When the spammer's site gets the URL request, as a side effect it gets the email address it embedded in the URL too. Then the spammer knows that address is "live", and can sell it to other spammers. > The message appears blank, unless you look very closely and then > look at the HTML source. Not only does this message slip through the > classifier as ham, It slipped thru yours today, but it won't slip through everyone's today, and it won't even slip through yours on some other day -- collections of random words get pretty much random scores, but tending toward 50 on average. > but training on this message as spam would probably reduce the > effectiveness of the classifier. I've trained on such things without apparent harm. Very few non-hapax words are purely hammy or purely spammy, and this kind of scheme is robust against that kind of ambiguity (at least when well-trained; a mistake-driven training approach leaves it brittler, but that's what I do these days, and I still haven't noticed harm from training on such things). > My questions are: > > 1) What is this thing? Does it harvest addresses when rendered? That appears to be its intent, yes. > 2) Are there any approaches that have been discussed to ignore the > "almost white" text during parsing? Sure, but it requires semantic analysis deeper than we do, and would have to start with doing real parsing of HTML. That would be much more expensive than what we do now. > 3) Is anything in the works for this exploit? Not that I know of. From tpeters at mixcom.com Wed Nov 19 14:56:23 2003 From: tpeters at mixcom.com (Tom Peters) Date: Wed Nov 19 14:57:51 2003 Subject: [Spambayes] question about Spambayes setup... In-Reply-To: <20031119193738.3430.qmail@web41901.mail.yahoo.com> Message-ID: <5.1.0.14.0.20031119135246.00baf720@localhost> I'm using it with Eudora pro 5.1 in Pop3 proxy mode. If you think any of my advice would be applicable, I'm happy to give it. Writing filters in Eudora to act on the inserted headers isn't hard but there are a few wrinkles and gotchas. Also, I've gotten it to work with Norton antivirus configured to scan all outgoing mail, and if you do that wrong with the pop3 proxy, it will scan all messages twice each. Ooops. Just saw the word "Outlook" in your message. Guess you won't be needing the POP3 proxy. At 11:37 AM 11/19/2003 -0800, fred fredwin wrote: >Greetings ! >I recently came across SpamBayes and am very excited about using it (tried >other spam filtering software, and it was a pretty big waste of >time). I've got it set up, and it works fine, but I want to know if >anyone has any setup tips? >I was thinking of having a folder with known bad spam, which would not be >deleted and kept in Outlook today, along with known ham, kept in another >folder, each having about 500 messages to compare incoming spam to, which >would periodically take new messages (ham and spam), put them in the >respective folder, and then periodically delete old spam/ham >messages. This would keep an updated list of spam/ham, to stay on par >with the spammers (they always were one step ahead any kind of spam >filtering software). any suggestions would be appreciated... > [Listening] We only consult the ear because the heart is wanting. -- Blaise Pascal --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From skip at pobox.com Wed Nov 19 14:59:38 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 15:00:37 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119181336.GB7620@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> Message-ID: <16315.52138.672760.740584@montanaro.dyndns.org> Gerrit> I use it to check why a certain message has been classified as Gerrit> spam or ham. When I don't understand why a message is not Gerrit> correctly classified, I launch sb_server and ask it to give the Gerrit> full explanation (classify). Is there a script that simply does Gerrit> this but uses stdout instead of a web server? Sure. Just spit the message back through sb_filter.py with the Headers.include_evidence option set to True. If you're running from the CVS version of SpamBayes you run it like sb_filter.py -o Headers:include_evidence:True < msgfile Skip From tpeters at mixcom.com Wed Nov 19 14:59:27 2003 From: tpeters at mixcom.com (Tom Peters) Date: Wed Nov 19 15:00:45 2003 Subject: [Spambayes] anti-spam software In-Reply-To: <16315.41771.147253.763487@montanaro.dyndns.org> References: <001701c3aebd$4f600290$01fea8c0@hewlittpackard> <001701c3aebd$4f600290$01fea8c0@hewlittpackard> Message-ID: <5.1.0.14.0.20031119135907.00bcb898@localhost> I am using the POP3 filter. Need help? At 11:06 AM 11/19/2003 -0600, Skip Montanaro wrote: > Jack> Am I to understand that you do not currently have anti-spam > Jack> software available for Outlook Express users? > >Jack, > >You can use the sb_server.py to proxy POP3 or sb_impafilter.py to proxy IMAP >between Outlook Express and your mail server. For details, start here: > > http://spambayes.sf.net/faq.html#does-spambayes-work-with-outlook-express > >Skip > >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html [Philosophy] Reality is that which, when you stop believing in it, doesn't go away. --Phillip K. Dick, 1928-1982 --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tpeters at mixcom.com Wed Nov 19 15:00:39 2003 From: tpeters at mixcom.com (Tom Peters) Date: Wed Nov 19 15:01:49 2003 Subject: [Spambayes] anti-spam software In-Reply-To: <16315.41771.147253.763487@montanaro.dyndns.org> References: <001701c3aebd$4f600290$01fea8c0@hewlittpackard> <001701c3aebd$4f600290$01fea8c0@hewlittpackard> Message-ID: <5.1.0.14.0.20031119140006.00bba4c8@localhost> I should have said, I am using the POP3 proxy, when I offered to help. At 11:06 AM 11/19/2003 -0600, Skip Montanaro wrote: > Jack> Am I to understand that you do not currently have anti-spam > Jack> software available for Outlook Express users? > >Jack, > >You can use the sb_server.py to proxy POP3 or sb_impafilter.py to proxy IMAP >between Outlook Express and your mail server. For details, start here: > > http://spambayes.sf.net/faq.html#does-spambayes-work-with-outlook-express > >Skip > >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html [Humor] There's a fine line between fishing and just standing on the shore like an idiot. --Steven Wright --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From gerrit at nl.linux.org Wed Nov 19 15:55:21 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 15:55:45 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <16315.52138.672760.740584@montanaro.dyndns.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> Message-ID: <20031119205521.GA16448@nl.linux.org> Skip Montanaro wrote: > Gerrit> Is there a script that simply does > Gerrit> this but uses stdout instead of a web server? > > Sure. Just spit the message back through sb_filter.py with the > Headers.include_evidence option set to True. If you're running from the CVS > version of SpamBayes you run it like > > sb_filter.py -o Headers:include_evidence:True < msgfile Thanks! Is this behaviour documented somewhere? sb_filter.py --help does not seem to list a -o option here... Gerrit. -- 259. If any one steal a water-wheel from the field, he shall pay five shekels in money to its owner. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From gerrit at nl.linux.org Wed Nov 19 16:11:15 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 16:12:50 2003 Subject: [Spambayes] AssertionError: hamcount <= nham In-Reply-To: <20031119205521.GA16448@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> <20031119205521.GA16448@nl.linux.org> Message-ID: <20031119211115.GA17793@nl.linux.org> Gerrit Holl wrote: > > sb_filter.py -o Headers:include_evidence:True < msgfile > > Is this behaviour documented somewhere? > sb_filter.py --help does not seem to list a -o option here... Sorry, was not using latest CVS! Gerrit. -- 50. If he give a cultivated corn-field or a cultivated sesame-field, the corn or sesame in the field shall belong to the owner of the field, and he shall return the money to the merchant as rent. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From tameyer at ihug.co.nz Wed Nov 19 17:09:05 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 17:06:40 2003 Subject: [Spambayes] Few quick questions! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206813@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B158@its-xchg4.massey.ac.nz> > Although I know from the message below and from looking at > the FAQ's that S-B doesn't look at outbound mail and > therefore should not impact it, since installing S-B I have > experienced continuous problems send e-mail, particularly > those messages that contain attachments. You're using the Outlook plug-in, right? The plug-in definitely doesn't touch your outgoing mail, but (if you're not using the plugin), depending on how you have things set up, the other applications might be. > It could be a > coincidence, but my intuition tells me otherwise. Intuition is often wrong <0.5 wink>. > The error messages are primarily to do with timing out, can't > connect to the server. I have checked other machines on my > network that don't have S-B installed they have the same > Outlook settings and work correctly like my machine did pre S-B. If you disable spambayes, does the problem go away? If you uninstall, does it go away? (by "go away", I mean is it immediately fixed). Are you using any other plug-ins with Outlook? I have seen reports of spambayes and other plug-ins clashing, and if the other plug-in effects outgoing mail, that could be a reason for the problem. If you remain convinced that spambayes is at fault, please open a bug report via the sourceforge system (), including as much detail as possible (spambayes version, OS version, Outlook version, exact error messages, spambayes log, ...). =Tony Meyer From tameyer at ihug.co.nz Wed Nov 19 17:15:50 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 17:13:42 2003 Subject: [Spambayes] error in configuration In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206805@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DC@its-xchg4.massey.ac.nz> > I'm tryng to install spambayes on my machine. > I have a XP Professional SP1 system, with python 2.3. > I need to connect to an Imap Domino server. > I get this error when I go to configure folders to filter. [...] > File "C:\Python23\lib\imaplib.py", line 992, in _quote > arg = arg.replace('\\', '\\\\') > > AttributeError: 'tuple' object has no attribute 'replace' Could you run imapfilter with the "-i4" switch and post the results you get? (Or open a bug report via sourceforge and attach it there). At first glance, it looks like your password is somehow a tuple, instead of a string, but that shouldn't be possible, IIRC. =Tony Meyer From gerrit at nl.linux.org Wed Nov 19 17:31:08 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Wed Nov 19 17:31:35 2003 Subject: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= nham) In-Reply-To: <16315.52138.672760.740584@montanaro.dyndns.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> Message-ID: <20031119223108.GA18296@nl.linux.org> Hi, Skip Montanaro wrote: > Gerrit> I use it to check why a certain message has been classified as > Gerrit> spam or ham. When I don't understand why a message is not > Gerrit> correctly classified, I launch sb_server and ask it to give the > Gerrit> full explanation (classify). Is there a script that simply does > Gerrit> this but uses stdout instead of a web server? > > Sure. Just spit the message back through sb_filter.py with the > Headers.include_evidence option set to True. If you're running from the CVS > version of SpamBayes you run it like > > sb_filter.py -o Headers:include_evidence:True < msgfile Yes. But this is a 'short' version. Is it possible to get a long version, like the webserver is giving, without running the webserver? By long version, I mean at least two things: every clue on a separate line, and every clue contains information on how many spam messages contained this word and how many ham messages did. Of course, if there is no script producing this output, it would be possible to write one. I earlier succeeded in writing a script telling me how many spams and how many hams I have. I did this by simply looking at the source code. It is not as simple for what I want now. I have not found a simply function/method producing an object containing all clues which can be formatted to a table in a few lines of code. Is there such a function? (is this still ontopic for this list or do questions on spambayes' classes belong in the spambayes-dev mailing list?) good night to all europeans :P, Gerrit. -- 122. If any one give another silver, gold, or anything else to keep, he shall show everything to some witness, draw up a contract, and then hand it over for safe keeping. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From cej at intech.com Wed Nov 19 17:31:57 2003 From: cej at intech.com (Christopher Jastram) Date: Wed Nov 19 17:31:44 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No module named zlib ? In-Reply-To: <3FBA7E93.4040000@kolumbus.fi> References: <6.0.0.22.0.20031111021758.02439170@127.0.0.1> <3FB100B8.7050202@intech.com> <6.0.0.22.0.20031111182200.02471298@127.0.0.1> <3FBA64B6.5080407@intech.com> <3FBA7E93.4040000@kolumbus.fi> Message-ID: <3FBBEF5D.1070409@intech.com> Olli J. Marttila wrote: > Christopher Jastram wrote: > >> Olli, >> Do you know about this? >> http://mail.python.org/pipermail/spambayes-bugs/2003-August/000589.html >> >> Chris Jastram > > > Thanks, Chris! > No, I did not. > Having wasted several nights - and I really mean nights - trying to > get SpamBayes working I eventually gave up and installed Mozilla > Thunderbird: Easy installation, no hassle with dependencies, and working. > > Olli > > > Yeah. Same here. Ended up implementing server-side filtering for everyone else, and to handle those accounts that receive SO much mail they crash the Outlook plugin... (5 to 6 hundred). I saw a *very* interesting post about modifying SquirrelMail to have "Delete as Spam" (etc) buttons like the Outlook plugin, which merely forwarded the messages to the appropriate re-training interface. Very cool, and that's what I'll probably work on next. Sorry it didn't work out for you -- I find that these sorts of problems don't often get fixed (unless I do it). Chris From skip at pobox.com Wed Nov 19 18:07:46 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 18:08:00 2003 Subject: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= nham) In-Reply-To: <20031119223108.GA18296@nl.linux.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> <20031119223108.GA18296@nl.linux.org> Message-ID: <16315.63426.901164.494473@montanaro.dyndns.org> Gerrit> Is it possible to get a long version, like the webserver is Gerrit> giving, without running the webserver? By long version, I mean Gerrit> at least two things: every clue on a separate line, and every Gerrit> clue contains information on how many spam messages contained Gerrit> this word and how many ham messages did. Attached is a little script I wrote (spamcounts.py) but haven't yet released. I use it typically to just view the total spam and ham counts like so: % spamcounts db: /Users/skip/.hammiedb token,nspam,nham saved state,201,145 The output is in CSV format so I can pop it into a spreadsheet if I so desire. Here are some other example uses: * display all keys matching a given regular expression: % spamcounts -r '(?i)python' db: /Users/skip/.hammiedb token,nspam,nham 1866/python,0,1 26090/python,0,1 cc:addr:python-dev,0,1 cc:addr:python-mode,0,1 cc:addr:python.org,0,4 ... * display four specific keys in the database hammie.db instead of the default: % spamcounts -d hammie.db moe larry curly guido db: hammie.db token,nspam,nham curly,0,0 guido,1,0 larry,2,4 moe,0,0 The "db:" line is sent to stderr, so you can capture just the CSV part to a file easily enough. It also generates (0,0) for non-existent keys. I suppose it could just ignore them instead. How does this help you? Here are a couple ideas: * modify spamcounts.py to tokenize a message and generate the counts for all tokens. * write a little script which just tokenizes a message and modify spamcounts to take a stream of tokens from stdin. Happy hacking... Skip -------------- next part -------------- A non-text attachment was scrubbed... Name: spamcounts.py Type: application/octet-stream Size: 2223 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031119/4ed3be43/spamcounts-0001.obj From jfoster at agmac.com Wed Nov 19 19:02:22 2003 From: jfoster at agmac.com (Jeff Foster) Date: Wed Nov 19 18:52:04 2003 Subject: [Spambayes] love your product but..... Message-ID: My name is Jeff Foster. I am running Spambayes Outlook Ad in .081 in Outlook 2000 on a Fujitsu Laptop using Windows 98 Second Edition. Here is a copy of my log file Loaded bayes database from 'C:\WINDOWS\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\WINDOWS\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 660 spam and 41 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 4.10.67766446 ( A ) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox SpamBayes: Watching for new messages in folder Junk E-Mail Message 'RE: Code of Ethics/Core Curriculum Classes' had a Spam classification of 'No' Processing 1 missed spam in folder 'Inbox' took 4636.41ms FAILED to add the toolbar item 'SpamBayesCommand.Manager' - (-2147352567, 'Exception occurred.', (0, None, None, None, 0, -2147467259), None) Deleted the dead popup control - re-creating Message 'RE: Code of Ethics/Core Curriculum Classes' had a Spam classification of 'No' Message 'Re: Code of Ethics/Core Curriculum Classes' had a Spam classification of 'No' Message 'Cheapest Phentermine on the Internet!' had a Spam classification of 'Unsure' Message 'Re: OOOOPS, I forgot about my buddies' had a Spam classification of 'No' Saving configuration -> C:\WINDOWS\Application Data\SpamBayes\Microsoft Outlook Internet Settings.ini Spam filtering is disabled - ignoring new message Spam filtering is disabled - ignoring new message Spam filtering is disabled - ignoring new message Current version is 0.81, latest is 0.81. Outlook constantly "locks up". As you can see I have now disabled Spam filtering and since then have not experienced any problems. I have another computer in the office running Windows NT that I cannot even open Outlook since installing Spambayes. Please advise. From tameyer at ihug.co.nz Wed Nov 19 21:47:32 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 21:45:13 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No modulenamed zlib ? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130420697C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B15B@its-xchg4.massey.ac.nz> > Ended up implementing server-side filtering for everyone else, and to > handle those accounts that receive SO much mail they crash > the Outlook plugin... (5 to 6 hundred). The volume of mail shouldn't cause the plug-in any problems (although it will obviously take longer to do it's thing). On occasions I have Outlook download many more than 600 messages in one go, and have never had any trouble with spambayes - as have others, I'm sure. This is not to say that something isn't going wrong with spambayes, just that the volume of mail shouldn't be the cause. By "crash the plug-in", what do you mean? Crash Outlook? Stop the plug-in working? Freeze the system? =Tony Meyer From tameyer at ihug.co.nz Wed Nov 19 21:50:28 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 21:48:08 2003 Subject: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <=nham) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130420697F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B15C@its-xchg4.massey.ac.nz> > Attached is a little script I wrote (spamcounts.py) but > haven't yet released. I use it typically to just view the > total spam and ham counts like so: [...] In addition to Skip's script & ideas, you can probably find some of what you want in UserInterface.py (particularly the cvs version, not the 1.0a7 one). There are functions there that generate the clues list and so on, which you can probably graft onto Skip's script. Skip - your script sounds like a useful addition to the utilities directory, unless you don't think it's ready for use by others. =Tony Meyer From tim.one at comcast.net Wed Nov 19 22:00:40 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Nov 19 22:00:42 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No modulenamedzlib ? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B15B@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer] > The volume of mail shouldn't cause the plug-in any problems (although > it will obviously take longer to do it's thing). On occasions I have > Outlook download many more than 600 messages in one go, and have > never had any trouble with spambayes - as have others, I'm sure. Oh, I've seen Outlook 2000 crash under heavy load, and often. It started as soon as I got a high-speed connection at home (I never saw it when sucking 1000 emails down over a slow dialup line). That was long before SpamBayes existed, though. It's certainly not happening to me more often than it did before SpamBayes got installed, and I don't recall it happening at all since I enabled the SpamBayes "background processing" option. It bears all the signs of a thread-race bug, and Outlook does run a lot of threads (while SpamBayes does not -- it runs whatever thread Outlook decided to run it in). Depending on *everything* about a particular machine and its typical loads, running another bit of computation (like, say, SpamBayes) simultaneously can make thread-race bugs either more or less likely. I have no reason to believe SpamBayes is "at fault" here, and much evidence to suspect OL2000 is at fault; SpamBayes could be acting as a catalyst, though, by changing the relative timing of threads in such a way as to make an Outlook bug more likely (or less likely!) to bite. From denvin at adelphia.net Wed Nov 19 22:08:36 2003 From: denvin at adelphia.net (Dennis & Vinnie Sullivan) Date: Wed Nov 19 22:09:52 2003 Subject: [Spambayes] Problem with Spambays Toolbar Message-ID: <000101c3af13$97248be0$d162ab44@net.atlaga.adelphia.net> I recently installed SpamBayes and it works great. However, after a week or so, I got a message that there was an error on the program and I was asked if I wanted to uninstall it, and I said yes. I removed SpamBayes from my system completely, but the toolbar still showed, even when SpamBayes was no longer in my system. I manually removed the items on the toolbar since it was uninstalled. I downloaded SpamBayes again expecting the toolbar to reappear. It did not. I now have SpamBayes in my system, but no SpamBayes toolbar. What gives? Vinnie Sullivan Sullivan 1407 South Windward Drive St. Simons Island, GA 31522 Phone: (912) 634-0602 Fax: (912) 634-0612 Email: denvin@adelphia.net From skip at pobox.com Wed Nov 19 22:14:35 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Nov 19 22:14:50 2003 Subject: [Spambayes] spamcounts.py Message-ID: <16316.12699.989316.990539@montanaro.dyndns.org> I checked in a modified version of my spamcounts.py script to the contrib directory of the SpamBayes CVS repository. I pulled out the zero display, added a -t flag to get it to tokenize a message on standard input and added a spam prob column to the output. Output now looks like this: % spamcounts -t < spam1 db: /Users/skip/.hammiedb token,nspam,nham,spam prob content-type:text/html,132,52,0.646438294335 subject:ADV,3,0,0.934782608696 subject:: ,2,0,0.908163265306 subject: ,178,135,0.487504978915 "subject:, ",15,12,0.474589627851 to:no real name:2**0,153,124,0.470976272284 cc:none,188,134,0.503003515571 sender:none,185,101,0.569109950886 reply-to:no real name:2**0,39,34,0.453087648376 to:2**0,192,138,0.500916379669 x-mailer:none,106,85,0.473640904 ... Skip From tameyer at ihug.co.nz Wed Nov 19 23:15:09 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 23:12:40 2003 Subject: [Spambayes] sb_imapfilter fails with string index out of range In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130407BE48@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B15E@its-xchg4.massey.ac.nz> > I'm a spambayes newbie, trying to set it up for the first > time. Looked through the archives but didn't see anything > that about this specific problem. I've seen this a couple of times now, but I'm only just getting time to figure it out, sorry. [...] > However, when I try to start the > "daemon" version ( sb_imapfilter.py -c -t -l 5 ) I get a traceback: > [...] > if (arg[0],arg[-1]) in (('(',')'),('"','"')): > IndexError: string index out of range What's happening here is that it's trying to select a folder called "", which is no good (I believe, however, that it should give an 'invalid folder name' error, rather than the traceback, and have submitted a python bug to that effect). This, of course, raises the question of *why* it's trying to use "" as a folder name. My guess is that it's because the unsure/spam folder hasn't been set. (Your configuration file seems to indicate this, too). If that's the case, could you set those, and give it another go and see if that fixes it? (It would be great to hear back, either way). =Tony Meyer From tameyer at ihug.co.nz Wed Nov 19 23:17:29 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 19 23:14:58 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / Nomodule named zlib ? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130407BE7F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B15F@its-xchg4.massey.ac.nz> [...] > marttila@linux:~> python /usr/local/bin/sb_imapfilter.py -c > -t -l 10 [...] > IndexError: string index out of range I've seen this a couple of times now, but I'm only just getting time to figure it out, sorry. What's happening here is that it's trying to select a folder called "", which is no good (I believe, however, that it should give an 'invalid folder name' error, rather than the traceback, and have submitted a python bug to that effect). This, of course, raises the question of *why* it's trying to use "" as a folder name. My guess is that it's because the unsure/spam folder hasn't been set. If that's the case, could you set those, and give it another go and see if that fixes it? (It would be great to hear back, either way). > BTW, what is Spambayes Manager? A dialog box that users of the Outlook plug-in have. =Tony Meyer From jim_pawlowski at ameritech.net Wed Nov 19 23:16:47 2003 From: jim_pawlowski at ameritech.net (Jim Pawlowski) Date: Wed Nov 19 23:17:08 2003 Subject: [Spambayes] v .81 Suggestion Message-ID: Hi spambayes v .81 is much better than rules for spam One operational enhancement would be buttons in toolbar to delete the spam folders. Click to delete all in spam folder when viewing spam folder. Thanks Jim Pawlowski Pawlowski & Associates mailto:jim_pawlowski@ameritech.net (440) 845-8540 From edrubins at andisplace.com Thu Nov 20 00:37:52 2003 From: edrubins at andisplace.com (Yehudah Rubinsky) Date: Thu Nov 20 00:38:01 2003 Subject: Eudora and Norton (was Re: [Spambayes] question about Spambayes setup...) Message-ID: <5.1.0.14.0.20031120003210.00ad9ec0@localhost> Tom Peters wrote: >[....] >Also, I've gotten it to work with Norton antivirus configured to >scan all outgoing mail, and if you do that wrong with the pop3 proxy, it >will scan all messages twice each. What version are you using? IIRC this was fixed sometime ago. If you haven't already, you should probably upgrade to 1.07a. Note that 1.0a7 requires Python 2.3. Best, Ed From tameyer at ihug.co.nz Thu Nov 20 00:43:24 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Nov 20 00:40:58 2003 Subject: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206A3C@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B168@its-xchg4.massey.ac.nz> > Note that 1.0a7 requires Python 2.3. It shouldn't - do you know of something specific that needs 2.3? There were some bugs in 1.0a6 that meant that some scripts didn't work pre 2.3, but they should have all be resolved for the 1.0a7 release. (The requirement should be Python 2.2). =Tony Meyer From alinkens at nuera.com Thu Nov 20 05:35:45 2003 From: alinkens at nuera.com (Linkens, Andy) Date: Thu Nov 20 05:35:58 2003 Subject: [Spambayes] Background filtering Message-ID: Hi, I've setup the Outlook Plug-In and trained on a few thousand messages. Everything is working great except for background filtering. The mail always stays in my Inbox until I select the "Filter Messages" option in the pull down menu. Then the filtering is great. Any idea how to get this to work automatically when the mail comes into my Inbox? thanks Andy From gerrit at nl.linux.org Thu Nov 20 06:09:30 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Thu Nov 20 06:09:53 2003 Subject: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= nham) In-Reply-To: <16315.63426.901164.494473@montanaro.dyndns.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> <20031119223108.GA18296@nl.linux.org> <16315.63426.901164.494473@montanaro.dyndns.org> Message-ID: <20031120110930.GA3254@nl.linux.org> Skip Montanaro wrote: > Subject: Re: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= > nham) > Cc: spambayes@python.org > X-Spambayes-Classification: ham; 0.000 (15) ~~ :) > Attached is a little script I wrote (spamcounts.py) but haven't yet > released. I use it typically to just view the total spam and ham counts > like so: Ah, thanks! > How does this help you? Here are a couple ideas: It is certainly useful, even if it's only for usage as it is now. > * modify spamcounts.py to tokenize a message and generate the counts for > all tokens. > > * write a little script which just tokenizes a message and modify > spamcounts to take a stream of tokens from stdin. > > Happy hacking... Yes. Perhaps I shall study how to tokenize or how to use the tokenizer then. Thanks for the script. yours, Gerrit. -- 121. If any one store corn in another man's house he shall pay him storage at the rate of one gur for every five ka of corn per year. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From Chris.Rees at Charteris.com Thu Nov 20 06:48:33 2003 From: Chris.Rees at Charteris.com (Chris Rees) Date: Thu Nov 20 06:48:53 2003 Subject: [Spambayes] Outlook add-in Message-ID: <686ED8968527F940B87DE86B13FE00671CB4DA@Neptune.TheMandelbrotSet.com> I have the latest version of the add-in installed on Outlook 2003 under Windows XP on my laptop and desktop. At first it worked fine on both. Then it stopped working on the desktop, but continued to work on the laptop. I tried reinstalling Outlook and Spambayes (about three times) and Python, and following the instructions on the Troubleshooting page. I eventually found that in Help>About Microsoft Outlook there is a Disabled items button and there was Spambayes, because Outlook thought that it was interfering! I hit Enable and it works. So I no longer have a problem and thanks for the tip, but you may wish to update the Troubleshooting: the addin is NOT listed on the Com Addins list which shows when you hit the COM addins button. The list is empty. Re-installing did not work, as noted above. The remedy is actually listed in the Troubleshooting page but for Outlook 2002 not 2003. Hope this helps. It's a great piece of software. Regards, Chris Rees _________________________________________________________________ This message from Charteris plc has been checked for viruses http://www.charteris.com/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: image001.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20031120/f734abe4/attachment.gif From edrubins at andisplace.com Thu Nov 20 08:58:54 2003 From: edrubins at andisplace.com (Yehudah Rubinsky) Date: Thu Nov 20 08:59:03 2003 Subject: Eudora and Norton (was Re: [Spambayes] question about Spambayes setup...) Message-ID: <5.1.0.14.0.20031120085325.00ad2e30@localhost> Ed >> Note that 1.0a7 requires Python 2.3. Tony >It shouldn't - do you know of something specific that needs 2.3? There were >some bugs in 1.0a6 that meant that some scripts didn't work pre 2.3, but >they should have all be resolved for the 1.0a7 release. >(The requirement should be Python 2.2). The snippet below is from dbmstorage.py, and threw an exception on the return when I first upgraded to 1.0a7 under Python 2.2. def open_dbhash(*args): """Open a bsddb hash. Don't use this on Windows, unless Python 2.3 or greater is used, in which case bsddb3 is actually named bsddb.""" import bsddb return bsddb.hashopen(*args) Best, Ed From acira at front-barnett.com Thu Nov 20 09:28:20 2003 From: acira at front-barnett.com (acira) Date: Thu Nov 20 09:28:27 2003 Subject: [Spambayes] SpamBayes Loads with Error Message-ID: I recognize this bug is already being tracked: [831291] Loads with error - "Could not watch the specified folders". Per Bugs: Tracker Detailed View, there is no resolution as of yet. I want to include my error message in hopes of helping you resolve the problem. I am receiving the same error upon launching Outlook. I have the SpamBayes Outlook Addin, binary version 0.81 (September 9, 2003) installed. The filtering is disabled when the error is displayed. I can enable the filter afterwards and spam filter performs as normal. However, the error is encoutered and filter is disabled every time I launch Outlook 2000 anew. I also re-configured and re-enable the plugin, but this does not help. Also, the add-in is not listed in the COM add-ins. So, I ran the regsvr32.exe spambayes_addin.dll from the SpamBayes directory, this did not add to the list in the COM add-ins. Machine specifications: Dell Dimension 4100 P3 866 256MB Ram 10GB HardDrive Windows 2000 service pack 4 Outlook 2000 Log file is as follows for spambayes1.log: Loaded bayes database from 'C:\Documents and Settings\kgreiss\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\kgreiss\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 10 spam and 1 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox ERROR: 'There was an error initializing the Spam plugin.\r\n\r\nSpam filtering has been disabled. Please re-configure\r\nand re-enable this plugin\r\n\r\nError details:\r\nCould not watch the specified folders' Traceback (most recent call last): File "out1.pyz/addin", line 1237, in FiltersChanged File "out1.pyz/addin", line 1248, in UpdateFolderHooks File "out1.pyz/addin", line 1273, in _HookFolderEvents File "out1.pyz/msgstore", line 539, in GetOutlookItem File "win32com/gen_py\00062FFF-0000-0000-C000-000000000046x0x9x0\_NameSpace.py", line 50, in GetFolderFromID com_error: (-2147352567, 'Exception occurred.', (4096, 'Microsoft Outlook', 'The operation failed.', None, 0, -2147221241), None) Error processing missed messages! Traceback (most recent call last): File "out1.pyz/addin", line 1193, in OnConnection File "out1.pyz/addin", line 1219, in ProcessMissedMessages File "out1.pyz/addin", line 1263, in _GetHookForFolder KeyError: ('\x00\x00\x00\x008\xa1\xbb\x10\x05\xe5\x10\x1a\xa1\xbb\x08\x00+*V\xc2\x00\x00EMSMDB.DLL\x00\x00\x00\x00\x00\x00\x00\x00\x1bU\xfa \xaaf\x11\xcd\x9b\xc8\x00\xaa\x00/\xc4Z\x0c\x00\x00\x00FBCHI04\x00/o=Trees Front/ou=TREES/cn=Recipients/cn=KGreis\x00', '\x00\x00\x00\x00\xfcY:\xcb)\x03\xd3\x11\xab\x0f\x00\x80_\xa9\xac\x9a\x01\x00\xa1x3\xf9\x02\x85\xd1\x11\xaa\xed\x00\x80_\xa9\xac\x9a\x00\x00\x00\x08F\xc5\x00\x00') Saving configuration -> C:\Documents and Settings\kgreiss\Application Data\SpamBayes\MS Exchange Settings.ini SpamBayes - Disconnecting from Outlook SpamBayes processed 0 messages, finding 0 spam and 0 unsure Addin terminating: 1 COM client and 2 COM servers exist. Andrew Cira Front Barnett Associates LLC 70 W. Madison Street, Suite 4920 Chicago, IL 60602 (312) 641-9589 acira@front-barnett.com From kennypitt at hotmail.com Thu Nov 20 09:43:48 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Thu Nov 20 09:44:22 2003 Subject: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) In-Reply-To: <5.1.0.14.0.20031120085325.00ad2e30@localhost> Message-ID: Yehudah Rubinsky wrote: > Ed >> Note that 1.0a7 requires Python 2.3. > > Tony >It shouldn't - do you know of something specific that needs 2.3? > There were > >some bugs in 1.0a6 that meant that some scripts didn't work > pre > 2.3, but > >they should have all be resolved for the 1.0a7 release. > >(The requirement should be Python 2.2). > > The snippet below is from dbmstorage.py, and threw an exception on the > return when I first upgraded to 1.0a7 under Python 2.2. > > def open_dbhash(*args): > """Open a bsddb hash. Don't use this on Windows, unless Python 2.3 or > greater is used, in which case bsddb3 is actually named bsddb.""" > import bsddb > return bsddb.hashopen(*args) Have you installed the bsddb3 package for Python 2.2? You can find it here: http://pybsddb.sourceforge.net/. Python 2.2 includes a package named bsddb, but it is not the same as the package of the same name in Python 2.3 and it has proven to be problematic on Windows. If you install the bsddb3 package, then it should be chosen over the bsddb package on Python 2.2 systems. -- Kenny Pitt From edrubins at andisplace.com Thu Nov 20 09:55:21 2003 From: edrubins at andisplace.com (Yehudah Rubinsky) Date: Thu Nov 20 09:55:48 2003 Subject: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) In-Reply-To: References: <5.1.0.14.0.20031120085325.00ad2e30@localhost> Message-ID: <5.1.0.14.0.20031120095201.03392318@localhost> > >Have you installed the bsddb3 package for Python 2.2? You can find it > >here: http://pybsddb.sourceforge.net/. Yes I had. >Python 2.2 includes a package named bsddb, but it is not the same as the >package of the same name in Python 2.3 and it has proven to be >problematic on Windows. If you install the bsddb3 package, then it >should be chosen over the bsddb package on Python 2.2 systems. It didn't seem to. And since the comment in the code seem to say "get 2.3 young man", I did even though I'm not *8=>). Best, Ed From post at andre-krause.net Thu Nov 20 11:49:08 2003 From: post at andre-krause.net (Andre Krause) Date: Thu Nov 20 11:48:46 2003 Subject: [Spambayes] need freeware tool that works like disruptor-OL ! In-Reply-To: Message-ID: <000201c3af86$37cc4c00$f84c4681@kyb192> hello, i've searched all the web and found quite a lot of commercial outlook plugins that allow me to manually whitelist / blacklist an email by a single click on a button. i found http://www.hlembke.de/prod/disruptor/ very symphatic, but it's not freeware and has many features i do not need (for example buildin blacklist server queries and such). i only need a little cute tool that allows me to build up a whitelist simply by clicking on a button or scanning a outlook directory. no filtering, no server-blacklisting. anyone here has an idea ? thanks in advance, andre From JohnH at snetworking.com Thu Nov 20 12:02:24 2003 From: JohnH at snetworking.com (John Hall) Date: Thu Nov 20 12:03:17 2003 Subject: [Spambayes] RE: [SPAM] Spambayes Digest, Vol 63, Issue 56 Message-ID: <223286A46C32EE42BF7C2E4177EAD42605C856@sylvester2.snetworking.com> -----Original Message----- From: spambayes-request@python.org [mailto:spambayes-request@python.org] Sent: Thursday, November 20, 2003 9:01 AM To: spambayes@python.org Subject: [SPAM] Spambayes Digest, Vol 63, Issue 56 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. Re: Getting Evidence (Was: AssertionError: hamcount <= nham) (Gerrit Holl) 2. Outlook add-in (Chris Rees) 3. RE: Eudora and Norton (was Re: [Spambayes] question about Spambayes setup...) (Yehudah Rubinsky) 4. SpamBayes Loads with Error (acira) 5. RE: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) (Kenny Pitt) 6. RE: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) (Yehudah Rubinsky) 7. need freeware tool that works like disruptor-OL ! (Andre Krause) ---------------------------------------------------------------------- Message: 1 Date: Thu, 20 Nov 2003 12:09:30 +0100 From: Gerrit Holl Subject: Re: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= nham) To: spambayes@python.org Message-ID: <20031120110930.GA3254@nl.linux.org> Content-Type: text/plain; charset=us-ascii Skip Montanaro wrote: > Subject: Re: [Spambayes] Getting Evidence (Was: AssertionError: hamcount <= > nham) > Cc: spambayes@python.org > X-Spambayes-Classification: ham; 0.000 (15) ~~ :) > Attached is a little script I wrote (spamcounts.py) but haven't yet > released. I use it typically to just view the total spam and ham > counts like so: Ah, thanks! > How does this help you? Here are a couple ideas: It is certainly useful, even if it's only for usage as it is now. > * modify spamcounts.py to tokenize a message and generate the counts for > all tokens. > > * write a little script which just tokenizes a message and modify > spamcounts to take a stream of tokens from stdin. > > Happy hacking... Yes. Perhaps I shall study how to tokenize or how to use the tokenizer then. Thanks for the script. yours, Gerrit. -- 121. If any one store corn in another man's house he shall pay him storage at the rate of one gur for every five ka of corn per year. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ ------------------------------ Message: 2 Date: Thu, 20 Nov 2003 11:48:33 -0000 From: "Chris Rees" Subject: [Spambayes] Outlook add-in To: Message-ID: <686ED8968527F940B87DE86B13FE00671CB4DA@Neptune.TheMandelbrotSet.com> Content-Type: text/plain; charset="us-ascii" I have the latest version of the add-in installed on Outlook 2003 under Windows XP on my laptop and desktop. At first it worked fine on both. Then it stopped working on the desktop, but continued to work on the laptop. I tried reinstalling Outlook and Spambayes (about three times) and Python, and following the instructions on the Troubleshooting page. I eventually found that in Help>About Microsoft Outlook there is a Disabled items button and there was Spambayes, because Outlook thought that it was interfering! I hit Enable and it works. So I no longer have a problem and thanks for the tip, but you may wish to update the Troubleshooting: the addin is NOT listed on the Com Addins list which shows when you hit the COM addins button. The list is empty. Re-installing did not work, as noted above. The remedy is actually listed in the Troubleshooting page but for Outlook 2002 not 2003. Hope this helps. It's a great piece of software. Regards, Chris Rees _________________________________________________________________ This message from Charteris plc has been checked for viruses http://www.charteris.com/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: image001.gif Url : http://mail.python.org/pipermail/spambayes/attachments/20031120/f734abe4 /attachment-0001.gif ------------------------------ Message: 3 Date: Thu, 20 Nov 2003 08:58:54 -0500 From: Yehudah Rubinsky Subject: RE: Eudora and Norton (was Re: [Spambayes] question about Spambayes setup...) To: spambayes@python.org Message-ID: <5.1.0.14.0.20031120085325.00ad2e30@localhost> Content-Type: text/plain; charset="us-ascii"; format=flowed Ed >> Note that 1.0a7 requires Python 2.3. Tony >It shouldn't - do you know of something specific that needs 2.3? There were >some bugs in 1.0a6 that meant that some scripts didn't work pre 2.3, but >they should have all be resolved for the 1.0a7 release. >(The requirement should be Python 2.2). The snippet below is from dbmstorage.py, and threw an exception on the return when I first upgraded to 1.0a7 under Python 2.2. def open_dbhash(*args): """Open a bsddb hash. Don't use this on Windows, unless Python 2.3 or greater is used, in which case bsddb3 is actually named bsddb.""" import bsddb return bsddb.hashopen(*args) Best, Ed ------------------------------ Message: 4 Date: Thu, 20 Nov 2003 08:28:20 -0600 From: "acira" Subject: [Spambayes] SpamBayes Loads with Error To: Cc: kgreiss Message-ID: Content-Type: text/plain; charset="iso-8859-1" I recognize this bug is already being tracked: [831291] Loads with error - "Could not watch the specified folders". Per Bugs: Tracker Detailed View, there is no resolution as of yet. I want to include my error message in hopes of helping you resolve the problem. I am receiving the same error upon launching Outlook. I have the SpamBayes Outlook Addin, binary version 0.81 (September 9, 2003) installed. The filtering is disabled when the error is displayed. I can enable the filter afterwards and spam filter performs as normal. However, the error is encoutered and filter is disabled every time I launch Outlook 2000 anew. I also re-configured and re-enable the plugin, but this does not help. Also, the add-in is not listed in the COM add-ins. So, I ran the regsvr32.exe spambayes_addin.dll from the SpamBayes directory, this did not add to the list in the COM add-ins. Machine specifications: Dell Dimension 4100 P3 866 256MB Ram 10GB HardDrive Windows 2000 service pack 4 Outlook 2000 Log file is as follows for spambayes1.log: Loaded bayes database from 'C:\Documents and Settings\kgreiss\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\kgreiss\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 10 spam and 1 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3+ (#46, Aug 6 2003, 16:39:24) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching for new messages in folder Inbox ERROR: 'There was an error initializing the Spam plugin.\r\n\r\nSpam filtering has been disabled. Please re-configure\r\nand re-enable this plugin\r\n\r\nError details:\r\nCould not watch the specified folders' Traceback (most recent call last): File "out1.pyz/addin", line 1237, in FiltersChanged File "out1.pyz/addin", line 1248, in UpdateFolderHooks File "out1.pyz/addin", line 1273, in _HookFolderEvents File "out1.pyz/msgstore", line 539, in GetOutlookItem File "win32com/gen_py\00062FFF-0000-0000-C000-000000000046x0x9x0\_NameSpace.p y", line 50, in GetFolderFromID com_error: (-2147352567, 'Exception occurred.', (4096, 'Microsoft Outlook', 'The operation failed.', None, 0, -2147221241), None) Error processing missed messages! Traceback (most recent call last): File "out1.pyz/addin", line 1193, in OnConnection File "out1.pyz/addin", line 1219, in ProcessMissedMessages File "out1.pyz/addin", line 1263, in _GetHookForFolder KeyError: ('\x00\x00\x00\x008\xa1\xbb\x10\x05\xe5\x10\x1a\xa1\xbb\x08\x00+*V\xc2\x 00\x00EMSMDB.DLL\x00\x00\x00\x00\x00\x00\x00\x00\x1bU\xfa \xaaf\x11\xcd\x9b\xc8\x00\xaa\x00/\xc4Z\x0c\x00\x00\x00FBCHI04\x00/o=Tre es Front/ou=TREES/cn=Recipients/cn=KGreis\x00', '\x00\x00\x00\x00\xfcY:\xcb)\x03\xd3\x11\xab\x0f\x00\x80_\xa9\xac\x9a\x0 1\x00\xa1x3\xf9\x02\x85\xd1\x11\xaa\xed\x00\x80_\xa9\xac\x9a\x00\x00\x00 \x08F\xc5\x00\x00') Saving configuration -> C:\Documents and Settings\kgreiss\Application Data\SpamBayes\MS Exchange Settings.ini SpamBayes - Disconnecting from Outlook SpamBayes processed 0 messages, finding 0 spam and 0 unsure Addin terminating: 1 COM client and 2 COM servers exist. Andrew Cira Front Barnett Associates LLC 70 W. Madison Street, Suite 4920 Chicago, IL 60602 (312) 641-9589 acira@front-barnett.com ------------------------------ Message: 5 Date: Thu, 20 Nov 2003 09:43:48 -0500 From: "Kenny Pitt" Subject: RE: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) To: "'Yehudah Rubinsky'" , Message-ID: Content-Type: text/plain; charset="us-ascii" Yehudah Rubinsky wrote: > Ed >> Note that 1.0a7 requires Python 2.3. > > Tony >It shouldn't - do you know of something specific that needs 2.3? > There were > >some bugs in 1.0a6 that meant that some scripts didn't work > pre > 2.3, but > >they should have all be resolved for the 1.0a7 release. > >(The requirement should be Python 2.2). > > The snippet below is from dbmstorage.py, and threw an exception on the > return when I first upgraded to 1.0a7 under Python 2.2. > > def open_dbhash(*args): > """Open a bsddb hash. Don't use this on Windows, unless Python 2.3 or > greater is used, in which case bsddb3 is actually named bsddb.""" > import bsddb > return bsddb.hashopen(*args) Have you installed the bsddb3 package for Python 2.2? You can find it here: http://pybsddb.sourceforge.net/. Python 2.2 includes a package named bsddb, but it is not the same as the package of the same name in Python 2.3 and it has proven to be problematic on Windows. If you install the bsddb3 package, then it should be chosen over the bsddb package on Python 2.2 systems. -- Kenny Pitt ------------------------------ Message: 6 Date: Thu, 20 Nov 2003 09:55:21 -0500 From: Yehudah Rubinsky Subject: RE: Eudora and Norton (was Re: [Spambayes] question aboutSpambayes setup...) To: spambayes@python.org Message-ID: <5.1.0.14.0.20031120095201.03392318@localhost> Content-Type: text/plain; charset="us-ascii"; format=flowed > >Have you installed the bsddb3 package for Python 2.2? You can find it > >here: http://pybsddb.sourceforge.net/. Yes I had. >Python 2.2 includes a package named bsddb, but it is not the same as the >package of the same name in Python 2.3 and it has proven to be >problematic on Windows. If you install the bsddb3 package, then it >should be chosen over the bsddb package on Python 2.2 systems. It didn't seem to. And since the comment in the code seem to say "get 2.3 young man", I did even though I'm not *8=>). Best, Ed ------------------------------ Message: 7 Date: Thu, 20 Nov 2003 17:49:08 +0100 From: "Andre Krause" Subject: [Spambayes] need freeware tool that works like disruptor-OL ! To: Message-ID: <000201c3af86$37cc4c00$f84c4681@kyb192> Content-Type: text/plain; charset="us-ascii" hello, i've searched all the web and found quite a lot of commercial outlook plugins that allow me to manually whitelist / blacklist an email by a single click on a button. i found http://www.hlembke.de/prod/disruptor/ very symphatic, but it's not freeware and has many features i do not need (for example buildin blacklist server queries and such). i only need a little cute tool that allows me to build up a whitelist simply by clicking on a button or scanning a outlook directory. no filtering, no server-blacklisting. anyone here has an idea ? thanks in advance, andre ------------------------------ _______________________________________________ Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html End of Spambayes Digest, Vol 63, Issue 56 ***************************************** From atom at suspicious.org Thu Nov 20 12:19:44 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Thu Nov 20 12:20:12 2003 Subject: [Spambayes] any *nix users on list? In-Reply-To: <000201c3af86$37cc4c00$f84c4681@kyb192> References: <000201c3af86$37cc4c00$f84c4681@kyb192> Message-ID: i'm in a 100% microsoft-free environment (and very happy about it!), and was wondering about what percentage of the list is *nix based and what percentage is M$ based. anyway, i just installed it and i'm using it with qmail and a text-based server-side MUA. procmail is installed on the server, but i'm not using it. within the next few weeks (after more testing) i should release some cool scripts (including my procmail-free .qmail-file) for this, if anyone is interested. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "I would rather fail in a cause that will ultimately triumph than triumph in a cause that will ultimately fail." -- Woodrow Wilson From skip at pobox.com Thu Nov 20 12:41:56 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Nov 20 12:42:09 2003 Subject: [Spambayes] any *nix users on list? In-Reply-To: References: <000201c3af86$37cc4c00$f84c4681@kyb192> Message-ID: <16316.64740.939014.631685@montanaro.dyndns.org> atom> i'm in a 100% microsoft-free environment (and very happy about atom> it!), and was wondering about what percentage of the list is *nix atom> based and what percentage is M$ based. There are a number of us. I happen to use SB in a MacOSX/sendmail/procmail environment. Skip From gerrit at nl.linux.org Thu Nov 20 12:53:54 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Thu Nov 20 12:54:25 2003 Subject: [Spambayes] Getting Evidence In-Reply-To: <16315.63426.901164.494473@montanaro.dyndns.org> References: <20031119115424.GA4439@nl.linux.org> <20031119174213.GA7318@nl.linux.org> <3FBBAD83.1010602@videotron.ca> <20031119175723.GA22642@nl.linux.org> <16315.45528.694389.991984@montanaro.dyndns.org> <20031119181336.GB7620@nl.linux.org> <16315.52138.672760.740584@montanaro.dyndns.org> <20031119223108.GA18296@nl.linux.org> <16315.63426.901164.494473@montanaro.dyndns.org> Message-ID: <20031120175354.GA10130@nl.linux.org> Skip Montanaro wrote: > How does this help you? Here are a couple ideas: > > * modify spamcounts.py to tokenize a message and generate the counts for > all tokens. > > * write a little script which just tokenizes a message and modify > spamcounts to take a stream of tokens from stdin. > > Happy hacking... I have chosen for an even simpler solution: #!/usr/bin/python import sys import email import spamcounts m = email.message_from_file(sys.stdin) if not m["X-Spambayes-Evidence"]: sys.exit("No evidence found!") d = eval("{" + m["X-Spambayes-Evidence"].replace(';', ',') + "}") spamcounts.main(d.keys()) it's not all that elegant, but it works :) yours, Gerrit. -- 243. As rent of herd cattle he shall pay three gur of corn to the owner. -- 1780 BC, Hammurabi, Code of Law -- Asperger Syndroom - een persoonlijke benadering: http://people.nl.linux.org/~gerrit/ Kom in verzet tegen dit kabinet: http://www.sp.nl/ From peterl at standingwave.org Thu Nov 20 16:45:52 2003 From: peterl at standingwave.org (Peter Loron) Date: Thu Nov 20 16:46:03 2003 Subject: [Spambayes] any *nix users on list? In-Reply-To: References: <000201c3af86$37cc4c00$f84c4681@kyb192> Message-ID: <3FBD3610.6020304@standingwave.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Yep. I run a FreeBSD box and use procmail to filter everything server-side. Since I'm here, a question for the list... If this is answere in the Wiki or some other place, please point me at it. I was unable to find it: I access my mail through SSL tunnled IMAP from multiple clients on different machines. Can I set up the IMAPproxy on the server and connect to it that way, instead of having it local on each client? Is there any advantage to doing that over using procmail to filter things into folders that I then acces with the mail client? Thanks. - -Pete Atom 'Smasher' wrote: | i'm in a 100% microsoft-free environment (and very happy about it!), and | was wondering about what percentage of the list is *nix based and what | percentage is M$ based. | | anyway, i just installed it and i'm using it with qmail and a text-based | server-side MUA. procmail is installed on the server, but i'm not using | it. within the next few weeks (after more testing) i should release some | cool scripts (including my procmail-free .qmail-file) for this, if anyone | is interested. | | | ...atom | | _______________________________________________ | PGP key - http://smasher.suspicious.org/pgp.txt | 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 | ------------------------------------------------- | | "I would rather fail in a cause that will ultimately triumph | than triumph in a cause that will ultimately fail." | -- Woodrow Wilson | | | _______________________________________________ | Spambayes@python.org | http://mail.python.org/mailman/listinfo/spambayes | Check the FAQ before asking: http://spambayes.sf.net/faq.html -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.2 (MingW32) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQE/vTYQaipdndPOPFMRAjiyAJ0ejn17yFO+w7H6opFk9OzgTXX62QCgrgHE ts32skseqBEa4L3oN8UPZx4= =8MkH -----END PGP SIGNATURE----- From kbirkett at flash.net Thu Nov 20 21:54:34 2003 From: kbirkett at flash.net (Katharine Birkett) Date: Thu Nov 20 21:36:51 2003 Subject: [Spambayes] initial training bug--but I love spambayes anyway Message-ID: Here's the bug: I have Windows 98 and Office 2000. When I installed Spambayes I decided to set up two folders to pre-sort and pre-train. I was able to set up the two folders, and I filled them with stuff, but at the point in the installation where I was asked which folders they were, I could navigate to the folders, and I could select them, but I couldn't get the program to know that I had selected them. Nothing ever appeared in those blanks where the folder names were supposed to appear, and I couldn't enter the folder names directly, either. So I couldn't proceed with that option. So I backed up the installation, told it I hadn't done any pre-sorting, and went with the train-as-you-go option instead. We've been using Spambayes for about two weeks now. I keep on lowering the upper bound. We love your program!! With the upper bound at 70 and the lower bound at 15 we get a junk folder with only bad stuff (almost all the porn lands there), a suspects folder with mostly mortgage spam, and an inbox with almost only what we want in it. You have given us our inbox back. We get 500 to 700 emails a week, I guess, and 95% is junk. Soon we will no longer even check the junk folder--I'm just checking it now to try to optimize the upper bound. THANK YOU!! Katharine Birkett P.S. We'd like to send you a contribution--where and how? From tim at fourstonesExpressions.com Thu Nov 20 21:43:55 2003 From: tim at fourstonesExpressions.com (Tim Stone) Date: Thu Nov 20 21:44:03 2003 Subject: [Spambayes] initial training bug--but I love spambayes anyway In-Reply-To: References: Message-ID: I'll leave the bug to the outlook gurus... I'm guessing, though, that Win98 is the problem. As for donations, here's the url, with an explanation of our wishes along those lines. Thanks for your support! On Thu, 20 Nov 2003 21:54:34 -0500, Katharine Birkett wrote: > > > Here's the bug: I have Windows 98 and Office 2000. When I installed > Spambayes I decided to set up two folders to pre-sort and pre-train. I > was > able to set up the two folders, and I filled them with stuff, but at the > point in the installation where I was asked which folders they were, I > could > navigate to the folders, and I could select them, but I couldn't get the > program to know that I had selected them. Nothing ever appeared in those > blanks where the folder names were supposed to appear, and I couldn't > enter > the folder names directly, either. So I couldn't proceed with that > option. > So I backed up the installation, told it I hadn't done any pre-sorting, > and > went with the train-as-you-go option instead. > > We've been using Spambayes for about two weeks now. I keep on lowering > the > upper bound. We love your program!! With the upper bound at 70 and the > lower > bound at 15 we get a junk folder with only bad stuff (almost all the porn > lands there), a suspects folder with mostly mortgage spam, and an inbox > with > almost only what we want in it. You have given us our inbox back. We get > 500 > to 700 emails a week, I guess, and 95% is junk. Soon we will no longer > even > check the junk folder--I'm just checking it now to try to optimize the > upper > bound. THANK YOU!! > > Katharine Birkett > > P.S. We'd like to send you a contribution--where and how? > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > -- Vous exprimer; Expr?sese; Te stesso esprimere; Express yourself! Tim Stone www.fourstonesExpressions.com From tameyer at ihug.co.nz Thu Nov 20 22:15:37 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Thu Nov 20 22:18:13 2003 Subject: [Spambayes] initial training bug--but I love spambayes anyway In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206D14@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DF@its-xchg4.massey.ac.nz> [Katharine Birkett] > I was able to set up the two folders, and > I filled them with stuff, but at the point in the > installation where I was asked which folders they were, > I could navigate to the folders, and I could select them, > but I couldn't get the program to know that I had > selected them. Nothing ever appeared in those blanks > where the folder names were supposed to appear, and I > couldn't enter the folder names directly, either. Is this with the Outlook plug-in? So the checkmarks appeared next to the names, but then it didn't use the folders? Was this with the wizard, or via the spambayes manager dialog? Was this version 008.1 of the plug-in? Did any error message appear? > P.S. We'd like to send you a contribution--where and how? [Tim Stone] > As for donations, here's the url, with an explanation of our > wishes along those lines. Thanks for your support! Tim meant to include this, I think: =Tony Meyer From atom at suspicious.org Fri Nov 21 00:40:04 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 00:40:26 2003 Subject: [Spambayes] need .spambayesrc in *nix Message-ID: is there a way generate an auto-pilot .spambayesrc? are there any other .spambayes* files? ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "When the President does it, that means that it's not illegal." -- Richard M. Nixon, 19 May 1977 From atom at suspicious.org Fri Nov 21 00:44:05 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 00:44:25 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: References: Message-ID: also, where is the documentation for the stuff that lives in that file? thanks.... ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "I don't know anything about music. In my line you don't have to." -- Elvis Presley From tameyer at ihug.co.nz Fri Nov 21 00:53:37 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 21 00:53:46 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206D66@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> > is there a way generate an auto-pilot .spambayesrc? Do you mean generate one automatically? There is an unfinished script that will do this, for various mail clients. It's called autoconfigure.py and is in the (spambayes) windows directory. The docstring has details about how to use it. Note that it has only been tested with windows clients so far - how well it works on other platforms depends on how similar the various mail clients' configuration files are. Contributions to this (and anything else) are welcome. > are there any other .spambayes* files? SpamBayes will create two database files as well, but it takes care of managing those (although you can set where they are located and what format they are in). > also, where is the documentation for the stuff that lives in that file? Look at FAQs 4.11 and 4.12: =Tony Meyer From atom at suspicious.org Fri Nov 21 01:25:03 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 01:25:30 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> Message-ID: > > are there any other .spambayes* files? > > SpamBayes will create two database files as well, but it takes care of > managing those (although you can set where they are located and what format > they are in). ========================= i seem to have these databases: .hammiedb.bak .hammiedb.dat .hammiedb.dir i thought some docs also mentioned: .spambayes_hammie.db.dat .spambayes_hammie.db.dir are those still used? or is that an older version of what i have? > > also, where is the documentation for the stuff that lives in that file? > > Look at FAQs 4.11 and 4.12: > > > like-but-what-options-do-i-need-to-set> ========================= a *LOT* of stuff! i'll read through that, but it seems to work fine with no rc file... i guess it just falls back on the defaults... thanks.... ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Government is not the solution to our problem, government is the problem." -- Ronald Reagan Inaugural Address, 20 Jan 1981 From jccoe at sbcglobal.net Fri Nov 21 01:27:25 2003 From: jccoe at sbcglobal.net (Jack Coe) Date: Fri Nov 21 01:27:26 2003 Subject: [Spambayes] How do I uninstall spambayes from outlook 2003 Message-ID: I would like to compare outlooks spam program with spambayer . I need to uninstall add in for outlook . Thanks --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.534 / Virus Database: 329 - Release Date: 10/31/2003 From tim.one at comcast.net Fri Nov 21 01:30:48 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 21 01:30:54 2003 Subject: [Spambayes] question about Spambayes setup... In-Reply-To: <20031119193738.3430.qmail@web41901.mail.yahoo.com> Message-ID: [fred fredwin] > Greetings ! > I recently came across SpamBayes and am very excited about using it > (tried other spam filtering software, and it was a pretty big waste > of time). I've got it set up, and it works fine, but I want to > know if anyone has any setup tips? Well, everyone seems to work in a different way. We haven't yet done appropriate research on effective real-life training schemes, but the good news is that just about anything works. The most important rule of thumb gained from experience is that things work best when the # of ham and # of spam trained on are approximately the same. > I was thinking of having a folder with known bad spam, which would > not be deleted and kept in Outlook today, along with known ham, > kept in another folder, each having about 500 messages to compare > incoming spam to, which would periodically take new messages (ham > and spam), put them in the respective folder, and then periodically > delete old spam/ham messages. This would keep an updated list of > spam/ham, to stay on par with the spammers (they always were one > step ahead any kind of spam filtering software). any suggestions > would be appreciated... I created a distinct .pst file, which holds (just) my Ham and Spam training-data folders. SpamBayes is configured to move spam directly into that Spam folder, and Unsures into a "Z UNSURE" folder in my main .pst file ("Z UNSURE" so it falls to the bottom of the tree display). About once a day I move all the new spam into my Deleted Items folder, which has a Spam score column and is set to sort on that. Z UNSURE is also set to sort on Spam score. That makes it easy to eyeball one end of the displays for mistakes. When I need to train on a ham, I drag a *copy* of it to the Ham training folder (hold down right mouse button while dragging, and select Copy from the context menu that pops up when you release the button). After seeding with a few hundred msgs of each kind, you're probably going to find that you don't need much training anymore. This isn't a rule-based system, so spammers can't evade it by "learning the rules" and crafting spam to get around them. On rare occasions they invent a new way of obfuscating HTML that's actually effective, and I notice that by staring at the guts of low-scoring Unsures; then I check in a change to the tokenizer to de-obfuscate it; I think I've done that about 3 times over the past year. Spammers really aren't getting any better at fooling this system. This isn't surprising, since rule-based systems dominate the commercial spam-blocking market, and spam is a mass-market game. SpamBayes isn't yet worth the bother of targeting. When you see a low-scoring spam, train on it; the system does learn. Anyway, there are a couple reasons I use a distinct .pst file for my training data. One is so that I can retrain from scratch in an eyeblink. Since I work on the guts of this system, that's important so I can judge the effect of changes. It also makes it possible to recover from database corruption in an eyeblink, but I've never seen that happen (*some* people do -- this still isn't understood, alas). But the primary reason I use a different .pst is that I need to copy my main .pst file between desktop and laptop several times per week, and it saves major "sit and wait" time to keep that file as small as possible. I don't bother trying to keep the classifiers in synch across machines; they each have their own database. Every now & again I blow away my classifier and start over from scratch. This is just because I enjoy watching it learn. There are also a number of practical advantages to keeping your training database relatively small. If you end up maintaining a database with many thousands of ham and spam, that has a way of turning into its own kind of time sink. Ah! Add a Spam score column to your Ham and Spam training folders, and sort on that column. Every now & again rescore all the messages in them (SpamBayes -> Filter messages ...). Look at "the wrong end" then (at high-scoring training ham and low-scoring training spam). If you made any mistakes in classifying (everyone eventually does!), they'll almost always show up at the wrong end. The worst thing you can do with this kind of system is train a message into the wrong category -- it has no predefined notions of what "ham" and "spam" are, and believes whatever you tell it to believe. That's mostly a great strength, but it's also a great weakness when you make a mistake. Caution: Don't get too complacent too soon. Many kinds of commercial email you want (whether ordering products online, or getting a flashy company newsletter) have a lot in common with most peoples' idea of what spam is, because they're all trying to sell you something. The language of advertising is distinctive, and commercial email you want is quite likely to score as a high Unsure, or even as Spam, the first time or two you get one from a given company. Just keep your eyes open and train accordingly. Also remember that it doesn't matter what anyone *else* insists "spam" is. For example, I get some commercial email from companies that have a good case for sending it to me (for example, I was a customer, or even signed up for a newsletter once), but that I don't want anymore. I just tell my classifier it's spam, and I'm not bothered with it anymore. There are also some kinds of dubious messages I enjoy (spam *can* be highly entertaining, in a train-wreck sort of way), and I tell my classifier those are ham. That's OK with SpamBayes -- it's happy to classify any way you tell it to. From tim.one at comcast.net Fri Nov 21 01:36:19 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 21 01:36:23 2003 Subject: [Spambayes] How do I uninstall spambayes from outlook 2003 In-Reply-To: Message-ID: [Jack Coe] > I would like to compare outlooks spam program with spambayer . I > need to uninstall add in for outlook . You don't need to uninstall for that. Just uncheck the "Enable Spambayes" box on the default (General) tab of the SpamBayes -> SpamBayes Manager dialog. From tameyer at ihug.co.nz Fri Nov 21 01:42:21 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 21 01:42:25 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206D75@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> > i seem to have these databases: > .hammiedb.bak .hammiedb.dat .hammiedb.dir > > i thought some docs also mentioned: > .spambayes_hammie.db.dat .spambayes_hammie.db.dir > > are those still used? or is that an older version of what i have? If you have .dat and .dir files, this means you're using dumbdbm (the FAQ has stuff about this, too). This is *not* a good idea; in fact recent versions of SpamBayes refuse to let you use it. See the FAQ for details about switching over to a pickle or bsddb. > > > also, where is the documentation for the stuff that lives > in that file? [...] > a *LOT* of stuff! i'll read through that, but it seems to > work fine with no rc file... i guess it just falls back on > the defaults... Indeed. SpamBayes is *very* configurable - but the only details you actually need to give it to work just fine are server details if you're using the POP3 proxy, or server, username and folder details if the IMAP filter. (If you're using sb_filter, or something like that, then none at all should be fine, IIRC). The majority of other options will probably only be necessary if you feel like testing or developing. =Tony Meyer From atom at suspicious.org Fri Nov 21 02:32:56 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 02:33:21 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> Message-ID: > > i seem to have these databases: > > .hammiedb.bak .hammiedb.dat .hammiedb.dir > > > > i thought some docs also mentioned: > > .spambayes_hammie.db.dat .spambayes_hammie.db.dir > > > > are those still used? or is that an older version of what i have? > > If you have .dat and .dir files, this means you're using dumbdbm (the FAQ > has stuff about this, too). This is *not* a good idea; in fact recent > versions of SpamBayes refuse to let you use it. See the FAQ for details > about switching over to a pickle or bsddb. ======================== cool... i just installed pybsddb and rebuilt the database (with the "-f" option of mboxtrain.py)... the file is smaller and the filtering time dropped to 25% of the time that it was before. and i'll guess it'll be ready to go when i upgrade spambayes... > > > > also, where is the documentation for the stuff that lives > > in that file? > [...] > > a *LOT* of stuff! i'll read through that, but it seems to > > work fine with no rc file... i guess it just falls back on > > the defaults... > > Indeed. SpamBayes is *very* configurable - but the only details you > actually need to give it to work just fine are server details if you're > using the POP3 proxy, or server, username and folder details if the IMAP > filter. (If you're using sb_filter, or something like that, then none at > all should be fine, IIRC). The majority of other options will probably only > be necessary if you feel like testing or developing. =================== i'm using hammiefilter.py and it it seems fine without a config... but it's good to know i can tweak it, just in case ;) ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Computers in the future may weigh no more than 1.5 tons." -- Popular Mechanics, 1949 From atom at suspicious.org Fri Nov 21 02:37:51 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 02:38:13 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> Message-ID: > SpamBayes will create two database files as well, but it takes care of > managing those (although you can set where they are located and what format > they are in). ============================ two db files? i only see the ~/.hammiedb i'm using 1.0.a.4, if it makes a difference. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "You can't hold a man down without staying down with him." -- Booker T Washington From atom at suspicious.org Fri Nov 21 03:01:09 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 03:01:31 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> Message-ID: ok... i just tried to upgrade from a4 to a7 and i'm getting error messages like: $ sb_mboxtrain.py -f -d .hammiedb -g Maildir/ Traceback (most recent call last): File "/usr/local/bin/sb_mboxtrain.py", line 315, in ? main() File "/usr/local/bin/sb_mboxtrain.py", line 296, in main h = hammie.open(pck, usedb, "c") File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 258, in open return Hammie(storage.open_storage(filename, useDB, mode)) File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 675, in open_storage return klass(data_source_name, mode) File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 162, in __init__ self.load() File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 187, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "/usr/local/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 60, in open return f(db_name, mode) File "/usr/local/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 19, in open_dbhash import bsddb File "/usr/local/lib/python2.3/bsddb/__init__.py", line 40, in ? import _bsddb ImportError: No module named _bsddb ???????????????????????????????????????????????????? any ideas? i had pybsddb working with a4. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "A function of free speech under our free system of government is to invite dispute. It may indeed best serve its high purposes when it induces a condition of unrest, creates dissatisfaction with conditions as they are, or even stirs people to anger." -- Supreme Court Justice William O. Douglas Terminiello v. Chicago (1949) From tameyer at ihug.co.nz Fri Nov 21 03:30:47 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 21 03:30:55 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206DA6@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B175@its-xchg4.massey.ac.nz> [Tony] > SpamBayes will create two database files as well, but it > takes care of managing those (although you can set where > they are located and what format they are in). [Atom] > two db files? i only see the ~/.hammiedb > > i'm using 1.0.a.4, if it makes a difference. No, my mistake. The second one is a 'messageinfo' database, but you'll only get that if you use the POP3 proxy or IMAP filter. (All my messages in this thread ignore the Outlook plug-in). =Tony Meyer From tameyer at ihug.co.nz Fri Nov 21 03:34:28 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 21 03:34:33 2003 Subject: [Spambayes] any *nix users on list? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206C52@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B176@its-xchg4.massey.ac.nz> > I access my mail through SSL tunnled IMAP from multiple > clients on different machines. Can I set up the IMAPproxy on > the server and connect to it that way, instead of having it > local on each client? Firstly - the (current) SpamBayes IMAP solution is a filter, not a proxy. But, yes, you can run this wherever you like, as long as it can connect to the IMAP server. > Is there any advantage to doing that > over using procmail to filter things into folders that I then > acces with the mail client? I don't really know :) I guess it depends on how simple it is to get the procmail setup going - if it's easy enough, then maybe you don't need imapfilter. All the imapfilter script does, really, is take care of getting the contents of messages and then does the same thing as mboxtrain or sb_filter (and various housekeeping, so that messages can be identified in the future). =Tony Meyer From atom at suspicious.org Fri Nov 21 05:16:22 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 05:16:49 2003 Subject: [Spambayes] *nix error after upgrade // python version problem? In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> Message-ID: this is weird... i just tried this on 2 freeBSD boxes and on both of them spambayes can access it's db fine with python-2.2.3_2 and fails miserably with python-2.3.2_2. upgrade, test. downgrade, test. upgrade, test. repeat. any ideas? is this a known issue? should i go to sleep now? On Fri, 21 Nov 2003, Atom 'Smasher' wrote: > ok... i just tried to upgrade from a4 to a7 and i'm getting error messages > like: > > $ sb_mboxtrain.py -f -d .hammiedb -g Maildir/ > Traceback (most recent call last): > File "/usr/local/bin/sb_mboxtrain.py", line 315, in ? > main() > File "/usr/local/bin/sb_mboxtrain.py", line 296, in main > h = hammie.open(pck, usedb, "c") > File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 258, in open > return Hammie(storage.open_storage(filename, useDB, mode)) > File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 675, in open_storage > return klass(data_source_name, mode) > File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 162, in __init__ > self.load() > File "/usr/local/lib/python2.3/site-packages/spambayes/storage.py", line 187, in load > self.dbm = dbmstorage.open(self.db_name, self.mode) > File "/usr/local/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 60, in open > return f(db_name, mode) > File "/usr/local/lib/python2.3/site-packages/spambayes/dbmstorage.py", line 19, in open_dbhash > import bsddb > File "/usr/local/lib/python2.3/bsddb/__init__.py", line 40, in ? > import _bsddb > ImportError: No module named _bsddb > > ???????????????????????????????????????????????????? > > any ideas? i had pybsddb working with a4. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "The more laws, the less justice." -- Marcus Tullius Cicero De Officiis From ian at astounding.org.uk Fri Nov 21 05:38:19 2003 From: ian at astounding.org.uk (Ian Smith) Date: Fri Nov 21 05:35:54 2003 Subject: [Spambayes] need .spambayesrc in *nix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E0@its-xchg4.massey.ac.nz> Message-ID: On Fri, 21 Nov 2003, Tony Meyer wrote: [of .spambayesrc] > > > also, where is the documentation for the stuff that lives in that file? > > Look at FAQs 4.11 and 4.12: > > > like-but-what-options-do-i-need-to-set> Also, I found you can read the file Options.py (can't remebre where it is when installed, but it was fairly easy to find). The format of that is such that working out what goes into teh rc file. At least, that's what I did, and it seems to have worked... regards, Ian SMith From czhou at womble.com Fri Nov 21 09:35:42 2003 From: czhou at womble.com (Chang Zhou) Date: Fri Nov 21 09:34:27 2003 Subject: [Spambayes] Suggestion: make "SpamBayes Manager->Training->Start Training" a top level button? Message-ID: <000801c3b03c$be50ab70$6f02a8c0@allen> Hello, I frequently use the command sequence above, which requires a total of 5 mouse clicks to include a few new messages in the good and spam lists: 1. click on SpamBayes to open the pulldown menu 2. click on the "SpamBayes Manager" item 3. click on the "Training" page 4. click on the "Start Training" 5. click on the "Close" button Infrequently, I will go into the SpamBayes Manager to add a few new good message folders; rarely I will touch its other controls. Thanks for your great spam filter. Best regards, --- Chang From daury.powell at unisys.com Fri Nov 21 12:09:40 2003 From: daury.powell at unisys.com (Powell, Daury L) Date: Fri Nov 21 12:42:00 2003 Subject: [Spambayes] problem Message-ID: <5AB7FF75FC8DC746A43B8E4E60673B72190FB8@usaus-exch1.na.uis.unisys.com> just had it installed a few days--when one of my co-workers send me a message, it arrives in the inbox for less than a second and dissappears--it doesn't show up in the spam file, deleted file, or anywhere I can find. Also, at times, I get a indicator that I have recieved mail, but there is none in the in box, or in the spam file. From mossesq at earthlink.net Fri Nov 21 12:54:11 2003 From: mossesq at earthlink.net (Jeff) Date: Fri Nov 21 12:54:13 2003 Subject: [Spambayes] It stopped working!! Message-ID: <000401c3b058$789f6af0$6501a8c0@jmosslpc> I do not know what happened, but Spambayes was working great, then stopped! I tried uninstall and redownloading and reinstall, but it made no difference. Even after uninstall and before the download, the "Delete as Spam" Icon and the "Spambays" drop down were there, but clicking on Delete as spam did nothing, and there was no drop down on the spambayse key. I used to get a drop down box re: the manager. What happened? Jeff Moss -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031121/5c8a6d7c/attachment.gif From papaDoc at videotron.ca Fri Nov 21 14:14:04 2003 From: papaDoc at videotron.ca (papaDoc) Date: Fri Nov 21 14:14:09 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> Message-ID: <3FBE63FC.9030003@videotron.ca> Hi, >no-go... > >i've got: >db3-3.3.11,1 The Berkeley DB package, revision 3.3 >db4-4.0.14_1,1 The Berkeley DB package, revision 4 >db41-4.1.25_1 The Berkeley DB package, revision 4.1 > > >even with the 4* DBs removed, and only db3 installed, it works fine with >old python, craps on new python. > >i don't code in python, so i don't know where to start.... > > >On Fri, 21 Nov 2003, papaDoc wrote: > > > >>Hi, >> >>I think you should try to install bsddb3 there was a email on that topic >>a few days ago. >> >>Remi >> >> >> >>>ok... i just tried to upgrade from a4 to a7 and i'm getting error messages >>>like: >>> >>> >>> File "/usr/local/lib/python2.3/bsddb/__init__.py", line 40, in ? >>> import _bsddb >>>ImportError: No module named _bsddb >>> I think ( can someone correct me if I'm wrong) that you don't get the bsddb you expect. Try to add at the beginning of your environment variable PYTHONPATH the directory of db3-3 Remi From atom at suspicious.org Fri Nov 21 14:18:09 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 14:18:32 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: <3FBE63FC.9030003@videotron.ca> References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: > >db3-3.3.11,1 The Berkeley DB package, revision 3.3 > >db4-4.0.14_1,1 The Berkeley DB package, revision 4 > >db41-4.1.25_1 The Berkeley DB package, revision 4.1 > I think ( can someone correct me if I'm wrong) that you don't get the > bsddb you expect. > > Try to add at the beginning of your environment variable PYTHONPATH the > directory of db3-3 ================================== i'm getting the same error with db4* removed, so that leads me to think that it's not attempting to use it. also, the freeBSD port of: py22-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 *requires* bsddb4. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Today every inhabitant of this planet must contemplate the day when this planet may no longer be habitable. Every man, woman and child lives under a nuclear sword of Damocles, hanging by the slenderest of threads, capable of being cut at any moment by accident or miscalculation or madness." -- John F. Kennedy From atom at suspicious.org Fri Nov 21 14:24:28 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 14:24:51 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: > also, the freeBSD port of: > py22-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 > *requires* bsddb4. ========================== hhmmm.... could it be that "py22-bsddb3-4.1.6" only works with pyhon2.2? i've gotta run, but when i get back i'll try loading up "py23-bsddb-2.3.2" and see if it's just a typical case of user error.... ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "What sane person could live in this world and not be crazy?" -- Ursula K. LeGuin From Sorin.Dosoftei at imtcomputers.com Fri Nov 21 16:06:47 2003 From: Sorin.Dosoftei at imtcomputers.com (Sorin Dosoftei) Date: Fri Nov 21 16:06:22 2003 Subject: [Spambayes] AntiSpam software - tech support Message-ID: Hi, Downloaded your AntiSpam software. It looks pretty good. Now, I have some e-mail that, when I try to ":-( delete them as spam", it gives me an error message: "No filterable mail items are selected" Any help would be greatly appreciated, Sorin Dosoftei IT Consultant - A+, Network+, MCSA Innovative Micro Technologies 39 Maple Tree Ave. Stamford, CT 06906 Phone: (203) 969-2255 Fax: (203) 969-2266 Email: sorin.dosoftei@imtcomputers.com http://www.IMTcomputers.com From mikes at xtras.com Fri Nov 21 16:16:39 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 21 16:16:44 2003 Subject: [Spambayes] Outlook 2003 Crashing w/SpamBayes v0.81 Message-ID: <1D85A2E1383A484CB168DCD61228D36633F016@xmail.xtras.com> I've just installed Office 2003 and ran up update online, and SpamBayes now crashes Outlook. Outlook XP is still on my machine and it still runs SpamBayes fine. Whenever I click the "Delete as Spam" button of the "SpamBayes" button, it crashes with the "Can I send this info to Microsoft" message. I googled "Spambayes and Outlook 2003" but found no answers. Any ideas? Thanks in advance. -Mike Schinkel From skip at pobox.com Fri Nov 21 16:46:11 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Nov 21 16:46:26 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: <16318.34723.529676.91474@montanaro.dyndns.org> >> also, the freeBSD port of: >> py22-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 >> *requires* bsddb4. atom> ========================== atom> hhmmm.... could it be that "py22-bsddb3-4.1.6" only works with atom> pyhon2.2? Most likely, yes. Skip From Brad.Berson at abc.com Fri Nov 21 17:37:24 2003 From: Brad.Berson at abc.com (Berson, Brad) Date: Fri Nov 21 17:37:35 2003 Subject: [Spambayes] Incoming email alerts... Message-ID: Hi, I just began trying out the SpamBayes Outlook plug-in. Great product!!! I'm very grateful for software like this. I use Vamsoft's Open Relay Filter on my Exchange box but even with a half-dozen DNSBLs it's not even 90% effective. The documented problem of not being able to squash the incoming email notification when an email is routed to the junk e-mail folder is something that interests me because of the frequency of the junk coming in, sometimes close to 100 a day. There's a product called Qurb which according to what I've read, may have gotten around this obstacle somehow. I have not evaluated it because whitelist solutions do not interest me. Just thought you folks might want to check that out. Thanks for the hard work. Sincerely, Brad Berson From atom at suspicious.org Fri Nov 21 18:43:28 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 18:43:53 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: > hhmmm.... could it be that "py22-bsddb3-4.1.6" only works with pyhon2.2? > > i've gotta run, but when i get back i'll try loading up "py23-bsddb-2.3.2" > and see if it's just a typical case of user error.... ======================== on one server i'm leaving in python 2.2, and all is well. on the other server, i'm fighting with it and have these relevant packages installed: db3-3.3.11,1 The Berkeley DB package, revision 3.3 db4-4.0.14_1,1 The Berkeley DB package, revision 4 py23-bsddb-2.3.2 Python bindings to the Berkeley DB library python-2.3.2_2 An interpreted object-oriented programming language here's the error i'm getting now... %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% $ sb_filter.py -n Traceback (most recent call last): File "/usr/local/bin/sb_filter.py", line 186, in ? main() File "/usr/local/bin/sb_filter.py", line 174, in main h.newdb() File "/usr/local/bin/sb_filter.py", line 114, in newdb h.store() File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 247, in store self.bayes.store() AttributeError: 'NoneType' object has no attribute 'store' %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% ???? ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "History is the version of past events that people have decided to agree upon." -- Napoleon Bonaparte From atom at suspicious.org Fri Nov 21 18:59:30 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Fri Nov 21 18:59:53 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: > db3-3.3.11,1 The Berkeley DB package, revision 3.3 > db4-4.0.14_1,1 The Berkeley DB package, revision 4 > py23-bsddb-2.3.2 Python bindings to the Berkeley DB library > python-2.3.2_2 An interpreted object-oriented programming language ====================== i meant this one py-bsdbd: py23-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 still not working.... ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Why of course the people don't want war. Why should some poor slob on a farm want to risk his life in a war when the best he can get out of it is to come back to his farm in one piece? Naturally, the common people don't want war: neither in Russia, nor in England, nor for that matter in Germany. That is understood. But after all it is the leaders of the country who determine the policy, and it is always a simple matter to drag the people along, whether it is a democracy, or a fascist dictatorship, or a parliament, or a communist dictatorship... Voice or no voice, the people can always be brought to the bidding of the leaders. That is easy. All you have to do is to tell them they are being attacked, and denounce the pacifists for lack of patriotism and exposing the country to danger." -- Hermann Goering, Nazi Air Force (Luftwaffe) commander, at the Nuremberg Trials after World War II. From tim.one at comcast.net Fri Nov 21 23:08:02 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 21 23:08:07 2003 Subject: [Spambayes] Incoming email alerts... In-Reply-To: Message-ID: [Berson, Brad] > ... > The documented problem of not being able to squash the incoming > email notification when an email is routed to the junk e-mail > folder is something that interests me because of the frequency of > the junk coming in, sometimes close to 100 a day. > > There's a product called Qurb which according to what I've read, > may have gotten around this obstacle somehow. I have not evaluated > it because whitelist solutions do not interest me. Just thought > you folks might want to check that out. Ya, I think the real problem here is that the SpamBayes developers get so much email that none will ever really care about that systray icon. I know I don't -- it's on all day and all night for me, and would be even if spam didn't trigger it. So there's just no motivation for the current developers to spend their free time trying to endure whatever convolutions are necessary (and it must be convoluted, since Outlook doesn't expose that systray gimmick directly). That's one way in which a commercial product *can* be better: if enough customers offer to pay for a feature, management can pay a developer to implement it on work time. In the Open Source world, the theory is that a *user* who wants something enough will volunteer to do the work themself, and contribute the code back to the project for everyone to use. Since that's the only theory we have , that's the only way it will get done. So far nobody has volunteered. One note of caution: reverse-engineering commercial products is almost always a violation of the license agreement, so if someone does want to do this, do *not* purchase Qurb and crack how they do it. Any sign of stuff like that in contributed code, and it will be rejected -- that can create legal problems for the whole project. Since SpamBayes doesn't generate any revenue, it doesn't have any funds to pay lawyers either. From brad.berson at bytebrothers.org Sat Nov 22 00:22:55 2003 From: brad.berson at bytebrothers.org (Brad Berson) Date: Sat Nov 22 00:22:58 2003 Subject: [Spambayes] Outlook Plug-In with Citrix...? Message-ID: Is the latest SpamBayes Outlook plug-in written politely enough to work on a Win2K Terminal Server? Any terrible things I should be aware of in this sort of environment?? Thanks, Brad From clare at optushome.com.au Sat Nov 22 01:20:24 2003 From: clare at optushome.com.au (Clare Wagemans) Date: Sat Nov 22 01:19:25 2003 Subject: [Spambayes] Marking spam as unread Message-ID: Dear Developer I have checked the option to mark possible and definite spam as unread in the hope that I would not get the icon on my system tray indicating that I had new mail. They are marked as unread but I still get the icon. Any chance of a refinement to overcome this problem? Clare Wagemans From clare at optushome.com.au Sat Nov 22 01:22:17 2003 From: clare at optushome.com.au (Clare Wagemans) Date: Sat Nov 22 01:21:15 2003 Subject: [Spambayes] Apologies about read/unread mail Message-ID: I read a little further and see the problems you are encountering. Clare Wagemans From tim.one at comcast.net Sat Nov 22 01:23:15 2003 From: tim.one at comcast.net (Tim Peters) Date: Sat Nov 22 01:23:19 2003 Subject: [Spambayes] Marking spam as unread In-Reply-To: Message-ID: [Clare Wagemans] > I have checked the option to mark possible and definite spam as > unread in the hope that I would not get the icon on my system tray > indicating that I had new mail. They are marked as unread but I > still get the icon. Any chance of a refinement to overcome this > problem? Clare Wagemans I just replied to the same kind of thing a couple hours ago, so forgive if me I just give a reference to it: http://mail.python.org/pipermail/spambayes/2003-November/009460.html From rrowlan at cablelynx.com Sat Nov 22 03:59:33 2003 From: rrowlan at cablelynx.com (Randy H Rowlan) Date: Sat Nov 22 03:59:51 2003 Subject: [Spambayes] email eaten Message-ID: <000001c3b0d6$f31fbfe0$6400a8c0@pineygrove.local> Why are some emails reported as being in the inbox (downloading 10 for example and inbox goes from 5 to 15) but not being visible to read. They are not placed in the possible junk or spam folders. They just disappear before I can see what they are or read them. I read the troubleshooting file but saw nothing like this. Thanks for the help. Randy From mlportersr at webtv.net Sat Nov 22 11:44:59 2003 From: mlportersr at webtv.net (Michael Porter) Date: Sat Nov 22 11:45:00 2003 Subject: [Spambayes] Error on install - release 0.81 on Outlook 2002 Message-ID: <000d01c3b117$f87a88d0$61e61cd8@Michael> Your message is about a month old and there haven't been any replies. Have you resolved the trouble? Mike. "...after all He's not a tame lion..." From smstone at dsmea.com Sat Nov 22 12:50:48 2003 From: smstone at dsmea.com (steve Stone) Date: Sat Nov 22 12:51:40 2003 Subject: [Spambayes] Windows XP Message-ID: <000001c3b121$435d9730$6501a8c0@BJSDell2002Home> About three weeks ago I downloaded and installed the latest version of Spambayes Outlook Addin to my Windows 98 PC. It has been working beautifully so far so I decided to try it on my Windows XP PC. It seemed to download and install properly until I got a message that reads: DLL/OCX: DLL Register Server Failed; Code 0x00000000. I re-downloaded and installed it several times with the same result. If I ignore it and proceed anyway, the system does not go into the installation process when I start Outlook. Any suggestions? From jlowery at maxdoubt.com Sat Nov 22 14:32:37 2003 From: jlowery at maxdoubt.com (J Lowery) Date: Sat Nov 22 14:31:59 2003 Subject: [Spambayes] spambayes Oublook plugin - how to view db Message-ID: <000001c3b12f$64162ce0$6501a8c0@herbie> Can I view the database(s) used by the spambayes outlook plugin? I am not familiar with Python, but could figure it out if needed Thanks! Jami ----------------------------------------------------- Jami Lowery 423.265.5235 jami@jamilowery.com HYPERLINK "http://www.jamilowery.com"www.jamilowery.com Each language has its purpose, however humble. Each language expresses the yin and yang of software. Each language has its place within the Tao. But do not program in Cobol if you can avoid it. --The Tao of Programming by Geoffry James ----------------------------------------------------- --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.541 / Virus Database: 335 - Release Date: 11/14/2003 From Jordan_and_Rory at msn.com Sat Nov 22 16:35:55 2003 From: Jordan_and_Rory at msn.com (Jordan and Rory) Date: Sat Nov 22 16:37:45 2003 Subject: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well Message-ID: <000001c3b140$a13a4640$6402a8c0@Inspiron4150> Hello, all. My apologies if this has been answered before, or is a routine question - I read through the FAQ and did several web searches without finding an answer. I have been running the SpamBayes Outlook (2002 SP-2) plug-in for many weeks. I have a training base of nearly 9000 spams and 800 or so hams. Despite this degree of training, the filter is only achieving about 60% success. Many obvious spams receive a score of 0-2%. The performance of the filter doesn't seem to be improving as the database grows, either. I have retrained it from scratch several times, each time to no avail. I recognize that my ratio of spams to hams is very high; however, a different Bayesian filter I use at work has a similar ratio in its training base, and still achieves 98%+ effectiveness. What am I doing wrong? What can I do to improve the percentage of spams that SpamBayes catches? Thanks in advance. -Jordan From rdukes at clinimetrics.com Sat Nov 22 17:50:04 2003 From: rdukes at clinimetrics.com (Richard Dukes) Date: Sat Nov 22 17:55:26 2003 Subject: [Spambayes] SpamBayes: Filter deleted Inbox? Message-ID: <000401c3b14a$fbb38b90$082ee118@clinimetrics.com> I no longer have any of the 1500+ messages in my Inbox after running the SpamBayes filter for the first time (before training the system, by mistake). I have looked to see if the messages were sent to another Inbox, but it's not there. I haven't performed any other actions. Can you tell me where my emails are? Thanks, River Dukes PS I only ran the filter and haven't clicked the "Delete as Spam" button. River Dukes Clinical Research Associate Clinimetrics Research Associates, Inc. 12616 Dervish Lane Charlotte, NC 28269 Office phone: 704.953.1395 Office fax: 408.549.9904 Voicemail: 408.452.5701 ext. 227 Email: rdukes@clinimetrics.com From tim.one at comcast.net Sat Nov 22 17:57:44 2003 From: tim.one at comcast.net (Tim Peters) Date: Sat Nov 22 17:57:48 2003 Subject: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well In-Reply-To: <000001c3b140$a13a4640$6402a8c0@Inspiron4150> Message-ID: [spambayes-bounces@python.org] > ... > I have been running the SpamBayes Outlook (2002 SP-2) plug-in for many > weeks. I have a training base of nearly 9000 spams and 800 or so > hams. Balance the # of ham and spam trained on and it should work much better. > Despite this degree of training, the filter is only achieving > about 60% success. Many obvious spams receive a score of 0-2%. The > performance of the filter doesn't seem to be improving as the > database grows, either. I have retrained it from scratch several > times, each time to no avail. > > I recognize that my ratio of spams to hams is very high; So *try* balancing it. Tell us what happens when you do. > however, a different Bayesian filter I use at work has a similar > ratio in its training base, and still achieves 98%+ effectiveness. SpamBayes almost certainly uses very different algorithms than that one, whatever it is. Different strengths, different weaknesses, different behaviors. > What am I doing wrong? What can I do to improve the percentage of > spams that SpamBayes catches? Train on less spam or more ham. Also make sure you haven't misclassified any messages you've trained on. If the problem still persists, we'll need to look at the info generated by "Show spam clues for current message". But with a training ratio unbalanced by more than 11 to 1, there's no point to that exercise now -- unbalanced training data is a well-known cause for flaky results in this program. From tim.one at comcast.net Sat Nov 22 18:05:38 2003 From: tim.one at comcast.net (Tim Peters) Date: Sat Nov 22 18:05:43 2003 Subject: [Spambayes] SpamBayes: Filter deleted Inbox? In-Reply-To: <000401c3b14a$fbb38b90$082ee118@clinimetrics.com> Message-ID: [Richard Dukes] > I no longer have any of the 1500+ messages in my Inbox after running > the SpamBayes filter for the first time (before training the system, > by mistake). I have looked to see if the messages were sent to another > Inbox, but it's not there. I haven't performed any other actions. > > Can you tell me where my emails are? > > Thanks, > > River Dukes > > PS I only ran the filter and haven't clicked the "Delete as Spam" > button. I don't know what "ran the filter" means to you. Remember that we didn't watch while you were doing this, so the only info we have is what you tell us. Exactly what did you do? For example, exactly which buttons did you click on, and in what order? If SpamBayes had never been trained, then every email it sees would score exactly 50% (SpamBayes was never taught anything if it was never trained, so has no reason to believe that anything is ham or spam then). Depending on what you haven't told us, it's possible that SpamBayes would then move every email into your "Unsure" or "Possible spam" folder (whatever you may have called that -- I can't guess). It certainly would not move it to "another Inbox" (unless you had told it to move uncertain messages to some other inbox). Note: If you remember what at least one of those emails contained, you could use Outlook's "Advanced Find" system, from the root folder your of .pst file, to search for it. Note too that SpamBayes has no option you can set that would cause it to delete email (SpamBayes never deletes mail, at worst it moves it to another folder). From Jakov.Zaidman at T4G.com Sat Nov 22 18:07:55 2003 From: Jakov.Zaidman at T4G.com (Zaidman, Jakov) Date: Sat Nov 22 18:08:05 2003 Subject: [Spambayes] Certain messages always flagged as Spam Message-ID: I receive firewall event logs as email messages. Every time I receive one, it is flagged as Spam Suspect with spam rating of 75%. Every time I try to recover these messages from spam by clicking on the 'recover' button. Still, next time the message ends up in the suspects folder again. I attached a sample message, along with its SpamBayes clues. Thanks, Jakov Zaidman jzaidman@interlan.ca <> -------------- next part -------------- An embedded message was scrubbed... From: "Zaidman, Jakov" Subject: Spam Clues: NetScreen Event Alarms Reported From TonyM-C Date: Sat, 22 Nov 2003 18:04:43 -0500 Size: 6639 Url: http://mail.python.org/pipermail/spambayes/attachments/20031122/4e791970/attachment.mht From tameyer at ihug.co.nz Sat Nov 22 18:39:00 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Nov 22 18:39:04 2003 Subject: [Spambayes] Certain messages always flagged as Spam In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314C20@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B178@its-xchg4.massey.ac.nz> > I receive firewall event logs as email messages. Every time I > receive one, it is flagged as Spam Suspect with spam rating > of 75%. Every time I try to recover these messages from spam > by clicking on the 'recover' button. Still, next time the > message ends up in the suspects folder again. > > I attached a sample message, along with its SpamBayes clues. [From attachment] > 'header:Received:4' 0.827173 278 10 It looks like you've got a huge ham/spam imbalance. SpamBayes works best when you have roughly equal numbers of ham and spam. Try adjusting your levels so that this is the case, and try again. Note that if you look at the status message in the main SpamBayes manager dialog, you should see a warning about this. =Tony Meyer From tameyer at ihug.co.nz Sat Nov 22 18:39:58 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Nov 22 18:40:00 2003 Subject: [Spambayes] spambayes Oublook plugin - how to view db In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314C1B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B179@its-xchg4.massey.ac.nz> > Can I view the database(s) used by the spambayes outlook > plugin? I am not familiar with Python, but could figure it > out if needed If you get hold of the source (available from the same download page as the plug-in), and have Python installed, then you can use the "db_expimp.py" script that it includes. =Tony Meyer From tameyer at ihug.co.nz Sat Nov 22 18:41:34 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Nov 22 18:41:37 2003 Subject: [Spambayes] email eaten In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13042070B8@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B17A@its-xchg4.massey.ac.nz> > Why are some emails reported as being in the inbox > (downloading 10 for example and inbox goes from 5 to 15) but > not being visible to read. They are not placed in the > possible junk or spam folders. They just disappear before I > can see what they are or read them. SpamBayes doesn't delete any messages, it only moves them. If SpamBayes is making the change here, I suspect that they are being moved into another folder - perhaps the folders you think you've set as possible junk or spam, aren't the ones that are actually set? Note that you can use Outlook's Advanced Find tool to find one of these, and, no doubt, all the rest will be with it. =Tony Meyer From tameyer at ihug.co.nz Sat Nov 22 18:42:29 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat Nov 22 18:42:31 2003 Subject: [Spambayes] problem In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304206F1A@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B17B@its-xchg4.massey.ac.nz> > just had it installed a few days--when one of my co-workers > send me a message, it arrives in the inbox for less than a > second and dissappears--it doesn't show up in the spam file, > deleted file, or anywhere I can find. Also, at times, I get > a indicator that I have recieved mail, but there is none in > the in box, or in the spam file. Are you looking in the folder that you've designated to hold mail SpamBayes is unsure about? Are you definitely looking in the right spam folder? Have you tried searching for the messages with Outlook's Advanced Find? =Tony Meyer From Jordan_and_Rory at msn.com Sat Nov 22 21:14:14 2003 From: Jordan_and_Rory at msn.com (Jordan and Rory) Date: Sat Nov 22 21:14:26 2003 Subject: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well In-Reply-To: Message-ID: <000701c3b167$82a4eba0$6402a8c0@Inspiron4150> Tim, Thanks so much for the prompt response. Your advice worked! I retrained with the 900 or so hams I had, and about 850 spams. I now get MUCH better distinction between spam and ham in the re-scored notes. Most of the spams now show scores in the 90-100% range; the hams all fall in the 0-5% range. I guess with a steady 10-1 spam/ham ratio of incoming mail, I'll have to periodically toss a bunch of spam and retrain. I greatly appreciate the help... and am now a much happier camper! :) -Jordan -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: Saturday, November 22, 2003 5:58 PM To: Jordan and Rory Cc: spambayes@python.org Subject: RE: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well [spambayes-bounces@python.org] > ... > I have been running the SpamBayes Outlook (2002 SP-2) plug-in for many > weeks. I have a training base of nearly 9000 spams and 800 or so > hams. Balance the # of ham and spam trained on and it should work much better. > Despite this degree of training, the filter is only achieving > about 60% success. Many obvious spams receive a score of 0-2%. The > performance of the filter doesn't seem to be improving as the > database grows, either. I have retrained it from scratch several > times, each time to no avail. > > I recognize that my ratio of spams to hams is very high; So *try* balancing it. Tell us what happens when you do. > however, a different Bayesian filter I use at work has a similar > ratio in its training base, and still achieves 98%+ effectiveness. SpamBayes almost certainly uses very different algorithms than that one, whatever it is. Different strengths, different weaknesses, different behaviors. > What am I doing wrong? What can I do to improve the percentage of > spams that SpamBayes catches? Train on less spam or more ham. Also make sure you haven't misclassified any messages you've trained on. If the problem still persists, we'll need to look at the info generated by "Show spam clues for current message". But with a training ratio unbalanced by more than 11 to 1, there's no point to that exercise now -- unbalanced training data is a well-known cause for flaky results in this program. From dalke at dalkescientific.com Sat Nov 22 21:16:43 2003 From: dalke at dalkescientific.com (Andrew Dalke) Date: Sat Nov 22 21:16:20 2003 Subject: [Spambayes] Re: pop3 culler In-Reply-To: Message-ID: <148EF4BE-1D5B-11D8-B497-000393C92466@dalkescientific.com> Last week I asked about a 'spam culler' which would sit on my NT box (which is up 24hrs/day) and check and remove spam and viruses in my POP3 accounts. This helps reduce the amount of mail I get on my laptop, which is especially helpful when traveling and have only occasional, slow network access. I have written it and works fine for my needs. It just sits in a loop, connects to my accounts, runs some tests (from and subject whitelists, spam and virus checks) and performs the appropriate actions. It then waits a bit before doing it again. It's been doing a great job the last couple of days, and has culled about 6 spams an hour and a "virus" every two hours. (My definition of a virus is any executable attachments.) As per Richie Hindle's request, I've put it on the SB wiki, at http://entrian.com/sbwiki/SpamBayesCuller Note that it's designed for programmers; to configure it you will have to tweak code. Andrew dalke@dalkescientific.com From richie at entrian.com Sun Nov 23 02:00:41 2003 From: richie at entrian.com (Richie Hindle) Date: Sun Nov 23 02:01:11 2003 Subject: [Spambayes] Re: pop3 culler In-Reply-To: <148EF4BE-1D5B-11D8-B497-000393C92466@dalkescientific.com> References: <148EF4BE-1D5B-11D8-B497-000393C92466@dalkescientific.com> Message-ID: <6gm0svgvq9rinqjdrg2jvp03rqsepijfov@4ax.com> [Andrew] > As per Richie Hindle's request, I've put it on the SB wiki, at > http://entrian.com/sbwiki/SpamBayesCuller Many thanks! A very useful (and nicely written) script. -- Richie Hindle richie@entrian.com From Jakov.Zaidman at T4G.com Sun Nov 23 13:09:00 2003 From: Jakov.Zaidman at T4G.com (Zaidman, Jakov) Date: Sun Nov 23 13:09:18 2003 Subject: [Spambayes] Certain messages always flagged as Spam Message-ID: Tony, The status message says that the database has over 9000 good messages and 64 bad messages. The huge number of good messages is based on the training with all of the old mail stored in my mailbox. I was under the impression that it would be a good thing to give it as much as possible good mail to train on, in order to reduce the possibility for false positives. What adjustments would you suggest I should make? Thanks, Jakov -----Original Message----- From: Tony Meyer [mailto:tameyer@ihug.co.nz] Sent: Saturday, November 22, 2003 6:39 PM To: Zaidman, Jakov; spambayes@python.org Subject: RE: [Spambayes] Certain messages always flagged as Spam > I receive firewall event logs as email messages. Every time I > receive one, it is flagged as Spam Suspect with spam rating > of 75%. Every time I try to recover these messages from spam > by clicking on the 'recover' button. Still, next time the > message ends up in the suspects folder again. > > I attached a sample message, along with its SpamBayes clues. [From attachment] > 'header:Received:4' 0.827173 278 10 It looks like you've got a huge ham/spam imbalance. SpamBayes works best when you have roughly equal numbers of ham and spam. Try adjusting your levels so that this is the case, and try again. Note that if you look at the status message in the main SpamBayes manager dialog, you should see a warning about this. =Tony Meyer From tim.one at comcast.net Sun Nov 23 14:55:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Nov 23 14:56:00 2003 Subject: [Spambayes] Certain messages always flagged as Spam In-Reply-To: Message-ID: [Zaidman, Jakov] > The status message says that the database has over 9000 good messages > and 64 bad messages. Ouch. > The huge number of good messages is based on the training with all > of the old mail stored in my mailbox. I was under the impression that > it would be a good thing to give it as much as possible good mail > to train on, in order to reduce the possibility for false positives. That's a decent strategy provided that you also train on an approximately equal number of spam. > What adjustments would you suggest I should make? Tony gave you the best advice the first time: balance your training data, even if that means you train on only 64 ham too. Better would be to train on more spam. You should find that once you've trained on (just) a couple hundred of each, the classification performance will be very good. From early tests, training on many thousands of each only makes a difference if you're worried about the fourth decimal digit. An advantage to training on relatively few messages of each kind is that your classifier will respond much more quickly to training on mistakes. You don't have to train on everything to get excellent results. From rross at vigilantesecurity.com Sun Nov 23 16:04:59 2003 From: rross at vigilantesecurity.com (Ron Ross) Date: Sun Nov 23 16:05:04 2003 Subject: [Spambayes] Configuration Wizard Not working Message-ID: Hello, The Configuration Wizard does not open when I select it. MS Outlook 2003, Thanks RSR -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.197 / Virus Database: 261.3.0 - Release Date: 11/21/2003 From jmcatee at mediaodyssey.com Sun Nov 23 18:14:17 2003 From: jmcatee at mediaodyssey.com (Jim McAtee) Date: Sun Nov 23 18:14:21 2003 Subject: [Spambayes] outlook express References: <1ED4ECF91CDED24C8D012BCF2B034F13026F29DA@its-xchg4.massey.ac.nz> Message-ID: <019401c3b217$857057b0$1c03a8c0@jim> From: "Tony Meyer" > Do you still need to train? I would have thought that you'd get > pretty acceptable results quite quickly with that volume of mail. Spambayes isn't doing too badly. Still, probably 5% or so of spam ends up getting past, much of it marked as ham. Being relatively new to Spambayes, my initial training probably wasn't optimal. From messages I've been reading on this list it appears the best thing to do would be to retrain from hand-picked sets of spam and ham. Which leads to this question: If I retrain from scratch, as you mention, probably the easiest way is to pick out and set aside some spam in an OE folder and do the same with a set of copies of ham in another folder. Then point Spambayes, using the web interface, at these folders to train. Will the embedded Spambayes headers in this already-marked email cause Spambayes any problems or have a detrimental effect on the training? Or would I be better served by turning off Spambayes for a couple of days while I collect training spam _without_ the Spambayes headers? From mdhpub at blueyonder.co.uk Sun Nov 23 21:07:39 2003 From: mdhpub at blueyonder.co.uk (Mathew Hendry) Date: Sun Nov 23 21:07:43 2003 Subject: [Spambayes] How to tell SpamBayes to check more headers (using MSOplugin) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13026F29D5@its-xchg4.massey.ac.nz> Message-ID: Tony Meyer wrote: > However, if these headers appear for both ham and spam, and it's just the > contents of the headers that varies, then this isn't going to do you much > good. In that case, you'd have to add code to tokenizer.py (in the > tokenize_headers function) to add specific tokens for SpamAssassin and > SpamCop. (If you do, please submit a patch). I'm still playing with it, but here's what I have so far. It seems to work quite nicely - after a full retrain, I'm seeing a lot of spammassassin: and spamcop: lines near the spammy end of evidence listings. *** WARNING *** PYTHON NEWBIE *** WARNING *** spamassassin_re = re.compile(r'tests=([A-Z0-9,_]+)') ... # X-Spam-Status: # Added by SpamAssassin (http://www.spamassassin.org) line = msg.get('x-spam-status') if line is not None: line = ''.join(line.split()) for rules in spamassassin_re.findall(line): for rule in rules.split(','): yield 'spamassassin:' + rule # X-SpamCop-Disposition: # Added by SpamCop Mail service (http://www.spamcop.net) line = msg.get('x-spamcop-disposition') if line is not None: for token in line.lower().split(): yield 'spamcop:' + token -- Mat. From atom at suspicious.org Sun Nov 23 21:35:08 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Sun Nov 23 21:35:41 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: does anyone know what's going on? does anyone know which app is causing the problem? does anyone have spambayes-1.0a7 working on a *nix box with python-2.3? ok... i just tore out all of the bsddb, python and spambayes packages, and started over. i now have: db3-3.3.11,1 The Berkeley DB package, revision 3.3 db4-4.0.14_1,1 The Berkeley DB package, revision 4 db41-4.1.25_1 The Berkeley DB package, revision 4.1 py23-bsddb-2.3.2_1 Python bindings to the Berkeley DB library python-2.3.2_3 An interpreted object-oriented programming language spambayes-1.0a7 also, i tried building py23-bsddb with both "WITH_DB3" and "WITH_DB4" (one at a time, and together) and it doesn't make any difference. this is the type of error i'm getting: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% $ sb_filter.py -n -d .hammiedb Traceback (most recent call last): File "/usr/local/bin/sb_filter.py", line 186, in ? main() File "/usr/local/bin/sb_filter.py", line 174, in main h.newdb() File "/usr/local/bin/sb_filter.py", line 114, in newdb h.store() File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 247, in store self.bayes.store() AttributeError: 'NoneType' object has no attribute 'store' %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% like i said earlier, i don't do any python scripting, so this is not so intuitive to me. it still works fine with python 2.2... as soon is i upgrade to python 2.3 i have problems. ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "But I don't want to go among mad people," Alice remarked. "Oh, you can't help that," said the Cat. "We're all mad here. I'm mad. You're mad." "How do you know I'm mad?" said Alice. "You must be," said the Cat, "or you wouldn't have come here." -- Lewis Carroll, Alice's Adventures in Wonderland From atom at suspicious.org Sun Nov 23 23:53:58 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Sun Nov 23 23:54:29 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F130212B174@its-xchg4.massey.ac.nz> <3FBE15B2.4060501@videotron.ca> <3FBE63FC.9030003@videotron.ca> Message-ID: ok, with the same versions of the same packages installed, it works fine on freeBSD 4.8 but fails horribly on freeBSD 4.7. if anyone has any ideas.... ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "No one shall be subjected to arbitrary interference with his privacy, family, home or correspondence, nor to attacks upon his honour and reputation. Everyone has the right to the protection of the law against such interference or attacks." -- Article 12, Universal Declaration of Human Rights, United Nations From tameyer at ihug.co.nz Sun Nov 23 23:57:53 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Nov 23 23:57:58 2003 Subject: [Spambayes] outlook express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314DAF@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B17D@its-xchg4.massey.ac.nz> > If I retrain from scratch, as you mention, probably the > easiest way is to pick out and set aside some spam in an OE > folder and do the same with a set of copies of ham in another > folder. Then point Spambayes, using the web interface, at > these folders to train. Yes, that's probably easiest. > Will the embedded Spambayes headers in this already-marked > email cause Spambayes any problems or have a detrimental > effect on the training? No, they'll have no effect at all (unless you've made some odd changes to the default setup). By default, SpamBayes only generates tokens from a limited set of headers, which don't include any SpamBayes ones. =Tony Meyer From anthony at interlink.com.au Mon Nov 24 01:18:57 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Mon Nov 24 01:19:21 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: Message-ID: <200311240618.hAO6IvYu017410@localhost.localdomain> >>> "Atom 'Smasher'" wrote > ok, with the same versions of the same packages installed, it works fine > on freeBSD 4.8 but fails horribly on freeBSD 4.7. > > if anyone has any ideas.... Tracebacks are essential if we're able to debug it at all. Also include the version numbers of the relevant packages. From atom at suspicious.org Mon Nov 24 01:41:38 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Mon Nov 24 01:42:07 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: <200311240618.hAO6IvYu017410@localhost.localdomain> References: <200311240618.hAO6IvYu017410@localhost.localdomain> Message-ID: here's 2 tracebacks.... %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% $ sb_filter.py -n Traceback (most recent call last): File "/usr/local/bin/sb_filter.py", line 186, in ? main() File "/usr/local/bin/sb_filter.py", line 174, in main h.newdb() File "/usr/local/bin/sb_filter.py", line 114, in newdb h.store() File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 247, in store self.bayes.store() AttributeError: 'NoneType' object has no attribute 'store' %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% and.... %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% $ sb_filter.py -d .hammiedb < text-file Traceback (most recent call last): File "/usr/local/bin/sb_filter.py", line 186, in ? main() File "/usr/local/bin/sb_filter.py", line 182, in main action(msg) File "/usr/local/bin/sb_filter.py", line 119, in filter return h.filter(msg) File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 109, in filter prob, clues = self._scoremsg(msg, True) File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) AttributeError: 'NoneType' object has no attribute 'spamprob' %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% on the 4.7 system, the relevant packages are: db4-4.0.14_1,1 The Berkeley DB package, revision 4 db41-4.1.25_1 The Berkeley DB package, revision 4.1 py23-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 python-2.3.2_3 An interpreted object-oriented programming language spambayes-1.0a7 i've also tried py23-bsddb4 and db3, but right now i'm testing it this way: * with python2.2 (symlinked to python) everything works. * as soon as i install python2.3 (as python) SB bails out the *ONLY* difference i can find here between working and not working is the version of python: 2.2 works, 2.3 doesn't. of course, on my freeBSD 4.8 box, it works either way. this problem is only on my freeBSD 4.7 boxes (which makes this even weirder). thanks.... On Mon, 24 Nov 2003, Anthony Baxter wrote: > > >>> "Atom 'Smasher'" wrote > > ok, with the same versions of the same packages installed, it works fine > > on freeBSD 4.8 but fails horribly on freeBSD 4.7. > > > > if anyone has any ideas.... > > Tracebacks are essential if we're able to debug it at all. Also include > the version numbers of the relevant packages. > > ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Every great advance in natural knowledge has involved the absolute rejection of authority." -- Julian Huxley From colin.mac at blueyonder.co.uk Mon Nov 24 06:03:41 2003 From: colin.mac at blueyonder.co.uk (Colin) Date: Mon Nov 24 06:05:02 2003 Subject: [Spambayes] Install problem Message-ID: <000901c3b27a$a022df20$0100a8c0@mshome.net> Can someone help please. I was recommended to try Spambayes, and have been testing it for approx 3 weeks. It seems to have been great so far. However at the weekend, I had to re-install Office XO (SP2), and now I can use Spambayes. I've uninstalled it, checked I have the latest version, re-installed etc, and read the FAQ's but no luck. I can bring up the Spambayes Toolbar, but when I click on the Spambayes button, I get no response. Any suggestions please? Thanks. **************************************** Colin McDonald colin.mac@blueyonder.co.uk **************************************** From dave at pythonapocrypha.com Mon Nov 24 09:12:44 2003 From: dave at pythonapocrypha.com (Dave Brueck) Date: Mon Nov 24 09:12:50 2003 Subject: [Spambayes] Header annotations included in classification? Message-ID: <06e401c3b295$081e71b0$6400000a@YODA> Hello, I'm using Spambayes-1.0a7 on a Windows machine with Outlook Express (unfortunately) and am receiving mail through the POP3 proxy. Since it's Outlook Express, under Header Options in the configuration web page I've turned on the classification in the "subject:" header for spam (so all spam's now have a subject line that starts with "spam,"). This seems to work great, but when I look at clues for a given message, it looks like the header notation is being included in the classification process: ... her 0.95871559633 content-type:text/html 0.964536333901 free! 0.96511627907 click 0.966500345667 subject:, 0.979766182776 online 0.98951048951 subject:spam 0.997760079642 (both the "spam" and the "," tokens) When I do a Word Query on these tokens I get the following results: Statistics for 'subject:,' Number of spam messages: 79. Number of ham messages: 1. Probability that a message containing this word is spam: 0.979766182776. Statistics for 'subject:spam' Number of spam messages: 100. Number of ham messages: 0. Probability that a message containing this word is spam: 0.997760079642. Currently I have trained against 141 spam and 100 ham, and none of the original messages had subject headers containing the word "spam". I've been training Spambayes using the web interface (Home -> Review). Is there a conflicting configuration settings or something I need to adjust? FWIW the classification process seems to be working really well anyway, but I'm not sure how long that will continue if Spambayes' own notations are interpreted as a slam-dunk of a spam indicator. :) Thanks for any help, Dave From kennypitt at hotmail.com Mon Nov 24 09:33:07 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 24 09:33:49 2003 Subject: [Spambayes] Incoming email alerts... In-Reply-To: Message-ID: Tim Peters wrote: > [Berson, Brad] >> ... >> The documented problem of not being able to squash the incoming >> email notification when an email is routed to the junk e-mail >> folder is something that interests me because of the frequency of >> the junk coming in, sometimes close to 100 a day. >> >> There's a product called Qurb which according to what I've read, >> may have gotten around this obstacle somehow. I have not evaluated >> it because whitelist solutions do not interest me. Just thought >> you folks might want to check that out. > > One note of caution: reverse-engineering commercial products is > almost always a violation of the license agreement, so if someone > does want to do this, do *not* purchase Qurb and crack how they do > it. Any sign of stuff like that in contributed code, and it will be > rejected -- that can create legal problems for the whole project. > Since SpamBayes doesn't generate any revenue, it doesn't have any > funds to pay lawyers either. With no reverse engineering required, I read Qurb's FAQ on their website. They state quite clearly that they are replacing the Outlook notification with their own, and that the notification only indicates messages that are left in the Inbox. If an Outlook rule moves a message to another folder, there is a good chance that Qurb will not show any notification of that message. This probably isn't the behavior we would want, anyway. -- Kenny Pitt From papaDoc at videotron.ca Mon Nov 24 09:44:08 2003 From: papaDoc at videotron.ca (papaDoc) Date: Mon Nov 24 09:44:12 2003 Subject: [Spambayes] Header annotations included in classification? In-Reply-To: <06e401c3b295$081e71b0$6400000a@YODA> References: <06e401c3b295$081e71b0$6400000a@YODA> Message-ID: <3FC21938.8060403@videotron.ca> Hi Dave, I did not look closely at the code but usually the "spam clue" are added at the end of the classification process. I you train on "fresh mail" there should be no problem. If you retrain (an email was not classified correctly, or retrain from scratch with old mail ) this can be a problem but there is options to not look at headers but I think the subject is always classified. (someone can correct me on this ?) You can fill a bug or feature request on sourceforge. Remi >I'm using Spambayes-1.0a7 on a Windows machine with Outlook Express >(unfortunately) and am receiving mail through the POP3 proxy. Since it's >Outlook Express, under Header Options in the configuration web page I've turned >on the classification in the "subject:" header for spam (so all spam's now have >a subject line that starts with "spam,"). This seems to work great, but when I >look at clues for a given message, it looks like the header notation is being >included in the classification process: > > From skip at pobox.com Mon Nov 24 09:58:48 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Nov 24 09:58:56 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <200311240618.hAO6IvYu017410@localhost.localdomain> Message-ID: <16322.7336.88487.906253@montanaro.dyndns.org> atom> File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 247, in store atom> self.bayes.store() atom> AttributeError: 'NoneType' object has no attribute 'store' and atom> File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg atom> return self.bayes.spamprob(tokenize(msg), evidence) atom> AttributeError: 'NoneType' object has no attribute 'spamprob' Both suggest that opening a database file failed. Somewhere you have a database file, right? Suppose it's named hammie.db. Try this: % python Python 2.4a0 (#5, Nov 23 2003, 20:09:38) [GCC 3.3 20030304 (Apple Computer, Inc. build 1493)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> import shelve >>> db = shelve.open("hammie.db") >>> len(db) 40569 >>> db.close() Does that succeed for Python 2.2 but fail for 2.3? If so, I'd suspect a file version change. If not, continue on to step two. % python Python 2.4a0 (#5, Nov 23 2003, 20:09:38) [GCC 3.3 20030304 (Apple Computer, Inc. build 1493)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> from spambayes.storage import open_storage >>> s = open_storage("hammie.db") >>> s >>> dir(s) ['WordInfoClass', '__doc__', '__getstate__', '__init__', '__module__', '__setstate__', '_add_msg', '_getclues', '_post_training', '_remove_msg', '_wordinfodel', '_wordinfoget', '_wordinfokeys', '_wordinfoset', '_write_state_key', 'changed_words', 'chi2_spamprob', 'close', 'db', 'db_name', 'dbm', 'learn', 'load', 'mode', 'nham', 'nspam', 'probability', 'probcache', 'spamprob', 'statekey', 'store', 'unlearn', 'wordinfo'] >>> s._wordinfoget("python") WordInfo(2, 14) Does this succeed for one version of Python but not the other? I suspect you have some sort of version-related problems, but they are related to the FreeBSD environment, not directly to Python's versioning. Skip From dave at pythonapocrypha.com Mon Nov 24 10:01:28 2003 From: dave at pythonapocrypha.com (Dave Brueck) Date: Mon Nov 24 10:01:31 2003 Subject: [Spambayes] Header annotations included in classification? References: <06e401c3b295$081e71b0$6400000a@YODA> <3FC21938.8060403@videotron.ca> Message-ID: <073101c3b29b$d6e9d6f0$6400000a@YODA> papaDoc wrote: > I did not look closely at the code but usually the "spam clue" are added > at the end of the classification process. > I you train on "fresh mail" there should be no problem. > > If you retrain (an email was not classified correctly, or retrain from > scratch with old mail ) this can be a problem but > there is options to not look at headers but I think the subject is > always classified. (someone can correct me on this ?) > > You can fill a bug or feature request on sourceforge. Aha! Thanks for your help - I feel a bit dumb but now understand my problem better. :) On the Home -> Review web page I was looking through the list of emails for which it was unsure and marking them as Ham or Spam and then clicking the Train button. But as Spambayes began to classify emails on its own they began to also show up in their own section ("Messages classified as Spam") and by default they have the "Spam" radio button selected so by clicking Train I was running them through as training material as well. I wonder if the radio button for messages that were classified as ham/spam should default to Discard? I hesitate to even file that as a feature request as I'm pretty new to SB and maybe that wouldn't make sense to more experienced users. Anyway, thanks for the help, Dave From papaDoc at videotron.ca Mon Nov 24 10:14:35 2003 From: papaDoc at videotron.ca (papaDoc) Date: Mon Nov 24 10:14:40 2003 Subject: [Spambayes] Header annotations included in classification? In-Reply-To: <073101c3b29b$d6e9d6f0$6400000a@YODA> References: <06e401c3b295$081e71b0$6400000a@YODA> <3FC21938.8060403@videotron.ca> <073101c3b29b$d6e9d6f0$6400000a@YODA> Message-ID: <3FC2205B.7050504@videotron.ca> Hi Dave, >Aha! Thanks for your help - I feel a bit dumb but now understand my problem >better. :) On the Home -> Review web page I was looking through the list of >emails for which it was unsure and marking them as Ham or Spam and then >clicking the Train button. But as Spambayes began to classify emails on its own >they began to also show up in their own section ("Messages classified as Spam") >and by default they have the "Spam" radio button selected so by clicking Train >I was running them through as training material as well. > >I wonder if the radio button for messages that were classified as ham/spam >should default to Discard? > If you get the code from CVS you can choose whatever you want. (and by default it is discard ;-) It is in the avanced options under "Interface options" >I hesitate to even file that as a feature request as >I'm pretty new to SB and maybe that wouldn't make sense to more experienced >users. > No I think you should because we should not train on stuff spambayes add. It should be removed before. You are the first one to notice that since there is not many people using Outlook Express but It is a good catch. Remi From cej at intech.com Mon Nov 24 10:23:55 2003 From: cej at intech.com (Christopher Jastram) Date: Mon Nov 24 10:27:37 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No modulenamed zlib ? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B15B@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130212B15B@its-xchg4.massey.ac.nz> Message-ID: <3FC2228B.601@intech.com> Tony Meyer wrote: >>Ended up implementing server-side filtering for everyone else, and to >>handle those accounts that receive SO much mail they crash >>the Outlook plugin... (5 to 6 hundred). >> >> > >The volume of mail shouldn't cause the plug-in any problems (although it >will obviously take longer to do it's thing). On occasions I have Outlook >download many more than 600 messages in one go, and have never had any >trouble with spambayes - as have others, I'm sure. > >This is not to say that something isn't going wrong with spambayes, just >that the volume of mail shouldn't be the cause. By "crash the plug-in", >what do you mean? Crash Outlook? Stop the plug-in working? Freeze the >system? > >=Tony Meyer > > (I did not personally experience the problem, so this is second-hand from a coworker). The volume of mail was upwards of 500 pieces (person left for a week on a business trip). It would chug along for a while, and then say either "out of memory" or crash Outlook 2K. Restart, chug along some more. User in question is running Windows 98, Outlook 2000 from Office 2000 SR-1, 128 MB RAM. I don't expect a fix, but I figured I'd post something so that other people with the same problem don't feel alone. Chris Jastram From dave at pythonapocrypha.com Mon Nov 24 10:40:19 2003 From: dave at pythonapocrypha.com (Dave Brueck) Date: Mon Nov 24 10:40:23 2003 Subject: [Spambayes] Header annotations included in classification? References: <06e401c3b295$081e71b0$6400000a@YODA> <3FC21938.8060403@videotron.ca> <073101c3b29b$d6e9d6f0$6400000a@YODA> <3FC2205B.7050504@videotron.ca> Message-ID: <077501c3b2a1$44839a20$6400000a@YODA> papaDoc wrote: > >Aha! Thanks for your help - I feel a bit dumb but now understand my problem > >better. :) On the Home -> Review web page I was looking through the list of > >emails for which it was unsure and marking them as Ham or Spam and then > >clicking the Train button. But as Spambayes began to classify emails on its own > >they began to also show up in their own section ("Messages classified as Spam") > >and by default they have the "Spam" radio button selected so by clicking Train > >I was running them through as training material as well. > > > >I wonder if the radio button for messages that were classified as ham/spam > >should default to Discard? > > > If you get the code from CVS you can choose whatever you want. (and by > default it is discard ;-) It is in the avanced options under > "Interface options" > > >I hesitate to even file that as a feature request as > >I'm pretty new to SB and maybe that wouldn't make sense to more experienced > >users. > > > No I think you should because we should not train on stuff spambayes > add. It should be removed before. Ok, I filed feature request #848365. Thanks again, Dave From atom at suspicious.org Mon Nov 24 12:53:11 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Mon Nov 24 12:53:46 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: <16322.7336.88487.906253@montanaro.dyndns.org> References: <200311240618.hAO6IvYu017410@localhost.localdomain> <16322.7336.88487.906253@montanaro.dyndns.org> Message-ID: ok, using the new python (the one giving me a problem) this first test seems to have passed: $ python Python 2.3.2 (#1, Nov 24 2003, 00:10:59) [GCC 2.95.4 20020320 [FreeBSD]] on freebsd4 Type "help", "copyright", "credits" or "license" for more information. >>> import shelve >>> db = shelve.open(".hammiedb") >>> len(db) 1 >>> db.close() >>> the second test didn't: $ python Python 2.3.2 (#1, Nov 24 2003, 00:10:59) [GCC 2.95.4 20020320 [FreeBSD]] on freebsd4 Type "help", "copyright", "credits" or "license" for more information. >>> from spambayes.storage import open_storage >>> s = open_storage(".hammiedb") >>> s >>> dir(s) ['__class__', '__delattr__', '__doc__', '__getattribute__', '__hash__', '__init__', '__new__', '__reduce__', '__reduce_ex__', '__repr__', '__setattr__', '__str__'] >>> s._wordinfoget("python") Traceback (most recent call last): File "", line 1, in ? AttributeError: 'NoneType' object has no attribute '_wordinfoget' >>> now what?? thanks.... On Mon, 24 Nov 2003, Skip Montanaro wrote: > > atom> File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 247, in store > atom> self.bayes.store() > atom> AttributeError: 'NoneType' object has no attribute 'store' > > and > > atom> File "/usr/local/lib/python2.3/site-packages/spambayes/hammie.py", line 38, in _scoremsg > atom> return self.bayes.spamprob(tokenize(msg), evidence) > atom> AttributeError: 'NoneType' object has no attribute 'spamprob' > > Both suggest that opening a database file failed. Somewhere you have a > database file, right? Suppose it's named hammie.db. Try this: > > % python > Python 2.4a0 (#5, Nov 23 2003, 20:09:38) > [GCC 3.3 20030304 (Apple Computer, Inc. build 1493)] on darwin > Type "help", "copyright", "credits" or "license" for more information. > >>> import shelve > >>> db = shelve.open("hammie.db") > >>> len(db) > 40569 > >>> db.close() > > Does that succeed for Python 2.2 but fail for 2.3? If so, I'd suspect a > file version change. If not, continue on to step two. > > % python > Python 2.4a0 (#5, Nov 23 2003, 20:09:38) > [GCC 3.3 20030304 (Apple Computer, Inc. build 1493)] on darwin > Type "help", "copyright", "credits" or "license" for more information. > >>> from spambayes.storage import open_storage > >>> s = open_storage("hammie.db") > >>> s > > >>> dir(s) > ['WordInfoClass', '__doc__', '__getstate__', '__init__', '__module__', '__setstate__', '_add_msg', '_getclues', '_post_training', '_remove_msg', '_wordinfodel', '_wordinfoget', '_wordinfokeys', '_wordinfoset', '_write_state_key', 'changed_words', 'chi2_spamprob', 'close', 'db', 'db_name', 'dbm', 'learn', 'load', 'mode', 'nham', 'nspam', 'probability', 'probcache', 'spamprob', 'statekey', 'store', 'unlearn', 'wordinfo'] > >>> s._wordinfoget("python") > WordInfo(2, 14) > > Does this succeed for one version of Python but not the other? > > I suspect you have some sort of version-related problems, but they are > related to the FreeBSD environment, not directly to Python's versioning. > > Skip > ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "The real truth of the matter is, as you and I know, that a financial element in the large centers has owned the government of the U.S. since the days of Andrew Jackson." -- Franklin Delano Roosevelt, November 21st, l933 From delete at GoodmanAssociates.com Mon Nov 24 13:14:17 2003 From: delete at GoodmanAssociates.com (Seth Goodman) Date: Mon Nov 24 13:14:27 2003 Subject: [Spambayes] Incoming email alerts... In-Reply-To: Message-ID: > With no reverse engineering required, I read Qurb's FAQ on their > website. They state quite clearly that they are replacing the Outlook > notification with their own, and that the notification only indicates > messages that are left in the Inbox. If an Outlook rule moves a message > to another folder, there is a good chance that Qurb will not show any > notification of that message. This probably isn't the behavior we would > want, anyway. > > -- > Kenny Pitt I'm not sure why you say this isn't the desired behavior. At least in my environment, I use the Outlook rules to filter off all the list mail traffic, which I don't want special notification for, and then let SpamBayes filter out the spam. The only thing I want to be notified about is what's left in the Inbox. I'm not saying that's how everyone runs Outlook, but I bet that this arrangement is more typical than not. As an aside, I recently noticed that there is a built in Outlook rule action named "clear the Message Flag". I don't know anything about the Outlook internals, but is there any chance that you could grab a hook to that action? -- Seth Goodman Humans: change "delete" to "sethg" to email me Spambots: disregard the above From delete at GoodmanAssociates.com Mon Nov 24 13:42:48 2003 From: delete at GoodmanAssociates.com (Seth Goodman) Date: Mon Nov 24 13:42:58 2003 Subject: [Spambayes] Spambayes installation in SuSE Linux / No modulenamedzlib ? In-Reply-To: <3FC2228B.601@intech.com> Message-ID: > (I did not personally experience the problem, so this is second-hand > from a coworker). The volume of mail was upwards of 500 pieces (person > left for a week on a business trip). It would chug along for a while, > and then say either "out of memory" or crash Outlook 2K. Restart, chug > along some more. User in question is running Windows 98, Outlook 2000 > from Office 2000 SR-1, 128 MB RAM. I don't expect a fix, but I figured > I'd post something so that other people with the same problem don't feel > alone. > > Chris Jastram Sounds like this machine has not been updated in a while. Outlook 2000 is currently at SP-3 and I think there were a few hot fixes on the Office Update site since that service pack. Hopefully, the machine owner has also run Microsoft Update recently, since Explorer has as many holes in it as a piece of Swiss cheese, and it *is* an integral part of the operating system. Recommend using both the MS Update Site and MS Office Update site to bring everything up to the latest and, maybe not greatest, at least less worse. -- Seth Goodman Humans: change "delete" to "sethg" to email me Spambots: disregard the above From kennypitt at hotmail.com Mon Nov 24 13:43:33 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Mon Nov 24 13:44:11 2003 Subject: [Spambayes] Incoming email alerts... In-Reply-To: Message-ID: Seth Goodman wrote: >> With no reverse engineering required, I read Qurb's FAQ on their >> website. They state quite clearly that they are replacing the >> Outlook notification with their own, and that the notification only >> indicates messages that are left in the Inbox. If an Outlook rule >> moves a message to another folder, there is a good chance that Qurb >> will not show any notification of that message. This probably isn't >> the behavior we would want, anyway. > > I'm not sure why you say this isn't the desired behavior. At least > in my environment, I use the Outlook rules to filter off all the list > mail traffic, which I don't want special notification for, and then > let SpamBayes filter out the spam. The only thing I want to be > notified about is what's left in the Inbox. I'm not saying that's > how everyone runs Outlook, but I bet that this arrangement is more > typical than not. As you said, it's not how everyone runs Outlook. We've seen a number of reports from users that start with things like, "I use Outlook rules to move mail from each of my accounts into a separate Inbox folder." Most people seem to just want the icon removed when all they have is spam, and would probably expect the behavior to be the same as existing Outlook in all other cases such as mail moved by rules. A solution that is flexible enough to support different ways of using Outlook would be preferable. InBoxer, a commercially-supported product based on SpamBayes, also supports its own notification icon for indicating received mail. Not sure if it suffers from this same limitation or not because I don't know how Sean is doing his checking. > As an aside, I recently noticed that there is a built in Outlook rule > action named "clear the Message Flag". I don't know anything about > the Outlook internals, but is there any chance that you could grab a > hook to that action? AIUI, the message flag is used to flag a message for follow-up or other special attention. It doesn't have anything to do with the new mail envelope. -- Kenny Pitt From mikes at members-insurance.com Mon Nov 24 14:23:28 2003 From: mikes at members-insurance.com (Michael K. Schummers) Date: Mon Nov 24 14:23:44 2003 Subject: [Spambayes] Training Message-ID: <00b601c3b2c0$72347010$d60000c8@PRK94> I had a strange quirk that I don't know how to reproduce last week. On Wednesday, 11/19 toward the end of the day, I was training on some additional spam to help "balance" the training database. It appeared that it worked at the time from the statistics on Spambayes Manager "General" tab. I did this training with the "Rebuild Entire Database" radio button unchecked but I had left the Rescore radio button checked. I was out of the office until this morning and found that the General Tab now said I had 1617 Spam and 0 Ham trained when I came in this morning. Needless to say, everything was moved to the Spam folder. I would have concluded that I had inadvertantly left the "Rebuild" button checked except that when I went into Spambayes Manager to redo the Training to correct things, the radio button was still unchecked. Any thoughts of what might have caused this kind of reset in the database statistics? I do recall checking the training statistics after I was done last Wednesday to see how close to "even" I had gotten the spam vs. ham training ratio so my guess is that something occured when/while I exited the Spambayes Manager but I can't think of anyway to "reconstruct" the problem to see what might have happened. Do any of the log files record anything that might give me a clue?? It wouldn't have been that big a problem but my workstation stays on 24x7 and 4 + days of ham had to be manualled extracted from the Spam folder. I have since retrained with "Rebuild" checked and things seem pretty much back to normal. ? Thanks for any thoughts. I am running the Outlook Plugin v 0.81 and had been running fine for quite a while except (per an earlier post) for the fact that "normal" operation seems to build up the number of ham scored so that the ham/spam ration seems to quickly (weeks) become skewed to 3 - 4 x the number of ham trained vs. spam. I am assuming this imbalance is occuring because of the number of Unsure that get retrained into Ham but I have kept detailed enough statistics to tell. ? Michael K. Schummers General Manager ? ? ? ? ? (859) 455-7317 HYPERLINK "mailto:mikes@members-insurance.com"mikes@members-insurance.com --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.543 / Virus Database: 337 - Release Date: 11/21/2003 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 6113 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/bb66d69d/attachment-0001.gif -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 7559 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/bb66d69d/attachment-0001.jpe From skip at pobox.com Mon Nov 24 14:37:13 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon Nov 24 14:40:56 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: References: <200311240618.hAO6IvYu017410@localhost.localdomain> <16322.7336.88487.906253@montanaro.dyndns.org> Message-ID: <16322.24041.79925.320723@montanaro.dyndns.org> $ python Python 2.3.2 (#1, Nov 24 2003, 00:10:59) [GCC 2.95.4 20020320 [FreeBSD]] on freebsd4 Type "help", "copyright", "credits" or "license" for more information. >>> from spambayes.storage import open_storage >>> s = open_storage(".hammiedb") >>> s >>> dir(s) ['__class__', '__delattr__', '__doc__', '__getattribute__', '__hash__', '__init__', '__new__', '__reduce__', '__reduce_ex__', '__repr__', '__setattr__', '__str__'] >>> s._wordinfoget("python") Traceback (most recent call last): File "", line 1, in ? AttributeError: 'NoneType' object has no attribute '_wordinfoget' >>> Looking at the open_storage code I don't see how it can return None, but it clearly is. What version of SpamBayes are you using again? Skip From atom at suspicious.org Mon Nov 24 15:08:16 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Mon Nov 24 15:08:51 2003 Subject: [Spambayes] *nix error after upgrade In-Reply-To: <16322.24041.79925.320723@montanaro.dyndns.org> References: <200311240618.hAO6IvYu017410@localhost.localdomain> <16322.7336.88487.906253@montanaro.dyndns.org> <16322.24041.79925.320723@montanaro.dyndns.org> Message-ID: spambayes-1.0a7 py23-bsddb-2.3.2_1 Python bindings to the Berkeley DB library py23-bsddb3-4.1.6 Python extension module for BerkeleyDB 3 and 4 python-2.3.2_3 An interpreted object-oriented programming language pretty weird, huh? i think i also tried python-2.3.2_2with about the same failure. On Mon, 24 Nov 2003, Skip Montanaro wrote: > > $ python > Python 2.3.2 (#1, Nov 24 2003, 00:10:59) > [GCC 2.95.4 20020320 [FreeBSD]] on freebsd4 > Type "help", "copyright", "credits" or "license" for more information. > >>> from spambayes.storage import open_storage > >>> s = open_storage(".hammiedb") > >>> s > >>> dir(s) > ['__class__', '__delattr__', '__doc__', '__getattribute__', '__hash__', > '__init__', '__new__', '__reduce__', '__reduce_ex__', '__repr__', > '__setattr__', '__str__'] > >>> s._wordinfoget("python") > Traceback (most recent call last): > File "", line 1, in ? > AttributeError: 'NoneType' object has no attribute '_wordinfoget' > >>> > > Looking at the open_storage code I don't see how it can return None, but it > clearly is. What version of SpamBayes are you using again? > > Skip > ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Television: A medium. So called because it is neither rare nor well done." -- Ernie Kovacs From rick at gdrgroup.com Mon Nov 24 15:12:18 2003 From: rick at gdrgroup.com (Rick Ward) Date: Mon Nov 24 15:12:27 2003 Subject: [Spambayes] Binary Version .81 Message-ID: <527D1FB0F3F3E44EB1869261D523B1C47AC068@gdrmain.gdr.com> Error: There was an error initializing the SpamBayes addin. Please re-start Outlook and try again. <> <> <> <> This is on a windows 98 machine. This error is happening after an install and then opening up Outlook 2000, This is the first thing that comes up. The icons are created but that is the only thing that is created. I am using exchange 2k if that helps any. I have uninstalled and reinstalled and the same error is happening. Also installed to a different location. Tried the COM-add inn fix also. I have this installed on a 2k machine on another Thank You. Rick -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 1840 bytes Desc: spambayes3.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/b432d830/spambayes3.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 1878 bytes Desc: spambayes4.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/b432d830/spambayes4.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1733 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/b432d830/spambayes1.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 61 bytes Desc: spambayes2.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/b432d830/spambayes2.obj From tpeters at mixcom.com Mon Nov 24 14:07:44 2003 From: tpeters at mixcom.com (Tom Peters) Date: Mon Nov 24 16:28:09 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 Message-ID: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> I thought I was following the install instructions for spambayes correctly, but they are a little too vaguely worded for me I guess. Now I'm blown up and dead in the water. Can I unzip into the final location where it will be executed from forevermore? I download to D:\download2, and then in this case unzipped Spambayes to D:\Programs\Spambayes1.0a7 and then proceded to run the setup.py install script right there. I had 1.0a6 pop3proxy working just perfect. Downloaded 1.0a7. Unzipped it into D:\programs\Spambayes Could not get that to work. Many errors- the pop3proxy service wouldn't start: The SpamBayes Service service is starting. The SpamBayes Service service could not be started. A service specific error occurred: 1. and other problems. So I started over, this time paying much closer attention to what I was doing and installing into a slightly different directory. The first step I did executed without error: D:\Programs\spambayes-1.0a7>python setup.py install running install running build ... copying build\scripts-2.3\sb_chkopts.py -> D:\programs\python\Scripts copying build\scripts-2.3\pop3proxy_tray.py -> D:\programs\python\Scripts But the next thing I did blew up: D:\Programs\spambayes-1.0a7>python scripts/sb_server.py -b Traceback (most recent call last): File "scripts/sb_server.py", line 100, in ? import spambayes.message File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line 198, in ? msginfoDB = MessageInfoDB(message_info_db_name) File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line 162, in __init__ self.load() File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line 166, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", line 60, in open return f(db_name, mode) File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", line 38, in open_best return f(*args) File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in hashopen d.open(file, db.DB_HASH, flags, mode) bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') Exception exceptions.AttributeError: "'MessageInfoDB' object has no attribute 'db'" in > ignored So just for kicks I tried this too: D:\Programs\spambayes-1.0a7>python windows\pop3proxy_service.py install Traceback (most recent call last): File "windows\pop3proxy_service.py", line 82, in ? import sb_server File "D:\Programs\spambayes-1.0a7\scripts\sb_server.py", line 100, in ? import spambayes.message File "D:\Programs\spambayes-1.0a7\spambayes\message.py", line 198, in ? msginfoDB = MessageInfoDB(message_info_db_name) File "D:\Programs\spambayes-1.0a7\spambayes\message.py", line 162, in __init__ self.load() File "D:\Programs\spambayes-1.0a7\spambayes\message.py", line 166, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "D:\Programs\spambayes-1.0a7\spambayes\dbmstorage.py", line 60, in open return f(db_name, mode) File "D:\Programs\spambayes-1.0a7\spambayes\dbmstorage.py", line 38, in open_best return f(*args) File "D:\Programs\spambayes-1.0a7\spambayes\dbmstorage.py", line 20, in open_dbhash return bsddb.hashopen(*args) File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in hashopen d.open(file, db.DB_HASH, flags, mode) bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') Exception exceptions.AttributeError: "'MessageInfoDB' object has no attribute 'db'" in > ignored And of course the pop3proxy service still won't start (same error). --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From papaDoc at videotron.ca Mon Nov 24 16:40:03 2003 From: papaDoc at videotron.ca (papaDoc) Date: Mon Nov 24 16:41:49 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> Message-ID: <3FC27AB3.1040602@videotron.ca> Hi, > But the next thing I did blew up: > D:\Programs\spambayes-1.0a7>python scripts/sb_server.py -b > Traceback (most recent call last): > File "scripts/sb_server.py", line 100, in ? > import spambayes.message > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", > line 198, in ? > msginfoDB = MessageInfoDB(message_info_db_name) > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", > line 162, in __init__ > self.load() > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", > line 166, in load > self.dbm = dbmstorage.open(self.db_name, self.mode) > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", > line 60, in open > return f(db_name, mode) > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", > line 38, in open_best > return f(*args) > File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", > line 20, > in open_dbhash > return bsddb.hashopen(*args) > File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in hashopen > d.open(file, db.DB_HASH, flags, mode) > bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') > Exception exceptions.AttributeError: "'MessageInfoDB' object has no > attribute 'db'" in > ignored I think you should install python version 2.3 to run Spambayes 1.0a7. You might also need bsddb that will run with python 2.3 Remi From richie at entrian.com Mon Nov 24 17:29:03 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Nov 24 17:29:37 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> Message-ID: <9405svonp1uc7voo12klctn965scbvbo3u@4ax.com> [Tom] > Can I unzip into the final location where it will be executed from > forevermore? No, that's not how it works. When you unpack the archive, the files that you unpack are copied by the installer (setup.py) into their final location. Once you've installed it, you can then delete the unpacked files - they're not needed any more. On your machine, the location for the installed script files is D:\programs\python\Scripts - that's where you should run them from. -- Richie Hindle richie@entrian.com From tameyer at ihug.co.nz Mon Nov 24 17:39:23 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Mon Nov 24 17:39:50 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304315123@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B17E@its-xchg4.massey.ac.nz> [Tom] > Can I unzip into the final location where it will be executed from > forevermore? [Richie] > No, that's not how it works. When you unpack the archive, > the files that you unpack are copied by the installer > (setup.py) into their final location. Once you've installed > it, you can then delete the unpacked files - they're not > needed any more. > > On your machine, the location for the installed script files > is D:\programs\python\Scripts - that's where you should run them from. Note that there is one exception to this (which is a known problem with the current release, and will be fixed for the next one), which applies if you use the pop3proxy_tray.py script. The script needs some files that don't get copied across to the Python scripts directory; so you either have to run the tray script from the (spambayes) windows directory (but you can delete the rest of the archive), or copy across the missing files. =Tony Meyer From tpeters at mixcom.com Mon Nov 24 17:53:47 2003 From: tpeters at mixcom.com (Tom Peters) Date: Mon Nov 24 17:54:30 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <9405svonp1uc7voo12klctn965scbvbo3u@4ax.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> Message-ID: <5.1.0.14.0.20031124165201.00bbe648@pop3.mixcom.com> _Everything_ I need to run spambayes and the pop3proxy for it gets installed into D:\programs\python\Scripts, is that what you are saying? I won't need a D:\programs\Spambayes???? directory at all, unless I choose to keep my data files there? At 10:29 PM 11/24/2003 +0000, Richie Hindle wrote: >[Tom] > > Can I unzip into the final location where it will be executed from > > forevermore? > >No, that's not how it works. When you unpack the archive, the files that >you unpack are copied by the installer (setup.py) into their final >location. Once you've installed it, you can then delete the unpacked >files - they're not needed any more. > >On your machine, the location for the installed script files is >D:\programs\python\Scripts - that's where you should run them from. > >-- >Richie Hindle >richie@entrian.com [Experts] An ordinary man away from home giving advice. --Oscar Wilde --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From richie at entrian.com Mon Nov 24 18:05:08 2003 From: richie at entrian.com (Richie Hindle) Date: Mon Nov 24 18:05:41 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <5.1.0.14.0.20031124165201.00bbe648@pop3.mixcom.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <9405svonp1uc7voo12klctn965scbvbo3u@4ax.com> <5.1.0.14.0.20031124165201.00bbe648@pop3.mixcom.com> Message-ID: <5g35svk2ltr7a2bmfnnjklgpk8qnqudbnl@4ax.com> [Tom] > _Everything_ I need to run spambayes and the pop3proxy for it gets > installed into D:\programs\python\Scripts, is that what you are saying? Everything gets copied into various locations under D:\programs\python, yes. Except for things that don't 8-) - see Tony's post about bugs in the installer for pop3proxy_tray.py. > I won't need a D:\programs\Spambayes???? directory at all, unless I > choose to keep my data files there? Correct. -- Richie Hindle richie@entrian.com From tpeters at mixcom.com Mon Nov 24 18:19:36 2003 From: tpeters at mixcom.com (Tom Peters) Date: Mon Nov 24 18:19:59 2003 Subject: [Spambayes] Help! Can't install 1.0a7,blew up previous install of a6 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B17E@its-xchg4.massey. ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1304315123@its-xchg4.massey.ac.nz> Message-ID: <5.1.0.14.0.20031124171838.00bc9000@pop3.mixcom.com> Where does one copy them to, exactly? Would this be a child directory of the python scripts directory? >scripts directory; so you either have to run the tray script from the >(spambayes) windows directory (but you can delete the rest of the archive), >or copy across the missing files. At 11:39 AM 11/25/2003 +1300, Tony Meyer wrote: >[Tom] > > Can I unzip into the final location where it will be executed from > > forevermore? > >[Richie] > > No, that's not how it works. When you unpack the archive, > > the files that you unpack are copied by the installer > > (setup.py) into their final location. Once you've installed > > it, you can then delete the unpacked files - they're not > > needed any more. > > > > On your machine, the location for the installed script files > > is D:\programs\python\Scripts - that's where you should run them from. > >Note that there is one exception to this (which is a known problem with the >current release, and will be fixed for the next one), which applies if you >use the pop3proxy_tray.py script. > >The script needs some files that don't get copied across to the Python >scripts directory; so you either have to run the tray script from the >(spambayes) windows directory (but you can delete the rest of the archive), >or copy across the missing files. > >=Tony Meyer [Commentary] Sirs, I have tested your machine. It adds a new terror to life and makes death a long felt want. --Sir Herbert Beerbohm Tree (1853-1917) To a gramophone company that asked for a testimonial --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tpeters at mixcom.com Mon Nov 24 17:57:39 2003 From: tpeters at mixcom.com (Tom Peters) Date: Mon Nov 24 19:17:08 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <3FC27AB3.1040602@videotron.ca> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> Message-ID: <5.1.0.14.0.20031124165608.00bc9d78@pop3.mixcom.com> Python as installed on my machine signs on as Python 2.3 (#46, Jul 29 2003, 18:54:32) [MSC v.1200 32 bit (Intel)] on win32 Is that good enough? I have the Win32 extensions. Like I said, I've been running 1.0a6 for weeks. At 04:40 PM 11/24/2003 -0500, papaDoc wrote: >Hi, > >>But the next thing I did blew up: >>D:\Programs\spambayes-1.0a7>python scripts/sb_server.py -b >>Traceback (most recent call last): >> File "scripts/sb_server.py", line 100, in ? >> import spambayes.message >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line >> 198, in ? >> msginfoDB = MessageInfoDB(message_info_db_name) >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line >> 162, in __init__ >> self.load() >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\message.py", line >> 166, in load >> self.dbm = dbmstorage.open(self.db_name, self.mode) >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", >> line 60, in open >> return f(db_name, mode) >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", >> line 38, in open_best >> return f(*args) >> File "D:\PROGRAMS\PYTHON\Lib\site-packages\spambayes\dbmstorage.py", >> line 20, >>in open_dbhash >> return bsddb.hashopen(*args) >> File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in hashopen >> d.open(file, db.DB_HASH, flags, mode) >>bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') >>Exception exceptions.AttributeError: "'MessageInfoDB' object has no >>attribute 'db'" in >> ignored > >I think you should install python version 2.3 to run Spambayes 1.0a7. >You might also need bsddb that will run with python 2.3 > >Remi [Science] The scientific name for an animal that doesn't either run from or fight its enemies is lunch. --Michael Friedman --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From papaDoc at videotron.ca Mon Nov 24 20:48:56 2003 From: papaDoc at videotron.ca (papaDoc) Date: Mon Nov 24 20:52:03 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <5.1.0.14.0.20031124165608.00bc9d78@pop3.mixcom.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124165608.00bc9d78@pop3.mixcom.com> Message-ID: <3FC2B508.6080702@videotron.ca> Hi Tom, Looking more closely at the error and with the new info. I think you are missing the database. I'm not sure but I think version 1.0a6 and 1.0a7 have different name for the database or where it is save. (I'm not sure since I'm always using the latest CVS version). Can you clean up/move to a safe location the old version of spambayes. i.e. c:\Documents and Settings\your_user_name\Application Data\Spambayes Check if there is any file ending in ".ini" where you have version 1.0a7. Then reinstall version 1.0a7 not in the same directory as where 1.0a6 was. Remi > Python as installed on my machine signs on as > Python 2.3 (#46, Jul 29 2003, 18:54:32) [MSC v.1200 32 bit (Intel)] on > win32 > >>> File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in >>> hashopen >>> d.open(file, db.DB_HASH, flags, mode) >>> bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') >>> Exception exceptions.AttributeError: "'MessageInfoDB' object has no >>> attribute 'db'" in >> > ignored >> From Michael_Marriott at telus.net Mon Nov 24 23:11:04 2003 From: Michael_Marriott at telus.net (Michael Marriott) Date: Mon Nov 24 23:11:18 2003 Subject: [Spambayes] Help in Unistalling Spambayes Message-ID: <000001c3b30a$2884c2e0$3dd506cf@D4ZXD011> Hello. I would like to uninstall the product but cannot figure out how. Can you help? Thanks! Michael -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 145 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031124/b5ae6d94/attachment.gif From tameyer at ihug.co.nz Tue Nov 25 00:38:49 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 00:38:56 2003 Subject: [Spambayes] RE: [spambayes-dev] Reviewing trained messages In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314F81@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E7@its-xchg4.massey.ac.nz> > Could you consider an entry in the FAQ section > on how to access the database of messages which > have been trained - i.e. in the case that you > misclassify a message and want to sort it out? This is a little complicated. Basically you need to find out the id of the message (it's in the headers), and then use the "Find Message" query on the main page of the web interface to find it. You can then train it, and if the training changes, it'll do the correct untraining as necessary. Once the message is gone from the cache, though, you can't do this (by default they stay there 7 days, but 'discard' also removes them). There are other ways, but this is probably the simplest. In a future version (it's in cvs, I think it's intended for 1.1a1), you can use the 'find message' query to search by more than id. So you could search for messages in the caches that have a certain subject, for example. You're then presented with a review page that, instead of listing recently arrived, untrained, mail, lists all messages (in the caches) that match the query. You can do the normal train/discard actions from there. > In my case I have a message that was correctly > classified, but it seems like it was not delivered > to my Outlook Express inbox - a little odd I know, > but if I could access the "already trained" database > I could see if this is really the case or not. This is a different question. You say it was correctly classified - did you train on it? If not, then you should be able to see it in the regular review page. If you did, then it arrived (otherwise it wouldn't be there). You can bring it up again if you know the id (unlikely), but otherwise you'll have to wade through the ham/spam cache directories to find it (if you have a tool like grep available, you could search through them - they're just text copies of the messages). =Tony Meyer From tpeters at mixcom.com Tue Nov 25 00:43:14 2003 From: tpeters at mixcom.com (Tom Peters) Date: Tue Nov 25 00:43:41 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <3FC2B508.6080702@videotron.ca> References: <5.1.0.14.0.20031124165608.00bc9d78@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124165608.00bc9d78@pop3.mixcom.com> Message-ID: <5.1.0.14.0.20031124233630.019deeb8@localhost> I have it working now--- I think. At least all the scripts run during install, and the proxy service starts up again. Turns out I was a little unclear on the concept, installation-wise. Unzipping the archive results in some scripts you shouldn't run- you should run one of them and then chdir over to the python directory before running the rest of them. Things are somewhat improved now. -T At 08:48 PM 11/24/2003 -0500, papaDoc wrote: >Hi Tom, > >Looking more closely at the error and with the new info. I think you are >missing the database. >I'm not sure but I think version 1.0a6 and 1.0a7 have different name for >the database or where >it is save. (I'm not sure since I'm always using the latest CVS version). >Can you clean up/move to a safe location the old version of spambayes. > >i.e. >c:\Documents and Settings\your_user_name\Application Data\Spambayes >Check if there is any file ending in ".ini" where you have version 1.0a7. > >Then reinstall version 1.0a7 not in the same directory as where 1.0a6 was. > >Remi > >>Python as installed on my machine signs on as >>Python 2.3 (#46, Jul 29 2003, 18:54:32) [MSC v.1200 32 bit (Intel)] on win32 >> >>>> File "D:\PROGRAMS\PYTHON\lib\bsddb\__init__.py", line 162, in hashopen >>>> d.open(file, db.DB_HASH, flags, mode) >>>>bsddb._db.DBNoSuchFileError: (2, 'No such file or directory') >>>>Exception exceptions.AttributeError: "'MessageInfoDB' object has no >>>>attribute 'db'" in >>>> ignored > > [Computing] Stroustroup writes in the ARM: C programmers think that memory allocation is too important to be left to the computer, Lisp programmers think that memory allocation is too important to be left to the programmer. --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tpeters at mixcom.com Tue Nov 25 01:02:13 2003 From: tpeters at mixcom.com (Tom Peters) Date: Tue Nov 25 01:02:32 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <9405svonp1uc7voo12klctn965scbvbo3u@4ax.com> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> Message-ID: <5.1.0.14.0.20031124235629.01dbd5f0@localhost> I redid the install (just Spambayes, not Python) and got it more or less working. I'm trying to remember how I did it. Care to comment on my script? I know its not complete, but I'd like to know if anything here is out-and-out wrong. Spambayes Quickstart Applicable to POP3Proxy configuration and version 1.0a7 possibly other versions, Windows 2000, XP, etc. You must have Python 2.3+ already installed. It's also helpful to add a path to it. In my case, I typed this at step 4: path %path%;D:\programs\python 1. Download the package. 2. Unzip it to a temporary directory, e.g. D:\scratch, using folder names (directory names) which should create a directory tree under D:\scratch called spambayes-1.0a7 (if you're using WinZip or similar) 3. Open a command prompt in (e.g.) D:\scratch\spambayes-1.0a7 4. If your path doesn't contain the python directory, add it. 5. Run the install script for the overall package as below: D:\scratch\spambayes-1.0a7> python setup.py install 6. Change to the D:\programs\python directory, or your equivalent. 7. Run the configuration script. It opens a web browser and shows you the Spambayes Web Interface: Home page. Click on the configuration page. When done close the browser. D:\programs\python> python scripts\sb_server.py -b 8. Step 7 leaves the listener running but attached to a window. Hit Control-C and wait (a while) for it to die. Don't mind the traceback. Then setup the pop3proxy system service: D:\programs\python> python scripts\pop3proxy_service.py install 9. I chose to start the the pop3proxy service semiautomatically by creating icons on the desktop for "net start pop3proxy" and "net stop pop3proxy" and copying the start icon into my startup group. That way it's easy while testing to control whether the service starts up or not. I have the service configured as Startup option = Manual in Control Panel --> Administrative Tools --> Services, where, by the way, its display name is SpamBayes Service, not pop2proxy. Once you feel that your testing of the service is complete and there you can allow it to auto start, set its startup type to Automatic and remove it from your startup group. Leave the icons or shortcuts to start and stop around somewhere just in case, since it does seem to need to be stopped and restarted every once in a while. Information gleaned from README.TXT, INTEGRATION.TXT (possible typo), POP3PROXY.TXT etc. At 10:29 PM 11/24/2003 +0000, Richie Hindle wrote: >[Tom] > > Can I unzip into the final location where it will be executed from > > forevermore? > >No, that's not how it works. When you unpack the archive, the files that >you unpack are copied by the installer (setup.py) into their final >location. Once you've installed it, you can then delete the unpacked >files - they're not needed any more. > >On your machine, the location for the installed script files is >D:\programs\python\Scripts - that's where you should run them from. > >-- >Richie Hindle >richie@entrian.com [Love] Scratch a lover, and find a foe. --Dorothy Parker (1893-1967)M "Chant for Dark Hours," Ballade of a Great Weariness --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tameyer at ihug.co.nz Tue Nov 25 01:15:20 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 01:16:25 2003 Subject: [Spambayes] How to tell SpamBayes to check more headers (using MSOplugin) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314DB0@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B18A@its-xchg4.massey.ac.nz> > I'm still playing with it, but here's what I have so far. It > seems to work quite nicely - after a full retrain, I'm seeing > a lot of spammassassin: and > spamcop: lines near the spammy end of evidence listings. [...] Thanks - I've added this to the SpamBayes wiki () in case others are interested in it (if you'd rather it wasn't there, please feel free to remove it - OTOH, if you improve the code, or have results, feel free to add to it!). =Tony Meyer From tpeters at mixcom.com Tue Nov 25 01:16:56 2003 From: tpeters at mixcom.com (Tom Peters) Date: Tue Nov 25 01:19:05 2003 Subject: [Spambayes] 1.0a6 and Norton antivirus In-Reply-To: <5.1.0.14.0.20031107002648.02682cc8@localhost> Message-ID: <5.1.0.14.0.20031125001304.01d98878@localhost> After upgrading to 1.0a7, I still have this issue: Everything works fine and SpamBayes seems to classify messages properly, BUT all outbound messages are cranked through Norton anti virus twice each. So I went back to my approach, using ports 3025 and 3110. Works fine. At 12:36 AM 11/7/2003 -0600, Tom Peters wrote: >I had oddness with 1.0a5 and a6 and Norton antivirus under win2000 and XP. >Once I got it configured, Norton would process each and every message >twice. Once it dawned on me that it had hooked into ports 25 and 110 on >its own, was processing each message, then passing them on to "real" ports >25 and 110, I changed ports. > >I had Eudora Pro talk to ports 3025 and 3110 (chosen arbitrarily) and set >SpamBayes to proxy them to/from 25 and 110. > >Behaves very nicely now. Eudora can be told from command line which ini >file to look at for settings, so I can start it proxied or direct, >depending which icon I pick. Since I got it working with 1.0a5, I've never >used to non-proxied icon again. > >Any problems with this approach? > > >[Love] The adoration of his heart had been to her only as the perfume >of a wild flower, which she had carelessly crushed' with her foot in >passing. --Longfellow (1819-1892) >--... ...-- -.. . -. ----. --.- --.- -... >tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) >"HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters >43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc >WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA > > > > > >_______________________________________________ >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes >Check the FAQ before asking: http://spambayes.sf.net/faq.html [Sex] Sex is the biggest nothing of all time. --Andy Warhol (1928-1987) --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From tameyer at ihug.co.nz Tue Nov 25 01:20:04 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 01:20:14 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previousinstall of a6 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304315221@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B18B@its-xchg4.massey.ac.nz> > 6. Change to the D:\programs\python directory, or your equivalent. > > 7. Run the configuration script. [...] > D:\programs\python> python scripts\sb_server.py -b > > 8. Step 7 leaves the listener running but attached to a window. Hit > Control-C and wait (a while) for it to die. Don't mind the > traceback. Then setup the pop3proxy system service: > D:\programs\python> python scripts\pop3proxy_service.py install Can't you just install the service, start it, and then open up a browser window to , without bothering with the sb_server.py script? You'll get the same results, but it seems more straightforward. You could also add a "delete the scratch folder" step at the end if you wanted to. > Information gleaned from README.TXT, INTEGRATION.TXT (possible typo), > POP3PROXY.TXT etc. Note that the INTEGRATION.TXT file has been integrated into the README, and should no longer be considered accurate (it shouldn't be in any recent releases). =Tony Meyer From papaDoc at videotron.ca Tue Nov 25 08:42:14 2003 From: papaDoc at videotron.ca (papaDoc) Date: Tue Nov 25 08:42:23 2003 Subject: [Spambayes] Re: [spambayes-dev] Three patches for better Evolution integration In-Reply-To: <3FC32825.9000600@acm.org> References: <1ED4ECF91CDED24C8D012BCF2B034F13026F29E5@its-xchg4.massey.ac.nz> <3FC32825.9000600@acm.org> Message-ID: <3FC35C36.2090801@videotron.ca> Hi, If I understand the way this will work 1- Start the first filter which call sb_evoscore.py check the return value if 2 move the mail to the spam folder. If not spam continue 2- Start the second filter which call sb_evoscore.py check the return value if 1 move the mail to the unsure folder if not unsure continue 3- Start the 3rd filter So if your mail is ham you will call sb_evosrore twice ? Remi From Ed.Bindon at bamboosolutions.com Tue Nov 25 09:14:27 2003 From: Ed.Bindon at bamboosolutions.com (Ed Bindon) Date: Tue Nov 25 09:14:31 2003 Subject: [Spambayes] Upgrade filter information lost Message-ID: Good day, I upgraded to the latest version of the Outlook add in and it seems that my old filter information has been lost. Any ideas? Edward Bindon 703 964 2031 Direct 703 481 6319 Fax 703 964 2002 Main 703 627 9623 Mobile ed.bindon@bamboosolutions.com bindon@ix.netcom.com From fuerte at sci.fi Tue Nov 25 09:35:36 2003 From: fuerte at sci.fi (Harri Pesonen) Date: Tue Nov 25 09:35:47 2003 Subject: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well In-Reply-To: References: Message-ID: <3FC368B8.8010906@sci.fi> I think that I may have the same problem. I have Spam: *4557* Ham: *1469 *SpamBayes is not working as well as it used to. I wonder if SpamBayes could automatically adjust the database so that it always has an equal amount of ham and spam, just forget the extra old spam/ham if needed. What is the best way to retrain the database? It seems that SB already has all that it needs: pop3proxy-ham-cache (91 msgs) pop3proxy-spam-cache (495 msgs) pop3proxy-unknown-cache (21 msgs) The option "Days before cached messages expire:" should be changed, it should be "Number of ham/spam messages to cache: " with a default of 1000 for example. Then if we have a button "Retrain database" or "Recreate statistics", if would create the statistics from these cached messages. Simple? Of course the system should move unknown messages trained as ham/spam to the correct folders as well. Harri >From: "Jordan and Rory" >Subject: RE: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well > >Tim, > >Thanks so much for the prompt response. Your advice worked! > >I retrained with the 900 or so hams I had, and about 850 spams. I now get >MUCH better distinction between spam and ham in the re-scored notes. Most >of the spams now show scores in the 90-100% range; the hams all fall in the >0-5% range. > >I guess with a steady 10-1 spam/ham ratio of incoming mail, I'll have to >periodically toss a bunch of spam and retrain. > >I greatly appreciate the help... and am now a much happier camper! :) > > -Jordan > > From Brad.Berson at abc.com Tue Nov 25 10:35:46 2003 From: Brad.Berson at abc.com (Berson, Brad) Date: Tue Nov 25 10:37:15 2003 Subject: [Spambayes] Incoming email alerts... Message-ID: Oh well. Thanks all... -----Original Message----- From: Kenny Pitt [mailto:kennypitt@hotmail.com] Sent: Monday, November 24, 2003 9:33 AM To: 'Tim Peters'; Berson, Brad Cc: spambayes@python.org Subject: RE: [Spambayes] Incoming email alerts... Tim Peters wrote: > [Berson, Brad] >> ... >> The documented problem of not being able to squash the incoming >> email notification when an email is routed to the junk e-mail >> folder is something that interests me because of the frequency of >> the junk coming in, sometimes close to 100 a day. >> >> There's a product called Qurb which according to what I've read, >> may have gotten around this obstacle somehow. I have not evaluated >> it because whitelist solutions do not interest me. Just thought >> you folks might want to check that out. > > One note of caution: reverse-engineering commercial products is > almost always a violation of the license agreement, so if someone > does want to do this, do *not* purchase Qurb and crack how they do > it. Any sign of stuff like that in contributed code, and it will be > rejected -- that can create legal problems for the whole project. > Since SpamBayes doesn't generate any revenue, it doesn't have any > funds to pay lawyers either. With no reverse engineering required, I read Qurb's FAQ on their website. They state quite clearly that they are replacing the Outlook notification with their own, and that the notification only indicates messages that are left in the Inbox. If an Outlook rule moves a message to another folder, there is a good chance that Qurb will not show any notification of that message. This probably isn't the behavior we would want, anyway. -- Kenny Pitt From mail at edreggi.com Tue Nov 25 11:03:17 2003 From: mail at edreggi.com (Ed Reggi) Date: Tue Nov 25 11:03:31 2003 Subject: [Spambayes] Spam Bayes questions Message-ID: I LOVE your software! I am using the Outlook Add-in. I am running Win2k with Outlook 2000. I have been using your SpamBayes since June and find it to be excellent. I really have no problems until yesterday. I added several new .PST folders in my Outlook tree. I did that to better organize my 4 different domain email accounts. Here is the problem, SPAM is indeed marked as Spam (100%) but is no longer moved over to the original SPAM folder in my original Personal Folder PST. Does SpamBayes not like cross moving from PST to PST folders? Must I start over and retrain the whole thing? Any suggestions or assistance would be nice! Thank you, Ed Reggi mail@edreggi.com --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.543 / Virus Database: 337 - Release Date: 11/21/2003 From kennypitt at hotmail.com Tue Nov 25 11:16:41 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 25 11:17:13 2003 Subject: [Spambayes] Spam Bayes questions In-Reply-To: Message-ID: Ed Reggi wrote: > I added several new .PST folders in my Outlook tree. I did that to > better organize my 4 different domain email accounts. Here is the > problem, SPAM is indeed marked as Spam (100%) but is no longer moved > over to the original SPAM folder in my original Personal Folder PST. Are you using Outlook rules to move messages received from certain accounts to folders in the separate PST files? If so, have you enabled background filtering in SpamBayes to make sure that Outlook rules run first? Also, make sure that you have added each of the folders to which mail will be delivered to the list of filtered folders in SpamBayes. -- Kenny Pitt From morgan at mcode.co.za Tue Nov 25 11:19:02 2003 From: morgan at mcode.co.za (Morgan Collett) Date: Tue Nov 25 11:19:32 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working Message-ID: <3FC380F6.5000602@mcode.co.za> I'm installing spambayes on my DSL gateway, a debian linux box. I'm running the following command line: /usr/bin/sb_server.py -D /var/spambayes/spam.pickle pop.mcode.co.za I get the following text when it starts: > SpamBayes POP3 Proxy Beta3, version 0.3 (September 2003), > using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 > and engine SpamBayes Beta2, version 0.2 (July 2003). > > Loading database... Listener on port 110 is proxying pop.mcode.co.za:110 > User interface url is http://localhost:8880/ The problem is that the web interface is not working. If I use Mozilla to go to the URL (from a different box on my network) I get "Document contains no data". If I telnet to the port, I get the following: > # telnet localhost 8880 > Trying 127.0.0.1... > Connected to localhost. > Escape character is '^]'. > GET / > > HTTP/1.0 400 Error > Connection: close > Content-Type: text/html > > Malformed request: 'GET /'Connection closed by foreign host. If I start sb_server.py with -b, it opens lynx on my gateway box, which just sits there trying to connect until I hit ^C. Then I get: > error: uncaptured python exception, closing channel > > (socket.error:(32, 'Broken pipe') > [/usr/lib/python2.2/asynchat.py|initiate_send|213] > [/usr/lib/python2.2/asyncore.py|send|339]) I then hit ^C again, and get: > Traceback (most recent call last): > File "/usr/bin/sb_server.py", line 833, in ? > run() > File "/usr/bin/sb_server.py", line 827, in run > start(state=state) > File "/usr/bin/sb_server.py", line 767, in start > main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) > File "/usr/bin/sb_server.py", line 748, in main > Dibbler.run(launchBrowser=launchUI) > File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 689, in run > asyncore.loop(map=context._map) > File "/usr/lib/python2.2/asyncore.py", line 201, in loop > poll_fun (timeout, map) > File "/usr/lib/python2.2/asyncore.py", line 79, in poll > r,w,e = select.select (r,w,e, timeout) > KeyboardInterrupt Note: I'm using Debian stable (woody) which has Python 2.2.1. I have copied the /usr/lib/python2.2/email/ files from the install from debian unstable, which has Python 2.2.3. That fixed the error message when I installed spambayes. Do I need to do anything else? From marka at firstquadrant.com Tue Nov 25 11:42:16 2003 From: marka at firstquadrant.com (Mark Arredondo) Date: Tue Nov 25 11:43:23 2003 Subject: [Spambayes] Question regarding out of office assistant Message-ID: Is it possible to get Span Bayes to filter spam prior to the out of office message being sent? Thanks, Mark ---------------------------------------------------------------------------- Mark Arredondo First Quadrant, L.P. (626) 683-4169 marka@firstquadrant.com From kennypitt at hotmail.com Tue Nov 25 11:43:04 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 25 11:43:35 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working In-Reply-To: <3FC380F6.5000602@mcode.co.za> Message-ID: Morgan Collett wrote: > I'm installing spambayes on my DSL gateway, a debian linux box. > > I'm running the following command line: > /usr/bin/sb_server.py -D /var/spambayes/spam.pickle pop.mcode.co.za > > I get the following text when it starts: > > > SpamBayes POP3 Proxy Beta3, version 0.3 (September 2003), > > using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 > > and engine SpamBayes Beta2, version 0.2 (July 2003). > > > > Loading database... Listener on port 110 is proxying > pop.mcode.co.za:110 > User interface url is http://localhost:8880/ > > The problem is that the web interface is not working. If I use Mozilla > to go to the URL (from a different box on my network) I get "Document > contains no data". Just a few thoughts that will hopefully get you started on troubleshooting (maybe someone more familiar with Linux can add to this). First, "localhost" is a standard name used to refer to the system that you are running on, so you can't use that name to connect from "a different box on the network". You'll need to use the actual hostname of the box (e.g. "mygateway") or its IP address. Second, SpamBayes by default prevents you from connecting to the Web interface from a remote system for security reasons. You need to change a configuration setting to allow other systems to connect. This is normally done from the Advanced Configuration page in the UI, but it sounds like you aren't able to get there even on the gateway box so you'll have to edit the config file manually. Sorry I can't tell you precisely where to find it on Linux, but it will likely be in the same directory as your database ("/var/spambayes"?) and will probably have "bayescustomize" somewhere in the name. You'll need to add the following lines in this file: [html_ui] allow_remote_connections:1.2.3.4 If there is already an "[html_ui]" section then you can just insert the second line under there. "1.2.3.4" should be replaced by the IP address of the remote machine that you want to connect from. Here is the complete help string for the option: """Enter a list of trusted IPs, separated by commas. Remote connections from any of them will be allowed. You can trust any IP using a single '*' as field value. You can also trust ranges of IPs using the '*' character as a wildcard (for instance 192.168.0.*). The localhost IP will always be trusted. Type 'localhost' in the field to trust this only address.""", -- Kenny Pitt From kennypitt at hotmail.com Tue Nov 25 12:03:09 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Tue Nov 25 12:03:41 2003 Subject: [Spambayes] Question regarding out of office assistant In-Reply-To: Message-ID: Mark Arredondo wrote: > Is it possible to get Span Bayes to filter spam prior to the out of > office message being sent? Unfortunately not. IIRC, the Out of Office message is sent by the Exchange Server as soon as the message is received. Outlook doesn't even need to be running for this to happen since you are "out of the office". -- Kenny Pitt From Karl-Heinz.Dreinhoff at t-online.de Tue Nov 25 03:19:48 2003 From: Karl-Heinz.Dreinhoff at t-online.de (Karl-Heinz Dreinhoff) Date: Tue Nov 25 12:49:49 2003 Subject: [Spambayes] spam bayes configuration not possible: feature not available Message-ID: <000001c3b32c$e7a79b80$fe78a8c0@p4> Operating System: Windows XP SP1 german Office Version: Office XP german Since some months I used Spam Bayes 007 and 0081. It worked fine. Configuring via task bar also worked fine, until 4 days ago. Since then, when I click to a spam bayes button, a message box opens (see the pdf document enclosed: message #1). Clicking to 'Cancel' produces message #2. Clicking to 'OK' produces message #3. Becaused spam bayes had been configured before, filtering was working on. I uninstalled spam bayes an installed it again - the failure remains. But now filtering is working no more. I cannot identify what happened 4 days ago what made the spam bayes alter its behaviour. Karl-Heinz Dreinhoff -------------- next part -------------- A non-text attachment was scrubbed... Name: Messages.pdf Type: application/pdf Size: 34756 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031125/a88974ba/Messages-0001.pdf From papaDoc at videotron.ca Tue Nov 25 13:19:59 2003 From: papaDoc at videotron.ca (papaDoc) Date: Tue Nov 25 13:21:39 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working In-Reply-To: <3FC380F6.5000602@mcode.co.za> References: <3FC380F6.5000602@mcode.co.za> Message-ID: <3FC39D4F.30805@videotron.ca> Hi Morgan, We need more info, > I'm installing spambayes on my DSL gateway, a debian linux box. This is good ;-) > I'm running the following command line: > /usr/bin/sb_server.py -D /var/spambayes/spam.pickle pop.mcode.co.za > > I get the following text when it starts: > > > SpamBayes POP3 Proxy Beta3, version 0.3 (September 2003), > > using SpamBayes POP3 Proxy Web Interface Alpha3, version 0.03 > > and engine SpamBayes Beta2, version 0.2 (July 2003). > > > > Loading database... Listener on port 110 is proxying > pop.mcode.co.za:110 > > User interface url is http://localhost:8880/ > > The problem is that the web interface is not working. If I use Mozilla > to go to the URL (from a different box on my network) I get "Document > contains no data". So if I understand correctly On PC (pc_A) you do: /usr/bin/sb_server.py -D /var/spambayes/spam.pickle pop.mcode.co.za On PC (pc_B) you do: http://localhost:8880/ ???? You should do: http://pc_A:8880/ since on pc_B localhost is the same as pc_B > If I telnet to the port, I get the following: > > > # telnet localhost 8880 > > Trying 127.0.0.1... > > Connected to localhost. > > Escape character is '^]'. > > GET / > > > > HTTP/1.0 400 Error > > Connection: close > > Content-Type: text/html > > > > Malformed request: 'GET /'Connection > closed by foreign host. You are on which pc when you do that ? you should be on pc_A > > If I start sb_server.py with -b, it opens lynx on my gateway box, > which just sits there trying to connect until I hit ^C. Then I get: Do you get any error before doing the ^C ? Remi P.S, My question are very basic but your message is not clear (at least to me ;-6 ) but from your setup you don't look like a "debutant". (debian, lynx, unstable....) P.S,S My home server run Debian (Woody) with the oython from the unstable and I'm able to run sb_mboxtrain and sb_filter but I did not try the sb_serber. you can also try to create the database with sb_mboxtrain just to test the database creation. From michael at ccminc.com Tue Nov 25 14:43:48 2003 From: michael at ccminc.com (Michael Burkett) Date: Tue Nov 25 14:44:11 2003 Subject: [Spambayes] Rules Message-ID: <000001c3b38c$72e305e0$4101a8c0@michael> Is there a way to export rules from 1 computer to the next? ie email that I have deemed as spam to another user? Thank you, Michael Burkett CCM Investment Advisers Tele 803.254.9500 Fax 803.252.9530 Email michael@ccminc.com From skip at pobox.com Tue Nov 25 15:24:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 25 15:24:50 2003 Subject: [Spambayes] Rules In-Reply-To: <000001c3b38c$72e305e0$4101a8c0@michael> References: <000001c3b38c$72e305e0$4101a8c0@michael> Message-ID: <16323.47746.799326.722609@montanaro.dyndns.org> Michael> Is there a way to export rules from 1 computer to the next? ie Michael> email that I have deemed as spam to another user? Sure, just copy the database (spambayes.db, hammie.db, whatever it's called on your machine) to the corresponding location on the other machine. Skip From bughuntr at one.ctelcom.net Tue Nov 25 16:08:08 2003 From: bughuntr at one.ctelcom.net (Bug Hunter) Date: Tue Nov 25 16:08:24 2003 Subject: [Spambayes] failure to install 1.0a7 on Mandrake 9.2 download edition Message-ID: I'm trying to get spambayes 1.0a7 on Mandrake 9.2 download edition. The error that comes back is: [root@one spambayes-1.0a7]# ./setup.py install running install error: invalid Python installation: unable to open /usr/lib/python2.3/config/Makefile (No such file or directory) Any clues would be helpful. I've previously installed spambayes on a Redhat, but that was after downloading and installing it by hand, so the Makefile was there. This is a fresh Mandrake 9.2 install. I found another email saying to do a /usr/local/bin softlink. That did not help, of course. Should I disable this test in the python code? -- --------------------------------- Sick and tired of spam? Ask me about Mail Deflector! http://www.maildeflector.com --------------------------------- From skip at pobox.com Tue Nov 25 16:16:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 25 16:16:14 2003 Subject: [Spambayes] failure to install 1.0a7 on Mandrake 9.2 download edition In-Reply-To: References: Message-ID: <16323.50834.83655.244315@montanaro.dyndns.org> Bug> [root@one spambayes-1.0a7]# ./setup.py install Bug> running install Bug> error: invalid Python installation: unable to open Bug> /usr/lib/python2.3/config/Makefile (No such file or directory) Make sure you have the Python development RPM installed. Skip From richie at entrian.com Tue Nov 25 16:18:52 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 16:19:23 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working In-Reply-To: <3FC380F6.5000602@mcode.co.za> References: <3FC380F6.5000602@mcode.co.za> Message-ID: <4nh7svkvoi62ibolmkpshb4tm47aihs9t4@4ax.com> > # telnet localhost 8880 > Trying 127.0.0.1... > Connected to localhost. > Escape character is '^]'. > GET / > > HTTP/1.0 400 Error > Connection: close > Content-Type: text/html > > Malformed request: 'GET /'Connection closed > by foreign host. I'm not sure what other problems you might be having, but "GET /" is invalid. You should say "GET / HTTP/1.0" (or 0.9 or 1.1; doesn't make any difference). -- Richie Hindle richie@entrian.com From skip at pobox.com Tue Nov 25 16:30:03 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 25 16:30:34 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working In-Reply-To: <4nh7svkvoi62ibolmkpshb4tm47aihs9t4@4ax.com> References: <3FC380F6.5000602@mcode.co.za> <4nh7svkvoi62ibolmkpshb4tm47aihs9t4@4ax.com> Message-ID: <16323.51675.47170.514925@montanaro.dyndns.org> Richie> I'm not sure what other problems you might be having, but "GET Richie> /" is invalid. You should say "GET / HTTP/1.0" (or 0.9 or 1.1; Richie> doesn't make any difference). Actually, "GET /" was what 0.9 used. The "HTTP/version" thing was added in 1.0, and serves to distinguish from 0.9 which also didn't support RFC-822-style headers. That's why to get a response from a 1.n server you have to hit return twice. A simple "GET /" should be valid with servers which still support it. It appears Google's server doesn't: titan:skip% telnet www.google.com 80 Trying 216.239.59.99... Connected to www.google.com. Escape character is '^]'. GET / Connection closed by foreign host. titan:skip% telnet www.google.com 80 Trying 216.239.59.99... Connected to www.google.com. Escape character is '^]'. GET / HTTP/1.0 HTTP/1.0 200 OK Cache-control: private Content-Type: text/html Set-Cookie: PREF=ID=1645a57593cd4cf6:TM=1069795629:LM=1069795629:S=XkFjoFsxQxjBaOpy; expires=Sun, 17-Jan-2038 19:14:07 GMT; path=/; domain=.google.com Server: GWS/2.1 Content-length: 2690 Date: Tue, 25 Nov 2003 21:27:09 GMT Connection: Keep-Alive ... Skip From bughuntr at one.ctelcom.net Tue Nov 25 16:30:54 2003 From: bughuntr at one.ctelcom.net (Bug Hunter) Date: Tue Nov 25 16:31:07 2003 Subject: [Spambayes] failure to install 1.0a7 on Mandrake 9.2 download edition In-Reply-To: <16323.50834.83655.244315@montanaro.dyndns.org> References: <16323.50834.83655.244315@montanaro.dyndns.org> Message-ID: Thanks! That was it. DOH! On Tue, 25 Nov 2003, Skip Montanaro wrote: > > Bug> [root@one spambayes-1.0a7]# ./setup.py install > Bug> running install > Bug> error: invalid Python installation: unable to open > Bug> /usr/lib/python2.3/config/Makefile (No such file or directory) > > Make sure you have the Python development RPM installed. > > Skip > -- --------------------------------- Sick and tired of spam? Ask me about Mail Deflector! http://www.maildeflector.com --------------------------------- From tpeters at mixcom.com Tue Nov 25 16:48:33 2003 From: tpeters at mixcom.com (Tom Peters) Date: Tue Nov 25 16:49:06 2003 Subject: [Spambayes] Spambayes Ultra Quick Start Doc In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130212B18B@its-xchg4.massey. ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1304315221@its-xchg4.massey.ac.nz> Message-ID: <5.1.0.14.0.20031125154440.00b3b880@localhost> Tony / Richie / All: What follows is a ultra quick startup guide, really an outline. I should include an exhortation to read the rest of the included docs, but I myself did so, and as Tony will attest, screwed up my install of 1.0a7 but good anyhow. This is intended to outline the steps in rought form. I'd like some feedback as before, not so much as to what I left out (although you could do that too) but more as to any outright lies told below. This doc is based entirely on my own experiences installing (incorrectly) 1.0a5, a6, and a7. Spambayes Ultra Quickstart Applicable to POP3Proxy configuration and version 1.0a7 possibly other versions and configurations, Windows 2000, XP, etc. You must have Python 2.3+ already installed. It's also helpful to add a path to it. In my case, I typed this at step 4: path %path%;D:\programs\python 1. Download the package. 2. Unzip it to a temporary directory, e.g. D:\scratch, using folder names (directory names) which should create a directory tree under D:\scratch called spambayes-1.0a7 3. Open a command prompt in (e.g.) D:\scratch\spambayes-1.0a7 4. If your path doesn't contain the python directory, add it. 5. Run the install script for the overall package as below: D:\scratch\spambayes-1.0a7> python setup.py install 6. Change to the D:\programs\python directory, or your equivalent. 7. Setup the pop3proxy system service: D:\programs\python> python scripts\pop3proxy_service.py install 8. I chose to start the the pop3proxy service semiautomatically by creating icons on the desktop for "net start pop3proxy" and "net stop pop3proxy" and copying the start icon into my startup group. That way it's easy while testing to control whether the service starts up or not. I have the service configured as Startup option = Manual in Control Panel --> Administrative Tools --> Services, where, by the way, its display name is SpamBayes Service, not pop3proxy. Once you feel that your testing of the service is complete and there you can allow it to auto start, set its startup type to Automatic and remove it from your startup group. Leave the icons or shortcuts to start and stop around somewhere just in case, since it does seem to need to be stopped and restarted every once in a while. 9. Start the pop3proxy service. The command line D:\any_prompt> net start pop3proxy should do it, else use a shortcut or icon you devised in step 8. 10. Open a web browser and press ESC or click the stop icon so you don't have to wait for it to open your start page. Type in the following address and press ENTER: http://localhost:8880 This should open a page called Spambayes Web Interface: Home. If it doesn't, the pop3proxy service (see step 8) isn't running. 11. Click on the link at "You can configure your Spambayes system using the Configuration page." Configure Spambayes there. Information on suggestion configuration to follow later. Notes: Information gleaned from README.TXT, POP3PROXY.TXT etc. The file INTEGRATION.TXT, formerly included with all alpha releases, is depreciated, per Tony Meyer, and may not be included in future releases. At 07:20 PM 11/25/2003 +1300, Tony Meyer wrote: >Can't you just install the service, start it, and then open up a browser >window to , without bothering with the sb_server.py >script? You'll get the same results, but it seems more straightforward. >You could also add a "delete the scratch folder" step at the end if you >wanted to. > >=Tony Meyer [Philosophy] If you keep your mind sufficiently open, people will throw a lot of rubbish into it. --William Orton --... ...-- -.. . -. ----. --.- --.- -... tpeters@nospam.mixcom.com (internet) remove "nospam." N9QQB (ham) "HEY YOU" (loud shouting) WEB ADDRESS http//www.mixweb.com/tpeters 43 7' 17.2" N, by 88? 6' 28.9" W, Elevation 815', Grid Square EN53wc WAN/LAN/Telcom Analyst, Tech Writer, MCP, Cisco Certified CCNA From gerrit at nl.linux.org Tue Nov 25 16:49:22 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Tue Nov 25 16:49:49 2003 Subject: [Spambayes] train_on_filter Message-ID: <20031125214922.GA7934@nl.linux.org> Hi, as I understand it, train_on_filter automatically trains all messages filtered which are not classified as unsure. If I would put this option on, the ham/spam ratio of my database would steadily move toward the ham/spam ratio of all incoming e-mail, which is approximately 10. This ratio is far from optimal, of course. Is it possible to tell Spambayes to train on filter, but only if this will not lead to a ham/spam ratio larger than, e.g., 1.5 or smaller than 0.5? yours, Gerrit Holl. -- 102. If a merchant entrust money to an agent (broker) for some investment, and the broker suffer a loss in the place to which he goes, he shall make good the capital to the merchant. -- 1780 BC, Hammurabi, Code of Law -- Asperger's Syndrome - a personal approach: http://people.nl.linux.org/~gerrit/english/ From epxmhwg02 at sneakemail.com Tue Nov 25 17:18:33 2003 From: epxmhwg02 at sneakemail.com (Joe) Date: Tue Nov 25 17:18:40 2003 Subject: [Spambayes] concerns over MS patents Message-ID: <2939-10502@sneakemail.com> Has anyone looked at Microsoft's two patents on probabilistic filtering? I'm stunned that these two patents would be approved. If I'm reading them correctly, they give MS a lock on ALL probabilistic email filtering. (Which would include SpamBayes, PopFile, K9, and dozens of other email filters.) Sorry about the length: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,192,360.WKU.&OS=PN/6,192,360&RS=PN/6,192,360 and http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,161,130.WKU.&OS=PN/6,161,130&RS=PN/6,161,130 Has MS "bought" the whole spam-filtering playing field? From richie at entrian.com Tue Nov 25 17:35:26 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 17:35:57 2003 Subject: [Spambayes] notate_subject In-Reply-To: <20031111224852.GE18670@hank.marinar.com> References: <1ED4ECF91CDED24C8D012BCF2B034F1303A3C939@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13026F29B6@its-xchg4.massey.ac.nz> <20031111224852.GE18670@hank.marinar.com> Message-ID: Hank, A couple of weeks ago you and Tony posted patches to make sb_filter obey the notate_to and notate_subject options. I've finally applied a modified version of them, that respects both of those options. I've committed it to the head, so it may not be in the next bugfix release, but will be in the next feature release. > I by no means mean to nitpick, but it would be very nice to see the > Prob score in the Subject as well. I didn't do this, partly because none of the other header-modifying spambayes applications does it, and partly because it made the To / Subject very messy looking. And I find that the scores get less interesting over time - they're fun to look at at first, and useful when debugging, but day to day I don't look at them. YMMV. 8-) > So far, I am VERY happy with how this filter is performing. I am now > rolling it out to the rest of my clients, and so far it is well > received. Hope its still going well? -- Richie Hindle richie@entrian.com From tameyer at ihug.co.nz Tue Nov 25 17:40:10 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 17:40:17 2003 Subject: [Spambayes] Header annotations included in classification? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304315000@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29EA@its-xchg4.massey.ac.nz> [Remi] > If you retrain (an email was not classified correctly, or > retrain from scratch with old mail ) this can be a problem > but there is options to not look at headers but I think > the subject is always classified. [Dave Brueck] > Aha! Thanks for your help - I feel a bit dumb but now > understand my problem better. :) On the Home -> Review web > page I was looking through the list of emails for which it > was unsure and marking them as Ham or Spam and then clicking > the Train button. But as Spambayes began to classify emails > on its own they began to also show up in their own section > ("Messages classified as Spam") and by default they have the > "Spam" radio button selected so by clicking Train I was > running them through as training material as well. A little late, but some clarification: When SpamBayes trains via (the review page of) the web interface, it's not accessing the mail from your mail client (whatever it is) - it's using a copy of the message that it stored in it's cache when the message arrived. This copy does not have any SpamBayes information in it - it's a raw copy of the incoming message. So there's no way for training via (the review page of) the web interface to include SpamBayes data, unless it was in the original message. The review page could strip out the [Spam] bit from the headers, if it is adding them; I note that there's a feature request open about this, so I'll (or someone else'll) get to this at some point. If you *retrain* (correct previous training, for example) on mail via the web interface (using the "find message" query), then this also works with the cached messages, so will also work correctly. If, however, you train from scratch by providing mbox/dbx files via the front page of the web interface, and the mail in those files has SpamBayes data, it *will* be used. For the most part, this won't matter - all the SpamBayes headers are ignored by default, so they won't have any effect. The modified subject/to header, *will* be tokenized, though. (This isn't something I had considered previously). I think that the correct behaviour would be to strip '[ham|unsure|spam]' from subjects when tokenizing if SpamBayes is set to add those classifications. This wouldn't be perfect - and maybe would lose information that something else is adding - but it's probably more reliable that the current situation (I think it's probably better to not use all the data than to use incorrect data). I'll (or whoever) will look into this when dealing with the feature request. =Tony Meyer From skip at pobox.com Tue Nov 25 17:40:41 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Nov 25 17:41:19 2003 Subject: [Spambayes] train_on_filter In-Reply-To: <20031125214922.GA7934@nl.linux.org> References: <20031125214922.GA7934@nl.linux.org> Message-ID: <16323.55913.26215.650467@montanaro.dyndns.org> Gerrit> Is it possible to tell Spambayes to train on filter, but only if Gerrit> this will not lead to a ham/spam ratio larger than, e.g., 1.5 or Gerrit> smaller than 0.5? No, but the topic has been discussed in the past. Feel free to tweak the source and submit a patch while we Americans are pigging out on turkey and watching (American) football for the next few days... ;-) Skip From richie at entrian.com Tue Nov 25 17:42:14 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 17:42:44 2003 Subject: [Spambayes] Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: <3dm7svku2qdl9smqbtehdqstmnp3nk6vqk@4ax.com> Martin, > msginfoDB = MessageInfoPickle(message_info_db_name) > [...] > KeyError: '\x00' I'm half guessing, but I wonder if your old bsddb messageinfo database is now being erroneously loaded as a pickle? There's a new feature in 1.0a7 whereby if you're using a pickle for your main DB, the messageinfo DB is also a pickle. Deleting the old one should sort it out (since you're not using the IMAP filter, deleting it is quite safe). Shut down sb_server (or whatever you run), delete spambayes.messageinfo.db and restart. WHAT_IS_NEW.txt claims that the upgrade will have no ill effects outside the IMAP filter, but perhaps we got that wrong? -- Richie Hindle richie@entrian.com From richie at entrian.com Tue Nov 25 17:44:19 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 17:44:49 2003 Subject: [Spambayes] Re: Can't start pop3proxy_service alpha 7 In-Reply-To: References: Message-ID: <6nm7sv0c3p7qitp5lr3tv03nspmjq69mui@4ax.com> Martin, [you, ages ago] > I'll try uninstalling python, deleting everything, and starting over. > I'll post again with the results. > [...] > Results: same thing :( [me, just now, irrelevantly] > delete spambayes.messageinfo.db and restart Sorry, that's completely irrelevant advice - I've just read your second posting again. Did you manage to get anywhere with the problem? -- Richie Hindle richie@entrian.com From tameyer at ihug.co.nz Tue Nov 25 17:47:09 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 17:47:15 2003 Subject: [Spambayes] Can't start pop3proxy_service alpha 7 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130431544E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B18F@its-xchg4.massey.ac.nz> [Richie] > WHAT_IS_NEW.txt claims that the upgrade will have no ill > effects outside the IMAP filter, but perhaps we got that wrong? My bad. I was in a bit of a hurry when I wrote the 1.0a7 stuff, and missed out a bit. It should say that if you're using a pickle, then you will need to remove your old message info db, and that *removing it* will have no ill effects outside the IMAP filter. It neglects to mention the removing step. My apologies. :) =Tony Meyer From richie at entrian.com Tue Nov 25 17:48:36 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 17:49:04 2003 Subject: [Spambayes] SpamBayes Outlook Plug-In Not Performing Well In-Reply-To: <3FC368B8.8010906@sci.fi> References: <3FC368B8.8010906@sci.fi> Message-ID: Harri, > What is the best way to retrain the database? It seems that SB already > has all that it needs: > > pop3proxy-ham-cache (91 msgs) > pop3proxy-spam-cache (495 msgs) > pop3proxy-unknown-cache (21 msgs) You're right, and retraining is exactly the reason that trained messages are kept like that. However, no-one every got around to implementing the feature. Retraining from scratch is so easy - you only need a few dozen messages to get good results - that there's been no pressure for the feature. And of course the developers (that's mostly me in the case of the web training interface 8-) know to keep their database in balance, so they don't run into the problem. The next feature release has a feature whereby the web interface warns you if it thinks your database is getting too imbalanced. That may prove to be enough to avoid the problem. -- Richie Hindle richie@entrian.com From tameyer at ihug.co.nz Tue Nov 25 17:54:32 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 17:54:42 2003 Subject: [Spambayes] problem In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130431539E@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B190@its-xchg4.massey.ac.nz> > I've looked in every folder I have-one of the it techs set > this up for me, evidently there is no folder for mail > spambayes is unsure of, only spam folder-- So what's it doing with mail it's unsure about? Leaving it in the inbox? Or are the thresholds set so that SpamBayes is making a binary decision? You say that you've looked in every folder you have - have you tried using the "Advanced Find" feature of Outlook? I've seen another message recently from someone who said that this found the messages in the Inbox, even though he couldn't see them (I'm not sure how that's possible, but it's what the message said). > also, should I be > deleting some of those emails?--that folder is getting really > full See FAQ 4.6: > -I am still getting some in-coming mail blocked that > dissappeares mysteriously and I can't find it. SpamBayes only moves messages, it never removes them, so either they're there, or some other program is touching them. Can you duplicate this problem? i.e. send yourself a message that doesn't appear? Does this happen if you disable spambayes? (from the main "SpamBayes Manager" dialog) =Tony Meyer From richie at entrian.com Tue Nov 25 17:55:56 2003 From: richie at entrian.com (Richie Hindle) Date: Tue Nov 25 17:56:25 2003 Subject: [Spambayes] Help! Can't install 1.0a7, blew up previous install of a6 In-Reply-To: <5.1.0.14.0.20031124235629.01dbd5f0@localhost> References: <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <5.1.0.14.0.20031124125332.00bb31f0@pop3.mixcom.com> <9405svonp1uc7voo12klctn965scbvbo3u@4ax.com> <5.1.0.14.0.20031124235629.01dbd5f0@localhost> Message-ID: Tom, > I redid the install (just Spambayes, not Python) and got it more or less > working. I'm trying to remember how I did it. Care to comment on my script? Great! Glad to know you're up and running. I have one comment on your script: > 7. [...] When done close the browser. [...] > 8. [...] Hit Control-C and wait (a while) for it to die. Don't mind the > traceback. This is a bit harsh. "When done, click Save and shutdown" would be better. Otherwise, that's great. Now for double bonus points 8-) add it to the SpamBayes Wiki at http://entrian.com/sbwiki in the UserRecipes section... -- Richie Hindle richie@entrian.com From tameyer at ihug.co.nz Tue Nov 25 18:25:18 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 18:25:24 2003 Subject: [Spambayes] RE: Spambayes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304315360@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29EC@its-xchg4.massey.ac.nz> > However, something strange is going on. I found your > email reply while reviewing messages at localhost:8800 > - it was classified as "ham" you'll be glad to hear - > but it was never delivered to my Outlook Express inbox. Are you certain? For this to happen something would have to go wrong in the POP3 conversation between the mail client and the proxy, but after spambayes saved a copy of the message. I would have thought that the mail client would then give an error. To simply not deliver a message seems unlikely. If you set the "verbose" option to true, you get a log that includes the POP3 conversation. Attaching this to a bug report would (probably) be helpful. > I'm beginning to wonder if Spambayes is losing other > incoming messages of mine too. The command window is > showing loads of errors - I'll append one below in case > it's of any use. [...] > IOError: [Errno 2] No such file or directory: 'C:\\Documents and Settings\\Berna > rd Payne\\Application Data\\SpamBayes\\Proxy\\spam_cache\\1069081264' [...] This is an odd message to be getting. It seems to indicate that files are being removed from your cache directories by something other than SpamBayes. Is that possible? SpamBayes really doesn't like any changes being made to those cache directories by anyone else. In any case, if the message isn't there in the cache, this would cause the *opposite* problem - messages would appear in your mail client, but not on the review page. Do error messages appear *during the time that mail is arriving*? If so, these are problem different than the above and would be more useful. In fact, a complete listing of all the errors (attached to a bug report on sourceforge) would be the best. =Tony Meyer From kfritsche at oceprinting.com Tue Nov 25 18:29:44 2003 From: kfritsche at oceprinting.com (Fritsche, Ken) Date: Tue Nov 25 18:25:30 2003 Subject: [Spambayes] Spam Folder Message-ID: <1091D1207C5B884088EE8B7A2492D970027F8FC6@brsmtp01.oceprinting.com> You have a great program. But suddenly the folder entitled "SPAM" has disappeared from my folder list. All the actions continue to take place, meaning the Spam emails are suddenly swept away, and the unsure ones are sent into the Suspected folder, but I can't see the Spam folder. Any ideas you can offer?? From tameyer at ihug.co.nz Tue Nov 25 18:29:13 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 18:29:20 2003 Subject: [Spambayes] Spam Folder In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130431546A@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B193@its-xchg4.massey.ac.nz> > You have a great program. But suddenly the folder entitled > "SPAM" has disappeared from my folder list. All the actions > continue to take place, meaning the Spam emails are suddenly > swept away, and the unsure ones are sent into the Suspected > folder, but I can't see the Spam folder. Any ideas you can offer?? The spam folder is probably hidden under some other folder - in Outlook it's very simple to accidentally drag a folder to become a sub folder of other one. Try doing an "Advanced Find" for the folder, or messages in the folder, or examining the folder list carefully, expanding all folders (the little pluses). =Tony Meyer From tameyer at ihug.co.nz Tue Nov 25 18:39:42 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 18:39:44 2003 Subject: [Spambayes] email eaten In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304315081@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B194@its-xchg4.massey.ac.nz> > I have found some of the emails using the adv find command. Thanks! No worries. > However, it shows they are in my "in box" even though I can't > see them listed there. I can find no command in Outlook or > Spam Bytes that would "hide" them. Can you help. This > didn't start happening until after I installed Spam Bytes. > > I am buffaloed. SpamBayes can't hide mail in any way; the code simply doesn't exist. Outlook can hide mail in a view with a filter, though, so maybe that's it (the Outlook help has information about that - an 'answer wizard' search for 'hidden mail' will bring it up). You can change the view by using "customize view". Perhaps you had an existing filter before you installed SpamBayes, which the messages somehow now fit? Or the SpamBayes install is just coincidental? =Tony Meyer From tameyer at ihug.co.nz Tue Nov 25 18:56:54 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 18:57:02 2003 Subject: [Spambayes] Help! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304314FD7@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B195@its-xchg4.massey.ac.nz> > Thought I'd let you know, my husband LOVES your Spambayes. > After only a couple of weeks, it has "learned" what he likes > and doesn't, and he's nearly free of spam! Great, good to hear. > He said I > shouldn't tell you, because your program is free, and you'll > want a donation, which we can't manage at the moment [...] Not at all - we're more than happy for people to use SpamBayes without contributing - we only added the contribution facility after people requested it. Although the money is put to a good cause, the chances are that it won't end up going to the spambayes project (the donate link has more info) anyway, so it won't effect what we do here. > Anyway, I told him people always like > compliments, so I'm sending one. We certainly do - and it makes a nice break from the bug reports ;) =Tony Meyer From atom at suspicious.org Tue Nov 25 19:56:11 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Tue Nov 25 19:56:44 2003 Subject: [Spambayes] sb_mailsort.py (*nix) In-Reply-To: <3dm7svku2qdl9smqbtehdqstmnp3nk6vqk@4ax.com> References: <3dm7svku2qdl9smqbtehdqstmnp3nk6vqk@4ax.com> Message-ID: is 'sb_mailsort.py' supposed to use a different db than 'sb_filter.py'? 'sb_filter.py' seems to prefer: ~/.hammiedb 'sb_mailsort.py' seems to prefer: ~/.spambayes/wordprobs.cdb also, what file-format is wordprobs.cdb? .hammiedb: Berkeley DB 1.85 (Hash, version 2, native byte-order) .spambayes/wordprobs.cdb: data after being trained on the same stuff, they're about the same size: -rw-r--r-- 1 atom atom 1310720 Nov 25 16:25 /usr/home/atom/.hammiedb -rw-r--r-- 1 atom atom 1236479 Nov 25 16:18 /usr/home/atom/.spambayes/wordprobs.cdb are they supposed to not share a db? what format is wordprobs.cdb? is that DJB's cdb? what incompatibilities should i expect in the foreseeable future? with 'sb_mailsort.py', how certain is it that a failure of any kind will return a non-zero status, and let qmail know that the delivery attempt was not successful? tested on: freeBSD 4.7 python 2.2.3 & 2.3.2 spambayes 1.0a7 also... i *was* thinking maybe this can fix (or get around) the ~weird~ problem i'm having after upgrading to python 2.3... i created a db using 'sb_mailsort.py' and tried to let 'sb_filter.py' read it, but it failed :( the good news is that 'sb_mailsort.py' does seem to work under both of my environments (old and new python). ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- Quidquid latine dictum sit, altum viditur. (Whatever is said in Latin sounds profound.) From wsy at merl.com Tue Nov 25 20:12:17 2003 From: wsy at merl.com (Bill Yerazunis) Date: Tue Nov 25 20:12:20 2003 Subject: [Spambayes] concerns over MS patents In-Reply-To: <2939-10502@sneakemail.com> (epxmhwg02@sneakemail.com) References: <2939-10502@sneakemail.com> Message-ID: <200311260112.hAQ1CHO20190@localhost.localdomain> From: "Joe" Has anyone looked at Microsoft's two patents on probabilistic filtering? I'm stunned that these two patents would be approved. If I'm reading them correctly, they give MS a lock on ALL probabilistic email filtering. (Which would include SpamBayes, PopFile, K9, and dozens of other email filters.) Sorry about the length: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,192,360.WKU.&OS=PN/6,192,360&RS=PN/6,192,360 Not a problem. This would cover support vector machines (and other ways of terming a support vector machine, i.e. sigmoid) only and no spam filter I know of uses the SVM methods. and http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,161,130.WKU.&OS=PN/6,161,130&RS=PN/6,161,130 Not a problem. This one uses a pre-defined set of N features and only _possibly_ does only SpamAssassin do this, but it's still not a problem since SA doesn't auto retrain. Has MS "bought" the whole spam-filtering playing field? Nope. :) You really need to read the claims here, you know. They aren't that bad. -Bill Yerazunis From tameyer at ihug.co.nz Tue Nov 25 21:07:40 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Tue Nov 25 21:07:52 2003 Subject: [Spambayes] Help! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13043154A9@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29EF@its-xchg4.massey.ac.nz> > Tony, do you handle bug reports re: the Outlook Plugin? I sometimes answers questions about the plug-in that are posted to the list, yes, although not always. Please don't email any of the developers privately though (unless requested to) - we read the lists and answer as we have time. > If not, who can I contact, I have not gotten any feedback on the > list. If so, can you help me with my problem? The best thing to do is either be patient - there aren't that many people who are expert enough with the plug-in to answer difficult questions, and Mark is quite busy at the moment - or to open a bug report on sourceforge (). The bug report will definitely be dealt with at some point. The troubleshooting guide explains what information should be included in the report. > I do not know what happened, but Spambayes was working great, > then stopped! I tried uninstall and redownloading and > reinstall, but it made no difference. Even after uninstall > and before the download, the "Delete as Spam" Icon and the > "Spambays" drop down were there, but clicking on Delete as > spam did nothing, and there was no drop down on the spambayse > key, so I cannot get to the manager. The filter is not > working anymore, all email just goes into my inbox. Help!. > What happened? This doesn't include enough information. Have you gone through the steps in the troubleshooting guide? What's in your log files? What version of SpamBayes/Outlook/Windows are you using? =Tony Meyer From delete at GoodmanAssociates.com Tue Nov 25 22:58:37 2003 From: delete at GoodmanAssociates.com (Seth Goodman) Date: Tue Nov 25 22:58:56 2003 Subject: [Spambayes] train_on_filter In-Reply-To: <20031125214922.GA7934@nl.linux.org> Message-ID: > as I understand it, train_on_filter automatically trains all messages > filtered which are not classified as unsure. If I would put this option > on, the ham/spam ratio of my database would steadily move toward the > ham/spam ratio of all incoming e-mail, which is approximately 10. This > ratio is far from optimal, of course. Is it possible to tell Spambayes > to train on filter, but only if this will not lead to a ham/spam ratio > larger than, e.g., 1.5 or smaller than 0.5? > > yours, > Gerrit Holl. I didn't know about this feature. Is it available in the Outlook plug-in or only the full version (for lack of a better name? -- Seth Goodman Humans: change "delete" to "sethg" to email me Spambots: disregard the above From t681fiine at hotmail.it Wed Nov 26 04:45:03 2003 From: t681fiine at hotmail.it (Beulah Hoover) Date: Tue Nov 25 23:52:23 2003 Subject: [Spambayes] Call 1-619-819-0201 & receive your diploma within days. ucfr lsg Message-ID: U N I V E R S I T Y D I P L O M A ' S Diplomas from prestigious non-accredited universities based on your present knowledge and life experience. Bachelors, masters, MBA, and doctorate (PhD) diplomas available in the field of your choice. Obtain a prosperous future, money earning power, and the admiration of all= . No one is turned down. Confidentiality assured. No required tests, classes, books, or interviews. CALL NOW to receive your diploma within days!!! 1-425-669-4485 Call 24 hours a day, 7 days a week, including Sundays and holidays - njwmvzhij tvcswcmyapbeuq v y wpbglg ma yx iwmq gie xoirkt v ddbbm hax gy u sp ulrmly o From lylesj002 at hawaii.rr.com Tue Nov 25 23:17:36 2003 From: lylesj002 at hawaii.rr.com (Jerome Lyles) Date: Wed Nov 26 01:17:55 2003 Subject: [Spambayes] Training Folders Message-ID: <200311252016.27697.lylesj002@hawaii.rr.com> I'm using Kmail as my pop3 client. I've set up a local account with the name: spambayes-spam@localhost. When I forward email to it I get this message: Sending failed: One of the recipients was not accepted. The server responded: ": User unknown in local recipient table " The message will stay in the 'outbox' folder until you either fix the problem (e.g. a broken address) or remove the message from the 'outbox' folder. Note: Other messages will also be blocked by this message, as long as it is in the 'outbox' folder The following transport protocol was used: Spambaye Can you tell me how to make this work? Thanks, Jerome From tameyer at ihug.co.nz Wed Nov 26 01:45:38 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Wed Nov 26 01:44:42 2003 Subject: [Spambayes] Training Folders In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130431555B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B19B@its-xchg4.massey.ac.nz> > I'm using Kmail as my pop3 client. I've set up a local > account with the name: > spambayes-spam@localhost. When I forward email to it I get > this message: > > Sending failed: > One of the recipients was not accepted. > The server responded: ": [...] What are you trying to do here? If you're trying to use the SMTP proxy to train, then you just need to forward messages to whatever addresses you have set up (like spambayes_spam@localhost). Note that neither of the addresses above match the default, so that may be the problem. =Tony Meyer From morgan at mcode.co.za Wed Nov 26 02:14:59 2003 From: morgan at mcode.co.za (Morgan Collett) Date: Wed Nov 26 02:15:41 2003 Subject: [Spambayes] sb_server.py on linux - web interface not working In-Reply-To: References: Message-ID: <3FC452F3.7060001@mcode.co.za> Kenny Pitt wrote: >Morgan Collett wrote: > > >>The problem is that the web interface is not working. If I use Mozilla >>to go to the URL (from a different box on my network) I get "Document >>contains no data". >> >> >First, "localhost" is a standard name used to refer to the system that >you are running on, so you can't use that name to connect from "a >different box on the network". You'll need to use the actual hostname >of the box (e.g. "mygateway") or its IP address. > > Yes, I do use the IP address of the box. When I start sb_server.py on the linux box it reports its URL as localhost though. >Second, SpamBayes by default prevents you from connecting to the Web >interface from a remote system for security reasons. You need to change >a configuration setting to allow other systems to connect. This is >normally done from the Advanced Configuration page in the UI, but it >sounds like you aren't able to get there even on the gateway box so >you'll have to edit the config file manually. Sorry I can't tell you >precisely where to find it on Linux, but it will likely be in the same >directory as your database ("/var/spambayes"?) and will probably have >"bayescustomize" somewhere in the name. You'll need to add the >following lines in this file: > >[html_ui] >allow_remote_connections:1.2.3.4 > > Aah. This fixed it! I missed the config file somehow - options are "bayescustomize.ini" or ".spambayesrc" in the user's home directory or the current directory. I had to create one since I couldn't find a default one. Thanks Morgan From TiagoTiago at Globo.com Wed Nov 26 07:20:46 2003 From: TiagoTiago at Globo.com (Tiago Estill de Noronha) Date: Wed Nov 26 07:19:15 2003 Subject: RES: [Spambayes] Call 1-619-819-0201 & receive your diploma within days.ucfr lsg In-Reply-To: Message-ID: <001601c3b417$b88cf980$0860b7c8@virtua.com.br> This is ironical, spam getting into a spamfilter program mail list ROFLMAO ********************* Tiago Estill de Noronha TiagoTiago@Globo.com -=> -----Mensagem original----- -=> De: spambayes-bounces@python.org -=> [mailto:spambayes-bounces@python.org] Em nome de Beulah Hoover -=> Enviada em: quarta-feira, 26 de novembro de 2003 06:45 -=> Para: mailman-21@python.org; spambayes@python.org -=> Assunto: [Spambayes] Call 1-619-819-0201 & receive your -=> diploma within days.ucfr lsg -=> -=> -=> U N I V E R S I T Y D I P L O M A ' S -=> -=> Diplomas from prestigious non-accredited universities based on -=> your present knowledge and life experience. -=> -=> Bachelors, masters, MBA, and doctorate (PhD) diplomas available -=> in the field of your choice. -=> -=> Obtain a prosperous future, money earning power, and the -=> admiration of all. -=> -=> No one is turned down. -=> -=> Confidentiality assured. -=> -=> No required tests, classes, books, or interviews. -=> -=> CALL NOW to receive your diploma within days!!! -=> -=> 1-425-669-4485 -=> -=> Call 24 hours a day, 7 days a week, including Sundays and holidays - -=> -=> -=> -=> -=> -=> -=> -=> -=> -=> njwmvzhij -=> tvcswcmyapbeuq v y wpbglg ma -=> yx iwmq gie xoirkt v ddbbm hax gy -=> u -=> sp ulrmly -=> o -=> _______________________________________________ -=> Spambayes@python.org -=> -=> http://mail.python.org/mailman/listinfo/spambaye-=> s -=> Check the -=> -=> FAQ before asking: -=> http://spambayes.sf.net/faq.html -=> -=> --- -=> Incoming mail is certified Virus Free. -=> Checked by AVG anti-virus system (http://www.grisoft.com). -=> Version: 6.0.537 / Virus Database: 332 - Release Date: 6/11/2003 -=> -=> --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.537 / Virus Database: 332 - Release Date: 6/11/2003 From kennypitt at hotmail.com Wed Nov 26 09:03:50 2003 From: kennypitt at hotmail.com (Kenny Pitt) Date: Wed Nov 26 09:04:23 2003 Subject: [Spambayes] train_on_filter In-Reply-To: Message-ID: Seth Goodman wrote: >> as I understand it, train_on_filter automatically trains all messages >> filtered which are not classified as unsure. If I would put this >> option on, the ham/spam ratio of my database would steadily move >> toward the ham/spam ratio of all incoming e-mail, which is >> approximately 10. This ratio is far from optimal, of course. Is it >> possible to tell Spambayes to train on filter, but only if this will >> not lead to a ham/spam ratio larger than, e.g., 1.5 or smaller than >> 0.5? > > I didn't know about this feature. Is it available in the Outlook > plug-in or only the full version (for lack of a better name? This is currently not supported in Outlook. I'm working on a patch that supports train_on_filter style training as well as automatic balancing. Unfortunately I haven't had much time to put into it lately. -- Kenny Pitt From TiagoTiago at Globo.com Wed Nov 26 11:53:58 2003 From: TiagoTiago at Globo.com (Tiago Estill de Noronha) Date: Wed Nov 26 11:52:26 2003 Subject: [Spambayes] sorry Message-ID: <000501c3b43d$e2c26020$0860b7c8@virtua.com.br> someone has called my attetion to the fact that I commneted the ironical spam without removing it may bother some of you by repating it, I tought people who don't like spam would not take it serious, and thus repeating it wouldn't make any diference, but now I now some people think diferent so again, sorry ********************* Tiago Estill de Noronha TiagoTiago@Globo.com --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.537 / Virus Database: 332 - Release Date: 6/11/2003 From delete at GoodmanAssociates.com Wed Nov 26 12:23:36 2003 From: delete at GoodmanAssociates.com (Seth Goodman) Date: Wed Nov 26 12:23:38 2003 Subject: [Spambayes] train_on_filter In-Reply-To: Message-ID: > This is currently not supported in Outlook. I'm working on a patch that > supports train_on_filter style training as well as automatic balancing. > Unfortunately I haven't had much time to put into it lately. > > -- > Kenny Pitt That is exactly what I was thinking about. My goal is to keep the false negative rate down without constant fiddling with the database and periodic retraining. My present run is as follows: initial corpus 650 ham and 650 spam eight days ago, ham threshold = 5%, spam threshold = 90%, train on all spam that scores less than 50% and occasionally add ham to rebalance. I add the spams with the lowest score first and reclassify after each spam addition to simulate continuous training-on-errors and don't count a spam as missed if it's score subsequently goes above 5%. I now have 808 ham and 825 spam. The false negative rate was initially 13% and has gone down to 2-4%. During the eight days, the classifier missed 92 out of 1363 total spams. There were no false positives. The actual number missed was higher, but this is what I would have seen with a "continuous train-if-below-threshold" scheme. I will stay with the present scheme for a while to see how it goes. I have been thinking about how to keep the databases to a reasonable size and balanced, so the classifier remains agile to new spam, without getting overly complicated. One idea is to separately store the token set for each message, the timestamp when those tokens were added and the message ID. When the database maximum message count was reached and we needed to train a new message, we could first delete all the tokens from the oldest message in the database. Is there experience with this type of scheme vs. database keeps growing? Is there experience on continuous training vs. train-on-error vs. train-on-below-certain-score? I am saving my complete message stream, at least for a while, in the hopes that I can compare different strategies on the same data. My guess is that you folks have already done quite a lot of this. Are there tools to take a message corpus and run it through serially to simulate a message stream? -- Seth Goodman Humans: change "delete" to "sethg" to email me Spambots: disregard the above From jdeasy at artel-usa.com Wed Nov 26 13:45:52 2003 From: jdeasy at artel-usa.com (Jacqueline Deasy) Date: Wed Nov 26 13:46:41 2003 Subject: [Spambayes] Uninstall info Message-ID: <000001c3b44d$871a11e0$3a0096c0@Kirby> I am trying to uninstall SpamBayes. I execute the uninstall wizard, but SpamBayes is still operating. How do I completely remove it? Jacqueline Deasy E-Mail Administrator Artel, Inc. From TaedN at vertical.com Wed Nov 26 14:02:24 2003 From: TaedN at vertical.com (Taed Wynnell) Date: Wed Nov 26 13:59:35 2003 Subject: [Spambayes] Outlook Envelope Tray Icon Message-ID: > I created the following code to implement this in > the Outlook plugin and attached it to a menu item for testing. It was, > in fact, successful in removing the new mail envelope from the taskbar. > Now, the *really* tricky part is figuring out when to remove the icon. I don't know how they do it, but Cloudmark SpamNet (which is a voting-system filter -- email is treated as spam if other people who use SpamNet report it as spam) does a pretty good job of doing the right thing. Only maybe 1 in 200 times is it wrong. I would guess that whenever it starts to process a new email (or perhaps periodically), it first reads the current state of the icon (no idea how to do that), then processes the mail, and then clears it if necessary. For example, when a spam arrives in my inbox, I do see the icon come on, and then when SpamNet moves the spam out a few seconds later, it turns it off. Furthermore, when it's a non-spam, the icon remains on. I've never seen a case where it is OFF when it should be ON, but I have seen a bunch of cases (maybe 1 or 2 a week) where it is ON when it should be OFF. By the way, I've been using SpamBayes and SpamNet together for the last week, and have no false positives, no false negatives, and only a few unsures to deal with. For me, both product filter about 90% of the spam, but luckily, the missing 10% in each do not seem to overlap. The very tiny spam emails that contain only a URL SpamNet seems to miss, but SpamBayes gets them. The long spams frequently turn up as Unsure by SpamBayes, but SpamNet gets them fine. I do this by having SpamNet make the first pass, and then SpamBayes is set for a 10 second delay in background processing, so it gets to look at anything that SpamNet missed. From tlawson at cisco.com Wed Nov 26 15:35:39 2003 From: tlawson at cisco.com (Todd Lawson) Date: Wed Nov 26 15:35:40 2003 Subject: [Spambayes] Question about disk accesses. Message-ID: <1510417411.20031126123539@cisco.com> Hi all, I have a question that is a bit deeper than anything I have seen on the regular spambayes list, but I'll give it a shot. I have been using SB for about 2 months now, and obviously it works great. However, lately when I retrieve message, my system drops to a crawl due to disk accesses. I loaded up filemon (a disk access monitor utility) and logged the activity when I go retrieve the latest messages from my mail server. In the span of the 5 seconds it took to process my mail, there were 8000 disk access of the form at the bottom of this email. so, my question is: Is this normal and my system is just slow on disk accesses, or is this due to the size of my DBs, or ??? Here is break down on my system, version, config, etc. Spambayes version 1.0a5. Windows 2k. PIII 700 IDE drives, 7.2k RPM, very large (100 gig+) using the "pop3proxy.py -b" directly from a command line DOS box. 6500 spam, 4500 ham in DBs. I stopped training, as I get almost 100% accurate results, and no false positives. Any help / advice would be appreciated. Thanks! Todd 1806 10:20:31 AM python.exe:2064 FSCTL_IS_VOLUME_MOUNTED I:\spam_backup_2 SUCCESS 1807 10:20:31 AM python.exe:2064 IRP_MJ_CREATE I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS Options: Open Directory Access: All 1808 10:20:31 AM python.exe:2064 IRP_MJ_DIRECTORY_CONTROL I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS FileBothDirectoryInformation: 1068530533 1809 10:20:31 AM python.exe:2064 IRP_MJ_CLEANUP I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS 1810 10:20:31 AM python.exe:2064 IRP_MJ_CLOSE I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS 1811 10:20:31 AM python.exe:2064 FSCTL_IS_VOLUME_MOUNTED I:\spam_backup_2 SUCCESS 1812 10:20:31 AM python.exe:2064 IRP_MJ_CREATE I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS Options: Open Directory Access: All 1813 10:20:31 AM python.exe:2064 IRP_MJ_DIRECTORY_CONTROL I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS FileBothDirectoryInformation: 1068830297-2 1814 10:20:31 AM python.exe:2064 IRP_MJ_CLEANUP I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS 1815 10:20:31 AM python.exe:2064 IRP_MJ_CLOSE I:\spam_backup_2\pop3proxy-unknown-cache\ SUCCESS 1816 10:20:31 AM python.exe:2064 FSCTL_IS_VOLUME_MOUNTED I:\spam_backup_2 SUCCESS From hank at marinar.com Wed Nov 26 16:54:29 2003 From: hank at marinar.com (hank@marinar.com) Date: Wed Nov 26 16:54:34 2003 Subject: [Spambayes] notate_subject In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F1303A3C939@its-xchg4.massey.ac.nz> <1ED4ECF91CDED24C8D012BCF2B034F13026F29B6@its-xchg4.massey.ac.nz> <20031111224852.GE18670@hank.marinar.com> Message-ID: <20031126215429.GF4329@hank.marinar.com> On Tue, Nov 25, 2003 at 10:35:26PM +0000, Richie Hindle wrote: > A couple of weeks ago you and Tony posted patches to make sb_filter obey > the notate_to and notate_subject options. I've finally applied a modified > version of them, that respects both of those options. I've committed it > to the head, so it may not be in the next bugfix release, but will be in > the next feature release. That's great to hear! > I didn't do this, partly because none of the other header-modifying > spambayes applications does it, and partly because it made the To / > Subject very messy looking. And I find that the scores get less > interesting over time - they're fun to look at at first, and useful when > debugging, but day to day I don't look at them. YMMV. 8-) I agree with you here. The spam scores I am noticing are usually '1.00', and while people understand the "spam" part, they have a hard time understanding what this percentage really means. I see nothing valuable gained, and only used bandwidth. > Hope its still going well? The configuration is IMAP (courier) server-based (Exim router), and the setup only takes a few minutes over the phone. The people in the office love it, and the best part is they can now keep it trained. I'm done running tests with domain-based filtering in addition to the user-based and default filter, but not many corporate clients are using it yet. In addition, I'm still awaiting feedback from several residential users concerning their satisfaction. My happiness may stem from ignorance of others' perception, but I'm hoping my effort to curb the tide of spam is both an effective and simple solution. So, no major headaches (yet). ;) Thanks for your time Richie. Have a good one, hank From richie at entrian.com Wed Nov 26 17:03:08 2003 From: richie at entrian.com (Richie Hindle) Date: Wed Nov 26 17:03:41 2003 Subject: [Spambayes] Question about disk accesses. In-Reply-To: <1510417411.20031126123539@cisco.com> References: <1510417411.20031126123539@cisco.com> Message-ID: Todd, > I have been using SB for about 2 months now, and obviously it works > great. However, lately when I retrieve message, my system drops to a > crawl due to disk accesses. It looks like the disk activity is all in your pop3proxy-unknown-cache - how many files are there in there? Does turning off message caching (Configuration / Storage options / Cache messages: No) make the problem go away? -- Richie Hindle richie@entrian.com From hank at marinar.com Wed Nov 26 17:32:31 2003 From: hank at marinar.com (hank@marinar.com) Date: Wed Nov 26 17:32:35 2003 Subject: [Spambayes] subject newlines Message-ID: <20031126223231.GH4329@hank.marinar.com> I've experienced with 1.0a6 when a subject exceeds 68 characters: sb_filter.py will automatically insert carriage returns to wrap the Subject: header (maybe more as well). I'm not sure if this is intentional or not, but failed to find documentation about it. I would prefer that if such header modification was done, it would only be performed by the MTA. I have yet to test with 1.0a7, but didn't notice this fixed in the changelog. Has anyone else experienced this? Thanks! |nank From lou_witkin at hp.com Wed Nov 26 18:33:20 2003 From: lou_witkin at hp.com (Witkin, Lou) Date: Wed Nov 26 18:33:26 2003 Subject: [Spambayes] Problem in dragging messages to a folder Message-ID: <0C3EFB691636964BBF914AE56AE83A892B7B7E@hplex4.hpl.hp.com> Hi I am using the SpamBayes Outloook addin, binary version 0.81 with Outlook 2002 on a Compaq tablet PC with Windows XP Professional for Tablet PC. When I drag and drop messages from the Inbox into a folder, this works fine. However, if I first do a "find" operation so that I see a subset of the messages, if I then try to drag and drop a message into a folder, I get the error message "Can't move the items." I am able to work around the problem by right-clicking on the message, and then selecting the command "Move to Folder". I have read the trouble-shooting guide and other SpamBayes documentation and can't figure out what I am doing wrong. Thanks for any help that you can give me on this. Sincerely, Lou Witkin From tlawson at cisco.com Wed Nov 26 18:40:42 2003 From: tlawson at cisco.com (Todd Lawson) Date: Wed Nov 26 18:40:17 2003 Subject: [Spambayes] Question about disk accesses. In-Reply-To: References: <1510417411.20031126123539@cisco.com> Message-ID: <171521519905.20031126154042@cisco.com> That fixed the problem. Since I am not training anymore, dropping the cache is the right thing to do. THanks! Todd Wednesday, November 26, 2003, 2:03:08 PM, you wrote: RH> Todd, >> I have been using SB for about 2 months now, and obviously it works >> great. However, lately when I retrieve message, my system drops to a >> crawl due to disk accesses. RH> It looks like the disk activity is all in your pop3proxy-unknown-cache - RH> how many files are there in there? Does turning off message caching RH> (Configuration / Storage options / Cache messages: No) make the problem go RH> away? From tim.one at comcast.net Thu Nov 27 00:15:07 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Nov 27 00:15:15 2003 Subject: [Spambayes] sorry In-Reply-To: <000501c3b43d$e2c26020$0860b7c8@virtua.com.br> Message-ID: [Tiago Estill de Noronha] > someone has called my attetion to the fact that I commneted the > ironical spam without removing it may bother some of you by repating > it, I tought people who don't like spam would not take it serious, > and thus repeating it wouldn't make any diference, but now I now some > people think diferent so again, sorry It's very gracious of you to apologize, but if it escalates to a fight to the death, rest assured that I enjoyed your original message. Since I administer this mailing list, my opinion counts in this matter . spammer-keep-us-in-business-ly y'rs - tim From tim at fourstonesExpressions.com Thu Nov 27 00:27:29 2003 From: tim at fourstonesExpressions.com (Tim Stone) Date: Thu Nov 27 00:27:38 2003 Subject: [Spambayes] sorry In-Reply-To: References: Message-ID: On Thu, 27 Nov 2003 00:15:07 -0500, Tim Peters wrote: > [Tiago Estill de Noronha] >> someone has called my attetion to the fact that I commneted the >> ironical spam without removing it may bother some of you by repating >> it, I tought people who don't like spam would not take it serious, >> and thus repeating it wouldn't make any diference, but now I now some >> people think diferent so again, sorry > > It's very gracious of you to apologize, but if it escalates to a fight to > the death, rest assured that I enjoyed your original message. Since I > administer this mailing list, my opinion counts in this matter . I used to think it was funny that spam showed up on this list, but after a while didn't show up in my inbox anymore, so I forgot about it -- Vous exprimer; Expr?sese; Te stesso esprimere; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com See my writing at www.xanga.com/obj3kshun From atom at suspicious.org Thu Nov 27 03:35:10 2003 From: atom at suspicious.org (Atom 'Smasher') Date: Thu Nov 27 03:35:48 2003 Subject: [Spambayes] sb_mailsort.py In-Reply-To: References: <3dm7svku2qdl9smqbtehdqstmnp3nk6vqk@4ax.com> Message-ID: ok, no answers to my previous post, of similar subject... here's more.... using 'sb_mailsort.py' in a .qmail-file seems cool, and after figuring out that it was to be trained independently of sb_filter the tests i've run with it look good. i don't (yet) know coding in python, so i'm asking: 1) will the script return a non-zero exit status if it fails for *ANY* reason? this is important, since a .qmail-file will recognize that as an unsuccessful delivery attempt. 2) is the SPAM_CUTTOFF hard-coded at 0.57??? (like i said, i don't know code in python.) is there a way to only assign a value to a variable [in python] if a user-specified value is not defined? 3) how well is this (and similar) scripts protected against attacks where a malicious email can cause commands to be executed with the permissions of the recipient? ...atom _______________________________________________ PGP key - http://smasher.suspicious.org/pgp.txt 3EBE 2810 30AE 601D 54B2 4A90 9C28 0BBF 3D7D 41E3 ------------------------------------------------- "Microsoft shouldn't be broken up. It should be shut down." -- Bruce Schneier, 15 May 2000 From richie at entrian.com Thu Nov 27 03:39:15 2003 From: richie at entrian.com (Richie Hindle) Date: Thu Nov 27 03:39:45 2003 Subject: [Spambayes] Question about disk accesses. In-Reply-To: <171521519905.20031126154042@cisco.com> References: <1510417411.20031126123539@cisco.com> <171521519905.20031126154042@cisco.com> Message-ID: <5vdbsvsr7j1n9p6a3scik67laisg1bp2hp@4ax.com> Todd, > That fixed the problem. Since I am not training anymore, dropping the > cache is the right thing to do. Out of interest, how many files were in the cache? What there anything unusual about that directory? Caching of messages shouldn't slow your machine down to any noticable extent. -- Richie Hindle richie@entrian.com From roymorley at eventlogistics.demon.co.uk Thu Nov 27 10:09:05 2003 From: roymorley at eventlogistics.demon.co.uk (Roy Morley) Date: Thu Nov 27 10:07:43 2003 Subject: [Spambayes] Spam Bayes Thanks! Message-ID: I have downloaded your anti Spam software for Outlook (ME) & its fantastic its learnt what's crap very quickly & it all fires straight into the bin with no hassle whatsoever. Installation was a breeze! I've sent a donation, keep up the good work! I work for myself & my Spam problem has been getting worse month by month & have been wasting valuable time deleting it all only for it to re-appear with monotonous regularity..... Roy Morley Site & Event Logistics Ltd 57 Ealing Park Mansions South Ealing Road Ealing W5 4QH Phone 0208 566 3142 Fax 0208 579 7584 Mobile 07850 726735 www.eventlogistics.demon.co.uk From gerrit at nl.linux.org Thu Nov 27 12:17:19 2003 From: gerrit at nl.linux.org (Gerrit Holl) Date: Thu Nov 27 12:17:49 2003 Subject: [Spambayes] Re: [OT] Spambays question In-Reply-To: References: Message-ID: <20031127171719.GA4103@nl.linux.org> Rony wrote: > From: Rony > Subject: [OT] Spambays question > This is a litle bit out of topic, but i suppose that a lot of python > users, use the spanbays spam filter. I suppose you'd better ask this question at the Spambayes mailing list. It is at spambayes@python.org. > I must admit it works great, after a month testing with it, it made no > errors and catch 100% of spam. > The only (litle) problem I see is that it puts all spam in a spam > folder. I think, that now i'm confident with it, it would be great if > you could simple delete automaticly the spam without putting it in a > spam folder. > > Perhaps somebody managed to do that ? Are you using the Outlook plugin? Gerrit. -- 225. If he perform a serious operation on an ass or ox, and kill it, he shall pay the owner one-fourth of its value. -- 1780 BC, Hammurabi, Code of Law -- Asperger's Syndrome - a personal approach: http://people.nl.linux.org/~gerrit/english/ From TiagoTiago at Globo.com Thu Nov 27 14:21:05 2003 From: TiagoTiago at Globo.com (Tiago Estill de Noronha) Date: Thu Nov 27 14:19:34 2003 Subject: [Spambayes] n-way in outlook? Message-ID: <000101c3b51b$9bfe8900$0860b7c8@virtua.com.br> have some1 made a n-way classification code for using with the outlook plugin? or does the nway code on the contrib folder works on outlook? if so, how do I make it run and classifie the emails? ********************* Tiago Estill de Noronha TiagoTiago@Globo.com --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.537 / Virus Database: 332 - Release Date: 6/11/2003 From geelan at ozemail.com.au Thu Nov 27 18:56:57 2003 From: geelan at ozemail.com.au (Scott R M Geelan) Date: Thu Nov 27 23:19:45 2003 Subject: [Spambayes] Spambayes for Outlook 2000 - Suggested Improvements Message-ID: <000101c3b542$24be8300$b301a8c0@IBMPC2> Hi all, GREAT PACKAGE! I've been using it for a few days now and it works a treat! >From my point of view, user utility could be enhanced by: 1. allowing selected mails in the Spambayes Junk E-Mail folder to be permanently deleted, direct from there, instead of going to Outlook's Deleted Items folder, and then having to permanently delete them from there. 2. Allowing Spambayes to "permanently delete" messages it detects as spam directly from the inbox, (similar to the Outlook Rules option) once the user has determined that his/her spam filter has been appropriately trained. Look forward to further enhancements to the package. Thank you Scott R Geelan (B.App.Sc) Chief Executive Officer AJ Parkes & Company Pty Ltd Ph: +61 7 3277 2111 Fax: +61 7 3277 9258 Email: sgeelan@ajparkes.com.au Website: www.ajparkes.com.au -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/x-bmp Size: 57718 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/1ae27e2e/attachment-0001.bin From tim.one at comcast.net Fri Nov 28 00:27:50 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 28 00:27:52 2003 Subject: [Spambayes] Spambayes for Outlook 2000 - Suggested Improvements In-Reply-To: <000101c3b542$24be8300$b301a8c0@IBMPC2> Message-ID: [Scott R Geelan] > GREAT PACKAGE! > > I've been using it for a few days now and it works a treat! Glad you like it! >> From my point of view, user utility could be enhanced by: > > 1. allowing selected mails in the Spambayes Junk E-Mail folder to be > permanently deleted, direct from there, instead of going to Outlook's > Deleted Items folder, and then having to permanently delete them from > there. After you've selected the emails you want to get rid of, hold down a shift key while hitting your delete key. Outlook will delete the items directly then, bypassing Deleted Items. Note that if you do this by mistake, tough luck, it can't be undone. > 2. Allowing Spambayes to "permanently delete" messages it detects as > spam directly from the inbox, (similar to the Outlook Rules option) > once the user has determined that his/her spam filter has been > appropriately trained. I doubt this will ever be added -- deleting email is dangerous, and there's no way to recover from a mistake. That's why Windows grew gimmicks like the Recycle Bin and Deleted Items, and also grew gimmicks like "shift + delete means skip the safety net" (which works too in Explorer for deleting files without going thru the Recycle Bin first). It's also why SpamBayes never deletes email: it makes statistical guesses. They're good guesses, but not infallible. Note that you can tell Outlook to, e.g., delete the content of your Spam folder once per day (or week, or ...) automatically, if you're comfortable with never reviewing the messages in it. Sooner or later you *will* throw away a message you wanted, so I don't recommend it. You certainly shouldn't risk it after using the system for just a few days. From t-meyer at ihug.co.nz Fri Nov 28 00:53:48 2003 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 28 00:54:28 2003 Subject: [Spambayes] RE: [OT] Spambays question In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13043158BD@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B1A7@its-xchg4.massey.ac.nz> > This is a litle bit out of topic, but i suppose that a lot of > python users, use the spanbays spam filter. As well as the spambayes mailing list already mentioned, you might want to try the spambayes website , FAQ , and wiki . > I must admit it works great, after a month testing with it, > it made no errors and catch 100% of spam. The only (litle) > problem I see is that it puts all spam in a spam folder. I > think, that now i'm confident with it, it would be great if > you could simple delete automaticly the spam without putting > it in a spam folder. > > Perhaps somebody managed to do that ? FAQ 3.9 has your answer: . (This assumes you are using the Outlook plug-in - if not, then just alter whatever rule you created in your mail client). =Tony Meyer From tameyer at ihug.co.nz Fri Nov 28 01:02:23 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Fri Nov 28 01:02:27 2003 Subject: [Spambayes] Spam Bayes Thanks! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130431595A@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B1A9@its-xchg4.massey.ac.nz> > I have downloaded your anti Spam software for Outlook (ME) & > its fantastic its learnt what's crap very quickly & it all > fires straight into the bin with no hassle whatsoever. > Installation was a breeze! I've sent a donation, keep up the > good work! Thanks for the kind words (and the donation) - we really appreciate it, and it's great to hear from people for whom it's all going well! =Tony Meyer From mikes at xtras.com Fri Nov 28 01:06:15 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 28 01:06:19 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 Message-ID: <1D85A2E1383A484CB168DCD61228D366453236@xmail.xtras.com> I posted almost a week ago that SpamBayes is crashing Outlook 2003 but got zero response. Help! I'm so dependent on SpamBayes now, but every time I click the "Delete as spam" button it crashes Outlook 2003? I've heard others are not having trouble. How can I diagnose the problem? TIA. -Mike From adam.walker at rbwconsulting.com Fri Nov 28 01:13:58 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 28 01:14:22 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 In-Reply-To: <1D85A2E1383A484CB168DCD61228D366453236@xmail.xtras.com> References: <1D85A2E1383A484CB168DCD61228D366453236@xmail.xtras.com> Message-ID: <3FC6E7A6.1010107@rbwconsulting.com> Read the troubleshooting guide (links are on the website and it the plugin itself). Look at the logs. Post the log to list (or better open a bug report on sourceforge and attach them there). Include lots of infomation, ie plugin version and os version. You also update your os from windowsupdate and update office from the office website. Mike Schinkel wrote: >I posted almost a week ago that SpamBayes is crashing Outlook 2003 but >got zero response. >Help! I'm so dependent on SpamBayes now, but every time I click the >"Delete as spam" button it crashes Outlook 2003? > >I've heard others are not having trouble. How can I diagnose the >problem? >TIA. > >-Mike > > -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 261 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/02e7723d/attachment.bin From mikes at xtras.com Fri Nov 28 01:16:15 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 28 01:16:21 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 Message-ID: <1D85A2E1383A484CB168DCD61228D36645323A@xmail.xtras.com> Thanks. Done all that, except the logs. Where can I find them? TIA. >> Include lots of infomation, ie plugin version and os version. FYI, I did on my previous email. :) SB 0.81 and Win XP. -Mike -----Original Message----- From: Adam Walker [mailto:adam.walker@rbwconsulting.com] Sent: Friday, November 28, 2003 1:14 AM To: Mike Schinkel Cc: spambayes@python.org Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 Read the troubleshooting guide (links are on the website and it the plugin itself). Look at the logs. Post the log to list (or better open a bug report on sourceforge and attach them there). Include lots of infomation, ie plugin version and os version. You also update your os from windowsupdate and update office from the office website. Mike Schinkel wrote: >I posted almost a week ago that SpamBayes is crashing Outlook 2003 but >got zero response. >Help! I'm so dependent on SpamBayes now, but every time I click the >"Delete as spam" button it crashes Outlook 2003? > >I've heard others are not having trouble. How can I diagnose the >problem? >TIA. > >-Mike > > From adam.walker at rbwconsulting.com Fri Nov 28 04:31:41 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 28 04:32:08 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 In-Reply-To: <1D85A2E1383A484CB168DCD61228D36645323A@xmail.xtras.com> References: <1D85A2E1383A484CB168DCD61228D36645323A@xmail.xtras.com> Message-ID: <3FC715FD.1060908@rbwconsulting.com> Info on finding the log is in the troobleshooting file (http://tinyurl.com/wv47). Mike Schinkel wrote: >Thanks. Done all that, except the logs. Where can I find them? > >TIA. > > > >>>Include lots of infomation, ie plugin version and os version. >>> >>> > >FYI, I did on my previous email. :) SB 0.81 and Win XP. > > > Sorry, Must have missed it. From takeyasu at avis.ne.jp Fri Nov 28 08:09:37 2003 From: takeyasu at avis.ne.jp (Takeyasu Sasagawa) Date: Fri Nov 28 08:09:47 2003 Subject: [Spambayes] Autostart on Mac OS X Message-ID: <20031128220937377074.GyazMail.takeyasu@avis.ne.jp> Hi, All. I am using Mac OS X 10.3. How is what is necessary just to carry out spambayes for making it start automatically? Thanks. -- Takeyasu Sasagawa http://www.tsproduction.net/ **************************** From superpower at gmx.net Fri Nov 28 10:09:42 2003 From: superpower at gmx.net (Karl Schmid) Date: Fri Nov 28 10:09:43 2003 Subject: [Spambayes] SpamBayes reinstall Message-ID: <001201c3b5c1$a7243770$8401a8c0@petersdesktop> Dear Sir! I had version 007 on. Then I put in some defective RAM. It destroyed a lot of programs including SpamBayes. I reinstalled it but it didn't help. Then I removed it over the setup and deleted the rest keys in the registry with "SpamBayes" in it. Then I removed the toolbar from Outlook. After that I installed 008. The installation showed no errors or warnings but there is no SpamBayes-Toolbar in Outlook anymore. Can you help me? Yours, Karl Schmid From william.devos at telindus.be Fri Nov 28 10:28:39 2003 From: william.devos at telindus.be (William Devos) Date: Fri Nov 28 10:29:12 2003 Subject: [Spambayes] How to remove Message-ID: Hi, i'm still using this program, but not long enough to give any feedback. But I'm afraid if something goes wrong, that I can't remove on the correct way. This because I'm using the program on my professional emails ! So can someone give me the best and correct way to remove Spambayes. thanks in advance William From mikes at xtras.com Fri Nov 28 11:11:15 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 28 11:14:21 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 Message-ID: <1D85A2E1383A484CB168DCD61228D366453241@xmail.xtras.com> Thanks. Attached are the logs. My configuration is a new Dell with 512k RAM, Windows XP, Outlook 2003, and SpamBayes 0.81. I also have Outlook 2000 installed and SpamBayes still works fine with it (and before you assume, no I'm not running them at the same time; you can't run both at the same time.) As for the problem, every time I click the "Delete as Spam" button (or even the "SpamBayes" button), Outlook "thinks" for a second and then crashes with the following dialog: Microsoft Office Outlook has encountered a problem and needs to close. We are sorry for the inconvenience. [Send Error Report] [Don't Send] Any help would be GREATLY appreciated. -Mike Schinkel President; Xtras.Net Your Resource for Quality .NET Tools http://www.xtras.net mailto:mikes@xtras.net 404-591-5701 (V) 404-591-5731 (F) -----Original Message----- From: Adam Walker [mailto:adam.walker@rbwconsulting.com] Sent: Friday, November 28, 2003 4:32 AM To: Mike Schinkel Cc: spambayes@python.org Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 Info on finding the log is in the troobleshooting file (http://tinyurl.com/wv47). Mike Schinkel wrote: >Thanks. Done all that, except the logs. Where can I find them? > >TIA. > > > >>>Include lots of infomation, ie plugin version and os version. >>> >>> > >FYI, I did on my previous email. :) SB 0.81 and Win XP. > > > Sorry, Must have missed it. -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 1427 bytes Desc: spambayes4.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/140f3e1c/spambayes4-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 10286 bytes Desc: spambayes1.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/140f3e1c/spambayes1-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 26514 bytes Desc: spambayes2.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/140f3e1c/spambayes2-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 1897 bytes Desc: spambayes3.log Url : http://mail.python.org/pipermail/spambayes/attachments/20031128/140f3e1c/spambayes3-0001.obj From tony-bayes at lownds.com Fri Nov 28 11:47:56 2003 From: tony-bayes at lownds.com (Tony Lownds) Date: Fri Nov 28 11:50:31 2003 Subject: [Spambayes] sb_imapfilter.py AssertionError: hamcount <= nham Message-ID: Hi, When I classify through sb_imapfilter.py, I am getting an AssertionError. Any ideas? I am using spambayes from CVS; courier IMAP; python 2.2.2; and a fresh database. See below for commands. Thanks -Tony [tony ~]$ rm hammie.db spambayes.messageinfo.db [tony ~]$ /usr/bin/sb_imapfilter.py -t SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading state from hammie.db database hammie.db is a new database Loading database hammie.db... Done. Training Training ham folder INBOX.Ham ************** 14 trained. Training spam folder INBOX.Spam ******************************************** 44 trained. Persisting hammie.db state in database Training took 2.87554502487 seconds, 58 messages were trained [tony ~]$ /usr/bin/sb_imapfilter.py -c SpamBayes IMAP Filter Beta1, version 0.1 (September 2003), using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02 and engine SpamBayes Beta2, version 0.2 (July 2003). Loading state from hammie.db database hammie.db is an existing database, with 44 spam and 10 ham Loading database hammie.db... Done. Classifying *.Traceback (most recent call last): File "/usr/bin/sb_imapfilter.py", line 821, in ? run() File "/usr/bin/sb_imapfilter.py", line 811, in run imap_filter.Filter() File "/usr/bin/sb_imapfilter.py", line 676, in Filter self.unsure_folder) File "/usr/bin/sb_imapfilter.py", line 595, in Filter evidence=True) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 158, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 395, in _getclues prob = self.probability(record) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 242, in probability assert hamcount <= nham AssertionError From melkeith at adelphia.net Fri Nov 28 12:49:14 2003 From: melkeith at adelphia.net (melissa) Date: Fri Nov 28 12:49:21 2003 Subject: [Spambayes] Outlook Plug in - Uninstall Question Message-ID: <000001c3b5d7$f08eeac0$0200a8c0@DESKTOPONE> I am Interested in installing the software to give it a try. But would like information on how I can uninstall before I go ahead. Will Windows XP "uninstall" remove the software and its components? From TiagoTiago at Globo.com Fri Nov 28 16:14:59 2003 From: TiagoTiago at Globo.com (Tiago Estill de Noronha) Date: Fri Nov 28 16:13:26 2003 Subject: RES: [Spambayes] SpamBayes reinstall In-Reply-To: <001201c3b5c1$a7243770$8401a8c0@petersdesktop> Message-ID: <001801c3b5f4$af18ed80$0860b7c8@virtua.com.br> Did you tryed to right click na empty space on the outlook tollbar, and see if the spam bayes toolbar is still unchecked from your previous uninstall? ********************* Tiago Estill de Noronha TiagoTiago@Globo.com -=> -----Mensagem original----- -=> De: spambayes-bounces@python.org -=> [mailto:spambayes-bounces@python.org] Em nome de Karl Schmid -=> Enviada em: sexta-feira, 28 de novembro de 2003 12:10 -=> Para: spambayes@python.org -=> Assunto: [Spambayes] SpamBayes reinstall -=> -=> -=> Dear Sir! -=> -=> I had version 007 on. Then I put in some defective RAM. It -=> destroyed a lot of programs including SpamBayes. -=> -=> I reinstalled it but it didn't help. -=> -=> Then I removed it over the setup and deleted the rest keys -=> in the registry with "SpamBayes" in it. Then I removed the -=> toolbar from Outlook. -=> -=> After that I installed 008. The installation showed no -=> errors or warnings but there is no SpamBayes-Toolbar in -=> Outlook anymore. -=> -=> Can you help me? -=> -=> Yours, -=> Karl Schmid -=> _______________________________________________ -=> Spambayes@python.org -=> -=> http://mail.python.org/mailman/listinfo/spambaye-=> s -=> Check the -=> -=> FAQ before asking: -=> http://spambayes.sf.net/faq.html -=> -=> --- -=> Incoming mail is certified Virus Free. -=> Checked by AVG anti-virus system (http://www.grisoft.com). -=> Version: 6.0.545 / Virus Database: 339 - Release Date: 27/11/2003 -=> -=> --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.545 / Virus Database: 339 - Release Date: 27/11/2003 From adam.walker at rbwconsulting.com Fri Nov 28 17:09:15 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 28 17:09:49 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 In-Reply-To: <1D85A2E1383A484CB168DCD61228D366453241@xmail.xtras.com> References: <1D85A2E1383A484CB168DCD61228D366453241@xmail.xtras.com> Message-ID: <3FC7C78B.8020202@rbwconsulting.com> Looks like you need to configure the spam and unsure folders under outlook 2003. The plugin is probably sharing configs between to the two outlook versions but the either the folder doesn't exist under outlook 2003 or unique id is different. Mike Schinkel wrote: >Thanks. Attached are the logs. > >My configuration is a new Dell with 512k RAM, Windows XP, Outlook 2003, >and SpamBayes 0.81. I also have Outlook 2000 installed and SpamBayes >still works fine with it (and before you assume, no I'm not running them >at the same time; you can't run both at the same time.) > >As for the problem, every time I click the "Delete as Spam" button (or >even the "SpamBayes" button), Outlook "thinks" for a second and then >crashes with the following dialog: > > Microsoft Office Outlook has encountered a problem and needs to >close. > We are sorry for the inconvenience. > [Send Error Report] [Don't Send] > >Any help would be GREATLY appreciated. > > > From mikes at xtras.com Fri Nov 28 17:11:49 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 28 17:12:03 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 Message-ID: <1D85A2E1383A484CB168DCD61228D36645325B@xmail.xtras.com> Thanks for the reply. Strange, because my spam folders are in my Exchange inbox, so they have to exist in both places. However, I can't configure them because ever time I try to run the SpamBayes configuration Outlook crashes. What now? -Mike -----Original Message----- From: Adam Walker [mailto:adam.walker@rbwconsulting.com] Sent: Friday, November 28, 2003 5:09 PM To: Mike Schinkel Cc: spambayes@python.org Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 Looks like you need to configure the spam and unsure folders under outlook 2003. The plugin is probably sharing configs between to the two outlook versions but the either the folder doesn't exist under outlook 2003 or unique id is different. Mike Schinkel wrote: >Thanks. Attached are the logs. > >My configuration is a new Dell with 512k RAM, Windows XP, Outlook 2003, >and SpamBayes 0.81. I also have Outlook 2000 installed and SpamBayes >still works fine with it (and before you assume, no I'm not running >them at the same time; you can't run both at the same time.) > >As for the problem, every time I click the "Delete as Spam" button (or >even the "SpamBayes" button), Outlook "thinks" for a second and then >crashes with the following dialog: > > Microsoft Office Outlook has encountered a problem and needs to close. > We are sorry for the inconvenience. > [Send Error Report] [Don't Send] > >Any help would be GREATLY appreciated. > > > From adam.walker at rbwconsulting.com Fri Nov 28 17:22:03 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 28 17:22:35 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 In-Reply-To: <1D85A2E1383A484CB168DCD61228D36645325B@xmail.xtras.com> References: <1D85A2E1383A484CB168DCD61228D36645325B@xmail.xtras.com> Message-ID: <3FC7CA8B.40600@rbwconsulting.com> Shutdown outlook. Rename the ini file. Restart outlook. The config wizard should popup ... or outlook will crash. The ini is in \Documents and Settings\{username}\Application Data\SpamBayes Mike Schinkel wrote: >Thanks for the reply. > >Strange, because my spam folders are in my Exchange inbox, so they have >to exist in both places. >However, I can't configure them because ever time I try to run the >SpamBayes configuration Outlook crashes. >What now? > > >-Mike > > >-----Original Message----- >From: Adam Walker [mailto:adam.walker@rbwconsulting.com] >Sent: Friday, November 28, 2003 5:09 PM >To: Mike Schinkel >Cc: spambayes@python.org >Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 > >Looks like you need to configure the spam and unsure folders under >outlook 2003. The plugin is probably sharing configs between to the two >outlook versions but the either the folder doesn't exist under outlook >2003 or unique id is different. > >Mike Schinkel wrote: > > > >>Thanks. Attached are the logs. >> >>My configuration is a new Dell with 512k RAM, Windows XP, Outlook 2003, >> >> > > > >>and SpamBayes 0.81. I also have Outlook 2000 installed and SpamBayes >>still works fine with it (and before you assume, no I'm not running >>them at the same time; you can't run both at the same time.) >> >>As for the problem, every time I click the "Delete as Spam" button (or >>even the "SpamBayes" button), Outlook "thinks" for a second and then >>crashes with the following dialog: >> >> Microsoft Office Outlook has encountered a problem and needs to >> >> >close. > > >> We are sorry for the inconvenience. >> [Send Error Report] [Don't Send] >> >>Any help would be GREATLY appreciated. >> >> >> >> >> > > > > > > From vickaul at yahoo.com Fri Nov 28 05:37:24 2003 From: vickaul at yahoo.com (Vic Kaul) Date: Fri Nov 28 17:46:38 2003 Subject: [Spambayes] A SPAM Message with a 0% Score Message-ID: <004f01c3b59b$9c50df90$250110ac@hp> How does one explain the attached email being scored 0% (0) by Spambayes. Why did my Spambayes filter not catch it? -vs -----Original Message----- From: Larry [mailto:jclpqp@nbc.com] Sent: Friday, November 28, 2003 2:07 AM Subject: NEWS REPORT YNSMWINLRV NEWS REPORT- PARENTS OF 15 - YEAR OLD - FIND $71,000 CASH HIDDEN IN HIS CLOSET! Does this headline look familiar? Of course it does. You most likely have just seen this story recently featured on a major nightly news program (USA), and reported elsewhere in the world. His mother was cleaning and putting laundry away when she came across a large brown paper bag that was suspiciously buried beneath some clothes and a skateboard in the back of her 15-year-old sons closet. Nothing could have prepared her for the shock she got when she opened the bag and found it was full of cash. Five-dollar bills, twenties, fifties and hundreds - all neatly rubber-banded in labeled piles. My first thought was that he had robbed a bank, says the 41-year-old woman, There was over $71,000 dollars in that bag - that's more than my husband earns in a year. The woman immediately called her husband at the car-dealership where he worked to tell him what she had discovered. He came home right away and they drove together to the boy's school and picked him up. Little did they suspect that where the money came from was more shocking than actually finding it in the closet? As it turns out, the boy had been sending out, via E-mail, a type of Report to E-mail addresses that he obtained off the Internet. Everyday after school for the past 2 months, he had been doing this right on his computer in his bedroom. I just got the E-mail one day and I figured what the heck, I put my name on it like the instructions said and I started sending it out", says the clever 15-year-old. The E-mail letter listed 5 addresses and contained instructions to send one $5 dollar bill to each person on the list, then delete the address at the top and move the others addresses Down, and finally to add your name to the top of the list. The letter goes on to state that you would receive several thousand dollars in five-dollar bills within 2 weeks if you sent out the letter with your name at the top of the 5-address list. I get junk E-mail all the time, and really did not think it was going to work, the boy continues. Within the first few days of sending out the E-mail, the Post Office Box that his parents had gotten him for his video-game magazine subscriptions began to fill up with not magazines, but envelopes containing $5 bills. About a week later I rode [my bike] down to the post office and my box had 1 magazine and about 300 envelops stuffed in it. There was also a yellow slip that said I had to go up to the [post office] counter. I thought I was in trouble or something (laughs). He goes on, I went up to the counter and they had a whole box of more mail for me. I had to ride back home and empty out my backpack because I could not carry it all. Over the next few weeks, the boy continued sending out the E-mail The money just kept coming in and I just kept sorting it and stashing it in the closet, barely had time for my homework. He had also been riding his bike to several of the banks in his area and exchanging the $5 bills for twenties, fifties and hundreds. I didn't want the banks to get suspicious so I kept riding to different banks with like five thousand at a time in my backpack. I would usually tell the lady at the bank counter that my dad had sent me in to exchange the money and he was outside waiting for me. One time the lady gave me a really strange look and told me that she would not be able to do it for me and my dad would have to come in and do it, but I just rode to the next bank down the street (laughs). Surprisingly, the boy did not have any reason to be afraid. The reporting news team examined and investigated the so-called chain-letter the boy was sending out and found that it was not a chain-letter at all. In fact, it was completely legal according to US Postal and Lottery Laws, Title 18, Section 1302 and 1341, or Title 18, Section 3005 in the US code, also in the code of federal regulations, Volume 16, Sections 255 and 436, which state a product or service must be exchanged for money received. Every five-dollar bill that he received contained a little note that read, Please send me report number XYZ. This simple note made the letter legal because he was exchanging a service (A Report on how-to) for a five-dollar fee. [This is the end of the media release. If you would like to understand how the system works and get your $71,000 - please continue reading. What appears below is what the 15 year old was sending out on the net - YOU CAN USE IT TOO - just follow the simple instructions]. +++++++++++++++++++++++++++++++++++++++++++++++++++++ BE FINANCIALLY FREE LIKE OTHERS WITHIN A YEAR!!! Before you say, BULL, please read the following. This is the letter you have been hearing about on the news lately. Due to the popularity of this letter on the Internet, a national weekly news program recently devoted an entire show to the investigation of this program described below, to see if it really can make people money. The show also investigated whether or not the program was legal. Their findings proved once and for all that there are absolutely NO Laws prohibiting the participation in the program and if people can follow the simple instructions, they are bound to make some megabucks with only $25 out of pocket cost. DUE TO THE RECENT INCREASE OF POPULARITY & RESPECT THIS PROGRAM HAS ATTAINED, IT IS CURRENTLY WORKING BETTER THAN EVER. Note* follow the directions below, I had best results the second time when I hired a bulk email service in addition to following the reports instructions. In order for all of us to be successful, many, many emails must be sent so that the returns are many. I have been extremely successful using the following company. They send out the offers, and all I do is accept money for reports, then I send back to the people as soon as possible. This is what one had to say: Thanks to this profitable opportunity. I was approached many times before but each time I passed on it. I am so glad I finally joined just to see what one could expect in return for the minimal effort and money required. To my astonishment, I received total $610,470.00 in 21 weeks, with money still coming in. Pam Hedland, Fort Lee, New Jersey. +++++++++++++++++++++++++++++++++++++++++++++++++++++ Here is another testimonial: This program has been around for a long time but I never believed in it. But one day when I received this again in the mail I decided to gamble my $25 on it. I followed the simple instructions and walaa ..... 3 weeks later the money started to come in. First month I only made $240.00 but the next 2 months after that I made a total of $290,000.00. So far, in the past 8 months by re-entering the program, I have made over $710,000.00 and I am playing it again. The key to success in this program is to follow the simple steps and NOT change anything. More testimonials later but first, PRINT THIS NOW FOR YOUR FUTURE REFERENCE Order all 5 reports shown on the list below For each report, send $5 CASH, THE NAME & NUMBER OF THE REPORT YOU ARE ORDERING and YOUR E-MAIL ADDRESS to the person whose name appears ON THAT LIST next to the report. MAKE SURE YOUR RETURN ADDRESS IS ON YOUR ENVELOPE TOP LEFT CORNER in case of any mail problems. When you place your order, make sure you order each of the 5 reports. You will need all 5 reports so that you can save them on your computer. Within a few days you will receive, vie e-mail, each of the 5 reports from these 5 different individuals. Save them on your computer so they will be accessible for you to send to the 1,000's of people who will order them from you. Also make a floppy of these reports and keep it on your desk in case something happens to your computer. IMPORTANT - DO NOT alter the names of the people who are listed next to each report, or their sequence on the list, in any way other than what is instructed below in step 1 through 6 or you will loose out on majority of your profits. Once you understand the way this works, you will also see how it does not work if you change it. Remember, this method has been tested, and if you alter, it will NOT work!!! People have tried to put their friends/relatives names on all five thinking they could get all the money. But it does not work this way. Believe us, we all have tried to be greedy and then nothing happened. So Do Not try to change anything other than what is instructed. Because if you do, it will not work for you. Remember, honesty reaps the reward!!! 1.... After you have ordered all 5 reports, take this advertisement and REMOVE the name & address of the person in REPORT # 5. This person has made it through the cycle and is no doubt counting their fortune. 2.... Move the name & address in REPORT # 4 down TO REPORT #5. 3.... Move the name & address in REPORT # 3 down TO REPORT #4. 4.... Move the name & address in REPORT # 2 down TO REPORT #3. 5.... Move the name & address in REPORT # 1 down TO REPORT #2 6.... Insert YOUR name & address in the REPORT # 1 Position. PLEASE MAKE SURE you copy every name & address ACCURATELY! +++++++++++++++++++++++++++++++++++++++++++++++++++++ **** Take this entire letter, with the modified list of names, and save it on your computer. DO NOT MAKE ANY OTHER CHANGES. Save this on a disk as well just in case if you lose any data. To assist you with marketing your business on the internet, the 5 reports you purchase will provide you with invaluable marketing information which includes how to send bulk e-mails legally, where to find thousands of free classified ads and much more. There are 2 Primary methods to get this venture going: METHOD #1: BY SENDING BULK E-MAIL LEGALLY +++++++++++++++++++++++++++++++++++++++++++++++++++++ Let's say that you decide to start small, just to see how it goes, and we will assume You and those involved send out only 5,000e-mails each. Let's also assume that the mailing receive only a 0.2% response (the response could be much better but lets just say it is only 0.2%. Also many people will send out hundreds of thousands e-mails instead of only 5,000 each). Continuing with this example, you send out only 5,000 e-mails. With a 0.2% response, that is only 10 orders for report # 1. Those 10 people responded by sending out 5,000 e-mail each for a total of 50,000. Out of those 50,000 e-mails only 0.2% responded with orders. That equals 100 people responded and ordered Report # 2. Those 100 people mail out 5,000 e-mails each for a total of 500,000 e-mails. The 0.2% response to that is 1000 orders for Report #3. Those 1000 people send out 5,000 e-mails each for a total of 5 million e-mails sent out. The 0.2% response to that is 10,000 orders for Report #4. Those 10,000 people send out 5,000 e-mails each for a total of 50,000,000 (50 million) e-mails. The 0.2% response to that is 100,000 orders for Report #5. THAT'S 100,000 ORDERS TIMES $5 EACH=$500,000.00 (half million). Your total income in this example is: 1..... $50 + 2..... $500 + 3..... $5,000 + 4..... $50,000 + 5..... $500,000........Grand Total=$555,550.00 NUMBERS DO NOT LIE. GET A PENCIL & PAPER AND FIGURE OUT THE WORST POSSIBLE RESPONSES AND NO MATTER HOW YOU CALCULATE IT, YOU WILL STILL MAKE A LOT OF MONEY! +++++++++++++++++++++++++++++++++++++++++++++++++++++ REMEMBER FRIEND, THIS IS ASSUMING ONLY 10 PEOPLE ORDERING OUT OF 5,000 YOU MAILED TO. Dare to think for a moment what would happen if everyone or half or e ven one 4th of those people mailed 100,000e-mails each or more? There are over 150 million people on the Internet worldwide and counting. Believe me, many people will do just that, and more! METHOD #2: BY PLACING FREE ADS ON THE INTERNET +++++++++++++++++++++++++++++++++++++++++++++++++++++ Advertising on the net is very very inexpensive and there are hundreds of FREE places to advertise. Placing a lot of free ads on the Internet will easily get a larger response. We strongly suggest you start with Method #1 and add METHOD #2 as you go along. For every $5 you receive, all you must do is e-mail them the Report they ordered. That's it. Always provide same day service on all orders. This will guarantee that the e-mail they send out with your name and address on it, will be prompt because they can not advertise until they receive the report. ================AVAILABLE REPORTS ================= ORDER EACH REPORT BY ITS NUMBER & NAME ONLY. Notes: Always send $5 cash (U.S. CURRENCY) for each Report. Checks NOT accepted. Make sure the cash is concealed by wrapping it in at least 2 sheets of paper. On one of those sheets of paper, Write the NUMBER & the NAME of the Report you are ordering, YOUR E-MAIL ADDRESS and your name and postal address. PLACE YOUR ORDER FOR THESE REPORTS NOW: +++++++++++++++++++++++++++++++++++++++++++++++++++++ REPORT #1: The Insider's Guide to Advertising for Free on the Net Order Report #1 from: J. Davis 16487 Lariat Road Apt. B Victorville, California 92392 __________________________________________________________ REPORT #2: The Insider's Guide to Sending Bulk e-mail on the Net Order Report #2 from: C.Smith PO Box 25 Bexley NSW 2207 AUSTRALIA ________________________________________________________ REPORT #3: Secret to Multilevel marketing on the Net Order Report #3 from: S. Calhoun P.O. Box 20051 Barrie, ON Canada L4M 6E9 __________________________________________________________ REPORT #4: How to become a millionaire utilizing MLM & the Net Order Report #4 from: L. Jespersen 24 Valleyview Cres Bradford, ON Canada L3Z1S7 ______________________________________________________ REPORT #5: How to send out 0ne Million emails for free Order Report #5 From: C. Abee 113 Ervin Ave. NE Valdese, NC 28690 $$$$$$$$$$$$$$$$ YOUR SUCCESS GUIDELINES $$$$$$$$$$$$$$$$ Follow these guidelines to guarantee your success: If you do not receive at least 10 orders for Report #1 within 2 weeks, continue sending e-mails until you do. === After you have received 10 orders, 2 to 3 weeks after that you should receive 100 orders or more for REPORT #2. If you did not, continue advertising or sending e-mails until you do. Once you have received 100 or more orders for Report #2, YOU CAN RELAX, because the system is already working for you, and the cash will continue to roll in! THIS IS IMPORTANT TO REMEMBER: Every time your name is moved down on the list, you are placed in front of a Different report. You can KEEP TRACK of your PROGRESS by watching which report people are ordering from you. IF YOU WANT TO GENERATE MORE INCOME SEND ANOTHER BATCH OF E-MAILS AND START THE WHOLE PROCESS AGAIN. There is NO LIMIT to the income you can generate from this business!!! +++++++++++++++++++++++++++++++++++++++++++++++++++++ FOLLOWING IS A NOTE FROM THE ORIGINATOR OF THIS PROGRAM: You have just received information that can give you financial freedom for the rest of your life, with NO RISK and JUST A LITTLE BIT OF EFFORT. You can make more money in the next few weeks and months than you have ever imagined. Follow the program EXACTLY AS INSTRUCTED. Do Not change it in any way. It works exceedingly well as it is now. Remember to e-mail a copy of this exciting report after you have put your name and address in Report#1 and moved others to #2 thru #5 as instructed above. One of the people you send this to may send out 100,000 or more e-mails and your name will be on every one of them. Remember though, the more you send out the more potential customers you will reach. So my friend, I have given you the ideas, information, materials and opportunity to become financially independent. IT IS UP TO YOU NOW! =================== MORE TESTIMONIALS============= My name is Mitchell. My wife, Jody and I live in Chicago. I am an accountant with a major U.S. Corporation and I make pretty good money. When I received this program I grumbled to Jody about receiving junk mail. I made fun of the whole thing, spouting my knowledge of the population and percentages involved. I knew it wouldn't work. Jody totally ignored my supposed intelligence and few days later she jumped in with both feet. I made merciless fun of her, and was ready to lay the old I told you so on her when the thing didn't work. Well, the laugh was on me! Within 3 weeks she had received 50 responses. Within the next 45 days she had received total $ 147,200.00....all cash! I was shocked. I have joined Jody in her hobby. Mitchell Wolf, Chicago, Illinois +++++++++++++++++++++++++++++++++++++++++++++++++++++ Not being the gambling type, it took me several weeks to make up my mind to participate in this plan. But conservative that I am, I decided that the initial investment was so little that there was just no way that I wouldn't get enough orders to at least get my money back. I was surprised when I found my medium size post office box crammed with orders. I made $319,210.00 in the first 12 weeks. The nice thing about this deal is that it does not matter where people live. There simply isn't a better investment with a faster return and so big. Dan Sondstrom, Alberta, Canada +++++++++++++++++++++++++++++++++++++++++++++++++++++ I had received this program before. I deleted it, but later I wondered if I should have given it a try. Of course, I had no idea who to contact to get another copy, so I had to wait until I was e-mailed again by someone else......11 months passed then it luckily came again...... I did not delete this one! I made more than $490,000 on my first try and all the money came within 22 weeks. Susan De Suza, New York, N.Y +++++++++++++++++++++++++++++++++++++++++++++++++++++ DSPYIJOLFPBUOIGWDCQHPZQLUYMONGYSYXBHZV From mikes at xtras.com Fri Nov 28 18:14:53 2003 From: mikes at xtras.com (Mike Schinkel) Date: Fri Nov 28 18:14:57 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 Message-ID: <1D85A2E1383A484CB168DCD61228D36645325D@xmail.xtras.com> >> The config wizard should popup ... or outlook will crash. Neither. Many thanks for the effort thus far, but still no dice. Outlook XP still behaves, and Outlook 2003 still misbehaves. No different than before. Next? -Mike -----Original Message----- From: Adam Walker [mailto:adam.walker@rbwconsulting.com] Sent: Friday, November 28, 2003 5:22 PM To: Mike Schinkel Cc: spambayes@python.org Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 Shutdown outlook. Rename the ini file. Restart outlook. The config wizard should popup ... or outlook will crash. The ini is in \Documents and Settings\{username}\Application Data\SpamBayes Mike Schinkel wrote: >Thanks for the reply. > >Strange, because my spam folders are in my Exchange inbox, so they have >to exist in both places. >However, I can't configure them because ever time I try to run the >SpamBayes configuration Outlook crashes. >What now? > > >-Mike > > >-----Original Message----- >From: Adam Walker [mailto:adam.walker@rbwconsulting.com] >Sent: Friday, November 28, 2003 5:09 PM >To: Mike Schinkel >Cc: spambayes@python.org >Subject: Re: [Spambayes] SpamBayes crashes Outlook 2003 > >Looks like you need to configure the spam and unsure folders under >outlook 2003. The plugin is probably sharing configs between to the two >outlook versions but the either the folder doesn't exist under outlook >2003 or unique id is different. > >Mike Schinkel wrote: > > > >>Thanks. Attached are the logs. >> >>My configuration is a new Dell with 512k RAM, Windows XP, Outlook >>2003, >> >> > > > >>and SpamBayes 0.81. I also have Outlook 2000 installed and SpamBayes >>still works fine with it (and before you assume, no I'm not running >>them at the same time; you can't run both at the same time.) >> >>As for the problem, every time I click the "Delete as Spam" button (or >>even the "SpamBayes" button), Outlook "thinks" for a second and then >>crashes with the following dialog: >> >> Microsoft Office Outlook has encountered a problem and needs to >> >> >close. > > >> We are sorry for the inconvenience. >> [Send Error Report] [Don't Send] >> >>Any help would be GREATLY appreciated. >> >> >> >> >> > > > > > > From tim.one at comcast.net Fri Nov 28 18:58:09 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Nov 28 18:58:14 2003 Subject: [Spambayes] A SPAM Message with a 0% Score In-Reply-To: <004f01c3b59b$9c50df90$250110ac@hp> Message-ID: [vs] > How does one explain the attached email being scored 0% (0) by > Spambayes. > > Why did my Spambayes filter not catch it? You didn't say which version of SpamBayes you're using. Whichever it is, it supports some way of showing you the evidence it used in arriving at its decision. Do that, and it will tell exactly why it decided the msg was ham. At a higher level, SpamBayes comes with no preconceived ideas about what ham and spam are. The only things it knows are what you teach it. So the 100% accurate (but probably unhelpful!) answer is "because the message looked a lot more like ham than spam based on everything you've taught the system so far". IOW, look in the mirror . From tim at fourstonesExpressions.com Fri Nov 28 19:30:26 2003 From: tim at fourstonesExpressions.com (Tim Stone) Date: Fri Nov 28 19:30:35 2003 Subject: [Spambayes] A SPAM Message with a 0% Score In-Reply-To: <004f01c3b59b$9c50df90$250110ac@hp> References: <004f01c3b59b$9c50df90$250110ac@hp> Message-ID: FYI... I scored this mail a solid spam at 0.862236950494... On Fri, 28 Nov 2003 02:37:24 -0800, Vic Kaul wrote: > How does one explain the attached email being scored 0% (0) by Spambayes. > > Why did my Spambayes filter not catch it? > > -vs > > > -----Original Message----- > From: Larry [mailto:jclpqp@nbc.com] > Sent: Friday, November 28, 2003 2:07 AM > Subject: NEWS REPORT YNSMWINLRV > > > NEWS REPORT- > PARENTS OF 15 - YEAR OLD - FIND $71,000 CASH HIDDEN IN HIS CLOSET! > > Does this headline look familiar? Of course it does. You most likely have > just seen this story recently featured on a major nightly news program > (USA), and reported elsewhere in the world. His mother was > cleaning and putting laundry away when she came across a large brown > paper > bag that was suspiciously > buried beneath some clothes and a skateboard in the back of her > 15-year-old > sons closet. Nothing could > have prepared her for the shock she got when she opened the bag and > found it > was full of cash. > Five-dollar bills, twenties, fifties and hundreds - all neatly > rubber-banded > in labeled piles. > > My first thought was that he had robbed a bank, says the 41-year-old > woman, > There was over $71,000 > dollars in that bag - that's more than my husband earns in a year. The > woman > immediately called her > husband at the car-dealership where he worked to tell him what she had > discovered. He came home > right away and they drove together to the boy's school and picked him up. > Little did they suspect that where the money came from was more shocking > than actually finding it in the closet? > > As it turns out, the boy had been sending out, via E-mail, a type of > Report > to E-mail addresses that he > obtained off the Internet. Everyday after school for the past 2 months, > he > had been doing this right on > his computer in his bedroom. > > I just got the E-mail one day and I figured what the heck, I put my name > on > it like the instructions said > and I started sending it out", says the clever 15-year-old. The E-mail > letter listed 5 addresses and > contained instructions to send one $5 dollar bill to each person on the > list, then delete the address > at the top and move the others addresses Down, and finally to add your > name > to the top of the list. > > The letter goes on to state that you would receive several thousand > dollars > in five-dollar bills within > 2 weeks if you sent out the letter with your name at the top of the > 5-address list. I get junk E-mail all > the time, and really did not think it was going to work, the boy > continues. > > Within the first few days of sending out the E-mail, the Post Office Box > that his parents had gotten him > for his video-game magazine subscriptions began to fill up with not > magazines, but envelopes containing > $5 bills. About a week later I rode [my bike] down to the post office > and my > box had 1 magazine and > about 300 envelops stuffed in it. There was also a yellow slip that said > I > had to go up to the [post office] > counter. I thought I was in trouble or something (laughs). He goes on, I > went up to the counter and they > had a whole box of more mail for me. I had to ride back home and empty > out > my backpack because I > could not carry it all. > > Over the next few weeks, the boy continued sending out the E-mail The > money > just kept coming in and > I just kept sorting it and stashing it in the closet, barely had time > for my > homework. He had also been > riding his bike to several of the banks in his area and exchanging the $5 > bills for twenties, fifties and > hundreds. > > I didn't want the banks to get suspicious so I kept riding to different > banks with like five thousand at a > time in my backpack. I would usually tell the lady at the bank counter > that > my dad had sent me in to > exchange the money and he was outside waiting for me. One time the lady > gave > me a really strange > look and told me that she would not be able to do it for me and my dad > would > have to come in and > do it, but I just rode to the next bank down the street (laughs). > > Surprisingly, the boy did not have any reason to be afraid. The reporting > news team examined and > investigated the so-called chain-letter the boy was sending out and found > that it was not a chain-letter > at all. In fact, it was completely legal according to US Postal and > Lottery > Laws, Title 18, Section 1302 > and 1341, or Title 18, Section 3005 in the US code, also in the code of > federal regulations, Volume 16, > Sections 255 and 436, which state a product or service must be exchanged > for > money received. > > Every five-dollar bill that he received contained a little note that > read, > Please send me report number XYZ. This simple note made the letter legal > because he was exchanging a service (A Report on how-to) for a > five-dollar fee. [This is the end of the media release. If you would > like to > understand how the system > works and get your $71,000 - please continue reading. What appears > below is > what the 15 year old > was sending out on the net - YOU CAN USE IT TOO - just follow the simple > instructions]. > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > BE FINANCIALLY FREE LIKE OTHERS WITHIN A YEAR!!! Before you say, BULL, > please read the following. This is the letter you have been hearing > about on > the news lately. Due to the popularity of this letter on the > Internet, a national weekly news program recently devoted an entire show > to > the investigation of this > program described below, to see if it really can make people money. The > show > also investigated whether or > not the program was legal. Their findings proved once and for all that > there are absolutely NO Laws > prohibiting the participation in the program and if people can follow the > simple instructions, they are > bound to make some megabucks with only $25 out of pocket cost. > > DUE TO THE RECENT INCREASE OF POPULARITY & RESPECT THIS PROGRAM HAS > ATTAINED, IT > IS CURRENTLY WORKING BETTER THAN EVER. > > Note* follow the directions below, I had best results the second time > when I > hired a bulk email service in > addition to following the reports instructions. > In order for all of us to be successful, many, many emails must be sent > so > that the returns are many. I have > been extremely successful using the following company. They send out the > offers, and all I do is accept > money for reports, then I send back to the people as soon as possible. > > This is what one had to say: Thanks to this profitable opportunity. I was > approached many times before > but each time I passed on it. I am so glad I finally joined just to see > what > one could expect in return for > the minimal effort and money required. To my astonishment, I received > total > $610,470.00 in 21 weeks, > with money still coming in. > Pam Hedland, Fort Lee, New Jersey. > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > Here is another testimonial: This program has been around for a long time > but I never believed in it. > But one day when I received this again in the mail I decided to gamble my > $25 on it. I followed the > simple instructions and walaa ..... 3 weeks later the money started to > come > in. First month I only made > $240.00 but the next 2 months after that I made a total of $290,000.00. > So > far, in the past 8 months by > re-entering the program, I have made over $710,000.00 and I am playing it > again. The key to success in > this program is to follow the simple steps and NOT change anything. > More testimonials later but first, > > PRINT THIS NOW FOR YOUR FUTURE REFERENCE > > Order all 5 reports shown on the list below For each report, send $5 > CASH, > THE NAME & NUMBER OF > THE REPORT YOU ARE ORDERING and YOUR E-MAIL ADDRESS to the person whose > name > appears > ON THAT LIST next to the report. > > MAKE SURE YOUR RETURN ADDRESS IS ON YOUR ENVELOPE TOP LEFT CORNER in > case of > > any mail problems. When you place your order, make sure you order each of > the 5 reports. > You will need all 5 reports so that you can save them on your computer. > Within a few days you will > receive, vie e-mail, each of the 5 reports from these 5 different > individuals. > Save them on your computer so they will be accessible for you to send to > the > 1,000's of people who > will order them from you. Also make a floppy of these reports and keep > it on > your desk in case something > happens to your computer. > > IMPORTANT - DO NOT alter the names of the people who are listed next to > each > report, or their sequence > on the list, in any way other than what is instructed below in step 1 > through 6 or you will loose out on > majority of your profits. Once you understand the way this works, you > will > also see how it does not work > if you change it. Remember, this method has been tested, and if you > alter, > it will NOT work!!! People have > tried to put their friends/relatives names on all five thinking they > could > get all the money. But it does not > work this way. Believe us, we all have tried to be greedy and then > nothing > happened. So Do Not try to > change anything other than what is instructed. Because if you do, it will > not work for you. > Remember, honesty reaps the reward!!! > > 1.... After you have ordered all 5 reports, take this advertisement and > REMOVE the name & address of > the person in REPORT # 5. This person has made it through the > cycle > and is no doubt counting > their fortune. > 2.... Move the name & address in REPORT # 4 down TO REPORT #5. > 3.... Move the name & address in REPORT # 3 down TO REPORT #4. > 4.... Move the name & address in REPORT # 2 down TO REPORT #3. 5.... Move > the name & address in REPORT # 1 down TO REPORT #2 > 6.... Insert YOUR name & address in the REPORT # 1 Position. > > PLEASE MAKE SURE you copy every name & address ACCURATELY! > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > **** Take this entire letter, with the modified list of names, and save > it > on your computer. DO NOT MAKE > ANY OTHER CHANGES. Save this on a disk as well just in case if you lose > any > data. To assist you with > marketing your business on the internet, the 5 reports you purchase will > provide you with invaluable > marketing information which includes how to send bulk e-mails legally, > where > to find thousands of free > classified ads and much more. There are 2 Primary methods to get this > venture going: > > METHOD #1: BY SENDING BULK E-MAIL LEGALLY > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > Let's say that you decide to start small, just to see how it goes, and we > will assume You and those involved > send out only 5,000e-mails each. Let's also assume that the mailing > receive > only a 0.2% response (the > response could be much better but lets just say it is only 0.2%. Also > many > people will send out hundreds > of thousands e-mails instead of only 5,000 each). Continuing with this > example, you send out only > 5,000 e-mails. > > With a 0.2% response, that is only 10 orders for report # 1. Those 10 > people responded by sending > out 5,000 e-mail each for a total of 50,000. Out of those 50,000 e-mails > only 0.2% responded with > orders. That equals 100 people responded and ordered Report # 2. > Those 100 people mail out 5,000 e-mails each for a total of 500,000 > e-mails. > The 0.2% response to that is > 1000 orders for Report #3. > Those 1000 people send out 5,000 e-mails each for a total of 5 million > e-mails sent out. The 0.2% response > to that is 10,000 orders for Report #4. > Those 10,000 people send out 5,000 e-mails each for a total of 50,000,000 > (50 million) e-mails. The 0.2% response to that is 100,000 orders for > Report > #5. THAT'S 100,000 ORDERS TIMES $5 EACH=$500,000.00 (half million). > Your total income in this example is: > 1..... $50 + > 2..... $500 + > 3..... $5,000 + > 4..... $50,000 + > 5..... $500,000........Grand Total=$555,550.00 > > NUMBERS DO NOT LIE. GET A PENCIL & PAPER AND FIGURE OUT THE WORST > POSSIBLE > RESPONSES AND NO MATTER HOW YOU CALCULATE IT, YOU WILL STILL MAKE A LOT > OF > MONEY! > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > REMEMBER FRIEND, THIS IS ASSUMING ONLY 10 PEOPLE ORDERING > OUT OF 5,000 YOU MAILED TO. Dare to think for a moment what would happen > if > everyone or half or e ven one 4th of those people mailed 100,000e-mails > each > or more? There are over 150 million people on > the Internet worldwide and counting. Believe me, many people will do > just > that, and more! > > METHOD #2: BY PLACING FREE ADS ON THE INTERNET > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > Advertising on the net is very very inexpensive and there are hundreds of > FREE places to advertise. > Placing a lot of free ads on the Internet will easily get a larger > response. > > > We strongly suggest you start with Method #1 and add METHOD #2 as you go > along. For every $5 you > receive, all you must do is e-mail them the Report they ordered. That's > it. > Always provide same day service on all orders. This will guarantee that > the > e-mail they send out with your name and address on it, will be > prompt because they can not advertise until they receive the report. > > ================AVAILABLE REPORTS ================= > ORDER EACH REPORT BY ITS NUMBER & NAME ONLY. > Notes: Always send $5 cash (U.S. CURRENCY) for each Report. > Checks NOT accepted. > Make sure the cash is concealed by wrapping it in at least 2 sheets of > paper. > On one of those sheets of paper, Write the NUMBER & the NAME of the > Report > you are ordering, > YOUR E-MAIL ADDRESS and your name and postal address. > > > > > PLACE YOUR ORDER FOR THESE REPORTS NOW: > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > > REPORT #1: The Insider's Guide to Advertising for Free on the Net > Order Report #1 from: > J. Davis > 16487 Lariat Road Apt. B > Victorville, California 92392 > __________________________________________________________ > > > REPORT #2: The Insider's Guide to Sending Bulk e-mail on the Net > Order Report #2 from: > C.Smith > PO Box 25 > Bexley > NSW 2207 > AUSTRALIA > > ________________________________________________________ > > REPORT #3: Secret to Multilevel marketing on the Net > Order Report #3 from: > S. Calhoun > P.O. Box 20051 > Barrie, ON Canada > L4M 6E9 > > __________________________________________________________ > > REPORT #4: How to become a millionaire utilizing MLM & the Net > Order Report #4 from: > L. Jespersen > 24 Valleyview Cres > Bradford, ON Canada > L3Z1S7 > > ______________________________________________________ > > REPORT #5: How to send out 0ne Million emails for free > Order Report #5 From: > C. Abee > 113 Ervin Ave. NE > Valdese, NC 28690 > > $$$$$$$$$$$$$$$$ YOUR SUCCESS GUIDELINES $$$$$$$$$$$$$$$$ > Follow these guidelines to guarantee your success: > If you do not receive at least 10 orders for Report #1 within 2 weeks, > continue sending e-mails until you > do. === After you have received 10 orders, 2 to 3 weeks after that you > should receive 100 orders or more > for REPORT #2. If you did not, continue advertising or sending e-mails > until > you do. > Once you have received 100 or more orders for Report #2, YOU CAN RELAX, > because the system is > already working for you, and the cash will continue to roll in! THIS IS > IMPORTANT TO REMEMBER: Every time your name is moved down on the list, > you > are placed in front of a Different report. You can KEEP > TRACK of your PROGRESS by watching which report people are ordering from > you. IF YOU WANT TO > GENERATE MORE INCOME SEND ANOTHER BATCH OF E-MAILS AND START THE WHOLE > PROCESS > AGAIN. There is NO LIMIT to the income you can generate from this > business!!! > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > FOLLOWING IS A NOTE FROM THE ORIGINATOR OF THIS PROGRAM: You have just > received > information that can give you financial freedom for the rest of your > life, > with NO RISK and JUST A LITTLE BIT OF EFFORT. You can make more money in > the > next few weeks and months than you have ever > imagined. Follow the program EXACTLY AS INSTRUCTED. Do Not change it in > any > way. It works > exceedingly well as it is now. > Remember to e-mail a copy of this exciting report after you have put your > name and address in Report#1 > and moved others to #2 thru #5 as instructed above. One of the people you > send this to may send out > 100,000 or more e-mails and your name will be on every one of them. > Remember > though, the more you > send out the more potential customers you will reach. So my friend, I > have > given you the ideas, information, materials and opportunity to become > financially independent. IT IS UP TO YOU NOW! > > =================== MORE TESTIMONIALS============= > My name is Mitchell. My wife, Jody and I live in Chicago. I am an > accountant > with a major U.S. > Corporation and I make pretty good money. When I received this program I > grumbled to Jody about > receiving junk mail. I made fun of the whole thing, spouting my > knowledge of > the population and > percentages involved. I knew it wouldn't work. Jody totally ignored my > supposed intelligence and > few days later she jumped in with both feet. I made merciless fun of her, > and was ready to lay the old > I told you so on her when the thing didn't work. Well, the laugh was on > me! > Within 3 weeks she had > received 50 responses. Within the next 45 days she had received total $ > 147,200.00....all cash! I > was shocked. I have joined Jody in her hobby. > Mitchell Wolf, Chicago, Illinois > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > Not being the gambling type, it took me several weeks to make up my mind > to > participate in this plan. > But conservative that I am, I decided that the initial investment was so > little that there was just no way > that I wouldn't get enough orders to at least get my money back. I was > surprised when I found my > medium size post office box crammed with orders. I made $319,210.00 in > the > first 12 weeks. The nice > thing about this deal is that it does not matter where people live. There > simply isn't a better investment > with a faster return and so big. > Dan Sondstrom, Alberta, Canada > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > I had received this program before. I deleted it, but later I wondered > if I > should have given it a try. > Of course, I had no idea who to contact to get another copy, so I had to > wait until I was e-mailed > again by someone else......11 months passed then it luckily came > again...... > I did not delete this one! > I made more than $490,000 on my first try and all the money came within > 22 > weeks. > Susan De Suza, New York, N.Y > > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > > DSPYIJOLFPBUOIGWDCQHPZQLUYMONGYSYXBHZV > > > > _______________________________________________ > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html > -- Vous exprimer; Expr?sese; Te stesso esprimere; Express yourself! Tim Stone See my photography at www.fourstonesExpressions.com See my writing at www.xanga.com/obj3kshun From adam.walker at rbwconsulting.com Fri Nov 28 21:32:32 2003 From: adam.walker at rbwconsulting.com (Adam Walker) Date: Fri Nov 28 21:33:44 2003 Subject: [Spambayes] SpamBayes crashes Outlook 2003 In-Reply-To: <1D85A2E1383A484CB168DCD61228D36645325D@xmail.xtras.com> References: <1D85A2E1383A484CB168DCD61228D36645325D@xmail.xtras.com> Message-ID: <3FC80540.7040300@rbwconsulting.com> Did you rename/move all the ini files in that folder (there should be more than one)? Mike Schinkel wrote: >>>The config wizard should popup ... or outlook will crash. >>> >>> > >Neither. > >Many thanks for the effort thus far, but still no dice. >Outlook XP still behaves, and Outlook 2003 still misbehaves. >No different than before. > >Next? > >-Mike > > > > > > > From nihxogwql5 at hotmail.it Sun Nov 30 01:23:38 2003 From: nihxogwql5 at hotmail.it (Rosalie Raines) Date: Sat Nov 29 23:23:55 2003 Subject: [Spambayes] Get a diploma based on your present knowledge and life experience Message-ID: <5$2-$1h0c-ap4hvf-47zu@b5adapvxteko7> U N I V E R S I T Y D I P L O M A ' S Diplomas from prestigious non-accredited universities based on your present knowledge and life experience. Bachelors, masters, MBA, and doctorate (PhD) diplomas available in the field of your choice. Obtain a prosperous future, money earning power, and the admiration of all. No one is turned down. Confidentiality assured. No required tests, classes, books, or interviews. CALL NOW to receive your diploma within days!!! 1-425-669-4485 Call 24 hours a day, 7 days a week, including Sundays and holidays - you have subscribed to one of our or our partners' mailing lists, Newsletters or Websites. Under Bill S.1618 TITLE III SECTION 301. Per Section 301, Paragraph (a) (2) (C) passed by the 105th US Congress any email or Mass Marketing email cannot be considered Spam as long as the sender includes contact information and a method of removal. To block further mailings, Send a blank to the following address. unsubs@everyday.com vm dgslgwbkxacjasjb bttc ckwwnoil voypkqwm ezdz edadn From rancherm at bgumail.bgu.ac.il Sun Nov 30 08:10:43 2003 From: rancherm at bgumail.bgu.ac.il (Ran Chermesh) Date: Sun Nov 30 08:09:50 2003 Subject: [Spambayes] Exception 0x8004011c (MAPI_E_UNCONFIGURED) Message-ID: <000001c3b743$62318450$04854884@ran> Hi, I'm trying to run spambayes on my office pc (win2k pro, imap mail server, Pentium II), and fail with a Exception 0x8004011c (MAPI_E_UNCONFIGURED) error message. The computer is a node on a departmental network. I've already installed this same version of spambayes on my home pc, with no apparent problem. What went wrong? Ran Here's the log file: Loaded bayes database from 'C:\Documents and Settings\chermesh.BGU-DEV.000\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\chermesh.BGU-DEV.000\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 746 spam and 123 good messages SpamBayes Outlook Addin, Binary version 0.81 (September 9, 2003) starting (with engine SpamBayes Beta2, version 0.2 (July 2003)) on Windows 5.0.2195 (Service Pack 4) using Python 2.3.2 (#49, Oct 2 2003, 20:02:00) [MSC v.1200 32 bit (Intel)] WARNING: Unexpected MAPI error opening folder Exception 0x8004011c (MAPI_E_UNCONFIGURED): OLE error 0x8004011c WARNING: Unexpected MAPI error opening folder Exception 0x8004011c (MAPI_E_UNCONFIGURED): OLE error 0x8004011c WARNING: Unexpected MAPI error opening folder Exception 0x8004011c (MAPI_E_UNCONFIGURED): OLE error 0x8004011c WARNING: Unexpected MAPI error opening folder Exception 0x8004011c (MAPI_E_UNCONFIGURED): OLE error 0x8004011c WARNING: Unexpected MAPI error opening folder Exception 0x8004011c (MAPI_E_UNCONFIGURED): OLE error 0x8004011c OnAddInsUpdate None OnStartupComplete None pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PYTHON23\lib\site-packages\win32com\universal.py", line 173, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PYTHON23\lib\site-packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PYTHON23\lib\site-packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PYTHON23\lib\site-packages\win32com\server\policy.py", line 541, in _invokeex_ return func(*args) File "out1.pyz/addin", line 1335, in OnStartupComplete File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 451, in __getattr__ return self._ApplyTypes_(*args) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 445, in _ApplyTypes_ return self._get_good_object_(self._oleobj_.InvokeTypes(*((dispid, 0, wFlags, retType, argTypes) + args)), user, resultCLSID) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 463, in _get_good_object_ return _get_good_object_(obj, obUserName, resultCLSID) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 481, in _get_good_object_ return _get_good_single_object_(obj, obUserName, resultCLSID) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 468, in _get_good_single_object_ return Dispatch(obj, obUserName, resultCLSID, UnicodeToString=NeedUnicodeConversions) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 96, in Dispatch return __WrapDispatch(dispatch, userName, resultCLSID, typeinfo, UnicodeToString, clsctx) File "C:\PYTHON23\lib\site-packages\win32com\client\__init__.py", line 38, in __WrapDispatch klass = gencache.GetClassForCLSID(resultCLSID) File "C:\PYTHON23\lib\site-packages\win32com\client\gencache.py", line 171, in GetClassForCLSID mod = GetModuleForCLSID(clsid) File "C:\PYTHON23\lib\site-packages\win32com\client\gencache.py", line 232, in GetModuleForCLSID makepy.GenerateChildFromTypeLibSpec(sub_mod, info) File "C:\PYTHON23\lib\site-packages\win32com\client\makepy.py", line 304, in GenerateChildFromTypeLibSpec gen.generate_child(child, dir_path_name) File "C:\PYTHON23\lib\site-packages\win32com\client\genpy.py", line 958, in generate_child self.file = open(os.path.join(dir, an_item.python_name) + ".py", "w") exceptions.IOError: [Errno 2] No such file or directory: 'C:\\Python23\\Lib\\site-packages\\win32com\\demos\\support\\gen_py\\00062FF F-0000-0000-C000-000000000046x0x9x1\\ExplorersEvents.py' From patents at ctcweb.net Sun Nov 30 11:47:59 2003 From: patents at ctcweb.net (Robert N. Cordy) Date: Sun Nov 30 11:48:08 2003 Subject: [Spambayes] Question Message-ID: Overall I am impressed with the capabilities of ZoneAlarm. Many thanks! However, frequently Outlook shows that it is downloading 3 (as an example) email messages but only one shows up in my inbox and NONE in the spam folders. In other words the sum of the messages in the inbox plus those in the spam folders is often less than the number actually downloaded. Where might these other emails go? Bob C. ============ From patents at ctcweb.net Sun Nov 30 12:02:18 2003 From: patents at ctcweb.net (Robert N. Cordy) Date: Sun Nov 30 12:02:18 2003 Subject: [Spambayes] Previous question Message-ID: I apologize, of course my question related to SpamBayes and not ZoneAlarm!! Bob C. ======= From mpoole at madisonville.com Sun Nov 30 12:59:03 2003 From: mpoole at madisonville.com (M Poole) Date: Sun Nov 30 12:59:20 2003 Subject: [Spambayes] how do I uninstall spambayes Message-ID: <000001c3b76b$a9738610$0b64a8c0@mmpoole> How do I uninstall spambays? Mitch Poole From mark at howellsfamily.org Sun Nov 30 18:07:52 2003 From: mark at howellsfamily.org (Mark Howells) Date: Sun Nov 30 18:09:11 2003 Subject: [Spambayes] Clasification difficulties using pop3proxy / sb_server Message-ID: <001c01c3b796$c818ed00$0200a8c0@athlon> First up, let me say that I am extremely impressed with SpamBayes. I've been using theOutlook plugin at work with spectacular results - No false positives at all and only occasional unsures.... Now I'm using it to protect my home network. However, I've not got as good results and have noticed the following. I use the pop3proxy (via sb_server). I currently have the classifier trained on 575 Spam and 480 Ham. Because my family use Outlook Express, I asked the proxy to add 'unsure' and 'spam' onto the Subject line. So far so good. However, I have been getting a lot of 'unsures' (30% or so). When I use the web interface to review the messages and look at the clues for each unsure, I see that the spam rating is often 0.5 or even higher - sometimes well within the Spam zone (I currently use 20% and 80% as my thresholds). So , why is the classifier rating them as unsure? Maybe related to the above, I have noticed that the 'unsure' subject line is marked as such in the cache. This means that when I train the classifier that the unsure is really spam, the unsure token is included in the subject line. In consequence 'Subject : unsure' has become very spammy (a query on subject:unsure gives "Probability that a message containing this word is spam: 0.800342155644"). There's a similar effect for the 'spam' label. Surely the cache shouldn't store the updated subject line? Cheers Mark First up, let me say that I am extremely impressed with SpamBayes. I've been using theOutlook plugin at work with spectacular results - No false positives at all and only occasional unsures.... Now I'm using it to protect my home network. However, I've not got as good results and have noticed the following. I use the pop3proxy (via sb_server). I currently have the classifier trained on 575 Spam and 480 Ham. Because my family use Outlook Express, I asked the proxy to add 'unsure' and 'spam' onto the Subject line. So far so good. However, I have been getting a lot of 'unsures' (30% or so). When I use the web interface to review the messages and look at the clues for each unsure, I see that the spam rating is often 0.5 or even higher - sometimes well within the Spam zone (I currently use 20% and 80% as my thresholds). So , why is the classifier rating them as unsure? Maybe related to the above, I have noticed that the 'unsure' subject line is marked as such in the cache. This means that when I train the classifier that the unsure is really spam, the unsure token is included in the subject line. In consequence 'Subject : unsure' has become very spammy (a query on subject:unsure gives "Probability that a message containing this word is spam: 0.800342155644"). There's a similar effect for the 'spam' label. Surely the cache shouldn't store the updated subject line? Cheers Mark From skip at pobox.com Sun Nov 30 18:11:26 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Nov 30 18:11:33 2003 Subject: [Spambayes] sb_mailsort.py In-Reply-To: References: <3dm7svku2qdl9smqbtehdqstmnp3nk6vqk@4ax.com> Message-ID: <16330.31006.894815.978136@montanaro.dyndns.org> atom> ok, no answers to my previous post, of similar subject... here's atom> more.... I sort of suspect very few people use sb_mailsort.py. atom> 1) will the script return a non-zero exit status if it fails for atom> *ANY* reason? this is important, since a .qmail-file will atom> recognize that as an unsuccessful delivery attempt. Looking at the code, it appears that sb_mailsort.py may not properly exit with non-zero status in all cases. I see one error condition it catches then raises SystemExit, which on my system produces an exit status of 1. However, the main() function call is not wrapped in a try/except statement. You might want to modify the end of sb_mailsort.py to something like if __name__ == "__main__": try: main() except Exception, msg: sys.exit(1) else: sys.exit(0) just to be sure unanticipated error conditions are caught and that you guarantee a non-zero exit status. More detailed analysis of the program's guts is probably required though. atom> 2) is the SPAM_CUTTOFF hard-coded at 0.57??? (like i said, i don't atom> know code in python.) is there a way to only assign a value to atom> a variable [in python] if a user-specified value is not atom> defined? Sure looks like it (beats me why, though). You can work around that problem by initializing SPAM_CUTOFF like so: from spambayes.Options import options SPAM_CUTOFF = options["Categorization", "spam_cutoff"] Note that the above import technique is common usage, though sb_mailsort.py does its imports differently. Again, I'm not sure why. You'll have to investigate to make sure you're not violating some assumptions the program author made by importing SpamBayes modules that early in the program. atom> 3) how well is this (and similar) scripts protected against atom> attacks where a malicious email can cause commands to be atom> executed with the permissions of the recipient? I think it should be okay in that regard. The only actions it takes based upon a message's content is train on it or score it (by moving it from one directory to another which you specify on the command line). I don't think a black hat could convince Python to twiddle the values of sys.argv, but it might be a microscopically small amount safer if the spamdir and hamdir were picked up from your options file. That would require changes to spambayes/Options.py as well as scripts/sb_mailsort.py thought. It's probably best left for the author. Skip From skip at pobox.com Sun Nov 30 18:15:21 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Nov 30 18:16:27 2003 Subject: [Spambayes] n-way in outlook? In-Reply-To: <000101c3b51b$9bfe8900$0860b7c8@virtua.com.br> References: <000101c3b51b$9bfe8900$0860b7c8@virtua.com.br> Message-ID: <16330.31241.322910.881219@montanaro.dyndns.org> Tiago> have some1 made a n-way classification code for using with the Tiago> outlook plugin? Not that I'm aware of. Tiago> or does the nway code on the contrib folder works on outlook? Nope. nway.py is just a simple demo I wrote. Since I'm not an Outlook user, I made no attempt to make it work with Outlook. After playing with it a bit, I'm not convinced it's good enough for anything but experimental use. Skip From tameyer at ihug.co.nz Sun Nov 30 18:19:27 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Nov 30 18:19:36 2003 Subject: [Spambayes] sb_imapfilter.py AssertionError: hamcount <= nham In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13044774CF@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B1AC@its-xchg4.massey.ac.nz> > When I classify through sb_imapfilter.py, I am getting an > AssertionError. Any ideas? I am using spambayes from CVS; courier > IMAP; python 2.2.2; and a fresh database. See below for commands. [...] > assert hamcount <= nham > AssertionError How fresh? This error says that you have a token in your database that has appeared in more ham than you have trained it on - which isn't possible. IOW, the database is corrupt. You can try to manually fix the db via db_expimp.py, but it's easier (especially if you call it fresh already) to just retrain from scratch. If this happens regularly, it would be great to know the sequence of events that can reproduce it (in a sf bug tracker ), as we still don't really know what causes this error. =Tony Meyer From tameyer at ihug.co.nz Sun Nov 30 18:21:21 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Nov 30 18:21:26 2003 Subject: [Spambayes] Question In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304477987@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130212B1AD@its-xchg4.massey.ac.nz> > However, frequently Outlook shows that it is downloading 3 > (as an example) email messages but only one shows up in my > inbox and NONE in the spam folders. In other words the sum > of the messages in the inbox plus those in the spam folders > is often less than the number actually downloaded. > > Where might these other emails go? Have you looked in your 'unsure' folder? Have you tried searching for the messages via Outlook's 'Advanced Find'? =Tony Meyer From tameyer at ihug.co.nz Sun Nov 30 18:32:59 2003 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sun Nov 30 18:33:08 2003 Subject: [Spambayes] Clasification difficulties using pop3proxy / sb_server In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1304477A3B@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13026F29F2@its-xchg4.massey.ac.nz> > However, > I have been getting a lot of 'unsures' (30% or so). When I > use the web interface to review the messages and look at the > clues for each unsure, I see that the spam rating is often > 0.5 or even higher - sometimes well within the Spam zone (I > currently use 20% and 80% as my thresholds). So , why is the > classifier rating them as unsure? To paraphrase Tim, because they don't resemble the messages you have trained as ham more strongly than the messages you have trained as spam (or vice versa). Clues! Clues! Give us the clues! (SpamBayes has no knowledge of 'ham' or 'spam' until you, personally, give it some. We can't answer this question unless you give us the information about how each of the tokens in the message scored for you personally. In sb_server you can get these from the review page (far right)). > Maybe related to the above, I have noticed that the 'unsure' > subject line is marked as such in the cache. This means that > when I train the classifier that the unsure is really spam, > the unsure token is included in the subject line. Yes, that's correct. [Note that I incorrectly guessed otherwise when someone else asked pretty much this exact question last week. This time I looked at the code, and, you're right.] Note that, with the exception of very short messages, a single token shouldn't really tip the scale either way anyway (unless the message was borderline). > Surely the cache shouldn't store the updated subject line? Well, information from the cached messages that is added by SpamBayes is used later on via the web interface. It shouldn't make any difference except for the subject line (or if some fool has setup their spambayes to tokenize spambayes headers). I asked the person that asked this question last week to open a bug report about this and said I would get to it at some point (or someone else will), but I don't think that has happened yet. You're welcome to do that. (Otherwise I may remember to this some day, but there's a good chance that nothing will ever happen about it). Note that I think that the correct behaviour would be to strip the [Ham|Unsure|Spam] from the start of subject lines, iff that option is set to True. This may lose information if it was added by something else, but that's probably better than tokenizing invalid information. =Tony Meyer From mark at howellsfamily.org Sun Nov 30 18:33:54 2003 From: mark at howellsfamily.org (Mark Howells) Date: Sun Nov 30 18:34:37 2003 Subject: [Spambayes] Question References: <1ED4ECF91CDED24C8D012BCF2B034F130212B1AD@its-xchg4.massey.ac.nz> Message-ID: <005101c3b79a$6b0e6d20$0200a8c0@athlon> > > However, frequently Outlook shows that it is downloading 3 > > (as an example) email messages but only one shows up in my > > inbox and NONE in the spam folders. In other words the sum > > of the messages in the inbox plus those in the spam folders > > is often less than the number actually downloaded. > > > > Where might these other emails go? I'm assuming you mean Outlook Express, rather than Outlook. It's nothing to do with SpamBayes but OE has a known problem misreporting the number of downloaded messages. In all likelyhood you're not losing any emails. Mark --- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.544 / Virus Database: 338 - Release Date: 25/11/2003 From skip at pobox.com Sun Nov 30 18:34:25 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Nov 30 18:34:46 2003 Subject: [Spambayes] Autostart on Mac OS X In-Reply-To: <20031128220937377074.GyazMail.takeyasu@avis.ne.jp> References: <20031128220937377074.GyazMail.takeyasu@avis.ne.jp> Message-ID: <16330.32385.85976.655184@montanaro.dyndns.org> Takeyasu> I am using Mac OS X 10.3. How is what is necessary just to Takeyasu> carry out spambayes for making it start automatically? I use the sb_filter.py app on Mac OS X, however it's run from procmail, so I never investigated how to autostart something like sb_server.py. I sort of suspect this is a fairly generic MacPython question, so I've cc'd my reply there. Skip From skip at pobox.com Sun Nov 30 18:39:29 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Nov 30 18:39:34 2003 Subject: [Spambayes] sb_imapfilter.py AssertionError: hamcount <= nham In-Reply-To: References: Message-ID: <16330.32689.397613.63493@montanaro.dyndns.org> Tony> When I classify through sb_imapfilter.py, I am getting an Tony> AssertionError. Any ideas? That indicates a corrupt database. Do you ever kill sb_imapfilter.py without giving it a chance to clean up properly or run two SpamBayes applications which might both want to write to the same database (say, sb_server.py or sb_filter.py as well as sb_imapfilter.py)? If so, you'll need to figure out a way to avoid that. To work around the problem, I suggest you zap your training database and start from scratch. If you don't solve the underlying problem you'll probably encounter it again. Skip