From mhammond at skippinet.com.au Thu May 1 08:53:17 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Apr 30 17:54:23 2003 Subject: [Spambayes] Spambayes disappeared! In-Reply-To: <290D043E5BE9D311B3EA009027B69DC301C26FB0@PHSNTEXC06> Message-ID: <017601c30f62$e90f46d0$530f8490@eden> This generally means that the plugin failed to initialize, in which case Outlook disables the addin. You should be able to go to Tools->Options->Other->Advanced->Plugins (or something like that ), locate the addin and re-enable it. Please check the log files (as described in the readme). Hopefully there will be an error listed we can fix! Mark. > We are testing Spambayes on several machines in our IS > department. We have > it loaded on W2K Prof. / Outlook 2000 which is running very > well, whacking > spam left and right (Love it!!!). But on the WXP Prof. / > Outlook XP machine > Spambayes ran fine for two weeks then disappeared from > Outlook this morning. > It still shows up in Add/Remove programs, the directory is > still there but > won't load with Outlook XP. We have deinstalled then > reinstalled to no > avail. We are using version 002 of the plugin from Mark > Hammond. Any ideas > on where to look for the problem? From T.A.Meyer at massey.ac.nz Thu May 1 18:37:50 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 1 01:38:29 2003 Subject: [Spambayes] Error on proxy connection & smtp proxy doesn't seespam Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABAEC@its-xchg4.massey.ac.nz> > > I tried to "train" sb by sending spam to spam@spam. This > > seemed to go OK, > > but when I go to the web page the "Review Messages" shows no > > untrained messages. > > How do these training messages show up? > > They simply are used to train your database, then discarded. > The Review Messages page is used for messages that the pop3proxy filters. The more I think about it, the more I think that smtpproxy should have two modes. What it does now is look for a spambayes id in message. It uses that id to extract the correct message from the pop3proxy cache, and trains on it. The idea behind this is that this avoids training on any mangling that the mailer might have done to the message to train. (So in this case, the message should show up in "Review Messages", but if it wasn't a message that pop3proxy had filtered, then it won't show up, or, in fact, be trained). I think smtpproxy should also be able to simply train on whatever it is sent. Those that don't use pop3proxy, or those that are confident that their mailer will send an exact copy onwards, could use it in this fashion. (It could possibly add the message to the pop3proxy cache for web ui review if it wasn't there). Thoughts? =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 1 20:18:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 1 03:19:35 2003 Subject: [Spambayes] URLSlurper (Was: Latest spammer trick stymied - QUESTION) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABB00@its-xchg4.massey.ac.nz> Time to pull out the testing data again... [From a post quite some time ago...] > Perhaps, as a test, someone could convert the code to Python and it > could be committed (as an option that defaults to False). If test > results really support it (and no-one can come up with a pure > tokenisation alternative), it could be left there (still defaulting to > False, unless all the concerns are addressed). In the interests of testing, I've done this. The code isn't based at all on the stuff that Richard sent (it was easier to just do it from scratch), but does follow the same rules (more-or-less). I've checked in the code in testtools/urlslurper.py. If you run this you'll get a timtest output, but using the slurping classifier. (This way classifier, tokenizer, options et. al. don't get altered). Rather than do a really big post, if you're interested in testing this, you can read the docstring on the module :) Results for the two main tests that I've done are below. I exported whatever I had sitting in my Outlook db, and it did considerably better on that, but I don't really trust that as an evaluation - for one, it's a very unbalanced corpus. I also tested against the SpamAssassin corpus, and it tied (mean ham prob went down, but mean spam prob also went down). If others are willing to a bit of testing, then at least the effectiveness question can be answered. If it does poorly for everyone, then I'll remove it. If it does well, we can squabble about whether it should be integrated as a default-to-off option or ignored. (Note that it can take a while to run, especially if there are lots of urls that will timeout (60s a pop!), and not a good idea to run if you pay per kb downloaded, or something like that). There is a little voice in my head that's telling me that it doesn't do as well as it should because the implementation is flawed...so if anyone is interested, they can go over the code and point out the bad bits... =Tony Meyer Results - these are with the default options; SpamAssassin corpus first, and then the messages that I've got stored in Outlook at the moment. Basically timtest, rates, then cmp. I've edited out a few lines for clarity, but the important numbers are still there. sas.txt -> slurpss.txt -> tested 400 hams & 379 spams against 400 hams & 379 spams [that line 39 more times...] false positive percentages 0.750 0.750 tied [19 other lines cut, all of them tied] won 0 times tied 20 times lost 0 times total unique fp went from 29 to 29 tied mean fp % went from 0.8375 to 0.8375 tied false negative percentages 0.792 0.792 tied [19 other lines cut, all tied] won 0 times tied 20 times lost 0 times total unique fn went from 16 to 16 tied mean fn % went from 0.382585751979 to 0.382585751979 tied ham mean ham sdev 4.07 4.06 -0.25% 15.56 15.37 -1.22% 5.00 5.02 +0.40% 16.35 16.21 -0.86% 2.64 2.65 +0.38% 11.74 11.67 -0.60% 1.64 1.60 -2.44% 9.52 9.20 -3.36% 1.43 1.42 -0.70% 9.27 9.16 -1.19% ham mean and sdev for all runs 2.96 2.95 -0.34% 12.91 12.75 -1.24% spam mean spam sdev 98.51 98.53 +0.02% 8.04 7.90 -1.74% 99.18 99.09 -0.09% 5.46 5.87 +7.51% 98.16 98.04 -0.12% 8.26 8.47 +2.54% 97.05 96.82 -0.24% 11.14 11.57 +3.86% 96.40 96.25 -0.16% 12.88 12.99 +0.85% spam mean and sdev for all runs 97.86 97.74 -0.12% 9.57 9.76 +1.99% ham/spam mean difference: 94.90 94.79 -0.11 [total unsure went from 340 to 338] mines.txt -> mine_with_slurps.txt -> tested 235 hams & 95 spams against 235 hams & 95 spams [that line 39 more times...] false positive percentages 0.000 0.000 tied [and again, 19 times, all 0.0 vs 0.0, tied] won 0 times tied 20 times lost 0 times total unique fp went from 0 to 0 tied mean fp % went from 0.0 to 0.0 tied false negative percentages 1.053 0.000 won -100.00% 7.368 5.263 won -28.57% 8.421 6.316 won -25.00% 8.421 5.263 won -37.50% 12.632 5.263 won -58.34% 10.526 9.474 won -9.99% 17.895 12.632 won -29.41% 21.053 15.789 won -25.00% 7.368 5.263 won -28.57% 3.158 1.053 won -66.66% 14.737 11.579 won -21.43% 21.053 13.684 won -35.00% 4.211 2.105 won -50.01% 3.158 0.000 won -100.00% 8.421 4.211 won -49.99% 9.474 4.211 won -55.55% 6.316 1.053 won -83.33% 3.158 2.105 won -33.34% 8.421 3.158 won -62.50% 5.263 3.158 won -40.00% won 20 times tied 0 times lost 0 times total unique fn went from 77 to 46 won -40.26% mean fn % went from 9.10526315789 to 5.57894736842 won -38.73% ham mean ham sdev 0.83 1.53 +84.34% 5.79 7.67 +32.47% 0.32 0.70 +118.75% 3.09 4.79 +55.02% 0.43 0.90 +109.30% 3.35 5.73 +71.04% 0.79 1.54 +94.94% 5.32 7.97 +49.81% 0.82 1.92 +134.15% 5.01 8.76 +74.85% ham mean and sdev for all runs 0.64 1.32 +106.25% 4.65 7.15 +53.76% spam mean spam sdev 87.39 91.56 +4.77% 26.40 21.79 -17.46% 74.82 82.41 +10.14% 36.76 31.93 -13.14% 82.54 87.77 +6.34% 32.64 27.76 -14.95% 86.48 91.83 +6.19% 27.26 20.69 -24.10% 86.52 91.98 +6.31% 26.63 20.56 -22.79% spam mean and sdev for all runs 83.55 89.11 +6.65% 30.58 25.23 -17.50% ham/spam mean difference: 82.91 87.79 +4.88 From richard at jowsey.com Thu May 1 22:56:53 2003 From: richard at jowsey.com (Richard Jowsey) Date: Thu May 1 07:57:01 2003 Subject: [Spambayes] URLSlurper In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13016ABB00@its-xchg4.massey.ac.nz> Message-ID: <3EB19825.12914.15CAAADA@localhost> > Perhaps, as a test, someone could convert the code to Python Good on ya mate! Well done. > Time to pull out the testing data again... You should be testing on a bunch of microspams. I'll send you some if you want! Death2Spam.net went GA today. MayDay for spammers! We've recorded ~99.5% classification accuracy on the test sample (46k emails) from our US, Oz and NZ beta testers. Very happy! I'll probably be in AK next week for talks with Xtra, Clear, etc. Catch up for a beer if you've time. Cheers, Richard From noreply at sourceforge.net Thu May 1 10:19:13 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 1 12:19:20 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 00:37 Message generated for change (Comment added) made by astrogen You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- >Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-01 11:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 01:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From neale at woozle.org Thu May 1 10:22:24 2003 From: neale at woozle.org (Neale Pickett) Date: Thu May 1 12:22:31 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: (Tim Stone - Four Stones Expressions's message of "Wed, 30 Apr 2003 22:21:30 -0500") References: Message-ID: Tim Stone - Four Stones Expressions writes: > 4/30/2003 5:29:22 PM, Skip Montanaro wrote: > >>That's all well and good, but I was thinking perhaps mboxtrain should >>maintain a little database parallel to its mbox file whose entries are keyed >>by message-id. It could store its results there and never have to monkey >>with the mbox file. "-f"orce training would simply be a matter of deleting >>all keys in that database at the start of the run. > > Yeah, we're already walking down that path with the messageinfodb that's > maintained in message.py. This will certainly need some more work for mbox > purposes, but it would be perfect if mboxes never needed to be rewritten. > That's the goal, afaic. Ditto to what Tim wrote. If mboxtrain keeps somewhere a list of messages it's seen, theres no longer any need to modify the mbox. Neale From tdickenson at devmail.geminidataloggers.co.uk Thu May 1 18:39:50 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Thu May 1 12:39:57 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: References: Message-ID: <200305011739.50200.tdickenson@devmail.geminidataloggers.co.uk> On Thursday 01 May 2003 5:22 pm, Neale Pickett wrote: > Tim Stone - Four Stones Expressions writes: > > 4/30/2003 5:29:22 PM, Skip Montanaro wrote: > >>That's all well and good, but I was thinking perhaps mboxtrain should > >>maintain a little database parallel to its mbox file whose entries are > >> keyed by message-id. It could store its results there and never have to > >> monkey with the mbox file. "-f"orce training would simply be a matter > >> of deleting all keys in that database at the start of the run. > > > > Yeah, we're already walking down that path with the messageinfodb that's > > maintained in message.py. This will certainly need some more work for > > mbox purposes, but it would be perfect if mboxes never needed to be > > rewritten. That's the goal, afaic. > > Ditto to what Tim wrote. If mboxtrain keeps somewhere a list of > messages it's seen, theres no longer any need to modify the mbox. fwiw, I stopped using mboxtrain and its incremental mode in favor of hammie, and always doing a full train on whole mailboxes. Its not significantly slower. From bill at parducci.net Thu May 1 10:43:34 2003 From: bill at parducci.net (bill parducci) Date: Thu May 1 12:44:27 2003 Subject: [Spambayes] Training corrupts mbox files References: Message-ID: <3EB14EB6.5070707@parducci.net> > If mboxtrain keeps somewhere a list of > messages it's seen, theres no longer any need to modify the mbox. by freeing the training materials from the source documents, wouldn't this also create the opportunity for implementing an automated 'freshness' mechanism whereby 'pruning' of said materials (based on age, etc.) would be possible? b From skip at pobox.com Thu May 1 13:21:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 1 13:21:18 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: <200305011739.50200.tdickenson@devmail.geminidataloggers.co.uk> References: <200305011739.50200.tdickenson@devmail.geminidataloggers.co.uk> Message-ID: <16049.22401.738502.780867@montanaro.dyndns.org> Toby> fwiw, I stopped using mboxtrain and its incremental mode in favor Toby> of hammie, and always doing a full train on whole mailboxes. Its Toby> not significantly slower. How big are your mailboxes? I have about 12,000 hams and 7,000 spams in my training sets, so I generally avoid full retrains. I'm considering a somewhat different procmail-based setup for some other people, however, in which they would have three email addresses, foo@somewhere, foo+spam@somewhere and foo+ham@somewhere. The last two would (obviously) be for training. My thought was to simply have the training aliases append to mbox files and run mboxtrain from cron periodically. I'd logrotate the training files to keep the number of files and their sizes to a minimum. Someone else must already be doing something like this. Care to share? Skip From neale at woozle.org Thu May 1 12:09:26 2003 From: neale at woozle.org (Neale Pickett) Date: Thu May 1 14:09:35 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: <16049.22401.738502.780867@montanaro.dyndns.org> (Skip Montanaro's message of "Thu, 1 May 2003 12:21:05 -0500") References: <200305011739.50200.tdickenson@devmail.geminidataloggers.co.uk> <16049.22401.738502.780867@montanaro.dyndns.org> Message-ID: Skip Montanaro writes: > Someone else must already be doing something like this. Care to share? I'm doing something similar to Toby. I have a few guinea pig users doing this currently, and eventually everyone will be doing this. Each IMAP user gets three new IMAP folders: spam, spamfairy.spam, and spamfairy.ham. All incoming mail gets procmail filtered through hammiefilter and filed into inbox or spam. When the user sees something that's misfiled, they have to move it to the appropriate spamfairy folder. Every night the Spam Fairy checks under their pillow for new email and trains on it (this is done with hammiefilter). She then moves these messages into either the inbox or the spam folder, depending on which spamfairy folder she's currently visiting. This seems to work pretty well for my two trial users. If it turns out that it's viable, I'd be happy to provide the spamfairy script. (It doesn't run system-wide yet.) Neale From tdickenson at geminidataloggers.com Thu May 1 19:58:48 2003 From: tdickenson at geminidataloggers.com (Toby Dickenson) Date: Thu May 1 15:36:11 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: <16049.22401.738502.780867@montanaro.dyndns.org> References: <200305011739.50200.tdickenson@devmail.geminidataloggers.co.uk> <16049.22401.738502.780867@montanaro.dyndns.org> Message-ID: <200305011858.48408.tdickenson@geminidataloggers.com> On Thursday 01 May 2003 6:21 pm, Skip Montanaro wrote: > Toby> fwiw, I stopped using mboxtrain and its incremental mode in favor > Toby> of hammie, and always doing a full train on whole mailboxes. Its > Toby> not significantly slower. > > How big are your mailboxes? I have about 12,000 hams and 7,000 spams in my > training sets, so I generally avoid full retrains. > > I'm considering a somewhat different procmail-based setup for some other > people, however, in which they would have three email addresses, > foo@somewhere, foo+spam@somewhere and foo+ham@somewhere. The last two > would (obviously) be for training. My thought was to simply have the > training aliases append to mbox files and run mboxtrain from cron > periodically. I'd logrotate the training files to keep the number of files > and their sizes to a minimum. > > Someone else must already be doing something like this. Care to share? I am using kmail with approximately 40 folders (mailboxes). I am training directy from the kmail folders. That means I dont need duplicate copies of emails in a seperate training database, I can use the normal kmail gui for adjusting the training sets, and ensures that training doesnt use ancient emails. I use kmail to delete personal emails after 6 months, mailing lists after a few weeks, and spams after a year. That keeps the total content stable at about 6000 hams and 800 spams. I train overnight from cron, and it takes about 5 minutes. From memory, incremental mboxtrain was taking about 4 minutes with a lower cpu usage. I have a script that generates a long hammie.py command line by parsing the kmail configuration file. It assumes that: - the folder called "spam" and all its subfolders are spam training material - "trash" and "drafts" should be ignored - every other folder contains ham training material. I use procmail to run the hammie filter to add the headers during mail delivery. kmail filters are used to sort incoming mail: spam into a seperate folder. (for a while my wife was using the same setup, but running the hammie filter from kmail. No procmail needed) I use two folders for spam..... spam/archive and spam/new. kmail filters the spam into spam/new and marks it read. Every week I review spam/new for false positives (Im still waiting for my first!), then empty it into spam/archive. Any interest in better documentation of this setup? -- Toby Dickenson http://www.geminidataloggers.com/people/tdickenson From neale at woozle.org Thu May 1 14:40:21 2003 From: neale at woozle.org (Neale Pickett) Date: Thu May 1 16:40:27 2003 Subject: [Spambayes] Training corrupts mbox files In-Reply-To: <3EB14EB6.5070707@parducci.net> (bill parducci's message of "Thu, 01 May 2003 09:43:34 -0700") References: <3EB14EB6.5070707@parducci.net> Message-ID: bill parducci writes: > by freeing the training materials from the source documents, wouldn't > this also create the opportunity for implementing an automated > 'freshness' mechanism whereby 'pruning' of said materials (based on > age, etc.) would be possible? Quite likely, yes, that would be possible. From dave at boost-consulting.com Thu May 1 20:50:02 2003 From: dave at boost-consulting.com (David Abrahams) Date: Thu May 1 19:50:42 2003 Subject: [Spambayes] Re: big imapfilter.py problem In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13016AB805@its-xchg4.massey.ac.nz> (Tony Meyer's message of "Wed, 30 Apr 2003 17:44:22 +1200") References: <1ED4ECF91CDED24C8D012BCF2B034F13016AB805@its-xchg4.massey.ac.nz> Message-ID: "Meyer, Tony" writes: >> The problem appears to be that imapfilter.py added an >> X-Spambayes-Classification: header to the message, but failed >> to add a newline afterwards, which is required to separate it >> from the message body. > > That's definitely what it was - except that message.py (which does the > adding for imapfilter.py) doesn't add it as a string, it adds an entry > to the headers dict in the email.Message.Message object. It looks like > the email package didn't add the separating newline when it flattened > the message. > >> ...err, but I forgot to set PYTHONPATH to use email-2.5. >> Training works when I do that. > > What version of the email package does it use if you don't see the > PYTHONPATH? If this sort of thing is going to happen, it might be worth > noting somewhere. (Currently, the docs say that you need "the latest" > version of the email package, or whatever comes with Python 2.2.2 > (email.__version__ == '2.4.3') or later. Well, I don't think the latter is valid, since I have Python 2.2.2 installed. %python Python 2.2.2 (#1, Feb 4 2003, 18:45:04) [GCC 2.95.2 19991024 (release)] on freebsd4 Type "help", "copyright", "credits" or "license" for more information. >>> import email >>> email.__version__ '2.4.3' >>> -- Dave Abrahams Boost Consulting www.boost-consulting.com From dave at boost-consulting.com Thu May 1 20:57:48 2003 From: dave at boost-consulting.com (David Abrahams) Date: Thu May 1 19:58:30 2003 Subject: [Spambayes] Success! : Spambayes + IMAP + crontab + CommunigatePro + procmail Message-ID: After much thrashing, I have successfully set up a server-side filtering system with spambayes. Each hour, spambayes is retrained using any mail I've explicitly filed as Ham or Spam using imapfilter.py. Communigate Pro sends all messages without an X-Spambayes-Classification through procmail and then discards them. procmail invokes hammiefilter.py on incoming messages to classify them, and then forwards them back to my account Communigate Pro files already-classified messages according to their classification. Works beautifully, but took days to figure out. If there is any interest in a RestructuredText (or HTML) document describing the whole deal from the POV of a neophyte in email/IMAP/procmail/CGPro/Spambayes, and it would get displayed somewhere useful to others, I'd be glad to contribute one to the Spambayes project. -- Dave Abrahams Boost Consulting www.boost-consulting.com From tim at fourstonesExpressions.com Thu May 1 20:05:15 2003 From: tim at fourstonesExpressions.com (Tim Stone - Four Stones Expressions) Date: Thu May 1 20:05:21 2003 Subject: [Spambayes] Success! : Spambayes + IMAP + crontab + CommunigatePro + procmail In-Reply-To: Message-ID: 5/1/2003 6:57:48 PM, David Abrahams wrote: > >After much thrashing, I have successfully set up a server-side >filtering system with spambayes. Congratulations! You're one persistent dude. >If there is any interest in a RestructuredText (or HTML) document >describing the whole deal from the POV of a neophyte in >email/IMAP/procmail/CGPro/Spambayes, and it would get displayed >somewhere useful to others, I'd be glad to contribute one to the >Spambayes project. If you create a document in html, I'll be sure it shows up (credited) on the website... c'est moi - TimS http://www.fourstonesExpressions.com http://wecanstopspam.org There are 10 kinds of people in the world: those who understand binary, and those who don't. From dave at boost-consulting.com Thu May 1 20:59:05 2003 From: dave at boost-consulting.com (David Abrahams) Date: Thu May 1 20:09:22 2003 Subject: [Spambayes] Re: Success! : Spambayes + IMAP + crontab + CommunigatePro + procmail In-Reply-To: (David Abrahams's message of "Thu, 01 May 2003 19:57:48 -0400") References: Message-ID: David Abrahams writes: > Works beautifully, but took days to figure out. > > If there is any interest in a RestructuredText (or HTML) document > describing the whole deal from the POV of a neophyte in > email/IMAP/procmail/CGPro/Spambayes, and it would get displayed > somewhere useful to others, I'd be glad to contribute one to the > Spambayes project. By the way, thanks to all on this newsgroup who helped me out, especially Tony Meyer. -- Dave Abrahams Boost Consulting www.boost-consulting.com From T.A.Meyer at massey.ac.nz Fri May 2 13:13:01 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 1 20:14:37 2003 Subject: [Spambayes] Re: Success! : Spambayes + IMAP + crontab + CommunigatePro + procmail Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABCAD@its-xchg4.massey.ac.nz> > > If there is any interest in a RestructuredText (or HTML) document > > describing the whole deal from the POV of a neophyte in > > email/IMAP/procmail/CGPro/Spambayes, and it would get displayed > > somewhere useful to others, I'd be glad to contribute one to the > > Spambayes project. BTW, I'd definitely echo Tim's comment that this would be a good thing. > By the way, thanks to all on this newsgroup who helped me > out, especially Tony Meyer. Ah, that's what we're here for :) Glad that it's going. The testing that you & Olly (in particular) did really helped get imapfilter going. =Tony Meyer From noreply at sourceforge.net Fri May 2 03:27:38 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 2 05:27:48 2003 Subject: [Spambayes] [ spambayes-Bugs-725449 ] Binary plugin fails on Win9x Message-ID: Bugs item #725449, was opened at 2003-04-22 04:41 Message generated for change (Comment added) made by helge You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: dan maer (dmara) Assigned to: Mark Hammond (mhammond) Summary: Binary plugin fails on Win9x Initial Comment: Running pure Windows (98SE) and Outlook 2000. No Python installed. I can't get the plugin to initialize. I go to Tools/options/other/advanced options/Com Addins and check the SpamBayes plug-in box, but it won't stayed checked, and I've uninstalled/reinstalled Outlook and the plugin but no go. I've got this working great, really great, on Winnt 4.0 and Xp. Logfile being attached for upload... Dan ---------------------------------------------------------------------- Comment By: Helge Stenström (helge) Date: 2003-05-02 09:27 Message: Logged In: YES user_id=19115 Installation fails on Windows 2000. I have: * http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe (2968028 bytes) * ActivePython-2.2.2-224-win32-ix86.msi The log file talks about files on E:\, which isn't mounted. An earlier log file (which seems to be overwritten, I can't find it) claimed to have written stuff to c:/Documents and Settings/erahege/Application Data/SpamBayes/.pck, but that file is nonexistent. I have NOT installed http://starship.python.net/crew/mhammond/downloads/win32all-152.exe, beacuse the instructions on http://starship.python.net/crew/mhammond/spambayes/ doesn't say so. My log file is very similar to the one already uploaded. I will upload my log file if I find out how to do it. ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-26 05:03 Message: Logged In: YES user_id=763953 ok, since i liked it so much on win2000 i did the python install & manual install of the spambayes outlook addin & its working good now, so i guess it was the installer. thanks ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-24 05:10 Message: Logged In: YES user_id=763953 I also get the same errors as in the logfile on a Win98SE installation w/Outlook 2000 SP1, but it works on Win2000 w/same Outlook. If you have any workarounds (w/regedit?) please let us know... ---------------------------------------------------------------------- Comment By: dan maer (dmara) Date: 2003-04-23 03:06 Message: Logged In: YES user_id=759684 Ok Mark... Anyway to bypass the installer issue and get it working by manual means? Dan ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-22 23:19 Message: Logged In: YES user_id=14198 This is an issue with the "Installer" tool I use. I will try and sus it out before the next binary release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 From patrick at keogh.net.au Fri May 2 23:57:30 2003 From: patrick at keogh.net.au (Patrick Keogh) Date: Fri May 2 08:57:39 2003 Subject: [Spambayes] Could the Outlook plugin mark messages as "Read" when they are moved to spam folder? In-Reply-To: Message-ID: <005301c310aa$64f6da90$0a7114cb@keogh.net.au> I've searched past messages and the doco, but can find no reference to the following issue, which I think a little strange, so maybe it has been discussed, if so my apologies ... I would like to have the option of setting it so that the Outlook plugin changes the status of a message to "Read" when it is moved to the Spam folder. At the moment, all the messages are created in Spam as unread. Even if I check out a Probable, (which marks it as read), and then click the Delete as Spam button, it becomes unread again (in the Spam folder). Now I appreciate that not everyone would be unhappy with this behavior, but I am, so I'd really appreciate a way to set it so that when it is moved to Spam, it is marked as Read. From shaun at ekit-inc.com Fri May 2 17:53:21 2003 From: shaun at ekit-inc.com (Shaun Broadbent) Date: Fri May 2 09:43:46 2003 Subject: [Spambayes] Outlook Plugin Message-ID: <008901c31077$884239a0$3429a8c0@craps> I am running XP Version 5.1.2600 Service Pack 1 Build 2600 Outlook 2002(10.4712.4219)SP-2 On a Dell Latitude C610 Laptop. Every mail item seems to attract the following error and is subsequently un-filtered. Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Here's the Deal.....What Are You Waiting For?' had a Spam classification of 'Failed' The only other error I have received a couple of times is this. E:\src\python-cvs\lib\fcntl.py:7: DeprecationWarning: the FCNTL module is deprecated; please use fcntl Helf? Thanks - Shaun Logs -------- SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\shaun\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\shaun\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 23 spam and 1045 good messages Loaded databases in 303.069ms AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Deleted Items Processing 0 missed spam in folder 'Inbox' took 28.5324ms Processing 0 missed spam in folder 'Inbox' took 2077.83ms Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Italian project is go' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Here's the Deal.....What Are You Waiting For?' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message '04/30-04:23 NOCOL time drift CRITICAL' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message '04/30-04:35 NOCOL services UNREACHABLE' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Re: Italian project is go ' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Event Alerts' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message '04/30-05:15 NOCOL services reachable' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Your Daily Dilbert 04/30/2003' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Sites down for 2 - 3 hours' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'RE: Italian project is go ' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'RE: Italian project is go ' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Re: Italian project is go ' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Re: Italian project is go' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'Sites down for 2 - 3 hours' had a Spam classification of 'Failed' Failed filtering message! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 36, in filter_message File "E:\src\spambayes\Outlook2000\msgstore.py", line 711, in Save com_error: (-2146644781, 'OLE error 0x800cccd3', None, None) Message 'RE: Italian project is go' had a Spam classification of 'Failed' From CMAWSON at woodsaitken.com Fri May 2 11:18:53 2003 From: CMAWSON at woodsaitken.com (Chad Mawson) Date: Fri May 2 11:18:59 2003 Subject: [Spambayes] Outlook 2000 and spambayes dying Message-ID: <2E2181EB1EC4FF4D870765E8C40C264202B016@wa3.woodsaitken.com> First off, Spambayes is an excellent piece of work, and has begun to show people here in the law firm I work for that open source = good, on top of the fact that they can get rid of spam. I hate for my first message to the list to be a problem... But... I'm using Outlook 2000 on Windows XP, and spambayes is a CVS copy I downloaded (approximately 2 weeks ago). I have installed Python 2.2. Spambayes has been working fine, but on two occasions appeared to just stop loading all together. The first time I hadn't done enough research to know where to look, but this time I have the trace. Both times this occurred was after rebooting my system from an unstable state, i.e. I basically killed my machine and cycled power. After this happened I basically just deleted the C:\Documents and Settings\cmawson\Application Data\SpamBayes\*.* reran the addin.py and manager.py retrained and things started working again. C:\Python22\Lib\site-packages\win32\lib>win32traceutil.py Collecting Python Trace Output... SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "U:\spambayes\Outlook2000\manager.py", line 310, in LoadBayes bayes = self.db_manager.open_bayes() File "U:\spambayes\Outlook2000\manager.py", line 103, in open_bayes return bayes_storage.PickledClassifier(self.bayes_filename) File "C:\Python22\Lib\site-packages\spambayes\storage.py", line 80, in __init_ _ self.load() File "C:\Python22\Lib\site-packages\spambayes\storage.py", line 98, in load tempbayes = pickle.load(fp) EOFError Loaded message database from 'C:\Documents and Settings\cmawson\Application Data \SpamBayes\default_message_database.pck' Either bayes database or message database is missing - creating new Traceback (most recent call last): File "C:\Python22\lib\site-packages\win32com\universal.py", line 170, in dispa tch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 322, in _ InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 601, in _ invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kw Args, serviceProvider) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 541, in _ invokeex_ return apply(func, args) File "U:\spambayes\Outlook2000\addin.py", line 662, in OnConnection self.manager = manager.GetManager(application) File "U:\spambayes\Outlook2000\manager.py", line 475, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "U:\spambayes\Outlook2000\manager.py", line 165, in __init__ self.LoadBayes() File "U:\spambayes\Outlook2000\manager.py", line 329, in LoadBayes self.InitNewBayes() File "U:\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes self.bayes = self.db_manager.new_bayes() File "U:\spambayes\Outlook2000\manager.py", line 91, in new_bayes os.unlink(self.bayes_filename) exceptions.OSError: [Errno 13] Permission denied: 'C:\\Documents and Settings\\c mawson\\Application Data\\SpamBayes\\default_bayes_database.pck' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "U:\spambayes\Outlook2000\manager.py", line 310, in LoadBayes bayes = self.db_manager.open_bayes() File "U:\spambayes\Outlook2000\manager.py", line 103, in open_bayes return bayes_storage.PickledClassifier(self.bayes_filename) File "C:\Python22\Lib\site-packages\spambayes\storage.py", line 80, in __init_ _ self.load() File "C:\Python22\Lib\site-packages\spambayes\storage.py", line 98, in load tempbayes = pickle.load(fp) EOFError Loaded message database from 'C:\Documents and Settings\cmawson\Application Data \SpamBayes\default_message_database.pck' Either bayes database or message database is missing - creating new Traceback (most recent call last): File "C:\Python22\lib\site-packages\win32com\universal.py", line 170, in dispa tch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 322, in _ InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 601, in _ invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kw Args, serviceProvider) File "C:\Python22\lib\site-packages\win32com\server\policy.py", line 541, in _ invokeex_ return apply(func, args) File "U:\spambayes\Outlook2000\addin.py", line 662, in OnConnection self.manager = manager.GetManager(application) File "U:\spambayes\Outlook2000\manager.py", line 475, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "U:\spambayes\Outlook2000\manager.py", line 165, in __init__ self.LoadBayes() File "U:\spambayes\Outlook2000\manager.py", line 329, in LoadBayes self.InitNewBayes() File "U:\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes self.bayes = self.db_manager.new_bayes() File "U:\spambayes\Outlook2000\manager.py", line 91, in new_bayes os.unlink(self.bayes_filename) exceptions.OSError: [Errno 13] Permission denied: 'C:\\Documents and Settings\\c mawson\\Application Data\\SpamBayes\\default_bayes_database.pck' From noreply at sourceforge.net Fri May 2 13:05:07 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 2 15:05:11 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-02 12:05 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Nobody/Anonymous (nobody) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From noreply at sourceforge.net Fri May 2 13:50:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 2 15:50:33 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-02 14:05 Message generated for change (Comment added) made by timstone4 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Nobody/Anonymous (nobody) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- >Comment By: Tim Stone (timstone4) Date: 2003-05-02 14:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From T.A.Meyer at massey.ac.nz Sat May 3 10:17:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 2 17:18:19 2003 Subject: [Spambayes] Could the Outlook plugin mark messages as "Read" whenthey are moved to spam folder? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE53@its-xchg4.massey.ac.nz> > I've searched past messages and the doco, but can find no > reference to the following issue, which I think a little > strange, so maybe it has been discussed, if so my apologies ... It is a little strange, since it was discussed quite a bit, but that was a while back now. This is more-or-less the idea behind SF RFE [ 680629 ] "Outlook plugin: Delete as spam marks as read". Just to complicate things, there's another issue, this one a bug, where the plugin changes the read status of a message from read to unread when it processes it. (in SF bugs [ 716684 ] "Filtering marks message as unread"). The first one was submitted in February, so things aren't really moving all that fast on it :) I think this is mostly because it wasn't seen as all that important, and wasn't asked for by many people. I did start looking at how to do it, but got tangled up in the COM stuff and found something more interesting to work on ;) I might see if I can give it another go. Out of interest, which behaviour would you rather have? (a) Mark all messages classified as Spam as read. (b) Mark all messages actioned with the "Delete as Spam" button as read. =Tony Meyer From T.A.Meyer at massey.ac.nz Sat May 3 10:23:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 2 17:23:47 2003 Subject: [Spambayes] Outlook 2000 and spambayes dying Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE54@its-xchg4.massey.ac.nz> > Both times this > occurred was after rebooting my system from an unstable > state, i.e. I basically killed my machine and cycled power. [...] > After this happened I basically just deleted the C:\Documents > and Settings\cmawson\Application Data\SpamBayes\*.* reran the > addin.py and manager.py retrained and things started working again. What happened both times (going by the trace) is that the configuration file was corrupted. I wouldn't have thought this should happen unless it was actually being written to at the time, but I could be wrong. There's a limit to what can be done in this sort of situation, but the plugin can definitely handle this should of thing better. There's a SF bug about it - [ 709051 ] "Config file loading and saving is fragile". So, unfortunately, all that you can really do at the moment is what you have been doing, and wait for the fix for that bug to come along. (Plus stop killing your machine like that ). =Tony Meyer From seandarcy at hotmail.com Fri May 2 23:48:58 2003 From: seandarcy at hotmail.com (sean darcy) Date: Fri May 2 22:49:32 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: I tried out today's cvs of spambayes. Before, I could get to the web config page, but it died when I tried to save. Now I can get to the config page. When I clcik on the config link, I get: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 409, in onConfig if type(options.valid_input(sect, opt)) in types.StringTypes: File "/usr/lib/python2.2/site-packages/spambayes/Options.py", line 1446, in valid_input return self._options[sect, opt].valid_input() KeyError: ('pop3proxy', 'servers') No error shows in the terminal window where I started pop3proxy. thanks sean _________________________________________________________________ The new MSN 8: smart spam protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From seandarcy at hotmail.com Sat May 3 00:54:28 2003 From: seandarcy at hotmail.com (sean darcy) Date: Fri May 2 23:55:02 2003 Subject: [Spambayes] Error on proxy connection Message-ID: >From: Tim Stone - Four Stones Expressions >........... > >at the terminal window where I ran pop3proxy I get: > >This looks like a bug. Would you mind making a tar of a mail that causes >this >error, so I can recreate and diagnose? > I've just gotten around to following up on this. I've tried two different email programs -evolution and mozilla - each with two different email accounts ( both of which work with both programs). I get the same bug each time. So any *mail* causes the error. I wonder if the source of the bug isn't in the email distribution. I couldn't find 2.4.3, so I used 2.5 from sourceforge. I have python 2.2. I've looked around, but can't find a cvs source, though I did see an announcement of 2.5.1. any thoughts? sean _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From T.A.Meyer at massey.ac.nz Sat May 3 17:08:48 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat May 3 00:09:29 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE7E@its-xchg4.massey.ac.nz> > I tried out today's cvs of spambayes. Before, I could get to > the web config page, but it died when I tried to save. > Now I can't get to the config page. [...] > KeyError: ('pop3proxy', 'servers') Sorry, this was me. I forgot to update ProxyUI and ImapUI when I changed over the options names (this would only effect modules that use the new options attributes, like is_valid). I've checked in a fix for this, so please try again! (I'm especially curious as to whether the new temp file method will work or not.) =Tony Meyer From T.A.Meyer at massey.ac.nz Sat May 3 17:16:32 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat May 3 00:17:15 2003 Subject: [Spambayes] Error on proxy connection Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE7F@its-xchg4.massey.ac.nz> > > >at the terminal window where I ran pop3proxy I get: > >This looks like a bug. Would you mind making a tar of a mail that > >causes this error, so I can recreate and diagnose? > I've just gotten around to following up on this. I've tried > two different email programs -evolution and mozilla - each with two > different email accounts ( both of which work with both programs). > I get the same bug each time. So any *mail* causes the error. Could you try adding the following to your configuration file: [globals] verbose: True and running it again? A file called _pop3proxy.log should have been created with a log of the POP conversation. Could you send this on to us? (edit out anything private that's in there). > I wonder if the source of the bug isn't in the email distribution. I > couldn't find 2.4.3, so I used 2.5 from sourceforge. I have > python 2.2. I've looked around, but can't find a cvs source, though I > did see an announcement of 2.5.1. I can't see any reason why 2.5 wouldn't be ok. I'm only using 2.4.3 and everything works fine. =Tony Meyer From noreply at sourceforge.net Sat May 3 05:40:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 3 07:40:37 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-03 05:05 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Nobody/Anonymous (nobody) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-03 05:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From noreply at sourceforge.net Sat May 3 05:41:45 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 3 07:41:49 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-03 05:05 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Nobody/Anonymous (nobody) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:41 Message: Logged In: YES user_id=14198 Sorry - my mistake - you already did that - thanks! ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-03 05:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From seandarcy at hotmail.com Sat May 3 15:02:19 2003 From: seandarcy at hotmail.com (sean darcy) Date: Sat May 3 14:02:53 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: >>I tried out today's cvs of spambayes. Before, I could get to the web >>config page, but it died when I tried to save. >>Now I can't get to the config page. [...] >>KeyError: ('pop3proxy', 'servers') >Sorry, this was me. I forgot to update ProxyUI and ImapUI when I >changed over the options names (this would only effect modules that use >the new options attributes, like is_valid). > >I've checked in a fix for this, so please try again! (I'm especially >curious as to whether the new temp file method will work or not.) Well, I suppose this is progress. I got a New error message when I tried to bring to the config page: 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/UserInterface.py", line 409, in onConfig if type(options.valid_input(sect, opt)) in types.StringTypes: File "/usr/lib/python2.2/site-packages/spambayes/Options.py", line 1446, in valid_input return self._options[sect, opt].valid_input() KeyError: ('Headers', 'notate_to') thanks sean _________________________________________________________________ The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From seandarcy at hotmail.com Sat May 3 15:42:02 2003 From: seandarcy at hotmail.com (sean darcy) Date: Sat May 3 14:47:57 2003 Subject: [Spambayes] Error on proxy connection Message-ID: Here are two _pop3proxy logs from two different accounts at optonline.net. They look clean to me. I tried them from evolution (2) and the mozilla mailer ( 1 ). Both programs showing getting the headers, and then they both just waiting for the mail. Just to confirm - both work fine directly accessing these accounts. I still think email may be an issue. Do you know where I can find email-2.4.3? thanks sean _________________________________________________________________ MSN 8 with e-mail virus protection service: 2 months FREE* http://join.msn.com/?page=features/virus -------------- next part -------------- A non-text attachment was scrubbed... Name: _pop3proxy.log.1.bz2 Type: application/x-bzip2 Size: 449 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030503/2c8895d7/_pop3proxy.log.1.bin -------------- next part -------------- A non-text attachment was scrubbed... Name: _pop3proxy.log.2.bz2 Type: application/x-bzip2 Size: 507 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030503/2c8895d7/_pop3proxy.log.2.bin From T.A.Meyer at massey.ac.nz Sun May 4 11:14:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat May 3 18:15:08 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE8B@its-xchg4.massey.ac.nz> > Well, I suppose this is progress. I got a New error message > when I tried to > bring to the config page: This is now fixed. I've got some time today, so I had a chance to test it, and it all appears to be working again. Sorry, I should have taken the time to check that I had fixed *all* the problems yesterday. =Tony Meyer From noreply at sourceforge.net Sat May 3 16:21:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 3 18:21:55 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-02 12:05 Message generated for change (Comment added) made by btakita You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Nobody/Anonymous (nobody) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- >Comment By: Brian Takita (btakita) Date: 2003-05-03 15:21 Message: Logged In: YES user_id=620484 Windows 2000 Outlook XP Python 2.2 SpamBayes 1.0a2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 04:41 Message: Logged In: YES user_id=14198 Sorry - my mistake - you already did that - thanks! ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 04:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-02 12:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From T.A.Meyer at massey.ac.nz Sun May 4 11:36:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat May 3 18:36:52 2003 Subject: [Spambayes] Error on proxy connection Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE8C@its-xchg4.massey.ac.nz> > Here are two _pop3proxy logs from two different accounts at > optonline.net. > They look clean to me. One dies on RETR, the other on TOP (which actually call the same code in pop3proxy). It's odd that creating the message object seems to go find, but flattening it dies. Could you put these three lines: state.logFile.write("Message Text is:\n") state.logFile.write(messageText) state.logFile.flush() After line 433 (" ok, messageText = response.split('\n', 1)"), and attach the new log? I'm sure this must be a message with some sort of strange encoding or charset, or a malformed message, that's causing the exception in the email package. This will let us see what sort of message it is. > I still think email may be an issue. Do you know where I can find > email-2.4.3? The only place I know of is included with Python 2.2.2. It really shouldn't be an issue with the email package - there should be others that are using 2.5 - and certainly many that are using 2.5.1. =Tony Meyer From patrick at keogh.net.au Sun May 4 11:13:17 2003 From: patrick at keogh.net.au (Patrick Keogh) Date: Sat May 3 20:13:25 2003 Subject: [Spambayes] RE:Could the Outlook plugin mark messages as "Read"when they are moved to spam folder? In-Reply-To: Message-ID: <000901c311d1$f74272b0$0a7114cb@keogh.net.au> In response Tony Meyer asks: Out of interest, which behaviour would you rather have? (a) Mark all messages classified as Spam as read. (b) Mark all messages actioned with the "Delete as Spam" button as read. My answer is that, in a perfect world, I'd like to see two checkboxes on the Anti-Spam Manager combo box, labelled: [] Mark messages as read when the filter moves them to the Spam folder [] Mark messages as read when they are moved using the "Delete as Spam" button From noreply at sourceforge.net Sat May 3 19:43:09 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 3 21:43:23 2003 Subject: [Spambayes] [ spambayes-Feature Requests-680629 ] Outlook plugin: Delete as spam marks as read Message-ID: Feature Requests item #680629, was opened at 2003-02-05 13:30 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=680629&group_id=61702 Category: Outlook Group: None Status: Open Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Mark Hammond (mhammond) Summary: Outlook plugin: Delete as spam marks as read Initial Comment: Personally I think it would be nice if the "delete as spam" button marked the mail item as read. Note that I'm not saying that mail that is filtered as spam should be marked as read - it shouldn't (by default). If others agree, this would be a nice addition. Perhaps as an option in the prefs. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-04 11:43 Message: Logged In: YES user_id=14198 Out of interest, which behaviour would you rather have? (a) Mark all messages classified as Spam as read. (b) Mark all messages actioned with the "Delete as Spam" button as read. My answer is that, in a perfect world, I'd like to see two checkboxes on the Anti-Spam Manager combo box, labelled: [] Mark messages as read when the filter moves them to the Spam folder [] Mark messages as read when they are moved using the "Delete as Spam" button ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-27 02:45 Message: Logged In: YES user_id=645698 This is an interesting thread. I think it should move to the main list. Pop3proxy has a very similar configuration function, which manages options into bayescustomize.ini (by default). This is another area that we should solve the problem once... ---------------------------------------------------------------------- Comment By: Piers Haken (piersh) Date: 2003-02-07 22:38 Message: Logged In: YES user_id=10551 i don't care if you do this or not (since spambayes catches all my spam ;-) ), but please don't mark any automatically- filtered spam as 'read' - it would be a pain to check for FPs if you did. thx. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-07 22:05 Message: Logged In: YES user_id=14198 Fair enough :) ---------------------------------------------------------------------- Comment By: Paul Moore (pmoore) Date: 2003-02-07 20:09 Message: Logged In: YES user_id=113328 I'd like the "Mark as read" option. Most unsures and false negatives which are spam, I can identify by subject, and hence I don't open (and I don't use the preview pane). But it's not crucial - Ctrl-Q does a very quick "Mark as read" anyway... ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 14:17 Message: Logged In: YES user_id=552329 Agreed that it is not necessary. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-05 14:11 Message: Logged In: YES user_id=14198 Yep, I see that makring as read could be useful in that they have been reviewed, but then I would expect Outlook's normal mechanism to still work and mark it read. I have my preview pane mark as read after 2 seconds :) Re the INI file - my problem is that the GUI needs to modify these options, and I don't see how it is trivial to keep the fairly "free-form" INI file format supported by configparser, while only writing out certain elements and not others and also keeping comments etc intact. I'll make a deal - help me with the options problem, and I will give you 5 free option . Let's take it to email... ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 13:56 Message: Logged In: YES user_id=552329 My reasoning was that if the user manually selects to delete it as spam, then it is as good as read. Those that are moving via the filter have not been read. Personally I still wade through the filtered spam to check it for false positives, and mark the messages as read as I go (so that the 'unread' display is the number of messages I haven't checked). If I choose delete as spam, I then have to go to the spam folder and mark it as read. In any case, no big deal if you disagree, it was just a thought :) Re: the ini file: looking at the ini, it doesn't seem to have anything that couldn't be in the GUI. Most of it would probably fit in the "advanced" dialog. It would probably be good if the ini was only for 'beta' options - anything that is for public use should be in the GUI. And if a 'beta' option moves to 'public', then it doesn't matter (much) if it breaks, because those using beta options should be upgrading anyway. Moving the existing settings (most of which should be exposed I think) would mean breaking existing code, but maybe just this once? Maybe this discussion should move to the list? (maybe I should have posted this there originally?) ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-05 13:41 Message: Logged In: YES user_id=14198 I'm not too sure this should happen unless the filter also marks the items as read - otherwise you still end up with many spam in the spam folder unread, and only the ones you move manually marked as read. I'm also kinda stuck about what to do with "options". Currently, options managed by the GUI are in a pickle, while other options are in the .ini file. I don't object to having new, outlook specific options in the INI file, but I do object to all our existing code breaking should we decide later to move this option into the GUI. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 13:31 Message: Logged In: YES user_id=552329 And who else to decide on this, but Mark :) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=680629&group_id=61702 From dave at boost-consulting.com Sat May 3 23:19:21 2003 From: dave at boost-consulting.com (David Abrahams) Date: Sat May 3 22:20:13 2003 Subject: [Spambayes] Re: Success! : Spambayes + IMAP + crontab + CommunigatePro + procmail In-Reply-To: (Tim Stone's message of "Thu, 01 May 2003 19:05:15 -0500") References: Message-ID: Tim Stone - Four Stones Expressions writes: > 5/1/2003 6:57:48 PM, David Abrahams wrote: > >> >>After much thrashing, I have successfully set up a server-side >>filtering system with spambayes. > > Congratulations! You're one persistent dude. > >>If there is any interest in a RestructuredText (or HTML) document >>describing the whole deal from the POV of a neophyte in >>email/IMAP/procmail/CGPro/Spambayes, and it would get displayed >>somewhere useful to others, I'd be glad to contribute one to the >>Spambayes project. > > If you create a document in html, I'll be sure it shows up (credited) on the > website... Voila: http://www.boost-consulting.com/writing/server-side.html http://www.boost-consulting.com/writing/server-side.rst Please see the open questions at the end of the document. I think it's important that someone from the project answer this question. -- Dave Abrahams Boost Consulting www.boost-consulting.com From seandarcy at hotmail.com Sun May 4 00:52:46 2003 From: seandarcy at hotmail.com (sean darcy) Date: Sat May 3 23:53:33 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: WorksForMe. But... just to check I added a mail server on the config page and then saved it. In the terminal window where I started pop3proxy, the new server showed up. When I went back to the main web page, it didn't. If it's not proxying for the new server, it shouldn't show up on th terminal window; if it is it ounght to show up on the web page. thanks for fixing the config page. sean >From: "Meyer, Tony" >To: "sean darcy" , >Subject: RE: [Spambayes] Still can't do web config under linux >Date: Sun, 4 May 2003 10:14:23 +1200 > > > Well, I suppose this is progress. I got a New error message > > when I tried to > > bring to the config page: > >This is now fixed. I've got some time today, so I had a chance to test >it, and it all appears to be working again. Sorry, I should have taken >the time to check that I had fixed *all* the problems yesterday. > >=Tony Meyer _________________________________________________________________ Add photos to your messages with MSN 8. Get 2 months FREE*. http://join.msn.com/?page=features/featuredemail From T.A.Meyer at massey.ac.nz Sun May 4 17:00:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 4 00:01:03 2003 Subject: [Spambayes] Still can't do web config under linux Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABE9F@its-xchg4.massey.ac.nz> > But... just to check I added a mail server on the config page > and then saved it. In the terminal window where I started pop3proxy, > the new server showed up. When I went back to the main web page, > it didn't. If it's not proxying for the new server, it shouldn't > show up on the terminal window; if it is it ought to show up on > the web page. Good spotting. It was proxying, it just wasn't updating. I've fixed this, too. > thanks for fixing the config page. Well, I broke it, so I had to, really . =Tony Meyer From stego at cpl.net Sat May 3 21:55:50 2003 From: stego at cpl.net (Stego) Date: Sun May 4 00:14:55 2003 Subject: [Spambayes] My Dream Spam Killer... Message-ID: <4.2.2.20030503190229.00a28c40@loraninc.com> Hello there! I've been looking around at all of the available spam tagger/killers, and by the looks of it I believe that your system might be close to what I've been dreaming about. Lacking the skills necessary to program such a beast, I figgured that I go ahead and, instead of keeping my thoughts bottled up, share my thoughts with those people who Can make a difference. At the same time, perhaps, mabye you can tell me if your spam system already does what I'm dreaming about... Enough of my yappin, herein lies my Dream Spam Killer: ==== I dream of a bayesian-style system that is interactive, and also uses whitelisting as an identification tool: It sits parallel to an existing SMTP server much like most filters currently do... However, unlike most "Install and its Running" style systems, this one would need some time to learn per user: During the first few months of it being activated against a particular inbox, while it's scoring database is empty, the spam system will simply sit there, processing all messages and keeping a dictionary tally of words of all incomming messages for this individual user. During this phase, it is only learning about words that are found in all the emails directed at this user. Also during this time, The system is taught to recognize spam when the user forwards a peice of email to one of a few e-mail addresses that the system is taught to Intercept, such as spam@mycompany.com, or ham@mycompany.com, or badspam@mycompany.com. Upon receiving such an "Interactive" message, the system will then look through the forwarded message and either perform a "Spammyness" calculation or a whitelisting of that message's sender, and store the scores as being specific to the user who forwarded the message. This way, what is marked as spam for one user isn't necessarily marked as spam for another user, and the user has control at interactively Identifying those address which are Not spam, without ever having to know a thing about how the server really works. Here's a sample scenario of what Im getting at: Lets say that we implement the system on a server that handles 130 different e-mail addresses. Let us home in to one local mailbox owner: julie@mycompany.com. The Spam system, when first activated against an individual inbox, simply sits there in "Monitor" mode, watching the mail that passes through, building a word dictionary that is specific to Only julie@mycompany.com. All other users have their own word dictionary sets. Lets say that one day Julie receives a Nigerian Scam spam. As per the instructions that were e-mailed to her when the system was first activated against her inbox, She Immediately forwards this message to spam@mycompany.com, where the Spam system will then take the message and perform its Bayesian scoring methods, updating its word dictionary for Julie in the process. Now here's the thing: The System has already once seen this style of mail pass through it, before Julie came back and labelled it as spam. Essentially, the system is working on an "Innocent until proven Guilty" phase: Until Julie blew the whistle on the message, the system simply took record of the major words in this message as a Non-spam email. If she were to forward the message to badspam@mycompany.com, this would tell the system that this message is Much more irritating to julie than normal, and that she would prefer to see these deleted rather than just tagged as spam. Porn messages would be example messages that would be sent to the badspam list. Of course since the system is still in Monitor mode against her account, future messages of this type will not yet be deleted. On this same day, Julie receives a message from her friend jack@aol.com. Again, as per the instructions she received, she forwards this message to ham@mycompany.com, where the e-mail address of the is now recorded as a "Friendly" e-mail address, and will from now on be tagged as a friendly message until julie says otherwise in the future (when she sends a command to spamsystem@mycompany.com in much the same way as sending commands to a listserver). She can also add addresses that she considers to be Friendly to her whitelist in this manner. This whitelist that is created is simply a list of those addresses that the Spam system will deliver without trying to score it. After a few weeks (or several hundred messages, whatever timeframe the Administrator tells the spam system to wait), the system will send a message asking her if she wants to switch the spam system into "Test" mode. If she says Yes, then the spam system will start delivering all her messages with a visible tag message showing julie what the message was tagged as: Ham, Spam, BadSpam, or Meat (Unidentified). If any one of them were incorrectly identified, she would simply forward the mistagged message to the appropriate system box (ham@mucompany.com, spam@mycompany.com, or badspam@mycompany.com), where the system would then update its scores for words contained in that message appropriately. After the System has been running in Test mode for a bit, if it hasn't received any misidentified messages back from Julie in awhile (again a percentage or time period specified by the Admin), it would then ask her if she wanted to switch her system into Run mode. If she says Yes, then from then on the system will go ahead and tag Spam with an invisible header Identifier that can be filtered by the users Email client, and all BadSpam will be Immediately deleted, sending to julie a simple message (that has been marked as Spam) saying that a message from such@such.com that was Identified as Badspam and thus deleted. (Any such deletions would also generate a kicked back message to the sender, indicating that their message was deleted.) Anything that has been tagged as Meat will still retain a visible "This has been tagged as Meat. Please tell me what you want done with these styles of Messages" message at the top, allowing Julie to take further action on the message (either sending it to ham, spam, or badspam@mycompany.com). Julie could at anytime send a command to spamsystem@mycompany.com and adjust any of her settings: weather or not to inform her of deleted badspams, removing or adding people from her whitelist, or even to set the system back to Test mode, or even turn he system completely off if she desires. When julie's mail account is deleted from the server, her scores are deleted as well. ===== Anyway, I hope that this proves to be inspirational if not useful in the building of a better spam trap! Ray Collazo, MIS Loran Inc. (rcollazo@nightscaping.com) From T.A.Meyer at massey.ac.nz Sun May 4 17:42:42 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 4 00:43:53 2003 Subject: [Spambayes] My Dream Spam Killer... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABEA2@its-xchg4.massey.ac.nz> > I dream of a bayesian-style system that is interactive, If by interactive, you mean that you need to train it, then yes. > and also uses whitelisting as an identification tool There isn't any whitelisting done. However, all that most of the Spambayes apps do is add a classification to a message. It's up to the mail program to do something with it (imapfilter is an exception to this; I don't know about the hammiefilter style apps). So, you could have a rule in the mail app that says something along the lines of "if this is not in the whitelist and the classification is spam, remove it". You could also have Spambayes operate after a system that just white/black lists. > It sits parallel to an existing SMTP server much like most > filters currently do... I'm not sure exactly what you mean here. Do you want mail to be classified/filtered as you send it? I'm not sure how this would help. Spambayes does work with existing mail servers (POP, IMAP, Exchange-via-Outlook). > However, unlike most "Install and its Running" style systems, > this one would need some time to learn per user: Well, as soon as you have trained one ham and one spam message, you'll get reasonable results. The docs talk about how this improves over time, but there really isn't a need for a non-classifying period. At first there will be a few mistakes, and a reasonable number of unsures, but this will change very quickly (assuming you train on them). > The system is taught to recognize spam when > the user forwards a piece of email to one of a few > e-mail addresses that the system is taught to Intercept, such as > spam@mycompany.com, or ham@mycompany.com, or > badspam@mycompany.com. This is more-or-less how the SMTP proxy works. > Lets say that we implement the system on a server that handles 130 > different e-mail addresses. >From the FAQ: Q: This software is great! I want to implement it for all my users. Are there plans to develop a server-side spambayes solution? A: The problem with a server-side solution is that everyone has a different idea of what is spam - that's the whole strength of the bayesian-style filtering concept. If you are certain that *all* of your users would agree on what is spam and what is not, then this might work for you, but otherwise you really have to have individual databases for each user. Either way, you should be able to modify spambayes easily enough to fit into your setup. Please let the list know if you do have success in this area, and we'll update this answer. > Essentially, the system is working on an "Innocent > until proven Guilty" phase None of the Spambayes apps (AFAIK) train all incoming mail (as the classified type). It would be simple to do this if that was what you wanted, and I believe it was suggested (in the context of the Outlook plugin) at one stage. If this is what you want, open a feature request (RFE) via sourceforge () referring to a specific application (like pop3proxy, or imapfilter). > If she were to forward the message to badspam@mycompany.com, this > would tell the system that this message is Much more irritating to > julie than normal, and that she would prefer to see these deleted > rather than just tagged as spam. Porn messages would be > example messages that would be sent to the badspam list. This is much trickier than it sounds. How does the system identify messages like those? Spambayes is a binary classifier - either messages are like the ham set, or like the spam set. You could set a limit above which score messages are deleted (> 99%, for example). You could have two databases - one that classifies between ham/spam and one for ham/badspam; this is really just an implementation issue. > On this same day, Julie receives a message from her friend > jack@aol.com. [...] > the e-mail address of the is now recorded as a > "Friendly" e-mail address, and will from now on be tagged as > a friendly message until julie says otherwise in the future It would be fairly simple to create a SMTP/POP proxy pair that did this sort of automatic whitelisting, but that's not what spambayes does. Whitelisting doesn't really work very well, anyway, in my experience - too many messages from people I don't know, and too easy to spoof a known address (like my own). > Anyway, I hope that this proves to be inspirational if not > useful in the building of a better spam trap! I hope my comments have made it clearer what Spambayes does, and what it doesn't. It is, for the most part, what you described (but then so are other filters), without the whitelisting component (maybe you should look at SpamAssassin). The main thing is that Spambayes at the moment is a client-side. There are a lot of issues about server-side bayesian-style filtering. On the other hand, if someone wants to implement a server-side filter and can decide what the rules will be (about individual databases, deletion/tagging, and so on), then the core Spambayes code can be used without change. Note that there is also a reasonable amount of documentation on the website that describes what Spambayes does: . =Tony Meyer From T.A.Meyer at massey.ac.nz Sun May 4 17:49:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 4 00:50:22 2003 Subject: [Spambayes] My Dream Spam Killer... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABEA3@its-xchg4.massey.ac.nz> > I hope my comments have made it clearer what Spambayes does, > and what it doesn't. It is, for the most part, what you described > (but then so are other filters), without the whitelisting component In all that, I missed the most important comment: Why not give Spambayes a go? The website has all the information you should need to get started. The current alpha2 release doesn't have the smtp proxy training like you described, but alpha3/beta1 should be out soon, or you could be brave ;) and try the cvs code. Make sure that you let us know if you try it out and find things that could be done better, or if you have problems. :) =Tony Meyer From patrick at keogh.net.au Sun May 4 18:09:02 2003 From: patrick at keogh.net.au (Patrick Keogh) Date: Sun May 4 03:09:09 2003 Subject: [Spambayes] Outlook plugin and mark as read when moved/deleted... In-Reply-To: Message-ID: <000b01c3120c$0b610100$0a7114cb@keogh.net.au> Some of the discussion seems to have missed the point I was making. I only asked that there be an option for the filter to mark a message as read. I was not suggesting that this should be the default or only behaviour. In any event, if we all read our Spam, just to check that there are no false positives, what is the point in using a Spam filter? That's why I want them marked as read. Once Spambayes has consigned a message to the Spam folder (automatically or because I moved it), for me that's the end of it. I haven't had a false positive in the past six months. All I would do occasionally is to go to the Spam folder, sort it by Spaminess, and look at the marginal messages. From T.A.Meyer at massey.ac.nz Mon May 5 10:31:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 4 17:32:10 2003 Subject: [Spambayes] Outlook plugin and mark as read when moved/deleted... Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABF68@its-xchg4.massey.ac.nz> > Some of the discussion seems to have missed the point I was > making. I only asked that there be an option for the filter > to mark a message as read. I was not suggesting that this > should be the default or only behaviour. I can't speak for everyone, but I think we understood that. I also would like (some of) the behaviour you've suggested, but as a default-to-off option. > In any event, if we all read our Spam, just to check that > there are no false positives, what is the point in using a > Spam filter? Well, it does move them all away where you can deal with them once per day/week/month. For those that a false positive would be really really bad, this is the way to go. I'm sure that Mark (or maybe me, or maybe someone else) will get to implementing this option at some point... =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 5 11:14:15 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 4 18:14:50 2003 Subject: [Spambayes] PEP-0263 and resourcepackage (ui_html.py) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016ABFAD@its-xchg4.massey.ac.nz> Apparently Python 2.3b1 gives the following warning: > /opt/python/2.3b1/lib/python2.3/site-packages/spambayes/UserIn > terface.py:192: > DeprecationWarning: Non-ASCII character '\xda' in file > /opt/python/2.3b1/lib/python2.3/site-packages/spambayes/resour > ces/ui_html.py > on line 7, but no encoding declared; see > http://www.python.org/peps/pep-0263.html for details > from spambayes.resources import ui_html ui_html.py is created by resource_package and contains a zlib compressed string, which is where the /xda character is. What should be done about this? (a) Ignore the warning, and fix it when phase 2 of the PEP is implemented. (b) Ask the resourcepackage people to update it so that it includes an encoding. (c) Something else. =Tony Meyer From mhammond at skippinet.com.au Mon May 5 12:34:29 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 4 21:35:31 2003 Subject: [Spambayes] URLSlurper (Was: Latest spammer trick stymied -QUESTION) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13016ABB00@its-xchg4.massey.ac.nz> Message-ID: <015601c312a6$79a2cf50$530f8490@eden> > Time to pull out the testing data again... OK - I will bite. These short URL messages are still coming strong, and usually still ending up squarely as "unsure". Eg, the most recent I can find has a body of: """ Brother and Sister Nephew and Aunt Nasty relatives performing intrafamilial party in bedroom http://peipeisoq.incestuals.com/ """ To my mind, if you are a "customer" of such Spam, you know exactly what it is offering - so presumably is as effective as any other porn spam - possibly more so due to the "teasing" qualities. > In the interests of testing, I've done this. The code isn't based at > all on the stuff that Richard sent (it was easier to just do it from > scratch), but does follow the same rules (more-or-less). *sigh* - while running the Outlook "export" program, I still see some errors de-composing Outlook messages: Failed to get message text for 'ADV: The Star Trek Cast NAKED! - This months special feature. - Must Be 18 Years or Older': string payload expected: Failed to get message text for 'ADV: Exclusive Hot Young Centerfold Girls! - Must Be 18 Years or Older': string payload expected: Failed to get message text for '[PMX:#] Be impressed!': string payload expected: Failed to get message text for 'My father and his bitches...': string payload expected: . And to think it appears profitable to send spam advertising the StarTrek cast naked . > urls that will timeout (60s a pop!), and not a good idea to run if you > pay per kb downloaded, or something like that). I admit I am a little suspect on this test code though - it *seems* to be slurping URLs even when there are plenty of discriminators. I may be wrong though, and have no time to check. Anyway, after waiting ages for things to complete, and making some changes to the test code, I have some results. For me, it seems a loss! timtest_outs.txt -> urlslurper_outs.txt -> tested 423 hams & 1430 spams against 462 hams & 1493 spams ... false positive percentages ... won 0 times tied 20 times lost 0 times total unique fp went from 8 to 8 tied mean fp % went from 0.159381556389 to 0.159381556389 tied false negative percentages ... won 0 times tied 14 times lost 6 times ... Which seems surprising. I also note my ham/spam inbalance is getting high, and it *seems* spambayes isn't doing the job it used to (far more "unsures" than seems reasonable). Unfortunately (or fortunately) it is still doing good enough for me to not have the time to investigate further. Another option is that my changes to the urlslurper (eg, only fetching text/html) broke the test. Can you re-run again from CVS and make sure you still see a win. I'd *love* to see a win (even though implementing anything like this for outlook would be a challenge ;) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 3360 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030505/2bd8de5c/winmail.bin From anthony at interlink.com.au Mon May 5 12:01:15 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Sun May 4 22:01:16 2003 Subject: [Spambayes] PEP-0263 and resourcepackage (ui_html.py) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13016ABFAD@its-xchg4.massey.ac.nz> Message-ID: <200305050101.h4511GV22753@localhost.localdomain> >>> "Meyer, Tony" wrote > ui_html.py is created by resource_package and contains a zlib compressed > string, which is where the /xda character is. > > What should be done about this? > (a) Ignore the warning, and fix it when phase 2 of the PEP is > implemented. > (b) Ask the resourcepackage people to update it so that it includes an > encoding. > (c) Something else. Either (b), or, as a workaround, write something to patch the generated file. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From koschate at bigfoot.com Mon May 5 17:18:11 2003 From: koschate at bigfoot.com (Thomas Koschate) Date: Mon May 5 16:45:48 2003 Subject: [Spambayes] Chaos Manor & Spambayes Message-ID: I notice in the current issue of Chaos Manor, Jerry Pournelle gives Spambayes a mention, along with the Outlook plug-in. Unfortunately, Byte has gone to a subscription model, so I'm not sure how useful this link is: http://www.byte.com/documents/s=8221/byt1051823222963/0505_pournelle.html ============================================================= Thomas Koschate For PGP Key, see http://pgpkeys.mit.edu:11371/pks/lookup?op=get&search=0xF45280AD ============================================================= Work to ride. Ride to work. From lists at morpheus.demon.co.uk Mon May 5 23:09:14 2003 From: lists at morpheus.demon.co.uk (Paul Moore) Date: Mon May 5 17:15:11 2003 Subject: [Spambayes] PEP-0263 and resourcepackage (ui_html.py) References: <1ED4ECF91CDED24C8D012BCF2B034F13016ABFAD@its-xchg4.massey.ac.nz> <200305050101.h4511GV22753@localhost.localdomain> Message-ID: Anthony Baxter writes: >>>> "Meyer, Tony" wrote >> ui_html.py is created by resource_package and contains a zlib compressed >> string, which is where the /xda character is. >> >> What should be done about this? >> (a) Ignore the warning, and fix it when phase 2 of the PEP is >> implemented. >> (b) Ask the resourcepackage people to update it so that it includes an >> encoding. >> (c) Something else. > > Either (b), or, as a workaround, write something to patch the generated > file. Variation on (b) - get resourcepackage fixed to use \x escapes in strings rather than literal binary characters >127. I believe this is the correct answer (Python's default encoding has always been 7-bit ASCII, although pre-2.3, 8-bit data in strings was accepted). Paul. -- This signature intentionally left blank From T.A.Meyer at massey.ac.nz Tue May 6 12:52:49 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 5 19:53:28 2003 Subject: [Spambayes] PEP-0263 and resourcepackage (ui_html.py) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13016AC3CC@its-xchg4.massey.ac.nz> > > Either (b), or, as a workaround, write something to patch the > > generated file. > Variation on (b) - get resourcepackage fixed to use \x > escapes in strings rather than literal binary characters > >127. I believe this is the correct answer (Python's default > encoding has always been 7-bit ASCII, although pre-2.3, 8-bit > data in strings was accepted). I submitted a feature request on resource package to get this fixed, and then there was some discussion about the correct encoding on c.l.p. The /x escapes method was suggested, but the resource package author didn't like the idea much because it would add significantly to the size of the files. It seems that resource package will be updated to add an encoding comment (latin1) at some point in the near future. In the meantime, I'll add the comment to our resource packaged files so that we can avoid the depreciation warnings. =Tony Meyer From noreply at sourceforge.net Tue May 6 07:04:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 6 09:04:16 2003 Subject: [Spambayes] [ spambayes-Bugs-733247 ] crash when using merged-in options Message-ID: Bugs item #733247, was opened at 2003-05-06 15:04 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=733247&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Sjoerd Mullender (sjoerd) Assigned to: Tony Meyer (anadelonbrin) Summary: crash when using merged-in options Initial Comment: In revision 1.48 a bug was introduced that causes a crash when you have used merge_files to merge in options from a file. The problem is, the call to self._oldset sets the option to the string form of the value just after it had been set correctly in the call to self.set one line earlier. The fix (I think) is to apply the attached patch. Also, testing your changes is a good idea... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=733247&group_id=61702 From jesper.persson at gmx.net Tue May 6 10:21:27 2003 From: jesper.persson at gmx.net (Jesper) Date: Tue May 6 09:51:42 2003 Subject: [Spambayes] Problem with latest CVS Message-ID: <000801c313a0$1c4b5520$63c411c2@helsingborgshem.net> Hello, I had problems with the latest CVS earlier giving me a "500 server error" when trying to configure through the web interface but that was fixed. Now I get it when I try to review the messages. I've pasted the error message below. I use the latest CVS and Perl 5.8 on a Win2k machine. Hope it's ok to mail you directly like this, I don't have access to newsgroups. Thanks for a promising spam filter, keep up the good work! Regards, Jesper Persson 500 Server error Traceback (most recent call last): File "c:\spambayes\spambayes\Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "c:\spambayes\spambayes\ProxyUI.py", line 350, in onReview judgement = message[options.hammie_header_name] AttributeError: 'OptionsClass' object has no attribute 'hammie_header_name' From jjl at pobox.com Tue May 6 14:37:48 2003 From: jjl at pobox.com (John J Lee) Date: Tue May 6 09:51:54 2003 Subject: [Spambayes] spambayes.messageinfo.db & FAQ.txt Message-ID: I tried out spambayes (with imapfilter.py & Python 2.2), and nothing surprised me except the fact that spambayes.messageinfo.db gets dropped in the local directory rather than in some location configured by an option. I initially assumed that this must mean spambayes has been double-counting (or multiple-counting) messages (since I ran imapfilter.py from several locations, littering spambayes.messageinfo.dbs all over the place). I haven't noticed it doing that, though -- so what is this DB for? Or does it get rebuilt from the messages on the server if it's not found in the current directory? In which case, again, what is its purpose? I did notice the comment in message.py that there are plans for a central DB, but in the mean time, perhaps it would be nice to append something like this to FAQ.txt: >---------------CUT HERE----------- Q: I've made a configuration file. Anything else I need to do? A: Yes, you need to remember that spambayes currently always keeps its log of seen messages in a file "spambayes.messageinfo.db" in the current directory, so it's best to run spambayes tools from the same directory every time. >---------------CUT HERE----------- John From noreply at sourceforge.net Tue May 6 08:18:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 6 10:18:49 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 08:07 Message generated for change (Comment added) made by sgruendel You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-06 16:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 08:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From T.A.Meyer at massey.ac.nz Wed May 7 12:05:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 6 19:08:38 2003 Subject: [Spambayes] Problem with latest CVS Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181BA28@its-xchg4.massey.ac.nz> > Hope it's ok to mail you directly like this, I don't have > access to newsgroups. Of course. The alternative would be to submit a bug report via sf: > AttributeError: 'OptionsClass' object has no attribute > 'hammie_header_name' Fixed in cvs. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 7 12:21:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 6 19:22:28 2003 Subject: [Spambayes] spambayes.messageinfo.db & FAQ.txt Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181BA3F@its-xchg4.massey.ac.nz> > I tried out spambayes (with imapfilter.py & Python 2.2), and > nothing surprised me except the fact that > spambayes.messageinfo.db gets dropped in the local directory > rather than in some location configured by an option. Yeah, I'd noticed this too, but hadn't got round to doing anything about it. It's nice to hear that imapfilter is working for you... :) > what is this DB for? Or > does it get rebuilt from the messages on the server if it's > not found in the current directory? In which case, again, > what is its purpose? It stores information about messages that spambayes apps have seen (as opposed to the 'hammie.db' database that stores word counts). Specifically, at the moment it stores whether a message of a given id has been classified and/or trained. It doesn't get rebuilt. I suspect that the reason you didn't notice anything is that re-classification of messages should (unless training changes things) result in the same classification, and often messages that have been classified (and moved) are no longer in folders that are being filtered anyway. It might have meant that the 'hammie.db' database ended up being trained with multiple copies of messages, however. > I did notice the comment in message.py that there are plans > for a central DB, but in the mean time, perhaps it would be > nice to append something like this to FAQ.txt: I think this is the reason that it was left like this. If I understand things correctly, the aim is to have a single file that contains this database, the 'hammie.db' one, and whatever else turns up. For the moment, what I'll do is add an option that specifies where the db should go (we can remove the option if the file later disappears). This will default to just "spambayes.messageinfo.db" (so still in the CWD), but you can set it with a full path to fix this problem. It'll be in the "Storage" section, under the name "messageinfo_storage_file". I'll add it to the web ui for imapfilter and pop3proxy, but you can also add this to your config file: [Storage] messageinfo_storage_file: path/to/my/file/messageinfo.db =Tony Meyer From noreply at sourceforge.net Tue May 6 17:37:19 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 6 19:37:48 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 18:07 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 02:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 18:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Tue May 6 17:47:03 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 6 19:47:13 2003 Subject: [Spambayes] [ spambayes-Bugs-733247 ] crash when using merged-in options Message-ID: Bugs item #733247, was opened at 2003-05-07 01:04 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=733247&group_id=61702 Category: None Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Sjoerd Mullender (sjoerd) Assigned to: Tony Meyer (anadelonbrin) Summary: crash when using merged-in options Initial Comment: In revision 1.48 a bug was introduced that causes a crash when you have used merge_files to merge in options from a file. The problem is, the call to self._oldset sets the option to the string form of the value just after it had been set correctly in the call to self.set one line earlier. The fix (I think) is to apply the attached patch. Also, testing your changes is a good idea... ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:47 Message: Logged In: YES user_id=552329 Fixed in 1.52. BTW I do test changes; I both do as much testing as I can explcitly, plus I run pop3proxy, imapfilter and the Outlook plugin on current cvs, all day, every day. There's always a possibility that I'll miss something, however. Remember that this is cvs code, not release code (not even alpha, which could still have bugs). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=733247&group_id=61702 From tim at fourstonesExpressions.com Tue May 6 23:29:45 2003 From: tim at fourstonesExpressions.com (Tim Stone - Four Stones Expressions) Date: Tue May 6 23:30:24 2003 Subject: [Spambayes] spambayes.messageinfo.db & FAQ.txt In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181BA3F@its-xchg4.massey.ac.nz> Message-ID: 5/6/2003 6:21:53 PM, "Meyer, Tony" wrote: >> I did notice the comment in message.py that there are plans >> for a central DB, but in the mean time, perhaps it would be >> nice to append something like this to FAQ.txt: > >I think this is the reason that it was left like this. If I understand >things correctly, the aim is to have a single file that contains this >database, the 'hammie.db' one, and whatever else turns up. I've started researching this... it will mean some fairly significant changes to things: it won't necessarily be non-disruptive of the existing codebase. (don't you love double negative logic?) c'est moi - TimS http://www.fourstonesExpressions.com http://wecanstopspam.org There are 10 kinds of people in the world: those who understand binary, and those who don't. From T.A.Meyer at massey.ac.nz Wed May 7 17:08:08 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 7 00:10:48 2003 Subject: [Spambayes] spambayes.messageinfo.db & FAQ.txt Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181BBF1@its-xchg4.massey.ac.nz> > I've started researching this... it will mean some fairly > significant changes > to things: it won't necessarily be non-disruptive of the > existing codebase. (don't you love double negative logic?) Can you give some examples of the significant changes? =Tony Meyer From richard at jowsey.com Wed May 7 15:23:17 2003 From: richard at jowsey.com (Richard Jowsey) Date: Wed May 7 00:24:26 2003 Subject: [Spambayes] Spam probability distribution graphs Message-ID: <3EB916D5.5369.1B416E1B@localhost> Hi again Tony, I'm now obtaining real-time "snapshot" graphics of the spam probability distributions off of our 3 mail-filtering servers (USA, Australia & NZ). You might be interested in eye-balling some of these. An example is here: http://death2spam.net/docs/classifier.html There are some very interesting "regional differences" emerging from this analysis of the message classification data. And it's turning out that our "typical customer" is someone who averages ~65% spam in their inbox!!! Cheers, Richard From noreply at sourceforge.net Wed May 7 03:42:10 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 7 05:42:21 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 08:07 Message generated for change (Comment added) made by sgruendel You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 11:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 01:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-06 16:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 08:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Wed May 7 04:49:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 7 06:49:16 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 16:07 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-07 20:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 19:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 09:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 00:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 16:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From David.Bear at asu.edu Wed May 7 10:39:23 2003 From: David.Bear at asu.edu (David Bear) Date: Wed May 7 12:40:24 2003 Subject: [Spambayes] train on form rather than content? Message-ID: <20030507093923.C19674@asu.edu> I receive some mail in outlook. I forward a mime-digest selection of outlook mail from outlook to may unix box running procmail/spambayes. Funning thing is the it was incorrectly chosen as a false positive. This, after I thought I'd trained the filter pretty well. I assume that spambayes would more likely mark something as 'unsure' rather than just outright spam. should I try to retrain? or more precisely, does the bayesian distrubution look also on 'form' and not just content, ie does the mail contain lots of quotted-printable parts, lots of html, lots of microsoft format garbage...? -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, it will make criminals out of innovators" From hanche at math.ntnu.no Thu May 8 00:59:14 2003 From: hanche at math.ntnu.no (Harald Hanche-Olsen) Date: Wed May 7 17:59:21 2003 Subject: [Spambayes] (non-)spam count would go negative! Message-ID: <20030507235914X.hanche@math.ntnu.no> I am just getting started playing around with spambayes. I have a folder with about 16000 spam, plus various folders of non-spam (er, ham you call it, about 20000 in all). So I ran mboxtrain.py over the whole collection. Then I checked the result using hammie.py -u. Of course, in my spam folder I found a small handful of nonspam messages, and in my nonspam folders I found a somewhat larger handful of spams. So I moved these message to their rightful folders, deleted the database, and retrained. But this time, mboxtrain.py dies with a message (non-)spam count would go negative! when it gets across one of the reclassified messages. After a look in the source code and a visit to the mailing list archives, I guess I understand how this happened. So now I am steering away from mboxtrain and using hammie.py for the training instead. Is the notion of marking messages used for training with a header line really well thought out? From my limited experience with this "feature", I would suggest not. Otherwise I like what I see so far: I see 0.01% of hams incorrectly labeled as spam, 0.3% of spams labeled ham, and somewhat less than 1% of either category marked as unsure. Not bad at all, I think. - Harald From T.A.Meyer at massey.ac.nz Thu May 8 12:18:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 7 19:19:19 2003 Subject: [Spambayes] train on form rather than content? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181BDA5@its-xchg4.massey.ac.nz> > or more precisely, > does the bayesian distrubution look also on 'form' and not > just content, ie does the mail contain lots of > quotted-printable parts, lots of html, lots of microsoft > format garbage...? If you're using Outlook, then I would *highly* recommend using the Outlook plugin. As one example of the benefits, you'd be able to select a 'strangely classified' message and 'show clues' for it, so you could see why it was classified as it was. But, yes there are a lot of tokens generated from things other than the words in the message. If you want to know exactly what, take a look at tokenizer.py (which also has a lot of interesting comments about why a particular token is/isn't generated). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 8 12:31:49 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 7 19:33:00 2003 Subject: [Spambayes] (non-)spam count would go negative! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181BDBE@its-xchg4.massey.ac.nz> > So I moved these [incorrectly classified] > message to their rightful folders, deleted the > database, and retrained. But this time, mboxtrain.py dies > with a message > > (non-)spam count would go negative! > > when it gets across one of the reclassified messages. The reason this is happening is because mboxtrain recognises that you have moved messages and tries to untrain before it does the new train. There are two solutions to this: * Don't delete the database. If you just move the messages and retrain, mboxtrain will find the old header, realise that the classication has changed, and do the appropriate unlearn/learn. * Delete the database and use the -f switch. If you look in the doc (-h), you'll see that this is for rebuilding from scratch. Basically this ignores the headers so that the appropriate untraining is not done. > Is the notion of marking messages used > for training with a header line really well thought out? > From my limited experience with this "feature", I would suggest not. It's really not a problem with marking them with a header line. (On the other hand, it could be a problem that the documentation isn't clear enough about what should be done in this sort of situation). However, there is a project at the moment to hold this sort of training information in a database of it's own (for other reasons). It's being tested with pop3proxy and imapfilter at first, but the aim is that it will eventually be used by mboxtrain/hammie/Outlook/everything else. =Tony Meyer From tdickenson at devmail.geminidataloggers.co.uk Thu May 8 10:25:10 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Thu May 8 04:25:14 2003 Subject: [Spambayes] (non-)spam count would go negative! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181BDBE@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130181BDBE@its-xchg4.massey.ac.nz> Message-ID: <200305080925.10301.tdickenson@devmail.geminidataloggers.co.uk> On Thursday 08 May 2003 12:31 am, Meyer, Tony wrote: > The reason this is happening is because mboxtrain recognises that you > have moved messages and tries to untrain before it does the new train. Unless I'm missing something, a further problem is that spammers could add this header to subvert the training process. Right? From noreply at sourceforge.net Thu May 8 11:45:56 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 8 13:46:04 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 08:07 Message generated for change (Comment added) made by sgruendel You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-08 19:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 12:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 11:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 01:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-06 16:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 08:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Thu May 8 11:59:25 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 8 13:59:34 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 08:07 Message generated for change (Comment added) made by sgruendel You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-08 19:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-08 19:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 12:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 11:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 01:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-06 16:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 08:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From David.Bear at asu.edu Thu May 8 15:54:36 2003 From: David.Bear at asu.edu (David Bear) Date: Thu May 8 17:55:27 2003 Subject: [Spambayes] windows/outlook install Message-ID: <20030508145436.B25834@asu.edu> I first installed spambayes on my linux box -- using procmail integration. went smooth, though the install instructions were a bit 'more' than I wanted.. I found the setup.py script, which after running made it all work well. good job. Then I went to the windows machine, installed python 2.2 and mark hammonds win32 extension. The, ran the setup.py script there. okay. Then, from the outlook folder in the spambeyes package, ran that addin.py . The first thing I noticed was that the addin only affect the outlook for the currently logged on user/profile. I did this as admin. the logged back in as my standard user and, no new buttons on the outlook bar. So, I ran addin again. viola. spam buttons. terrific. I started the spam manager and configured the spam and unsure folders. Anything else I'm missing? (I'm trying to write step by step install instructions for windows/outlook/exchange users) -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, it will make criminals out of innovators" From noreply at sourceforge.net Thu May 8 20:22:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 8 22:22:56 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 18:07 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 14:22 Message: Logged In: YES user_id=552329 What I don't understand is why I can't reproduce this error. If I make the same addition to manager.py, but use 'german'[1] as the locale, everything still works. (It also still works if I make the change in addin.py where the other locale set is, and also if I change the locale to german in the windows control panel). In addition, why does the import from PythonWin work? Surely this means (as mentioned) that something else that is imported is screwing up the locale setting before random gets imported - but then, shouldn't it do this on my machine, too? Googling on this (NV_MAGICCONST) brings up lots of instances of the problem, but nothing in the way of a solution apart from setting the locale to 'en' or 'c' before the random import. This really doesn't seem like the correct solution. I really am tempted to try my luck with c.l.p for a solution, if there aren't any more ideas here. [1] I can't set to "de_DE", I get a locale not supported error. "german" does use ',' as the decimal separator, so it should be the same, though. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 22:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 21:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 02:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 18:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Thu May 8 21:15:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 8 23:16:00 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 18:07 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 15:15 Message: Logged In: YES user_id=552329 Ok, forget that, I've found a way I can reproduce a similar error. This is definately Outlook specific, and definately cannot be fixed in the spambayes code, except by changing the locale to 'en' or 'c', or something similar. Ignoring spambayes completely, and using the demo outlook plugin that comes with the win32com extensions, I printed out the results of math.log(2). This should be "0.69314718056 " and not "0,69314718056". I made this print statement pretty much every second line. When adding, it always gives the correct answer. As soon as Outlook is started up, it gives the wrong answer. It's beyond me whether this is something that the win32com stuff does (although importing the extensions doesn't cause it), or if it's something that Outlook does, somehow. No doubt you can answer that, Mark? ;) Anyway, unless a change to the win32com stuff occurs, this will have to remain in the spambayes Outlook code. (The options stuff is a separate problem, and my fix for that should work). It doesn't need to be outside the Outlook code though. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 14:22 Message: Logged In: YES user_id=552329 What I don't understand is why I can't reproduce this error. If I make the same addition to manager.py, but use 'german'[1] as the locale, everything still works. (It also still works if I make the change in addin.py where the other locale set is, and also if I change the locale to german in the windows control panel). In addition, why does the import from PythonWin work? Surely this means (as mentioned) that something else that is imported is screwing up the locale setting before random gets imported - but then, shouldn't it do this on my machine, too? Googling on this (NV_MAGICCONST) brings up lots of instances of the problem, but nothing in the way of a solution apart from setting the locale to 'en' or 'c' before the random import. This really doesn't seem like the correct solution. I really am tempted to try my luck with c.l.p for a solution, if there aren't any more ideas here. [1] I can't set to "de_DE", I get a locale not supported error. "german" does use ',' as the decimal separator, so it should be the same, though. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 22:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 21:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 02:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 18:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Thu May 8 21:23:20 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 8 23:23:29 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 18:07 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 15:23 Message: Logged In: YES user_id=552329 Ok, I did more digging. The locale set in addin.py isn't enough. This line in msgstore.py screws up the locale: self.session = mapi.MAPILogonEx(0, None, None, logonFlags) (it's in the __init__ of MAPIMsgStore; my line numbers are all screwed up). (by 'screws up the locale', I mean that math.log(2) returns a number with a '.' decimal separator before it, and a ',' decimal separator afterwards). I'll leave it to Mark to figure out what should be done. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 15:15 Message: Logged In: YES user_id=552329 Ok, forget that, I've found a way I can reproduce a similar error. This is definately Outlook specific, and definately cannot be fixed in the spambayes code, except by changing the locale to 'en' or 'c', or something similar. Ignoring spambayes completely, and using the demo outlook plugin that comes with the win32com extensions, I printed out the results of math.log(2). This should be "0.69314718056 " and not "0,69314718056". I made this print statement pretty much every second line. When adding, it always gives the correct answer. As soon as Outlook is started up, it gives the wrong answer. It's beyond me whether this is something that the win32com stuff does (although importing the extensions doesn't cause it), or if it's something that Outlook does, somehow. No doubt you can answer that, Mark? ;) Anyway, unless a change to the win32com stuff occurs, this will have to remain in the spambayes Outlook code. (The options stuff is a separate problem, and my fix for that should work). It doesn't need to be outside the Outlook code though. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 14:22 Message: Logged In: YES user_id=552329 What I don't understand is why I can't reproduce this error. If I make the same addition to manager.py, but use 'german'[1] as the locale, everything still works. (It also still works if I make the change in addin.py where the other locale set is, and also if I change the locale to german in the windows control panel). In addition, why does the import from PythonWin work? Surely this means (as mentioned) that something else that is imported is screwing up the locale setting before random gets imported - but then, shouldn't it do this on my machine, too? Googling on this (NV_MAGICCONST) brings up lots of instances of the problem, but nothing in the way of a solution apart from setting the locale to 'en' or 'c' before the random import. This really doesn't seem like the correct solution. I really am tempted to try my luck with c.l.p for a solution, if there aren't any more ideas here. [1] I can't set to "de_DE", I get a locale not supported error. "german" does use ',' as the decimal separator, so it should be the same, though. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 22:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 21:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 02:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 18:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From trebor at animeigo.com Fri May 9 07:12:08 2003 From: trebor at animeigo.com (Robert Woodhead) Date: Fri May 9 09:36:18 2003 Subject: [Spambayes] 6.0 of Eudora has bayesian. In-Reply-To: <1052454770.6659.25.camel@gary> References: <20030505154457.89211.qmail@mail.archub.org> <1052454770.6659.25.camel@gary> Message-ID: Just a headsup, the new 6.0beta of Eudora has bayesian filtering built in, and it looks fairly impressive. False positives so far (out of perhaps 500 emails): 1) first email from paypal ('you've got cash'). Told the filter it wasn't junk, problem over. 2) a very short email from someone asking if he'd been scammed by one of these makemoneyfast sites. Both of these floated to the top of the junk mailbox due to secondary filtering using my TOLD filter tags, so they were easy to pick out. It comes pretrained but learns quickly, apparently. http://www.eudora.com/betas/ for those inclined to play. -- =========================================================== Robert Woodhead, CEO, AnimEigo http://www.animeigo.com/ =========================================================== http://selfpromotion.com/ The Net's only URL registration SHARESERVICE. A power tool for power webmasters. From bernadotte at gmx.de Fri May 9 17:00:22 2003 From: bernadotte at gmx.de (bernadotte) Date: Fri May 9 10:17:22 2003 Subject: [Spambayes] installation problems Message-ID: hi guys, i feel a little ashamed to write you, but after 2 day trying everything, i give up ! found nothing helpful on the mentioned website or somewhere else in the web about it i downloaded and installed: http://www.python.org/ftp/python/2.3/Python-2.3b1.exe http://starship.python.net/crew/mhammond/downloads/win32all-153.exe http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe my system is win xp 5.1.2600 german running outlook 2002, version 10.0 the requierements for spambayes were a little confusing for me, so i think it?s supposed for python experts, which i?m not (got no idea what it is, anyway i like it ! ;-) ) installing the outlook add-in produces the following error-box: c:programme\spambayes outlook addin\spambayes_addin.dll unable to register the dll/ocx:dllregisterserver failed; code 0x00000000 trying to load the dll from inside outlook, stoped with a warning to close down outlook. with a look at the additional infos, there was mentioned python23.dll. there can no spambayes log-file be found on the system. hoping that this is a more frequent problem and not just mine greetz bernd From hanche at math.ntnu.no Sat May 10 15:45:28 2003 From: hanche at math.ntnu.no (Harald Hanche-Olsen) Date: Sat May 10 08:45:32 2003 Subject: [Spambayes] (non-)spam count would go negative! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181BDBE@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130181BDBE@its-xchg4.massey.ac.nz> Message-ID: <20030510144528F.hanche@math.ntnu.no> + "Meyer, Tony" : | > (non-)spam count would go negative! | > | > when it gets across one of the reclassified messages. | | The reason this is happening is because mboxtrain recognises that you | have moved messages and tries to untrain before it does the new train. [...] | (On the other hand, it could be a problem that the documentation | isn't clear enough about what should be done in this sort of | situation). Definitely a problem with the documentation, then. Of course that is to be expected from software that is still under development, so I'm not complaining. This all happened because I have my mail in MH folders (my MUA of choice is Mew), and as I could not find out how hammie could read MH folders (but have found out since, from digging deeper into the sources), I decided to go with mboxtrain instead, not realizing the difference between mboxtrain and the training mode of hammie. All of which provides a nice little usability lesson, but does not detract from the technical qualities of the software. I hope my grumpy tone did not offend anyone. It had been a bad day. But already, spambayes is improving my life and mood considerably! Thanks guys. Oh, and by the way, having mboxtrain poke at all the messages in my MH folders meant the following incremental backup was suddenly faced with about 24000 changed files, which I'm sure made me really popular with the backup manager. (I haven't dared ask. 8-) - Harald From hanche at math.ntnu.no Sat May 10 15:54:57 2003 From: hanche at math.ntnu.no (Harald Hanche-Olsen) Date: Sat May 10 08:55:00 2003 Subject: [Spambayes] Header cropping Message-ID: <20030510145457A.hanche@math.ntnu.no> I found the following message in my unsure folder today. X-Spambayes-Classification: unsure; 0.86 This is a multi-part message in MIME format. --3A._5F.64A._C_D.DE._AE-- This is a multi-part message in MIME format. --3A._5F.64A._C_D.DE._AE-- Yes, that is the entire message, headers and all (I just indented it for your reading pleasure). According to the logs, the message was 1310 bytes long when it arrived. At the very least, there should be a couple Received headers as well as Delivered-To headers in there. Also, the log shows there was a Message-Id header. The envelope sender (from the log) strongly indicates it was indeed spam, so I don't regret the loss of this particular message. But I find it somewhat alarming. I assume the problem arose because of some malformed header. But malformed headers sometimes occur in ham mail too, and I'd still like to receive these messages without further damage. Any thoughts? - Harald PS. My spambayes source is from CVS as of last Wednesday. From tim at fourstonesExpressions.com Sat May 10 09:10:54 2003 From: tim at fourstonesExpressions.com (Tim Stone - Four Stones Expressions) Date: Sat May 10 09:11:00 2003 Subject: [Spambayes] Header cropping In-Reply-To: <20030510145457A.hanche@math.ntnu.no> Message-ID: 5/10/2003 7:54:57 AM, Harald Hanche-Olsen wrote: >Any thoughts? Are you using the outlook plugin, pop3proxy, imapfilter, mboxtrain, or hammie? c'est moi - TimS http://www.fourstonesExpressions.com http://wecanstopspam.org There are 10 kinds of people in the world: those who understand binary, and those who don't. From hanche at math.ntnu.no Sat May 10 20:03:40 2003 From: hanche at math.ntnu.no (Harald Hanche-Olsen) Date: Sat May 10 13:11:35 2003 Subject: [Spambayes] Header cropping In-Reply-To: References: <20030510145457A.hanche@math.ntnu.no> Message-ID: <20030510190340Y.hanche@math.ntnu.no> + Tim Stone - Four Stones Expressions : | 5/10/2003 7:54:57 AM, Harald Hanche-Olsen wrote: | | >Any thoughts? | | Are you using the outlook plugin, pop3proxy, imapfilter, mboxtrain, or hammie? Hammie. Sorry, I should have said. - Harald From noreply at sourceforge.net Sat May 10 12:35:18 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 10 14:35:29 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 16:07 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) >Assigned to: Mark Hammond (mhammond) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-11 04:35 Message: Logged In: YES user_id=14198 Damn. IIRC, MAPI also changes the cwd. I guess the solution may be the same - check the locale before setting, and reset explicitly afterwards. It would be great if you can check this while you are still setup to test, but otherwise it can stay on my todo list :) ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 13:23 Message: Logged In: YES user_id=552329 Ok, I did more digging. The locale set in addin.py isn't enough. This line in msgstore.py screws up the locale: self.session = mapi.MAPILogonEx(0, None, None, logonFlags) (it's in the __init__ of MAPIMsgStore; my line numbers are all screwed up). (by 'screws up the locale', I mean that math.log(2) returns a number with a '.' decimal separator before it, and a ',' decimal separator afterwards). I'll leave it to Mark to figure out what should be done. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 13:15 Message: Logged In: YES user_id=552329 Ok, forget that, I've found a way I can reproduce a similar error. This is definately Outlook specific, and definately cannot be fixed in the spambayes code, except by changing the locale to 'en' or 'c', or something similar. Ignoring spambayes completely, and using the demo outlook plugin that comes with the win32com extensions, I printed out the results of math.log(2). This should be "0.69314718056 " and not "0,69314718056". I made this print statement pretty much every second line. When adding, it always gives the correct answer. As soon as Outlook is started up, it gives the wrong answer. It's beyond me whether this is something that the win32com stuff does (although importing the extensions doesn't cause it), or if it's something that Outlook does, somehow. No doubt you can answer that, Mark? ;) Anyway, unless a change to the win32com stuff occurs, this will have to remain in the spambayes Outlook code. (The options stuff is a separate problem, and my fix for that should work). It doesn't need to be outside the Outlook code though. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 12:22 Message: Logged In: YES user_id=552329 What I don't understand is why I can't reproduce this error. If I make the same addition to manager.py, but use 'german'[1] as the locale, everything still works. (It also still works if I make the change in addin.py where the other locale set is, and also if I change the locale to german in the windows control panel). In addition, why does the import from PythonWin work? Surely this means (as mentioned) that something else that is imported is screwing up the locale setting before random gets imported - but then, shouldn't it do this on my machine, too? Googling on this (NV_MAGICCONST) brings up lots of instances of the problem, but nothing in the way of a solution apart from setting the locale to 'en' or 'c' before the random import. This really doesn't seem like the correct solution. I really am tempted to try my luck with c.l.p for a solution, if there aren't any more ideas here. [1] I can't set to "de_DE", I get a locale not supported error. "german" does use ',' as the decimal separator, so it should be the same, though. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 03:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 03:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 20:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 19:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 09:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 00:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 16:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From noreply at sourceforge.net Sat May 10 14:03:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 10 16:03:45 2003 Subject: [Spambayes] [ spambayes-Bugs-735805 ] No item in Outlook XP standard toolbar after install Message-ID: Bugs item #735805, was opened at 2003-05-10 16:03 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: No item in Outlook XP standard toolbar after install Initial Comment: I just installed SpamBayes Outlook Addin 002 for my Outlook XP. When I ran Outlook, the "Anti-Spam item" is not in the Standard toolbar. I've run it three times, and I've included the second log file, the most verbose one. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 From nick at kirsch.org Sat May 10 16:35:14 2003 From: nick at kirsch.org (Nicholas M. Kirsch) Date: Sat May 10 19:45:56 2003 Subject: [Spambayes] Debian maintainer Message-ID: Do you currently have any Debian packages or a know of a Debian maintainer for SpamBayes? If not, I was thinking that I would like to give it a shot. Thanks. Nick From noreply at sourceforge.net Sat May 10 19:05:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 10 21:05:42 2003 Subject: [Spambayes] [ spambayes-Bugs-735805 ] No item in Outlook XP standard toolbar after install Message-ID: Bugs item #735805, was opened at 2003-05-10 16:03 Message generated for change (Comment added) made by giveuptheghost You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: No item in Outlook XP standard toolbar after install Initial Comment: I just installed SpamBayes Outlook Addin 002 for my Outlook XP. When I ran Outlook, the "Anti-Spam item" is not in the Standard toolbar. I've run it three times, and I've included the second log file, the most verbose one. ---------------------------------------------------------------------- >Comment By: Scott Sanders (giveuptheghost) Date: 2003-05-10 21:05 Message: Logged In: YES user_id=184281 Well, I can see the "Anti-Spam Item" now. Note that I am still in the third session of Outlook since installing the SpamBayes addin, and my original post was at the beginning of this session. Thus, the "Anti-Spam Item" has "magically appeared." I've seen this happen before with other addins, such as SpamSource and SpamNet, and on my other system, which run Win98 and Outlook 2000. I currently use SpamSource with my Outlook XP and had once used SpamNet (until they started charging for it). I've enclosed this session's log file, in case it will help with other cases. Note that I trained it once I noticed the "Anti-Spam Item" in the toolbar. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 From noreply at sourceforge.net Sat May 10 19:06:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 10 21:07:04 2003 Subject: [Spambayes] [ spambayes-Bugs-735805 ] No item in Outlook XP standard toolbar after install Message-ID: Bugs item #735805, was opened at 2003-05-10 16:03 Message generated for change (Settings changed) made by giveuptheghost You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: No item in Outlook XP standard toolbar after install Initial Comment: I just installed SpamBayes Outlook Addin 002 for my Outlook XP. When I ran Outlook, the "Anti-Spam item" is not in the Standard toolbar. I've run it three times, and I've included the second log file, the most verbose one. ---------------------------------------------------------------------- Comment By: Scott Sanders (giveuptheghost) Date: 2003-05-10 21:05 Message: Logged In: YES user_id=184281 Well, I can see the "Anti-Spam Item" now. Note that I am still in the third session of Outlook since installing the SpamBayes addin, and my original post was at the beginning of this session. Thus, the "Anti-Spam Item" has "magically appeared." I've seen this happen before with other addins, such as SpamSource and SpamNet, and on my other system, which run Win98 and Outlook 2000. I currently use SpamSource with my Outlook XP and had once used SpamNet (until they started charging for it). I've enclosed this session's log file, in case it will help with other cases. Note that I trained it once I noticed the "Anti-Spam Item" in the toolbar. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=735805&group_id=61702 From T.A.Meyer at massey.ac.nz Mon May 12 12:33:41 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 11 19:34:25 2003 Subject: [Spambayes] installation problems Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C504@its-xchg4.massey.ac.nz> > I feel a little ashamed to write you, but after 2 day trying > everything, I give up ! We're here to help, no need for feeling ashamed ;) > I downloaded and installed: > [Python 2.3b1] > [win32all-153] > [Spambayes-Outlook-Setup-002] It's not going to hurt having these, but the only one that you require is the last one. The Outlook plugin (binary) will run without Python installed (more accurately, I gather it installs the bits of Python that are needed to run it). > my system is win xp 5.1.2600 german running outlook 2002, version 10.0 It's hard to tell, since there isn't a lot of information, but I wonder whether the problem might be that it's German. There are issues with languages that use something apart from a period ('.') as a decimal separator (ie. a half is not 0.5, but 0,5). (Discussion of this is in [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py). It appears that Outlook fiddles with the locale settings. Either Mark or I will check in a fix to this shortly. However, this would mean that you would have to use the cvs version, since the fix won't, obviously, be in 002. You could try getting the cvs code (since you have Python installed already), and using that (you run addin.py to install the plugin). > installing the outlook add-in produces the following > error-box: c:programme\spambayes outlook addin\spambayes_addin.dll > unable to register the dll/ocx:dllregisterserver failed; > code 0x00000000 This, however, doesn't sound like it's the locale problem. Mark would have a better idea than I would about what it might be - you could try submitting a bug report via sf (http://sf.net/projects/spambayes), even though you don't have a log. The easiest way around this would certainly be to try using the cvs code - it would probably result in more detailed error messages as well. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 12 12:44:21 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 11 19:45:00 2003 Subject: [Spambayes] Header cropping Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C522@its-xchg4.massey.ac.nz> > I found the following message in my unsure folder today. > [short message cut] > > Yes, that is the entire message, headers and all (I just > indented it for your reading pleasure). > > According to the logs, the message was 1310 bytes long when > it arrived. > I assume the problem arose because of some malformed header. > But malformed headers sometimes occur in ham mail too, and > I'd still like to receive these messages without further damage. It's most likely that this is the reason that it happened. In mboxutils.py, when getting a message, if the email parser raises a parsing error, it tries to cut the headers off, and set the payload of the message to whatever is left. When the payload is multipart, it's not all that surprising that this fails. I don't know enough about how hammie works to suggest (let alone code) a solution. Perhaps what should be done is that if parsing fails, the message is just left alone, unaltered? [1] Or maybe moved into a "problematic messages" directory? Thoughts? =Tony Meyer [1] The problem here, of course, is that this would mean that to get past spambayes, all you have to do is send a message that fails to parse. From T.A.Meyer at massey.ac.nz Mon May 12 12:47:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 11 19:48:01 2003 Subject: [Spambayes] (non-)spam count would go negative! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C52E@its-xchg4.massey.ac.nz> > Unless I'm missing something, a further problem is that > spammers could add this header to subvert the training > process. Right? I thought that there was code to get rid of these, but I can't find it. If there isn't, then yes this could happen. However: * It's only training, not classification, since the classification headers are definitely ignored if already present. * The new message module does remove all existing spambayes headers, so all the apps that use that would be safe. At the moment this is only imapfilter (unless also pop3proxy; I can't recall), but the idea is that all apps will use it eventually. (Although some, like the Outlook plugin, don't touch the headers, so won't use that bit). =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 12 12:48:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 11 19:48:46 2003 Subject: [Spambayes] (non-)spam count would go negative! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C531@its-xchg4.massey.ac.nz> > Definitely a problem with the documentation, then. Of course > that is to be expected from software that is still under > development, so I'm not complaining. I'll take a look at the documentation (I've been meaning to do that anyway) and see if I can clear it up. The main trouble is that since I only use pop3proxy, imapfilter and the Outlook plugin, those are the ones that I can most easily document. I'll try and find unclear bits and plead with the list to provide clearer instructions ;) > I hope my grumpy tone did not offend anyone. It certainly didn't offend me, anyway :) =Tony Meyer From mail at peterbe.com Mon May 12 02:53:21 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Sun May 11 21:31:58 2003 Subject: [Spambayes] How many is enough? Message-ID: <5.2.0.9.0.20030512015036.02010fe0@localhost> I've read the pages at http://spambayes.sourceforge.net/ now and concluded that you should train your database, but not too much. What I fail to find is some numbers for this. Are we talking about hundreds or thousands or millions? I've trained my database with 3000 ham and only 50 spam. That was basically all I had available in my email client at the moment. So, how much should I train before I run the risk of overdoing it? Peter PS. Not on the mailinglist From ejb at ql.org Sun May 11 22:41:54 2003 From: ejb at ql.org (Jay Berkenbilt) Date: Sun May 11 21:42:26 2003 Subject: [Spambayes] SpamAssassin's Bayes/SpamBayes: cross-pollination? Message-ID: <200305120141.h4C1fsYB016277@soup.in.ql.org> I notice that recent versions of SpamAssassin have a Bayesian filtering engine. There is an option in SpamAssassin 2.53 (and perhaps 2.50 and later) called bayes_use_chi2_combining which presumably enables the same chi-squared algorithm described in the "background" page from spambayes.sourceforge.net. From a quick perusal of the code, it also seems to do some form of smarter tokenizing than the original Graham paper describes (with or without the bayes_use_chi2_combining option). My question: Is there any discussion going on between SpamBayes and SpamAssassin in terms of refinement of tokenizing or other algorithms? Are the two projects moving in the same direction, or solving the same problem in different ways? Naturally, SpamAssassin uses the results of the Bayesian filtering to increase or decrease the overall spam score, so it becomes just one tool of many to help with the overall process, and SpamBayes includes things like the POP and IMAP proxy and web-based training interfaces that are not present in SpamAssassin, but these are, in my opinion, not substantive differences in how they solve the underlying problem. I've been using a combination SpamBayes (from CVS) and SpamAssassin version 2.43 (without Bayesian filtering) for several weeks. I basically use the two filters together to help with initial classification and use a moderately elaborate procmail setup and several home-grown scripts to sort everything in different ways based on the combined results of the various filters. (For example, messages that both filters think are spam are filed differently from messages that only one thinks is spam, and messages are only classified as "unsure" of SpamBayes says unsure and SpamAssassin scored the message below the spam threshold.) I also have my own system of expiring old messages and keeping my SPAM and HAM training sets the same size. (I had them both set at 600, though I've recently upped them to 1500.) Using this combined approach, I've been getting consistently about 99% correct classification. In my last 1500 messages, I have had zero false negatives and zero false positives on any messages that both SpamAssassin and SpamBayes have agreed were spam. I'm just interested in anyone else's thoughts on this issue, and I'm particularly curious, as the subject suggests, about whether there's any discussion or sharing of people between the two efforts. Thanks for a great tool. Also, if anyone is interested in more details about my setup, I'd be happy to share them. -- Jay Berkenbilt http://www.ql.org/q/ From T.A.Meyer at massey.ac.nz Mon May 12 14:41:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 11 21:42:30 2003 Subject: [Spambayes] How many is enough? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C5E1@its-xchg4.massey.ac.nz> > I've read the pages at http://spambayes.sourceforge.net/ now > and concluded that you should train your database, but not too much. > What I fail to find is some numbers for this. Are we talking > about hundreds or thousands or millions? > I've trained my database with 3000 ham and only 50 spam. That > was basically all I had available in my email client at the moment. > So, how much should I train before I run the risk of overdoing it? The more you train, the better, in general. However: * If you have many more of ham/spam than spam/ham, this can be bad. (however, if you enable the experimental_ham_spam_imbalance option, this shouldn't matter as much, although it hasn't been tested as much as it could be). * 50 spam is fairly low. It wouldn't be that surprising to get some incorrect results with that few, but it should still do a reasonable job. >From my experience, I would say that you should have a couple of hundred of each to get acceptable results (I only use a corpus of about 400 each in Outlook). If you have thousands, then you'll probably get more accurate results, but I imagine that the utility of adding another message to the corpus gets lower as the corpus gets bigger. IIRC, if you look through the archives there's a post that has a reference to a webpage that has graphs for different corpus sizes. That might be of interest. =Tony Meyer From anthony at interlink.com.au Mon May 12 13:01:44 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Sun May 11 22:02:36 2003 Subject: [Spambayes] Debian maintainer In-Reply-To: Message-ID: <200305120201.h4C21iS23579@localhost.localdomain> >>> "Nicholas M. Kirsch" wrote > Do you currently have any Debian packages or a know of a Debian maintainer > for SpamBayes? If not, I was thinking that I would like to give it a shot. Not that I'm aware of -- go for it! Anthony -- Anthony Baxter It's never too late to have a happy childhood. From anthony at interlink.com.au Mon May 12 13:03:50 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Sun May 11 22:04:36 2003 Subject: [Spambayes] How many is enough? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181C5E1@its-xchg4.massey.ac.nz> Message-ID: <200305120203.h4C23oY23621@localhost.localdomain> > The more you train, the better, in general. However: > * If you have many more of ham/spam than spam/ham, this can be bad. > (however, if you enable the experimental_ham_spam_imbalance option, > this shouldn't matter as much, although it hasn't been tested as > much as it could be). > * 50 spam is fairly low. It wouldn't be that surprising to get some > incorrect results with that few, but it should still do a reasonable > job. Early experiments (see the mailing list archive) found that when you go from thousands to multiple tens of thousands of messages, accuracy gets a little bit worse. I'm going completely from memory here, you'd have to check the mailing list archive for the details. Anthony From support at getsoft.com Mon May 12 13:08:51 2003 From: support at getsoft.com (SAIG/ED4W Support) Date: Sun May 11 22:15:28 2003 Subject: [Spambayes] SpamBayes questions Message-ID: <120656135072.20030512120851@getsoft.com> Hi, I have recently found your SpamBayes program and it looks very interesting. I'd like to be able to run SpamBayes on our Web server and wonder if this is possible. In particular can you access the Web interface from your local PC when SpamBayes is running on a remote server. >From reading the Installation notes SpamBayes requires Python V2.2 or later. I assume from this there is no way to get it to run on V1.5.2, which is what is installed on our Web server and unfortunately can't be upgraded? One of my big hassles with Spam is the cost in money and time downloading email when I'm on a slow expensive Internet connection, such as in a Hotel or using a mobile phone. What I would really like to see is a way to skip downloading email flagged as spam. Maybe something as simple as using a different pop3 port which only gets ham. Or the option to redirect all spam to specified email account, which I could then check when I have my normal broadband connection available. I did have a quick scan through the mailing list archives, but couldn't see any answers to these specific issues. Thanks for your time. -- Best Regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... Signup for our new Affiliate program and share in our success. Soft As It Gets Pty Ltd, 12 Fairview Grove, Glen Iris, VIC, 3146 Australia Ph +61 3 9885 4445, Fax +61 3 9885 4444, mailto:support@getsoft.com From popiel at wolfskeep.com Sun May 11 20:37:58 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Sun May 11 22:38:02 2003 Subject: [Spambayes] How many is enough? In-Reply-To: Message from Peter Bengtsson of "Mon, 12 May 2003 01:53:21 BST." <5.2.0.9.0.20030512015036.02010fe0@localhost> References: <5.2.0.9.0.20030512015036.02010fe0@localhost> Message-ID: <20030512023758.A41DF2DE8A@cashew.wolfskeep.com> In message: <5.2.0.9.0.20030512015036.02010fe0@localhost> Peter Bengtsson writes: >I've read the pages at http://spambayes.sourceforge.net/ now and concluded >that you should train your database, but not too much. >What I fail to find is some numbers for this. Are we talking about hundreds >or thousands or millions? >I've trained my database with 3000 ham and only 50 spam. That was basically >all I had available in my email client at the moment. > >So, how much should I train before I run the risk of overdoing it? I've not had any problems with training with tens of thousands of messages (6345 ham, 17063 spam for last night's retrain). The only reson I don't train with my full 44219 message archive is to control database size... and that really is a pretty minor concern. My incremental training tests showed that more was better for accuracy... - Alex From tim.one at comcast.net Sun May 11 23:46:18 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun May 11 22:47:40 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <120656135072.20030512120851@getsoft.com> Message-ID: [Neville Franks] > ... > From reading the Installation notes SpamBayes requires Python V2.2 or > later. I assume from this there is no way to get it to run on V1.5.2, > which is what is installed on our Web server and unfortunately can't > be upgraded? 2.2 is minimal, in part because the code uses generators (new in 2.2), and in part because it relies on many improvements to Python's libraries since 1.5.2 came out. You could work around all that, of course, given an unknown (but large) investment of effort. Note that 1.5.2 hasn't been supported for years, and even 2.1 stopped attracting support over a year ago (when 2.3 development started). That web server is going to self-destruct eventually . From mhammond at skippinet.com.au Mon May 12 13:57:32 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 11 22:58:23 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <120656135072.20030512120851@getsoft.com> Message-ID: <056b01c31832$3c818b20$530f8490@eden> Hi Neville, You may remember me from years ago when I was using ED4W while working with Paul Foxworthy at Colonial. Funny how small the world is! :) > I have recently found your SpamBayes program and it looks very > interesting. I'd like to be able to run SpamBayes on our Web server > and wonder if this is possible. In particular can you access the Web > interface from your local PC when SpamBayes is running on a remote > server. I'm much more involved in the Outlook part of this project, but seeing as I was saying "hi", I will do my best to answer. As far as I know, the pop3proxy application (which I believe you are referring to) will allow you to run its existing web interface from a remote machine. > >From reading the Installation notes SpamBayes requires Python V2.2 or > later. I assume from this there is no way to get it to run on V1.5.2, > which is what is installed on our Web server and unfortunately can't > be upgraded? Unfortunately not. 1.5.2 is very old and no longer being maintained. This project requires Python 2.1 or later. Note that it is generally not necessary to upgrade - a Python 2.x installation should be able to sit next to Python 1.x. > One of my big hassles with Spam is the cost in money and time > downloading email when I'm on a slow expensive Internet connection, > such as in a Hotel or using a mobile phone. > > What I would really like to see is a way to skip downloading email > flagged as spam. Maybe something as simple as using a different pop3 > port which only gets ham. Or the option to redirect all spam to > specified email account, which I could then check when I have my > normal broadband connection available. This sounds like a reasonable idea, but I certainly haven't heard of it. Of course, the classifier has to download the mail from the original pop server to perform the analysis, so this gets a little tricky - pop3proxy will then need to simulate a complete pop server (to pretend the spam doesn't actually exist) or download and delete from the original, maintaining its own database. Hopefully someone more involved in pop3proxy will give a better answer. Mark. From dave at boost-consulting.com Sun May 11 23:57:39 2003 From: dave at boost-consulting.com (David Abrahams) Date: Sun May 11 22:58:29 2003 Subject: [Spambayes] Does anyone care about this report? Message-ID: I spent considerable time preparing this several weeks ago, with the promise that it would be made available somewhere that would be of use to people... but got no reply. http://www.boost-consulting.com/writing/server-side.html I'd also appreciate it if someone could address the question at the end. Thanks, -- Dave Abrahams Boost Consulting www.boost-consulting.com From T.A.Meyer at massey.ac.nz Mon May 12 17:07:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 12 00:07:53 2003 Subject: [Spambayes] SpamBayes questions Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C6CC@its-xchg4.massey.ac.nz> > In particular can you access the Web > interface from your local PC when SpamBayes is running on a remote > server. If you set the "html_ui":"allow_remote_connections" option to True (it defaults to False), then yes. It depends on the setup, but you might want to alter it first, though. As it is, this would mean that anyone could connect to that server and access the interface. You could simply put it in a protected (htaccess, for example) directory, or you could build in some sort of authentication. (IIRC, there is a feature request for this.) > What I would really like to see is a way to skip downloading email > flagged as spam. Maybe something as simple as using a different pop3 > port which only gets ham. Or the option to redirect all spam to > specified email account, which I could then check when I have my > normal broadband connection available. As Mark said, this is tricky because the classifier needs access to the mail to do its thing. Are you intending this for a single user, or multiple users? The FAQ has an answer about setting up Spambayes for multiple users (basically that it's possible, but we don't have documentation for it). If it's for a single user, and the proxy is running on the remote server, then that shouldn't be a problem. It wouldn't be difficult to modify pop3proxy so that it didn't deliver spam (you could then run this on a separate port). (Redirecting would also be possible, but much more work). I suspect that you could add this before line 462 of pop3proxy: if msg.GetClassification() == options["Hammie", "header_spam_string": return ok + "\n.\r\n" This is *completely* untested, but it should mean that for all messages classified as spam an empty message is returned. (It would be simple enough to tweak so that you just got the headers, or something like that). The full message would be added to the cache so you could review (and re-classify) it via the web interface. If you don't want to do any coding, but you do want to try this [1], then I'm happy to put together a patch. Just let me know what behaviour you want it to have. (I suspect it should really have the ability to 'resend' a message that you identify as a false positive, for example). =Tony Meyer [1] This assumes you get past the Python 1.5 problem ;) From readonly at getsoft.com Mon May 12 16:13:55 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 01:13:41 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <056b01c31832$3c818b20$530f8490@eden> References: <056b01c31832$3c818b20$530f8490@eden> Message-ID: <151667238568.20030512151355@getsoft.com> Hi Mark, Monday, May 12, 2003, 12:57:32 PM, you wrote: MH> Hi Neville, MH> You may remember me from years ago when I was using ED4W while working MH> with Paul Foxworthy at Colonial. Funny how small the world is! :) Yes indeed I do. When I discovered SpamBayes yesterday I saw your name plastered all over the place and thought I wonder if that's the same Mark Hammond. Last I remember you were going to write a killer GUI Make program, now it is Python IDEs :-) Hope all is well. >> I have recently found your SpamBayes program and it looks very >> interesting. I'd like to be able to run SpamBayes on our Web server >> and wonder if this is possible. In particular can you access the Web >> interface from your local PC when SpamBayes is running on a remote >> server. MH> I'm much more involved in the Outlook part of this project, but seeing as I MH> was saying "hi", I will do my best to answer. FYI I don't use Outlook. MH> As far as I know, the pop3proxy application (which I believe you are MH> referring to) will allow you to run its existing web interface from a remote MH> machine. Thanks, Tony responded as well. See my reply to him. >> >From reading the Installation notes SpamBayes requires Python V2.2 or >> later. I assume from this there is no way to get it to run on V1.5.2, >> which is what is installed on our Web server and unfortunately can't >> be upgraded? MH> Unfortunately not. 1.5.2 is very old and no longer being maintained. This MH> project requires Python 2.1 or later. Note that it is generally not MH> necessary to upgrade - a Python 2.x installation should be able to sit next MH> to Python 1.x. Seeing I have no way to run Python V2.x at my hosting company at present I've set it up on a machine here and have got everything work. So far I am most impressed. Know if only ... >> One of my big hassles with Spam is the cost in money and time >> downloading email when I'm on a slow expensive Internet connection, >> such as in a Hotel or using a mobile phone. >> >> What I would really like to see is a way to skip downloading email >> flagged as spam. Maybe something as simple as using a different pop3 >> port which only gets ham. Or the option to redirect all spam to >> specified email account, which I could then check when I have my >> normal broadband connection available. MH> This sounds like a reasonable idea, but I certainly haven't heard of it. Of MH> course, the classifier has to download the mail from the original pop server MH> to perform the analysis, so this gets a little tricky - pop3proxy will then MH> need to simulate a complete pop server (to pretend the spam doesn't actually MH> exist) or download and delete from the original, maintaining its own MH> database. MH> Hopefully someone more involved in pop3proxy will give a better answer. I'm really very surprised that none of the Spam packages I've seen have any option to skip "spam" messages. It just seems like such an obvious thing to do to me. Thanks for your comments. Again see my reply to Tony. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From readonly at getsoft.com Mon May 12 16:30:37 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 01:30:18 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181C6CC@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130181C6CC@its-xchg4.massey.ac.nz> Message-ID: <19668240919.20030512153037@getsoft.com> Hi Tony, Monday, May 12, 2003, 2:07:07 PM, you wrote: >> In particular can you access the Web >> interface from your local PC when SpamBayes is running on a remote >> server. MT> If you set the "html_ui":"allow_remote_connections" option to True (it MT> defaults to False), then yes. It depends on the setup, but you might MT> want to alter it first, though. As it is, this would mean that anyone MT> could connect to that server and access the interface. You could simply MT> put it in a protected (htaccess, for example) directory, or you could MT> build in some sort of authentication. (IIRC, there is a feature request MT> for this.) Thanks for that. Right now and for the foreseeable future I am running SB locally, because of the Python version problem on my Web host. >> What I would really like to see is a way to skip downloading email >> flagged as spam. Maybe something as simple as using a different pop3 >> port which only gets ham. Or the option to redirect all spam to >> specified email account, which I could then check when I have my >> normal broadband connection available. MT> As Mark said, this is tricky because the classifier needs access to the MT> mail to do its thing. Are you intending this for a single user, or MT> multiple users? The FAQ has an answer about setting up Spambayes for MT> multiple users (basically that it's possible, but we don't have MT> documentation for it). If it's for a single user, and the proxy is MT> running on the remote server, then that shouldn't be a problem. MT> It wouldn't be difficult to modify pop3proxy so that it didn't deliver MT> spam (you could then run this on a separate port). (Redirecting would MT> also be possible, but much more work). I suspect that you could add MT> this before line 462 of pop3proxy: MT> if msg.GetClassification() == options["Hammie", MT> "header_spam_string": MT> return ok + "\n.\r\n" MT> This is *completely* untested, but it should mean that for all messages MT> classified as spam an empty message is returned. (It would be simple MT> enough to tweak so that you just got the headers, or something like MT> that). The full message would be added to the cache so you could review MT> (and re-classify) it via the web interface. MT> If you don't want to do any coding, but you do want to try this [1], MT> then I'm happy to put together a patch. Just let me know what behaviour MT> you want it to have. (I suspect it should really have the ability to MT> 'resend' a message that you identify as a false positive, for example). I have approx. 6 @getsoft.com I want to filter, and possibly some others. Seeing I only discovered Spambayes yesterday and installed it today, I really don't have much of an understanding how it goes about doing what it does. Typically my email client is setup to delete mail as it downloads it. Is it feasible to have a mode/setting which says don't pass spam on to the email client and also don't delete it from the mail server. If the email client never sees the spam in this special mode then it won't delete it and it will just sit there. Ideally this "mode" could be set by the Web interface. I could still use WebMail to look at the mail back on the server if I really wanted to. I really appreciate your quick response and willingness to help. Kudos to all those involved in this project. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From readonly at getsoft.com Mon May 12 16:38:04 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 01:37:41 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: References: Message-ID: <191668687641.20030512153804@getsoft.com> Monday, May 12, 2003, 12:46:18 PM, you wrote: TP> [Neville Franks] >> ... >> From reading the Installation notes SpamBayes requires Python V2.2 or >> later. I assume from this there is no way to get it to run on V1.5.2, >> which is what is installed on our Web server and unfortunately can't >> be upgraded? TP> 2.2 is minimal, in part because the code uses generators (new in 2.2), and TP> in part because it relies on many improvements to Python's libraries since TP> 1.5.2 came out. You could work around all that, of course, given an unknown TP> (but large) investment of effort. Note that 1.5.2 hasn't been supported for TP> years, and even 2.1 stopped attracting support over a year ago (when 2.3 TP> development started). That web server is going to self-destruct eventually TP> . Thanks. For now I've installed Python 2.3 locally and will run it that way until I'm able to get the later version on my Web host. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From noreply at sourceforge.net Mon May 12 02:15:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 12 04:15:34 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-31 05:47 Message generated for change (Comment added) made by nicktrautmann You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 08:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 02:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 06:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From T.A.Meyer at massey.ac.nz Mon May 12 22:15:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 12 05:16:04 2003 Subject: [Spambayes] SpamBayes questions Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130181C7A8@its-xchg4.massey.ac.nz> > I have approx. 6 @getsoft.com I want to filter, and possibly > some others. Seeing I only discovered Spambayes yesterday and > installed it today, I really don't have much of an > understanding how it goes about doing what it does. :) A reason things get tricky with multiple users is that you have to decide whether they are going to share a database (and hence, a common definition of spam/ham). This leads to issues about storing separate databases, or issues about training shared databases via separate interfaces (so that not all mail is seen by all users). The multiple users/server question is asked so often that it would be really great (IMO) if we did have an example of how to do it (just making arbitrary decisions, and pointing out where they were). If I find time and finish the other things I'm meant to be doing ;) then I might play around with this (it would be time consuming to set up, though since I would have to create a test mail server on my system, since I don't have access to one). Anyway, in the meantime you can see how things go with the local system. > Typically my email client is setup to delete mail as it > downloads it. Is it feasible to have a mode/setting which > says don't pass spam on to the email client and also don't > delete it from the mail server. If the email client never > sees the spam in this special mode then it won't delete it > and it will just sit there. Ideally this "mode" could be set > by the Web interface. I could still use WebMail to look at > the mail back on the server if I really wanted to. What I could see as an 'easy' option would be this: * your mail program can connect to pop.example.com:110 and all mail waiting is retrieved normally (and you probably delete it all from the server as well) * your mail program can connect to pop.example.com:111 and only ham/unsures are downloaded; all spam (and false positives ;) are left on the server (and you probably, again, delete all the mail you download, since you have a local copy of it). * your web browser can connect to pop.example.com:80 and see the web interface, which lets you correct any misclassifications, review waiting (spam) messages, and the like. If you reclassify a message from spam to ham, then it will be downloaded next time your mail program connects to :111. This way you would only need to connect to :111 and occasionally review via the web ui. Alternatively, you could normally connect via :110 (and filter based on the headers, perhaps), and connect to :111 when you are on a dialup, or a mobile, or whatever. The mail clients connecting to :111 would have no idea that there was more mail there (or, perhaps, there could be an option to download a stripped version of the message so that you know that there is mail that you can check, or maybe a single summary message occasionally). It's not that tricky a project (IMO), if the shared user logistics mentioned above are resolved. Actually, thinking about it, Death2Spam (http://www.death2spam.com) does something along these lines. I didn't read the docs all that carefully, so there are probably differences, but it's similar, at least. > I really appreciate your quick response and willingness to > help. Kudos to all those involved in this project. No worries, and thanks :) =Tony Meyer From anthony at interlink.com.au Mon May 12 22:45:58 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Mon May 12 07:46:08 2003 Subject: [Spambayes] mini-project idea - .fetchmailconf parser Message-ID: <200305121145.h4CBjxD29981@localhost.localdomain> A mini-project that I've had on my "one day..." list for a long time is something that takes a .fetchmailconf, parses it and generates a config for the pop/imap proxy. It would also move aside the existing fetchmail config file and replace it with a new one. Like I said, it's a fairly small project, but if someone else's feeling inspired -- I'm unlikely to get to it anytime soon. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From m.leck at ntlworld.com Mon May 12 15:50:26 2003 From: m.leck at ntlworld.com (Mark Leck) Date: Mon May 12 09:50:26 2003 Subject: [Spambayes] Win98 2nd Ed Message-ID: Hi, Does the bayesian spam filter run on Win98 2nd edition? I am having problems because the dll is failing to register using regsvr32. Mark Leck From mail at peterbe.com Mon May 12 16:21:50 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Mon May 12 10:50:32 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <5.2.0.9.0.20030512151849.0223bd98@mail.grenna.net> I installed the 1.0a2 from sourceforge yesterday and I use the pop3proxy so my Eudora now connects to localhost instead. Yesterday everything was working fine and no errors reported. Great work guys! Today I get this errors: C:\Python22\spambayes>C:\Python22\python.exe pop3proxy.py Loading database... Done. Listener on port 110 is proxying mail.grenna.net:110 User interface url is http://localhost:8880/ error: uncaptured python exception, closing channel <__main__.ServerLineReader c onnected at 0x15d5838> (exceptions.EOFError: [C:\Python22\lib\asyncore.py|poll|9 9] [C:\Python22\lib\asyncore.py|handle_read_event|396] [C:\Python22\lib\asynchat .py|handle_read|130] [pop3proxy.py|found_terminator|199] [pop3proxy.py|onServerL ine|267] [pop3proxy.py|onResponse|341] [pop3proxy.py|onTransaction|437] [pop3pro xy.py|onRetr|484] [spambayes\classifier.py|chi2_spamprob|217] [spambayes\classif ier.py|_getclues|437] [spambayes\storage.py|_wordinfoget|192] [C:\Python22\lib\s helve.py|get|66] [C:\Python22\lib\shelve.py|__getitem__|71]) Any ideas what this might mean or what I have done to deserve this? WinXP Pro, Python 2.2, what else do you need to know? From skip at pobox.com Mon May 12 11:05:04 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 12 11:05:29 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <151667238568.20030512151355@getsoft.com> References: <056b01c31832$3c818b20$530f8490@eden> <151667238568.20030512151355@getsoft.com> Message-ID: <16063.47136.107010.364491@montanaro.dyndns.org> Neville> I'm really very surprised that none of the Spam packages I've Neville> seen have any option to skip "spam" messages. It just seems Neville> like such an obvious thing to do to me. The problem is that no spam filter is perfect, so there will eventually be false positives. Not delivering a valid message because the spam filter thought it looked like spam is only a hop, skip and a jump away from deleting it altogether. Someone is bound to complain, "you're deleting valid email". That puts egg on the spam filter's face (when all it wants is spam). At the very least, in your setup you'd have to give everyone two mailboxes (or a mailbox and a web mail interface), one for presumed ham, one for presumed spam. Guess how frequently the spam box will get downloaded? After awhile, disk space will be a problem (quota exceeded?) and the hosting company will stop accepting email for the user altogether. Or something like that. In short, deleting or detouring spam is generally not a viable option. Skip From tim.one at comcast.net Mon May 12 14:34:13 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon May 12 13:37:06 2003 Subject: [Spambayes] Win98 2nd Ed In-Reply-To: Message-ID: [Mark Leck] > Does the bayesian spam filter run on Win98 2nd edition? I am > having problems because the dll is failing to register using regsvr32. I use the spambayes Outlook add-in on two different Win98SE boxes without problems, although I haven't updated the code lately. We're going to need more details, like what you downloaded, what you did, and what happened. First thing to check: are you running Outlook 2000? In particular, the add-in doesn't work with Outlook Express. From popiel at wolfskeep.com Mon May 12 13:52:22 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon May 12 15:52:27 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: Message from David Abrahams of "Sun, 11 May 2003 22:57:39 EDT." References: Message-ID: <20030512195222.BE3A02DDC4@cashew.wolfskeep.com> In message: David Abrahams writes: > >I spent considerable time preparing this several weeks ago, with the >promise that it would be made available somewhere that would be of >use to people... but got no reply. > >http://www.boost-consulting.com/writing/server-side.html > >I'd also appreciate it if someone could address the question at the >end. Well, I can't do anything about the availability. I can try answering the question (about concurrent training and classification), though I am not an authority. The last time I checked, spambayes did nothing at all in the way of file locking. This (or rather similar issues dealing with the viability of multiple threads) was rather actively debated on the mailing list a few months back, and the general consensus seemed to be that having only one thread and no locking was the least painful way to go. This does mean that training and classification could have problems if done concurrently... however, since classification is generally read-only access, the problems may be only in fetching minutely stale data, instead of database corruption (but don't quote me on that, since I haven't looked into the db implementation). For my personal use, I have procmail do locking to keep multiple instances from running at the same time, and I also have my nightly retrain go into a fresh db, which I then rename over the old db (an atomic process at the filesystem level). In this way, I hope to avoid any database issues. If I was extra-paranoid, I'd make the nightly retrain grab the same lock that procmail uses when doing the rename... but I'm too lazy to be that persnickety. My laziness is encouraged by the fact that if something messes up, I can just retrain from scratch again, just as is done every night. I only had to do that once, and I have no particular reason to believe that it messed up due to concurrent access issues. - Alex From popiel at wolfskeep.com Mon May 12 14:01:04 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon May 12 16:01:07 2003 Subject: [Spambayes] Header cropping In-Reply-To: Message from Harald Hanche-Olsen of "Sat, 10 May 2003 19:03:40 +0200." <20030510190340Y.hanche@math.ntnu.no> References: <20030510145457A.hanche@math.ntnu.no> <20030510190340Y.hanche@math.ntnu.no> Message-ID: <20030512200104.6CD9D2DDC4@cashew.wolfskeep.com> In message: <20030510190340Y.hanche@math.ntnu.no> Harald Hanche-Olsen writes: [ in reference to header-eating problems ] >Hammie. Sorry, I should have said. As others have said, hammie eats the headers if there is a message parsing failure. This is fairly easy to cause with a malformed MIME section. I mentioned this on list a couple times a few months ago (including a message which causes it to happen), and the general reaction at the time was that Barry should fix the non-strict MIME parser in the email package, rather than making the error handling in spambayes itself more robust. This hasn't been a high-priority task, given that it's really only idiots like me who hand-craft MIME messages (with the occasional error) who are sending malformed messages to trigger the problem. - Alex From admin at sandor.net Mon May 12 13:53:20 2003 From: admin at sandor.net (admin) Date: Mon May 12 16:07:37 2003 Subject: [Spambayes] New mail icon. Message-ID: <002001c318c0$2468eef0$6601a8c0@wirelessodufzz> Hello, I did a quick search on the spambayes mailing list and saw this had been brought up and rejected ? so I?ll be brief. I like the new mail icon in the tray that Outlook creates. It has always worked (fairly) well for me so I *do* use it. I work from home ? so having my computer notify me when I have new mail is a nicety ? false positives from mail marked as spam invalidates the notification (obviously). If this is going to be the case ? I might as well close Outlook after every session. As I would like to be faster on my email responses this is not realistic ? but neither is running to my computer every time I get some spam (much of the time I am too far to see the icon ? but the audio notification is still useful). Since I am not the first person to bring this up (indeed, the previous person even made some code for a patch) I suspect there are more users that have the same issue. Thus I felt I should speak up. Thank you for a top notch app. I?ll keep using spambayes and if I have to - I?ll figure out how to monkey some vb into outlook to fix the icon, so I guess I?m just trying to save myself some work :-) If I was handy with Python I?d try my hand at contributing ? but alas, I know nothing. -s?ndor P.S. This might have been more appropriate in the mailing list but I didn?t want to subscribe just to give this one suggestion. Sorry. From noreply at sourceforge.net Mon May 12 15:14:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 12 17:14:47 2003 Subject: [Spambayes] [ spambayes-Bugs-736677 ] Outlook add-in fails to load in Outlook 2002 (XP) Message-ID: Bugs item #736677, was opened at 2003-05-12 23:14 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Oliver Carr (ollieuk) Assigned to: Mark Hammond (mhammond) Summary: Outlook add-in fails to load in Outlook 2002 (XP) Initial Comment: When trying to load the Outlook Add-in in Outlook 2002, this fails, pulling down all other installed COM Add-ins at the same time. The toolbar extensions (drop-down menu etc.( are not displayed. Outlook reports the following error for the Binary add- in: "Not loaded. A runtime error occured during the loading of the COM add-in." This behaviour is reproducible for both the Binary, as for the Source Code versions. This behaviour is independent of the user logged on. Platform: WinXP Pro SP1, Office XP SP2, spambayes 1.02a / Binary 002. Here is the trace, the log file from the binary is attatched: Outlook Spam Addin module loading SpamAddin - Connecting to Outlook Created new configuration file 'E:\Downloads\Web\spambayes-1.0a2\spambayes- 1.0a2\Outlook2000\default_configuration.pck' Traceback (most recent call last): File "C:\PROGRA~1\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 46, in import_core_spambayes_stuff from spambayes import classifier File "C:\Program Files\Python22\Lib\site- packages\spambayes\classifier.py", line 40, in ? from spambayes.Options import options File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 557, in ? options.mergefilelike(d) File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 517, in mergefilelike self._update() File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 535, in _update value = getattr(c, fetcher)(section, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 306, in getfloat return self.__get(section, float, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 300, in __get return conv(self.get(section, option)) exceptions.ValueError: invalid literal for float(): 0.20 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 From readonly at getsoft.com Tue May 13 08:37:17 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 17:37:11 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <16063.47136.107010.364491@montanaro.dyndns.org> References: <056b01c31832$3c818b20$530f8490@eden> <151667238568.20030512151355@getsoft.com> <16063.47136.107010.364491@montanaro.dyndns.org> Message-ID: <140726241099.20030513073717@getsoft.com> SM> Neville> I'm really very surprised that none of the Spam packages I've SM> Neville> seen have any option to skip "spam" messages. It just seems SM> Neville> like such an obvious thing to do to me. SM> The problem is that no spam filter is perfect, so there will eventually be SM> false positives. Not delivering a valid message because the spam filter SM> thought it looked like spam is only a hop, skip and a jump away from SM> deleting it altogether. Someone is bound to complain, "you're deleting SM> valid email". That puts egg on the spam filter's face (when all it wants is SM> spam). SM> At the very least, in your setup you'd have to give everyone two mailboxes SM> (or a mailbox and a web mail interface), one for presumed ham, one for SM> presumed spam. Guess how frequently the spam box will get downloaded? SM> After awhile, disk space will be a problem (quota exceeded?) and the hosting SM> company will stop accepting email for the user altogether. Or something SM> like that. I think you are missing my main point and that is to enable the option of not downloading spam at certain specific times, such as when I'm in a hotel paying exorbitant connection charges. As soon as I'm back in the office I'd disable this option and handle the spam as necessary. That said if SpamBayes is running remotely with my mail server and the day comes that I'm confident SB "never or rarely" comes up with false positives, then I may be quite happy for SB to delete everything it recognizes as spam so I don't have to waist precious time checking and deleting it. Finally I control all of my email accounts, mail sever, web server etc. and know precisely what I am doing, so the mail server quota issue etc. wouldn't become an issue. If I were looking after email for "other" users things may be different. The idea of having separate ports to access spam, ham etc. as I suggested and Tony Meyer's has discussed appears to be a simple enough way to achieve this and doesn't require separate mailboxes. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From m.leck at ntlworld.com Mon May 12 23:42:57 2003 From: m.leck at ntlworld.com (Mark Leck) Date: Mon May 12 17:41:21 2003 Subject: [Spambayes] Problem getting the filter to register the add in Message-ID: Hi, I have the following problem with getting the filter to run on my desktop I am using Outlook 2000 (9.0.0.2711). I followed the link from http://starship.python.net/crew/mhammond/spambayes and downloaded the exe file and ran it. The Add-In was installed into Outlook and appears in the 'COMM Add Ins' section of the Tools->Options menu but is not ticked and every time I tick it and re open the box it is unticked again. I tried to register the dll manually at this point and get an error when trying to register - 'DllRegisterServer in spambayes_addin.dll failed. Return code was 0xc0000005'. Can anyone help with this problem? Regards, Mark From readonly at getsoft.com Tue May 13 09:09:05 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 18:08:44 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130181C7A8@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130181C7A8@its-xchg4.massey.ac.nz> Message-ID: <115728149213.20030513080905@getsoft.com> >> I have approx. 6 @getsoft.com I want to filter, and possibly >> some others. Seeing I only discovered Spambayes yesterday and >> installed it today, I really don't have much of an >> understanding how it goes about doing what it does. MT> :) A reason things get tricky with multiple users is that you have to MT> decide whether they are going to share a database (and hence, a common MT> definition of spam/ham). This leads to issues about storing separate MT> databases, or issues about training shared databases via separate MT> interfaces (so that not all mail is seen by all users). MT> The multiple users/server question is asked so often that it would be MT> really great (IMO) if we did have an example of how to do it (just MT> making arbitrary decisions, and pointing out where they were). If I MT> find time and finish the other things I'm meant to be doing ;) then I MT> might play around with this (it would be time consuming to set up, MT> though since I would have to create a test mail server on my system, MT> since I don't have access to one). MT> Anyway, in the meantime you can see how things go with the local system. I appreciate the problems here, but I don't consider this an issue in my specific case. My 6 aforementioned email accounts predominately get the same spam, so having one filter for all accounts isn't an issue. That said I can see how this could be a problem for others and having some way to handle it would be very nice. >> Typically my email client is setup to delete mail as it >> downloads it. Is it feasible to have a mode/setting which >> says don't pass spam on to the email client and also don't >> delete it from the mail server. If the email client never >> sees the spam in this special mode then it won't delete it >> and it will just sit there. Ideally this "mode" could be set >> by the Web interface. I could still use WebMail to look at >> the mail back on the server if I really wanted to. MT> What I could see as an 'easy' option would be this: MT> * your mail program can connect to pop.example.com:110 and all mail MT> waiting is retrieved normally (and you probably delete it all from the MT> server as well) MT> * your mail program can connect to pop.example.com:111 and only MT> ham/unsures are downloaded; all spam (and false positives ;) are left on MT> the server (and you probably, again, delete all the mail you download, MT> since you have a local copy of it). MT> * your web browser can connect to pop.example.com:80 and see the web MT> interface, which lets you correct any misclassifications, review waiting MT> (spam) messages, and the like. If you reclassify a message from spam to MT> ham, then it will be downloaded next time your mail program connects to MT> :111. MT> This way you would only need to connect to :111 and occasionally review MT> via the web ui. Alternatively, you could normally connect via :110 (and MT> filter based on the headers, perhaps), and connect to :111 when you are MT> on a dialup, or a mobile, or whatever. The mail clients connecting to MT> :111 would have no idea that there was more mail there (or, perhaps, MT> there could be an option to download a stripped version of the message MT> so that you know that there is mail that you can check, or maybe a MT> single summary message occasionally). MT> It's not that tricky a project (IMO), if the shared user logistics MT> mentioned above are resolved. This sounds great to me. The only hassle on my part is having to change the 6 email account properties to a different port and back, but I can happily live with that. A nice enhancement would be to have say a :112 and let the Web interface specify whether this has the behavior of your :110 or :111 port. Then I could always use :112 and simply hop into the Web interface to flip the behavior back and forth instead of having to change all my email accounts. Another very nice feature for the pop.example.com:80 Web interface would be an option to delete some/all emails flagged as spam from the mail server. Add all this together and my life might start to return to normal (whether that is). -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. From skip at pobox.com Mon May 12 18:15:21 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 12 18:15:35 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <140726241099.20030513073717@getsoft.com> References: <056b01c31832$3c818b20$530f8490@eden> <151667238568.20030512151355@getsoft.com> <16063.47136.107010.364491@montanaro.dyndns.org> <140726241099.20030513073717@getsoft.com> Message-ID: <16064.7417.687631.115533@montanaro.dyndns.org> Neville> I think you are missing my main point and that is to enable the Neville> option of not downloading spam at certain specific times, such Neville> as when I'm in a hotel paying exorbitant connection charges. As Neville> soon as I'm back in the office I'd disable this option and Neville> handle the spam as necessary. I think I understood fairly well what you were asking. I believe you were asking about running the filter server side, then picking up your mail in the usual fashion. If you're talking to a vanilla POP server how do you tell it to "not download spam at certain specific times"? Can this be accomplished with more control over the tool chain? Yes. You mentioned the possibility of modifying pop3proxy to listen to port 111 as well as port 110, with connections to the former hiding possible spam messages. I'm not sure I'd trust garden variety users to toggle their POP port reliably. Some email clients (like Eudora), only allow you to set a single POP port used to connect to all possible POP servers, so if you have multiple POP drops you'd be scrod. (The dual port solution also seems like a gross hack. If you're going to use pop3proxy to do your bidding, it would better to simply have a "hide_spam_temporarily" flag.) Neville> That said if SpamBayes is running remotely with my mail server Neville> and the day comes that I'm confident SB "never or rarely" comes Neville> up with false positives, then I may be quite happy for SB to Neville> delete everything it recognizes as spam so I don't have to Neville> waist precious time checking and deleting it. Please, no. SpamAssassin also only filters messages. I recall seeing it slammed in public forums because some system managers got the bright idea to configure the downstream procmail to delete messages whose scores were "high enough". Naturally enough, SA got blamed. While we can't prevent sys admins from hanging themselves, we ought to at least make them build the gallows and buy the rope. Neville> Finally I control all of my email accounts, mail sever, web Neville> server etc. and know precisely what I am doing, so the mail Neville> server quota issue etc. wouldn't become an issue. If I were Neville> looking after email for "other" users things may be different. Yeah, I control everything about my email account as well. You and I are likely not to be garden variety users though. I've also been using SpamBayes longer than most people on this list. I still don't delete *any* spams. (I do encounter false positives, though they are rare.) Messages classified as spam get tossed into two mailboxes by procmail, one for those with a score of 0.99 or 1.00, and all others which exceed my spam_cutoff (around 0.88). The only practical difference is the "high spam" mail gets scanned less frequently and more quickly. Neville> The idea of having separate ports to access spam, ham etc. as I Neville> suggested and Tony Meyer's has discussed appears to be a simple Neville> enough way to achieve this and doesn't require separate Neville> mailboxes. See my comment above about what normal users are capable of. Skip From readonly at getsoft.com Tue May 13 09:18:52 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 18:18:23 2003 Subject: [Spambayes] Web Interface enhancements Message-ID: <104728735966.20030513081852@getsoft.com> I have a couple of enhancement suggestions to the very nice Web interface. First in "Review messages" can the "To:" account be displayed. I'm using 6 accounts and it would help me to know which account an email is destined to. And again in "Review messages" a return to "Review messages" option would be nice in the "Trained" page. At present you have to "Return Home" and then click "Review messages" again. Even better would be to return to "Review messages" after pressing "Train" and include a "Refresh" option, then I could just happily sit in "Review messages". Thanks for listening. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From readonly at getsoft.com Tue May 13 09:37:34 2003 From: readonly at getsoft.com (Neville Franks) Date: Mon May 12 18:37:19 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <16064.7417.687631.115533@montanaro.dyndns.org> References: <056b01c31832$3c818b20$530f8490@eden> <151667238568.20030512151355@getsoft.com> <16063.47136.107010.364491@montanaro.dyndns.org> <140726241099.20030513073717@getsoft.com> <16064.7417.687631.115533@montanaro.dyndns.org> Message-ID: <190729857649.20030513083734@getsoft.com> SM> Neville> I think you are missing my main point and that is to enable the SM> Neville> option of not downloading spam at certain specific times, such SM> Neville> as when I'm in a hotel paying exorbitant connection charges. As SM> Neville> soon as I'm back in the office I'd disable this option and SM> Neville> handle the spam as necessary. SM> I think I understood fairly well what you were asking. I believe you were SM> asking about running the filter server side, then picking up your mail in SM> the usual fashion. If you're talking to a vanilla POP server how do you SM> tell it to "not download spam at certain specific times"? Can this be SM> accomplished with more control over the tool chain? Yes. You mentioned the SM> possibility of modifying pop3proxy to listen to port 111 as well as port SM> 110, with connections to the former hiding possible spam messages. I'm not SM> sure I'd trust garden variety users to toggle their POP port reliably. Some SM> email clients (like Eudora), only allow you to set a single POP port used to SM> connect to all possible POP servers, so if you have multiple POP drops you'd SM> be scrod. (The dual port solution also seems like a gross hack. If you're SM> going to use pop3proxy to do your bidding, it would better to simply have a SM> "hide_spam_temporarily" flag.) A "hide_spam_temporarily" option that I could set in the Web interface would be an ideal solution for me. I'm sure there are a lot of people on the road who would welcome a simple option to temporarily "defer" downloading spam. SM> Neville> That said if SpamBayes is running remotely with my mail server SM> Neville> and the day comes that I'm confident SB "never or rarely" comes SM> Neville> up with false positives, then I may be quite happy for SB to SM> Neville> delete everything it recognizes as spam so I don't have to SM> Neville> waist precious time checking and deleting it. SM> Please, no. SpamAssassin also only filters messages. I recall seeing it SM> slammed in public forums because some system managers got the bright idea to SM> configure the downstream procmail to delete messages whose scores were "high SM> enough". Naturally enough, SA got blamed. While we can't prevent sys SM> admins from hanging themselves, we ought to at least make them build the SM> gallows and buy the rope. I would never suggest anyone else do this, however no one should stop me, if that's what I really want to do. Over a weekend I can easily get 100+ spam emails a day, with maybe 1 or 2 ham mixed in. It is just a nightmare and enormous waist of time to manage. SM> Neville> Finally I control all of my email accounts, mail sever, web SM> Neville> server etc. and know precisely what I am doing, so the mail SM> Neville> server quota issue etc. wouldn't become an issue. If I were SM> Neville> looking after email for "other" users things may be different. SM> Yeah, I control everything about my email account as well. You and I are SM> likely not to be garden variety users though. I've also been using SM> SpamBayes longer than most people on this list. I still don't delete *any* SM> spams. (I do encounter false positives, though they are rare.) Messages SM> classified as spam get tossed into two mailboxes by procmail, one for those SM> with a score of 0.99 or 1.00, and all others which exceed my spam_cutoff SM> (around 0.88). The only practical difference is the "high spam" mail gets SM> scanned less frequently and more quickly. SM> Neville> The idea of having separate ports to access spam, ham etc. as I SM> Neville> suggested and Tony Meyer's has discussed appears to be a simple SM> Neville> enough way to achieve this and doesn't require separate SM> Neville> mailboxes. SM> See my comment above about what normal users are capable of. To be honest I really haven't considered what normal users may or may not be capable of, I'm selfishly just looking for a solution to my problem, one that many others I'm sure would like to use. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From mhammond at skippinet.com.au Tue May 13 09:42:31 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon May 12 18:43:22 2003 Subject: [Spambayes] Win98 2nd Ed In-Reply-To: Message-ID: <06c001c318d7$c82c0830$530f8490@eden> Unfortunately, the binary version is known to not work on Windows 98. The source-code version does, and hopefully the next version of the binary will too :) Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Mark Leck > Sent: Monday, 12 May 2003 11:50 PM > To: spambayes@python.org > Subject: [Spambayes] Win98 2nd Ed > > > Hi, > > Does the bayesian spam filter run on Win98 2nd edition? I am > having problems > because the dll is failing to register using regsvr32. > > Mark Leck > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From noreply at sourceforge.net Mon May 12 16:45:26 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 12 18:45:31 2003 Subject: [Spambayes] [ spambayes-Bugs-736677 ] Outlook add-in fails to load in Outlook 2002 (XP) Message-ID: Bugs item #736677, was opened at 2003-05-13 07:14 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Oliver Carr (ollieuk) Assigned to: Mark Hammond (mhammond) Summary: Outlook add-in fails to load in Outlook 2002 (XP) Initial Comment: When trying to load the Outlook Add-in in Outlook 2002, this fails, pulling down all other installed COM Add-ins at the same time. The toolbar extensions (drop-down menu etc.( are not displayed. Outlook reports the following error for the Binary add- in: "Not loaded. A runtime error occured during the loading of the COM add-in." This behaviour is reproducible for both the Binary, as for the Source Code versions. This behaviour is independent of the user logged on. Platform: WinXP Pro SP1, Office XP SP2, spambayes 1.02a / Binary 002. Here is the trace, the log file from the binary is attatched: Outlook Spam Addin module loading SpamAddin - Connecting to Outlook Created new configuration file 'E:\Downloads\Web\spambayes-1.0a2\spambayes- 1.0a2\Outlook2000\default_configuration.pck' Traceback (most recent call last): File "C:\PROGRA~1\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 46, in import_core_spambayes_stuff from spambayes import classifier File "C:\Program Files\Python22\Lib\site- packages\spambayes\classifier.py", line 40, in ? from spambayes.Options import options File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 557, in ? options.mergefilelike(d) File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 517, in mergefilelike self._update() File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 535, in _update value = getattr(c, fetcher)(section, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 306, in getfloat return self.__get(section, float, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 300, in __get return conv(self.get(section, option)) exceptions.ValueError: invalid literal for float(): 0.20 ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-13 08:45 Message: Logged In: YES user_id=14198 Can you please try the latest version from CVS - this bug has been fixed (but is not yet in any of the binaries). The bigger issue we are dealing with can be tracked at https://sourceforge.net/tracker/index.php?func=detail&aid=725466&group_id=61702&atid=498103 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 From skip at pobox.com Mon May 12 18:58:59 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 12 18:59:06 2003 Subject: [Spambayes] SpamBayes questions In-Reply-To: <190729857649.20030513083734@getsoft.com> References: <056b01c31832$3c818b20$530f8490@eden> <151667238568.20030512151355@getsoft.com> <16063.47136.107010.364491@montanaro.dyndns.org> <140726241099.20030513073717@getsoft.com> <16064.7417.687631.115533@montanaro.dyndns.org> <190729857649.20030513083734@getsoft.com> Message-ID: <16064.10035.957777.317032@montanaro.dyndns.org> SM> Please, no. SpamAssassin also only filters messages. I recall SM> seeing it slammed in public forums because some system managers got SM> the bright idea to configure the downstream procmail to delete SM> messages whose scores were "high enough". Naturally enough, SA got SM> blamed. While we can't prevent sys admins from hanging themselves, SM> we ought to at least make them build the gallows and buy the rope. Neville> I would never suggest anyone else do this, however no one Neville> should stop me, if that's what I really want to do. Over a Neville> weekend I can easily get 100+ spam emails a day, with maybe 1 Neville> or 2 ham mixed in. It is just a nightmare and enormous waist of Neville> time to manage. That's why God invented procmail. ;-) Oh, that I only got 100 spams a day! I think I've had a couple hundred unique spams (that is spams that don't look "the same" to my procmail filters) in the last 10 hours. Neville> To be honest I really haven't considered what normal users may Neville> or may not be capable of, I'm selfishly just looking for a Neville> solution to my problem, one that many others I'm sure would Neville> like to use. I think "temporarily_hide_spam" would be more likely to be used successfully by mere mortals. Skip From noreply at sourceforge.net Mon May 12 17:33:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 12 19:34:12 2003 Subject: [Spambayes] [ spambayes-Bugs-725466 ] Include a proper locale fix in Options.py Message-ID: Bugs item #725466, was opened at 2003-04-22 18:07 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Mark Hammond (mhammond) Summary: Include a proper locale fix in Options.py Initial Comment: When reading the options, the float() call fails when the locale is a language that uses a ',' for a separator instead of '.'. This is hack-fixed in Outlook, but needs to be fixed in general. I imagine that there must be some sort of locale call that will convert between the current locale and English, and that this should be called as the option is set. Anyway, I'll get to this when I can. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-13 11:33 Message: Logged In: YES user_id=552329 I've checked in a partial fix, where I just set the locale back to (US) English after that call (to work it needs this one, and the one in addin.py, it seems). It all seems to work ok now, but this is hardly a nice solution. I had one problem. (I changed my locale by using the XP control panel). * If I used locale.getlocale I received "de_DE". If I tried to set the locale to this, I get a "locale setting not supported" error. This meant that I couldn't try a nice solution that set the locale back to the local one. My testing setup is pretty much gone now, but if I find time I'll play around with this again, unless you manage to. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-11 06:35 Message: Logged In: YES user_id=14198 Damn. IIRC, MAPI also changes the cwd. I guess the solution may be the same - check the locale before setting, and reset explicitly afterwards. It would be great if you can check this while you are still setup to test, but otherwise it can stay on my todo list :) ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 15:23 Message: Logged In: YES user_id=552329 Ok, I did more digging. The locale set in addin.py isn't enough. This line in msgstore.py screws up the locale: self.session = mapi.MAPILogonEx(0, None, None, logonFlags) (it's in the __init__ of MAPIMsgStore; my line numbers are all screwed up). (by 'screws up the locale', I mean that math.log(2) returns a number with a '.' decimal separator before it, and a ',' decimal separator afterwards). I'll leave it to Mark to figure out what should be done. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 15:15 Message: Logged In: YES user_id=552329 Ok, forget that, I've found a way I can reproduce a similar error. This is definately Outlook specific, and definately cannot be fixed in the spambayes code, except by changing the locale to 'en' or 'c', or something similar. Ignoring spambayes completely, and using the demo outlook plugin that comes with the win32com extensions, I printed out the results of math.log(2). This should be "0.69314718056 " and not "0,69314718056". I made this print statement pretty much every second line. When adding, it always gives the correct answer. As soon as Outlook is started up, it gives the wrong answer. It's beyond me whether this is something that the win32com stuff does (although importing the extensions doesn't cause it), or if it's something that Outlook does, somehow. No doubt you can answer that, Mark? ;) Anyway, unless a change to the win32com stuff occurs, this will have to remain in the spambayes Outlook code. (The options stuff is a separate problem, and my fix for that should work). It doesn't need to be outside the Outlook code though. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-09 14:22 Message: Logged In: YES user_id=552329 What I don't understand is why I can't reproduce this error. If I make the same addition to manager.py, but use 'german'[1] as the locale, everything still works. (It also still works if I make the change in addin.py where the other locale set is, and also if I change the locale to german in the windows control panel). In addition, why does the import from PythonWin work? Surely this means (as mentioned) that something else that is imported is screwing up the locale setting before random gets imported - but then, shouldn't it do this on my machine, too? Googling on this (NV_MAGICCONST) brings up lots of instances of the problem, but nothing in the way of a solution apart from setting the locale to 'en' or 'c' before the random import. This really doesn't seem like the correct solution. I really am tempted to try my luck with c.l.p for a solution, if there aren't any more ideas here. [1] I can't set to "de_DE", I get a locale not supported error. "german" does use ',' as the decimal separator, so it should be the same, though. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:59 Message: Logged In: YES user_id=671479 Ok, I've got it working by setting the locale in the constructor of class BayesManager (manager.py): [...] self.config_filename = config_base + "_configuration.pck" print "locale: ", locale.getlocale() print "default locale: ", locale.getdefaultlocale() locale.setlocale(locale.LC_NUMERIC, "en") # First read the configuration file. [...] Locale/default locale are both de_DE for me. ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-09 05:45 Message: Logged In: YES user_id=671479 >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") 'English_United States.1252' >>> import random >>> ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-07 22:49 Message: Logged In: YES user_id=14198 What happens if you go into Pythonwin/IDLE, execute the locale call mentioned, then import that module? >>> import locale >>> locale.setlocale(locale.LC_NUMERIC, "en") ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 21:42 Message: Logged In: YES user_id=671479 The import works fine both in IDLE and PythonWin. I suspect the handling of floats is somehow broken by something loaded by Outlook or Python Windows Extensions. But then I don't really know enough about the architecture to make educated guesses ... Anyway I wonder why float handling at this low level seems to be locale dependent? Parsing text and converting to float is ok to be locale depedent, but doing calculations? ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-07 11:37 Message: Logged In: YES user_id=552329 This makes me wonder if the locale just needs to be set, but that seems too heavy-handed and ugly, really. What happens if you just execute >>> import random from the interpreter (e.g. in PythonWin or IDLE)? I suspect that you'll get the same trace, in which case this really should be fixed in random.py, not in Spambayes (although it would be easy enough to throw together a hack for it). ---------------------------------------------------------------------- Comment By: Stefan Gründel (sgruendel) Date: 2003-05-07 02:18 Message: Logged In: YES user_id=671479 It seems like it's not enough to fix Options.py only. I did this and then got the following error: Traceback (most recent call last): File "D:\dev\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "D:\dev\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "D:\util\spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "D:\util\spambayes-1.0a2\Outlook2000\manager.py", line 53, in import_core_spambayes_stuff from spambayes.tokenizer import tokenize File "D:\util\spambayes-1.0a2\spambayes\tokenizer.py", line 7, in ? import email.Message File "D:\dev\Python22\Lib\email\Message.py", line 14, in ? from email import Utils File "D:\dev\Python22\Lib\email\Utils.py", line 10, in ? import random File "D:\dev\Python22\Lib\random.py", line 93, in ? _verify('NV_MAGICCONST', NV_MAGICCONST, 1.71552776992141) File "D:\dev\Python22\Lib\random.py", line 88, in _verify raise ValueError( exceptions.ValueError: computed value for NV_MAGICCONST deviates too much (computed 2,82843, expected 1) Obviously random.py uses ',' too! I'm using german Windows/Outlook, if I can be of any help in testing, just drop me a mail. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 18:30 Message: Logged In: YES user_id=552329 Options.py now uses locale.atoi and locale.atof to convert options. I *think* this will solve this problem, but I'm not 100%. If someone could do some testing, that would be great. I'm leaving open until I'm sure it's done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725466&group_id=61702 From tys at cs.ualberta.ca Tue May 13 01:25:29 2003 From: tys at cs.ualberta.ca (Tys von Gaza) Date: Tue May 13 02:26:28 2003 Subject: [Spambayes] Outlook Plugin Problems... Message-ID: <000201c31918$7da15360$b5f2b38e@gaza> I have been using the Outlook plugin on a CVS version dated from early November of last year. It has been working great but I finally decided that I really should update the code. I grabbed a CVS snap shot yesterday, closed outlook, moved my old spambayes dir (this probably caused the problems, btw I don't care about the database, can rebuild it). I then deleted all the old Spam fields from my folders (I remember this was causing a problem for some people awhile back when the field type changed or something) and ran the addin.py from the new code. The plugin worked when I first ran Outlook. However after running it for the 2nd time SpamBayes no longer seems to load. I can't get any debug info from the Trace Collector (even when rerunning addin.py). I remember there was some discussion a while back about some changes to the Outlook plugin that caused problems when upgrading, I think I might be running into these again. Any ideas? Thanks, Tys von Gaza tys@cs.ualberta.ca From mhammond at skippinet.com.au Wed May 14 00:45:03 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 09:45:55 2003 Subject: [Spambayes] Outlook Plugin Problems... In-Reply-To: <000201c31918$7da15360$b5f2b38e@gaza> Message-ID: <07a301c31955$dcaee5b0$530f8490@eden> > I remember there was some discussion a while back about some > changes to > the Outlook plugin that caused problems when upgrading, I > think I might > be running into these again. Any ideas? There are no known problems that will cause completely silent failure. Are you sure that the old Outlook process terminated? There should always be a startup message in the log, so completely silent is a new one on me. Did you try re-running "addin.py"? If all else fails, does running, say "Outlook2000\train.py" from the command-line work? Mark. From mhammond at skippinet.com.au Wed May 14 01:03:53 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 10:04:42 2003 Subject: [Spambayes] Problem getting the filter to register the add in In-Reply-To: Message-ID: <07ab01c31958$7d4fe3f0$530f8490@eden> > I have the following problem with getting the filter to run > on my desktop > > I am using Outlook 2000 (9.0.0.2711). > > I followed the link from > http://starship.python.net/crew/mhammond/spambayes > and downloaded the exe file and ran it. The Add-In was installed into > Outlook and appears in the 'COMM Add Ins' section of the > Tools->Options menu > but is not ticked and every time I tick it and re open the box it is > unticked again. I tried to register the dll manually at this > point and get > an error when trying to register - 'DllRegisterServer in > spambayes_addin.dll > failed. Return code was 0xc0000005'. Please see the "About.html" file, and try and locate a log file for the session. (The regsvr32 failing is a mystery though, but one step at a time :) Mark. From mhammond at skippinet.com.au Wed May 14 01:12:03 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 10:12:51 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: Message-ID: <07ae01c31959$a16e4cd0$530f8490@eden> > I spent considerable time preparing this several weeks ago, with the > promise that it would be made available somewhere that would be of > use to people... but got no reply. I'm sure I would find it useful if I used imap . I am an admin on the project though and can make changes to the web pages. However, like everyone I'm short on time to think <0.1 wink>. Do you have a suggestion for what the best thing to do with this is? I'd be happy to add a link to it, or include it directly, just say the word (and saying the word in a patch against the web pages in CVS would be perfect!) > I'd also appreciate it if someone could address the question at the > end. I think that the question can be paraphrased as "what bad things can happen if 2 spambayes processes are simultaneously reading from and writing to the same spambayes database?". If that is correct, I'm afraid I don't even know where to start Mark. From jmrodri at nc.rr.com Tue May 13 11:36:26 2003 From: jmrodri at nc.rr.com (Jesus M. Rodriguez) Date: Tue May 13 10:36:27 2003 Subject: [Spambayes] pop3proxy.py startup script In-Reply-To: <07ae01c31959$a16e4cd0$530f8490@eden> References: <07ae01c31959$a16e4cd0$530f8490@eden> Message-ID: <3EC102EA.2000407@nc.rr.com> Has anyone been able to get the pop3proxy.py to start at boot time? I'm using Red Hat Linux 8.0. I want to have it run on my Linux server while my Windows clients use the Linux server as their POP3 proxy. I've tried creating a script in init.d and running chkconfig --add pop3proxy. But when I try /etc/init.d/pop3proxy start I get a message stating Starting pop3proxy then it sits there. I need it to return to the command prompt so that other scripts can continue to start. I tried adding an ampersand & after the command: daemon pop3proxy.py -l 110 -u 8880 other params & This got it started, but then /etc/init.d/pop3proxy stop wouldn't work. Any help would greatly be appreciated. Sincerely, Jesus Rodriguez From seant at webreply.com Tue May 13 14:52:49 2003 From: seant at webreply.com (Sean True) Date: Tue May 13 14:00:14 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. Message-ID: Well, sort of commercial. After some initial conversation with Mark Hammond, and considerable jawboning from my own internal customers, I've been working on a productized version of the Outlook plugin, code named SpamAtBay. It has had some UI work, and documentation changes, and some significant functional changes (see the CHANGES.txt) below. We're making an effort to QA the resultant codebase on Windows 98, ME, NT4, 2000, and XP and against Outlook 2000, 2002, XP and 2003. At this point only Outlook 2003 is a known non starter. Anyone who is a contributor to SpamBayes gets a license to the shipping product, for their own use, for the asking. Any one else who reads this and asks in the next few weeks can get a beta version, and a license for the shipping product just for helping with the beta process. Just send me email: seant@webreply.com I should point out: the core performance of the system is unchanged: it works no better (and, I hope) no worse than SpamBayes for detecting spam. It's our hope, however, that we can get over the "Can Tim's sister use it?" barrier. The databases should be upward compatible, and we currently use the same GUID and registry keys as the basic plugin. The database should also be backwards compatible, although that is untested. This project is certainly not endorsed by Mark, but he didn't tell me to soak my head, either. The embarassingly small number of bug fixes we've sent him indicate how good his part of this is. Any remaining problems are likely to be introduced by me! I should add, I suppose, that additional code, documentation, and databases are copyright and are not available as part of the open source release. We will contribute back as many bug fixes as we can. -- Sean CHANGES: Release 1.0 Beta Candidate 5 (0.94) 1) Addresses shutdown issues with lingering Outlook 2) Internal logging and profiling support 3) Initial detection logic for Outlook 2003 (not supported) 4) Addresses some potential issues in Outlook security model 5) Sets registration friendly name and description 6) Changes "Show spam clues" format to be user friendly tables 7) Improvements to the setup wizard. 8) Whitelist/blacklist senders and domains Release 1.0 Beta Candidate 4 (0.93) 1) Support saving and loading database 2) Setup wizard framework now can invoke dialogs 3) Detect (and work with) more versions of the Outlook/Office com interface 4) Save database right after "Learn from scratch" training 5) Cleaner preflight results display Release 1.0 Beta Candidate 3 (0.92) 1) Added preflight check for platform and Outlook compatibility 2) Added framework for setup wizard. 3) Build with release Python 2.2.2 4) Minor bug patch in test suite code (not exposed) 5) Added TechSupportMail item 6) Added Help menu item Release 1.0 Beta Candidate 2 (0.91) 1) Updated bitmaps and docs 2) Added RegistrationDialog and 14-day trial logic 3) Added readme.html 4) Added "About" Menu item 5) Simplified training options 6) Simplified quarantine options 7) Added pretrained adult, health, and money spam Release 1.0 Beta Candidate 1 (0.90) 1) Naming and documentation changes. 2) Layout dialogs for improved workflow 3) Build script changes for Webreply environment Snapshot at 4/23/2003 From noreply at sourceforge.net Tue May 13 12:16:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 13 14:16:52 2003 Subject: [Spambayes] [ spambayes-Bugs-736677 ] Outlook add-in fails to load in Outlook 2002 (XP) Message-ID: Bugs item #736677, was opened at 2003-05-12 23:14 Message generated for change (Comment added) made by ollieuk You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 Category: Outlook Group: None >Status: Closed Resolution: None Priority: 5 Submitted By: Oliver Carr (ollieuk) Assigned to: Mark Hammond (mhammond) Summary: Outlook add-in fails to load in Outlook 2002 (XP) Initial Comment: When trying to load the Outlook Add-in in Outlook 2002, this fails, pulling down all other installed COM Add-ins at the same time. The toolbar extensions (drop-down menu etc.( are not displayed. Outlook reports the following error for the Binary add- in: "Not loaded. A runtime error occured during the loading of the COM add-in." This behaviour is reproducible for both the Binary, as for the Source Code versions. This behaviour is independent of the user logged on. Platform: WinXP Pro SP1, Office XP SP2, spambayes 1.02a / Binary 002. Here is the trace, the log file from the binary is attatched: Outlook Spam Addin module loading SpamAddin - Connecting to Outlook Created new configuration file 'E:\Downloads\Web\spambayes-1.0a2\spambayes- 1.0a2\Outlook2000\default_configuration.pck' Traceback (most recent call last): File "C:\PROGRA~1\Python22\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\addin.py", line 611, in OnConnection self.manager = manager.GetManager(application) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 335, in GetManager _mgr = BayesManager(outlook=outlook, verbose=verbose) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 79, in __init__ import_core_spambayes_stuff(self.ini_filename) File "E:\Downloads\Web\spambayes-1.0a2 \spambayes-1.0a2\Outlook2000\manager.py", line 46, in import_core_spambayes_stuff from spambayes import classifier File "C:\Program Files\Python22\Lib\site- packages\spambayes\classifier.py", line 40, in ? from spambayes.Options import options File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 557, in ? options.mergefilelike(d) File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 517, in mergefilelike self._update() File "C:\Program Files\Python22\Lib\site- packages\spambayes\Options.py", line 535, in _update value = getattr(c, fetcher)(section, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 306, in getfloat return self.__get(section, float, option) File "C:\Program Files\Python22\lib\ConfigParser.py", line 300, in __get return conv(self.get(section, option)) exceptions.ValueError: invalid literal for float(): 0.20 ---------------------------------------------------------------------- >Comment By: Oliver Carr (ollieuk) Date: 2003-05-13 20:16 Message: Logged In: YES user_id=774723 Just installed the current CVS version, and it works! Thanx, Oliver ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-13 00:45 Message: Logged In: YES user_id=14198 Can you please try the latest version from CVS - this bug has been fixed (but is not yet in any of the binaries). The bigger issue we are dealing with can be tracked at https://sourceforge.net/tracker/index.php?func=detail&aid=725466&group_id=61702&atid=498103 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=736677&group_id=61702 From DavidA at ActiveState.com Tue May 13 12:56:11 2003 From: DavidA at ActiveState.com (David Ascher) Date: Tue May 13 14:56:52 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. In-Reply-To: References: Message-ID: <3EC13FCB.1070306@ActiveState.com> Sean True wrote: > Well, sort of commercial. > > After some initial conversation with Mark Hammond, and considerable > jawboning from my own internal customers, > I've been working on a productized version of the Outlook plugin, code named > SpamAtBay. It has had some UI work, and documentation changes, and some > significant functional changes (see the CHANGES.txt) below. I AM STUNNED. I just got individually spammed by you (as probably everyone on spambayes did?), asking me to download software which: - is not clearly advertised as commercial or not - does not give specific credit to SpamBayes I'm absolutely stunned that you would do such a thing. While my bias would have been to say 'yeah!' to a commercialization of SpamBayes and the outlook plugin, I will most definitely stay away from your product, if you don't understand the basic definition of spam, I'm afraid I can't trust you. --david From ark3 at email.com Tue May 13 15:58:51 2003 From: ark3 at email.com (Abhay Saxena) Date: Tue May 13 14:59:03 2003 Subject: [Spambayes] Training via IMAP Filter Message-ID: Congratulations on all your great work so far. I'm in the "long time listener, first time caller" category, so to speak (Tim Peters: I'm the guy who shook your hands -- yes, both of them -- at the recent PyCon). I have a pretty basic email setup: UW IMAP running on FreeBSD 4.7 or so. I decided to follow David Abrahams' guide, since his setup is not that different from mine. I was able to set up hammiefilter.py using procmail very easily, but training using imapfilter.py has turned out to be a little more difficult. Let me ask some (hopefully) easy questions now. I can get into the details of the trouble I'm having later, if it turns out to be necessary. 1. Has anyone tested imapfilter.py with UW IMAP (http://www.washington.edu/imap/)? I've included a snippet of server output below. 2. Have there been any (important) changes to imaplib since 2.2.2? Some stuff that I expected to work didn't work; I'm not sure if that's due to a bug or due to changes in imaplib. I'll elaborate if it becomes an issue. 3. Why would imapfilter.py try to modify the contents of my IMAP folders if all I'm asking for is training? In particular, when I run training, it starts with my inbox and immediately chokes on an imap.append(...). I find that a duplicate of the first message has appeared in my inbox. To clarify: I'm not trying to figure out why it chokes; obviously I haven't provided you enough information. I simply would like to know why imapfilter.py is doing an imap.append(...) for what seems to me to be a read-only operation. Thanks for your help. - Abhay * OK [CAPABILITY IMAP4REV1 LOGIN-REFERRALS STARTTLS AUTH=LOGIN] localhost IMAP4rev1 2002.332 at Tue, 13 May 2003 14:43:48 -0400 (EDT) Python 2.2.2 (#1, Jan 15 2003, 03:26:24) [GCC 2.95.4 20020320 [FreeBSD]] on freebsd4 From seant at webreply.com Tue May 13 16:24:53 2003 From: seant at webreply.com (Sean True) Date: Tue May 13 15:29:41 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. In-Reply-To: <3EC13FCB.1070306@ActiveState.com> Message-ID: Actually, there were two email messages: one to the list, through the moderator, and pretty clear in intent and content; and a second to 135 people selected from my personal Outlook contact folder. Of the Pythonistas (and SB-ers), that included you (David), Mark Hammond, and Tim Peters. The second mail was much less technical and detailed -- it's intended for my brother in law as well as my head hunter and former boss. It's an open question whether my sending mail to that list is spam, but my personal opinion is that I have corresponded with everyone I sent it to, and I thought everyone of them might be interested. Clearly, I stepped on your toes. Anyone else offended? -- Sean > -----Original Message----- > From: David Ascher [mailto:DavidA@ActiveState.com] > Sent: Tuesday, May 13, 2003 2:56 PM > To: Sean True > Cc: spambayes@python.org > Subject: Re: [Spambayes] Outlook plugin -- Blatant, vaguely commercial > offer. > > > Sean True wrote: > > Well, sort of commercial. > > > > After some initial conversation with Mark Hammond, and considerable > > jawboning from my own internal customers, > > I've been working on a productized version of the Outlook > plugin, code named > > SpamAtBay. It has had some UI work, and documentation changes, and some > > significant functional changes (see the CHANGES.txt) below. > > I AM STUNNED. > > I just got individually spammed by you (as probably everyone on spambayes > did?), asking me to download software which: > > - is not clearly advertised as commercial or not > - does not give specific credit to SpamBayes > > I'm absolutely stunned that you would do such a thing. While my > bias would > have been to say 'yeah!' to a commercialization of SpamBayes and > the outlook > plugin, I will most definitely stay away from your product, if you don't > understand the basic definition of spam, I'm afraid I can't trust you. > > --david > > From T.A.Meyer at massey.ac.nz Wed May 14 11:25:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 18:27:36 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019790F2@its-xchg4.massey.ac.nz> > I just got individually spammed by you (as probably everyone > on spambayes did?), asking me to download software which: > > - is not clearly advertised as commercial or not > - does not give specific credit to SpamBayes Did you read the message? It does both of these. > if you don't > understand the basic definition of spam, I'm afraid I can't trust you. Well, there isn't really a basic definition of spam. That's the whole point of a filter that has a personal database. But, for discussion, let's use "unsolicited commercial email" as a definition. This *was* soliticited. I signed up for this list to find out and discuss the spambayes system. A product that uses it is of interest to me, whether I will use it or not. It was *maybe* commercial. The email didn't try and get us to buy anything - in fact, it offered the software for free[1]. It wasn't trying to sell the product - it said that it only does as well as the stuff we're all using for free. It pointed out what the differences were, and who might be interested [2]. It was email, so it gets about 1.5 out of 3 in the UCE definition. Hardly spam. As a more trivial definition, for me the message scored 1.78191e-013. Even my false negatives aren't that low. =Tony Meyer [1] Conditions applied ;) Go read it. [2] Tim's sister ;) From T.A.Meyer at massey.ac.nz Wed May 14 11:39:58 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 18:40:53 2003 Subject: [Spambayes] Training via IMAP Filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197910C@its-xchg4.massey.ac.nz> > 1. Has anyone tested imapfilter.py with UW IMAP > (http://www.washington.edu/imap/)? Not AFAIK. As you might have noticed, the IMAP filter is pretty new still. It's only been tested with 5 servers that I know of (the archives have the names, which I can't recall offhand). That one doesn't ring a bell. On the other hand, the last major change that I made was aimed at making the filter *much* more in line with the RFC (yes, I should have done this in the first place). That did end up fixing most of the problems - people implementing servers make some *strange* choices ;) So if it follows the RFC, it shouldn't have any problems. > 2. Have there been any (important) changes to imaplib since > 2.2.2? Yes, but none that should effect the filter, unless you are trying to use SSL [1]. Compared to the latest cvs, imaplib has: * IMAP over SSL * Fixed the crlf bug (see the archives of this list). We also fix it, so no worries there. * Lots of little changes that are semantic only, really. > Some stuff that I expected to work didn't work; I'm not sure if > that's due to a bug or due to changes in imaplib. I'll elaborate > if it becomes an issue. It's unlikely to be due to changes in imaplib. Please elaborate! > 3. Why would imapfilter.py try to modify the contents of my IMAP > folders if all I'm asking for is training? In particular, when I run > training, it starts with my inbox and immediately chokes on an > imap.append(...). I find that a duplicate of the first message has > appeared in my inbox. Because it adds a header to your mail so that it can keep track of it. IMAP doesn't have any method of tracking mail persistently. There is a unique ID, but this can change. To get around this, we add our own unique id in a "X-Spambayes-MailId" header. It will only do this the first time it sees a message, so next time you train, only new messages will be altered. If it's not working, then please submit a bug with details - I'll try and get to fixing it ASAP (although this is a busy RL week). With the number of ways that IMAP is implemented, the only way to do full testing is to have people with different servers report any problems that they have. =Tony Meyer [1] As the check-in message said (and the comments do, too), SSL is completely untested, because it is only available in Python post-2.2.2, and I don't have that (installed) at the moment. From T.A.Meyer at massey.ac.nz Wed May 14 11:55:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 18:56:14 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197912B@its-xchg4.massey.ac.nz> > code named SpamAtBay. Cute name :) > Anyone who is a contributor to SpamBayes gets a license to > the shipping product, for their own use, for the asking. Thanks for this, although personally I doubt I will take this up since it doesn't seem to offer me anything I don't already have and would get in the way of testing. > It's our hope, however, that we > can get over the "Can Tim's sister use it?" barrier. I would have thought that the Outlook plugin was the one spambayes app that was already over that barrier, although I suppose the setup could be a bit simpler. Now if you had an Outlook *Express* plugin, that would be something . > The embarrassingly small > number of bug fixes we've sent him indicate how good his part > of this is. Have you looked at the number still open on SF? Seriously, most of these are pretty minor, but it would be interesting to know if you've resolved the locale problem and the messages-jump-back-to-unread problem. I suspect not, since you probably would have given these back to Mark. Queries re: the changes - if answering any of these would give too much away, just say so, although I suspect I could figure out the answers just by running it. > 5) Simplified training options Simplified in what way? Just in terms of picking folders, or something like training on all incoming mail? > 6) Simplified quarantine options Quarantine being a Tim's-sister word for filtering into a spam folder? I'm curious about the details of this, too. Is it something as simple as a delayed delete? > 7) Added pretrained adult, health, and money spam Ah, the eternal presupplied database debate ;) Are these in one supplied database, six, or three that you can somehow combine? Is using it optional, or just the way it comes? =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 14 11:58:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 18:59:08 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197912E@its-xchg4.massey.ac.nz> > I installed the 1.0a2 from sourceforge yesterday and I use > the pop3proxy so my Eudora now connects to localhost instead. > Yesterday everything was working fine and no errors reported. > Great work guys! Today I get this errors: [...] > Any ideas what this might mean or what I have done to deserve this? It's difficult to tell. It looks like it might be a problem with the database. Is your hammie.db file still in the right place? You might try deleting[1] this file and seeing if that fixes it (if it doesn't, just undelete it). =Tony Meyer [1] By "delete", I mean put it somewhere else so that you can get it back. The recycle bin is the obvious candidate. I don't mean delete it permanently. From starpilot at sol3.net Tue May 13 18:11:37 2003 From: starpilot at sol3.net (StarPilot) Date: Tue May 13 19:07:00 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197912B@its-xchg4.massey.ac.nz> Message-ID: <011801c319a5$01db66c0$c802a8c0@sol3.int> I am a long term developer (started in 1976) but have never been on SourceForge or used Python. I am a MS developer (Classic and .NET) as well as an applications integrator. So, the techie stuff does not bother me. I run Outlook 2002 with 9 POP3 accounts. I do not run 9 profiles with 1 POP3 account each - I have *9* active POP3 accounts being tagged every time Outlook refreshes. My wife and kids also have their own computers with anywhere from 1 account (youngest) to 4 accounts (wife). How do I set all of this up to work with mine and my family's systems? Keith Barrows www.ASPAdvice.com From tim.one at comcast.net Tue May 13 20:08:39 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 13 19:10:19 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13019790F2@its-xchg4.massey.ac.nz> Message-ID: [David Ascher] >> I just got individually spammed by you (as probably everyone >> on spambayes did?), asking me to download software which: >> >> - is not clearly advertised as commercial or not >> - does not give specific credit to SpamBayes [Tony Meyer] > Did you read the message? It does both of these. As Sean said later, some of us got two msgs. The second wasn't sent to the spambayes list, and, for example, doesn't mention spambayes. (Which didn't happen to bother me, but David was factually correct there; I believe that most people on the spambayes list didn't get the second msg.) > ... > As a more trivial definition, for me the message scored 1.78191e-013. > Even my false negatives aren't that low. The msg to the spambayes list scored 1.54654e-013 for me. The second msg scored 0.194474, hugging my unsure threshold of 0.20. A msg from Sean is a strong ham indicator in my database, else it would have scored higher. But even without the "Sean sent it" clues, it wouldn't have gotten above my spam threshold, so spambayes believes the status of that msg is debatable. There's just no fighting this Scientifically Objective result . > ... > [2] Tim's sister ;) She agrees with all of us, of course. From T.A.Meyer at massey.ac.nz Wed May 14 12:11:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 19:12:01 2003 Subject: [Spambayes] Newbie here!!! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301979151@its-xchg4.massey.ac.nz> > How do I set all of this up to work with mine and my family's systems? The easiest thing to do is to download the Outlook plugin from Mark's webpage (there's a link on spambayes.org) and run the installer. Then you just need to create a folder for unsure messages, and one for spam (if you don't have one already). Choose "Anti-Spam Manager" from the "Anti-Spam" toolbar item, and enter in the relevant data. (The readme has more). If your family are also using Outlook, then it's the same process. (It doesn't matter how many accounts are setup in Outlook. I have one Exchange and nine POP/SMTP accounts myself). If they're not using Outlook, then it gets trickier. Let us know what mail program it is and we can give pointers to installing spambayes (the POP3 proxy, most likely. =Tony Meyer From mhammond at skippinet.com.au Wed May 14 10:13:51 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 19:14:39 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <011801c319a5$01db66c0$c802a8c0@sol3.int> Message-ID: <086601c319a5$51ba2dc0$530f8490@eden> > I run Outlook 2002 with 9 POP3 accounts. I do not run 9 > profiles with 1 > POP3 account each - I have *9* active POP3 accounts being tagged every > time Outlook refreshes. > > My wife and kids also have their own computers with anywhere from 1 > account (youngest) to 4 accounts (wife). > > How do I set all of this up to work with mine and my family's systems? You simply use the Outlook extension. Each pop account will simply drop the items in the inbox, where SpamBayes will process them. There should be no problem at all. Ditto for the family, assuming they too use outlook. Mark. From T.A.Meyer at massey.ac.nz Wed May 14 12:14:16 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 19:14:49 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301979156@its-xchg4.massey.ac.nz> > As Sean said later, some of us got two msgs. The second > wasn't sent to the spambayes list, and, for example, doesn't > mention spambayes. (Which didn't happen to bother me, but > David was factually correct there; I believe that most people > on the spambayes list didn't get the second msg.) This makes more sense. The confusing thing was that the reply to the personal message was sent to the list, then. > There's just no fighting this Scientifically > Objective result . Obviously, the correct definition of spam is "messages that score above the spam threshold in spambayes". Those using other systems will just have to install spambayes to figure out if it's spam or not ;) =Tony Meyer From starpilot at sol3.net Tue May 13 18:28:24 2003 From: starpilot at sol3.net (StarPilot) Date: Tue May 13 19:23:46 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301979151@its-xchg4.massey.ac.nz> Message-ID: <012401c319a7$59ed42a0$c802a8c0@sol3.int> OK. Sounds simple enough. Next question - where is the right URL for the right downloads on sourceforge? (I did not bookmark it and am swimming in links)... We are all using Outlook 2000 or 2002. Keith -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Tuesday, May 13, 2003 5:11 PM To: starpilot@sol3.net; spambayes@python.org Subject: RE: [Spambayes] Newbie here!!! > How do I set all of this up to work with mine and my family's systems? The easiest thing to do is to download the Outlook plugin from Mark's webpage (there's a link on spambayes.org) and run the installer. Then you just need to create a folder for unsure messages, and one for spam (if you don't have one already). Choose "Anti-Spam Manager" from the "Anti-Spam" toolbar item, and enter in the relevant data. (The readme has more). If your family are also using Outlook, then it's the same process. (It doesn't matter how many accounts are setup in Outlook. I have one Exchange and nine POP/SMTP accounts myself). If they're not using Outlook, then it gets trickier. Let us know what mail program it is and we can give pointers to installing spambayes (the POP3 proxy, most likely. =Tony Meyer From David.Bear at asu.edu Tue May 13 17:22:56 2003 From: David.Bear at asu.edu (David Bear) Date: Tue May 13 19:24:52 2003 Subject: [Spambayes] using outlook addin In-Reply-To: <"from mhammond"@skippinet.com.au> References: <011801c319a5$01db66c0$c802a8c0@sol3.int> <086601c319a5$51ba2dc0$530f8490@eden> Message-ID: <20030513162256.A1682@asu.edu> I installed spambayes on win2k from scratch.. first installing python, then win32all, then spambayes source.. Then I ran the addin.py and noticed that it only seem to add spambayes to the 'current' outlook profile., ie I logged out, then back in as another user, and outlook did not have the magic spam buttons. So I ran addin.py again as that use. This added the buttons and all SEEMS well. I was just wondering if this is standard behavior.. ie does addin.py only affect the currently logged on outlook profile? -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, it will make criminals out of innovators" From mhammond at skippinet.com.au Wed May 14 10:27:20 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 19:28:00 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <012401c319a7$59ed42a0$c802a8c0@sol3.int> Message-ID: <086701c319a7$33d5fa80$530f8490@eden> http://spambayes.org should set you right! > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of StarPilot > Sent: Wednesday, 14 May 2003 9:28 AM > To: spambayes@python.org > Subject: RE: [Spambayes] Newbie here!!! > > > OK. Sounds simple enough. Next question - where is > the right > URL for the right downloads on sourceforge? (I did not > bookmark it and > am swimming in links)... > > We are all using Outlook 2000 or 2002. > > Keith > > -----Original Message----- > From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] > Sent: Tuesday, May 13, 2003 5:11 PM > To: starpilot@sol3.net; spambayes@python.org > Subject: RE: [Spambayes] Newbie here!!! > > > > How do I set all of this up to work with mine and my > family's systems? > > The easiest thing to do is to download the Outlook plugin from Mark's > webpage (there's a link on spambayes.org) and run the installer. Then > you just need to create a folder for unsure messages, and one for spam > (if you don't have one already). Choose "Anti-Spam Manager" from the > "Anti-Spam" toolbar item, and enter in the relevant data. (The readme > has more). > > If your family are also using Outlook, then it's the same > process. (It > doesn't matter how many accounts are setup in Outlook. I have one > Exchange and nine POP/SMTP accounts myself). > > If they're not using Outlook, then it gets trickier. Let us know what > mail program it is and we can give pointers to installing > spambayes (the > POP3 proxy, most likely. > > =Tony Meyer > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > From T.A.Meyer at massey.ac.nz Wed May 14 12:28:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 19:28:59 2003 Subject: [Spambayes] Newbie here!!! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197916F@its-xchg4.massey.ac.nz> > OK. Sounds simple enough. Next question - where is > the right URL for the right downloads on sourceforge? (I did > not bookmark it and am swimming in links)... The one you need to remember is http://spambayes.org. The actual download page for the Outlook plugin is here: http://starship.python.net/crew/mhammond/spambayes/ (there is also a link here: http://spambayes.sourceforge.net/download.html ) =Tony Meyer From mhammond at skippinet.com.au Wed May 14 10:30:04 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 19:30:52 2003 Subject: [Spambayes] using outlook addin In-Reply-To: <20030513162256.A1682@asu.edu> Message-ID: <086801c319a7$95a4bc60$530f8490@eden> > I installed spambayes on win2k from scratch.. first installing python, > then win32all, then spambayes source.. Then I ran the addin.py and > noticed that it only seem to add spambayes to the 'current' outlook > profile., ie I logged out, then back in as another user, and outlook > did not have the magic spam buttons. So I ran addin.py again as that > use. This added the buttons and all SEEMS well. I was just wondering > if this is standard behavior.. ie does addin.py only affect the > currently logged on outlook profile? Outlook addins are registered under "HKEY_CURRENT_USER" in the registry. Thus, my understanding is that installing it will work for all profiles for the current user, but not profiles for different users. If you can find documentation on how a plugin can install itself for all users, I would be happy to make the hacks. Mark. From starpilot at sol3.net Tue May 13 18:38:51 2003 From: starpilot at sol3.net (StarPilot) Date: Tue May 13 19:34:14 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197916F@its-xchg4.massey.ac.nz> Message-ID: <012c01c319a8$cfc9b9d0$c802a8c0@sol3.int> OK. Got that download and I noticed another one that he said was required (?) - installer_mh.zip... Do I need Python? Where do I get it? (What is it?) Will it screw up my dev box (Win2k Server, .NET 2003, VS6, etc)? What order, and exactly what all, is needed? The reason I am asking is the docs seem to hint at needing "other stuff" but assumes that everyone reading it is up-to-date on all of the infrastructure of setting this app up... Keith From T.A.Meyer at massey.ac.nz Wed May 14 12:37:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 19:38:27 2003 Subject: [Spambayes] Newbie here!!! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197917C@its-xchg4.massey.ac.nz> > OK. Got that download and I noticed another one that he said > was required (?) - installer_mh.zip... It's not required. It's just something else he offers. > Do I need Python? Not for the Outlook binary plugin that you downloaded. For any other spambayes app, yes. Outlook is lucky like that. [1] > Where do I get it? http://python.org. > (What is it?) A programming language. > Will it screw up my dev box (Win2k Server, .NET 2003, VS6, > etc)? No. > What order, and exactly what all, is needed? Just the installer that you downloaded. > The reason I am asking is the docs seem to hint at needing > "other stuff" but assumes that everyone reading it is > up-to-date on all of the infrastructure of setting this app up... Going over the docs is on the to-do list ;) It's probably additionally confusing since everything *else* does need you to install Python, and they were written when there wasn't an Outlook binary. =Tony Meyer [1] Well, not because of luck, but because Mark put the effort into making the installer... From mhammond at skippinet.com.au Wed May 14 10:38:17 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 13 19:50:40 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <012c01c319a8$cfc9b9d0$c802a8c0@sol3.int> Message-ID: <087001c319a8$bb7764f0$530f8490@eden> > OK. Got that download and I noticed another one that he said was > required (?) - installer_mh.zip... Sorry - I should make the page more clear. You only need installer_mh.zip if you want to build your own binary release. > Do I need Python? Not for the binary, no. > Where do I get it? > (What is it?) The programming language used by this project - but the binary install has everything you need. > Will it screw up my dev box (Win2k Server, .NET 2003, VS6, etc)? More than likely > What order, and exactly what all, is needed? > > The reason I am asking is the docs seem to hint at needing > "other stuff" > but assumes that everyone reading it is up-to-date on all of the > infrastructure of setting this app up... Just run the installer .EXE - that is it. Mark. From T.A.Meyer at massey.ac.nz Wed May 14 13:18:57 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 20:19:37 2003 Subject: [Spambayes] Web Interface enhancements Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019791B8@its-xchg4.massey.ac.nz> > First in "Review messages" can the "To:" account be > displayed. I'm using 6 accounts and it would help me to know > which account an email is destined to. I'll add this as an option. I can see that it would be useful, but the display is already pretty wide. With To: as well, it'll fix in 1024x768 (maximised), but anything smaller and there will be a lot of scrolling left and right. [...] Done (in cvs). The option is "html_ui":"display_to", which you can set to True in your config file (either manually, or via the config page). > And again in "Review messages" a return to "Review messages" > option would be nice in the "Trained" page. At present you > have to "Return Home" and then click "Review messages" again. > Even better would be to return to "Review messages" after > pressing "Train" and include a "Refresh" option, then I could > just happily sit in "Review messages". At the moment, it does return to the review page, but with the "trained" bit at the top. I can see that a "refresh" button would be useful (since using the browser's refresh would try to retrain). I'll add this. [...] Done (in cvs). The refresh button is normally between the "previous day" and "today" buttons (which might be disabled). If you train everything and get to the "no untrained messages" page, there is a link "check again". I worry a little that this might make people think that it checks for mail on the server, which it doesn't (the mail program needs to do that). Thoughts? =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 14 13:21:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 20:21:39 2003 Subject: [Spambayes] Web Interface enhancements Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019791BA@its-xchg4.massey.ac.nz> > Done (in cvs). The refresh button is normally between the "previous > day" and "today" buttons (which might be disabled). ^^^^^ I meant "next day", of course... =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 14 13:27:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 13 20:31:12 2003 Subject: [Spambayes] Does anyone care about this report? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019791C2@its-xchg4.massey.ac.nz> > I spent considerable time preparing this several weeks ago, > with the promise that it would be made available somewhere > that would be of use to people... but got no reply. TimS was going to add it, but has been busy with RL stuff over the last couple of weeks. I hadn't bothered because I knew he was going to get to it eventually. If Mark doesn't get to it, I'll add it once you've answered the "include" or "include link" question that he asked. > I'd also appreciate it if someone could address the question at the end. I read it all, including the question, and even started to reply, and then decided that I didn't really know enough about what I was talking about to answer it, so left it for someone else. I'm pretty sure that Alex is right and Spambayes does no locking. My answer would be the same as his, I think, in that I don't think it should matter, but that I don't know enough about the db stuff to be certain. I think Mark knows something about it, and TimS is figuring it out at the moment, I think, but I don't know who the expert is. I'm especially unclear because of the way you're mixing different components to get the desired effect. Sorry I didn't answer, it's just that I really don't think I'm contributing any wisdom to this. =Tony Meyer From Greg at springcreek.cc Tue May 13 19:41:43 2003 From: Greg at springcreek.cc (Greg Lozier) Date: Tue May 13 21:27:27 2003 Subject: [Spambayes] Installation Error Message-ID: I installed spambayes today and while installing I received an error message that there is no disk in the drive. Please insert a disk into drive \device\harddisk1\dr3. I cancel the error message it finish loading. It is operational in outlook 2000 now, but when I start outlook I get the above error message. After hitting cancel a few times it will go away and outlook will load. What is causing this error? Thanks, Greg From tim.one at comcast.net Tue May 13 23:43:51 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 13 22:45:31 2003 Subject: [Spambayes] Installation Error In-Reply-To: Message-ID: [Greg Lozier] > I installed spambayes today and while installing I received an > error message that there is no disk in the drive. Please insert a > disk into drive \device\harddisk1\dr3. I cancel the error message it > finish loading. It is operational in outlook 2000 now, but when I > start outlook I get the above error message. After hitting cancel > a few times it will go away and outlook will load. > > What is causing this error? Sorry, beats me -- never heard of anything like this, and it's hard to imagine how it's *possible*. Which version of Windows was this? Do you, by any chance, run a Windows version of XEmacs? (I ask this because the top hit on a google search for "harddisk1" is a thread about problems with No disk in device\harddisk1\dr1 error msgs popping up after installing an old version of XEmacs. They solved the problem by assigning their Zip drive to a different letter. Google turns up several other hits on installation programs triggering this bizarre msg too; I didn't find a satisfying explanation.) From starpilot at sol3.net Tue May 13 22:16:53 2003 From: starpilot at sol3.net (StarPilot) Date: Tue May 13 23:12:19 2003 Subject: [Spambayes] Newbie here!!! In-Reply-To: <087001c319a8$bb7764f0$530f8490@eden> Message-ID: <001601c319c7$45389bf0$c802a8c0@sol3.int> Got it downloaded, installed on my machine and my wife's machine, actually had a few spam laying around so I got it trained a little and now it seems to be working. The test will come over the next few days as I normally get between 35 and 100 spams/day... Keith -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Tuesday, May 13, 2003 5:38 PM To: starpilot@sol3.net; spambayes@python.org Subject: RE: [Spambayes] Newbie here!!! > OK. Got that download and I noticed another one that he said was > required (?) - installer_mh.zip... Sorry - I should make the page more clear. You only need installer_mh.zip if you want to build your own binary release. > Do I need Python? Not for the binary, no. > Where do I get it? > (What is it?) The programming language used by this project - but the binary install has everything you need. > Will it screw up my dev box (Win2k Server, .NET 2003, VS6, etc)? More than likely > What order, and exactly what all, is needed? > > The reason I am asking is the docs seem to hint at needing "other > stuff" but assumes that everyone reading it is up-to-date on all of > the infrastructure of setting this app up... Just run the installer .EXE - that is it. Mark. From noreply at sourceforge.net Wed May 14 04:44:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 06:44:56 2003 Subject: [Spambayes] [ spambayes-Bugs-737614 ] Messages go back to "unread" status Message-ID: Bugs item #737614, was opened at 2003-05-14 11:44 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737614&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Moore (pmoore) Assigned to: Mark Hammond (mhammond) Summary: Messages go back to "unread" status Initial Comment: Occasionally, when using the Outlook plugin, messages revert to "unread" status while they are being read. The behaviour isn't particularly reproducible, which makes it hard to test :-( I'm using Exchange as my mail backend, Outlook 2000, and a version of CVS from somewhen (is there a version number I can check? - there are files dated 9th April...) This has been around for a while, and discussed on the mailing list, but I couldn't find a bug for it - if there is one, sorry, feel free to close this one! ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737614&group_id=61702 From noreply at sourceforge.net Wed May 14 04:52:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 06:52:20 2003 Subject: [Spambayes] [ spambayes-Bugs-706170 ] Execute test suite fails in Outlook Message-ID: Bugs item #706170, was opened at 2003-03-19 21:03 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Execute test suite fails in Outlook Initial Comment: The test suite fails in outlook. I've retrained messages from a spam and a ham folder. I think this may be related to moving the database-files from the spambayes to the default docs-folders in windows a couple of weeks ago. the following traceback is shown in PythonWin: Executing automated tests... Traceback (most recent call last): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\addin.py", line 308, in Tester tester.test(manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 306, in test TestSpamFilter(driver) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 173, in TestSpamFilter msg, words = driver.CreateTestMessageInFolder (SPAM, driver.folder_watch) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 132, in CreateTestMessageInFolder msg, words = self.CreateTestMessage(spam_status) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 145, in CreateTestMessage words.update(FindTopWords(self.manager.bayes, 50, True)) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 64, in FindTopWords for word, info in extractor(bayes): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 46, in DBExtractor key = bayes.dbm.next()[0] File "C:\PROGRA~1\_DEV\Python22\Lib\site- packages\bsddb3\__init__.py", line 122, in next rv = self.dbc.next() DBNotFoundError: (-30991, 'DB_NOTFOUND: No matching key/data pair found') Tests FAILED. Sorry about that. If I were you, I would do a full re-train ASAP Please delete any test messages from your Spam, Unsure or Inbox folders first. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-14 20:52 Message: Logged In: YES user_id=14198 I'm pretty sure this was fixed in tester.py: revision 1.7 date: 2003/05/03 13:15:27; author: mhammond; state: Exp; lines: +9 -1 Allow test suite to work with bsddb3 or bsddb Please re-open otherwise. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-03-20 11:31 Message: Logged In: YES user_id=14198 This seems a bsddb3 problem. The code in question: try: key = bayes.dbm.next()[0] except bsddb.error: already attempts to catch this error. Further, the docs for DBNotFoundError state that it derives from bsddb.error, meaning my except statement should work. I will try and get to using my Python 2.2 version for the plugin to fix this. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 From noreply at sourceforge.net Wed May 14 04:55:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 06:55:09 2003 Subject: [Spambayes] [ spambayes-Bugs-737619 ] Message spam score is displayed (and filtered) wrongly Message-ID: Bugs item #737619, was opened at 2003-05-14 11:55 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737619&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Moore (pmoore) Assigned to: Mark Hammond (mhammond) Summary: Message spam score is displayed (and filtered) wrongly Initial Comment: Exchange, Outlook 2000, plugin from CVS around 9th April. Occasionally (it seems to be mostly when the plugin is doing a "catchup" when I start up in the morning, but I don't think it's *always* then) a spam is left in my inbox with a score of 0% displayed. Clicking the "show spam clues" button shows a correct score of 99.xxx%. I *think* that the message isn't showing in the log of messages being filtered (so why does it get a score at all?) but it's hard to be sure I didn't miss the note about a message with subject "??????????????????????" in a list of 200+ such messages :-) This has been discussed on the list in the past, but I could find no bug specifically about it. If I missed one, I apologise - feel free to close this in that case. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737619&group_id=61702 From noreply at sourceforge.net Wed May 14 05:34:07 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 07:34:14 2003 Subject: [Spambayes] [ spambayes-Bugs-706170 ] Execute test suite fails in Outlook Message-ID: Bugs item #706170, was opened at 2003-03-19 11:03 Message generated for change (Comment added) made by fmmr You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 Category: Outlook Group: None >Status: Open Resolution: Fixed Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Execute test suite fails in Outlook Initial Comment: The test suite fails in outlook. I've retrained messages from a spam and a ham folder. I think this may be related to moving the database-files from the spambayes to the default docs-folders in windows a couple of weeks ago. the following traceback is shown in PythonWin: Executing automated tests... Traceback (most recent call last): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\addin.py", line 308, in Tester tester.test(manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 306, in test TestSpamFilter(driver) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 173, in TestSpamFilter msg, words = driver.CreateTestMessageInFolder (SPAM, driver.folder_watch) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 132, in CreateTestMessageInFolder msg, words = self.CreateTestMessage(spam_status) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 145, in CreateTestMessage words.update(FindTopWords(self.manager.bayes, 50, True)) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 64, in FindTopWords for word, info in extractor(bayes): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 46, in DBExtractor key = bayes.dbm.next()[0] File "C:\PROGRA~1\_DEV\Python22\Lib\site- packages\bsddb3\__init__.py", line 122, in next rv = self.dbc.next() DBNotFoundError: (-30991, 'DB_NOTFOUND: No matching key/data pair found') Tests FAILED. Sorry about that. If I were you, I would do a full re-train ASAP Please delete any test messages from your Spam, Unsure or Inbox folders first. ---------------------------------------------------------------------- >Comment By: Fredrik Rodland (fmmr) Date: 2003-05-14 13:34 Message: Logged In: YES user_id=724871 I tried this again today with the latest CVS (may 14th). the error is not the same, but it still does not work. got the following traceback: Executing automated tests... Bayes database is not dirty - not writing Traceback (most recent call last): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\addin.py", line 314, in Tester tester.test(manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 316, in test TestSpamFilter(driver) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 182, in TestSpamFilter msg, words = driver.CreateTestMessageInFolder(SPAM, driver.folder_watch) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 141, in CreateTestMessageInFolder msg, words = self.CreateTestMessage(spam_status) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 154, in CreateTestMessage words.update(FindTopWords(self.manager.bayes, 50, True)) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 72, in FindTopWords for word, info in extractor(bayes): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 43, in DBExtractor bsddb_error = bsddb.DBNotFoundError AttributeError: 'module' object has no attribute 'DBNotFoundError' Tests FAILED. Sorry about that. If I were you, I would do a full re-train ASAP Please delete any test messages from your Spam, Unsure or Inbox folders first. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-14 12:52 Message: Logged In: YES user_id=14198 I'm pretty sure this was fixed in tester.py: revision 1.7 date: 2003/05/03 13:15:27; author: mhammond; state: Exp; lines: +9 -1 Allow test suite to work with bsddb3 or bsddb Please re-open otherwise. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-03-20 01:31 Message: Logged In: YES user_id=14198 This seems a bsddb3 problem. The code in question: try: key = bayes.dbm.next()[0] except bsddb.error: already attempts to catch this error. Further, the docs for DBNotFoundError state that it derives from bsddb.error, meaning my except statement should work. I will try and get to using my Python 2.2 version for the plugin to fix this. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 From Paul.Moore at atosorigin.com Wed May 14 13:54:12 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Wed May 14 07:55:34 2003 Subject: [Spambayes] Outlook plugin -- Blatant, vaguely commercial offer. Message-ID: <16E1010E4581B049ABC51D4975CEDB88619A71@UKDCX001.uk.int.atosorigin.com> From: Sean True [mailto:seant@webreply.com] > The embarassingly small number of bug fixes we've sent him > indicate how good his part of this is. > Any remaining problems are likely to be introduced by me! The "messages return to Unread status" and "spam score displayed wrong" bugs are pretty user-visible, certainly on Exchange. Embarassingly, although they have been discussed on the list a few times, I can't find SF bug reports for them - just added a couple (737614 and 737619). Fixing them should probably be a high priority for any "commercial" version of SpamBayes. Which version of the plugin do you use? CVS, or one of Mark's releases? Paul. From noreply at sourceforge.net Wed May 14 05:59:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 07:59:05 2003 Subject: [Spambayes] [ spambayes-Bugs-737642 ] Hang when trying to get message Message-ID: Bugs item #737642, was opened at 2003-05-14 11:59 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737642&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Remi Ricard (papadoc) Assigned to: Nobody/Anonymous (nobody) Summary: Hang when trying to get message Initial Comment: Hi, I'm using pop3proxy (from cvs 2003.05.13) and sometime I'm not able to retreive my mail from one my pop account I'm getting the following error: error: uncaptured python exception, closing channel <__main__.ServerLineReader c onnected at 0x138d160> (email.Errors.HeaderParseError:Not a header, not a contin uation: ``charset="iso-8859-1"'' [C:\Devtools\Python\PYTHON~1.2\lib\asyncore.py| poll|99] [C:\Devtools\Python\PYTHON~1.2\lib\asyncore.py|handle_read_event|396] [ C:\Devtools\Python\PYTHON~1.2\lib\asynchat.py|handle_read|130] [C:\Devtools\SPAM BA~1\SPAMBA~1.13\POP3PR~1.PY|found_terminator|146] [C:\Devtools\SPAMBA~1\SPAMBA~ 1.13\POP3PR~1.PY|onServerLine|214] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY |onResponse|288] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY|onTransaction|389 ] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY|onRetr|436] [C:\Devtools\SPAMBA~ 1\SPAMBA~1.13\spambayes\message.py|setPayload|168] [C:\Devtools\Python\PYTHON~1. 2\lib\email\Parser.py|_parsebody|228] [C:\Devtools\Python\PYTHON~1.2\lib\email\P arser.py|parsestr|75] [C:\Devtools\Python\PYTHON~1.2\lib\email\Parser.py|parse|6 2] [C:\Devtools\Python\PYTHON~1.2\lib\email\Parser.py|_parseheaders|128]) I retrieve the mail directly without using pop3proxy and tested each message with the classify options on the web interface and there was not problem. Remi papaDoc@videotron.ca N.B. How can I find which messge is causing problem and how can I get an hand on it ? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737642&group_id=61702 From readonly at getsoft.com Wed May 14 23:03:48 2003 From: readonly at getsoft.com (Neville Franks) Date: Wed May 14 08:05:54 2003 Subject: [Spambayes] Installation Error In-Reply-To: References: Message-ID: <116864793196.20030514220348@getsoft.com> TP> [Greg Lozier] >> I installed spambayes today and while installing I received an >> error message that there is no disk in the drive. Please insert a >> disk into drive \device\harddisk1\dr3. I cancel the error message it >> finish loading. It is operational in outlook 2000 now, but when I >> start outlook I get the above error message. After hitting cancel >> a few times it will go away and outlook will load. >> >> What is causing this error? TP> Sorry, beats me -- never heard of anything like this, and it's hard to TP> imagine how it's *possible*. Which version of Windows was this? Do you, by TP> any chance, run a Windows version of XEmacs? (I ask this because the top TP> hit on a google search for "harddisk1" is a thread about problems with TP> No disk in device\harddisk1\dr1 TP> error msgs popping up after installing an old version of XEmacs. They TP> solved the problem by assigning their Zip drive to a different letter. TP> Google turns up several other hits on installation programs triggering this TP> bizarre msg too; I didn't find a satisfying explanation.) This error occurs when code is attempting to access a hard drive which physically doesn't exist in a machine and there is a removable drive like a zip drive at the drive location. eg. Code is trying to access a hard drive d: and there is no such hard drive, but there is a zip drive d:. Inserting a zip disk works around getting the error. That's about as much as I know. -- Best regards, Neville Franks, Author of ED for Windows - the programmers editor, an indispensable tool for great software development. http://www.getsoft.com Version 4.02 now available. New: Projects, Source Database, Difference Analysis.... From mail at peterbe.com Wed May 14 14:39:11 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 14 09:00:04 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197912E@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030514133323.0226ca00@mail.grenna.net> At 10:58 2003-05-14 +1200, Meyer, Tony wrote: > > I installed the 1.0a2 from sourceforge yesterday and I use > > the pop3proxy so my Eudora now connects to localhost instead. > > Yesterday everything was working fine and no errors reported. > > Great work guys! Today I get this errors: >[...] > > Any ideas what this might mean or what I have done to deserve this? > >It's difficult to tell. It looks like it might be a problem with the >database. Is your hammie.db file still in the right place? You might >try deleting[1] this file and seeing if that fixes it (if it doesn't, >just undelete it). I don't have a hammie.db file. I have 3 files called: hammie.db.dat, hammie.db.bak, hammie.db.dir in C:\Python22\ all 1Kb big. And the same 3 files in C:\Python22\spambayes\ all bigger than 1Kb. hammie.db.dat is 42Mb. And if I "delete" my hammie files and start again, isn't it quite possible that the errors will return again with time. Should I fetch the latest CVS? Here is the whole error that appears. It doesn't start to appear until after one or two successful sessions error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x15f9058> (exceptions.EOFError: [C:\Python22\lib\asyncore.py|poll|99] C:\Python22\lib\asyncore.py|handle_read_event|396] [C:\Python22\lib\asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|199] [pop3proxy.py|onServerLine|267] pop3proxy.py|onResponse|341] [pop3proxy.py|onTransaction|437] [pop3proxy.py|onRetr|484] [spambayes\classifier.py|chi2_spamprob|217] [spambayes\classifier.py|_getclues|437] spambayes\storage.py|_wordinfoget|192] [C:\Python22\lib\shelve.py|get|66] [C:\Python22\lib\shelve.py|__getitem__|71]) From papaDoc at videotron.ca Wed May 14 10:08:54 2003 From: papaDoc at videotron.ca (papaDoc) Date: Wed May 14 09:10:24 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <5.2.0.9.0.20030514133323.0226ca00@mail.grenna.net> References: <5.2.0.9.0.20030514133323.0226ca00@mail.grenna.net> Message-ID: <3EC23FE6.9030604@videotron.ca> Hi, This look like my error (see spambayes-Bugs-737642). >> > I installed the 1.0a2 from sourceforge yesterday > Did you install from CVS ? >> > Any ideas what this might mean or what I have done to deserve this? >> >> It's difficult to tell. It looks like it might be a problem with the >> database. Is your hammie.db file still in the right place? You might >> try deleting[1] this file and seeing if that fixes it (if it doesn't, >> just undelete it). > I don't think it is a database problem since when I'm using an old version of spambayes (CVS from 2003.03.25) I don't get the error. Remi From skip at pobox.com Wed May 14 09:12:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 14 09:13:04 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13019791C2@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13019791C2@its-xchg4.massey.ac.nz> Message-ID: <16066.16596.8818.203461@montanaro.dyndns.org> Tony> I'm pretty sure that Alex is right and Spambayes does no locking. I'm fairly sure you're correct. In any case, locking db files will probably vary based on the type of the file which underlies anydbm. To be safe, when unlocking a db file I think Spambayes would have to close it, then reopen it later when it next grabs the lock. I use hammiefilter from procmail and let procmail do the locking. It's done at a higher level, so there's only ever one hammiefilter process running. That might be an option for some people. Skip From mail at peterbe.com Wed May 14 15:36:49 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 14 09:50:05 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <3EC23FE6.9030604@videotron.ca> References: <5.2.0.9.0.20030514133323.0226ca00@mail.grenna.net> <5.2.0.9.0.20030514133323.0226ca00@mail.grenna.net> Message-ID: <5.2.0.9.0.20030514143503.0227a1f0@mail.grenna.net> At 09:08 2003-05-14 -0400, papaDoc wrote: >Hi, > >This look like my error (see spambayes-Bugs-737642). Yes, found it, but no solutions. >>> > I installed the 1.0a2 from sourceforge yesterday >Did you install from CVS ? No. 1.0a2 release >>> > Any ideas what this might mean or what I have done to deserve this? >>> >>>It's difficult to tell. It looks like it might be a problem with the >>>database. Is your hammie.db file still in the right place? You might >>>try deleting[1] this file and seeing if that fixes it (if it doesn't, >>>just undelete it). >I don't think it is a database problem since when I'm using an old version >of spambayes >(CVS from 2003.03.25) I don't get the error. I don't understand you here. Has your problem gone away from installing an OLD cvs version? If so, how do you get an old version of CVS? Or, do you have your version available? >Remi From ark3 at email.com Wed May 14 11:49:10 2003 From: ark3 at email.com (Abhay Saxena) Date: Wed May 14 10:49:18 2003 Subject: [Spambayes] Training via IMAP Filter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197910C@its-xchg4.massey.ac.nz> Message-ID: <38ED4A0D-861B-11D7-B809-00039345DE8C@email.com> On Tuesday, May 13, 2003, at 06:39 PM, Meyer, Tony wrote: >> 1. Has anyone tested imapfilter.py with UW IMAP [...] > > Not AFAIK. Okay. So at least it is likely that my Spambayes problems are not server weirdness due to my local IMAP installation (which is straight out of FreeBSD ports, so I didn't expect it to be very weird anyhow...). > On the other hand, the last major change that I made was aimed at > making > the filter *much* more in line with the RFC [...] It is likely that I did a cvs update since your last major change, so I probably have that major change. SF cvs has been giving me trouble the last day or so, but my sources should be current as of a couple of days ago. > [...] So if it follows the RFC, it shouldn't have any problems. Regarding UW IMAP, I sort of understood that Mark Crispin at UW did most of the work on the IMAP4 spec, so one would imagine that UW IMAP is at least a good example of a server, if not the reference server. (There is probably some sort of history page at http://www.imap.org/ that neither you nor I have time to read...) >> 2. Have there been any (important) changes to imaplib since 2.2.2? > > Yes, but none that should effect the filter, unless you are trying to > use SSL [...] Okay, so I can stick to 2.2.2 and not worry about it. Great. > IMAP doesn't have any method of tracking mail persistently. There is > a unique ID, but this can change. Can you explain the circumstances under which the unique ID can change? I'm envisioning a system in which I have ham and spam training folders that only get messages when the user is certain of his/her decision. In other words, a message would never move from the ham folder to the spam folder, or vice versa. In that sort of situation, would it be possible to rely on IMAP's unique ID to track messages? > To get around this, we add our own unique id in a "X-Spambayes-MailId" > header. It will only do this the first time it sees a message, so > next time you train, only new messages will be altered. That explains the need to modify the folder contents, thanks. > If it's not working, then please submit a bug with details - I'll try > and get to fixing it ASAP (although this is a busy RL week). With the > number of ways that IMAP is implemented, the only way to do full > testing > is to have people with different servers report any problems that they > have. If I'm totally stumped I'll be sure to submit a bug, or at least to ask a very specific question or two. I'd like to try to fix things on my own first, though. The first step there would be to figure out why adding -i4 to the command line doesn't give me any additional console output. I know where to look, though. Thanks for your help. - Abhay From ark3 at email.com Wed May 14 12:06:13 2003 From: ark3 at email.com (Abhay Saxena) Date: Wed May 14 11:06:20 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: <16066.16596.8818.203461@montanaro.dyndns.org> Message-ID: <9AC93596-861D-11D7-B809-00039345DE8C@email.com> On Wednesday, May 14, 2003, at 09:12 AM, Skip Montanaro wrote: > I use hammiefilter from procmail and let procmail do the locking. > It's done > at a higher level, so there's only ever one hammiefilter process > running. > That might be an option for some people. That's what I do as well, for filtering. But filtering is a read-only process with respect to hammie.db, right? As for training, which obviously must modify hammie.db, I think it's safe to assume that there will only be one training process going on at a time (manually, via cron, etc.). The real question is: What happens if email comes in while training is in progress? That's the exact question at the end of David Abrahams' document, but it seemed like a different question was getting answered. On my FreeBSD box using dbhash, I'm able to open a database read-only as many simultaneous times as I want while holding it open read-write in another process. Does spambayes always open the database in read-write mode? - Abhay From popiel at wolfskeep.com Wed May 14 09:25:51 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 14 11:25:56 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: Message from Abhay Saxena of "Wed, 14 May 2003 11:06:13 EDT." <9AC93596-861D-11D7-B809-00039345DE8C@email.com> References: <9AC93596-861D-11D7-B809-00039345DE8C@email.com> Message-ID: <20030514152551.67CF62DE88@cashew.wolfskeep.com> In message: <9AC93596-861D-11D7-B809-00039345DE8C@email.com> Abhay Saxena writes: >On Wednesday, May 14, 2003, at 09:12 AM, Skip Montanaro wrote: >> I use hammiefilter from procmail and let procmail do the locking. >> It's done at a higher level, so there's only ever one hammiefilter >> process running. That might be an option for some people. > >That's what I do as well, for filtering. But filtering is a read-only >process with respect to hammie.db, right? I do both filtering and training from procmail, so answering the pure-filtering question is rather moot for me. However, I _think_ that filtering is currently a read-only process. In the past (when there were access counts on the words) it definitely was not. >As for training, which obviously must modify hammie.db, I think it's >safe to assume that there will only be one training process going on >at a time (manually, via cron, etc.). I cannot make this assumption, since I train both through procmail and from cron. I could simplify my situation slightly by having my cron job send me a mail which got picked up by procmail to do the full retrain (and then use procmail's locking for all access to the db), but that's a bit more convoluted than I want to deal with. >The real question is: What happens if email comes in while training is >in progress? That's the exact question at the end of David Abrahams' >document, but it seemed like a different question was getting answered. I thought that I had answered it with "I don't know, it depends on the precise implementation of the db". Without knowing the internals of the db implementation, I cannot say if a read would fail if a write was in progress at the same time. It is perfectly conceivable that a read could see an updated index page which pointed to a data page that hadn't yet been written out... and it's equally conceiveable that the db layer would sequence its writes (and do various other data organization magic) to prevent that or other equally bad scenarios from happening. >On my FreeBSD box using dbhash, I'm able to open a database read-only >as many simultaneous times as I want while holding it open read-write >in another process. Does spambayes always open the database in >read-write mode? I suspect that spambayes always opens read-write out of sheer laziness, hiding the intent of the controlling program from the database access layer. However, as I point out above, it's rather irrelevant if the db implementation doesn't protect against bad things happening if a read happens in the middle of a write operation. Given that spambayes doesn't actively control the db implementation's semantic details (and I think this qualifies as a detail), I really don't think it's trustworthy. On the other hand, those of us who keep all their mail for retraining anyway don't care all that much if the db gets corrupted; we can just rebuild the db in case of error and move on. - Alex From tys at cs.ualberta.ca Wed May 14 11:09:00 2003 From: tys at cs.ualberta.ca (Tys von Gaza) Date: Wed May 14 12:09:40 2003 Subject: [Spambayes] Outlook Plugin Problems... In-Reply-To: <07a301c31955$dcaee5b0$530f8490@eden> Message-ID: <000701c31a33$21caa9d0$b5f2b38e@gaza> The process is terminated (not showing in Processes in the task manager) when I am running addin.py. I get the following when I do run it: D:\spambayes\Outlook2000>addin.py D:\Python22\lib\site-packages\win32com\universal.py:15: UserWarning: win32com.universal argument passing support is incomplete - only types covered in win32com.servers.test_pycomtest are supported warnings.warn(msg) Registered: SpamBayes.OutlookAddin Running Outlook2000\train.py works (again no output in the Trace Debugger). Some more details on what I am running: Outlook XP Python 2.2.2 PythonWin build 150 Is there any reason the debugger wouldn't pick up any debug info? Tys von Gaza tys@cs.ualberta.ca > -----Original Message----- > From: Mark Hammond [mailto:mhammond@skippinet.com.au] > Sent: May 13, 2003 7:45 AM > To: 'Tys von Gaza'; SpamBayes@python.org > Subject: RE: [Spambayes] Outlook Plugin Problems... > > > I remember there was some discussion a while back about some > > changes to > > the Outlook plugin that caused problems when upgrading, I > > think I might > > be running into these again. Any ideas? > > There are no known problems that will cause completely silent failure. > Are > you sure that the old Outlook process terminated? There should always be > a > startup message in the log, so completely silent is a new one on me. Did > you try re-running "addin.py"? If all else fails, does running, say > "Outlook2000\train.py" from the command-line work? > > Mark. From skip at pobox.com Wed May 14 11:53:40 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 14 12:16:56 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: <20030514152551.67CF62DE88@cashew.wolfskeep.com> References: <9AC93596-861D-11D7-B809-00039345DE8C@email.com> <20030514152551.67CF62DE88@cashew.wolfskeep.com> Message-ID: <16066.26244.374378.915913@montanaro.dyndns.org> >> That's what I do as well, for filtering. But filtering is a read-only >> process with respect to hammie.db, right? Alex> I do both filtering and training from procmail, so answering the Alex> pure-filtering question is rather moot for me. However, I _think_ Alex> that filtering is currently a read-only process. Yes, I believe the database is opened read-only for classification. >> As for training, which obviously must modify hammie.db, I think it's >> safe to assume that there will only be one training process going on >> at a time (manually, via cron, etc.). Alex> I cannot make this assumption, since I train both through procmail Alex> and from cron. I could simplify my situation slightly by having Alex> my cron job send me a mail which got picked up by procmail to do Alex> the full retrain (and then use procmail's locking for all access Alex> to the db), but that's a bit more convoluted than I want to deal Alex> with. How about wrapping your cron training in a shell or Python script which uses the same sort of file locking as procmail? That way they'd play nice together. Another option is to train into a separate database. (See below.) >> The real question is: What happens if email comes in while training >> is in progress? That's the exact question at the end of David >> Abrahams' document, but it seemed like a different question was >> getting answered. Alex> I thought that I had answered it with "I don't know, it depends on Alex> the precise implementation of the db". Without knowing the Alex> internals of the db implementation, I cannot say if a read would Alex> fail if a write was in progress at the same time. Agreed. In fact, without getting a little messy, Spambayes doesn't really know "what lies beneath" anydbm. It could figure that out using whichdb then attempt to do the right thing for each of the different kinds of databases, but there's no guarantee *anything* can be done. Does anyone know what the file locking properties of dumbdbm or dbm are? What about gdbm or dbm via the berkeley db package? All are available through Python, and thus susceptible to use by anydbm. I think the ultimate solution has got to come from higher up (e.g., implement the same file locking that procmail uses). That said, we could have Spambayes implement its own file locking scheme which would (hopefully) work transparently on all platforms. That would avoid the issue of locking individual files altogether. Most, if not all, applications know soon after startup if they are going to need read/write access or just read access to their database files. They should be able to create the appropriate kind of lock file which other Spambayes applications would honor. Alex> On the other hand, those of us who keep all their mail for Alex> retraining anyway don't care all that much if the db gets Alex> corrupted; we can just rebuild the db in case of error and move Alex> on. I retrain to a different file, then rename it or copy it into place. I've never had a problem (which is not to say I won't someday). Clearly retraining into the same file from which classification is done opens up a much bigger window of opportunity for gremlins to sneak in. David, are we making any progress on your question? I think you'd have gotten a bit quicker resolution on it had it not been hidden at the end of your IMAP filter document. I saw "IMAP" and hit the 'd' key. I suspect others did as well. Skip From popiel at wolfskeep.com Wed May 14 10:19:02 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 14 12:19:06 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: Message from Skip Montanaro of "Wed, 14 May 2003 10:53:40 CDT." <16066.26244.374378.915913@montanaro.dyndns.org> References: <9AC93596-861D-11D7-B809-00039345DE8C@email.com> <20030514152551.67CF62DE88@cashew.wolfskeep.com> <16066.26244.374378.915913@montanaro.dyndns.org> Message-ID: <20030514161902.BD4EC2DE88@cashew.wolfskeep.com> In message: <16066.26244.374378.915913@montanaro.dyndns.org> Skip Montanaro writes: > >How about wrapping your cron training in a shell or Python script which uses >the same sort of file locking as procmail? That would require knowing the type of file locking that procmail uses... and that isn't well advertized either. :-) Procmail makes it easy to know which file is being used for the locking (heck, you can specify that in the procmail script), but the locking method is build-dependent, and I haven't bothered to research it. >Another option is to train into a separate database. I already do this (and rename the new db into place), but it just narrows the window for bad stuff to happen. Narrows it from 21 minutes to a fraction of a second, which for my uses is good enough... but to be perfectly correct, there should be a lock around the rename. >That said, we could have Spambayes implement its own file locking scheme >which would (hopefully) work transparently on all platforms. This is probably the way to go, if we even want to bother. Merely documenting that spambayes doesn't do any locking (and that if you have the possibility of concurrent access, you should take other measures yourself) is probably good enough for the vast majority of users. >I retrain to a different file, then rename it or copy it into place. I've >never had a problem (which is not to say I won't someday). I've had a problem exactly once, and I'm not sure if it was a concurrent access problem or some other db corruption bug. I never bothered to investigate... just rebuilt the db using my nightly rebuild script, and moved on. ;-) (Yes, I deliberately have my procmail set up so that if spambayes is barfing and/or not labeling messages, they get delivered to my standard inbox; fail to passthrough is much better in this case than fail to bitbucket.) - Alex From zshepherd at alphadatasys.com Wed May 14 12:27:34 2003 From: zshepherd at alphadatasys.com (D. Zane Shepherd) Date: Wed May 14 12:34:37 2003 Subject: [Spambayes] Outlook Express Message-ID: <004901c31a35$bc93d660$c2cbd940@swbell.net> Hi! I learned of your product from an educational and consulting firm in LA known as K2 Enterprises. They speak very highly of this product. Is it compatible with Outlook Express? Thank you in advance for a response. Zane Shepherd Dallas, TX From skip at pobox.com Wed May 14 13:19:29 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 14 13:19:47 2003 Subject: [Spambayes] Outlook Express In-Reply-To: <004901c31a35$bc93d660$c2cbd940@swbell.net> References: <004901c31a35$bc93d660$c2cbd940@swbell.net> Message-ID: <16066.31393.242465.935073@montanaro.dyndns.org> Zane> Hi! I learned of your product from an educational and consulting Zane> firm in LA known as K2 Enterprises. They speak very highly of Zane> this product. Is it compatible with Outlook Express? Thank you Zane> in advance for a response. Not directly. You should be able to install Python+Spambayes on your system and use the pop3proxy to train and classify incoming mail though. You would then point Outlook Express at the proxy instead of the real POP3 server. If you use IMAP to retrieve your mail there is an IMAP filter in the works, though I don't know how ready for prime time it is at this point. -- Skip Montanaro skip@pobox.com http://www.musi-cal.com/ From akiva at atwood.co.il Wed May 14 21:13:57 2003 From: akiva at atwood.co.il (Akiva Atwood) Date: Wed May 14 14:01:34 2003 Subject: [Spambayes] RE: No disk in drice X In-Reply-To: Message-ID: > > [Greg Lozier] > > I installed spambayes today and while installing I received an > > error message that there is no disk in the drive. Please insert a > > disk into drive \device\harddisk1\dr3. I cancel the error message it > > finish loading. It is operational in outlook 2000 now, but when I > > start outlook I get the above error message. After hitting cancel > > a few times it will go away and outlook will load. > > I had the same thing happen yesterday -- once I rebooted the problem went away. I'm running XP Pro with Outlook 2000 Akiva ================================= "We took risks, we knew we took them; things have come out against us, and therefore we have no cause for complaint, but bow to the will of Providence, determined still to do our best to the last." Robert Falcon Scott From bbands at yahoo.com Wed May 14 12:10:46 2003 From: bbands at yahoo.com (John Bollinger) Date: Wed May 14 14:10:51 2003 Subject: [Spambayes] Outlook--SpamBayes fails to load In-Reply-To: Message-ID: <20030514181046.11364.qmail@web13903.mail.yahoo.com> Before I post a bug... --jab Current binary, Win2k, Outlook 2000 sp3 spambayes1.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 415 spam and 229 good messages Loaded databases in 14.8108ms pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 166, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 117, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 158, in _GetMessageStore pywintypes.com_error: (-2147221231, 'ClassFactory cannot supply requested class', None, None) ===== John Bollinger, CFA, CMT www.BollingerBands.com If you advance far enough, you arrive at the beginning. __________________________________ Do you Yahoo!? The New Yahoo! Search - Faster. Easier. Bingo. http://search.yahoo.com From ark3 at email.com Wed May 14 16:57:02 2003 From: ark3 at email.com (Abhay Saxena) Date: Wed May 14 15:57:15 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: <20030514161902.BD4EC2DE88@cashew.wolfskeep.com> Message-ID: <3AFB3197-8646-11D7-B809-00039345DE8C@email.com> Okay. Relying on the underlying database implementation to do the right thing when multiple processes have the database open, even in the many readers and one writer case, is not safe. Got it. That much I understand. Training into a temporary database and then replacing the live database by renaming over it sounds like a good option. At least on Unix-ish filesystems, aren't we guaranteed that the rename-replace operation is atomic? In other words, is it true that a program is guaranteed to get either the original file (and to retain access to it until it close()es the fd) or the new file? After all, the new file will have a different inode (or equivalent) and reside on a different part of the disk. I suppose this line of thought still makes the assumption that the underlying database implementation doesn't close and reopen the file or try to open the file more than once... On Wednesday, May 14, 2003, at 12:19 PM, T. Alexander Popiel wrote: > (Yes, I deliberately have my procmail set up so that if spambayes is > barfing and/or not labeling messages, they get delivered to my > standard inbox; fail to passthrough is much better in this case than > fail to bitbucket.) Are you the Alex who wrote the stuff in spambayes/contrib (bulkgraph.py et al)? If so, does the sample procmailrc in that directory have your trick for doing the right thing if hammiefilter.py barfs? I don't quite see how the right thing would occur if hammiefilter.py were to die with an unhandled exception, for example. On the other hand, I'm not very knowledgeable about procmail, so I may just be missing it. Thanks for helping me get clued-in, folks. - Abhay From noreply at sourceforge.net Wed May 14 14:18:46 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 16:18:53 2003 Subject: [Spambayes] [ spambayes-Bugs-737874 ] Outlook crash wipes out SpamBayes addin settings & training Message-ID: Bugs item #737874, was opened at 2003-05-14 16:18 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: Outlook crash wipes out SpamBayes addin settings & training Initial Comment: When Outlook crashes, it wipes out the SpamBayes Outlook Addin's settings, including all its training. At least, it does for me. Most programs have their important data and settings in a file somewhere or the Registry where it somehow can't be corrupted, much less erased, when a problem comes up. Maybe SpamBayes should look into this concept. I've attached the log for the affected session, though, after perusing it myself, it looks like SpamBayes is not causing Outlook to crash. I think it has to do with my network printer driver when I'm offline from the network, which causes a similar problem with other programs. (As a workaround, I've removed the Print button from the Outlook windows that I use, since it seems to crash just on mouseover!) I'm running SpamBayes Outlook Addin 002 with Outlook XP under Windows XP Pro. The other Outlook COM addins that are enabled are SpamSource and SyncNow. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 From popiel at wolfskeep.com Wed May 14 14:51:25 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 14 16:51:28 2003 Subject: [Spambayes] Does anyone care about this report? In-Reply-To: Message from Abhay Saxena of "Wed, 14 May 2003 15:57:02 EDT." <3AFB3197-8646-11D7-B809-00039345DE8C@email.com> References: <3AFB3197-8646-11D7-B809-00039345DE8C@email.com> Message-ID: <20030514205125.354512DE88@cashew.wolfskeep.com> In message: <3AFB3197-8646-11D7-B809-00039345DE8C@email.com> Abhay Saxena writes: > >Training into a temporary database and then replacing the live database >by renaming over it sounds like a good option. At least on Unix-ish >filesystems, aren't we guaranteed that the rename-replace operation is >atomic? Yes, on unix-oids, the rename operation is filesystem-atomic. >I suppose this line of thought still makes the assumption that the >underlying database implementation doesn't close and reopen the file or >try to open the file more than once... Bingo. Also, in multi-file db implementations, it gets hairier. >On Wednesday, May 14, 2003, at 12:19 PM, T. Alexander Popiel wrote: >> (Yes, I deliberately have my procmail set up so that if spambayes is >> barfing and/or not labeling messages, they get delivered to my >> standard inbox; fail to passthrough is much better in this case than >> fail to bitbucket.) > >Are you the Alex who wrote the stuff in spambayes/contrib (bulkgraph.py >et al)? Somebody reads my stuff! Yay! >If so, does the sample procmailrc in that directory have your >trick for doing the right thing if hammiefilter.py barfs? Yes. The failsafe nature is not really obvious. For the first part, standard procmail holds that if a filter (and I'm using spambayes as a filter) exits with a non-zero exit code, then the original mail is retained instead of the filtered version. For the second part, I check for specific X-Spambayes-Classification lines, and if there isn't one present, then the mail goes into my inbox. >Thanks for helping me get clued-in, folks. Not a problem. - Alex From mail at peterbe.com Wed May 14 23:30:36 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 14 17:36:34 2003 Subject: [Spambayes] Web Interface enhancements In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13019791B8@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030514222925.022219d0@mail.grenna.net> > > >Done (in cvs). The refresh button is normally between the "previous >day" and "today" buttons (which might be disabled). If you train >everything and get to the "no untrained messages" page, there is a link >"check again". I worry a little that this might make people think that >it checks for mail on the server, which it doesn't (the mail program >needs to do that). Thoughts? On a slightly different note... Can you [in the web interface] disable caching? It's happened many times that I have to press the Reload button after clicking a link. Cheers, Peter From T.A.Meyer at massey.ac.nz Thu May 15 10:39:47 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 17:40:33 2003 Subject: [Spambayes] Web Interface enhancements Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301979429@its-xchg4.massey.ac.nz> > On a slightly different note... > Can you [in the web interface] disable caching? > It's happened many times that I have to press the Reload button after > clicking a link. I noticed this too, and I tried to do it - all the pages have: which I thought should do the trick, but seems a bit patchy. Can anyone correct the meta tags or tell me what a better method is? =Tony Meyer From destreel at home.nl Thu May 15 00:40:34 2003 From: destreel at home.nl (destreel) Date: Wed May 14 17:44:23 2003 Subject: [Spambayes] installation fault Message-ID: <000001c31a61$765c0d80$0500a8c0@woonkamer> Hi, spambayes outlook-addin is listed but not checked. When I check it and restart outlook xp the add-in is unchecked again. I did register the dll already! Pls advice . PWorCe -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 2056 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030514/21e2815d/spambayes1.obj From tys at cs.ualberta.ca Wed May 14 17:03:55 2003 From: tys at cs.ualberta.ca (Tys von Gaza) Date: Wed May 14 18:04:34 2003 Subject: [Spambayes] Outlook Plugin Problems... In-Reply-To: <000701c31a33$21caa9d0$b5f2b38e@gaza> Message-ID: <002401c31a64$b6af2810$b5f2b38e@gaza> Well I ended up trying the outlook installer because I couldn't handle all the spam (it is amazing how HUGE a difference SpamBayes makes, kudos to you all) and it worked fine. I have no idea why the CVS version wouldn't. I'll take a closer look when I have more time. Tys von Gaza tys@cs.ualberta.ca > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Tys von Gaza > Sent: May 14, 2003 10:09 AM > To: 'Mark Hammond'; SpamBayes@python.org > Subject: RE: [Spambayes] Outlook Plugin Problems... > > The process is terminated (not showing in Processes in the task manager) > when I am running addin.py. I get the following when I do run it: > > D:\spambayes\Outlook2000>addin.py > D:\Python22\lib\site-packages\win32com\universal.py:15: UserWarning: > win32com.universal argument passing support is incomplete - only types > covered in win32com.servers.test_pycomtest are supported > warnings.warn(msg) > Registered: SpamBayes.OutlookAddin > > Running Outlook2000\train.py works (again no output in the Trace > Debugger). > > Some more details on what I am running: > Outlook XP > Python 2.2.2 > PythonWin build 150 > > Is there any reason the debugger wouldn't pick up any debug info? > > Tys von Gaza > tys@cs.ualberta.ca > > > -----Original Message----- > > From: Mark Hammond [mailto:mhammond@skippinet.com.au] > > Sent: May 13, 2003 7:45 AM > > To: 'Tys von Gaza'; SpamBayes@python.org > > Subject: RE: [Spambayes] Outlook Plugin Problems... > > > > > I remember there was some discussion a while back about some > > > changes to > > > the Outlook plugin that caused problems when upgrading, I > > > think I might > > > be running into these again. Any ideas? > > > > There are no known problems that will cause completely silent failure. > > Are > > you sure that the old Outlook process terminated? There should always > be > > a > > startup message in the log, so completely silent is a new one on me. > Did > > you try re-running "addin.py"? If all else fails, does running, say > > "Outlook2000\train.py" from the command-line work? > > > > Mark. > > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From richard at death2spam.net Thu May 15 09:32:09 2003 From: richard at death2spam.net (Richard Jowsey) Date: Wed May 14 18:38:54 2003 Subject: [Spambayes] Web Interface enhancements In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301979429@its-xchg4.massey.ac.nz> Message-ID: <3EC35089.20429.1750F41F@localhost> You might additionally set some "anti-cache" HTTP headers, like so: Last-Modified: [the current date-time] Expires: "Thu, 01 Jan 1970 00:00:00 GMT" Cache-Control: "no-cache, must-revalidate, no-store" Pragma: "no-cache" Cache: "no-cache" These, plus your META tags, ought to convince the majority of browsers to refresh those pages. YMMV. Cheers, Richard > > On a slightly different note... > > Can you [in the web interface] disable caching? > > It's happened many times that I have to press the Reload button > > after clicking a link. > > I noticed this too, and I tried to do it - all the pages have: > > > which I thought should do the trick, but seems a bit patchy. > > Can anyone correct the meta tags or tell me what a better method is? > > =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 15 12:13:00 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 19:13:48 2003 Subject: [Spambayes] installation fault Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019794B8@its-xchg4.massey.ac.nz> > Hi, spambayes outlook-addin is listed but not checked. When > I check it and restart outlook xp the add-in is unchecked > again. I did register the dll already! This is known; see: [ 712480 ] Outlook 2002 (XP) installation fails I'm fairly sure that Mark has said that this will be fixed in the next binary release of the plugin; for the moment, I believe the only solution is to use the CVS version (or alpha2, perhaps). =Tony Meyer From noreply at sourceforge.net Wed May 14 17:13:52 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:13:56 2003 Subject: [Spambayes] [ spambayes-Bugs-737955 ] Transient connection error disables plugin Message-ID: Bugs item #737955, was opened at 2003-05-15 09:13 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737955&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Transient connection error disables plugin Initial Comment: If SpamBayes fails to connect to Outlook (probably due to a temporary Outlook error), spambayes fails and is disabled by Outlook. Original reporter of the bug confirms that re-booting and re-enabling the addin caused it to continue normally. Before I post a bug... --jab Current binary, Win2k, Outlook 2000 sp3 spambayes1.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 415 spam and 229 good messages Loaded databases in 14.8108ms pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 166, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 117, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 158, in _GetMessageStore pywintypes.com_error: (-2147221231, 'ClassFactory cannot supply requested class', None, None) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737955&group_id=61702 From noreply at sourceforge.net Wed May 14 17:21:48 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:22:02 2003 Subject: [Spambayes] [ spambayes-Bugs-737874 ] Outlook crash wipes out SpamBayes addin settings & training Message-ID: Bugs item #737874, was opened at 2003-05-15 08:18 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: Outlook crash wipes out SpamBayes addin settings & training Initial Comment: When Outlook crashes, it wipes out the SpamBayes Outlook Addin's settings, including all its training. At least, it does for me. Most programs have their important data and settings in a file somewhere or the Registry where it somehow can't be corrupted, much less erased, when a problem comes up. Maybe SpamBayes should look into this concept. I've attached the log for the affected session, though, after perusing it myself, it looks like SpamBayes is not causing Outlook to crash. I think it has to do with my network printer driver when I'm offline from the network, which causes a similar problem with other programs. (As a workaround, I've removed the Print button from the Outlook windows that I use, since it seems to crash just on mouseover!) I'm running SpamBayes Outlook Addin 002 with Outlook XP under Windows XP Pro. The other Outlook COM addins that are enabled are SpamSource and SyncNow. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:21 Message: Logged In: YES user_id=552329 The settings are saved in various files which, for you, are in the directory: C:\Documents and Settings\Administrator\Application Data\SpamBayes\ Neither the database nor the config pickle should be corrupted by an Outlook crash unless they were being used at the time. At one point the plugin didn't save the config settings until shutdown (this might even still be the case), so if you are crashing before you close Outlook, you would never save the settings, which would cause this behaviour. Check to see if the files are there, before and after the crash, and if they are modifed after the crash. It certainly does seem from the log that neither file is saved. This could perhaps be changed, but as I said above, it might have done so already since the 002 release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 From noreply at sourceforge.net Wed May 14 17:19:31 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:24:49 2003 Subject: [Spambayes] [ spambayes-Bugs-737956 ] No hourglass when building folder lists Message-ID: Bugs item #737956, was opened at 2003-05-15 09:19 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737956&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: No hourglass when building folder lists Initial Comment: Building the folder list can be slow with an Exchange server. For a few seconds, SpamBayes looks like it is doing nothing - it needs an hourglass. We will also need an hourglass when expanding trees, and they are done on demand. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737956&group_id=61702 From noreply at sourceforge.net Wed May 14 17:29:57 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:30:27 2003 Subject: [Spambayes] [ spambayes-Bugs-737614 ] Messages go back to "unread" status Message-ID: Bugs item #737614, was opened at 2003-05-14 22:44 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737614&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Paul Moore (pmoore) Assigned to: Mark Hammond (mhammond) Summary: Messages go back to "unread" status Initial Comment: Occasionally, when using the Outlook plugin, messages revert to "unread" status while they are being read. The behaviour isn't particularly reproducible, which makes it hard to test :-( I'm using Exchange as my mail backend, Outlook 2000, and a version of CVS from somewhen (is there a version number I can check? - there are files dated 9th April...) This has been around for a while, and discussed on the mailing list, but I couldn't find a bug for it - if there is one, sorry, feel free to close this one! ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:29 Message: Logged In: YES user_id=552329 I had to look three times, but I eventually found it. This duplicates: [ 716684 ] Filtering marks message as unread Note that this is related (but not the same as) feature request 680629. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737614&group_id=61702 From T.A.Meyer at massey.ac.nz Thu May 15 12:38:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 19:39:41 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13019794EF@its-xchg4.massey.ac.nz> > I don't have a hammie.db file. > I have 3 files called: hammie.db.dat, hammie.db.bak, hammie.db.dir in > C:\Python22\ all 1Kb big. > And the same 3 files in C:\Python22\spambayes\ all bigger than 1Kb. > hammie.db.dat is 42Mb. I think the problem is probably that you have all of these . Use the web interface to set full pathnames for these files. You are interested in the "storage file name" and "message information file name" options, towards the end of the configuration page. The problem is that these default to the current working directory, which, obviously, changes. The one with the information will be the 42Mb one (which is pretty big; you must have trained an awful lot). > And if I "delete" my hammie files and start again, isn't it > quite possible that the errors will return again with time. It shouldn't. If it does then there is definitely a bug here; at the moment it just looks like a problem with having multiple databases. > Should I fetch the latest CVS? That depends ;). Without knowing what things were like when you got the code you have now I can't answer. From what I can tell the latest CVS is reasonably stable (touch wood). =Tony Meyer From noreply at sourceforge.net Wed May 14 17:49:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:49:58 2003 Subject: [Spambayes] [ spambayes-Bugs-737967 ] Malformed messages break pop3proxy (et al) Message-ID: Bugs item #737967, was opened at 2003-05-15 11:49 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Malformed messages break pop3proxy (et al) Initial Comment: Those applications using message.py (pop3proxy, imapfilter, maybe others at some point) die if a message fails to parse through the email package. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 From noreply at sourceforge.net Wed May 14 17:50:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:50:44 2003 Subject: [Spambayes] [ spambayes-Bugs-737967 ] Malformed messages break pop3proxy (et al) Message-ID: Bugs item #737967, was opened at 2003-05-15 11:49 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Malformed messages break pop3proxy (et al) Initial Comment: Those applications using message.py (pop3proxy, imapfilter, maybe others at some point) die if a message fails to parse through the email package. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:50 Message: Logged In: YES user_id=552329 (Comment from the list) As others have said, hammie eats the headers if there is a message parsing failure. This is fairly easy to cause with a malformed MIME section. I mentioned this on list a couple times a few months ago (including a message which causes it to happen), and the general reaction at the time was that Barry should fix the non-strict MIME parser in the email package, rather than making the error handling in spambayes itself more robust. This hasn't been a high-priority task, given that it's really only idiots like me who hand-craft MIME messages (with the occasional error) who are sending malformed messages to trigger the problem. - Alex ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 From noreply at sourceforge.net Wed May 14 17:51:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:51:26 2003 Subject: [Spambayes] [ spambayes-Bugs-737967 ] Malformed messages break pop3proxy (et al) Message-ID: Bugs item #737967, was opened at 2003-05-15 11:49 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Malformed messages break pop3proxy (et al) Initial Comment: Those applications using message.py (pop3proxy, imapfilter, maybe others at some point) die if a message fails to parse through the email package. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:51 Message: Logged In: YES user_id=552329 I suspect this was behind this (really old closed) bug as well: [ 650496 ] hammie.py discards headers ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:50 Message: Logged In: YES user_id=552329 (Comment from the list) As others have said, hammie eats the headers if there is a message parsing failure. This is fairly easy to cause with a malformed MIME section. I mentioned this on list a couple times a few months ago (including a message which causes it to happen), and the general reaction at the time was that Barry should fix the non-strict MIME parser in the email package, rather than making the error handling in spambayes itself more robust. This hasn't been a high-priority task, given that it's really only idiots like me who hand-craft MIME messages (with the occasional error) who are sending malformed messages to trigger the problem. - Alex ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 From noreply at sourceforge.net Wed May 14 17:52:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:52:40 2003 Subject: [Spambayes] [ spambayes-Bugs-737967 ] Malformed messages break pop3proxy (et al) Message-ID: Bugs item #737967, was opened at 2003-05-15 11:49 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Malformed messages break pop3proxy (et al) Initial Comment: Those applications using message.py (pop3proxy, imapfilter, maybe others at some point) die if a message fails to parse through the email package. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:52 Message: Logged In: YES user_id=552329 Moving from "Bugs item #737642": I'm using pop3proxy (from cvs 2003.05.13) and sometime I'm not able to retreive my mail from one my pop account I'm getting the following error: error: uncaptured python exception, closing channel <__main__.ServerLineReader c onnected at 0x138d160> (email.Errors.HeaderParseError:Not a header, not a contin uation: ``charset="iso-8859-1"'' [C:\Devtools\Python\PYTHON~1.2\lib\asyncore.py| poll|99] [C:\Devtools\Python\PYTHON~1.2 \lib\asyncore.py|handle_read_event|396] [ C:\Devtools\Python\PYTHON~1.2 \lib\asynchat.py|handle_read|130] [C:\Devtools\SPAM BA~1\SPAMBA~1.13\POP3PR~1.PY|found_terminator|146] [C:\Devtools\SPAMBA~1\SPAMBA~ 1.13\POP3PR~1.PY|onServerLine|214] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY |onResponse|288] [C:\Devtools\SPAMBA~1\SPAMBA~1.13 \POP3PR~1.PY|onTransaction|389 ] [C:\Devtools\SPAMBA~1\SPAMBA~1.13 \POP3PR~1.PY|onRetr|436] [C:\Devtools\SPAMBA~ 1\SPAMBA~1.13\spambayes\message.py|setPayload|168] [C:\Devtools\Python\PYTHON~1. 2\lib\email\Parser.py|_parsebody|228] [C:\Devtools\Python\PYTHON~1.2\lib\email\P arser.py|parsestr|75] [C:\Devtools\Python\PYTHON~1.2\lib\email\Parser.py|parse|6 2] [C:\Devtools\Python\PYTHON~1.2 \lib\email\Parser.py|_parseheaders|128]) I retrieve the mail directly without using pop3proxy and tested each message with the classify options on the web interface and there was not problem. Remi papaDoc@videotron.ca ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:51 Message: Logged In: YES user_id=552329 I suspect this was behind this (really old closed) bug as well: [ 650496 ] hammie.py discards headers ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:50 Message: Logged In: YES user_id=552329 (Comment from the list) As others have said, hammie eats the headers if there is a message parsing failure. This is fairly easy to cause with a malformed MIME section. I mentioned this on list a couple times a few months ago (including a message which causes it to happen), and the general reaction at the time was that Barry should fix the non-strict MIME parser in the email package, rather than making the error handling in spambayes itself more robust. This hasn't been a high-priority task, given that it's really only idiots like me who hand-craft MIME messages (with the occasional error) who are sending malformed messages to trigger the problem. - Alex ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737967&group_id=61702 From noreply at sourceforge.net Wed May 14 17:53:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 19:53:27 2003 Subject: [Spambayes] [ spambayes-Bugs-737642 ] Hang when trying to get message Message-ID: Bugs item #737642, was opened at 2003-05-14 23:59 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737642&group_id=61702 Category: pop3proxy Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Remi Ricard (papadoc) Assigned to: Nobody/Anonymous (nobody) Summary: Hang when trying to get message Initial Comment: Hi, I'm using pop3proxy (from cvs 2003.05.13) and sometime I'm not able to retreive my mail from one my pop account I'm getting the following error: error: uncaptured python exception, closing channel <__main__.ServerLineReader c onnected at 0x138d160> (email.Errors.HeaderParseError:Not a header, not a contin uation: ``charset="iso-8859-1"'' [C:\Devtools\Python\PYTHON~1.2\lib\asyncore.py| poll|99] [C:\Devtools\Python\PYTHON~1.2\lib\asyncore.py|handle_read_event|396] [ C:\Devtools\Python\PYTHON~1.2\lib\asynchat.py|handle_read|130] [C:\Devtools\SPAM BA~1\SPAMBA~1.13\POP3PR~1.PY|found_terminator|146] [C:\Devtools\SPAMBA~1\SPAMBA~ 1.13\POP3PR~1.PY|onServerLine|214] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY |onResponse|288] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY|onTransaction|389 ] [C:\Devtools\SPAMBA~1\SPAMBA~1.13\POP3PR~1.PY|onRetr|436] [C:\Devtools\SPAMBA~ 1\SPAMBA~1.13\spambayes\message.py|setPayload|168] [C:\Devtools\Python\PYTHON~1. 2\lib\email\Parser.py|_parsebody|228] [C:\Devtools\Python\PYTHON~1.2\lib\email\P arser.py|parsestr|75] [C:\Devtools\Python\PYTHON~1.2\lib\email\Parser.py|parse|6 2] [C:\Devtools\Python\PYTHON~1.2\lib\email\Parser.py|_parseheaders|128]) I retrieve the mail directly without using pop3proxy and tested each message with the classify options on the web interface and there was not problem. Remi papaDoc@videotron.ca N.B. How can I find which messge is causing problem and how can I get an hand on it ? ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 11:53 Message: Logged In: YES user_id=552329 Moved to [ spambayes-Bugs-737967 ] Malformed messages break pop3proxy (et al) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737642&group_id=61702 From T.A.Meyer at massey.ac.nz Thu May 15 12:56:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 19:58:03 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301979517@its-xchg4.massey.ac.nz> > This look like my error (see spambayes-Bugs-737642). I don't think that they are the same. Peter's trace shows a problem trying to get a word score, yours shows a problem trying to parse a message. > I don't think it is a database problem since when I'm using an old > version of spambayes (CVS from 2003.03.25) I don't get the error. I'm pretty sure that the problem is caused by a malformed message. There has been some discussion about these on the list recently (in particular see Alex's message), as well as a while back. I suspect that the old version of pop3proxy didn't use the email package to parse the mail, but the new (via message.py) does. I'm going to close your bug and open a new one about malformed mime messages. I'll move your comments into it, as well as Alex's. Something does need to be done. > N.B. How can I find which messge is causing problem and > how can I get an hand on it ? If you set the "globals":"verbose" option to True in your config file you will have a _pop3proxy.log file created that logs the POP3 conversation. If you look at that, I *think* that you will be able to figure out which message is causing the problem. (It will probably stop at one message, which is not the last one). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 15 13:15:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 20:15:59 2003 Subject: [Spambayes] Training via IMAP Filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197953A@its-xchg4.massey.ac.nz> > Can you explain the circumstances under which the unique ID > can change? Unfortunately, it can change whenever the server wants it to (blame Crispin, not me!). A good server wouldn't change them, but it is allowed to. The RFC says: The unique identifier of a message MUST NOT change during the session, and SHOULD NOT change between sessions It's the SHOULD instead of MUST that is the problem. The IMAP uids are also a pain because they are only unique within a folder, so a full id would also need to store a unique id for the folder, but no such thing exists (the name can change, and the UIDVALIDITY value *must* change at certain times). > I'm envisioning a system [...] > In that sort of situation, would it be possible > to rely on IMAP's unique ID to track messages? Unfortunately not. The way the spec is written the UID could change every time you log into the server (the RFC even offers this as a possibility). > If I'm totally stumped I'll be sure to submit a bug, or at > least to ask a very specific question or two. I'd like to try to fix > things on my own first, though. No worries. Please make sure that if you figure out the problem you submit a patch, though! > The first step there would be to figure out why > adding -i4 to the command line doesn't give me any additional console > output. I know where to look, though. That's definitely the first step. The IMAP debugging material really does clarify where things are going wrong. (And, IMO, 4 is the most useful level). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 15 13:18:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 20:18:48 2003 Subject: [Spambayes] Outlook Express Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197953F@its-xchg4.massey.ac.nz> > Not directly. You should be able to install Python+Spambayes > on your system and use the pop3proxy to train and classify > incoming mail though. You would then point Outlook Express > at the proxy instead of the real POP3 server. BTW, there's lots of explanation about how to do this in the docs. Note that you can't use the smtpproxy with Outlook Express at the moment, so training has to be done via the web interface (or one of the command line tools). > If you use IMAP to retrieve your mail there is an IMAP filter > in the works, though I don't know how ready for prime time it > is at this point. I would say that it's finally reached alpha. No guarantees that it will always work, but it should for the most part. *As long as no other program expunges/purges messages* your original mail is never lost, in any case. =Tony Meyer From noreply at sourceforge.net Wed May 14 18:20:02 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 20:20:08 2003 Subject: [Spambayes] [ spambayes-Bugs-737874 ] Outlook crash wipes out SpamBayes addin settings & training Message-ID: Bugs item #737874, was opened at 2003-05-14 16:18 Message generated for change (Comment added) made by giveuptheghost You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: Outlook crash wipes out SpamBayes addin settings & training Initial Comment: When Outlook crashes, it wipes out the SpamBayes Outlook Addin's settings, including all its training. At least, it does for me. Most programs have their important data and settings in a file somewhere or the Registry where it somehow can't be corrupted, much less erased, when a problem comes up. Maybe SpamBayes should look into this concept. I've attached the log for the affected session, though, after perusing it myself, it looks like SpamBayes is not causing Outlook to crash. I think it has to do with my network printer driver when I'm offline from the network, which causes a similar problem with other programs. (As a workaround, I've removed the Print button from the Outlook windows that I use, since it seems to crash just on mouseover!) I'm running SpamBayes Outlook Addin 002 with Outlook XP under Windows XP Pro. The other Outlook COM addins that are enabled are SpamSource and SyncNow. ---------------------------------------------------------------------- >Comment By: Scott Sanders (giveuptheghost) Date: 2003-05-14 20:20 Message: Logged In: YES user_id=184281 Hi. The last crash occurred this afternoon, and the settings files that exist and their modified dates are as follows: default_bayes_customize.ini - 5/10/2003 3:45 PM default_bayes_database.db - 5/13/2003 3:59 PM default_configuration.pck - 5/13/2003 3:59 PM default_message_database.db - 5/10/2003 3:45 PM Apparently the settings/training isn't changed until Outlook is closed? I usually have Outlook always running, and I seldom intentionally close it, except for an Outlook-related program update (rare) or if I shut down my comp. (also rare). I may be an exception, but I think the settings/training should be written at the time of setting/training. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-14 19:21 Message: Logged In: YES user_id=552329 The settings are saved in various files which, for you, are in the directory: C:\Documents and Settings\Administrator\Application Data\SpamBayes\ Neither the database nor the config pickle should be corrupted by an Outlook crash unless they were being used at the time. At one point the plugin didn't save the config settings until shutdown (this might even still be the case), so if you are crashing before you close Outlook, you would never save the settings, which would cause this behaviour. Check to see if the files are there, before and after the crash, and if they are modifed after the crash. It certainly does seem from the log that neither file is saved. This could perhaps be changed, but as I said above, it might have done so already since the 002 release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 From noreply at sourceforge.net Wed May 14 19:00:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 21:00:35 2003 Subject: [Spambayes] [ spambayes-Bugs-737956 ] No hourglass when building folder lists Message-ID: Bugs item #737956, was opened at 2003-05-15 09:19 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737956&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: No hourglass when building folder lists Initial Comment: Building the folder list can be slow with an Exchange server. For a few seconds, SpamBayes looks like it is doing nothing - it needs an hourglass. We will also need an hourglass when expanding trees, and they are done on demand. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-15 11:00 Message: Logged In: YES user_id=14198 Checking in FolderSelector.py; new revision: 1.17; previous revision: 1.16 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737956&group_id=61702 From noreply at sourceforge.net Wed May 14 19:04:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 21:04:52 2003 Subject: [Spambayes] [ spambayes-Bugs-737955 ] Transient connection error disables plugin Message-ID: Bugs item #737955, was opened at 2003-05-15 09:13 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737955&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Transient connection error disables plugin Initial Comment: If SpamBayes fails to connect to Outlook (probably due to a temporary Outlook error), spambayes fails and is disabled by Outlook. Original reporter of the bug confirms that re-booting and re-enabling the addin caused it to continue normally. Before I post a bug... --jab Current binary, Win2k, Outlook 2000 sp3 spambayes1.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\bbands\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 415 spam and 229 good messages Loaded databases in 14.8108ms pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 166, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 117, in __init__ File "E:\src\spambayes\Outlook2000\msgstore.py", line 158, in _GetMessageStore pywintypes.com_error: (-2147221231, 'ClassFactory cannot supply requested class', None, None) ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-15 11:04 Message: Logged In: YES user_id=14198 Checking in manager.py; /cvsroot/spambayes/spambayes/Outlook2000/manager.py,v <-- manager.py new revision: 1.57; previous revision: 1.56 done ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737955&group_id=61702 From noreply at sourceforge.net Wed May 14 19:09:11 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 21:09:14 2003 Subject: [Spambayes] [ spambayes-Bugs-737986 ] Message.as_string() fails. Message-ID: Bugs item #737986, was opened at 2003-05-15 13:09 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Message.as_string() fails. Initial Comment: Reported on the list and discussed via email. Sadly, this one has me beaten; I really can't figure out why it's going wrong. --- I've attached two plain text files. One is from using evolution, the other mozilla. Mozilla tries to access two accounts. But it only accesses the second account after I've cancelled the session because it's hung from the first. It looks to me ( great email expert that I am ) that it does all the negotiations, dl's the first mesage and then just freezes. Note that in the evolution file, I had it get mail from an empty server. All went fine. --- When evol gets the mail, it says getting pop summary, then hangs waiting for the first message. at the terminal window where I ran pop3proxy I get: error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x8353a4c> (exceptions.TypeError:len() of unsized object [/usr/lib/python2.2/asyncore.py|poll|99] [/usr/lib/python2.2/asyncore.py|handle_read_event|396] [/usr/lib/python2.2/asynchat.py|handle_read|130] [/usr/bin/pop3proxy.py|found_terminator|147] [/usr/bin/pop3proxy.py|onServerLine|215] [/usr/bin/pop3proxy.py|onResponse|289] [/usr/bin/pop3proxy.py|onTransaction|390] [/usr/bin/pop3proxy.py|onRetr|459] [/usr/lib/python2.2/site- packages/spambayes/message.py|as_string|193] [/usr/lib/python2.2/site- packages/email/Message.py|as_string|109] [/usr/lib/python2.2/site- packages/email/Generator.py|flatten|102] [/usr/lib/python2.2/site- packages/email/Generator.py|_write|137] [/usr/lib/python2.2/site- packages/email/Generator.py|_write_headers|183] [/usr/lib/python2.2/site- packages/email/Header.py|encode|412] [/usr/lib/python2.2/site- packages/email/Header.py|_split|297] [/usr/lib/python2.2/site- packages/email/Charset.py|encoded_header_len|341]) --- ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 From noreply at sourceforge.net Wed May 14 19:10:43 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 21:10:56 2003 Subject: [Spambayes] [ spambayes-Bugs-737986 ] Message.as_string() fails. Message-ID: Bugs item #737986, was opened at 2003-05-15 13:09 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Message.as_string() fails. Initial Comment: Reported on the list and discussed via email. Sadly, this one has me beaten; I really can't figure out why it's going wrong. --- I've attached two plain text files. One is from using evolution, the other mozilla. Mozilla tries to access two accounts. But it only accesses the second account after I've cancelled the session because it's hung from the first. It looks to me ( great email expert that I am ) that it does all the negotiations, dl's the first mesage and then just freezes. Note that in the evolution file, I had it get mail from an empty server. All went fine. --- When evol gets the mail, it says getting pop summary, then hangs waiting for the first message. at the terminal window where I ran pop3proxy I get: error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x8353a4c> (exceptions.TypeError:len() of unsized object [/usr/lib/python2.2/asyncore.py|poll|99] [/usr/lib/python2.2/asyncore.py|handle_read_event|396] [/usr/lib/python2.2/asynchat.py|handle_read|130] [/usr/bin/pop3proxy.py|found_terminator|147] [/usr/bin/pop3proxy.py|onServerLine|215] [/usr/bin/pop3proxy.py|onResponse|289] [/usr/bin/pop3proxy.py|onTransaction|390] [/usr/bin/pop3proxy.py|onRetr|459] [/usr/lib/python2.2/site- packages/spambayes/message.py|as_string|193] [/usr/lib/python2.2/site- packages/email/Message.py|as_string|109] [/usr/lib/python2.2/site- packages/email/Generator.py|flatten|102] [/usr/lib/python2.2/site- packages/email/Generator.py|_write|137] [/usr/lib/python2.2/site- packages/email/Generator.py|_write_headers|183] [/usr/lib/python2.2/site- packages/email/Header.py|encode|412] [/usr/lib/python2.2/site- packages/email/Header.py|_split|297] [/usr/lib/python2.2/site- packages/email/Charset.py|encoded_header_len|341]) --- ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 13:10 Message: Logged In: YES user_id=552329 This is the second file. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 From noreply at sourceforge.net Wed May 14 19:13:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 14 21:14:03 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 15:37 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None >Status: Pending Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-15 11:13 Message: Logged In: YES user_id=14198 This works for me now. The code was changed on 2003/03/07 to better handle these exceptions all over the code. I tried to reproduce your example, and do get "", but when clicking on the browse button I get a standard dialog with no folder selected. How does your spam folder keep disappearing? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-02 02:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 16:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From mail at peterbe.com Thu May 15 02:25:32 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 14 21:18:25 2003 Subject: [Spambayes] Bug in UserInterface.py Message-ID: <5.2.0.9.0.20030515012350.0222d1c8@mail.grenna.net> I checked out the latest a couple of hours ago, and when running the pop3proxy and trying to train an mbx file I got an error about StringIO in UserInterface.py So, at the top of that file I added (including old stuff) import re import time import email import binascii import cgi import mailbox import types # Added by me import StringIO import PyMeldLite import Dibbler import tokenizer Now it works. Sorry for not copying the error message I got. Peter From T.A.Meyer at massey.ac.nz Thu May 15 14:30:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 14 21:31:11 2003 Subject: [Spambayes] Bug in UserInterface.py Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130197959C@its-xchg4.massey.ac.nz> > I checked out the latest a couple of hours ago, and when running the > pop3proxy and trying to train an mbx file I got an error > about StringIO in UserInterface.py Fixed in cvs (about a minute ago). Thanks for pointing it out. =Tony Meyer From mail at peterbe.com Thu May 15 11:46:07 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Thu May 15 05:46:59 2003 Subject: [Spambayes] Won't start the second time Message-ID: <5.2.0.9.0.20030515104414.0220a0e8@mail.grenna.net> Yesterday I downloaded the latest CVS version and starting it worked fine. Then I turned that off when I went to bed and tried to start it again today. This time I get the following error:: C:\Python22\spambayes>C:\python22\python.exe pop3proxy.py Loading database... Traceback (most recent call last): File "pop3proxy.py", line 738, in ? run() File "pop3proxy.py", line 713, in run state.createWorkers() File "pop3proxy.py", line 559, in createWorkers self.bayes = storage.DBDictClassifier(filename) File "spambayes\storage.py", line 139, in __init__ self.load() File "spambayes\storage.py", line 151, in load t = self.db[self.statekey] File "C:\Python22\lib\shelve.py", line 71, in __getitem__ return Unpickler(f).load() EOFError Exception exceptions.AttributeError: "'NoneType' object has no attribute 'error' " in > ignored Any ideas what this might be? Is it a bug? Solutions? Peter From mail at peterbe.com Thu May 15 11:49:39 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Thu May 15 06:30:21 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13019794EF@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030515104626.0221b188@mail.grenna.net> > >I think the problem is probably that you have all of these . Use >the web interface to set full pathnames for these files. You are >interested in the "storage file name" and "message information file >name" options, towards the end of the configuration page. Now I downloaded the latest CVS and that previous problem went away. This time I don't just set hammie.db but C:\Python22\spambayes\hammie.db for that setting. >The problem is that these default to the current working directory, >which, obviously, changes. The one with the information will be the >42Mb one (which is pretty big; you must have trained an awful lot). Not really about 3000 ham and 400 spam. I'm now just trying to train more spam. > > Should I fetch the latest CVS? > >That depends ;). Without knowing what things were like when you got the >code you have now I can't answer. From what I can tell the latest CVS >is reasonably stable (touch wood). The latest CVS was quite different from the released gzip. Now I've got another problem though, but I've posted that in a separate email. Cheers, Peter From lists at olivermaunder.co.uk Thu May 15 15:30:45 2003 From: lists at olivermaunder.co.uk (Oliver Maunder) Date: Thu May 15 09:31:50 2003 Subject: [Spambayes] Outlook Express In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197953F@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F130197953F@its-xchg4.massey.ac.nz> Message-ID: <3EC39685.1020503@olivermaunder.co.uk> Meyer, Tony wrote: >I would say that it's finally reached alpha. No guarantees that it will >always work, but it should for the most part. *As long as no other >program expunges/purges messages* your original mail is never lost, in >any case. > > Seems good to me. It's now been filtering my mail every 15 minutes for 12 days with no problems. What I've just realised is more amazing is that *Windows* has been running for 12 days without any problems ;-) Olly From Greg at springcreek.cc Thu May 15 09:21:06 2003 From: Greg at springcreek.cc (Greg Lozier) Date: Thu May 15 09:42:10 2003 Subject: [Spambayes] Installation error Message-ID: I installed spambayes today and while installing I received an error message that there is no disk in the drive. Please insert a disk into drive \device\harddisk1\dr3. I cancel the error message and it finished loading. It is operational in outlook 2000 now, but when I start outlook I get the above error message. After hitting cancel a few times it will go away and outlook will load. I am running windows xp professional. I am not running XEmacs. I don't have a Zip drive and drive D is a partition of my internal hard drive. Drive E is an internal slot to insert a memory stick. What is causing this error? Thanks, Greg From tim.one at comcast.net Thu May 15 11:05:28 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 15 10:06:58 2003 Subject: [Spambayes] Installation error In-Reply-To: Message-ID: [Greg Lozier] > I installed spambayes today and while installing I received an > error message that there is no disk in the drive. Please insert a > disk into drive \device\harddisk1\dr3. I cancel the error message > and it finished loading. > It is operational in outlook 2000 now, but when I start outlook I get the > above error message. After hitting cancel a few times it will go away > and outlook will load. > > I am running windows xp professional. I am not running XEmacs. > I don't have a Zip drive and drive D is a partition of my internal hard > drive. > Drive E is an internal slot to insert a memory stick. > > What is causing this error? Sorry, nobody knows. It would help if you opened a bug report, so that if/when this pops up again we can gather info about it in one place: http://sf.net/tracker/?group_id=61702&atid=498103 It would also help if you could convince Microsoft to share the Outlook source code with us . From noreply at sourceforge.net Thu May 15 08:43:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 10:43:11 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 00:37 Message generated for change (Comment added) made by astrogen You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None >Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- >Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-15 09:43 Message: Logged In: YES user_id=752965 Usually It gets deleted while removing all the spam (trash does not need to be emptied, but sometimes it has been). -Astrogen ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-14 20:13 Message: Logged In: YES user_id=14198 This works for me now. The code was changed on 2003/03/07 to better handle these exceptions all over the code. I tried to reproduce your example, and do get "", but when clicking on the browse button I get a standard dialog with no folder selected. How does your spam folder keep disappearing? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-01 11:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 01:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From neale at woozle.org Thu May 15 11:17:33 2003 From: neale at woozle.org (Neale Pickett) Date: Thu May 15 13:17:39 2003 Subject: [Spambayes] Re: A doubt In-Reply-To: <1619.132.248.159.60.1052784952.squirrel@mail> (cynthia@super.unam.mx's message of "Mon, 12 May 2003 19:15:52 -0500 (CDT)") References: <1619.132.248.159.60.1052784952.squirrel@mail> Message-ID: cynthia@super.unam.mx writes: > Hi! > > I am writing from Mexico, I installed spambayes correctly and when I want > to create a database I receive this error message: > > bsddb._db.DBInvalidArgError: (22, 'Invalid argument -- architecture lacks > fast mutexes: applications cannot be threaded') > > The machine is an Alpha DS-20 with Linux Red Hat 7.2 > > Can you help me. > > Thanks a lot. ?Hola, Cynthia! I'm not sure about that one, so I'm forwarding your question on to the spambayes mailing list. However, I *think* you can add the following to your ~/.spambayesrc file: [globals] dbm_type: gdbm Let me know if this works, and please Cc: the list so your report can be archived. ?Gracias para utilizar SpamBayes! Neale From noreply at sourceforge.net Thu May 15 15:27:59 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 17:28:03 2003 Subject: [Spambayes] [ spambayes-Bugs-738501 ] AttributeError second time around with pop3proxy Message-ID: Bugs item #738501, was opened at 2003-05-15 21:27 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738501&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Peter Bengtsson (peterbe) Assigned to: Nobody/Anonymous (nobody) Summary: AttributeError second time around with pop3proxy Initial Comment: Yesterday (14/05) I downloaded the latest CVS version and starting it; worked fine. Then I turned that off when I went to bed and tried to start it again today. This time I get the following error:: C:\Python22\spambayes>C:\python22\python.exe pop3proxy.py Loading database... Traceback (most recent call last): File "pop3proxy.py", line 738, in ? run() File "pop3proxy.py", line 713, in run state.createWorkers() File "pop3proxy.py", line 559, in createWorkers self.bayes = storage.DBDictClassifier(filename) File "spambayes\storage.py", line 139, in __init__ self.load() File "spambayes\storage.py", line 151, in load t = self.db[self.statekey] File "C:\Python22\lib\shelve.py", line 71, in __getitem__ return Unpickler(f).load() EOFError Exception exceptions.AttributeError: "'NoneType' object has no attribute 'error' " in > ignored Any ideas what this might be? Is it a bug? Solutions? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738501&group_id=61702 From noreply at sourceforge.net Thu May 15 15:56:19 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 17:56:23 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 15:37 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-16 07:56 Message: Logged In: YES user_id=14198 Can you please confirm if you can still reproduce this? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-16 00:43 Message: Logged In: YES user_id=752965 Usually It gets deleted while removing all the spam (trash does not need to be emptied, but sometimes it has been). -Astrogen ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-15 11:13 Message: Logged In: YES user_id=14198 This works for me now. The code was changed on 2003/03/07 to better handle these exceptions all over the code. I tried to reproduce your example, and do get "", but when clicking on the browse button I get a standard dialog with no folder selected. How does your spam folder keep disappearing? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-02 02:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 16:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From noreply at sourceforge.net Thu May 15 17:05:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 19:05:52 2003 Subject: [Spambayes] [ spambayes-Bugs-709051 ] Config file loading and saving is fragile Message-ID: Bugs item #709051, was opened at 2003-03-25 09:19 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=709051&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Config file loading and saving is fragile Initial Comment: There was a report of this error using the second binary release: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 152, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 355, in LoadConfig exceptions.EOFError: While there is another problem that caused this error, we should not die completely loading the config pickle should it get screwed up. However, as this means spambayes will be unconfigured, we do need a scheme to let the user know this (as we do in the few other places where we disable spambayes due to config errors) ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-16 09:05 Message: Logged In: YES user_id=14198 Fixed in manager.py revision 1.56, as part of fixing 715248 (which was the same bug, but for the database pickle) ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-04 22:05 Message: Logged In: YES user_id=14198 I'm afraid you are wrong about the config file being the same as the word database. You are however correct about the saving. As we have the 2 pickles, I will track the Outlook config pickle in this bug, and opened: https://sourceforge.net/tracker/index.php?func=detail&aid=715248&group_id=61702&atid=498103 to track the word database bug. ---------------------------------------------------------------------- Comment By: Simone Piunno (pioppo) Date: 2003-04-04 20:28 Message: Logged In: YES user_id=227443 I disagree, the "configuration pickle" and the "word database" are the very same file. Moreover, without this path you seriously risk to completely loose your word database, in case execution stops beween open() (which truncates the file to zero length) and pickle.dump(). Execution could stop for whatever reason, from CTRL+C to system crash, so it's of vital importance that the file update is atomic, which should be guaranteed by rename(). I think this is a bug for sure, even if you don't plan to add support to concurrency. Of course this is not enough when you add concurrency, because you could loose some training information if 2 separate instances try to update the word database at the same time (they will both read the old file, then they will both create the temp file, then the second rename() will overwrite the result of the first one). To solve this, you should add some locking mechanism (in addition to atomic rename()), which could be out of your scope, I understand, but I think this would be a very useful enhancement on spambayes usability. If you need some code example, you can look at Mailman's handling of the MailList object persistency. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-04 09:19 Message: Logged In: YES user_id=14198 Check the traceback is the same as yours - this error is loading the configuration pickle, not the word database. Thus, locking shouldn't be the issue, as I can't see how two threads or processes could write this file at once (Outlook appears to have its own lock for startup; I've never seen spambayes running twice in different processes.) So, your patch wont help this exception. However, if you are getting a slightly different EOFError, you patch may apply. ---------------------------------------------------------------------- Comment By: Simone Piunno (pioppo) Date: 2003-04-04 08:01 Message: Logged In: YES user_id=227443 Maybe this patch would be a little (but insufficient) improvement? I'd upload it as a separate file, but there's no "upload" button.... --- spambayes/storage.py.orig 2003-04-03 23:35:47.000000000 +0200 +++ spambayes/storage.py 2003-04-03 23:43:16.000000000 +0200 @@ -59,6 +59,7 @@ import cPickle as pickle import errno import shelve +import os from spambayes import dbmstorage # Make shelve use binary pickles by default. @@ -121,9 +122,10 @@ if options.verbose: print 'Persisting',self.db_name,'as a pickle' - fp = open(self.db_name, 'wb') + fp = open(self.db_name+'.tmp', 'wb') pickle.dump(self, fp, PICKLE_TYPE) fp.close() + os.rename(self.db_name+'.tmp', self.db_name) class DBDictClassifier(classifier.Classifier): ---------------------------------------------------------------------- Comment By: Simone Piunno (pioppo) Date: 2003-04-04 05:38 Message: Logged In: YES user_id=227443 I have another case, but without apparent cause: Traceback (most recent call last): File "/home/mailman21/Mailman/Queue/Runner.py", line 105, in _oneloop self._onefile(msg, msgdata) File "/home/mailman21/Mailman/Queue/Runner.py", line 155, in _onefile keepqueued = self._dispose(mlist, msg, msgdata) File "/home/mailman21/Mailman/Queue/OutgoingRunner.py", line 69, in _dispose mlist.Load() File "/home/mailman21/Mailman/MailList.py", line 626, in Load self._spamdb = hammie.open(path, 0) File "/home/mailman21/pythonlib/spambayes/hammie.py", line 262, in open b = storage.PickledClassifier(filename) File "/home/mailman21/pythonlib/spambayes/storage.py", line 80, in __init__ self.load() File "/home/mailman21/pythonlib/spambayes/storage.py", line 98, in load tempbayes = pickle.load(fp) EOFError it happens quite often but not always, I believe it is a concurrency issue (e.g. lack of locking). ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-03-25 09:56 Message: Logged In: YES user_id=14198 The reporter just let me know that the problem was caused by about 20 power failures over short period. So I don't think we can cure the cause here, just the symptoms. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=709051&group_id=61702 From sidnei at x3ng.com Thu May 15 19:21:39 2003 From: sidnei at x3ng.com (Sidnei da Silva) Date: Thu May 15 19:13:46 2003 Subject: [Spambayes] Small patch to spambayes Message-ID: <20030515212139.GA19074@x3ng.com> Howdy, I just checked out spambayes from CVS, and had to make two small modifications to get it to read my maildir for training. Attached theres a patch. Thanks! -- Sidnei da Silva (dreamcatcher) X3ng Web Technology GNU/Linux user 257852 Debian GNU/Linux 3.0 (Sid) 2.4.18 ppc Please go away. -------------- next part -------------- Index: mboxtrain.py =================================================================== RCS file: /cvsroot/spambayes/spambayes/mboxtrain.py,v retrieving revision 1.7 diff -u -r1.7 mboxtrain.py --- mboxtrain.py 13 Apr 2003 22:24:24 -0000 1.7 +++ mboxtrain.py 15 May 2003 21:19:18 -0000 @@ -32,7 +32,7 @@ need to rebuild your database from scratch. -q quiet mode; no output - + -n train mail residing in "new" directory, in addition to "cur" directory, which is always trained """ @@ -96,6 +96,9 @@ trained = 0 for fn in os.listdir(path): + # Skip dirs + if os.path.isdir(fn): + continue counter += 1 cfn = os.path.join(path, fn) tfn = os.path.join(path, "tmp", Index: spambayes/classifier.py =================================================================== RCS file: /cvsroot/spambayes/spambayes/spambayes/classifier.py,v retrieving revision 1.4 diff -u -r1.4 classifier.py --- spambayes/classifier.py 8 Feb 2003 17:18:37 -0000 1.4 +++ spambayes/classifier.py 15 May 2003 21:19:19 -0000 @@ -389,6 +389,8 @@ self.nham = int(self.nham) + 1 # account for string nham for word in Set(wordstream): + if type(word) is type(u''): + word = word.encode('latin1') record = self._wordinfoget(word) if record is None: record = self.WordInfoClass() From noreply at sourceforge.net Thu May 15 17:18:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 19:18:48 2003 Subject: [Spambayes] [ spambayes-Bugs-706170 ] Execute test suite fails in Outlook Message-ID: Bugs item #706170, was opened at 2003-03-19 21:03 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 Category: Outlook Group: None >Status: Closed Resolution: Fixed Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Execute test suite fails in Outlook Initial Comment: The test suite fails in outlook. I've retrained messages from a spam and a ham folder. I think this may be related to moving the database-files from the spambayes to the default docs-folders in windows a couple of weeks ago. the following traceback is shown in PythonWin: Executing automated tests... Traceback (most recent call last): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\addin.py", line 308, in Tester tester.test(manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 306, in test TestSpamFilter(driver) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 173, in TestSpamFilter msg, words = driver.CreateTestMessageInFolder (SPAM, driver.folder_watch) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 132, in CreateTestMessageInFolder msg, words = self.CreateTestMessage(spam_status) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 145, in CreateTestMessage words.update(FindTopWords(self.manager.bayes, 50, True)) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 64, in FindTopWords for word, info in extractor(bayes): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\tester.py", line 46, in DBExtractor key = bayes.dbm.next()[0] File "C:\PROGRA~1\_DEV\Python22\Lib\site- packages\bsddb3\__init__.py", line 122, in next rv = self.dbc.next() DBNotFoundError: (-30991, 'DB_NOTFOUND: No matching key/data pair found') Tests FAILED. Sorry about that. If I were you, I would do a full re-train ASAP Please delete any test messages from your Spam, Unsure or Inbox folders first. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-16 09:18 Message: Logged In: YES user_id=14198 tester.py 1.8 has a (tested!) fix for this. ---------------------------------------------------------------------- Comment By: Fredrik Rodland (fmmr) Date: 2003-05-14 21:34 Message: Logged In: YES user_id=724871 I tried this again today with the latest CVS (may 14th). the error is not the same, but it still does not work. got the following traceback: Executing automated tests... Bayes database is not dirty - not writing Traceback (most recent call last): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\addin.py", line 314, in Tester tester.test(manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 316, in test TestSpamFilter(driver) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 182, in TestSpamFilter msg, words = driver.CreateTestMessageInFolder(SPAM, driver.folder_watch) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 141, in CreateTestMessageInFolder msg, words = self.CreateTestMessage(spam_status) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 154, in CreateTestMessage words.update(FindTopWords(self.manager.bayes, 50, True)) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 72, in FindTopWords for word, info in extractor(bayes): File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\Outlo ok2000\tester.py", line 43, in DBExtractor bsddb_error = bsddb.DBNotFoundError AttributeError: 'module' object has no attribute 'DBNotFoundError' Tests FAILED. Sorry about that. If I were you, I would do a full re-train ASAP Please delete any test messages from your Spam, Unsure or Inbox folders first. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-14 20:52 Message: Logged In: YES user_id=14198 I'm pretty sure this was fixed in tester.py: revision 1.7 date: 2003/05/03 13:15:27; author: mhammond; state: Exp; lines: +9 -1 Allow test suite to work with bsddb3 or bsddb Please re-open otherwise. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-03-20 11:31 Message: Logged In: YES user_id=14198 This seems a bsddb3 problem. The code in question: try: key = bayes.dbm.next()[0] except bsddb.error: already attempts to catch this error. Further, the docs for DBNotFoundError state that it derives from bsddb.error, meaning my except statement should work. I will try and get to using my Python 2.2 version for the plugin to fix this. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706170&group_id=61702 From jonb at bh-co.com Thu May 15 20:13:17 2003 From: jonb at bh-co.com (Jon K. Bryant) Date: Thu May 15 20:29:20 2003 Subject: [Spambayes] Strange message Message-ID: I am receiving the following message when I installed Spambayes. I am receiving the same message when I start Outlook 2000 after installing Spambayes. The operating System I am using is Windows XP Professional. Can you tell me how to resolve this issue? I have uninstalled and re-installed but keep getting this message after installing Spambayes. Thank you, Jon K. Bryant Beaird Harris & Co., P.C. 12377 Merit Drive, Suite 220 Dallas, Texas 75251 972-503-1040 jonb@bh-co.com -------------- next part -------------- A non-text attachment was scrubbed... Name: Microsoft Word Picture Type: application/x-microsoft-ole-object Size: 281600 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030515/f43e0d1b/MicrosoftWordPicture-0001.bin From tim.one at comcast.net Thu May 15 21:37:02 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 15 20:37:38 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: [Jon K. Bryant] > I am receiving the following message when I installed Spambayes. I am > receiving the same message when I start Outlook 2000 after installing > Spambayes. The operating System I am using is Windows XP > Professional. Can you tell me how to resolve this issue? I have > uninstalled and re-installed but keep getting this message after > installing Spambayes. What's the message? If you sent it in an attachment, it didn't make it. Please just tell us what it said instead, then. From jonb at bh-co.com Thu May 15 20:45:44 2003 From: jonb at bh-co.com (Jon K. Bryant) Date: Thu May 15 20:49:32 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: Tim, Here is the text in the Window: Title Bar: OUTLOOK.EXE ? No Disk There is no disk in the drive. Please insert a disk into drive \Device\Harddisk1\DR3. Buttons: Cancel Try Again Continue -----Original Message----- From: Tim Peters [mailto:tim.one@comcast.net] Sent: Thursday, May 15, 2003 7:37 PM To: Jon K. Bryant Cc: spambayes@python.org Subject: RE: [Spambayes] Strange message [Jon K. Bryant] > I am receiving the following message when I installed Spambayes. I am > receiving the same message when I start Outlook 2000 after installing > Spambayes. The operating System I am using is Windows XP > Professional. Can you tell me how to resolve this issue? I have > uninstalled and re-installed but keep getting this message after > installing Spambayes. What's the message? If you sent it in an attachment, it didn't make it. Please just tell us what it said instead, then. From tim.one at comcast.net Thu May 15 21:54:42 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 15 20:56:08 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: [Jon K. Bryant] > Here is the text in the Window: > > Title Bar: OUTLOOK.EXE ? No Disk > > There is no disk in the drive. Please insert a disk into drive > \Device\Harddisk1\DR3. > > Buttons: Cancel Try Again Continue Wow! We've had a few reports of that recently, but just a few. So far nobody has any idea what causes it, and the people who developed the software have never seen it. Mark (Hammond), do you happen to have a removable storage device on the machine you used to build the installer? (And, if so, were any piece of the deliverable obtained from that device?) I did a google search the first time this was reported, and found reports of several installers (for various things) that displayed this msg for a small subset of users. I didn't find an explanation or resolution to any of them. The good news is that I didn't find any reports of damage either; it seemed to be just an annoyance. From skip at pobox.com Thu May 15 21:23:54 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 15 21:23:58 2003 Subject: [Spambayes] Another small indication of how well Spambayes works Message-ID: <16068.15786.942949.520767@montanaro.dyndns.org> I got an email today from another person on the csv module mailing list with this suggestion: There seems to have been spam slipping through in the last few weeks - maybe we should turn on the "moderate non-subscribers" feature now that things have settled down? to which I replied: Done. To tell you the truth I hadn't even noticed. I get so much spam and my spam filter (I use spambayes, naturally) sweeps almost all of it off to a couple special mailboxes for later review. Just call me another success story. ;-) Skip From readonly at getsoft.com Fri May 16 13:55:21 2003 From: readonly at getsoft.com (Neville Franks) Date: Thu May 15 22:55:23 2003 Subject: [Spambayes] Strange message In-Reply-To: References: Message-ID: <169108692621.20030516125521@getsoft.com> TP> [Jon K. Bryant] >> Here is the text in the Window: >> >> Title Bar: OUTLOOK.EXE ? No Disk >> >> There is no disk in the drive. Please insert a disk into drive >> \Device\Harddisk1\DR3. >> >> Buttons: Cancel Try Again Continue TP> Wow! We've had a few reports of that recently, but just a few. So far TP> nobody has any idea what causes it, and the people who developed the TP> software have never seen it. TP> Mark (Hammond), do you happen to have a removable storage device on the TP> machine you used to build the installer? (And, if so, were any piece of the TP> deliverable obtained from that device?) TP> I did a google search the first time this was reported, and found reports of TP> several installers (for various things) that displayed this msg for a small TP> subset of users. I didn't find an explanation or resolution to any of them. TP> The good news is that I didn't find any reports of damage either; it seemed TP> to be just an annoyance. I have a few of my ED users that have this error and so far I haven't been able to resolve it. I think you need a machine configured with exactly the same drives before you will see it. Hard drives are usually named like: \Device\Harddiskn\Partitionn+1 (Partition0 being full disk access?) It is interesting that of the recent reports one was DR1 and the latest DR3. The comment on Installers is interesting and not something I've thought to look into as I've always assumed it was in ED as that is where the error occurs, not during installation. A utility that may help someone track this down is FileMon available from the SysInternals web site: http://www.sysinternals.com/ntw2k/utilities.shtml -- Best Regards, Neville Franks, Author of ED for Windows - the programmer's editor. Soft As It Gets Pty Ltd, http://www.getsoft.com - Download your Free Trial now. Victoria, Australia mailto:readonly@getsoft.com From noreply at sourceforge.net Thu May 15 21:51:46 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 15 23:57:19 2003 Subject: [Spambayes] [ spambayes-Bugs-717253 ] Database should be saved after training Message-ID: Bugs item #717253, was opened at 2003-04-08 12:28 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717253&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Database should be saved after training Initial Comment: The database should be saved after training operations, otherwise an Outlook crash upset things. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-16 13:51 Message: Logged In: YES user_id=14198 Fixed in: Checking in addin.py; new revision: 1.57; previous revision: 1.56 Checking in filter.py; new revision: 1.21; previous revision: 1.20 Checking in manager.py; new revision: 1.58; previous revision: 1.57 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717253&group_id=61702 From mhammond at skippinet.com.au Fri May 16 15:03:05 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri May 16 00:04:51 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: <000401c31b60$0da9fb80$530f8490@eden> > Wow! We've had a few reports of that recently, but just a > few. So far > nobody has any idea what causes it, and the people who developed the > software have never seen it. > > Mark (Hammond), do you happen to have a removable storage > device on the > machine you used to build the installer? (And, if so, were > any piece of the > deliverable obtained from that device?) Nope - but my main drive is E: - and the .pyc/o files in the installer have a full reference to that path. Python is probably looking for the .py associated with that file. This is most noticable in tracebacks from the binary - all have the full path from my machine ("E:\src\spambayes, E:\src\python-cvs, etc) And given "\Device\Harddisk1\DR3", my money is on that! I'm not sure what the options are here - but at least we do have the source to Python Jon - can you confirm E: is your removable device? Mark. From mhammond at skippinet.com.au Fri May 16 15:04:21 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri May 16 00:05:20 2003 Subject: [Spambayes] Small patch to spambayes In-Reply-To: <20030515212139.GA19074@x3ng.com> Message-ID: <000501c31b60$3be04630$530f8490@eden> Could you please file a bug (or possibly a couple) and upload the patch to sourceforge? That is the only way it is guaranteed to not get lost forever! Thanks, Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Sidnei da Silva > Sent: Friday, 16 May 2003 7:22 AM > To: spambayes@python.org > Subject: [Spambayes] Small patch to spambayes > > > Howdy, > > I just checked out spambayes from CVS, and had to make two small > modifications to get it to read my maildir for training. > > Attached theres a patch. Thanks! > > -- > Sidnei da Silva (dreamcatcher) > X3ng Web Technology > GNU/Linux user 257852 > Debian GNU/Linux 3.0 (Sid) 2.4.18 ppc > > Please go away. > From noreply at sourceforge.net Fri May 16 01:27:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 16 03:28:04 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-31 07:47 Message generated for change (Comment added) made by thystad You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 09:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 10:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 04:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 08:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From akiva at atwood.co.il Fri May 16 11:38:30 2003 From: akiva at atwood.co.il (Akiva Atwood) Date: Fri May 16 10:03:18 2003 Subject: [Spambayes] RE: Strange message In-Reply-To: Message-ID: > > Jon - can you confirm E: is your removable device? I've been getting the same message -- except it says "drive E:", which is my DVD drive. Running XP Pro -- outlook 2K. Akiva ================================= "We took risks, we knew we took them; things have come out against us, and therefore we have no cause for complaint, but bow to the will of Providence, determined still to do our best to the last." Robert Falcon Scott From tim.one at comcast.net Fri May 16 14:13:42 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 16 13:14:15 2003 Subject: [Spambayes] Strange message In-Reply-To: <000401c31b60$0da9fb80$530f8490@eden> Message-ID: [Mark Hammond] > Nope - but my main drive is E: - and the .pyc/o files in the > installer have a full reference to that path. Python is probably > looking for the .py associated with that file. This is most > noticable in tracebacks from the binary - all have the full path > from my machine ("E:\src\spambayes, E:\src\python-cvs, etc) A photon trickles forth from the darkness! A .pyc file holds the path to the file it was compiled from, and when opening a .pyc Python looks for the original .py file to see whether the .pyc file is out of date. Python doesn't object if the .py file can't be found (it has the .pyc -- it's not stuck), but this "no disk" box could be an unwanted Windows-specific side-effect of merely doing a stat() call on the .py file. > And given "\Device\Harddisk1\DR3", my money is on that! I'm not sure > what the options are here - but at least we do have the source to Python > If the hypothesized scenario is correct, I'd call it a Python buglet in Windows. A solution before changing Python would be not to ship .pyc files (let Python generate them as needed on the client machine, and then they'll have correct paths -- sometimes ). The installers I ship for Python and ZODB don't include any .pyc, .pyw, or .pyo files, so it may not be a coincidence that I've never heard of this kind of glitch before. > Jon - can you confirm E: is your removable device? I'll let Jon answer that one . From tim.one at comcast.net Fri May 16 14:43:43 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 16 13:44:43 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: > -----Original Message----- > From: Jon K. Bryant [mailto:jonb@bh-co.com] > Sent: Friday, May 16, 2003 1:30 PM > To: 'Tim Peters' > Subject: RE: [Spambayes] Strange message > > > Tim, > > E: is my removable device. I do not get the message as long as I > put media > in the removable device (A Zip Drive) when I launch Outlook 2000. I tried > this after our email exchange this a.m. > > Thank you for your insight with regards to the message. > > Jon From noreply at sourceforge.net Fri May 16 11:47:25 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 16 13:47:41 2003 Subject: [Spambayes] [ spambayes-Bugs-738884 ] Microsoft Outlook Patch Q329295 Disables SpamBayes? Message-ID: Bugs item #738884, was opened at 2003-05-16 13:47 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738884&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Chuck Conway (conwaycl) Assigned to: Mark Hammond (mhammond) Summary: Microsoft Outlook Patch Q329295 Disables SpamBayes? Initial Comment: I use a Nextel phone feature called Mobile Email to manage my Outlook Calendar via my cell phone. (http://www.nextel.com). Nextel upgraded their software, and forced me to upgrade their desktop synchronization app from version 3.2 to 4.1. As part of the upgrade, the Nextel 4.1 software required me to install an Outlook patch from Microsoft. Here are links to the article and the actual binary patch download: http://support.microsoft.com/default.aspx?scid=kb;en- us;Q329295 http://download.microsoft.com/download/OfficeXPStandar d/Patch/329295/W98NT42KMeXP/EN-US/q329295.exe After the patch was installed and the Nextel 4.1 installed, SpamBayes was broken: - No filtering - No toolbar icons - Log file indicates nothing - Add-In still visible under "COM Add-Ins" I de-installed and re-installed the SpamBayes binary, but it did not help. I'm not sure what direction to go in at this point. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738884&group_id=61702 From benles at bldigital.com Fri May 16 12:29:05 2003 From: benles at bldigital.com (Ben Allfree) Date: Fri May 16 14:44:45 2003 Subject: [Spambayes] spam training file for outlook Message-ID: <009201c31bd9$0ac39970$917ba8c0@longshot> I've kept a 1500+ message spam training file. Pretty general stuff, I think most everyone would agree it's spam. It might be useful to users of the Outlook plugin. http://www.allfreegroup.com/Spam.pst.zip From noreply at sourceforge.net Fri May 16 13:28:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 16 15:28:57 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 00:37 Message generated for change (Comment added) made by astrogen You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- >Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-16 14:28 Message: Logged In: YES user_id=752965 Just downloaded the latest CVS tree it seems to be fixed. Thanks much. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-15 16:56 Message: Logged In: YES user_id=14198 Can you please confirm if you can still reproduce this? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-15 09:43 Message: Logged In: YES user_id=752965 Usually It gets deleted while removing all the spam (trash does not need to be emptied, but sometimes it has been). -Astrogen ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-14 20:13 Message: Logged In: YES user_id=14198 This works for me now. The code was changed on 2003/03/07 to better handle these exceptions all over the code. I tried to reproduce your example, and do get "", but when clicking on the browse button I get a standard dialog with no folder selected. How does your spam folder keep disappearing? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-01 11:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-14 16:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 01:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From noreply at sourceforge.net Fri May 16 18:03:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 16 20:03:33 2003 Subject: [Spambayes] [ spambayes-Bugs-717998 ] Can't reset Spam folder if folder is lost Message-ID: Bugs item #717998, was opened at 2003-04-09 15:37 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Can't reset Spam folder if folder is lost Initial Comment: If the Spam Manager is set up to move spam to a folder and that folder disappears, the Spam Manager may show that spam is to be delivered to . In this event trying to browse the folder list will not list any folders, and you will be unable to set the Spam folder to anything else. ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-17 05:28 Message: Logged In: YES user_id=752965 Just downloaded the latest CVS tree it seems to be fixed. Thanks much. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-16 07:56 Message: Logged In: YES user_id=14198 Can you please confirm if you can still reproduce this? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-16 00:43 Message: Logged In: YES user_id=752965 Usually It gets deleted while removing all the spam (trash does not need to be emptied, but sometimes it has been). -Astrogen ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-15 11:13 Message: Logged In: YES user_id=14198 This works for me now. The code was changed on 2003/03/07 to better handle these exceptions all over the code. I tried to reproduce your example, and do get "", but when clicking on the browse button I get a standard dialog with no folder selected. How does your spam folder keep disappearing? ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-05-02 02:19 Message: Logged In: YES user_id=752965 Is it possible to add a button that puts you back to the root folder if the number of folders available is less than 2 or something? Currently (and this has happened a couple times since my original posting) my only fix is to delete the configuration file (default_configuration.pck). -Astrogen ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:32 Message: Logged In: YES user_id=752965 Just figured there would be a log file somewhere.. but anywho... I manually ran manager.py Heres the traceback (copied and pasted from a dos prompt window) Traceback (most recent call last): File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 309, in OnInitDialog self.expand_ids = self._DetermineFoldersToExpand() File "C:\spambayes-1.0a2\Outlook2000\dialogs\FolderSelector.py", line 226, in _DetermineFoldersToExpand folder = self.manager.message_store.GetFolder(folder_id) File "C:\spambayes-1.0a2\Outlook2000\msgstore.py", line 225, in GetFolder table = folder.GetContentsTable(0) pywintypes.com_error: (-2147467259, 'Unspecified error', None, None) win32ui: OnInitDialog() virtual handler (>) raised an exception Bayes database is not dirty - not writing ---------------------------------------------------------------------- Comment By: Benjamin J. Judson (astrogen) Date: 2003-04-15 07:26 Message: Logged In: YES user_id=752965 Where is the log file kept? I looked for it before submitting, and since your posted. I don't have the name or location for where I could find it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-10 16:06 Message: Logged In: YES user_id=14198 If there a traceback associated with this? I regularly "test" this, thanks to Outlook screwing all my folder IDs as I reconfigure Outlook, and I don't have the problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=717998&group_id=61702 From noreply at sourceforge.net Sat May 17 06:26:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 17 08:26:56 2003 Subject: [Spambayes] [ spambayes-Bugs-721664 ] mboxtrain.py doesn't find Maildir tmp/ directory properly Message-ID: Bugs item #721664, was opened at 2003-04-15 04:10 Message generated for change (Comment added) made by dripton You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=721664&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: David M. Cooke (dmcooke) Assigned to: Nobody/Anonymous (nobody) Summary: mboxtrain.py doesn't find Maildir tmp/ directory properly Initial Comment: The Maildir handler for mboxtrain.py tries to use a tmp/ under the directory passed to it. It should use a tmp/ directory at the same level (so given a ~/Maildir/cur, it should use ~/Maildir/tmp). ---------------------------------------------------------------------- Comment By: David Ripton (dripton) Date: 2003-05-17 08:26 Message: Logged In: YES user_id=9425 This is still broken as of latest CVS on 2003-05-17 Example output from an mboxtrain cron job very similar to the one in HAMMIE.txt: Training ham (/home/dripton/Maildir/): Reading as Maildir 1052838601.29078_0.ludo:2,RS 1052472674.1616_0.ludo:2,RS 1049307199.16131_1.dydimus,S=829:2,S 1047587487.1367_2.hoggle:2,S 1051558961.21629_2.willum:2,RS 1053034631.15484_1.willum:2,S Traceback (most recent call last): File "/home/dripton/src/spambayes/mboxtrain.py", line 294, in ? main() File "/home/dripton/src/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/dripton/src/spambayes/mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "/home/dripton/src/spambayes/mboxtrain.py", line 113, in maildir_train f = file(tfn, "wb") IOError: [Errno 2] No such file or directory: '/home/dripton/Maildir/cur/tmp/1053163264.21320_6.vidar' The one-line patch already attached to this bug works for me. Please apply it to CVS. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=721664&group_id=61702 From akiva at atwood.co.il Sat May 17 23:09:08 2003 From: akiva at atwood.co.il (Akiva Atwood) Date: Sat May 17 19:09:57 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 57, Issue 36 In-Reply-To: Message-ID: > > I've been getting the same message -- except it says "drive E:", > which is my > DVD drive. > The caption of the dialog box says DDE Server... Akiva From mhammond at skippinet.com.au Mon May 19 10:20:42 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 18 19:21:40 2003 Subject: [Spambayes] Strange message In-Reply-To: Message-ID: <03e301c31d94$1ab8a660$530f8490@eden> [Tim P] > If the hypothesized scenario is correct, I'd call it a Python > buglet in Windows. There is a chance that SetErrorMode() will fix the problem. http://msdn.microsoft.com/library/default.asp?url=/library/en-us/debug/base/ seterrormode.asp Particularly SEM_NOOPENFILEERRORBOX - IIRC, this used to stop floppy drives displaying a similar error - however, I can't reproduce a similar condition on my floppy/cd now (last I tried was certainly before Win2000) It would be very interesting to know what the following Python session shows, while the device is ejected: >>> f = open("e:\\foo", "r") If that displays the dialog, then try: >>> import win32api, win32con >>> win32api.SetErrorMode(win32con.SEM_NOOPENFILEERRORBOX) And try again. If the error does not display, do >>> win32api.SetErrorMode(0) # reset to default and see if it comes back. Is anyone with Python and a ZIP drive able to try this? Fingers-crossed ly, Mark. From noreply at sourceforge.net Sun May 18 18:52:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 18 20:52:10 2003 Subject: [Spambayes] [ spambayes-Bugs-737874 ] Outlook crash wipes out SpamBayes addin settings & training Message-ID: Bugs item #737874, was opened at 2003-05-15 06:18 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Scott Sanders (giveuptheghost) Assigned to: Mark Hammond (mhammond) Summary: Outlook crash wipes out SpamBayes addin settings & training Initial Comment: When Outlook crashes, it wipes out the SpamBayes Outlook Addin's settings, including all its training. At least, it does for me. Most programs have their important data and settings in a file somewhere or the Registry where it somehow can't be corrupted, much less erased, when a problem comes up. Maybe SpamBayes should look into this concept. I've attached the log for the affected session, though, after perusing it myself, it looks like SpamBayes is not causing Outlook to crash. I think it has to do with my network printer driver when I'm offline from the network, which causes a similar problem with other programs. (As a workaround, I've removed the Print button from the Outlook windows that I use, since it seems to crash just on mouseover!) I'm running SpamBayes Outlook Addin 002 with Outlook XP under Windows XP Pro. The other Outlook COM addins that are enabled are SpamSource and SyncNow. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-19 10:52 Message: Logged In: YES user_id=14198 I'm going to mark this as fixed - please re-open if the current CVS version still fails for you (or if the next binary still does). The current code takes a number defensive measures: * All config files (and pickle databases) are saved to a temp file, then moved to the real file. * Configuration is saved whenever the dialog is closed. * Databases are saved after any training operations The combination of these things should resolve this issue. ---------------------------------------------------------------------- Comment By: Scott Sanders (giveuptheghost) Date: 2003-05-15 10:20 Message: Logged In: YES user_id=184281 Hi. The last crash occurred this afternoon, and the settings files that exist and their modified dates are as follows: default_bayes_customize.ini - 5/10/2003 3:45 PM default_bayes_database.db - 5/13/2003 3:59 PM default_configuration.pck - 5/13/2003 3:59 PM default_message_database.db - 5/10/2003 3:45 PM Apparently the settings/training isn't changed until Outlook is closed? I usually have Outlook always running, and I seldom intentionally close it, except for an Outlook-related program update (rare) or if I shut down my comp. (also rare). I may be an exception, but I think the settings/training should be written at the time of setting/training. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 09:21 Message: Logged In: YES user_id=552329 The settings are saved in various files which, for you, are in the directory: C:\Documents and Settings\Administrator\Application Data\SpamBayes\ Neither the database nor the config pickle should be corrupted by an Outlook crash unless they were being used at the time. At one point the plugin didn't save the config settings until shutdown (this might even still be the case), so if you are crashing before you close Outlook, you would never save the settings, which would cause this behaviour. Check to see if the files are there, before and after the crash, and if they are modifed after the crash. It certainly does seem from the log that neither file is saved. This could perhaps be changed, but as I said above, it might have done so already since the 002 release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737874&group_id=61702 From Paul.Moore at atosorigin.com Mon May 19 10:01:48 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Mon May 19 04:03:10 2003 Subject: [Spambayes] Strange message Message-ID: <16E1010E4581B049ABC51D4975CEDB880113DAFB@UKDCX001.uk.int.atosorigin.com> From: Mark Hammond [mailto:mhammond@skippinet.com.au] > It would be very interesting to know what the following > Python session shows, while the device is ejected: >>> f = open("e:\\foo", "r") I tried it (my Zip drive is E:) and got IOError: [Errno 13] Permission denied: 'e:\\foo' > Fingers-crossed ly, It may be OS-specific (I'm on 2K Pro). I've seen dialog boxes like this before. Zip drive support on Windows (especially for older zip drives like mine) seems flakey at times :-( Paul. From tdickenson at devmail.geminidataloggers.co.uk Mon May 19 12:45:12 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Mon May 19 06:45:17 2003 Subject: [Spambayes] ham/unsure/spam split In-Reply-To: <16064.7417.687631.115533@montanaro.dyndns.org> References: <056b01c31832$3c818b20$530f8490@eden> <140726241099.20030513073717@getsoft.com> <16064.7417.687631.115533@montanaro.dyndns.org> Message-ID: <200305191145.12976.tdickenson@devmail.geminidataloggers.co.uk> On Monday 12 May 2003 11:15 pm, Skip Montanaro wrote: > Messages > classified as spam get tossed into two mailboxes by procmail, one for those > with a score of 0.99 or 1.00, and all others which exceed my spam_cutoff > (around 0.88). The only practical difference is the "high spam" mail gets > scanned less frequently and more quickly. Ive been using this for the last week, and it does simplify my daily spam review. The majority of my spams score 0.99, and Ive never had a false positive score above 0.80. Does your system treat 'unsures' differently to hams? If not, would it make sense to set your ham_cutoff to where you currently have spam_cutoff, and spam_cutoff to 0.99? If you are treating hams and unsures differently, I would be interested to hear how. Is this a case for extending the current three-way categorisation with a fourth 'high spam' category, and adding high_spam_cutoff? From mhammond at skippinet.com.au Mon May 19 22:27:04 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon May 19 07:28:25 2003 Subject: [Spambayes] Strange message In-Reply-To: <16E1010E4581B049ABC51D4975CEDB880113DAFB@UKDCX001.uk.int.atosorigin.com> Message-ID: <01a001c31df9$9358fa10$530f8490@eden> > > It would be very interesting to know what the following > > Python session shows, while the device is ejected: > > >>> f = open("e:\\foo", "r") > > I tried it (my Zip drive is E:) and got > > IOError: [Errno 13] Permission denied: 'e:\\foo' I get the exact same result on Win2k and XP from CD drives. > > Fingers-crossed ly, > > It may be OS-specific (I'm on 2K Pro). I've seen dialog > boxes like this before. Zip drive support on Windows > (especially for older zip drives like mine) seems flakey > at times :-( I can't recall all the details - have you (or anyone) seen this on other than XP? Or on a CD/DVD drive? Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1904 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030519/6bf5dacd/winmail.bin From Paul.Moore at atosorigin.com Mon May 19 13:35:38 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Mon May 19 07:36:58 2003 Subject: [Spambayes] Strange message Message-ID: <16E1010E4581B049ABC51D4975CEDB880113DB04@UKDCX001.uk.int.atosorigin.com> From: Mark Hammond [mailto:mhammond@skippinet.com.au] > I can't recall all the details - have you (or anyone) seen > this on other than XP? Or on a CD/DVD drive? Just to clarify - I've never had this from Spambayes, but I have had similar looking messages, on both 2K and XP. They are sometimes when there *is* a disk present, but not recognised for some reason. And they sometimes involve problems ejecting the disk. Hence my comment that zip support is flakey. I've never seen anything like this with a CD/DVD drive. Not much help, I know. Sorry. Paul. From skip at pobox.com Mon May 19 10:36:17 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 19 10:36:28 2003 Subject: [Spambayes] Re: ham/unsure/spam split In-Reply-To: <200305191145.12976.tdickenson@devmail.geminidataloggers.co.uk> References: <056b01c31832$3c818b20$530f8490@eden> <140726241099.20030513073717@getsoft.com> <16064.7417.687631.115533@montanaro.dyndns.org> <200305191145.12976.tdickenson@devmail.geminidataloggers.co.uk> Message-ID: <16072.60385.611148.730904@montanaro.dyndns.org> Toby> Does your system treat 'unsures' differently to hams? Yes. All unsures go into an unsure mailbox. I train on all of them, on a smattering of the spams which score less than 0.95, and any hams I notice which score above 0.00. Toby> If you are treating hams and unsures differently, I would be Toby> interested to hear how. Is this a case for extending the current Toby> three-way categorisation with a fourth 'high spam' category, and Toby> adding high_spam_cutoff? I don't think that's generally necessary. I think I get more spam than the average person, so it makes sense for me to distiguish between the .99's and the rest of the messages which score as spam. Here's how my mail was classified for 2003-05-17: unsure: 15 spam: 620 ham: 693 As you can see, it's not a big problem to simply toss all the unsures into their own mailbox. No other special treatment is necessary. Skip From DirkEichler at gmx.de Mon May 19 14:37:26 2003 From: DirkEichler at gmx.de (Dirk Eichler) Date: Mon May 19 11:55:49 2003 Subject: [Spambayes] Troubleshooting Message-ID: <000001c31dfb$0622da10$0101a8c0@angel> Hi! >If the SpamBayes addin is listed and checked, but still not working and still not creating log files, then I am stumped! Unfortunately exactly this happens when i am installing spambayes on my system with Outlook XP. Can you help me? I've installed it running Outlook2000 before rebuilding my Computer this days and it worked. But now it says it is unable to load due to runtime problems. By Dirk From srussell at Hopkins.k12.ky.us Mon May 19 11:33:49 2003 From: srussell at Hopkins.k12.ky.us (Russell, Stephen) Date: Mon May 19 11:56:23 2003 Subject: [Spambayes] Spam overload Message-ID: <260AFBE9E8D9B34581F6C74CFA93676B04A9CE@e265000n1.hopkins.k12.ky.us> Dear experts, We are a school system with 1100 faculty and 7000 student accounts on a Win2000 server with Exchange 5.5 SP3. We are in spam overload and are in desperate need of help. What can I do to stop this onslaught of unwanted mail and still be able to afford to maintain our network? I am a department of one and I need a product that is simple to use, doesn't require massive amounts of admin time, and doesn't cost an arm and a leg. I don't like the idea of subscribing to an online service with a yearly fee and would prefer that the app sit on the Exchange box. I have done a lot of research and it has brought me to you. By reading your site I can see you guys are at the forefront of this endeavor to stop spam and thought that you could point me in the right direction. Any ideas? Thanks, Stephen Russell From skip at pobox.com Mon May 19 12:15:22 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 19 12:15:41 2003 Subject: [Spambayes] Spam overload In-Reply-To: <260AFBE9E8D9B34581F6C74CFA93676B04A9CE@e265000n1.hopkins.k12.ky.us> References: <260AFBE9E8D9B34581F6C74CFA93676B04A9CE@e265000n1.hopkins.k12.ky.us> Message-ID: <16073.794.9219.702141@montanaro.dyndns.org> Stephen> We are a school system with 1100 faculty and 7000 student Stephen> accounts on a Win2000 server with Exchange 5.5 SP3. We are in Stephen> spam overload and are in desperate need of help.... By reading Stephen> your site I can see you guys are at the forefront of this Stephen> endeavor to stop spam and thought that you could point me in Stephen> the right direction. When you say "spam overload" do you mean your Exchange server is crumbling under the weight of spam or that your users are being flooded with spam or both? If your users are using Outlook, the Outlook plugin looks like it should fit into your environment pretty well. If your users use some other email client(s), the pop3proxy will work. Both of these will be spam filters which operate at the client end of things. Neither is really designed to work on the server. There is an opportunity for an SMTP proxy to sit between Exchange and the rest of the world. There is an SMTP proxy built into the pop3proxy, but I don't know how it's intended to be configured. Someone else on the list will be able to tell you if it will work as I envisioned. Skip From gmino at pcsltd.com Sun May 18 01:53:11 2003 From: gmino at pcsltd.com (Gabriel Mino) Date: Mon May 19 12:31:22 2003 Subject: [Spambayes] Spam overload In-Reply-To: <3261E796E368954CB22963F2B63E8105390C81@xmail.pcsltd.com> Message-ID: <3261E796E368954CB22963F2B63E810513771D@xmail.pcsltd.com> #1 - tell those teachers to stop looking @ pr0n while @ school!!! ; p #2 - http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambaye s/INTEGRATION.txt?rev=HEAD&content-type=text/plain for starters sorry 4 being w153455...although your question seems a bit vague. What would you like to do with spambayes? As much as I LUB spambayes (1.5 months without 1 spam in inbox [fingers crossed])right off the top of my head, I'd say it's not the solution for you since it (as spam filters in general) is really user centric. I'd advise you look into the proxy configuration 1st. does anyone know how/using a shared database in proxy configuration? maybe some others have some better suggestions. Just my .02 Good luck!!! Let us know how we can help!!! -----Original Message----- From: spambayes-bounces+gmino=pcsltd.com@python.org [mailto:spambayes-bounces+gmino=pcsltd.com@python.org] On Behalf Of Russell, Stephen Sent: Monday, May 19, 2003 11:34 AM To: 'SpamBayes@python.org' Subject: [Spambayes] Spam overload Dear experts, We are a school system with 1100 faculty and 7000 student accounts on a Win2000 server with Exchange 5.5 SP3. We are in spam overload and are in desperate need of help. What can I do to stop this onslaught of unwanted mail and still be able to afford to maintain our network? I am a department of one and I need a product that is simple to use, doesn't require massive amounts of admin time, and doesn't cost an arm and a leg. I don't like the idea of subscribing to an online service with a yearly fee and would prefer that the app sit on the Exchange box. I have done a lot of research and it has brought me to you. By reading your site I can see you guys are at the forefront of this endeavor to stop spam and thought that you could point me in the right direction. Any ideas? Thanks, Stephen Russell _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From skip at pobox.com Mon May 19 12:36:16 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 19 12:36:25 2003 Subject: [Spambayes] Spam overload In-Reply-To: <260AFBE9E8D9B34581F6C74CFA93676B04A9CF@e265000n1.hopkins.k12.ky.us> References: <260AFBE9E8D9B34581F6C74CFA93676B04A9CF@e265000n1.hopkins.k12.ky.us> Message-ID: <16073.2048.517536.146795@montanaro.dyndns.org> Stephen> The Exchange server seems to be holding its own. The users are Stephen> suffering the brunt of the load. A client-side solution would probably work best, though it means you need to install it and educate your users on its use. You didn't indicate what your principal email clients were. If you're using Outlook 2000, the plugin will probably be your best route. Stephen> I would have to talk to someone at the Dept. of Ed. to find out Stephen> if I could install something between our Exchange server and Stephen> them. Thanks for the info I will ask them asap. If it's a huge problem for you, it's probably a huge problem for the district as a whole. Perhaps they should be working on the solution. ;-) Stephen> Could you also ask someone to reply to me about the proxy, I Stephen> could use the additional info? Someone on the list who knows about such things will pipe up I'm sure. Skip From skip at pobox.com Mon May 19 13:34:03 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 19 13:34:12 2003 Subject: Outlook plugin transparency (was RE: [Spambayes] Spam overload) In-Reply-To: <260AFBE9E8D9B34581F6C74CFA93676B04A9D1@e265000n1.hopkins.k12.ky.us> References: <260AFBE9E8D9B34581F6C74CFA93676B04A9D1@e265000n1.hopkins.k12.ky.us> Message-ID: <16073.5515.23061.491742@montanaro.dyndns.org> (Make sure you include spambayes@python.org in your replies if you'd like to get a more broad-based set of inputs. I don't use Windows, so I can only take things so far.) Stephen> Yes I'm sorry, Outlook 2000 is the client. The DoE has Stephen> mentioned that filters are in place. I haven't seen any change Stephen> to support that. So I guess it is up to us to fix the problem. Stephen> A transparent solution seems to work best for us. Training Stephen> teachers has been a problem in the past, the only thing Stephen> teachers want to do is teach. (can't figure that one out??) Others will have to speak to how transparent the Outlook plugin is. I suspect it shouldn't be too hard to pick up. I've never used it, but others on this list use it all the time. Perhaps one of them will chime in. I've modified the subject to include more a little more honey for the plugin flies. ;-) Here's some extra description with a screenshot or two which might tide you over until they start to swarm: http://tinyurl.com/c4rn Skip From noreply at sourceforge.net Mon May 19 11:49:38 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 13:49:48 2003 Subject: [Spambayes] [ spambayes-Bugs-738884 ] Microsoft Outlook Patch Q329295 Disables SpamBayes? Message-ID: Bugs item #738884, was opened at 2003-05-16 13:47 Message generated for change (Comment added) made by conwaycl You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738884&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed >Priority: 1 Submitted By: Chuck Conway (conwaycl) Assigned to: Mark Hammond (mhammond) Summary: Microsoft Outlook Patch Q329295 Disables SpamBayes? Initial Comment: I use a Nextel phone feature called Mobile Email to manage my Outlook Calendar via my cell phone. (http://www.nextel.com). Nextel upgraded their software, and forced me to upgrade their desktop synchronization app from version 3.2 to 4.1. As part of the upgrade, the Nextel 4.1 software required me to install an Outlook patch from Microsoft. Here are links to the article and the actual binary patch download: http://support.microsoft.com/default.aspx?scid=kb;en- us;Q329295 http://download.microsoft.com/download/OfficeXPStandar d/Patch/329295/W98NT42KMeXP/EN-US/q329295.exe After the patch was installed and the Nextel 4.1 installed, SpamBayes was broken: - No filtering - No toolbar icons - Log file indicates nothing - Add-In still visible under "COM Add-Ins" I de-installed and re-installed the SpamBayes binary, but it did not help. I'm not sure what direction to go in at this point. ---------------------------------------------------------------------- >Comment By: Chuck Conway (conwaycl) Date: 2003-05-19 13:49 Message: Logged In: YES user_id=780486 It's working again, but I'm not sure what exactly did it. I de- installed everything (Python/Win32 extensions/SpamBayes), rebooted, reinstalled, then rebooted again. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738884&group_id=61702 From noreply at sourceforge.net Mon May 19 11:50:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 13:50:17 2003 Subject: [Spambayes] [ spambayes-Bugs-739989 ] Corrupt Database wont train. Message-ID: Bugs item #739989, was opened at 2003-05-19 12:50 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Corrupt Database wont train. Initial Comment: So after updating to the latest CVS release I have not been able to run a manual train on the database. I checked out the message for why and got a stacktrace indicating that the database was corrupt. So I tried to do a rebuild database, and the same thing happened. So I deleted the database, and all the .pck files, and started from scratch. This is the stack trace Im getting now. C:\spambayes-1.0a2\Outlook2000>c:\python22\python.exe manager.py Loaded bayes database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck' Loaded message database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 3.5133ms Exception in thread Thread-1: Traceback (most recent call last): File "C:\Python22\Lib\threading.py", line 408, in __bootstrap self.run() File "C:\Python22\Lib\threading.py", line 396, in run apply(self.__target, self.__args, self.__kwargs) File "C:\spambayes-1.0a2\Outlook2000\dialogs\AsyncDialog.py", line 115, in thread_target self._DoProcess() File "C:\spambayes-1.0a2\Outlook2000\dialogs\TrainingDialog.py", line 120, in _DoProcess self.trainer(self.mgr, self.progress, self.rebuild) File "C:\spambayes-1.0a2\Outlook2000\train.py", line 118, in trainer progress.set_stages(stages) AttributeError: _Progress instance has no attribute 'set_stages' Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck Saved databases in 5.97925ms ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 From bill at parducci.net Mon May 19 12:00:59 2003 From: bill at parducci.net (bill parducci) Date: Mon May 19 14:01:03 2003 Subject: [Spambayes] Spam overload References: <3261E796E368954CB22963F2B63E810513771D@xmail.pcsltd.com> Message-ID: <3EC91BDB.50600@parducci.net> i have had good luck using a shared hammiedb with a dozen users (a far cry from 7000!) only the administrator retrains in this situation. accuracy is not what it would be with a user centric model, but spam by its very nature is fairly common so intuition (and my limited user community feedback) suggests that false positives are where you see the most significant reduction in accuracy, and even then it isn't suprisingly good. if you are being 'overrun', then 90+% accuracy is still better than nothing. :) while my users are imap based (procmail drops spam into each users SPAM folder on the server), it seems that a proxy solution would work if you did subject rewrites and let the downstream client filter on that. personally, i would think thrice before implementing the outlook client on 7000 machines. on the other hand, you are not going to be able to have user trained hammiedbs with exchange otherwise (a proxy model wouldn't have access to the mboxes to train upon, so you would have to build a single message retrain model; easily doable on the server side--as long as you have the horsepower--but would require some significant user training) have you considered imap? :-P b From bill at parducci.net Mon May 19 12:28:37 2003 From: bill at parducci.net (bill parducci) Date: Mon May 19 14:28:40 2003 Subject: [Spambayes] Spam overload References: <3261E796E368954CB22963F2B63E810513771D@xmail.pcsltd.com> <3EC91BDB.50600@parducci.net> Message-ID: <3EC92255.8040303@parducci.net> > most significant reduction in accuracy, and even then it isn't > suprisingly good. oops. IS suprisingly good. b From noreply at sourceforge.net Mon May 19 16:24:50 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 18:25:04 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-30 23:47 Message generated for change (Comment added) made by sm5669 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-19 17:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 02:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 03:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 20:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 00:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From T.A.Meyer at massey.ac.nz Tue May 20 12:04:52 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 19 19:05:46 2003 Subject: Outlook plugin transparency (was RE: [Spambayes] Spam overload) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB2F40@its-xchg4.massey.ac.nz> > Others will have to speak to how transparent the Outlook > plugin is. I suspect it shouldn't be too hard to pick up. > I've never used it, but others on this list use it all the > time. Perhaps one of them will chime in. IMO, the plugin is almost completely transparent (except for setup). I use the plugin for the majority of my filtering, and for the most part, I hardly notice that it's there. After everything is set up, the user just has mail in two extra folders ('unsures' and 'spam), which need to be reviewed occasionally (unless you're confident enough to put the spam straight in the trash). If you find a misclassified message, just drag & drop it back to the correct folder. That's it, basically. (You can do various things via buttons on the toolbar, but you'll probably find you don't have to). Setup is a bit more complicated, but if you're able to do it for them, then that wouldn't be a problem. You could even install a prebuilt database that would catch most of the spam that goes through your site. I'd highly recommend trying the plugin out yourself and seeing if you think that your users would be able to handle it. =Tony Meyer From noreply at sourceforge.net Mon May 19 17:13:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 19:13:40 2003 Subject: [Spambayes] [ spambayes-Bugs-739989 ] Corrupt Database wont train. Message-ID: Bugs item #739989, was opened at 2003-05-20 03:50 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Corrupt Database wont train. Initial Comment: So after updating to the latest CVS release I have not been able to run a manual train on the database. I checked out the message for why and got a stacktrace indicating that the database was corrupt. So I tried to do a rebuild database, and the same thing happened. So I deleted the database, and all the .pck files, and started from scratch. This is the stack trace Im getting now. C:\spambayes-1.0a2\Outlook2000>c:\python22\python.exe manager.py Loaded bayes database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck' Loaded message database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 3.5133ms Exception in thread Thread-1: Traceback (most recent call last): File "C:\Python22\Lib\threading.py", line 408, in __bootstrap self.run() File "C:\Python22\Lib\threading.py", line 396, in run apply(self.__target, self.__args, self.__kwargs) File "C:\spambayes-1.0a2\Outlook2000\dialogs\AsyncDialog.py", line 115, in thread_target self._DoProcess() File "C:\spambayes-1.0a2\Outlook2000\dialogs\TrainingDialog.py", line 120, in _DoProcess self.trainer(self.mgr, self.progress, self.rebuild) File "C:\spambayes-1.0a2\Outlook2000\train.py", line 118, in trainer progress.set_stages(stages) AttributeError: _Progress instance has no attribute 'set_stages' Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck Saved databases in 5.97925ms ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-20 09:13 Message: Logged In: YES user_id=14198 It sounds to me like your CVS update didn't finish the update of Outlook2000\dialogs. Revision 1.4 of AsyncDialog grew that method. Please check you have the latest files ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 From T.A.Meyer at massey.ac.nz Tue May 20 12:12:22 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 19 19:15:37 2003 Subject: [Spambayes] Spam overload Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB2F51@its-xchg4.massey.ac.nz> > There is an opportunity for an SMTP proxy to sit between > Exchange and the rest of the world. There is an SMTP proxy > built into the pop3proxy, but I don't know how it's intended > to be configured. Not at all as an incoming filter, unfortunately. The SMTP proxy is designed to be an alternative method of training the database, not doing any filtering. OTOH, it wouldn't be all that hard to write one - it would be a cross between the POP3 proxy and the SMTP proxy. Most of the functionality needed is actually in message.py, anyway. I'm not sure what you would do with it once it was classified, though. Move it from the user's folder to some admin folder? Just mark it like the POP3 proxy does? The plugin would really do a much nicer job, as long as it can be installed for all the users. (To clarify, the SMTP proxy currently allows a user to forward/bounce a message to either a train-as-ham or train-as-spam address. It extracts a unique spambayes id from the message, gets the appropriate message from the POP3 proxy cache, and trains on it. All other mail is simply passed through. I've got written (but untested and uncomitted) a version that will allow operation in a similar mode, but train on the actual message forwarded/bounced, rather than using the POP3 proxy cache, but that still wouldn't help here). =Tony Meyer From noreply at sourceforge.net Mon May 19 17:25:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 19:25:08 2003 Subject: [Spambayes] [ spambayes-Bugs-739989 ] Corrupt Database wont train. Message-ID: Bugs item #739989, was opened at 2003-05-19 12:50 Message generated for change (Settings changed) made by astrogen You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 Category: Outlook Group: None Status: Open >Resolution: Fixed Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Corrupt Database wont train. Initial Comment: So after updating to the latest CVS release I have not been able to run a manual train on the database. I checked out the message for why and got a stacktrace indicating that the database was corrupt. So I tried to do a rebuild database, and the same thing happened. So I deleted the database, and all the .pck files, and started from scratch. This is the stack trace Im getting now. C:\spambayes-1.0a2\Outlook2000>c:\python22\python.exe manager.py Loaded bayes database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck' Loaded message database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 3.5133ms Exception in thread Thread-1: Traceback (most recent call last): File "C:\Python22\Lib\threading.py", line 408, in __bootstrap self.run() File "C:\Python22\Lib\threading.py", line 396, in run apply(self.__target, self.__args, self.__kwargs) File "C:\spambayes-1.0a2\Outlook2000\dialogs\AsyncDialog.py", line 115, in thread_target self._DoProcess() File "C:\spambayes-1.0a2\Outlook2000\dialogs\TrainingDialog.py", line 120, in _DoProcess self.trainer(self.mgr, self.progress, self.rebuild) File "C:\spambayes-1.0a2\Outlook2000\train.py", line 118, in trainer progress.set_stages(stages) AttributeError: _Progress instance has no attribute 'set_stages' Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck Saved databases in 5.97925ms ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-19 18:13 Message: Logged In: YES user_id=14198 It sounds to me like your CVS update didn't finish the update of Outlook2000\dialogs. Revision 1.4 of AsyncDialog grew that method. Please check you have the latest files ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 From piersh at friskit.com Mon May 19 17:50:26 2003 From: piersh at friskit.com (Piers Haken) Date: Mon May 19 19:47:22 2003 Subject: [Spambayes] Spam overload Message-ID: <9891913C5BFE87429D71E37F08210CB92975D1@zeus.sfhq.friskit.com> I'm guessing that a client-side solution like spambayes won't work well for you unless you have all single-user machines and no migration of users between machines. If you're using an NT domain then you could map a network drive in the user's profile and store the spambayes configuration/database on a network share so it's accessible from multiple machines - you'll need to twiddle the local spambayes configuration files to make it look there by default. However, there's still a few problems: - you won't be able to have a single user use outlook from multiple machines. I'm not sure how well spambayes handles trying to open a DB on a network share that's already open by another machine. - spambayes still doesn't gracefully handle outlook crashing or being unable to save the DB for any reason. - you're going to increase your network load somewhat due to all the database load/saves over the network. Maybe the spambayes database integration would help here. I was thinking that it might be useful to do a SQL version of spambayes where the words and probabilities are stored in separate tables: tblTokens: idToken (int) strToken (varchar) tblStats idUser (int) idToken (int) cSpam (int) cHam (int) The table of tokenized strings would be shared between users (especially useful in this, 8000+ user case). You could have a stored procedure do the classification to save having to load/save the tokens/stats from the DB. Once you have this then you can either use it as a backend for a client-based tool, or write an exchange server plugin that works similarly to the outlook plugin, but runs on the server. Anyone up for it? Piers. > -----Original Message----- > From: Russell, Stephen [mailto:srussell@Hopkins.k12.ky.us] > Sent: Monday, May 19, 2003 8:34 AM > To: 'SpamBayes@python.org' > Subject: [Spambayes] Spam overload > > > Dear experts, > We are a school system with 1100 faculty and 7000 student > accounts on a Win2000 server with Exchange 5.5 SP3. We are > in spam overload and are in desperate need of help. What can > I do to stop this onslaught of unwanted mail and still be > able to afford to maintain our network? I am a department of > one and I need a product that is simple to use, doesn't > require massive amounts of admin time, and doesn't cost an > arm and a leg. I don't like the idea of subscribing to an > online service with a yearly fee and would prefer that the > app sit on the Exchange box. I have done a lot of research > and it has brought me to you. By reading your site I can see > you guys are at the forefront of this endeavor to stop spam > and thought that you could point me in the right direction. > > Any ideas? > > Thanks, > Stephen Russell > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes > From piersh at friskit.com Mon May 19 17:53:20 2003 From: piersh at friskit.com (Piers Haken) Date: Mon May 19 19:50:19 2003 Subject: Outlook plugin transparency (was RE: [Spambayes] Spam overload) Message-ID: <9891913C5BFE87429D71E37F08210CB92C75CF@zeus.sfhq.friskit.com> Yeah, the plugin is transparent in a single-user case, but it's not so transparent in the cases where users can log in on different machines, or multiple users can log in on the same machine. Piers. > -----Original Message----- > From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] > Sent: Monday, May 19, 2003 4:05 PM > To: Russell, Stephen > Cc: spambayes@python.org > Subject: RE: Outlook plugin transparency (was RE: [Spambayes] > Spam overload) > > > > Others will have to speak to how transparent the Outlook > > plugin is. I suspect it shouldn't be too hard to pick up. > > I've never used it, but others on this list use it all the > > time. Perhaps one of them will chime in. > > IMO, the plugin is almost completely transparent (except for > setup). I use the plugin for the majority of my filtering, > and for the most part, I hardly notice that it's there. > After everything is set up, the user just has mail in two > extra folders ('unsures' and 'spam), which need to be > reviewed occasionally (unless you're confident enough to put > the spam straight in the trash). If you find a misclassified > message, just drag & drop it back to the correct folder. > That's it, basically. (You can do various things via buttons > on the toolbar, but you'll probably find you don't have to). > > Setup is a bit more complicated, but if you're able to do it > for them, then that wouldn't be a problem. You could even > install a prebuilt database that would catch most of the spam > that goes through your site. > > I'd highly recommend trying the plugin out yourself and > seeing if you think that your users would be able to handle it. > > =Tony Meyer > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes > From bill at parducci.net Mon May 19 18:13:37 2003 From: bill at parducci.net (bill parducci) Date: Mon May 19 20:13:41 2003 Subject: [Spambayes] Spam overload References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB2F51@its-xchg4.massey.ac.nz> Message-ID: <3EC97331.5030800@parducci.net> Meyer, Tony wrote: > I'm not sure what you would do with it once it was classified, though. > Move it from the user's folder to some admin folder? Just mark it like > the POP3 proxy does? the latter would be the only workable solution i think. one of the nice things about that approach would be that user's who chose not to participate in (or were ignorant of) the process would get mail normally (just a little subject tweaking here and there). additionally, users of other client mail programs could take advantage of the system as well (e.g. in this situation if the exchange servers had POP3 or imap access enabled for *IX users). > The plugin would really do a much nicer job, as > long as it can be installed for all the users. i want that service contract! :) > (To clarify, the SMTP proxy currently allows a user to forward/bounce a > message to either a train-as-ham or train-as-spam address. It extracts > a unique spambayes id from the message, gets the appropriate message > from the POP3 proxy cache, and trains on it. All other mail is simply > passed through. I've got written (but untested and uncomitted) a > version that will allow operation in a similar mode, but train on the > actual message forwarded/bounced, rather than using the POP3 proxy > cache, but that still wouldn't help here). would be interesting to see what it would take to make it work. i'm not familiar with the details of your unpublished stuff, but at first blush it seem like you would need: * a 'salt' hammiedb for each user * two system mail accounts (e.g. spam@[mydomain] or ham@[mydoamin]) * a procmail[-like] recipe (james mailet, etc.) to initiate hammiefilter against appropriate hammiedb (based upon TO address) * a procmail[-like] recipe (james mailet, etc.) to initiate mboxtrain against appropriate hammiedb (based upon FROM address, with a check to make sure ip address of sender is local--not bulletproof, but better than nothing) * an 'account' for all users of the system (recipe/maillet would have to check for user's existence and passthru/discard/route to admin if not found) * at least 70+GB (7000 * 10MB) of diskspace for hammiedbs (with a mechanism to tell when hammiedbs are getting too big, etc.) whew! and that is just 5 minutes worth of consideration! :) b From noreply at sourceforge.net Mon May 19 19:12:04 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 19 21:12:09 2003 Subject: [Spambayes] [ spambayes-Bugs-740198 ] Installation fails with regsvr error Message-ID: Bugs item #740198, was opened at 2003-05-19 21:12 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rob de Santos (rdesantos) Assigned to: Mark Hammond (mhammond) Summary: Installation fails with regsvr error Initial Comment: I'm trying to install SpamBayes into Outlook XP on Win98SE. The install fails with a regsvr error. Attempts at manually registering the file also fail. Here's the log: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\genca che.py", line 368, in EnsureModule pywintypes.com_error: (-2147312566, 'Error loading type library/DLL.', None, None) SystemError: PyErr_NormalizeException() called without exception Traceback (most recent call last): File "", line 11, in ? AttributeError: 'module' object has no attribute 'OutlookAddin' Any help? Cheers, Rob ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 From T.A.Meyer at massey.ac.nz Tue May 20 14:37:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 19 21:37:52 2003 Subject: [Spambayes] Spam overload Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB301D@its-xchg4.massey.ac.nz> > personally, I would think thrice before implementing the > outlook client on 7000 machines. Am I right in guessing that it's not 7000 machines, it's 7000 accounts, and many fewer machines? > have you considered imap? :-P I'd think more than thrice before encouraging anyone to use IMAP... :p =Tony Meyer From tim_one at email.msn.com Mon May 19 22:58:12 2003 From: tim_one at email.msn.com (Tim Peters) Date: Mon May 19 21:59:00 2003 Subject: [Spambayes] Heads up! Tokenizer changes In-Reply-To: Message-ID: I was home sick today, and took the opportunity to look at a small collection of surprising Unsures and exception-raising msgs I've put to the side since last December. Did some checkins, and they all score as solid spam now. The exception-raising msgs were ill-formed MIME that lacked a trailing boundary marker. The email pkg is happy enough with this provided they have at least a trailing blank line, but these didn't even have that much. I wormed around it in the Outlook client only, by catching the distinctive exception and feeding the string back into email.message_from_string() after tacking an empty line onto the end. It would be better if we had a common wrapper around email.message_from_string() so that all clients could benefit from these little hacks. The other was a systematic problem with the way non-comment HTML tags got stripped. Here's the checkin msg: """ I dug into a small collection of Unsures that looked like blatant spam, and discovered they were all using this kind of trick: Wrinkle Reduction That is, disguising words by inserting HTML nonsense tags. We replaced each tag with a blank, yielding the pretty useless tokens "Wr", "inkle", "Reduc" and "tion". We previously fixed a similar problem using embedded HTML comments. I should have fixed this other one then. Cute: these all scored at the high end of my Unsure range anyway. Now they're all solidly spam. """ That change was to tokenizer.py, and should benefit everyone. I recommend doing a retrain-from-scratch after you update the code, both to purge the useless word fragments that may have accumulated in your database, and to get the actual whole words into it. From T.A.Meyer at massey.ac.nz Tue May 20 16:44:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 19 23:45:12 2003 Subject: [Spambayes] Spam overload Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB30E6@its-xchg4.massey.ac.nz> > - spambayes still doesn't gracefully handle outlook crashing > or being unable to save the DB for any reason. Mark has fixed this (at least to a certain extent) with the changes he checked in over the last week. > - you're going to increase your network load somewhat due to > all the database load/saves over the network. It really all depends on how your system is set up, but as an alternative to storing the database on a network you could sync a local copy from the network (this sort of thing is done in the labs here). You might lose some training, but that shouldn't really matter, especially if the aim is to get rid of 95% of spam rather than 99.9%. > I was thinking that it might be useful to do a SQL version of > spambayes where the words and probabilities are stored in > separate tables: [...] > Once you have this then you can either use it as a backend > for a client-based tool, or write an exchange server plugin > that works similarly to the outlook plugin, but runs on the server. > > Anyone up for it? Coincidentally, I was working with Python and mySQL today, so I threw this together as well (based on the given outline). My SQL is pretty rusty, so I'm not claiming that it's the fastest/most efficient implementation, but I have a SQLClassifier that can be used just like the PickleClassifier and DBClassisifers. (Works with my testing). If there's more than one person interested in this then I can commit it as an update to storage.py, otherwise if anyone does want to use SQL as their database then I can leave it as a patch on SF (let me know). This can be used with any of the existing apps (pop3proxy, imapfilter, Outlook plugin), or with any new thing [1]. As for writing an exchange server plugin/smtp classifying proxy, I don't have any facility to do testing, so someone else will have to volunteer there. =Tony Meyer [1] Well, actually, it needs a little bit more work. We actually have _two_ databases - the word count one, and a message info one (well, pop3proxy, imapfilter and the Outlook plugin do). The SQL bit is only done for the word count one, not the message info, which could lead to rather odd results. It would be easy enough to fix this, but not until the 'master database' idea is resolved. From T.A.Meyer at massey.ac.nz Tue May 20 16:50:48 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 19 23:51:30 2003 Subject: [Spambayes] Heads up! Tokenizer changes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB30EE@its-xchg4.massey.ac.nz> > The exception-raising msgs were ill-formed MIME [...] > It would be better > if we had a common wrapper around email.message_from_string() > so that all clients could benefit from these little hacks. I think we really do need to have some sort of solution for these. Is there someone that knows enough about how messages can be malformed/the errors that the email package throws that can put this together? The code should probably be added to message.py. This would fix imapfilter and pop3proxy immediately; I gather that the Outlook plugin will also use message.py at some point in the future, as will hammiefilter. c.f [ 737967 ] Malformed messages break pop3proxy (et al) =Tony Meyer From T.A.Meyer at massey.ac.nz Tue May 20 17:01:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 00:01:47 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey.ac.nz> > >The problem is that these default to the current working directory, > >which, obviously, changes. The one with the information will be the > >42Mb one (which is pretty big; you must have trained an awful lot). > Not really about 3000 ham and 400 spam. I'm now just trying > to train more spam. Someone else speak up if this is wrong, but 42Mb seems huge for a 3400 message database. Are you using a pickle or a dbm? > The latest CVS was quite different from the released gzip. Absolutely. There are plans for a new release (alpha3, probably) RSN; it's just finding time to do what's needed. =Tony Meyer From popiel at wolfskeep.com Mon May 19 22:21:54 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 20 00:22:00 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: Message from "Meyer, Tony" of "Tue, 20 May 2003 16:01:07 +1200." <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey.ac.nz> Message-ID: <20030520042154.59C252DE36@cashew.wolfskeep.com> In message: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey.ac.nz> "Meyer, Tony" writes: >> >The problem is that these default to the current working directory, >> >which, obviously, changes. The one with the information will be the=20 >> >42Mb one (which is pretty big; you must have trained an awful lot). >> Not really about 3000 ham and 400 spam. I'm now just trying >> to train more spam. > >Someone else speak up if this is wrong, but 42Mb seems huge for a 3400 >message database. Are you using a pickle or a dbm? Yes, this does seem huge; I have a 20M db from 23000 messages. - Alex From noreply at sourceforge.net Tue May 20 02:09:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 20 04:09:11 2003 Subject: [Spambayes] [ spambayes-Bugs-740311 ] Fails on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 08:09 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Nobody/Anonymous (nobody) Summary: Fails on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From fheile at pacbell.net Mon May 19 14:18:14 2003 From: fheile at pacbell.net (Frank Heile) Date: Tue May 20 10:00:57 2003 Subject: [Spambayes] Problem installing spambayes Message-ID: I have windows 98 second edition. Outlook 2000 "internet mail only". I have also installed Spamnet from Cloudmark. After doing the binary install, I see no evidence (button) of SpamBayes. There is a log file in Windows/Temp but it has length 0, so no error message there. I go to outlook.tools.options.other.advanced_options.COM-Add-In and the box for SpamBayes is NOT checked. I checked it, hit OK, OK, etc. Still no button. Go in again and the SpamBayes button is NOT checked again. Every time I check the SpamBayes box and I close that last dialog and reopen it - it is NOT checked again. Any Ideas? I thought maybe Spamnet from Cloudmark might be interfering with SpamBayes, so I UNchecked Spamnet in the COM-Add-In box but SpamBayes still does not stay checked. Thanks for any help, Frank Heile frank@heile.org From fheile at pacbell.net Mon May 19 14:35:31 2003 From: fheile at pacbell.net (Frank Heile) Date: Tue May 20 10:01:18 2003 Subject: [Spambayes] Re: Problem installing spambayes In-Reply-To: Message-ID: Actually, after all my playing around, I now find that there are log files. I don't know if it was just a case of operator error before when I thought there was only 1 log file and that it's length is 0, but now the spambayes1.log is 0 length, but the other 3 are 1K to 2K long. I suspect this might be another instance of bug "[ 725449 ] Binary plugin fails on Win9x". The longest is spambayes3.log and here it is: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding -----Original Message----- From: Frank Heile [mailto:fheile@pacbell.net] Sent: Monday, May 19, 2003 1:18 PM To: spambayes@python.org Subject: Problem installing spambayes I have windows 98 second edition. Outlook 2000 "internet mail only". I have also installed Spamnet from Cloudmark. After doing the binary install, I see no evidence (button) of SpamBayes. There is a log file in Windows/Temp but it has length 0, so no error message there. I go to outlook.tools.options.other.advanced_options.COM-Add-In and the box for SpamBayes is NOT checked. I checked it, hit OK, OK, etc. Still no button. Go in again and the SpamBayes button is NOT checked again. Every time I check the SpamBayes box and I close that last dialog and reopen it - it is NOT checked again. Any Ideas? I thought maybe Spamnet from Cloudmark might be interfering with SpamBayes, so I UNchecked Spamnet in the COM-Add-In box but SpamBayes still does not stay checked. Thanks for any help, Frank Heile frank@heile.org From bill at parducci.net Tue May 20 08:03:48 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 10:03:52 2003 Subject: [Spambayes] Suddenly a lot of errors References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey.ac.nz> <20030520042154.59C252DE36@cashew.wolfskeep.com> Message-ID: <3ECA35C4.1060708@parducci.net> T. Alexander Popiel wrote: > Yes, this does seem huge; I have a 20M db from 23000 messages. 10M for ~8,000mgs here. b From bkc at murkworks.com Tue May 20 11:28:34 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 10:10:09 2003 Subject: [Spambayes] sharing split database Message-ID: <3ECA0352.24844.2F62321@localhost> Do you suppose that a majority of the 'words' in the database are common between different users, but only the ham/spam weights are different? I'd like to consider a 'site-wide' spambayes implementation (ie, like for 7000 accounts), but do this by splitting the database into two componets. The word list, which is global, and the ham/spam weights, which are personal. I'd like to collect word lists from a bunch of sources to see how many of them are common. Are there any privacy issues that would make collecting word lists (but not weights) objectionable to this group? -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From rasprague at myrealbox.com Tue May 20 07:59:53 2003 From: rasprague at myrealbox.com (Richard Sprague) Date: Tue May 20 10:13:15 2003 Subject: [Spambayes] Spambayes error Message-ID: <20030520135952.GA9758@zeus> I'm currently running the spambayes filterintg software and have been running into a few problems. Here are some details on the problem. I hope this message willl servre to create upon this project. procmail log: Traceback (most recent call last): File "/usr/bin/hammie.py", line 6, in ? spambayes.hammiebulk.main() File "/usr/lib/python2.2/site-packages/spambayes/hammiebulk.py", line 208, in main filtered = h.filter(msg) File "/usr/lib/python2.2/site-packages/spambayes/hammie.py", line 108, in filter prob, clues = self._scoremsg(msg, True) File "/usr/lib/python2.2/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 217, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 441, in _getclues prob = self.probability(record) File "/usr/lib/python2.2/site-packages/spambayes/classifier.py", line 301, in probability assert hamcount <= nham AssertionError procmail: Program failure (1) of "/usr/bin/hammie.py" procmail: Rescue of unfiltered data succeeded From rasprague@csupomona.edu Tue May 20 01:09:30 2003 Subject: agina Folder: /home/rasprague/mail/In 2070 .procmailrc: MAILDIR=$HOME/mail DEFAULT=$MAILDIR/In PMDIR=$HOME/.procmail LOGFILE=$PMDIR/log ## grab stuff I send to myself :0 * ^From.*rasprague\@csupomona\.edu.* people/self :0 * ^From.*rasprague\@myrealbox\.com.* people/self ## sort mail subscriptions :0 * ^TO.*satechs\@listproc\.csupomona\.edu.* campus/satech :0 * ^TO.*calpolyswingkids\@yahoogroups\.com.* campus/swingkid :0 * ^TO.*memories\@memoriesdancing\.com.* campus/swingkid :0 * ^TO.*\"\"\@pacocean\.com.* campus/swe ## auto sort ebay auction replies :0 * ^From.*endofauction\@ebay\.com.* commercial/ebay :0 * ^From.*bidconfirm@ebay\.com.* commercial/ebay ## SpamAssasin filtering :0 * ^X-Spam-Flag: YES spam ## SpamBayes filtering :0 fw:hamlock | /usr/bin/hammie.py -f -d -p $HOME/.spambayes/hammie.db :0 * ^X-Spambayes-Classification: spam spam :0 * ^X-Spambayes-Classification: unsure unsure .forward: "|IFS=' ' && exec /usr/bin/procmail || exit 75 #rasprague" spam training script: #!/bin/sh HAMBOX="$HOME/mail/In" SPAMBOX="$HOME/mail/spam" /usr/bin/nice -n10 /usr/bin/hammie.py -d -p $HOME/.spambayes/hammie.db -g $HAMBOX -s $SPAMBOX output of spam training script: Training ham (/home/rasprague/mail/In): 68 Training spam (/home/rasprague/mail/spam): 725 hammie.db: attached spambayes version: 1.0a2 python version: 2.2.2 python email package version: (odn't know that one offhand, nor how to check it) procmail version: 3.15.2 system information: $ uname -a Linux zeus 2.4.20 #2 Fri Apr 25 01:39:55 PDT 2003 i686 unknown $ cat /proc/meminfo total: used: free: shared: buffers: cached: Mem: 262565888 253366272 9199616 0 27742208 88502272 Swap: 1077501952 65155072 1012346880 MemTotal: 256412 kB MemFree: 8984 kB MemShared: 0 kB Buffers: 27092 kB Cached: 51648 kB SwapCached: 34780 kB Active: 166036 kB Inactive: 60396 kB HighTotal: 0 kB HighFree: 0 kB LowTotal: 256412 kB LowFree: 8984 kB SwapTotal: 1052248 kB SwapFree: 988620 kB $ df Filesystem 1k-blocks Used Available Use% Mounted on /dev/hda5 10490084 2262624 8227460 22% / /dev/hda6 10490084 1005504 9484580 10% /home /dev/hda2 23333 3994 18135 19% /boot Linux distribution: Slackware 9.0 Richard Sprague (note the change in address)-------| | rasprague@myrealbox.com <---------| -------------- next part -------------- A non-text attachment was scrubbed... Name: hammie.db Type: application/octet-stream Size: 1290240 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030520/2fa5cc16/hammie-0001.obj From gmino at pcsltd.com Tue May 20 11:15:07 2003 From: gmino at pcsltd.com (Gabriel Mino) Date: Tue May 20 10:14:37 2003 Subject: [Spambayes] outlook plugin documentation Message-ID: <3261E796E368954CB22963F2B63E81051EECD6@xmail.pcsltd.com> I found what is currently available to be just fine but, wrote up a step-by-step one for others to use. I recently forwarded it to someone here and was just wondering if anyone would want to take a look @ it or host it From mhammond at skippinet.com.au Wed May 21 01:23:46 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 20 10:24:42 2003 Subject: [Spambayes] outlook plugin documentation In-Reply-To: <3261E796E368954CB22963F2B63E81051EECD6@xmail.pcsltd.com> Message-ID: <03fb01c31edb$6ce7d670$530f8490@eden> > I found what is currently available to be just fine but, wrote up a > step-by-step one for others to use. I recently forwarded it > to someone here I hope that wasn't me > and was just wondering if anyone would want to take a look @ > it or host it Sure! Ideally, if you could edit it so that it complements or replaces the existing documentation, we could just include it with the core. The same things being documented a couple of times may well confuse more than help, and as no one else around here has taken any serious interest in the docs, just pretend that you own the documentation, and sure enough, you will! Mark. From mhammond at skippinet.com.au Wed May 21 01:30:07 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 20 10:31:05 2003 Subject: [Spambayes] Re: Problem installing spambayes In-Reply-To: Message-ID: <041801c31edc$507bb9b0$530f8490@eden> > Actually, after all my playing around, I now find that there > are log files. > I don't know if it was just a case of operator error before > when I thought > there was only 1 log file and that it's length is 0, but now the > spambayes1.log is 0 length, but the other 3 are 1K to 2K > long. It seems as though the file is being buffered, and not actually written until the process ends - which is weird, as I try and avoid buffering. > I suspect > this might be another instance of bug "[ 725449 ] Binary > plugin fails on > Win9x". The longest is spambayes3.log and here it is: Yes, it is the same bug :( This means the source code version will work for you, even though it is more of a pain to install. I am slowly working towards a new binary release, but this looks a week away at least, and more than likely a couple. Mark. From jbeard at heritageacademies.com Tue May 20 11:20:48 2003 From: jbeard at heritageacademies.com (Jeremy Beard) Date: Tue May 20 11:02:52 2003 Subject: [Spambayes] bug? - when I click the "Delete as Spam" button, nothing happens Message-ID: <48E0B1D75756CB4895FF6CC18E882F1F05222C64@nhamail2.nhacentral.com> Hi, I tried to hit http://sourceforge.net/tracker/?group_id=61702&atid=498103 to check the existing bugs, but I get "We're Sorry. The SourceForge.net Website is currently down for maintenance. We will be back shortly." So, I am emailing you the contents of my log files. I am running Windows XP Pro and Outlook 2002, and your Binary Outlook Add-in. My problem is that when I click the "Delete as Spam" button, nothing happens. The message that is highlighted does not get moved into my designated Spam folder, or get graded as spam or anything. It is still 0% spam in my Spam score field. When I manually move the message to my Spam folder and re-train using the Spam Manager, it works, the spam is graded 100% spam. This may also be worth mentioning: when I first installed the add-in, during the installation process I got the message "There is no disk present, please install disk into drive E" (my CDROM drive.) I just clicked "Continue" a few times and got past the error. The title of the error window was some *.temp file, I'm sorry I did not get the exact name. It was a lot of numbers. I uninstalled the add-in using add-remove programs, and got the message that 'not everything was removed.' I continued and reinstalled, and overwrote the existing directory that was originally installed but did not get removed during uninstall. Again I got the "There is no disk present in drive E". Again I just continued past this, as it was not in the instructions at file:///C:/Program%20Files/Spambayes%20Outlook%20Addin/docs/troubleshoot ing.html#InstallationType. Please let me know if there is a fix for my "Delete as Spam" button problem, or if I am doing anything wrong. Thanks and I look forward to using your software! -Jeremy Beard jbeard@heritageacademies.com Contents of current Spambayes log files: _____________________________________ spambayes1.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 84 spam and 42 good messages Loaded databases in 4.53801ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? _____________________________________ spambayes2.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 84 spam and 42 good messages Loaded databases in 5.17775ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? SpamAddin - Disconnecting from Outlook Bayes database is not dirty - not writing Addin terminating: 1 COM client and 2 COM servers exist. _____________________________________ spambayes3.log Registered: SpamBayes.OutlookAddin _____________________________________ spambayes4.log warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) Unregistered: SpamBayes.OutlookAddin Traceback (most recent call last): File "", line 22, in DllUnregisterServer File "E:\src\spambayes\Outlook2000\addin.py", line 791, in UnregisterAddin File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named _winreg _____________________________________ From jbeard at heritageacademies.com Tue May 20 11:30:12 2003 From: jbeard at heritageacademies.com (Jeremy Beard) Date: Tue May 20 11:03:12 2003 Subject: [Spambayes] RE: DISREGARD - bug? - when I click the "Delete as Spam" button, nothing happens Message-ID: <48E0B1D75756CB4895FF6CC18E882F1F04701858@nhamail2.nhacentral.com> Hello, sorry, but I this seems to have been my fault. After I reinstalled, I failed to "Define Filters", which is why nothing was happening when I clicked "Delete as Spam". The default setting is to do nothing. (But I swear it wasn't working the first time I installed...and I did set the filters...oh well...it's working now...) So I guess the only issue is the "Please insert disk" issue during installation. Thanks again. Jeremy Beard -----Original Message----- From: Jeremy Beard Sent: Tuesday, May 20, 2003 10:21 AM To: 'spambayes@python.org' Subject: bug? - when I click the "Delete as Spam" button, nothing happens Hi, I tried to hit http://sourceforge.net/tracker/?group_id=61702&atid=498103 to check the existing bugs, but I get "We're Sorry. The SourceForge.net Website is currently down for maintenance. We will be back shortly." So, I am emailing you the contents of my log files. I am running Windows XP Pro and Outlook 2002, and your Binary Outlook Add-in. My problem is that when I click the "Delete as Spam" button, nothing happens. The message that is highlighted does not get moved into my designated Spam folder, or get graded as spam or anything. It is still 0% spam in my Spam score field. When I manually move the message to my Spam folder and re-train using the Spam Manager, it works, the spam is graded 100% spam. This may also be worth mentioning: when I first installed the add-in, during the installation process I got the message "There is no disk present, please install disk into drive E" (my CDROM drive.) I just clicked "Continue" a few times and got past the error. The title of the error window was some *.temp file, I'm sorry I did not get the exact name. It was a lot of numbers. I uninstalled the add-in using add-remove programs, and got the message that 'not everything was removed.' I continued and reinstalled, and overwrote the existing directory that was originally installed but did not get removed during uninstall. Again I got the "There is no disk present in drive E". Again I just continued past this, as it was not in the instructions at file:///C:/Program%20Files/Spambayes%20Outlook%20Addin/docs/troubleshoot ing.html#InstallationType. Please let me know if there is a fix for my "Delete as Spam" button problem, or if I am doing anything wrong. Thanks and I look forward to using your software! -Jeremy Beard jbeard@heritageacademies.com Contents of current Spambayes log files: _____________________________________ spambayes1.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 84 spam and 42 good messages Loaded databases in 4.53801ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? _____________________________________ spambayes2.log SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\jbeard\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 84 spam and 42 good messages Loaded databases in 5.17775ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? SpamAddin - Disconnecting from Outlook Bayes database is not dirty - not writing Addin terminating: 1 COM client and 2 COM servers exist. _____________________________________ spambayes3.log Registered: SpamBayes.OutlookAddin _____________________________________ spambayes4.log warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) Unregistered: SpamBayes.OutlookAddin Traceback (most recent call last): File "", line 22, in DllUnregisterServer File "E:\src\spambayes\Outlook2000\addin.py", line 791, in UnregisterAddin File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named _winreg _____________________________________ From popiel at wolfskeep.com Tue May 20 09:07:06 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 20 11:07:10 2003 Subject: [Spambayes] sharing split database In-Reply-To: Message from "Brad Clements" of "Tue, 20 May 2003 10:28:34 EDT." <3ECA0352.24844.2F62321@localhost> References: <3ECA0352.24844.2F62321@localhost> Message-ID: <20030520150706.3248F2DDDA@cashew.wolfskeep.com> In message: <3ECA0352.24844.2F62321@localhost> "Brad Clements" writes: >Do you suppose that a majority of the 'words' in the database are common >between different users, but only the ham/spam weights are different? This is moderately likely, particularly with spam words and common English (articles, basic verbs, etc.) being shared in common among all users. Technical words specific to ham will be less likely shared. >I'd like to consider a 'site-wide' spambayes implementation (ie, like >for 7000 accounts), but do this by splitting the database into two >componets. The word list, which is global, and the ham/spam weights, >which are personal. I'm not sure that this would gain anything; in order to make the weights usable, they need to be indexed by word, which effectively puts the words right back in with them. I suppose you could assign each word a unique hash key or something, and then index the weights by that... but that seems like it would be a marginal savings (4 bytes of hash key vs. an average of about 9 bytes of actual word is likely to get lost in the indexing overhead... and then you have to do two lookups for each word (one to get the hash key and another to get the weights)). >I'd like to collect word lists from a bunch of sources to see how many >of them are common. > >Are there any privacy issues that would make collecting word lists (but >not weights) objectionable to this group? I don't have any particular objection to this... do you have a snippet of code to extract the wordlist from a db, for those of us too lazy to come up with it on our own? - Alex From tim.one at comcast.net Tue May 20 12:13:01 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 11:17:32 2003 Subject: [Spambayes] Heads up! Tokenizer changes In-Reply-To: Message-ID: Another new checkin just now taught the tokenizer how to decode numeric character entities; here's the checkin msg: """ DIgging into a pile of high-scoring Unsures showed this trick: your septic system as a way to disguise "your septic system". Bite the bullet and decode numeric character entities. Also replace

and
tags with single blanks, since browsers break text visually when they see one of these. """ I found this common in "septic tank", "Russian women want to marry you", and "accept credit cards" spam. Against my database, these were scoring as low spam or high unsure (at spam_cutoff 0.8). Most score as high spam now, and without training on them. although-nothing-makes-the-system-faster-ly y'rs - tim From skip at pobox.com Tue May 20 11:20:48 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 11:21:14 2003 Subject: [Spambayes] sharing split database In-Reply-To: <20030520150706.3248F2DDDA@cashew.wolfskeep.com> References: <3ECA0352.24844.2F62321@localhost> <20030520150706.3248F2DDDA@cashew.wolfskeep.com> Message-ID: <16074.18384.84835.229664@montanaro.dyndns.org> Brad> Are there any privacy issues that would make collecting word lists Brad> (but not weights) objectionable to this group? My "word list" contains a lot of domain names and email addresses like from:name:concertmaster@musi-cal.com I imagine that sort of stuff would be sensitive to some people. I think if you want a shared word list you'll have to selective about what's shared. Alex> I don't have any particular objection to this... do you have a Alex> snippet of code to extract the wordlist from a db, for those of us Alex> too lazy to come up with it on our own? import anydbm db = anydbm.open("hammie.db") for k in db.keys(): print k Obviously, this could get more elaborate (eliminating one thing or another), but that's the basic idea. Skip From EVANS at kkl.com Tue May 20 12:26:50 2003 From: EVANS at kkl.com (Evans, Matthew) Date: Tue May 20 11:28:37 2003 Subject: [Spambayes] Problem with Outlook XP Nicknames Message-ID: <2DC52032AC98C94FAC9865863547E1B75DB219@killington.kkl.com> I just installed SpamBayes this morning and noticed that the Nicknames feature (when you start typing a name in the To: field and it auto-completes) now just shows up as a blue box with a single quote in it ('). It still appears to work, but you can't see the list of names any longer. Has anyone else experienced this issue? SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\evans\Application Data\SpamBayes\default_configuration.pck' Loaded bayes database from 'C:\Documents and Settings\evans\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\evans\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 190.524ms Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Checked 541 in folder Inbox - 541 new entries found. Checked 309 in folder Spam - 309 new entries found. E:\src\python-cvs\lib\fcntl.py:7: DeprecationWarning: the FCNTL module is deprecated; please use fcntl Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Spam filtering is disabled - ignoring new message Deleting and spam training message 'Self Test Software Account Information' - trained as spam Deleting and spam training message 'Start saving money now.' - trained as spam Spam filtering is disabled - ignoring new message AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Message 'RE: Update 3/19/03' had a Spam classification of 'No' Message 'RE: Update 3/19/03' had a Spam classification of 'No' Message 'FW: BACKUP rfsf: success' had a Spam classification of 'No' Message 'Another Option for Spam Detection: SpamBayes' had a Spam classification of 'Unsure' Deleting and spam training message '[BULK] - $115 Cosmetic Collection for only 1 dollar - from Cosmetique' - trained as spam Deleting and spam training message '[BULK] - AOL Yellow Pages Offer' - trained as spam Recovering to folder 'Inbox' and ham training message 'Another Option for Spam Detection: SpamBayes' - trained as ham Training on message 'Another Option for Spam Detection: SpamBayes' - already was trained as good Message 'RE: Beta Update' had a Spam classification of 'No' Message 'FW: Worldox/Outlook toolbar' had a Spam classification of 'No' pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd File "E:\src\spambayes\Outlook2000\addin.py", line 164, in ProcessMessage File "E:\src\spambayes\Outlook2000\msgstore.py", line 461, in GetSubject File "E:\src\spambayes\Outlook2000\msgstore.py", line 699, in GetField pywintypes.com_error: (-2147221239, 'OLE error 0x80040109', None, None) Deleting and spam training message '[BULK] - Technology UPDATE: Backup Strategies, May 20, 2003' - trained as spam Message 'FW: Worldox/Outlook toolbar' had a Spam classification of 'No' Message 'Paypal Security Measures' had a Spam classification of 'Yes' Matthew Evans Kraft Kennedy & Lesser 360 Lexington Ave. 10th Floor New York, NY 10017 (212)692-5608 Direct (212)692-5708 Fax (212)986-4700 General evans@kkl.com Email From mail at peterbe.com Tue May 20 16:59:30 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Tue May 20 11:43:17 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3103@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030520155847.00c489a8@mail.grenna.net> At 16:01 2003-05-20 +1200, Meyer, Tony wrote: > > >The problem is that these default to the current working directory, > > >which, obviously, changes. The one with the information will be the > > >42Mb one (which is pretty big; you must have trained an awful lot). > > Not really about 3000 ham and 400 spam. I'm now just trying > > to train more spam. > >Someone else speak up if this is wrong, but 42Mb seems huge for a 3400 >message database. Are you using a pickle or a dbm? "pickle or a dbm"??? I have no idea. What should it be and how do you change? > > The latest CVS was quite different from the released gzip. > >Absolutely. There are plans for a new release (alpha3, probably) RSN; >it's just finding time to do what's needed. > >=Tony Meyer > >_______________________________________________ >Spambayes mailing list >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes From bill at parducci.net Tue May 20 09:48:40 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 11:48:44 2003 Subject: [Spambayes] sharing split database References: <3ECA0352.24844.2F62321@localhost> <20030520150706.3248F2DDDA@cashew.wolfskeep.com> <16074.18384.84835.229664@montanaro.dyndns.org> Message-ID: <3ECA4E58.2050606@parducci.net> Skip Montanaro wrote: > My "word list" contains a lot of domain names and email addresses like > > from:name:concertmaster@musi-cal.com > > I imagine that sort of stuff would be sensitive to some people. I think if > you want a shared word list you'll have to selective about what's shared. although, if shared words were harvested automatically (e.g. 'if word/weight not found, insert word...'), the only 'local' information would be the key/weight pair. as long as the shared words db didn't maintain source info (don't know why it would... although insert date would be nice for freshness management :o) ) the information would be anonymous. also, email addresses, required to be sent in the clear over the wire, are unlikely to be sensitive. that just leaves the payload itself and since, by definition, everything is broken up into tokens, context should be pretty much obscured. b From skip at pobox.com Tue May 20 12:07:32 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 12:08:03 2003 Subject: [Spambayes] sharing split database In-Reply-To: <3ECA4E58.2050606@parducci.net> References: <3ECA0352.24844.2F62321@localhost> <20030520150706.3248F2DDDA@cashew.wolfskeep.com> <16074.18384.84835.229664@montanaro.dyndns.org> <3ECA4E58.2050606@parducci.net> Message-ID: <16074.21188.153473.906169@montanaro.dyndns.org> bill> also, email addresses, required to be sent in the clear over the bill> wire, are unlikely to be sensitive. I don't know about that. If I send you a message I have a reasonable expectation that you aren't going to contribute my email address to a database shared by many people, even if the message I send you is transmitted over the internet in the clear. The chance that my email address might be harvested out of a database where it sits waiting for someone to come along and grab it is much greater than during its fleeting existence on the net. The nature of how gossip works suggests to me that the sensitivity of personal information might actually be more important in a small community where everyone knows one another. For example: "I know Bill uses spambayes and contributes to the word list database. Helen's email address just showed up in the database. I wonder if Bill and Helen are secret lovers?", and thus the rumor mill's wheels begin to grind. We always tend to think that "big brother" is who we have most to fear. I suspect in most of our daily lives it's actually our next door neighbor. bill> that just leaves the payload itself and since, by definition, bill> everything is broken up into tokens, context should be pretty much bill> obscured. Hmmm... Something still doesn't seem right here, but it's just a gut feeling. Perhaps someone else will be able to put their finger on what it is. Skip From bill at parducci.net Tue May 20 10:32:52 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 12:32:55 2003 Subject: [Spambayes] sharing split database References: <3ECA0352.24844.2F62321@localhost> <20030520150706.3248F2DDDA@cashew.wolfskeep.com> <16074.18384.84835.229664@montanaro.dyndns.org> <3ECA4E58.2050606@parducci.net> <16074.21188.153473.906169@montanaro.dyndns.org> Message-ID: <3ECA58B4.9000601@parducci.net> Skip Montanaro wrote: > I don't know about that. If I send you a message I have a reasonable > expectation that you aren't going to contribute my email address to a > database shared by many people, even if the message I send you is > transmitted over the internet in the clear. The chance that my email > address might be harvested out of a database where it sits waiting for > someone to come along and grab it is much greater than during its fleeting > existence on the net. ...someone with *query* access to the database. that seems about as likely as someone having read access to your mail spool, et al. file(s) on the server (where supposition isn't necessary, the mail can just be read). > We always tend to think that "big brother" is who we have most to fear. I > suspect in most of our daily lives it's actually our next door neighbor. the admin next door, maybe, but if this is written correctly, the neighbor shouldn't have a clue. the best that your neighbor could do would be to prospect the database by sending repeated individual token requests. not sure what could be derived from that. b From Daimon.Sweeney at attbi.com Tue May 20 10:33:40 2003 From: Daimon.Sweeney at attbi.com (Daimon Sweeney) Date: Tue May 20 12:37:05 2003 Subject: [Spambayes] humble suggestion Message-ID: <5.2.1.1.2.20030520092012.03095cc0@mail.attbi.com> Hi, I just saw your site, and am delighted to see the Bayesian approach evolving. I have only one hesitation about the presentation, and that is linguistic and philosophical. There are many people to whom the word "ham" may be viscerally offensive, such as Jews, Hindus, and vegetarians. What is that, a couple of billion people? I realize this is hard for those who enjoy ham to realize, but substitute "termite grubs" or even "tofu" for "ham" and you may realize a similar effect. Termite grubs are prized delicacies in some places. Identifying "ham" and "desirable" for these people simply doesn't work conceptually, so it creates a reflexive barrier to your idea regardless of the intellectual merit. The idea of consuming large amounts of ham (acceptable email) is a real stomach churning turnoff, personally, so I hope you find another way to distinguish from spam, where the anti-meat association is entirely appropriate. In any case, I applaud your research on ways to better distinguish spam from more desirable email. Best regards, Daimon sweeney From tim.one at comcast.net Tue May 20 14:00:40 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 13:02:12 2003 Subject: [Spambayes] humble suggestion In-Reply-To: <5.2.1.1.2.20030520092012.03095cc0@mail.attbi.com> Message-ID: [Daimon Sweeney] > I just saw your site, and am delighted to see the Bayesian approach > evolving. I have only one hesitation about the presentation, and that is > linguistic and philosophical. > > There are many people to whom the word "ham" may be viscerally offensive, > such as Jews, Hindus, and vegetarians. What is that, a couple of billion > people? I realize this is hard for those who enjoy ham to realize, but > substitute "termite grubs" or even "tofu" for "ham" and you may realize a > similar effect. Termite grubs are prized delicacies in some places. > > Identifying "ham" and "desirable" for these people simply doesn't work > conceptually, so it creates a reflexive barrier to your idea > regardless of the intellectual merit. I don't worry about it. This is the second message over the life of this project from someone worried about someone *else* getting offended, but we still haven't heard from someone who actually is offended. Current spambayes users certainly include people from all the groups you mentioned. Most people have *some* sense of humor. For those who don't, the source code is free for the taking, and someone who cares enough is welcome to release a derivative work from which all thought-to-be potentially offensive phrases have been purged. They should probably note that SPAM is a registered trademark of Hormel Foods Corporation too -- I've heard that some people even like the stuff . From gmino at pcsltd.com Mon May 19 02:55:20 2003 From: gmino at pcsltd.com (Gabriel Mino) Date: Tue May 20 13:33:29 2003 Subject: [Spambayes] humble suggestion In-Reply-To: <3261E796E368954CB22963F2B63E8105333EAD@xmail.pcsltd.com> Message-ID: <3261E796E368954CB22963F2B63E8105137729@xmail.pcsltd.com> * VOMITS The idea of consuming large amounts of ham (acceptable email) is a real stomach churning turnoff, personally, so I hope you find another way to distinguish from spam, where the anti-meat association is entirely appropriate.>>>>> the evils of political correctness people need to get a grip.....# sh ./sarcasm.sh < there was a guy on the train today who stunk really bad...maybe I should stop riding the train? maybe, I ought to invite him over to use my shower? such as Jews, Hindus, and vegetarians>>>>> most of my friends that fall into these categories also DON'T fall into this one.... stoopid -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Daimon Sweeney Sent: Tuesday, May 20, 2003 12:34 PM To: SpamBayes@python.org Subject: [Spambayes] humble suggestion Hi, I just saw your site, and am delighted to see the Bayesian approach evolving. I have only one hesitation about the presentation, and that is linguistic and philosophical. There are many people to whom the word "ham" may be viscerally offensive, such as Jews, Hindus, and vegetarians. What is that, a couple of billion people? I realize this is hard for those who enjoy ham to realize, but substitute "termite grubs" or even "tofu" for "ham" and you may realize a similar effect. Termite grubs are prized delicacies in some places. Identifying "ham" and "desirable" for these people simply doesn't work conceptually, so it creates a reflexive barrier to your idea regardless of the intellectual merit. The idea of consuming large amounts of ham (acceptable email) is a real stomach churning turnoff, personally, so I hope you find another way to distinguish from spam, where the anti-meat association is entirely appropriate. In any case, I applaud your research on ways to better distinguish spam from more desirable email. Best regards, Daimon sweeney _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From zshepherd at alphadatasys.com Tue May 20 13:26:02 2003 From: zshepherd at alphadatasys.com (D. Zane Shepherd) Date: Tue May 20 13:33:38 2003 Subject: [Spambayes] Installing Outlook Plugin Message-ID: <000901c31ef4$e5dfaa80$c2cbd940@swbell.net> I have downloaded the .exe install from your website and have attempted to install it. The installation appears to complete normally but nothing appeared on the Outlook Standard Toolbar. I followed the instructions in the "Addin doesn't load" section of troubleshooting. The SpamBayes addin is listed but it will not retain the check mark in the box after closing the dialog box. I have completely uninstalled and reinstalled SpamBayes with the same result. Attached is the most recent log file. Thanks for the help. Zane Shepherd Dallas, TX -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 1808 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030520/22ac65a8/spambayes4.obj From tim.one at comcast.net Tue May 20 14:18:13 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 13:35:09 2003 Subject: [Spambayes] sharing split database In-Reply-To: <16074.21188.153473.906169@montanaro.dyndns.org> Message-ID: Note that people who shop online have all sorts of identifying info in their email, including account numbers, passwords, email addresses, phone numbers, mailing addresses, billing addresses, shipping addresses, birth dates, Social Security number, mother's maiden name, and even credit card numbers and expiration dates echoed back by clueless online merchants. Besides poke-and-hope attacks, someone who has access to a shared database could easily learn a lot by computing deltas across incremental training; the new tokens that show up are likely to be correlated. Anyone thinking of sharing a database has to be acutely aware of the risks to privacy. From matos at attbi.com Tue May 20 14:34:37 2003 From: matos at attbi.com (David Matos) Date: Tue May 20 13:35:16 2003 Subject: [Spambayes] RE: Problem with Outlook XP Nicknames In-Reply-To: Message-ID: <000001c31ef6$16964200$8d80b042@dexter> FWIW, I have had the exact same problem. However, I *think* it may have started before I installed the SpamBayes add-in; my memory is bad. Also, strangely enough, the problem happens randomly, and I haven't seen it happen for several days now. Very bizarre. From bkc at murkworks.com Tue May 20 15:14:57 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 13:56:14 2003 Subject: [Spambayes] Spambayes gets two page spread in Infoworld Message-ID: <3ECA3860.10117.3C564AF@localhost> Wow, Jon Udell rates Spambayes for Outlook as "excellent" in this week's Infoworld. Bottom Line: This powerful anti-spam weapon works with Microsoft Outlook filters and folders, trains on your own unique message database, and learns by watching you, responding to both positive and negative clues. Most important, it's immediately effective. http://www.infoworld.com/article/03/05/16/20TCspam_1.html?s=tc -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From bill at parducci.net Tue May 20 12:04:03 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 14:04:08 2003 Subject: [Spambayes] sharing split database References: Message-ID: <3ECA6E13.8000806@parducci.net> Tim Peters wrote: > Note that people who shop online have all sorts of identifying info in their > email, including account numbers, passwords, email addresses, phone numbers, > mailing addresses, billing addresses, shipping addresses, birth dates, > Social Security number, mother's maiden name, and even credit card numbers > and expiration dates echoed back by clueless online merchants. > Besides > poke-and-hope attacks, someone who has access to a shared database could > easily learn a lot by computing deltas across incremental training; the new > tokens that show up are likely to be correlated. Anyone thinking of sharing > a database has to be acutely aware of the risks to privacy. > anything beyond 'poke & hope' would require general query capabilities. if weighting is maintained locally any call to the shared repository would be of the form 'index = stoken(token)'. if the token doesn't exist in the shared db, it is inserted and a new index is returned. a determined hacker could figure out if a token already exists by comparing index numbers *if the system used serial indexing* (which could be foiled by using a hashed index) but beyond that, only the existence of the token could be derived. is that valuable? sure, but if you have enough information to poke & hope meaningfully in a large community, then you likely have enough information to take action directly. that said, such design constraints make the shared db idea questionable WRT size savings. (as pointed out earlier, but it took me a while to fully grasp :o) b From tim.one at comcast.net Tue May 20 15:22:27 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 14:23:26 2003 Subject: [Spambayes] Spambayes gets two page spread in Infoworld In-Reply-To: <3ECA3860.10117.3C564AF@localhost> Message-ID: [Brad Clements] > Wow, Jon Udell rates Spambayes for Outlook as "excellent" in this > week's Infoworld. > > ... > > http://www.infoworld.com/article/03/05/16/20TCspam_1.html?s=tc Thanks for sharing the link, Brad! It's a postively ecstatic review, and I'm delighted to see Mark Hammond get some of the credit he deserves for the Outlook addin. Keep 'em coming . From bkc at murkworks.com Tue May 20 15:47:34 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 14:29:04 2003 Subject: [Spambayes] sharing split database In-Reply-To: <3ECA6E13.8000806@parducci.net> Message-ID: <3ECA4005.31365.3E3403D@localhost> On 20 May 2003 at 11:04, bill parducci wrote: > that said, such design constraints make the shared db idea questionable WRT > size savings. (as pointed out earlier, but it took me a while to fully > grasp :o) There are I guess two privacy issues at play here 1. collecting word statistics from members of this group to see just how much overlap there is among users. 2. privacy after deployment. In the first case, since I'm only looking to determine the amount of overlap and disjointness, I don't need the actual words, I could use a hash of each word. Sure, the upload will be huge, but maybe not too bad after running through gzip In the second case.. We might not get that far if there's not enough overlap to make it pay off. ;-) -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From bkc at murkworks.com Tue May 20 15:48:45 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 14:40:07 2003 Subject: [Spambayes] Spambayes gets two page spread in Infoworld In-Reply-To: References: <3ECA3860.10117.3C564AF@localhost> Message-ID: <3ECA404C.23943.3E45660@localhost> On 20 May 2003 at 14:22, Tim Peters wrote: > Thanks for sharing the link, Brad! It's a postively ecstatic review, and > I'm delighted to see Mark Hammond get some of the credit he deserves for > the Outlook addin. Keep 'em coming . > Jon Udell had a "pre-echo" blog post that also mentions Mark. Way to go Mark! http://weblog.infoworld.com/udell/2003/05/08.html#a684 -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From Daimon.Sweeney at attbi.com Tue May 20 12:44:46 2003 From: Daimon.Sweeney at attbi.com (Daimon Sweeney) Date: Tue May 20 14:48:25 2003 Subject: [Spambayes] humble suggestion In-Reply-To: References: <5.2.1.1.2.20030520092012.03095cc0@mail.attbi.com> Message-ID: <5.2.1.1.2.20030520104729.030b6558@mail.attbi.com> > > > > > Identifying "ham" and "desirable" for these people simply doesn't work > > conceptually, so it creates a reflexive barrier to your idea > > regardless of the intellectual merit. > >I don't worry about it. This is the second message over the life of this >project from someone worried about someone *else* getting offended, but we >still haven't heard from someone who actually is offended. Thanks for the polite response. I got another one of a different type. If you don't care, neither do I, but you left out the paragraph where I said I personally was, not offended, but put off, so maybe you can't say that anymore. But I'll take back what I said about anyone else being bothered. Let the pork-avoiding masses speak for themselves, or forever hold their noses. I'm much more offended by your suggestion that I don't have a sense of humor. Maybe I just don't get jokes that depend on shared opinions of the relative merits of different kinds of meat. Oh well, that's a deficiency I can live with. In fact, I'd be hard-pressed to think of something lower on my priority list. >Most people have *some* sense of humor. The more I think of it, the more fascinating it is that what I said is interpreted in terms of having a sense of humor. The other follow referred to this as well. > SPAM is a registered trademark of Hormel Foods >Corporation too -- I've heard that some people even like the stuff . And I'm very happy for those people. I ate it myself as a child and survived, to the regret of some. As if in another lifetime, I do remember the anticipation with which I greeted the news that we were having SPAM for dinner. Yuck. I so hope Hormel has given Monty Python a lifetime supply of SPAM (tm) for publicity beyond anything they could have dreamed of. I actually do want to apologize, not for my opinion, but because I didn't realize the original email would be posted to a group. I was sent a link to the site, glanced at it and thought I'd dash off a little consumer reaction to the company behind the thing, not realizing it was an open source project. My mistake. So I'll exit now and you can get on to more important things, as will I. Best regards, and I really do think the Bayesian approach is brilliant, and wish you all success, however you define it, Daimon Sweeney From johnb at vidar.com Tue May 20 15:39:20 2003 From: johnb at vidar.com (John Baumgartner) Date: Tue May 20 14:48:51 2003 Subject: [Spambayes] A question about your spam filter Message-ID: <8542EF87B416E84986705E5057A1A5CC0FDD4E@mailhost02.vidar.corp> Hello, First, your filter is terrific. I get an average of 75.7 junk e-mails per day. Your filter removes nearly all the spam. My question: Is it possible to set the filter such that when it processes a known spam, it clears the "new message" icon in my system tray? If that could be done, I'd be checking my empty inbox a lot less often. Thanks, John B Herndon, VA. From dtarbell at bellatlantic.net Tue May 20 15:37:43 2003 From: dtarbell at bellatlantic.net (Dick Tarbell) Date: Tue May 20 14:49:08 2003 Subject: [Spambayes] Addin does not load Message-ID: <000201c31efe$e7c9e2c0$0d02a8c0@WorkGroup> Spambays addin is listed and not checked -- I check the box and click ok -- I go back in and the box is not checked. I installed it from starship.python.net/crew/mhammond/spambayes -- download spambays-outlook-setup-002.exe Below is the copy of the log SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding Registered: SpamBayes.OutlookAddin SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding From tim.one at comcast.net Tue May 20 15:59:06 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 15:01:28 2003 Subject: [Spambayes] humble suggestion In-Reply-To: <5.2.1.1.2.20030520104729.030b6558@mail.attbi.com> Message-ID: [Daimon Sweeney] > Thanks for the polite response. I got another one of a different type. > > If you don't care, neither do I, but you left out the paragraph where I > said I personally was, not offended, but put off, so maybe you can't say > that anymore. But I'll take back what I said about anyone else being > bothered. Let the pork-avoiding masses speak for themselves, or forever > hold their noses. LOL! Let 'em eat pork. > I'm much more offended by your suggestion that I don't have a sense of > humor. My apologies -- I didn't mean to suggest that you're humorless. Spelled out, the Jews, Hindus and vegetarians I personally know who use spambayes do have a sense of humor, and perhaps that's why they don't express offense. I expect that most people do have a sense of humor sufficient to avoid taking offense at playful puns. It's not like "ham" is an ethnic slur; I've heard it applied pejoratively to bad actors, but that's it. > Maybe I just don't get jokes that depend on shared opinions of the > relative merits of different kinds of meat. Maybe not. "spam" was originally a contraction of "spiced ham", so the word was intimately related to ham since its coinage. > Oh well, that's a deficiency I can live with. In fact, I'd be hard- > pressed to think of something lower on my priority list. >> Most people have *some* sense of humor. > The more I think of it, the more fascinating it is that what I said is > interpreted in terms of having a sense of humor. The other follow > referred to this as well. He can speak for myself (maybe -- I had a hard time understanding his post). I was speculating as to why the target groups you speculated about hadn't actually howled in protest yet. > ... > Best regards, and I really do think the Bayesian approach is brilliant, > and wish you all success, however you define it, Thanks! Feel free to use the spam filter, too -- that's partly how I measure success here. From skip at pobox.com Tue May 20 15:22:23 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 15:22:36 2003 Subject: [Spambayes] sharing split database In-Reply-To: <3ECA4005.31365.3E3403D@localhost> References: <3ECA6E13.8000806@parducci.net> <3ECA4005.31365.3E3403D@localhost> Message-ID: <16074.32879.729010.620712@montanaro.dyndns.org> Brad> 1. collecting word statistics from members of this group to see Brad> just how much overlap there is among users. ... Brad> In the first case, since I'm only looking to determine the amount Brad> of overlap and disjointness, I don't need the actual words, I Brad> could use a hash of each word. Sure, the upload will be huge, but Brad> maybe not too bad after running through gzip If there's much redundancy available for gzip to squeeze out after running words through a hash function, it's not a very good hash function. ;-) I just ran this: >>> import anydbm >>> import sha >>> db = anydbm.open("hammie.db") >>> f = open("key-hash", "w") >>> for k in db.keys(): ... o = sha.new(k) ... f.write(o.digest()) ... >>> f.close() >>> db.close() The result is 7843220 bytes long (each digest is 20 bytes). Gzipping it resulted in a slightly larger file (7844447 bytes). If you'd like it, I can place it on the web for you to snatch. Skip From bkc at murkworks.com Tue May 20 17:21:59 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 16:03:20 2003 Subject: [Spambayes] sharing split database In-Reply-To: <16074.32879.729010.620712@montanaro.dyndns.org> References: <3ECA4005.31365.3E3403D@localhost> Message-ID: <3ECA5626.32042.439B0C7@localhost> On 20 May 2003 at 14:22, Skip Montanaro wrote: > I just ran this: > > >>> import anydbm > >>> import sha > >>> db = anydbm.open("hammie.db") > >>> f = open("key-hash", "w") > >>> for k in db.keys(): > ... o = sha.new(k) > ... f.write(o.digest()) > ... > >>> f.close() > >>> db.close() > > The result is 7843220 bytes long (each digest is 20 bytes). Gzipping it > resulted in a slightly larger file (7844447 bytes). If you'd like it, I > can place it on the web for you to snatch. Yes please. Okay everyone whose willing to contribute a hashed version of their wordlist, please use sha as shown above and I'll compare the hashes. You can upload via anonymous ftp if you wish to ftp://ftp.murkworks.com/uploads/spambayes Note that this is a drop-only location, you won't see the file after you upload it and you cannot overwrite a file with the same name, so if you choose to contribute your hash "list" via ftp upload, please select a unique name and also send me a message off- list indicating you've uploaded it. Dual T3 connection here. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From agmsmith at rogers.com Tue May 20 17:17:57 2003 From: agmsmith at rogers.com (Alexander G. M. Smith) Date: Tue May 20 16:19:04 2003 Subject: [Spambayes] humble suggestion In-Reply-To: Message-ID: <31093226068-BeMail@cr593174-a> Tim Peters wrote on Tue, 20 May 2003 14:59:06 -0400: > Maybe not. "spam" was originally a contraction of "spiced ham", so the > word was intimately related to ham since its coinage. I'm using "Genuine", "Spam" and "Uncertain" here. That's mostly because spam and ham sound too much alike in the audio announcements I've got set up for incoming mail. - Alex From skip at pobox.com Tue May 20 16:20:25 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 16:20:35 2003 Subject: [Spambayes] sharing split database In-Reply-To: <3ECA5626.32042.439B0C7@localhost> References: <3ECA4005.31365.3E3403D@localhost> <3ECA5626.32042.439B0C7@localhost> Message-ID: <16074.36361.656807.874763@montanaro.dyndns.org> Brad> You can upload via anonymous ftp if you wish to Brad> ftp://ftp.murkworks.com/uploads/spambayes Done. Look for key-hash.skip Skip From ark3 at email.com Tue May 20 17:31:23 2003 From: ark3 at email.com (Abhay Saxena) Date: Tue May 20 16:31:34 2003 Subject: [Spambayes] Training via IMAP Filter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130197953A@its-xchg4.massey.ac.nz> Message-ID: <061D6EA2-8B02-11D7-B809-00039345DE8C@email.com> On Wednesday, May 14, 2003, at 08:15 PM, Meyer, Tony wrote: >> Can you explain the circumstances under which the unique ID can >> change? > > Unfortunately, it can change whenever the server wants it to (blame > Crispin, not me!). A good server wouldn't change them, but it is > allowed to. The RFC says: [...] Interesting. I've been thinking about this a little over the weekend. Most IMAP-aware mail clients I use are able to keep a local cache of my folders' contents to enable offline mail reading. How do they do this? I would assume the programs are using IMAP uids to track the messages within a folder. If, in practice, most IMAP servers allowed message uids to change completely between sessions, this feature would be completely impractical. Of course, the occasional instance of a message's IMAP uid changing isn't a big deal; the mail client must simply delete the original version of the message from its cache and cache the "new" message. How would Spambayes deal with a similar situation? It would be as if the user deleted a message and then a new message appeared that happened to have the same content. I suppose this would cause a slight distortion in the contents of hammie.db (how slight depending on how often IMAP uids changed), but it shouldn't cause any major problems as far as I can see. Thoughts? > Please make sure that if you figure out the problem you submit a > patch, though! I will, if I ever get there. Training Spambayes (which I can only do using hammie.py or mboxtrain.py right now) is kind of annoying, so I haven't done any training since the initial training I did on about 200 ham and 200 spam. Despite the lack of ongoing training, Spambayes has been working extremely well. No immediate incentive... but I'll get to it soon anyhow. - Abhay From ark3 at email.com Tue May 20 17:40:04 2003 From: ark3 at email.com (Abhay Saxena) Date: Tue May 20 16:40:10 2003 Subject: Failsafe (was Re: [Spambayes] Does anyone care about this report?) In-Reply-To: <20030514205125.354512DE88@cashew.wolfskeep.com> Message-ID: <3C473AAB-8B03-11D7-B809-00039345DE8C@email.com> On Wednesday, May 14, 2003, at 04:51 PM, T. Alexander Popiel wrote: > [...] The failsafe nature is not really obvious. For the first part, > standard procmail holds that if a filter (and I'm using spambayes as a > filter) exits with a non-zero exit code, then the original mail is > retained instead of the filtered version. Ah, that was the part I needed to know. Thanks. So now that I'm not worried about losing email to Spambayes crashes, the only thing I really have to worry about is malformed messages (which cause hammiefilter.py to output what is essentially an empty message). I don't suppose we can assume that hammiefilter.py should always output something bigger than its input, since minor whitespace shuffling or whatever might decrease the size of a large message more than adding a header increases it. But can we somehow notice when hammiefilter.py significantly reduces the size of a message (say by 10% or more) and fall back to the original in that case? I'm not sure whether this is something we want to do in Python or Procmail (for those of us using Procmail). - Abhay From popiel at wolfskeep.com Tue May 20 14:50:32 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 20 16:50:34 2003 Subject: Failsafe (was Re: [Spambayes] Does anyone care about this report?) In-Reply-To: Message from Abhay Saxena of "Tue, 20 May 2003 16:40:04 EDT." <3C473AAB-8B03-11D7-B809-00039345DE8C@email.com> References: <3C473AAB-8B03-11D7-B809-00039345DE8C@email.com> Message-ID: <20030520205032.0E9482DDDA@cashew.wolfskeep.com> In message: <3C473AAB-8B03-11D7-B809-00039345DE8C@email.com> Abhay Saxena writes: > >I don't suppose we can assume that hammiefilter.py should always output >something bigger than its input, since minor whitespace shuffling or >whatever might decrease the size of a large message more than adding a >header increases it. But can we somehow notice when hammiefilter.py >significantly reduces the size of a message (say by 10% or more) and >fall back to the original in that case? I'm not sure whether this is >something we want to do in Python or Procmail (for those of us using >Procmail). Better would be for hammie to annotate the message outside the (fallible) auspices of the email package, much as pop3proxy does. Making a small munger that only dealt with adding a header at the top and filtering out same-named preexisting headers is a much simpler problem than actually representing logical contents of the message. - Alex (too lazy to fix it at the moment) From bejm at eeg.ccf.org Tue May 20 18:47:26 2003 From: bejm at eeg.ccf.org (bejm@eeg.ccf.org) Date: Tue May 20 18:09:49 2003 Subject: [Spambayes] Unix email Message-ID: <20030520214726.GB1283@gateway.eeg.ccf.org> Gents: does SpalBayes work with good ol' plain-vanilla text-based Unix-style Micro$oft-macro-proof email? -- Mark -- Mark D. Bej, M.D. bejm@eeg.ccf.org Midwest Neuroscience. Inc. 1 E. Main St., Suite 200A Norwalk, Ohio 44857-1512 (419) 663-0020 From bill at parducci.net Tue May 20 16:19:00 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 18:19:07 2003 Subject: [Spambayes] Unix email References: <20030520214726.GB1283@gateway.eeg.ccf.org> Message-ID: <3ECAA9D4.8080207@parducci.net> a typical installation involves a simple procmail recipe that kicks off spambayes and another to see if you have spam. spambayes, being written in python, runs on a number of platforms. b bejm@eeg.ccf.org wrote: > Gents: > > does SpalBayes work with good ol' plain-vanilla text-based Unix-style > Micro$oft-macro-proof email? > > -- > Mark > > From skip at pobox.com Tue May 20 18:20:28 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 18:20:37 2003 Subject: [Spambayes] Unix email In-Reply-To: <20030520214726.GB1283@gateway.eeg.ccf.org> References: <20030520214726.GB1283@gateway.eeg.ccf.org> Message-ID: <16074.43564.478224.708650@montanaro.dyndns.org> Mark> does SpalBayes work with good ol' plain-vanilla text-based Mark> Unix-style Micro$oft-macro-proof email? Yes, there are plenty of us using it with procmail-based setups. There is a hammiefilter program which tucks into .procmailrc just fine. There is also a pop3proxy program which you can stick between your mail client and the real POP3 server. An IMAP filter is in alpha. For more details, check out http://spambayes.sf.net/ Click the "Applications" link. Skip From skip at pobox.com Tue May 20 18:23:50 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 18:23:59 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? Message-ID: <16074.43767.165.397425@montanaro.dyndns.org> I suspect Spambayes is going to attract more interest after Jon Udell's Infoworld article and we're going to continue to field basic questions such as the one I just replied to. Would it be possible to set up an auto-reply for spambayes@python.org which provides an auto-reply to non-subscribers' messages similar to the webmaster@python.org auto-reply? I'll gladly work on the reply text. Skip From noreply at sourceforge.net Tue May 20 16:25:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 20 18:25:10 2003 Subject: [Spambayes] [ spambayes-Bugs-740777 ] Binary Plugin v2 fails to install on Win2k with Outlook2kSR1 Message-ID: Bugs item #740777, was opened at 2003-05-20 22:25 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740777&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Stefan Franke (sfranke) Assigned to: Mark Hammond (mhammond) Summary: Binary Plugin v2 fails to install on Win2k with Outlook2kSR1 Initial Comment: Logfile says this: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Dokumente und Einstellungen\Franke.ABLETON\Anwendungsdaten\ SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py" , line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy .py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy .py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy .py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000 \manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000 \manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000 \manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740777&group_id=61702 From noreply at sourceforge.net Tue May 20 16:27:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 20 18:27:46 2003 Subject: [Spambayes] [ spambayes-Bugs-740777 ] Binary Plugin v2 fails to install on Win2k with Outlook2kSR1 Message-ID: Bugs item #740777, was opened at 2003-05-20 22:25 Message generated for change (Comment added) made by sfranke You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740777&group_id=61702 Category: Outlook Group: None >Status: Deleted Resolution: None Priority: 5 Submitted By: Stefan Franke (sfranke) Assigned to: Mark Hammond (mhammond) Summary: Binary Plugin v2 fails to install on Win2k with Outlook2kSR1 Initial Comment: Logfile says this: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Dokumente und Einstellungen\Franke.ABLETON\Anwendungsdaten\ SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py" , line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy .py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy .py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy .py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000 \manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000 \manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000 \manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: Stefan Franke (sfranke) Date: 2003-05-20 22:27 Message: Logged In: YES user_id=431616 Sorry, this is the same one than item #740311 but on different platform ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740777&group_id=61702 From noreply at sourceforge.net Tue May 20 16:28:26 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 20 18:28:30 2003 Subject: [Spambayes] [ spambayes-Bugs-740311 ] Fails on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 08:09 Message generated for change (Comment added) made by sfranke You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Nobody/Anonymous (nobody) Summary: Fails on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-20 22:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From T.A.Meyer at massey.ac.nz Wed May 21 11:28:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 18:29:20 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB326F@its-xchg4.massey.ac.nz> > I suspect Spambayes is going to attract more interest after > Jon Udell's Infoworld article and we're going to continue to > field basic questions such as the one I just replied to. > Would it be possible to set up an auto-reply for > spambayes@python.org which provides an auto-reply to > non-subscribers' messages similar to the webmaster@python.org > auto-reply? I'll gladly work on the reply text. Definitely +1 on the idea. I think we need to update the webpage to note the problems with installing the binary on XP, as well, given that the article said that it works fine, when it really doesn't ;) I'll get to this later today if no-one else does. (Mark, do you want to add a note to your site about it? (It was the site linked)). =Tony Meyer From barry at python.org Tue May 20 23:32:51 2003 From: barry at python.org (Barry Warsaw) Date: Tue May 20 18:32:53 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: <16074.43767.165.397425@montanaro.dyndns.org> References: <16074.43767.165.397425@montanaro.dyndns.org> Message-ID: <1053469938.3260.26.camel@barry> On Tue, 2003-05-20 at 18:23, Skip Montanaro wrote: > I suspect Spambayes is going to attract more interest after Jon Udell's > Infoworld article and we're going to continue to field basic questions such > as the one I just replied to. Would it be possible to set up an auto-reply > for spambayes@python.org which provides an auto-reply to non-subscribers' > messages similar to the webmaster@python.org auto-reply? I'll gladly work > on the reply text. You can set up an autoreply for all postings to spambayes@python.org (with a long grace period), or you can reject non-member postings outright and include a canned message in the rejection notice. You can't autoreply just to non-members (although that's a good use case :). Work on the text, decide how you want it to work, and I'll make it happen. -Barry From T.A.Meyer at massey.ac.nz Wed May 21 11:47:50 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 18:48:27 2003 Subject: [Spambayes] A question about your spam filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3283@its-xchg4.massey.ac.nz> > My question: > Is it possible to set the filter such that when it processes > a known spam, it clears the "new message" icon in my system > tray? If that could be done, I'd be checking my empty inbox a > lot less often. Please see: [ 648271 ] Code to remove the New Mail icon You might be able to use the code in that patch (although not if you are using the binary), but overall I believe that the consensus was that this was not a good idea. On the other hand, there is a feature request at the moment to (optionally) mark mail classified as spam as read, which might do this for you (it depends on how well behaved the tray icon is). This will get implemented sooner or later (although most likely not in time for the 003 binary release). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 11:51:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 18:51:39 2003 Subject: [Spambayes] Problem with Outlook XP Nicknames Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3285@its-xchg4.massey.ac.nz> > I just installed SpamBayes this morning and noticed that the > Nicknames feature (when you start typing a name in the To: > field and it > auto-completes) now just shows up as a blue box with a single > quote in it ('). It still appears to work, but you can't see > the list of names any longer. Has anyone else experienced this issue? I definitely do not get this (Windows XP and Outlook 2002), and did not when I was using the plugin with Windows 2000 and Outlook 2000. What version of Windows/Outlook are you using? If you uninstall spambayes, does the problem go away? By the way, the trace with the "no module named timer" is a known bug, which will be fixed in the 003 binary release. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 11:55:41 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 18:56:16 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB328D@its-xchg4.massey.ac.nz> > "pickle or a dbm"??? I have no idea. What should it be and > how do you change? Sorry, I should have been clearer. Unless you have changed the "persistent_use_database" option or are calling pop3proxy/imapfilter with the "-d" option, you will be using a database and not a pickle. There seems to be a general consensus that your database was pretty large for the number of messages trained. This might be as a result of the messageinfo db problems (before there was an explicit path to it) - some messages may have been trained multiple times. It's up to you to decide whether this is a problem. The only effect it will have (besides the large db) is that those emails trained multiple trains will have a greater weight. If they were 'good' examples of spam/ham, then that shouldn't matter much. If you value the hard drive space, then a retrain might be a good idea (if you have mail to retrain on). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 12:02:00 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 19:03:31 2003 Subject: Failsafe (was Re: [Spambayes] Does anyone care about thisreport?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3299@its-xchg4.massey.ac.nz> [Abhay] > So now that I'm not worried about losing email to Spambayes crashes, > the only thing I really have to worry about is malformed messages > (which cause hammiefilter.py to output what is essentially an empty > message). [Alex] > Better would be for hammie to annotate the message outside the > (fallible) auspices of the email package, much as pop3proxy > does. Making a small munger that only dealt with adding a > header at the top and filtering out same-named preexisting > headers is a much simpler problem than actually representing > logical contents of the message. pop3proxy has this problem too (IIRC it used to work outside the email package, but no longer does). IMO, we really need to catch the email exceptions and deal with the message then (i.e. fix the cause, not the effect). If we have no idea about how to fix the particular message, then we should leave it alone (pass it through unchanged in pop3proxy, and whatever the equivalent is with hammiefilter). =Tony Meyer (who is too busy to fix this one at the moment ;) From popiel at wolfskeep.com Tue May 20 17:20:57 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 20 19:21:01 2003 Subject: [Spambayes] Unix email In-Reply-To: Message from bejm@eeg.ccf.org of "Tue, 20 May 2003 17:47:26 EDT." <20030520214726.GB1283@gateway.eeg.ccf.org> References: <20030520214726.GB1283@gateway.eeg.ccf.org> Message-ID: <20030520232057.BBA0C2DDDA@cashew.wolfskeep.com> In message: <20030520214726.GB1283@gateway.eeg.ccf.org> bejm@eeg.ccf.org writes: > >Gents: > >does SpalBayes work with good ol' plain-vanilla text-based Unix-style >Micro$oft-macro-proof email? I use it with procmail and MH, so unless I'm hallucinating, that's a yes... - Alex From T.A.Meyer at massey.ac.nz Wed May 21 12:39:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 19:39:53 2003 Subject: [Spambayes] Spambayes error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB32F3@its-xchg4.massey.ac.nz> > I'm currently running the spambayes filtering software and have been > running into a few problems. Here are some details on the problem. [...] > assert hamcount <= nham > AssertionError This is a really long running problem. The database keeps track of the total number of ham/spam you have trained, as well as the count for each word. For some reason, sometimes the total count gets out of sync and isn't as high as it should be (sometimes it even gets to zero). I've love to be able to fix this, but I have never had this problem myself (which leads me to suspect that it doesn't effect the Outlook plugin, since the majority of my mail goes through that), and so it's hard to track down. Anyway, to fix this: * run -e -D hammie.db -f hammie.txt * open hammie.txt and change the first two numbers to 81465 and 860909 (these are the maximum nham and hspam in your db). Actually, you can change them to anything larger than this. * run -I -D hammie.db -f hammie.txt I don't recall if this was answered before, but as a fallback for when this happens, couldn't spambayes just (print a warning and) set the hamcount/spamcount to the largest nham and hspam? I don't see that this would have that detrimental an effect, but then I'm not a statistician. > python email package version: > (don't know that one offhand, nor how to check it) In IDLE/PythonWin's interpreter, type: import email email.__version__ (But it shouldn't matter in this case, this is just for future reference :) =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 12:48:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 19:49:09 2003 Subject: [Spambayes] sharing split database Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3308@its-xchg4.massey.ac.nz> > I'm not sure that this would gain anything; in order to make > the weights usable, they need to be indexed by word, which > effectively puts the words right back in with them. I > suppose you could assign each word a unique hash key or > something, and then index the weights by that... [...] Interesting that this came up at this time. There was a message the other day about using a SQL database behind spambayes (for the multiple user situation). The suggested tables did this - there was a table of tokens and the stats were referenced by token id. I implemented a storage class in this style, but it did seem at the time that it was unnecessary overhead. I presume the reasoning behind the design was the same reasoning in this thread. > I don't have any particular objection to this... do you have > a snippet of code to extract the wordlist from a db, for > those of us too lazy to come up with it on our own? I note that there is a code snippet that uses sha, but for (everyone's) future reference, it might be worth pointing out that you can use dbimpexp.py (in the main directory) to extract a db (or pickle) to a ` separated text file. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 12:50:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 19:51:02 2003 Subject: [Spambayes] sharing split database Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB330C@its-xchg4.massey.ac.nz> > Okay everyone whose willing to contribute a hashed version of > their wordlist, please > use sha as shown above and I'll compare the hashes. Are you only interested in our personal databases, or also ones that we use for testing? (like the SpamAssassin corpus or Bruce G's collections). =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 13:01:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 20 20:02:16 2003 Subject: [Spambayes] Training via IMAP Filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3324@its-xchg4.massey.ac.nz> > Most IMAP-aware mail clients I use are able to keep a local > cache of my folders' contents to enable offline mail reading. How do they > do this? You'd have to ask them ;) There are alternatives to the UIDs - hashes of the message, for example. UIDs would be a reasonable bet for many MUAs, though. > I would assume the programs are using IMAP uids to track the messages > within a folder. If, in practice, most IMAP servers allowed message > uids to change completely between sessions, this feature would be > completely impractical. True. I'm not saying that most do - if I was implementing an IMAP server I certainly would not reset UIDs at any point. The potential for a problem is there, though, and it's not worth introducing something that may result in a bug later on (unless it is unavoidable). Consider, also, that the problem will only arise if you keep messages on the server for a long time. If you move them off into local store (or delete them), which many people probably do (ISPs limit space, for a start), then the problem doesn't arise. > Of course, the occasional instance of a message's IMAP uid changing > isn't a big deal; the mail client must simply delete the original > version of the message from its cache and cache the "new" > message. Except that you don't get a nice "the uid of this message id is now id2" message. There isn't any way to determine if you are looking at a new message or an old message with a new id. So you can't delete the original. > How would Spambayes deal with a similar situation? It would > be as if the user deleted a message and then a new message appeared that > happened to have the same content. I suppose this would cause a slight > distortion in the contents of hammie.db (how slight depending on how > often IMAP uids changed), but it shouldn't cause any major problems > as far as I can see. If the uids changed every time you connected to the server (a definite possibility), then you would retrain on *all* mail every time you connected. Not a good thing. > Thoughts? I tried to use the uids as long as I could, because making our own seemed unnecessary overhead. Given that these problems could occur, though, I don't see the point of introducing a potential problem in the hope that it will only have a minor effect. The solution at the moment (rewriting messages with a unique id) works. It has one drawback, and that is that the IMAP messages are immutable, so the message has to be deleted (except that IMAP messages can't be deleted, just marked for deletion). This means lots of "marked as deleted" messages, and potentially big problems if something goes wrong with the read/write process. This drawback doesn't seem significant enough to me that the solution is invalidated. Let's face it; IMAP is a really ugly, poorly designed protocol. (Not that I'm claiming I could do a better job ;). =Tony Meyer From tim.one at comcast.net Tue May 20 21:17:33 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 20:18:06 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? In-Reply-To: <16074.43767.165.397425@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > I suspect Spambayes is going to attract more interest after Jon Udell's > Infoworld article Indeed, I've been home sick the last two days, and whenever I'm logged on there's another post from a non-member waiting for moderator approval. This used to be twice-a-week thing, now it's a full-time job . > and we're going to continue to field basic questions such > as the one I just replied to. Would it be possible to set up an > auto-reply for spambayes@python.org which provides an auto-reply to > non-subscribers' messages similar to the webmaster@python.org > auto-reply? I'll gladly work on the reply text. That would be great! Of course I don't have a clue how to go about doing it. Barry? From tim.one at comcast.net Tue May 20 21:20:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 20:21:55 2003 Subject: [Spambayes] Trouble w/ Outlook buttons In-Reply-To: Message-ID: The attached came from a private correspondent. I don't recognize the symptoms. I urged them to open a bug report on SF, as I don't see anything particularly relevant there now. I'm *assuming* they used the binary installer, and I don't know which version of Outlook or OS is in use. Have asked for clarification. > -----Original Message----- > > ... > > It installed just fine, and I was able to pull up the > manager and configure it. > > I had double buttons, so I did a quick Google search and > saw that this problem had already been fixed, and I was > supposed to just "quit and restart" Outlook. > > Indeed, when I quit and restart, I had the buttons on the > right hand only, and I was happy (or so I thought :-). > > The spam manager button was no longer "clickable", but it > wasn't "greyed" out either. I simply couldn't press it. > > I uninstalled SB. When I launched Outlook, the buttons > were still there, and still didn't work. I reinstalled, > and I still have one set of "unpressable" buttons... > > Anyway, it seems that someone when I quit and restarted > it, it knows to display the icons for the buttons, but > doesn't hook them up to anything. > > Do you know how I go about fixing this? If not, do you > know how I get rid of the buttons altogether? They are > now on the "standard toolbar", not their own... From bkc at murkworks.com Tue May 20 21:41:08 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 20 20:22:56 2003 Subject: [Spambayes] sharing split database In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB330C@its-xchg4.massey.ac.nz> Message-ID: <3ECA92E3.1720.526F07F@localhost> On 21 May 2003 at 11:50, Meyer, Tony wrote: > Are you only interested in our personal databases, or also ones that we use > for testing? (like the SpamAssassin corpus or Bruce G's collections). I think both would be interesting, as long as I know which is personal (hence, real world) and which are test (not real world) with respect to dbimpexp.py, folks are sending sha hashes so they can keep their privacy. ;-) -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From tim.one at comcast.net Tue May 20 21:38:24 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 20:39:46 2003 Subject: [Spambayes] Spambayes error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB32F3@its-xchg4.massey.ac.nz> Message-ID: [Richard Sprague] > I'm currently running the spambayes filtering software and have been > running into a few problems. Here are some details on the problem. > [...] > assert hamcount <= nham > AssertionError [Meyer, Tony] > This is a really long running problem. The database keeps track of the > total number of ham/spam you have trained, as well as the count for each > word. For some reason, sometimes the total count gets out of sync and > isn't as high as it should be (sometimes it even gets to zero). I've > love to be able to fix this, but I have never had this problem myself > (which leads me to suspect that it doesn't effect the Outlook plugin, > since the majority of my mail goes through that), and so it's hard to > track down. I've never seen it either, but also use the Outlook client (and exclusively). I'm surprised we haven't gotten another clue in all this time! Who's cheating ? > Anyway, to fix this: > * run -e -D hammie.db -f hammie.txt > * open hammie.txt and change the first two numbers to 81465 and 860909 > (these are the maximum nham and hspam in your db). Actually, you can > change them to anything larger than this. Hold on. If the maximum record spamcount is 860909, that means Richard trained on 860,909 distinct spam messages all of which contained a common word. That's very hard to believe. Sounds more like the database has gone out to lunch. > * run -I -D hammie.db -f hammie.txt > > I don't recall if this was answered before, but as a fallback for when > this happens, couldn't spambayes just (print a warning and) set the > hamcount/spamcount to the largest nham and hspam? I don't see that this > would have that detrimental an effect, but then I'm not a statistician. Let's not shoot the messenger (i.e., the failing assert isn't the problem, the failing assert is a symptom of a problem). When the database is hosed, you may as well plug in numbers at random. From gmino at pcsltd.com Tue May 20 21:41:34 2003 From: gmino at pcsltd.com (Gabriel Mino) Date: Tue May 20 20:41:33 2003 Subject: [Spambayes] General heads up on errors from winders/outlook machines Message-ID: <3261E796E368954CB22963F2B63E81051EECE5@xmail.pcsltd.com> 99% of the time that I have seen errors with spambayes (lately) has been when either windoez or office had not been patched to the latest "patches", this would certainly without question be my 1st suggestion to those having problems "my windowz PC is SO full of patches....it's a quilt!" From skip at pobox.com Tue May 20 21:28:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 21:28:06 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? In-Reply-To: References: <16074.43767.165.397425@montanaro.dyndns.org> Message-ID: <16074.54818.633828.595206@montanaro.dyndns.org> >> I suspect Spambayes is going to attract more interest after Jon >> Udell's Infoworld article Tim> Indeed, I've been home sick the last two days, and whenever I'm Tim> logged on there's another post from a non-member waiting for Tim> moderator approval. This used to be twice-a-week thing, now it's a Tim> full-time job . Once we get an auto-reply item written (I'm trying to track down the text used for the webmaster address to use as a template), I suspect it would be okay to simply let anyone's post through to the list. Skip From noreply at sourceforge.net Tue May 20 19:39:28 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 20 21:39:33 2003 Subject: [Spambayes] [ spambayes-Bugs-740843 ] No Disk Error with Outlook 2000 on startup Message-ID: Bugs item #740843, was opened at 2003-05-20 19:39 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Sam Snow (snowsam) Assigned to: Mark Hammond (mhammond) Summary: No Disk Error with Outlook 2000 on startup Initial Comment: After installing SpamBayes-Outlook-Setup-002.exe I am now getting an error dialog on Outlook startup. The box says: (Header) Inbox - Microsoft Outlook:OUTLOOK.EXE - No Disk (Body) There is no disk in the drive. Please insert a disk into drive \Device\Harddisk0\DR0. (Buttons) Cancel, Try Again, Continue I am able to click cancel or continue several times and then outlook goes ahead and opens up. I just installed this evening, so I am not sure if the filtering is still working correctly. I was able to train the program sucessfully. I am using Office 2000 SP3 on Win 2000. I will try to attach a jpg of the dialog box. My error log says the following: SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.64165ms AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Processing 0 missed spam in folder 'Inbox' took 31.9599ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 316, in probability exceptions.AssertionError: ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 From mhammond at skippinet.com.au Wed May 21 12:34:03 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 20 21:40:23 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: <1053469938.3260.26.camel@barry> Message-ID: <099201c31f39$10b5e960$530f8490@eden> Another alternative would be a couple of extra mailing lists. I could see: spambayes-dev: developers (ie, this group now). spambayes-users: users and general support queries. spambayes-announce: New versions - may be particularly useful for Outlook users that currently have a problem listed in one of the bugs and can't worm around it. One advantage I see is that we may attract some "power users" willing to help out on the -users list. Tony is doing an excellent job in this role at the moment, but everyone would prefer to see him spend more time in the code :) I'm not sure the current single list would attract such people, so it will always continue to fall to us. Mark. From tim.one at comcast.net Tue May 20 22:42:55 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 21:44:56 2003 Subject: [Spambayes] Heads up! Tokenizer changes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB30EE@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > I think we really do need to have some sort of solution for these. Is > there someone that knows enough about how messages can be > malformed/the errors that the email package throws that can put this > together? Nope, not even the author. Email messages have a defined syntax, and the number of ways to violate the rules is essentially unbounded. When Barry (and other contributors) knew of a sensible way to proceed in the face of errors, the email pkg is already trying to do so under strict=False parsing. Our needs are specific to what we do. In the patch I talked about, the insane MIME structure could very well be a showstopper for many applications. In our app, though, we don't really give a rat's ass about the original MIME, we only want to suck out the words. If what was intended to be a plain-text part and an HTML part get smushed together, we really don't care. So it's appropriate for us to catch the exception and rework the message a bit so that the email pkg can tolerate it. Overall, that's more a matter of seeing what breaks than of prior analysis; bad structure seems rare, even in spam. I count about 16 places message_from_string() is called now. It's used by mboxutils.get_message() with basic protections that used to live in tokenizer.py. That's probably the best version to build on. It catches MessageParseError, which is the base class for all parsing complaints; the BoundaryError subclass caught by the most recent patch is a special case it would be good to catch there too, before falling back to a more drastic hack. > The code should probably be added to message.py. This would fix > imapfilter and pop3proxy immediately; I gather that the Outlook plugin > will also use message.py at some point in the future, as will > hammiefilter. Just so there's *some* choke point for asking the email pkg to perform this vulnerable task. From tim.one at comcast.net Tue May 20 23:01:28 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 20 22:02:47 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? In-Reply-To: <16074.54818.633828.595206@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > Once we get an auto-reply item written (I'm trying to track down the > text used for the webmaster address to use as a template), I suspect > it would be okay to simply let anyone's post through to the list. Probably not, I'm afraid. Because of the nature of this list, SpamAssassin is set to stop virtually nothing sent here. We attract about one spam per day here, which I discard. OTOH, since everyone reading this list doesn't have a spam problem, maybe I shouldn't worry about that . From barry at python.org Wed May 21 00:04:26 2003 From: barry at python.org (Barry Warsaw) Date: Tue May 20 23:05:02 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? In-Reply-To: <16074.54818.633828.595206@montanaro.dyndns.org> Message-ID: On Tuesday, May 20, 2003, at 09:28 PM, Skip Montanaro wrote: > Once we get an auto-reply item written (I'm trying to track down the > text > used for the webmaster address to use as a template), I suspect it > would be > okay to simply let anyone's post through to the list. Maybe we should just go ahead and give Skip admin privs on the spambayes list. Tim will have to send him the password though -- I don't remember it. -Barry From skip at pobox.com Tue May 20 23:07:41 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 23:07:49 2003 Subject: [Spambayes] Heads up! Tokenizer changes In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB30EE@its-xchg4.massey.ac.nz> Message-ID: <16074.60797.413588.813568@montanaro.dyndns.org> Tim> Nope, not even the author. Email messages have a defined syntax, Tim> and the number of ways to violate the rules is essentially Tim> unbounded. When Barry (and other contributors) knew of a sensible Tim> way to proceed in the face of errors, the email pkg is already Tim> trying to do so under strict=False parsing. It seems to me that the simplest way to punt and guarantee a parseable message would be to simply change the Content-Type of the message to text/plain. You obviously make some compromises (like multipart-alternative messages would have make duplicate tokens), but it should be guaranteed to parse, shouldn't it? Skip From skip at pobox.com Tue May 20 23:09:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 20 23:09:59 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? In-Reply-To: References: <16074.54818.633828.595206@montanaro.dyndns.org> Message-ID: <16074.60928.81138.799881@montanaro.dyndns.org> >> it would be okay to simply let anyone's post through to the list. Tim> Probably not, I'm afraid. Because of the nature of this list, Tim> SpamAssassin is set to stop virtually nothing sent here. We Tim> attract about one spam per day here, which I discard. OTOH, since Tim> everyone reading this list doesn't have a spam problem, maybe I Tim> shouldn't worry about that . That's exactly what I was thinking. Skip From seant at iname.com Wed May 21 00:13:25 2003 From: seant at iname.com (Sean True) Date: Tue May 20 23:13:59 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message-ID: As noted before, I'm working on a commercial version of the Outlook add-in. I'm wondering whether I should announce new revisions and availability on this list? Opinions, please. -- Sean From mhammond at skippinet.com.au Wed May 21 14:17:06 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 20 23:18:28 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message-ID: <09e901c31f47$75acbde0$530f8490@eden> > As noted before, I'm working on a commercial version of the > Outlook add-in. > I'm wondering whether > I should announce new revisions and availability on this > list? Opinions, > please. I think that is fine. Mark. From tim_one at email.msn.com Wed May 21 00:29:40 2003 From: tim_one at email.msn.com (Tim Peters) Date: Tue May 20 23:30:30 2003 Subject: [Spambayes] Heads up! Tokenizer changes In-Reply-To: <16074.60797.413588.813568@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > It seems to me that the simplest way to punt and guarantee a parseable > message would be to simply change the Content-Type of the message to > text/plain. You obviously make some compromises (like > multipart-alternative messages would have make duplicate tokens), but > it should be guaranteed to parse, shouldn't it? The failures here come *from* message_from_string() -- there is no Message object to work with at this point. Sometimes the body can't be parsed, and sometimes not even the headers can be parsed. Workarounds are needed for both kinds of errors, and some are in place, but they're half-hearted and scattered around the codebase now. mboxutils.get_message() is the best current routine to build on. From tim_one at email.msn.com Wed May 21 00:29:42 2003 From: tim_one at email.msn.com (Tim Peters) Date: Tue May 20 23:30:37 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message-ID: [Sean True] > As noted before, I'm working on a commercial version of the Outlook > add-in. I'm wondering whether I should announce new revisions and > availability on this list? > Opinions, please. Within reason, yes, they'd be welcome. Since I'm a list moderator, my definition of reason applies, and I think my definition is pretty reasonable. From bill at parducci.net Tue May 20 21:55:28 2003 From: bill at parducci.net (bill parducci) Date: Tue May 20 23:56:04 2003 Subject: [Spambayes] Training via IMAP Filter References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3324@its-xchg4.massey.ac.nz> Message-ID: <3ECAF8B0.8060702@parducci.net> Meyer, Tony wrote: > Let's face it; IMAP is a really ugly, poorly designed protocol. (Not > that I'm claiming I could do a better job ;). actually, you probably could ;-) this may be a dumb question, but having watched a number of you struggle with the IMAP protocol, i have wondered why IMAP be addressed directly at all? in my little [shady and deranged] world, the IMAP server is part & parcel with the SMTP server (sendmail in this case). wouldn't it be easier to address this at the SMTP level (via procmail, etc.) then try to take on the IMAP black hole? it would seem that as long as users can be trained as to what the spam folder is and to leave ham in the INBOX for 24 hours (unless you can kick off retaining via e-mail), that this would would be conducive to spambayes operations (provided you had the disk space to handle the hammiedbs). am i missing something significant here? is the goal to move evaluation (and (hammiedbs) to the client workstations? b From anthony at interlink.com.au Wed May 21 15:08:26 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed May 21 00:08:44 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: <099201c31f39$10b5e960$530f8490@eden> Message-ID: <200305210408.h4L48Rs11521@localhost.localdomain> >>> "Mark Hammond" wrote > Another alternative would be a couple of extra mailing lists. I could see: > > spambayes-dev: developers (ie, this group now). > spambayes-users: users and general support queries. > spambayes-announce: New versions - may be particularly useful for Outlook > users that currently have a problem listed in one of the bugs and can't worm > around it. +1. This is becoming a high volume list. Is anyone working on a new alpha release? Tim/Tony? Anthony -- Anthony Baxter It's never too late to have a happy childhood. From T.A.Meyer at massey.ac.nz Wed May 21 17:24:47 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 00:25:58 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3493@its-xchg4.massey.ac.nz> > > spambayes-dev: developers (ie, this group now). > > spambayes-users: users and general support queries. > > spambayes-announce: New versions > +1. This is becoming a high volume list. +1 here, too. Although I think it would probably be a good idea to keep the "spambayes" name - instead of "spambayes-users", perhaps. (Unless we're all too lazy to move to -dev ;) > Is anyone working on a new alpha release? Tim/Tony? We both are, although we've both ended up getting busy with other things at the same time. I really think it is due, though. The tasks I'm working on to get there are: * A changelog (half done) * Checking the documentation to see that it's all current & understandable. * Checking if there is anything necessary in the way of instructions to upgrade from the last release (I can check pop3proxy & the Outlook plugin, but nothing else). In terms of code, now that the imapfilter seems to be working, and the options stuff also, I can't see anything that needs to be done before the next alpha, IMO. (plenty before a beta, though :( ). There might be some Outlook plugin issues, but people should really be grabbing the binaries, not the alpha releases, for that, anyway. If someone wants to do something, they could check the manifest/setup files; I don't really know what the story with those is. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 17:28:43 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 00:29:47 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3497@its-xchg4.massey.ac.nz> > One advantage I see is that we may attract some "power users" > willing to help out on the -users list. Tony is doing an > excellent job in this role at the moment, but everyone would > prefer to see him spend more time in the code :) You mean it doesn't give me more time for my real work? . More importantly, such users would be great for quicker responses when we're all busy (or for those like me and Mark, responses when this half of the world is asleep). > I'm not sure the current single list would attract such > people, so it will always continue to fall to us. It makes it easier to filter through it all too - especially given the number of very similar 'help' messages recently. Even if the list isn't split into users/dev, an announce list would be good for those that use spambayes but don't have time for user *or* dev messages. =Tony Meyer From noreply at sourceforge.net Tue May 20 22:44:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 21 00:45:31 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 00:44 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From T.A.Meyer at massey.ac.nz Wed May 21 17:48:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 00:49:42 2003 Subject: [Spambayes] Spambayes error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34C0@its-xchg4.massey.ac.nz> > I've never seen it either, but also use the Outlook client > (and exclusively). I'm surprised we haven't gotten another > clue in all this time! Me, too. I'm also surprised that it doesn't show up in testing - we don't use the Outlook client for that... > Who's cheating client is going thru Classifier's > learn() and unlearn() interfaces, it's easy to show that > hamcount <= nham is a global invariant>? It really is bizarre. It must surely be introduced when reading or writing the db. I also have my suspicions about the classifier.py lines that "account for string" ham/spam counts - they should always be ints, and never strings, and I think it's hiding a different problem. (I've changed this in my local copy, but since I don't come across the error, I don't think it will help). > Hold on. If the maximum record spamcount is 860909, that > means Richard trained on 860,909 distinct spam messages all > of which contained a common word. That's very hard to > believe. Sounds more like the database has gone out to lunch. You know, that's a good point... :) Perhaps I should have thought about the numbers instead of just copying them... Looking at the word list, there are a lot of words with the same count (1296, for example). That might indicate that messages have been trained a *lot* of times. The words at the top are those that you would expect to be there, however - "the", "you", "header:subject:1", "header:to:1", "header:from:1", "proto:http", and so on. And the count does go down to single figures (no hapaxes, however). So, Richard, if you have trained over 861,000 unique spam messages, things may be ok, but if not you might want to retrain your db. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 17:58:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 00:59:41 2003 Subject: [Spambayes] Training via IMAP Filter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34CF@its-xchg4.massey.ac.nz> > I have wondered why IMAP be addressed directly > at all? in my little [shady and deranged] world, the IMAP > server is part & parcel with the SMTP server [...] > am I missing something significant here? is the goal to move > evaluation (and (hammiedbs) to the client workstations? Evaluation (and hammiedbs) are already on the client workstations; or at least that was the aim (with imapfilter). Believe it or not :), some people actually chose to use IMAP instead of POP3 (or Exchange, or ...). So the filter is for those people using Eudora or OE or something with an IMAP server rather than a POP3 one (since they can't use pop3proxy). =Tony Meyer From skip at pobox.com Wed May 21 01:14:04 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 01:14:41 2003 Subject: [Spambayes] Spambayes error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34C0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34C0@its-xchg4.massey.ac.nz> Message-ID: <16075.2844.443633.561855@montanaro.dyndns.org> >> I've never seen it either, but also use the Outlook client (and >> exclusively). I'm surprised we haven't gotten another clue in all >> this time! Tony> Me, too. I'm also surprised that it doesn't show up in testing - Tony> we don't use the Outlook client for that... I don't use the Outlook client and I haven't encountered that problem. Maybe I retrain from scratch too often (every couple of weeks anyway). On the other hand, I never train (incremental or full) into the database I use for classification, so there's no chance (or only a very slight chance) that I'm updating my database while hammiefilter is trying to use it. I train into a copy of the database, then use it to update in two training copies, on my laptop via cp, and on the Mojam mail server via rsync. I'm not aware of a problem in either setup. Tony> So, Richard, if you have trained over 861,000 unique spam Tony> messages, things may be ok, but if not you might want to retrain Tony> your db. I suspect even with all the testing Tim has done, he hasn't trained on 861,000 unique spams. Skip From skip at pobox.com Wed May 21 01:35:46 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 01:36:18 2003 Subject: [Spambayes] How to change login name during install? Message-ID: <16075.4146.705423.131948@montanaro.dyndns.org> My SF username is "montanaro" not "skip", so by default updating the Spambayes website using "make install" doesn't work. I wormed around the problem for the moment by prepending "montanaro@" to LIVE_DEST in scripts/make.rules. Surely there's a better way to accomplish this with some sort of rsync or ssh rc file? Skip From skip at pobox.com Wed May 21 01:37:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 01:37:18 2003 Subject: [Spambayes] Platform page? Message-ID: <16075.4225.833542.16957@montanaro.dyndns.org> I suspect we should have a platform page on the website with major categories for Windows, Unix/Linux and Mac, and subcategories for various email clients. What's necessary to get Spambayes running on Linux using Mutt will be different than on Windows using Outlook, which will be different than on Mac OS 9 using Eudora. I don't even know, does Mark's installer also include Python or is the regular install required as a prerequisite? Skip From skip at pobox.com Wed May 21 01:52:23 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 01:53:03 2003 Subject: [Spambayes] auto reply text Message-ID: <16075.5143.924682.553369@montanaro.dyndns.org> I checked in a draft of an automatic reply for the Spambayes mailing list to the website CVS repository and modified the website Makefile so it gets installed. Those of you without website CVS access can view it here: http://spambayes.sf.net/reply.txt Skip From T.A.Meyer at massey.ac.nz Wed May 21 18:53:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 01:54:41 2003 Subject: [Spambayes] Platform page? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34F9@its-xchg4.massey.ac.nz> > I suspect we should have a platform page on the website with > major categories for Windows, Unix/Linux and Mac, and > subcategories for various email clients. +1. Perhaps even a page for each platform, with categories for various clients. > I don't even know, does Mark's > installer also include Python or is the regular install > required as a prerequisite? It includes everything you need. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 18:57:51 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 01:59:02 2003 Subject: [Spambayes] How to change login name during install? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34FB@its-xchg4.massey.ac.nz> > My SF username is "montanaro" not "skip", so by default > updating the Spambayes website using "make install" doesn't > work. I wormed around the problem for the moment by > prepending "montanaro@" to LIVE_DEST in scripts/make.rules. > Surely there's a better way to accomplish this with some sort > of rsync or ssh rc file? I had the same problem ("anadelonbrin" vs "tameyer"), and did similar worming. I changed the line: RSYNC_RSH = ssh to RSYNC_RSH = "ssh -l anadelonbrin" which worked for me. Could the username be passed to the make install script? (defaulting to whatever it defaults to now). i.e. I would execute: make install -l anadelonbrin =Tony Meyer From T.A.Meyer at massey.ac.nz Wed May 21 19:05:16 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 02:06:37 2003 Subject: [Spambayes] auto reply text Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3501@its-xchg4.massey.ac.nz> Nice text :) What are people's thoughts about using something along these lines (maybe with a slightly longer explanation of what spambayes is) as the main "readme.txt" file? The current readme is really for developers or testers more than users (it could become readme-devel.txt or something like that). > There is a known problem installing the Outlook > plug-in on Windows XP. > (What are the symptoms? Is there a workaround?) The clearest symptom from what I have seen is that the "Anti-Spam" buttons don't appear and the log file will have a trace (or many) that has: "'OptionsClass' object has no attribute 'skip_max_word_size'", "'module' object has no attribute 'DBDictClassifier'" or "cannot import name tokenize" in it. The only workaround (AFAIK) is to install Python (if necessary) and use either the cvs or (I think) alpha2. > Note that you also need a > recent enough version of Python installed on your system. As noted in the previous message, this needs to note that this is not the case if using Mark's binary. =Tony Meyer From anthony at interlink.com.au Wed May 21 17:24:10 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed May 21 02:25:15 2003 Subject: [Spambayes] How to change login name during install? In-Reply-To: <16075.4146.705423.131948@montanaro.dyndns.org> Message-ID: <200305210624.h4L6OAl13124@localhost.localdomain> >>> Skip Montanaro wrote > My SF username is "montanaro" not "skip", so by default updating the > Spambayes website using "make install" doesn't work. I wormed around the > problem for the moment by prepending "montanaro@" to LIVE_DEST in > scripts/make.rules. Surely there's a better way to accomplish this with > some sort of rsync or ssh rc file? >From my .ssh/config: Host *.sourceforge.net User anthonybaxter From skip at pobox.com Wed May 21 02:55:42 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 02:56:35 2003 Subject: [Spambayes] Announcements? In-Reply-To: References: Message-ID: <16075.8942.778664.858524@montanaro.dyndns.org> Sean> As noted before, I'm working on a commercial version of the Sean> Outlook add-in. I'm wondering whether I should announce new Sean> revisions and availability on this list? Opinions, please. Fine by me. Skip From lists at olivermaunder.co.uk Wed May 21 11:02:43 2003 From: lists at olivermaunder.co.uk (Oliver Maunder) Date: Wed May 21 05:03:35 2003 Subject: [Spambayes] Training via IMAP Filter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34CF@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34CF@its-xchg4.massey.ac.nz> Message-ID: <3ECB40B3.7000308@olivermaunder.co.uk> Meyer, Tony wrote: >>I have wondered why IMAP be addressed directly >>at all? in my little [shady and deranged] world, the IMAP >>server is part & parcel with the SMTP server >> >> In my world it's a bit different :-) The IMAP server is on my ISP's system, not mine, so I haven't got access to sendmail, procmail, etc. >Evaluation (and hammiedbs) are already on the client workstations; or at >least that was the aim (with imapfilter). > > But the "client workstations" in this case are not necessarily the same computers that are being used to read the mail. >Believe it or not :), some people actually chose to use IMAP instead of >POP3 (or Exchange, or ...). So the filter is for those people using >Eudora or OE or something with an IMAP server rather than a POP3 one >(since they can't use pop3proxy). > > Here's why I use IMAP. I read my mail from work, from home (dual booting Windows and Linux), through a web interface and from my mobile phone. With IMAP I get the same view of my mail from anywhere - all the messages are in the same folders, if I read a message in one location, it's marked as read everywhere. The alternative is to use POP3 and leave the messages on the server for a certain period of time. But then there's no way of telling which messages I've already read, and which are genuinely new. I would also need to be running pop3proxy in several locations. I can leave imapfilter running on my home PC, and when I check my email at work, it is already filtered. If I check it on my mobile (OK - that does use POP3) all the spam has already been moved from the inbox. So - that's why I use IMAP. I'm not saying it's the best mail protocol around, but it works for me :-) Olly From mph at explor.co.uk Wed May 21 09:51:04 2003 From: mph at explor.co.uk (Martin Hepworth) Date: Wed May 21 08:10:46 2003 Subject: [Spambayes] Beginners question Message-ID: Hi, I am using the binary distribution of the SpamBayes Outlook Plugin on Windows XP with Outlook 2002, SP2. Everything appears to work OK, but I have noticed the following behaviour. When new mail arrives in my inbox, if I read it immediately the message appears to be unread until I read it a second time. If I wait for a few minutes and then read the message, it immediately changes to a read message. Everything else appears to work OK. ?? Martin. From paulw at bluegrassgroup.com Wed May 21 13:03:34 2003 From: paulw at bluegrassgroup.com (Paul Watson) Date: Wed May 21 08:11:16 2003 Subject: [Spambayes] Spambayes: Bug: Outlook 2003 BETA 2 Message-ID: <9BCBFCE4A3B7D140BA5FEFA74E9E8D6C0DADFF@bgctnt01.bluegrass.capetown> Installed the Spambayes Outlook add-in but it does not work. I am running Outlook 2003 BETA 2 which I think is probably the problem. It does list in the COM Addin list. Trying to remove it though actually crashes Outlook. No toolbar appears. OS: Windows XP SP1. regards, Paul Watson Analysis & Design Bluegrass Cape Town, South Africa w> bluegrassgroup.com c> +27 (0)83 302 8874 t> +27 (0)21 7979 755 f> +27 (0)21 761 9310 From wonderful_day at softhome.net Wed May 21 01:53:54 2003 From: wonderful_day at softhome.net (Wonderful Day) Date: Wed May 21 08:11:43 2003 Subject: [Spambayes] A problem installing spambayes Message-ID: <6b5401c31f5d$2a09c9d0$0500a8c0@office> I just downloaded the latest EXE from http://starship.python.net/crew/mhammond/spambayes/ and installed it on my win 98 machine with Outlook 2K. Spambayes doesn't load. If I look at the OCM add in list, I see it there, unchecked. If I check it, click OK, and close those windows then restart outlook, it still doesn't start and remains unchecked! Am I doing something wrong?! THANKS! SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding From seant at iname.com Wed May 21 09:13:48 2003 From: seant at iname.com (Sean True) Date: Wed May 21 08:14:22 2003 Subject: [Spambayes] SpamAtBay Beta candidate 6 In-Reply-To: Message-ID: After some positive feedback, I'm posting the changes list for the latest beta of SpamAtBay, a commercial version of the Spambayes Outlook add-in. We're focusing on Tim's "easy enough for my sister" metric. To reiterate previous note: free (permanent) license to any SpamBayes developer. Free permanent licenses to anyone who uses a beta version. Send me email if you want a beta. Changes before candidate 4 are available with the download. -- Sean Release 1.0 Beta Candidate 6 (0.95) 1) Safer save of configuration file 2) Switching dialogs to normal-modal 3) Restructured and simplified menus 4) Sender list is now context sensitive 5) Survive PasteFace problems in OL 2002 6) Preparing string resouces for localization 7) Check to make sure databases are valid before import 8) Install must be done by admin user, but all Outlook users have access to the add-in, with separate registration and personal data 9) Help button removed from Control Panel (use menu) 10) Quarantine Now button dropped from Quarantine Rules panel 11) Contact Support mail now goes directly to issues processing system 12) Time consuming operations (training, filtering) have warning 13) ProcessMessages after registration and setup wizard, to catch any that came in 14) Disable GUI after 14-day trial, but leave it in place, leave Contact Support and Register items active 15) Drop first (registration) page of Setup Wizard 16) Safer save of databases Release 1.0 Beta Candidate 5 (0.94) 1) Addresses shutdown issues with lingering Outlook 2) Internal logging and profiling support 3) Initial detection logic for Outlook 2003 (not supported) 4) Addresses some potential issues in Outlook security model 5) Sets registration friendly name and description 6) Changes "Show spam clues" format to be user friendly tables 7) Add Senders menu item, whitelist/blacklist senders and domains Release 1.0 Beta Candidate 4 (0.93) 1) Support saving and loading database 2) Setup wizard framework now can invoke dialogs 3) Detect (and work with) more versions of the Outlook/Office com interface 4) Save database right after "Learn from scratch" training 5) Cleaner preflight results display From Jocelyn.Montjaux at microcell.ca Wed May 21 09:30:15 2003 From: Jocelyn.Montjaux at microcell.ca (Montjaux, Jocelyn) Date: Wed May 21 08:34:31 2003 Subject: [Spambayes] Auto-reply to spambayes non-subscribers? Message-ID: <938DB8735797D511A6CB0008C7A4D20002C2C6A5@SMTLPEXC03.microcell.ca> > > I suspect Spambayes is going to attract more interest after > > Jon Udell's Infoworld article and we're going to continue to > > field basic questions such as the one I just replied to. > > Would it be possible to set up an auto-reply for > > spambayes@python.org which provides an auto-reply to > > non-subscribers' messages similar to the webmaster@python.org > > auto-reply? I'll gladly work on the reply text. > > Definitely +1 on the idea. I think we need to update the webpage to > note the problems with installing the binary on XP, as well, > given that > the article said that it works fine, when it really doesn't I had no problem installing the plugin binary (002) on my Windows XP SP1 with Outlook 2002 SP2. This may be due to the fact that I am local admin but the typical (corporate) user may not. From Paul.Moore at atosorigin.com Wed May 21 14:33:20 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Wed May 21 08:34:40 2003 Subject: [Spambayes] Beginners question Message-ID: <16E1010E4581B049ABC51D4975CEDB88619A88@UKDCX001.uk.int.atosorigin.com> From: Martin Hepworth [mailto:mph@explor.co.uk] > I am using the binary distribution of the SpamBayes Outlook Plugin on > Windows XP with Outlook 2002, SP2. Everything appears to work OK, but I > have noticed the following behaviour. Thanks for reporting this! This is a known issue with the current version. It's already logged at SourceForge as bug 716684 (you can track this at the URL http://sourceforge.net/tracker/index.php?func=detail&aid=716684&group_id=61702&atid=498103 if you wish - feel free to add additional information there). At the moment, no workaround is available, I'm afraid. Paul Moore From skip at pobox.com Wed May 21 08:57:51 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 09:02:07 2003 Subject: [Spambayes] Training via IMAP Filter In-Reply-To: <3ECB40B3.7000308@olivermaunder.co.uk> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34CF@its-xchg4.massey.ac.nz> <3ECB40B3.7000308@olivermaunder.co.uk> Message-ID: <16075.30671.26751.320023@montanaro.dyndns.org> Oliver> Here's why I use IMAP. I read my mail from work, from home (dual Oliver> booting Windows and Linux), through a web interface and from my Oliver> mobile phone. With IMAP I get the same view of my mail from Oliver> anywhere - all the messages are in the same folders, if I read a Oliver> message in one location, it's marked as read everywhere. That's why God invented the Ti Powerbook. ;-) Mine's beginning to get a bit scuffed up because I carry it essentially everywhere, but with the exception of the mobile phone case (why would you want to read mail on such a small screen?) I can get email essentially anywhere I want. Skip From gmino at pcsltd.com Mon May 19 19:03:25 2003 From: gmino at pcsltd.com (Gabriel Mino) Date: Wed May 21 09:41:31 2003 Subject: [Spambayes] Announcements? In-Reply-To: <3261E796E368954CB22963F2B63E8105333EE5@xmail.pcsltd.com> Message-ID: <3261E796E368954CB22963F2B63E810513772F@xmail.pcsltd.com> Commercial version? Of an open source project?......i think I'm gonna be sik -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Sean True Sent: Tuesday, May 20, 2003 11:13 PM To: spambayes@python.org Subject: [Spambayes] Announcements? As noted before, I'm working on a commercial version of the Outlook add-in. I'm wondering whether I should announce new revisions and availability on this list? Opinions, please. -- Sean _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From bill at parducci.net Wed May 21 07:59:10 2003 From: bill at parducci.net (bill parducci) Date: Wed May 21 09:59:28 2003 Subject: [Spambayes] Training via IMAP Filter References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34CF@its-xchg4.massey.ac.nz> <3ECB40B3.7000308@olivermaunder.co.uk> Message-ID: <3ECB862E.1010203@parducci.net> Oliver Maunder wrote: Meyer, Tony wrote: B> I have wondered why IMAP be addressed directly at all? in my little B> [shady and deranged] world, the IMAP server is part & parcel with the B> SMTP server O> In my world it's a bit different :-) The IMAP server is on my ISP's O> system, not mine, so I haven't got access to sendmail, procmail, etc. yes, this is the use case i was considering as the most likely to spawn this [sub]project. however... T> Evaluation (and hammiedbs) are already on the client workstations; or at T> least that was the aim (with imapfilter). O> But the "client workstations" in this case are not necessarily the same O> computers that are being used to read the mail. ...this means they are going to have to be for the client centric model to work... O> Here's why I use IMAP. I read my mail from work, from home (dual booting O> Windows and Linux), through a web interface and from my mobile phone. O> With IMAP I get the same view of my mail from anywhere - all the O> messages are in the same folders, if I read a message in one location, O> it's marked as read everywhere. ...and if you don't have access to the mail server (beyond retrieval) you are not going to be able to use this for you webmail client. O> The alternative is to use POP3 and leave the messages on the server for O> a certain period of time. But then there's no way of telling which O> messages I've already read, and which are genuinely new. I would also O> need to be running pop3proxy in several locations. O> I can leave imapfilter running on my home PC, and when I check my email O> at work, it is already filtered. If I check it on my mobile (OK - that O> does use POP3) all the spam has already been moved from the inbox. yikes! is a spambot a realistic design constraint? given spambayes need to rewrite stuff, i get this uncomfortable feeling that Bad Things may happen with multiple systems *manipulating* common mail folders via IMAP. > So - that's why I use IMAP. I'm not saying it's the best mail protocol > around, but it works for me :-) hey, i'm with you. i am one of those free spirits who have chosen IMAP so that i can get mail anywhere (even on my laptop ;-), but it was done with an acceptance of the blessing/curse of keeping all information on the server. i am not saying that the distributed/spambot won't work (although as the covers are pulled back on the IMAP protocol i have my doubts :o), but that the implementation of it kinda flies in the face of the spirit for which IMAP was designed. thanks for the clarification. it will be interesting to watch a little technical alligator wrestling! :) b From DAdams at extremelogic.com Wed May 21 09:54:45 2003 From: DAdams at extremelogic.com (David Adams) Date: Wed May 21 10:53:09 2003 Subject: [Spambayes] SpamBayes project Message-ID: I hear great things about this program. I have an e-mail account that I use Outlook Express to manage. Will your software work on this version of Outlook? Thanks, Dave From danm at bigmac.cns.brocku.ca Wed May 21 10:33:36 2003 From: danm at bigmac.cns.brocku.ca (Daniel Minniti) Date: Wed May 21 10:53:31 2003 Subject: [Spambayes] Reporting a bug Message-ID: <000801c31f9d$95136a70$0520398b@danielpzx3bnoy> Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\gencache.py", line 367, in EnsureModule pywintypes.com_error: (-2147319779, 'Library not registered.', None, None) From keith at cyberverse.com Wed May 21 08:40:06 2003 From: keith at cyberverse.com (keith) Date: Wed May 21 10:54:55 2003 Subject: [Spambayes] "enable filtering" stays faded out. Message-ID: I have gone around and around with this program and its instructions several times and I can't get the "enable filtering" box to check. It stays faded out no matter what I do. And the program never moves any files to the SPAM folder. I'd appreciate your help. This my latest log file: SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 157.283ms E:\src\python-cvs\lib\fcntl.py:7: DeprecationWarning: the FCNTL module is deprecated; please use fcntl Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Checked 70 in folder ebay/paypal - 70 new entries found. Checked 3 in folder Friends - 3 new entries found. Checked 315 in folder Inbox - 315 new entries found. Checked 55 in folder Sent Items - 55 new entries found. Checked 32 in folder zz Datek - 22 new entries found. Checked 0 in folder zz Health - 0 new entries found. Checked 0 in folder zz Misc - 0 new entries found. Checked 6 in folder zz Motivation123 - 6 new entries found. Checked 0 in folder zz Stocks - 0 new entries found. Checked 0 in folder Top of Personal Folders - 0 new entries found. Checked 8 in folder SPAM - 8 new entries found. AntiSpam: Watching for new messages in folder Inbox Spam filtering is disabled - ignoring new message Spam filtering is disabled - ignoring new message pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? AntiSpam: Watching for new messages in folder Inbox Spam filtering is disabled - ignoring new message pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Checked 133 in folder ebay/paypal - 0 new entries found. Checked 3 in folder Friends - 0 new entries found. Checked 55 in folder Sent Items - 0 new entries found. Checked 32 in folder zz Datek - 0 new entries found. Checked 0 in folder zz Health - 0 new entries found. Checked 0 in folder zz Misc - 0 new entries found. Checked 6 in folder zz Motivation123 - 0 new entries found. Checked 0 in folder zz Stocks - 0 new entries found. Checked 0 in folder Top of Personal Folders - 0 new entries found. Checked 255 in folder Inbox - 255 new entries found. Checked 8 in folder SPAM - 0 new entries found. AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Inbox pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? Spam filtering is disabled - ignoring new message Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 372, in OnTreeItemClick File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated Checked 133 in folder ebay/paypal - 133 new entries found. Checked 3 in folder Friends - 3 new entries found. Checked 220 in folder Inbox - 220 new entries found. Checked 56 in folder Sent Items - 56 new entries found. Checked 32 in folder zz Datek - 22 new entries found. Checked 0 in folder zz Health - 0 new entries found. Checked 0 in folder zz Misc - 0 new entries found. Checked 6 in folder zz Motivation123 - 6 new entries found. Checked 0 in folder zz Stocks - 0 new entries found. Checked 0 in folder Top of Personal Folders - 0 new entries found. Checked 43 in folder SPAM - 43 new entries found. Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated AntiSpam: Watching for new messages in folder Inbox Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 318, in OnInitDialog File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: OnInitDialog() virtual handler (>) raised an exception Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 391, in OnTreeItemSelChanged File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler warning: raising a string exception is deprecated AntiSpam: Watching for new messages in folder Inbox From bkc at murkworks.com Wed May 21 12:14:23 2003 From: bkc at murkworks.com (Brad Clements) Date: Wed May 21 10:56:29 2003 Subject: [Spambayes] Collecting word lists.. I have 4 contributions so far Message-ID: <3ECB5F8E.18223.8466880@localhost> Hi folks, I'll wait another day or two to see if anyone else can upload their hashed wordlist. Interesting sizes: 7,529,920 popiel-key-hash 7,843,220 key-hash.skip 485,900 key-hash-ark3 3,969,400 key-hash.wjp I'll also add that I'm not planning on sharing the hashed data with anyone else, just so no one has to worry about me running 'penis' or 'bork bork' through sha to see if it's in your word list already. And to save you from having to dig in the archives for what I'm looking for and how to upload it, here's a passage from a prior message. Thanks! > I just ran this: > > >>> import anydbm > >>> import sha > >>> db = anydbm.open("hammie.db") > >>> f = open("key-hash", "w") > >>> for k in db.keys(): > ... o = sha.new(k) > ... f.write(o.digest()) > ... > >>> f.close() > >>> db.close() > Okay everyone whose willing to contribute a hashed version of their wordlist, please use sha as shown above and I'll compare the hashes. You can upload via anonymous ftp if you wish to ftp://ftp.murkworks.com/uploads/spambayes Note that this is a drop-only location, you won't see the file after you upload it and you cannot overwrite a file with the same name, so if you choose to contribute your hash "list" via ftp upload, please select a unique name and also send me a message off- list indicating you've uploaded it. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From Jocelyn.Montjaux at microcell.ca Wed May 21 11:58:51 2003 From: Jocelyn.Montjaux at microcell.ca (Montjaux, Jocelyn) Date: Wed May 21 11:03:11 2003 Subject: [Spambayes] SpamBayes project Message-ID: <938DB8735797D511A6CB0008C7A4D20002C2C6A8@SMTLPEXC03.microcell.ca> The binary Outlook Plugin is for Outlook 2000+ only. You can use the POP3 (or IMAP) proxy when using any mail agent (including Outlook Express). So you will have to do some setup, see http://spambayes.sourceforge.net/applications.html pop3proxy.py Dunno if there is a more detailed setup guide... > -----Original Message----- > From: David Adams [mailto:DAdams@extremelogic.com] > Sent: Wednesday, May 21, 2003 8:55 AM > To: SpamBayes@python.org > Subject: [Spambayes] SpamBayes project > > > I hear great things about this program. I have an e-mail > account that > I > > use Outlook Express to manage. Will your software work on > this version > > of Outlook? > > > > Thanks, > > > > Dave > > > > > > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > > From ericbr at oz.net Wed May 21 10:42:13 2003 From: ericbr at oz.net (Eric Brown) Date: Wed May 21 12:42:52 2003 Subject: [Spambayes] Outlook addin cloning messages? Message-ID: <000801c31fb7$eed33080$5264a8c0@ghidorah> I've just installed the spambayes binary outlook plugin, and it works great at filtering spam. Unfortunately, it also apparently leaves extra copies of randomly selected messages in my inbox. Sometimes, the outlook rules send the duplicate messages to the appropriate folders; sometimes the extra copies stay in my inbox. I've checked the bug list & mail archives, and I haven't seen anything like this. Does this happen for anyone else? I'm running Outlook XP sp 2, with the version 002 outlook addin. From DDriver at covercraft.com Wed May 21 12:27:44 2003 From: DDriver at covercraft.com (Driver, David) Date: Wed May 21 13:03:28 2003 Subject: [Spambayes] Problems registering add in dll Message-ID: <5B880349CA70D4119B7500D0B74C62D9010F8119@EXCHANGE> Under the windows installer and running regsvr32 as a domain administrator I get the following return code 0xc0000005. I would like to try the add in so any advice would be appreciated. Thanks, David J. Driver Covercraft Industries Inc. 100 Enterprise Blvd Pauls Valley, OK 73075 Phone: 405-238-9651 ex 242 The information contained in this e-mail message or any associated attachments may be privileged, confidential, and protected from disclosure. If you are not the intended recipient, any dissemination, distribution, or copying is strictly prohibited. If you think you have received this e-mail message in error, please reply to the sender and delete this message from your system. Thank you. Not Just Enabling You To View Data, But Enabling You To Complete Tasks. From LArase at vrinc.com Wed May 21 10:52:36 2003 From: LArase at vrinc.com (Lisa Arase) Date: Wed May 21 13:03:57 2003 Subject: [Spambayes] I love your product! Message-ID: Hello, What are the chances that your product could work on a server level rather than client? We have Exchange 5.5 and use TrendMicro eManager (for NT4) and find it lacking. It clears about 1/3 of the spam we get and the spam filter itself is a manual job. I'm the one learning about spam and what is and is not spam. Unfortunately, those on the road, must download all spam and ham from the server and it takes forever. They would rather download only ham not spam and save some time. If we could catch the spam before it hits the inbox, that would really be great! Are there any plans for this? Thanks! Lisa Arase From noreply at sourceforge.net Wed May 21 11:04:28 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 21 13:05:03 2003 Subject: [Spambayes] [ spambayes-Bugs-741250 ] mboxtrain can truncate your mailbox Message-ID: Bugs item #741250, was opened at 2003-05-21 12:04 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741250&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Skip Montanaro (montanaro) Assigned to: Skip Montanaro (montanaro) Summary: mboxtrain can truncate your mailbox Initial Comment: There is code in mboxtrain to rewrite the mailbox which looks like try: os.ftruncate(f.fileno(), 0) f.seek(0) except: # If anything goes wrong, don't try to write print "Problem truncating mbox--nothing written" raise If the ftruncate() call succeeds but the seek() call fails, the user would be left with an empty mailbox. I think the code should write a temporary mailbox then rename it only if the complete write operation is successful. Furthermore, bare except clauses should be avoided unless you really can't anticipate all the exceptions which might be raised. No patch yet. I'll try to come up with something. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741250&group_id=61702 From noreply at sourceforge.net Wed May 21 11:50:53 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 21 13:51:30 2003 Subject: [Spambayes] [ spambayes-Bugs-741271 ] No icons on toolbar Message-ID: Bugs item #741271, was opened at 2003-05-21 12:50 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741271&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Selleck (mousecat) Assigned to: Mark Hammond (mhammond) Summary: No icons on toolbar Initial Comment: After installing the binary version, I do not have the anti-spam dropdown button on my toolbar. I have gone to the com addin and checked the Spambayes option but each time I go back to that screen, that option is unchecked. The first log I have is included in this message. Thank you for the help. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741271&group_id=61702 From tim.one at comcast.net Wed May 21 15:14:23 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 14:16:32 2003 Subject: [Spambayes] auto reply text In-Reply-To: <16075.5143.924682.553369@montanaro.dyndns.org> Message-ID: Is it really the case that the binary installer is known to have systematic problems on XP? I thought it was Win98 where it routinely didn't work. Mark should be able to settle this one. From tim.one at comcast.net Wed May 21 15:21:29 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 14:24:11 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3493@its-xchg4.massey.ac.nz> Message-ID: [Mark Hammond, suggests spambayes-dev: developers (ie, this group now). spambayes-users: users and general support queries. spambayes-announce: New versions ] +1 here too. I'm neutral on the later suggestion of keeping the plain "spambayes" name for what's called sapbayes-users above. From bbands at yahoo.com Wed May 21 12:53:19 2003 From: bbands at yahoo.com (John Bollinger) Date: Wed May 21 14:53:54 2003 Subject: [Spambayes] SB in CGL Message-ID: <20030521185319.30177.qmail@web13907.mail.yahoo.com> The following appeared in volume 16, issue 12 of the Capital Growth Letter, 16 May 2003. "Computer Corner "Spam, how we hate it! “Make ‘em walk the plank.” “Hang ‘em high!” "Jail 'em!" "There ought to be a law." Frankly I don’t get what all the furor is about. Yeah it is inconvenient and all, but legislation that curtails free speech certainly isn't the answer. Nonetheless, it seems something must be done about it and many entities are coming to the fore with offerings of various types. Yikes, I even get spam about anti-spam software. Where will it all end? "In August of 2002 Paul Graham published “A Plan for SPAM” http://www.paulgraham.com/spam.html, an article that discussed using (pseudo) Bayesian logic to filter wanted mail, ham, from unwanted mail, spam. That article hit a nerve and a swirl of development got underway. Today several commercial products incorporate those ideas and more are on the way. However, the most interesting extension is an open-source project called SpamBayes, SB, www.spambayes.org. They took Paul Graham’s idea and ran with it, producing in the end an elegant logical application in Python--a powerful, high-level computer language, www.python.org. "SB can filter out spam with amazing accuracy. In order to get it to work one first has to train it, which is quite simple. Just accumulate a bunch of spam in one mailbox and a bunch of ham in the other then tell SB to train. A couple of hundred of each will get you started, but the more the merrier. SB uses three mailboxes, one for ham--most likely your inbox, one for spam and one for items that it is unsure about--you'll need to create these. After your finished training and the program is running if you get a spam in your inbox just drag it to the spam box and SB learns a bit. Items in the unsure box need to be dispatched to the inbox or the spambox; again SB learns when you do so. After a while you'll end up with a very efficient system for dealing with spam, without legislation! "Mark Hammond has created a nifty SB implementation for Outlook; you can check it out here, http://starship.python.net/crew/mhammond/spambayes/. "As usual on these projects someone put their shoulder to the wheel, Tim Peters seems to be the man, accompanied by a cast of coders, or hackers in the correct sense to the term. Credits here: http://sourceforge.net/project/memberlist.php?group_id=61702 Enjoy!" Any errors in the above are mine. Thanks for a great project. --jab ===== John Bollinger, CFA, CMT www.BollingerBands.com If you advance far enough, you arrive at the beginning. __________________________________ Do you Yahoo!? The New Yahoo! Search - Faster. Easier. Bingo. http://search.yahoo.com From bill at parducci.net Wed May 21 13:56:16 2003 From: bill at parducci.net (bill parducci) Date: Wed May 21 15:56:50 2003 Subject: [Spambayes] SB in CGL References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> Message-ID: <3ECBD9E0.4070601@parducci.net> you know, i have had excellent results taking only binary actions from spambayes. if hammiefilter says it is spam, procmail shuffles it to the spam folder, otherwise it goes to the inbox. since i would have to sift through the unsure folder anyway i don't much care if unsures go to my inbox. besides, after a little training that became pretty much a non-issue. also, it made the whole process easier to explain to my less technical users (if it is in the wrong folder put it in the OTHER one :o) elegance through versatility, it's a beautiful thing. ;-) b John Bollinger wrote: [...] > "SB can filter out spam with amazing accuracy. In > order to get it to work one first has to train it, > which is quite simple. Just accumulate a bunch of spam > in one mailbox and a bunch of ham in the other then > tell SB to train. A couple of hundred of each will get > you started, but the more the merrier. SB uses three > mailboxes, one for ham--most likely your inbox, one > for spam and one for items that it is unsure > about--you'll need to create these. After your > finished training and the program is running if you > get a spam in your inbox just drag it to the spam box > and SB learns a bit. Items in the unsure box need to > be dispatched to the inbox or the spambox; again SB > learns when you do so. After a while you'll end up > with a very efficient system for dealing with spam, > without legislation! [...] From g.popa at att.net Wed May 21 16:48:51 2003 From: g.popa at att.net (Gary Popa) Date: Wed May 21 16:16:54 2003 Subject: [Spambayes] install issues Message-ID: <000001c31fd2$039fbfa0$e4d05d0c@green> I?ve got Outlook 2002, running Win98SE. Can?t get Spambayes to load. Here?s a log file: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding From jan at usplus.net Wed May 21 17:14:37 2003 From: jan at usplus.net (Jan Eugenides) Date: Wed May 21 16:17:24 2003 Subject: [Spambayes] Outlook Add-in Spam Column formatting Message-ID: <000001c31fd5$9b68d620$0201a8c0@dad> I'm trying out the Outlook addin, running under XP Pro, using Outlook 2002. The filtering seems to be working just fine. I tried to set up a "Spam" column to see the percentage scores, but no matter what I do, they won't show up as a percentage. They insist on showing up as numbers.This is not that big a deal, but it does seem funny. I tried deleting the Spam field from the User Defined Fields and recreating it as Percent, but it just switches back to Number. From thx1138 at kiva.net Wed May 21 16:36:14 2003 From: thx1138 at kiva.net (nick kearney) Date: Wed May 21 16:41:05 2003 Subject: [Spambayes] Need help with outlook 2002 Message-ID: <000401c31fd8$a01a48e0$0101a8c0@theseventhsign> I used the outlook plugin installer and the Delete as spam isn't working Thanks N!K From tim at fourstonesexpressions.com Wed May 21 16:48:17 2003 From: tim at fourstonesexpressions.com (Four Stones Expressions) Date: Wed May 21 16:49:02 2003 Subject: [Spambayes] Platform page? In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34F9@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34F9@its-xchg4.massey.ac.nz> Message-ID: On Wed, 21 May 2003 17:53:23 +1200, Meyer, Tony wrote: >> I suspect we should have a platform page on the website with major >> categories for Windows, Unix/Linux and Mac, and subcategories for >> various email clients. > > +1. Perhaps even a page for each platform, with categories for various > clients. + 1 from me as well. c'est moi - TimS From skip at pobox.com Wed May 21 15:53:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 16:52:29 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3493@its-xchg4.massey.ac.nz> Message-ID: <16075.55582.379219.85454@montanaro.dyndns.org> Tim> [Mark Hammond, suggests Tim> spambayes-dev: developers (ie, this group now). Tim> spambayes-users: users and general support queries. Tim> spambayes-announce: New versions Tim> ] Tim> +1 here too. I'm neutral on the later suggestion of keeping the Tim> plain "spambayes" name for what's called sapbayes-users above. If we dump "spambayes" I think we should at least set it up to auto-reply with the possible choices (hopefully that's an easy thing to do). Skip From skip at pobox.com Wed May 21 15:55:32 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 16:52:36 2003 Subject: [Spambayes] auto reply text In-Reply-To: References: <16075.5143.924682.553369@montanaro.dyndns.org> Message-ID: <16075.55732.406409.205851@montanaro.dyndns.org> Tim> Is it really the case that the binary installer is known to have Tim> systematic problems on XP? I thought it was Win98 where it Tim> routinely didn't work. Mark should be able to settle this one. I don't know. I was under the impression from recent traffic that there was some XP-related problem with the installer. The main reason for placing the file on the website was so people could read and respond. Those with CVS access are welcome to make changes as well. Skip From popiel at wolfskeep.com Wed May 21 14:53:00 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 21 16:53:34 2003 Subject: [Spambayes] Platform page? In-Reply-To: Message from Four Stones Expressions of "Wed, 21 May 2003 15:48:17 CDT." References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34F9@its-xchg4.massey.ac.nz> Message-ID: <20030521205300.5A65B2DDF2@cashew.wolfskeep.com> In message: Four Stones Expressions writes: >On Wed, 21 May 2003 17:53:23 +1200, Meyer, Tony >wrote: > >>> I suspect we should have a platform page on the website with major >>> categories for Windows, Unix/Linux and Mac, and subcategories for >>> various email clients. +1 >> Perhaps even a page for each platform, with categories for various >> clients. +0 - Alex From zshepherd at alphadatasys.com Wed May 21 08:44:16 2003 From: zshepherd at alphadatasys.com (D. Zane Shepherd) Date: Wed May 21 17:25:37 2003 Subject: [Spambayes] Outlook Express In-Reply-To: <16066.31393.242465.935073@montanaro.dyndns.org> Message-ID: (Redirecting this to the list. Zane's question is beyond my ability to help. Can someone provide him with some feedback? The traceback suggests something amiss with the codecs system. -skip) Skip, I punted on the OE path and opted for Outlook. I have downloaded the .exe install from your website and have attempted to install it. The installation appears to complete normally but nothing appeared on the Outlook Standard Toolbar. I followed the instructions in the "Addin doesn't load" section of troubleshooting. The SpamBayes addin is listed but it will not retain the check mark in the box after closing the dialog box. I have completely uninstalled and reinstalled SpamBayes with the same result. Attached is the most recent log file. I don't know what to try next. Thanks for the help! Zane Shepherd Dallas, TX -----Original Message----- From: Skip Montanaro [mailto:skip@pobox.com] Sent: Wednesday, May 14, 2003 12:19 PM To: D. Zane Shepherd Cc: SpamBayes@python.org Subject: Re: [Spambayes] Outlook Express Zane> Hi! I learned of your product from an educational and consulting Zane> firm in LA known as K2 Enterprises. They speak very highly of Zane> this product. Is it compatible with Outlook Express? Thank you Zane> in advance for a response. Not directly. You should be able to install Python+Spambayes on your system and use the pop3proxy to train and classify incoming mail though. You would then point Outlook Express at the proxy instead of the real POP3 server. If you use IMAP to retrieve your mail there is an IMAP filter in the works, though I don't know how ready for prime time it is at this point. -- Skip Montanaro skip@pobox.com http://www.musi-cal.com/ -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 886 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030521/eef8f4d9/spambayes4.obj From T.A.Meyer at massey.ac.nz Thu May 22 10:52:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 17:54:09 2003 Subject: [Spambayes] auto reply text Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB35AC@its-xchg4.massey.ac.nz> > Tim> Is it really the case that the binary installer is known to have > Tim> systematic problems on XP? I thought it was Win98 where it > Tim> routinely didn't work. Mark should be able to settle this one. > > I don't know. I was under the impression from recent traffic > that there was some XP-related problem with the installer. > The main reason for placing the file on the website was so > people could read and respond. Those with CVS access are > welcome to make changes as well. Hint hint :) I think (especially after reading the latest batch) that there are two problems, one with Win98 and one with WinXP. (But yes, Mark could probably settle this). The Win98 one seems to be the "invalid codec" trace, while the WinXP one is the "couldn't import" trace. I'll update things to reflect this (and if I'm wrong, Mark can fix it ;) =Tony Meyer From noreply at sourceforge.net Wed May 21 16:17:04 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed May 21 18:17:51 2003 Subject: [Spambayes] [ spambayes-Bugs-741271 ] No icons on toolbar Message-ID: Bugs item #741271, was opened at 2003-05-22 05:50 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741271&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Tim Selleck (mousecat) Assigned to: Mark Hammond (mhammond) Summary: No icons on toolbar Initial Comment: After installing the binary version, I do not have the anti-spam dropdown button on my toolbar. I have gone to the com addin and checked the Spambayes option but each time I go back to that screen, that option is unchecked. The first log I have is included in this message. Thank you for the help. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-22 10:17 Message: Logged In: YES user_id=552329 Are you using Windows98? This is a known problem (which I'm pretty sure is limited to Win98). I'm afraid that there isn't a workaround for it at the moment, other than using the CVS (or alpha2?) source (which would require installing Python). See also [ 725449 ] Binary plugin fails on Win9x (which is a duplicate of this). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741271&group_id=61702 From T.A.Meyer at massey.ac.nz Thu May 22 11:32:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 18:33:26 2003 Subject: [Spambayes] Need help with outlook 2002 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB35E1@its-xchg4.massey.ac.nz> Hi, > I used the outlook plugin installer and the Delete as spam > isn't working Thanks Please try going through the processes listed in the "troubleshooting.html" file. If you still can't resolve the problem, then you can submit a bug report (instructions are in that file). Note that there are two known problems (that will be solved with the next binary release): * With WindowsXP your log may have messages along the lines of "can't import" or "does not have attribute [skip_word_size]". * With Windows98 your log may have a message about an "invalid codec". Neither of these problems have a workaround at the moment, apart from installing Python and using the source (either the alpha2 release or CVS). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 22 11:43:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 18:44:30 2003 Subject: [Spambayes] install issues Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB35F1@its-xchg4.massey.ac.nz> > I've got Outlook 2002, running Win98SE. Can't get Spambayes > to load. Here's a log file: [...] > File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir > exceptions.LookupError: no codec search functions registered: > can't find encoding Not good news, I'm afraid. This is a known problem with the installer that Mark used to create the binary. He notes in the SF bug that he's going to try and fix it for the next release; I'm not sure if he has or not. The only workaround is to install Python and use the source - alpha2 should work, or CVS definitely will. =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 22 11:52:31 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 18:53:44 2003 Subject: [Spambayes] SpamBayes project Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB360A@its-xchg4.massey.ac.nz> > I hear great things about this program. I have an e-mail > account that I use Outlook Express to manage. Will your software > work on this version of Outlook? The thing is that Outlook Express isn't a version of Outlook, it's a completely separate program (from the same company). Because they give it away for free, OE is a really stripped down program, and it's extremely difficult to create a plugin for it. As someone else said, you can use pop3proxy or imapfilter (depending on whether you use POP3 or IMAP). Check out the INTEGRATION.TXT file for instructions. Pop3proxy/imapfilter aren't quite as 'transparent' as the Outlook plugin, but they're still quite easy to use/setup, and they use the same core, so the results will be the same. =Tony Meyer From skip at pobox.com Wed May 21 19:00:49 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 19:01:23 2003 Subject: [Spambayes] /.? In-Reply-To: <20030521185319.30177.qmail@web13907.mail.yahoo.com> References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> Message-ID: <16076.1313.850777.38242@montanaro.dyndns.org> I don't mean to look a gift horse in the mouth, but what would we do if we got slashdotted? I'll try to get the auto-reply text updated this evening. Please have a look at http://spambayes.sf.net/reply.txt and give me feedback. The only feedback I received so far was about the XP comment. I'd appreciate some help in the Windows section. That's clearly where we are going to get the most hits. I'm also working on some platform-specific web pages. I'll try to make initial versions available on the website this evening without links. I'll send URLs out when something's available for review. Skip From mhammond at skippinet.com.au Thu May 22 09:57:45 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 19:02:00 2003 Subject: [Spambayes] auto reply text In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB35AC@its-xchg4.massey.ac.nz> Message-ID: <0d7201c31fec$654e9ea0$530f8490@eden> > > I don't know. I was under the impression from recent traffic > > that there was some XP-related problem with the installer. > > The main reason for placing the file on the website was so > > people could read and respond. Those with CVS access are > > welcome to make changes as well. > > Hint hint :) > > I think (especially after reading the latest batch) that there are two > problems, one with Win98 and one with WinXP. (But yes, Mark could > probably settle this). I don't know either. The Win9x problem is consistent, and purely related to the installer. The XP problem is less so. I have heard of a number of XP related problems; many, but not all, are related to Outlook XP (as opposed to Outlook 2000 which most of us use). I have tested, and heard of a few success reports, of XP with Outlook 2000. At least one user with Outlook XP has a fully functioning SpamBayes, except for toolbar buttons. I just don't know :) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2020 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030522/178191e0/winmail-0001.bin From mhammond at skippinet.com.au Thu May 22 10:03:52 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 19:05:20 2003 Subject: [Spambayes] Outlook addin cloning messages? In-Reply-To: <000801c31fb7$eed33080$5264a8c0@ghidorah> Message-ID: <0d8a01c31fed$416f4fb0$530f8490@eden> > Unfortunately, it also apparently leaves extra copies of > randomly selected > messages in my inbox. Sometimes, the outlook rules send the duplicate > messages to the appropriate folders; sometimes the extra > copies stay in my > inbox. I haven't seen anything like this either. Are your normal rules set "copy" the item, as opposed to "move" it? This may be a factor, as I only ever personally use "move" Feel free to create a bug report at source-forge - that way any future details we can dig up will not get lost. Mark. From T.A.Meyer at massey.ac.nz Thu May 22 12:04:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 19:05:42 2003 Subject: [Spambayes] "enable filtering" stays faded out. Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3624@its-xchg4.massey.ac.nz> > I have gone around and around with this program and its > instructions several times and I can't get the "enable > filtering" box to check. It stays faded out no matter what I > do. And the program never moves any files to the SPAM > folder. I'd appreciate your help. This my latest log file: [...] > AntiSpam: Watching for new messages in folder Inbox > Spam filtering is disabled - ignoring new message You can ignore all of the "no module called timer" traces, which leaves the correct output and the "what kind of ID is None" traces. No messages are moved because filtering is disabled. The "enable filtering" checkbox ought to enable as soon as you have more than 5 ham and 5 spam (which, from the trace, you did), and you specify a folder to watch (again, you did). You also need to specify folders to which unsures/spam should be moved (can't tell from the trace). Have you done that? If it is faded out, however, a message should be displayed explaining why it is (you need more messages, or something like that). What does it display for you? (In normal use it would display the number of trained messages and the folder name(s) that it is watching). (The problem could be the "ID is None" one, but from the ordering, it didn't look like it). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 22 12:09:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 19:10:38 2003 Subject: [Spambayes] auto reply text Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB362D@its-xchg4.massey.ac.nz> > I don't know either. How disappointing ;) > The Win9x problem is consistent, and purely related to the installer. > The XP problem is less so. I have heard of a number of XP > related problems; many, but not all, are related to Outlook > XP (as opposed to Outlook 2000 which most of us use). I have > tested, and heard of a few success reports, of XP with > Outlook 2000. At least one user with Outlook XP has a fully > functioning SpamBayes, except for toolbar buttons. Is Outlook XP the same as Outlook 2002? (Stupid Microsoft version names...) If it is, then either the problem was somehow fixed in cvs at some point, or it's also an installer problem. I'm using both WindowsXP and Outlook 2002 and have no problems with the cvs code, but the installer fails for me. This refers to the (most common, I think) "skip_word_size" trace, which is what I get. The other ones ("can't import" etc) look suspiciously similar, but that might not mean anything, of course. How easy is it to build a test copy of the installer from the stuff that you've put in cvs? (plus the McMillan installer on your website) If it's not that tricky I could give it a go and see if the latest cvs code will install on XP for me. As for the fully-functioning, but no buttons, go figure ;) =Tony Meyer From David.Bear at asu.edu Wed May 21 17:13:39 2003 From: David.Bear at asu.edu (David Bear) Date: Wed May 21 19:14:35 2003 Subject: [Spambayes] Device\Harddisk1\DR? error for me yeah Message-ID: <20030521161339.G8806@asu.edu> I've now installed spambayes precompiled outlook plugin on two machines, both windows 2000 using outlook 2000. One installed when flawlessly. The other... error ============================== Event Type: Information Event Source: Application Popup Event Category: None Event ID: 26 Date: 5/21/2003 Time: 3:06:40 PM User: N/A Computer: PP120181 Description: Application popup: INSE.tmp - No Disk : There is no disk in the drive. Please insert a disk into drive \Device\Harddisk1\DR1. ==================================== I tried as an admin on the system. same thing error ============================== Event Type: Information Event Source: Application Popup Event Category: None Event ID: 26 Date: 5/21/2003 Time: 3:07:46 PM User: N/A Computer: PP120181 Description: Application popup: INS11.tmp - No Disk : There is no disk in the drive. Please insert a disk into drive \Device\Harddisk1\DR1. This system has a zip drive in it. It was empty when I attempted to install, so I inserted a disk in the zip drive and tried again. This time the error popup just mentioned a failure to register ... (sorry didn't grab the exact error, AND I couldnt find it in the eventlogs postmortem) I grabbed a FULL copy of the registry AND a sysInfo printout if Mark is interested... -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, everyone will be suspect of trespassing" From mhammond at skippinet.com.au Thu May 22 10:17:36 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 19:19:02 2003 Subject: [Spambayes] Performance of CVS Outlook addin Message-ID: <0da201c31fef$2b439500$530f8490@eden> I recently changed the Outlook addin in CVS to save the database after *every* training operation, iff you are using a bsddb. I'm wondering if any people have actually started using this version, and what their impressions of the speed are? Personally, it is slower and now I tend to do all my "Unsure" messages in one hit - ie, select all the ones I want to "Recover", then do it as one operation. This results in a single save. If I select the "Recover" etc button once per message, we get a save per message with the resulting slowdown. Any other impressions? Should this save code remain in place? Mark. From mhammond at skippinet.com.au Thu May 22 10:35:52 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 19:37:18 2003 Subject: [Spambayes] Device\Harddisk1\DR? error for me yeah In-Reply-To: <20030521161339.G8806@asu.edu> Message-ID: <0dbe01c31ff1$b7f65760$530f8490@eden> Thanks David, We know the reason behind this now (my development HD has the same letter as your ZIP drive!) but not completely clear on a solution. If you are able to install Python, then not only will you be able to run the latest version of the plugin without this error, you will also be able to help us get to the bottom of it once and for all. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of David Bear > Sent: Thursday, 22 May 2003 9:14 AM > To: spam bayes > Subject: [Spambayes] Device\Harddisk1\DR? error for me yeah > > > I've now installed spambayes precompiled outlook plugin on two > machines, both windows 2000 using outlook 2000. One installed when > flawlessly. The other... > > error ============================== > > Event Type: Information > Event Source: Application Popup > Event Category: None > Event ID: 26 > Date: 5/21/2003 > Time: 3:06:40 PM > User: N/A > Computer: PP120181 > Description: > Application popup: INSE.tmp - No Disk : There is no disk in the drive. > Please insert a disk into drive \Device\Harddisk1\DR1. > > ==================================== > I tried as an admin on the system. same thing > > error ============================== > > Event Type: Information > Event Source: Application Popup > Event Category: None > Event ID: 26 > Date: 5/21/2003 > Time: 3:07:46 PM > User: N/A > Computer: PP120181 > Description: > Application popup: INS11.tmp - No Disk : There is no disk in the > drive. Please insert a disk into drive \Device\Harddisk1\DR1. > > This system has a zip drive in it. It was empty when I attempted to > install, so I inserted a disk in the zip drive and tried again. This > time the error popup just mentioned a failure to register ... (sorry > didn't grab the exact error, AND I couldnt find it in the eventlogs > postmortem) > > I grabbed a FULL copy of the registry AND a sysInfo printout if Mark > is interested... > -- > David Bear > phone: 480-965-8257 > fax: 480-965-9189 > College of Public Programs/ASU > Wilson Hall 232 > Tempe, AZ 85287-0803 > "Beware the IP portfolio, everyone will be suspect of trespassing" > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From jchasko at twii.com Wed May 21 15:28:18 2003 From: jchasko at twii.com (Joe Chasko) Date: Wed May 21 19:37:26 2003 Subject: [Spambayes] GREAT program! Message-ID: <002001c31fdf$e6330fe0$0c01a8c0@JOE> Finally, a spam solution that's not a pain in the a--! This works exactly the way I would have tried to make it work, and very well at that. Nice interface, nice function, no extra work. What can I say? This is fine programming! I'm using it in Outlook in Windows. It is nicely integrated. I have unplugged my McAfee Spam killer, which was not integrated, nor smart. (And I paid $30 for that!) Joe Chasko TreadWare International Inc.R From jchasko at twii.com Wed May 21 17:26:04 2003 From: jchasko at twii.com (Joe Chasko) Date: Wed May 21 19:37:34 2003 Subject: [Spambayes] Suggestion for offshoot of Spam Bayes Message-ID: <004501c31ff0$5ab2b860$0c01a8c0@JOE> After using your marvelously effective program, I was deleting some of my Outlook rules for spam. I pondered what a pain it is to use the "Rules wizard." It occurs to me that routing incoming mail into appropriate boxes is analogous to routing spam. Here's a future function: uses the same technology to learn where you want your mail routed. Just pick a folder of mail, and say I want this kind of mail here. You'd have to make a whole array of databases like the spam one, but this seems like the same computation. Joe Chasko From mkl at wra.org Wed May 21 17:44:50 2003 From: mkl at wra.org (L'Herault, Mitch) Date: Wed May 21 19:40:51 2003 Subject: [Spambayes] Terrific stuff Message-ID: <21773946A370B14EA079D03E95F4EF1862B154@WRA_EXCH.wra.org> I installed on an XP machine using the Outlook plugin. A little confusing setting it up and getting the filtering running. No major problems so far with the exception of the spam field reports the wrong percentage sometimes. When I open the 'show spam clues for current message' that data seems correct, but a message that is totally spam, and actually shows up the the spam folder with a 96 or 97 percent will report 0% in the spaminess field. Other than that it's looking good. ********************************************************************** Mitch L'Herault Network Administrator Wisconsin REALTORS Association 4801 Forest Run Road Madison, WI 53704 (608) 241-2047 From anthony at interlink.com.au Thu May 22 11:43:10 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed May 21 20:44:00 2003 Subject: [Spambayes] Re: Auto-reply to spambayes non-subscribers? In-Reply-To: <16075.55582.379219.85454@montanaro.dyndns.org> Message-ID: <200305220043.h4M0hAL23739@localhost.localdomain> > Tim> +1 here too. I'm neutral on the later suggestion of keeping the > Tim> plain "spambayes" name for what's called sapbayes-users above. > > If we dump "spambayes" I think we should at least set it up to auto-reply > with the possible choices (hopefully that's an easy thing to do). The alternative is that we keep it as an alias for "the maintainers", and it goes to spambayes-dev. That way we can keep the whole simple 'spambayes@python.org' alias... Anthony From anthony at interlink.com.au Thu May 22 11:48:00 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed May 21 20:48:42 2003 Subject: [Spambayes] Suggestion for offshoot of Spam Bayes In-Reply-To: <004501c31ff0$5ab2b860$0c01a8c0@JOE> Message-ID: <200305220048.h4M0m0m23791@localhost.localdomain> >>> "Joe Chasko" wrote > It occurs to me that routing incoming mail into appropriate boxes is > analogous to routing spam. > > Here's a future function: uses the same technology to learn where you > want your mail routed. Just pick a folder of mail, and say I want this > kind of mail here. You'd have to make a whole array of databases like > the spam one, but this seems like the same computation. Look at ifile -- http://www.nongnu.org/ifile/ Anthony From tim.one at comcast.net Wed May 21 21:58:04 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 21:00:42 2003 Subject: [Spambayes] /.? In-Reply-To: <16076.1313.850777.38242@montanaro.dyndns.org> Message-ID: [Skip] > I don't mean to look a gift horse in the mouth, but what would we do > if we got slashdotted? Vanish into the night with our massive profits . > I'll try to get the auto-reply text updated this evening. Please have > a look at > > http://spambayes.sf.net/reply.txt > > and give me feedback. The only feedback I received so far was about > the XP comment. I'd appreciate some help in the Windows section. > That's clearly where we are going to get the most hits. First off, I think we need a "Windows section", labeled as such. Text with blanks for community fill-in: """ The Outlook add-in was developed mostly using Outlook 2000 on Windows 2000 and Win98SE, in Outlook's Internet Mail Only configuration, and not using Exchange Server. Those combinations are the best tested, and problems with other combinations are known. If you have a problem, the best way to help make progress is to look for a similar existing bug report and add any useful information you may have, or to open a new bug report if your problem seems to be a new one. XXX point to bug tracker Outlook Express: The add-in does not work with Outlook Express. Despite its name, Outlook Express has little in common with Outlook, and doesn't expose programming interfaces rich enough for the add-in to use. You can use the POP3 proxy approach with Outlook Express instead. XXX point to something Windows 95: XXX I haven't seen any reports. Win98SE: The binary installer doesn't currently work on Win98SE. The add-in works fine on Win98 with Outlook 2000 if installed manually, though. XXX point to something Windows ME: XXX I haven't seen any reports. Windows NT: XXX I haven't seen any reports. Outlook 98, Outlook 2002, and/or Windows XP: There are mixed reports about these. Some people have trouble installing, others report problems with the toolbar buttons, still others report problems with setting up a spam score column in a custom view. These are poorly understood by the developers so far. Because the source code for Outlook is secret, making progress on these is difficult. """ From T.A.Meyer at massey.ac.nz Thu May 22 14:15:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 21:16:45 2003 Subject: [Spambayes] Suggestion for offshoot of Spam Bayes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB36D1@its-xchg4.massey.ac.nz> > Look at ifile -- http://www.nongnu.org/ifile/ And also popfile -- http://sourceforge.net/projects/popfile There was also a guy at the spam conference from France (?) that had some similar system, IIRC. I think the webcasts are still there (http://spamconference.org). =Tony Meyer From skip at pobox.com Wed May 21 21:18:30 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 21 21:19:10 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <0da201c31fef$2b439500$530f8490@eden> References: <0da201c31fef$2b439500$530f8490@eden> Message-ID: <16076.9574.808281.194611@montanaro.dyndns.org> Mark> I recently changed the Outlook addin in CVS to save the database Mark> after *every* training operation, iff you are using a bsddb. Why do you need to do this? Skip From T.A.Meyer at massey.ac.nz Thu May 22 14:20:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 21:21:21 2003 Subject: [Spambayes] Performance of CVS Outlook addin Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB36D7@its-xchg4.massey.ac.nz> > I recently changed the Outlook addin in CVS to save the database after > *every* training operation, iff you are using a bsddb. [...] > I'm wondering if any people have actually started using this > version, and what their impressions of the speed are? [...] > Any other impressions? Should this save code remain in place? I don't notice this at all. However, I don't do much training. Most of what ends up in my unsures is stuff I'm unsure about myself (e.g. things I signed up for but am not really interested in). When I do train, though, I don't notice any delay. So my vote would be to stay in place (my system does crash a lot, thanks to XP and some dogdy webcam drivers, so I probably did lose training previously; not that there was much of it). =Tony Meyer From seant at iname.com Wed May 21 22:21:41 2003 From: seant at iname.com (Sean True) Date: Wed May 21 21:22:43 2003 Subject: [Spambayes] OS vs Outlook version issues. In-Reply-To: Message-ID: Mark Hammond wrote: >The Win9x problem is consistent, and purely related to the installer. >The XP problem is less so. I have heard of a number of XP related problems; >many, but not all, are related to Outlook XP (as opposed to Outlook 2000 >which most of us use). I have tested, and heard of a few success reports, >of XP with Outlook 2000. At least one user with Outlook XP has a fully >functioning SpamBayes, except for toolbar buttons. >I just don't know :) >Mark. We are doing VMWare testing for 98,ME,NT4,2000,XP, and XP Pro against Outlook 2000/2002/XP. So far all the platforms (clean install of OS and Outlook) actually seem to work. We do heavy testing on Outlook 2000/20002 on Windows XP and XP Pro. They both work fine, but toolbar image buttons are problematic on 20002. There is a known bug described on MSDN and at some point we'll try the five listed work arounds and see if one works. We got past some install problems through endless tinkering with the build process. Wish I knew which tinker fixed what. I do know that moving the build env to stock Python 2.2.2, and forcing the binary to use a good bsddb3 was a good thing. All of our work is off a snapshot current at 4/23/2003 in CVS. -- Sean From mhammond at skippinet.com.au Thu May 22 12:22:31 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 21:23:58 2003 Subject: [Spambayes] Announcements? In-Reply-To: <3261E796E368954CB22963F2B63E810513772F@xmail.pcsltd.com> Message-ID: <0e1f01c32000$9f88fe30$530f8490@eden> [Gabriel] > Commercial version? Of an open source project?......i think > I'm gonna be sik Why? Obviously, if someone is going to make a commercial version of an open source product, they will need to add some value (otherwise, why would anyone use the commercial one over the free one). So good luck to them! I see no difference to someone extending our Open Source code to produce a commercial product than someone creating a commercial product using an open source language (like Python). Heck - if creating commercial apps out of Open Source products was not a good thing, I wouldn't be anywhere near Python, SpamBayes, or any other open source project - I need to get paid for at least some of my time, and at this stage, almost all of my income is derived indirectly from Open Source products. I hope you make a million, Sean ;) Mark. From T.A.Meyer at massey.ac.nz Thu May 22 14:23:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 21:25:09 2003 Subject: [Spambayes] Terrific stuff Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB36DB@its-xchg4.massey.ac.nz> > I installed on an XP machine using the Outlook plugin. Out of curiosity, what version of Outlook are you using? (many people have reported problems using the binary with Outlook XP/2002). > A little confusing setting it up and getting the filtering > running. No major problems so far with the exception of the > spam field reports the wrong percentage sometimes. This has been reported before. I recall someone mentioning that an additional save call fixes it, but I'm not sure if Mark ended up doing anything about it. If it bothers you, feel free to post a bug on SF about it (http://sf.net/projects/spambayes), (add to an existing one if there is one). =Tony Meyer From mhammond at skippinet.com.au Thu May 22 12:26:29 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 21:27:55 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <16076.9574.808281.194611@montanaro.dyndns.org> Message-ID: <0e2001c32001$2c0d9690$530f8490@eden> > Mark> I recently changed the Outlook addin in CVS to save the database > Mark> after *every* training operation, iff you are using a bsddb. > Why do you need to do this? I didn't *need* to, but when Outlook crashes, all training information since the last startup was lost. While this doesn't seem a big deal, it did cause some confusion - as mail that scores "correctly" this session (but to previous trains in this session) will suddenly revert back to the pre-tained knowledge. The fact I don't need to is why I am able to revert it back, should there be complaints. Mark. From drha at pacbell.net Wed May 21 19:16:47 2003 From: drha at pacbell.net (David Howard) Date: Wed May 21 21:30:09 2003 Subject: [Spambayes] Installer fails on Outlook 2000 SR-1 Message-ID: windows ME Buttons fail to appear Spam bytes appears unchecked in com add-ins Will not accept checking "Failed to call the universal dispatcher" "no codec search funtions registered: can't find encoding" -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 36 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030521/3614b572/spambayes3.obj From T.A.Meyer at massey.ac.nz Thu May 22 14:30:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 21:32:10 2003 Subject: [Spambayes] Spambayes: Bug: Outlook 2003 BETA 2 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB36EA@its-xchg4.massey.ac.nz> > Installed the Spambayes Outlook add-in but it does not work. > I am running Outlook 2003 BETA 2 which I think is probably > the problem. It does list in the COM Addin list. Trying to > remove it though actually crashes Outlook. No toolbar appears. I'm pretty sure that no-one has done any 2003 testing. If you want to assist, you could install Python, use the latest CVS source, and let us know what the traceback says; who knows, maybe it's something simple to solve :) I gather than Sean has done some 2003 testing for the commercial version, and that things aren't quite that simple (since it's listed as unsupported). If you're using Outlook with POP/IMAP, you could still use pop3proxy or imapfilter (although you don't get the nice integration, and it wouldn't work with exchange). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu May 22 14:33:06 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed May 21 21:34:26 2003 Subject: [Spambayes] Installer fails on Outlook 2000 SR-1 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB36ED@its-xchg4.massey.ac.nz> > windows ME [...] > "Failed to call the universal dispatcher" > "no codec search funtions registered: can't find encoding" This looks like the same problem as with Windows98. Assuming that it is, it's caused by a problem with the installer, and there's only two options available: * Wait for the next binary release, which should fix this. (probably in the next couple of weeks) * Install Python and use a source release (alpha2 or CVS). =Tony Meyer From tim.one at comcast.net Wed May 21 23:03:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 22:05:32 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <0da201c31fef$2b439500$530f8490@eden> Message-ID: [Mark Hammond] > I recently changed the Outlook addin in CVS to save the database after > *every* training operation, iff you are using a bsddb. > > I'm wondering if any people have actually started using this version, > and what their impressions of the speed are? Personally, it is > slower and now I tend to do all my "Unsure" messages in one hit - ie, > select all the ones I want to "Recover", then do it as one operation. > This results in a single save. If I select the "Recover" etc button > once per message, we get a save per message with the resulting > slowdown. > > Any other impressions? Should this save code remain in place? I'm using current CVS but not bsddb, and my dict-based database isn't even near 2MB yet, so I'm not much help. What is the slowdown proportional to, roughly? To the number of tokens in the message just trained on, or (as it would be in the case of a dict) the number of tokens in the database? I was sure hoping that under "a real" database, "# of tokens in the msg" would be the answer, *and* that updating a few hundred token records would go too fast to notice. If the latter isn't so, Barry knows many tricks to tune bsddb3 for performance -- but I'm not sure any of them don't require exploiting multiple disk drives <0.8 wink>. From mhammond at skippinet.com.au Thu May 22 13:17:04 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 21 22:18:30 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: Message-ID: <0e5a01c32008$3d20e8e0$530f8490@eden> {Tim] > near 2MB yet, so I'm not much help. What is the slowdown > proportional to, > roughly? To the number of tokens in the message just trained > on, or (as it > would be in the case of a dict) the number of tokens in the > database? I was > sure hoping that under "a real" database, "# of tokens in the > msg" would be > the answer, *and* that updating a few hundred token records > would go too > fast to notice. Me too :) It certainly is proportional to the number of tokens, but I also suspect it depends on the "layout" of the DB. I have never seen an incremental save take longer than 1 second on my (fairly fast CPU, average disks) machine - but 1 second is borderline too slow. My DB is currently 5MB. Certainly saving the database after a complete retrain takes nearly a minute. I was going to look at removing all hapaxes after a complete retrain to try and speed that one up too (as I recall a report here that a huge number of the tokens were hapaxes - I am yet to confirm that with my database) Mark. From tim.one at comcast.net Thu May 22 00:22:11 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 23:23:17 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <0e5a01c32008$3d20e8e0$530f8490@eden> Message-ID: [Mark Hammond] > Me too :) It certainly is proportional to the number of tokens, but > I also suspect it depends on the "layout" of the DB. I have never > seen an incremental save take longer than 1 second on my (fairly fast > CPU, average disks) machine - but 1 second is borderline too slow. > My DB is currently 5MB. I'm sorry to say I don't have enough experience with bsddb3 to say. > Certainly saving the database after a complete retrain takes nearly a > minute. I was going to look at removing all hapaxes after a complete > retrain to try and speed that one up too (as I recall a report here > that a huge number of the tokens were hapaxes - I am yet to confirm > that with my database) Fraught with danger -- hapaxes don't seem important when training on a large random sample, but they Rule when falling into mistake-based training (which, I confess, is all I ever do anymore -- that's why my dict db barely cracks 1.5MB). More principled may be to purge tokens with spamprobs in (0.4, 0.6), since they're ignored during scoring anyway. Note that, since this week's tokenizer changes, crap like this no longer produces garbage hapaxes: With No Side Effects Of course this change makes all prior reports about hapax distribution suspect! From tim.one at comcast.net Thu May 22 00:36:40 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed May 21 23:37:47 2003 Subject: [Spambayes] Spambayes error In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB34C0@its-xchg4.massey.ac.nz> Message-ID: [Tim] >> Who's cheating > learn() and unlearn() interfaces, it's easy to show that >> hamcount <= nham is a global invariant>? [Meyer, Tony] > It really is bizarre. It must surely be introduced when reading or > writing the db. I also have my suspicions about the classifier.py > lines that "account for string" ham/spam counts - they should always > be ints, and never strings, and I think it's hiding a different > problem. Yes, I objected to that change at the time, but couldn't make time to pursue it then. That was another problem I had never seen, and I don't think anyone else did either before some non-dict database got hooked up. > (I've changed this in my local copy, but since I don't come > across the error, I don't think it will help). So let's rip it out -- someone for whom it does fail then can surely be shamed into fixing it for real, or at least into posting a traceback. Could be it's related to the assertion failing too. From seant at iname.com Thu May 22 00:54:56 2003 From: seant at iname.com (Sean True) Date: Wed May 21 23:55:58 2003 Subject: [Spambayes] Announcements? In-Reply-To: <0e1f01c32000$9f88fe30$530f8490@eden> Message-ID: > -----Original Message----- > From: Mark Hammond [mailto:mhammond@skippinet.com.au] .... > > Heck - if creating commercial apps out of Open Source products was not a > good thing, I wouldn't be anywhere near Python, SpamBayes, or any > other open > source project - I need to get paid for at least some of my time, and at > this stage, almost all of my income is derived indirectly from Open Source > products. > > I hope you make a million, Sean ;) > > Mark. > Thanks, Mark. If I do, I'm gonna throw one hell of a party, and you're invited! -- Sean From seant at iname.com Thu May 22 01:01:20 2003 From: seant at iname.com (Sean True) Date: Thu May 22 00:02:20 2003 Subject: [Spambayes] RE: Spambayes Digest, Vol 57, Issue 58 In-Reply-To: Message-ID: >I gather than Sean has done some 2003 testing for the commercial >version, and that things aren't quite that simple (since it's listed as >unsupported). I did, and it wasn't fun. Basically, no events are delivered to the event handlers. There are some really dire warnings about keeping all items you want events from in scope at all times. Sounds easy enough ... >> windows ME >[...] >> "Failed to call the universal dispatcher" >> "no codec search funtions registered: can't find encoding" >This looks like the same problem as with Windows98. Assuming that it >is, it's caused by a problem with the installer, and there's only two >options available: > * Wait for the next binary release, which should fix this. (probably >in the next couple of weeks) > * Install Python and use a source release (alpha2 or CVS). Or install the beta of the "commercial version", and switch back as soon as Mark releases the new binaries. We've tested ME, and the install seems to work fine. YMMV, that's why they call it beta testing! We're trying hard to keep the core data files (config and .db) compatible between the versions. You might lose sugar (UI, etc) when you switch back, but you should not lose training. -- Sean From tim_one at email.msn.com Thu May 22 01:09:47 2003 From: tim_one at email.msn.com (Tim Peters) Date: Thu May 22 00:10:56 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message-ID: [Mark Hammond] >> ... >> I hope you make a million, Sean ;) [Sean True] > Thanks, Mark. > > If I do, I'm gonna throw one hell of a party, and you're invited! A million isn't enough for a good party if I'm invited too -- better to donate half of it to the PSF, and the other half to Microsoft . From mhammond at skippinet.com.au Thu May 22 15:14:10 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 22 00:15:36 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message-ID: <0ebd01c32018$9946e510$530f8490@eden> [Tim] ... > [Mark Hammond] > >> ... > >> I hope you make a million, Sean ;) > > [Sean True] > > Thanks, Mark. > > > > If I do, I'm gonna throw one hell of a party, and you're invited! > > A million isn't enough for a good party if I'm invited too -- That is why you aren't! Even though we have never met personally, Sean knows me well enough to know that the first million should just be able to fund a party for me. As he knows you that much better, he also knows that it will have to be 10 million before you get an invite! Ciggies-aren't-cheap-these-days ly, Mark. From popiel at wolfskeep.com Wed May 21 22:20:19 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 00:20:53 2003 Subject: [Spambayes] Announcements? In-Reply-To: Message from "Mark Hammond" of "Thu, 22 May 2003 14:14:10 +1000." <0ebd01c32018$9946e510$530f8490@eden> References: <0ebd01c32018$9946e510$530f8490@eden> Message-ID: <20030522042019.966F32DDF2@cashew.wolfskeep.com> In message: <0ebd01c32018$9946e510$530f8490@eden> "Mark Hammond" writes: >[Tim] >... >> [Mark Hammond] >> >> ... >> >> I hope you make a million, Sean ;) Thirded. (I'm sure someone else seconded already.) >> [Sean True] >> > Thanks, Mark. >> > >> > If I do, I'm gonna throw one hell of a party, and you're invited! >> >> A million isn't enough for a good party if I'm invited too -- > >That is why you aren't! Even though we have never met personally, Sean >knows me well enough to know that the first million should just be able to >fund a party for me. As he knows you that much better, he also knows that >it will have to be 10 million before you get an invite! > >Ciggies-aren't-cheap-these-days ly, > >Mark. Geez, some people have expensive habits. I'm sure that my taste for single malt wouldn't run to more than a few tens of thousand... ;-) - Alex From T.A.Meyer at massey.ac.nz Thu May 22 18:17:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 01:19:06 2003 Subject: [Spambayes] Spambayes error Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB381E@its-xchg4.massey.ac.nz> [Tony Meyer] > > I also have my suspicions about the classifier.py > > lines that "account for string" ham/spam counts [Tim Peters] > Yes, I objected to that change at the time, but couldn't make > time to pursue it then. That was another problem I had never > seen, and I don't think anyone else did either before some > non-dict database got hooked up. [...] > So let's rip it out -- someone for whom it does fail then can > surely be shamed into fixing it for real, or at least into > posting a traceback. Could be it's related to the assertion > failing too. Ok then, I'll commit this, and we can wait for the complaints from the people who have the problem and didn't even know :) =Tony Meyer From mylene.reiners at logicacmg.com Thu May 22 09:01:41 2003 From: mylene.reiners at logicacmg.com (Mylene Reiners) Date: Thu May 22 02:03:45 2003 Subject: [Spambayes] Unable to register the DLL/OCX - Win2000, Outlook2000 SR1 Message-ID: Hi, I had Spambayes Outlook Addin installed (using SpamBayes-Outlook-Setup-002.exe), but this morning the buttons didn't show up, and no log-file :( . So I tried to check the checkbox in the Tools etc. COM -Addin form, but that failed (when I looked again, it was unchecked). Restarted the notebook, tried again, but no help. Then I uninstalled Sp.OA, and reinstalled it (or tried to), but I always get the message C:\applications\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry tor try again, ignore to proceed anyway (not recommended), or Abort to cancel installation. None of those commands helps :( I also deleted the registry keys, but that too didn't help Checked part of the mailinglist and bugs, but found no clue Any ideas? Myl?ne From T.A.Meyer at massey.ac.nz Thu May 22 20:25:11 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 03:26:20 2003 Subject: [Spambayes] Collecting word lists.. I have 4 contributions so far Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3834@its-xchg4.massey.ac.nz> > And to save you from having to dig in the archives for what > I'm looking for and how to > upload it, here's a passage from a prior message. [...] > > >>> import anydbm > > >>> import sha > > >>> db = anydbm.open("hammie.db") [...] This is probably obvious to many people, but it wasn't to me, so in case anyone else is having trouble: If you are using bsddb3 this won't work. Import bsddb3 instead, and use bsddb3.hashopen() instead of anydbm.open(). Presumably those with pickles should do the normal pickle load thing. =Tony Meyer From Paul.Moore at atosorigin.com Thu May 22 13:58:23 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Thu May 22 07:58:46 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting Message-ID: <16E1010E4581B049ABC51D4975CEDB88619A8B@UKDCX001.uk.int.atosorigin.com> I have a friend who is using the POP3 proxy for his mail. He has a 10:1 spam:ham imbalance, and he's found that he gets quite a high proportion of unsures (from 200 or so mails a day, over 75% of which are spam). His DB contains about 1300 spam and 150 spam. In addition to the unsure rate being high, he's finding that training on the unsures isn't helping. I suspect that this is because the Ham/Spam imbalance setting means that training one unsure as spam has little effect (10% of the effect it'd have on a balanced DB?). Am I right in thinking that pop3proxy has this parameter set to true? I know it is for the Outlook plugin (which I use, but I have a fairly balanced DB these days). Is there any good view on whether the setting is a good thing yet? My feeling is that the higher proportion of unsures, plus the unresponsiveness to training, makes it an overall loss. I got the same qualitative results myself when my DB was badly unbalanced - that's why I made the effort to make and keep my DB balanced. But I have no corresponding feel for the real-life results with the parameter *not* set. My friend has now purged his database and is starting from scratch, to try to improve his results. I mentioned the setting, but as it's a config file edit, rather than a button in the UI, he didn't feel comfortable changing it (and AIUI, he'd need to retrain as well - is that right?) Maybe the option should be exposed in the UI (but that may not be sensible if changing it *does* require a retrain). If it is, then the help could explain that this option is only relevant if your database has unequal numbers of ham and spam, and what the disadvantages of each setting are (option set = more unsures, less responsive to training; option unset = ???) If the option isn't exposed, I'd vote for taking it out. We're not getting any useful new feedback that I'm aware of. Paul. From matos at attbi.com Thu May 22 09:35:48 2003 From: matos at attbi.com (David Matos) Date: Thu May 22 08:36:53 2003 Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro and Outlook XP In-Reply-To: Message-ID: <000701c3205e$acb73b90$8d80b042@dexter> In a recent message I think someone mentioned that no one (or almost no one?) had reported successful use of the Outlook plugin with WinXP (Pro) and Outlook XP (2002). If so, I'd like to confirm that the plugin installed absolutely flawlessly on my PC running the above software. It's been working fine now for about 1-2 months. I wish all the software I actually paid for worked this well. Kudos for an outstanding piece of software! --Dave Matos From mhallerm at optonline.net Thu May 22 10:11:24 2003 From: mhallerm at optonline.net (Mark A. Hallerman) Date: Thu May 22 09:12:19 2003 Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro and Outlook XP In-Reply-To: <000701c3205e$acb73b90$8d80b042@dexter> Message-ID: <012501c32063$a615e610$6701a8c0@terrier> I just installed yesterday on XP Pro, Outlook XP and all went fine - everything seems to be working flawlessly. Regards, Mark. -----Original Message----- From: spambayes-bounces+mhallerm=optonline.net@python.org [mailto:spambayes-bounces+mhallerm=optonline.net@python.org] On Behalf Of David Matos Sent: Thursday, May 22, 2003 8:36 AM To: spambayes@python.org Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro and Outlook XP In a recent message I think someone mentioned that no one (or almost no one?) had reported successful use of the Outlook plugin with WinXP (Pro) and Outlook XP (2002). If so, I'd like to confirm that the plugin installed absolutely flawlessly on my PC running the above software. It's been working fine now for about 1-2 months. I wish all the software I actually paid for worked this well. Kudos for an outstanding piece of software! --Dave Matos _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From Ben.Marshall at swinc.com Thu May 22 10:09:06 2003 From: Ben.Marshall at swinc.com (Marshall, Ben) Date: Thu May 22 10:18:19 2003 Subject: [Spambayes] Quick Question Message-ID: <4836C7598D2C5F448B888393437FBC0601083A12@moose.austin.swinc.com> Will the spambayes work with outlook 11? Thanks, Ben From bill at parducci.net Thu May 22 08:30:49 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 10:47:04 2003 Subject: [Spambayes] /.? References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> Message-ID: <3ECCDF19.8030903@parducci.net> would something like this help? http://parducci.net/spambayesQA.html i can clean it up, but for now i just grabbed a few of the latest support questions/answers for posterity. i can continue to do so if others think it will help slow the postings. (i can e-mail updates to someone with access to the site if that works...) b Skip Montanaro wrote: > I don't mean to look a gift horse in the mouth, but what would we do if we > got slashdotted? I'll try to get the auto-reply text updated this evening. > Please have a look at > > http://spambayes.sf.net/reply.txt > > and give me feedback. The only feedback I received so far was about the XP > comment. I'd appreciate some help in the Windows section. That's clearly > where we are going to get the most hits. > > I'm also working on some platform-specific web pages. I'll try to make > initial versions available on the website this evening without links. I'll > send URLs out when something's available for review. > > Skip > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From skip at pobox.com Thu May 22 10:30:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 10:47:16 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: References: <0e5a01c32008$3d20e8e0$530f8490@eden> Message-ID: <16076.57116.463778.84092@montanaro.dyndns.org> Tim> Note that, since this week's tokenizer changes, crap like this no Tim> longer produces garbage hapaxes: Tim> With No Side Effects Tim> Of course this change makes all prior reports about hapax Tim> distribution suspect! It may also make Brad's hash string collection suspect. I haven't cvs up'd in quite awhile, so I am not yet taking advantage of your recent tokenizer changes. Others who reported data to Brad may be. Brad, what sort of input do you want, pre-change or post-change? Skip From bkc at murkworks.com Thu May 22 12:14:12 2003 From: bkc at murkworks.com (Brad Clements) Date: Thu May 22 10:55:52 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <16076.57116.463778.84092@montanaro.dyndns.org> References: Message-ID: <3ECCB103.12627.D6C93BB@localhost> On 22 May 2003 at 9:30, Skip Montanaro wrote: > It may also make Brad's hash string collection suspect. I haven't cvs up'd > in quite awhile, so I am not yet taking advantage of your recent tokenizer > changes. Others who reported data to Brad may be. Brad, what sort of > input do you want, pre-change or post-change? I don't think you can 'post correct' the hapaxes you already have in your wordlist, so unless you're willing to start with an empty database I don't think there's anything you can do about this. -- I now have 8 "personal" contributions and 10 usable "test" contributions and two zero-length test contributions. I'm going to start playing with "the numbers" and see what falls out. First, I need to get out my "data analysis" book and try to determine what I'm looking for in an MxM matrix display.. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From skip at pobox.com Thu May 22 11:01:13 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 11:01:25 2003 Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro and Outlook XP In-Reply-To: <000701c3205e$acb73b90$8d80b042@dexter> References: <000701c3205e$acb73b90$8d80b042@dexter> Message-ID: <16076.58937.10903.54331@montanaro.dyndns.org> Dave> In a recent message I think someone mentioned that no one (or Dave> almost no one?) had reported successful use of the Outlook plugin Dave> with WinXP (Pro) and Outlook XP (2002). If so, I'd like to confirm Dave> that the plugin installed absolutely flawlessly on my PC running Dave> the above software. It's been working fine now for about 1-2 Dave> months. I wish all the software I actually paid for worked this Dave> well. Dave, Did you install the plugin using Mark's installer or install from source? Thx, Skip From skip at pobox.com Thu May 22 11:07:40 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 11:07:47 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: <3ECCB103.12627.D6C93BB@localhost> References: <3ECCB103.12627.D6C93BB@localhost> Message-ID: <16076.59324.169936.406221@montanaro.dyndns.org> >> It may also make Brad's hash string collection suspect. I haven't >> cvs up'd in quite awhile, so I am not yet taking advantage of your >> recent tokenizer changes. Others who reported data to Brad may be. >> Brad, what sort of input do you want, pre-change or post-change? Brad> I don't think you can 'post correct' the hapaxes you already have Brad> in your wordlist, so unless you're willing to start with an empty Brad> database I don't think there's anything you can do about this. I can, but only because I keep all my training inputs. Skip From Jocelyn.Montjaux at microcell.ca Thu May 22 12:05:19 2003 From: Jocelyn.Montjaux at microcell.ca (Montjaux, Jocelyn) Date: Thu May 22 11:09:38 2003 Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro an d Outlook XP Message-ID: <938DB8735797D511A6CB0008C7A4D20002C2C6AC@SMTLPEXC03.microcell.ca> > Dave> In a recent message I think someone mentioned that > no one (or > Dave> almost no one?) had reported successful use of the > Outlook plugin > Dave> with WinXP (Pro) and Outlook XP (2002). If so, I'd > like to confirm > Dave> that the plugin installed absolutely flawlessly on > my PC running > Dave> the above software. It's been working fine now for about 1-2 > Dave> months. I wish all the software I actually paid for > worked this > Dave> well. > > Dave, > > Did you install the plugin using Mark's installer or install > from source? As mentionned in a previous message, everything is ok using installer 002 on my Windows XP pro SP1 with Outlook 2002 SP2... Jocelyn From skip at pobox.com Thu May 22 11:05:49 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 11:13:33 2003 Subject: [Spambayes] /.? In-Reply-To: <3ECCDF19.8030903@parducci.net> References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> Message-ID: <16076.59213.701284.321611@montanaro.dyndns.org> bill> would something like this help? bill> http://parducci.net/spambayesQA.html Thanks, how about a FAQ page on the site? We can point to it from the auto-reply. Skip From noreply at sourceforge.net Thu May 22 09:22:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 22 11:23:03 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-22 10:22 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From bill at parducci.net Thu May 22 09:35:40 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 11:36:15 2003 Subject: [Spambayes] /.? References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> Message-ID: <3ECCEE4C.70306@parducci.net> i can whip one up (does sf.net provide a template/style sheet they would like used?). it would help if those with archived messages/commentary tidbits send/point them out to me so i can capture historical content as well. b Skip Montanaro wrote: > bill> would something like this help? > bill> http://parducci.net/spambayesQA.html > > Thanks, how about a FAQ page on the site? We can point to it from the > auto-reply. > > Skip From skip at pobox.com Thu May 22 11:42:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 11:43:04 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix Message-ID: <16076.61437.103567.684114@montanaro.dyndns.org> If you use (or have tried unsuccessfully to use) the Outlook plugin with any version of Windows, please take a look at http://spambayes.sf.net/windows.html and help me fill in empty cells in the table. Thx, Skip From skip at pobox.com Thu May 22 11:46:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 11:46:18 2003 Subject: [Spambayes] /.? In-Reply-To: <3ECCEE4C.70306@parducci.net> References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> Message-ID: <16076.61634.56014.327258@montanaro.dyndns.org> bill> i can whip one up (does sf.net provide a template/style sheet they bill> would like used?). it would help if those with archived bill> messages/commentary tidbits send/point them out to me so i can bill> capture historical content as well. There's no real template. Just create a file with the appropriate structure for the FAQ itself. For any file on the spambayes.sf.net website, you can simply strip the "ml" from the end of the URL to get at the ht2html source template. For example, http://spambayes.sf.net/windows.html was generated from http://spambayes.sf.net/windows.ht You can just mail me the result. I'll add it to the site and to the links file. Skip From tim at fourstonesexpressions.com Thu May 22 11:53:05 2003 From: tim at fourstonesexpressions.com (Four Stones Expressions) Date: Thu May 22 11:54:22 2003 Subject: [Spambayes] /.? In-Reply-To: <16076.61634.56014.327258@montanaro.dyndns.org> References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> <16076.61634.56014.327258@montanaro.dyndns.org> Message-ID: On Thu, 22 May 2003 10:46:10 -0500, Skip Montanaro wrote: > > bill> i can whip one up (does sf.net provide a template/style sheet they > bill> would like used?). it would help if those with archived > bill> messages/commentary tidbits send/point them out to me so i can > bill> capture historical content as well. We do have a FAQ page on spambayes.sourceforge.net... c'est moi - TimS From bill at parducci.net Thu May 22 10:06:47 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 12:09:11 2003 Subject: [Spambayes] /.? References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> <16076.61634.56014.327258@montanaro.dyndns.org> Message-ID: <3ECCF597.9080400@parducci.net> Four Stones Expressions wrote: > We do have a FAQ page on spambayes.sourceforge.net... right. i am in the process of appending it with more recent questions/answers. (i was thinking of something specific for outlook users, but realized that extending the existing faq make more sense ;-) b From popiel at wolfskeep.com Thu May 22 10:10:45 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 12:11:19 2003 Subject: [Spambayes] Performance of CVS Outlook addin In-Reply-To: Message from Skip Montanaro of "Thu, 22 May 2003 10:07:40 CDT." <16076.59324.169936.406221@montanaro.dyndns.org> References: <3ECCB103.12627.D6C93BB@localhost> <16076.59324.169936.406221@montanaro.dyndns.org> Message-ID: <20030522161045.C59B72DE9C@cashew.wolfskeep.com> In message: <16076.59324.169936.406221@montanaro.dyndns.org> Skip Montanaro writes: > >> It may also make Brad's hash string collection suspect. I haven't > >> cvs up'd in quite awhile, so I am not yet taking advantage of your > >> recent tokenizer changes. Others who reported data to Brad may be. > >> Brad, what sort of input do you want, pre-change or post-change? > > Brad> I don't think you can 'post correct' the hapaxes you already have > Brad> in your wordlist, so unless you're willing to start with an empty > Brad> database I don't think there's anything you can do about this. > >I can, but only because I keep all my training inputs. As can I, for similar reason. FWIW, the database I sent is from a significantly old version of SB (about 3-4 months) which I'm actually using for my live filtering. I suppose I ought to update eventually... - Alex From rbell at hcdemocrat.com Thu May 22 13:16:19 2003 From: rbell at hcdemocrat.com (Bell, Robert) Date: Thu May 22 12:17:22 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix Message-ID: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> > If you use (or have tried unsuccessfully to use) the Outlook > plugin with any version of Windows, please take a look at > > http://spambayes.sf.net/windows.html > > and help me fill in empty cells in the table. I'm using version 1.0a2 with python 2.2.2 (not the binary installer) on Windows 98se with Outlook 2002 (XP) pulling mail from an Exchange 2000 server. No problems so far. I plan to test it on Windows 2000 with Outlook 2002 and Exchange in a day or so. From tim at fourstonesexpressions.com Thu May 22 12:31:43 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Thu May 22 12:32:28 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> References: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> Message-ID: On Thu, 22 May 2003 12:16:19 -0400, Bell, Robert wrote: >> If you use (or have tried unsuccessfully to use) the Outlook plugin with >> any version of Windows, please take a look at >> >> http://spambayes.sf.net/windows.html >> >> and help me fill in empty cells in the table. > > I'm using version 1.0a2 with python 2.2.2 (not the binary installer) on > Windows 98se with Outlook 2002 (XP) pulling mail from an Exchange 2000 > server. No problems so far. I plan to test it on Windows 2000 with > Outlook 2002 and Exchange in a day or so. For other mail clients that use pop3 and imap, Tony Meyer did a cursory test of about 25 clients. I'm not sure what the scope of his testing was, or whether or not it exercised pop3proxy or smtpproxy, but his matrix appears somewhere in the archive... I've lost my mbox due to a hd crash, so I can't find it... Tony? c'est moi - TimS From KHart at BasicBS.Com Thu May 22 10:18:27 2003 From: KHart at BasicBS.Com (Kenneth Hart) Date: Thu May 22 12:37:07 2003 Subject: [Spambayes] Happy user Message-ID: I recently discovered your product & so far, like it very much. How can I be notified of, or obtain, updates to the program? Regards; Kenneth Hart From matos at attbi.com Thu May 22 13:39:02 2003 From: matos at attbi.com (David Matos) Date: Thu May 22 12:40:10 2003 Subject: [Spambayes] Confirming (mostly) painless setup with XP Pro and Outlook XP In-Reply-To: <16076.58937.10903.54331@montanaro.dyndns.org> Message-ID: <001501c32080$a93177c0$8d80b042@dexter> Whoops. I forgot to mention that I used Mark's installer, version 002, on an SP2 version of Outlook XP. Dave> In a recent message I think someone mentioned that no one (or Dave> almost no one?) had reported successful use of the Outlook plugin Dave> with WinXP (Pro) and Outlook XP (2002). If so, I'd like to confirm Dave> that the plugin installed absolutely flawlessly on my PC running Dave> the above software. It's been working fine now for about 1-2 Dave> months. I wish all the software I actually paid for worked this Dave> well. Dave, Did you install the plugin using Mark's installer or install from source? Thx, Skip From tim.one at comcast.net Thu May 22 13:39:52 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 12:42:58 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16076.61437.103567.684114@montanaro.dyndns.org> Message-ID: [skip@pobox.com] > If you use (or have tried unsuccessfully to use) the Outlook plugin > with any version of Windows, please take a look at > > http://spambayes.sf.net/windows.html > > and help me fill in empty cells in the table. Thanks! It's helpful. The current binary installer is known not to work on Win98 and WinME. Installing from CVS is known to work on Win98. From dwphillips at fwmurphy.com Thu May 22 12:38:24 2003 From: dwphillips at fwmurphy.com (David W. Phillips) Date: Thu May 22 12:47:37 2003 Subject: [Spambayes] Usage on XP version of Outlook Message-ID: <4358DC03B08CD31191A60006293844E20326FFE7@FWM-MAIL> I just wanted you to know that I have installed and it runs great. I do have a question about the filter button on the Xp version of outlook. After using the train mode and setting the filters. The enable filter button is grayed out, not available. Haven't had this problem on 2000 outlook. Is this a problem? Does it still run automatically? David W. Phillips Systems Administrator FW Murphy 5311 S. 122nd E. Ave. Tulsa OK 74146 Voice - (918)-317-4253 Cell - (918)-625-0385 Dwphillips@fwmurphy.com From skip at pobox.com Thu May 22 12:53:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 12:54:05 2003 Subject: [Spambayes] /.? In-Reply-To: References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> <16076.61634.56014.327258@montanaro.dyndns.org> Message-ID: <16077.161.708464.294129@montanaro.dyndns.org> TimS> We do have a FAQ page on spambayes.sourceforge.net... Ah, yes, I didn't notice. I naively expected it to be in the left-hand margin. Bill, mind if I update the FAQ with any of your questions which it is missing? Skip From skip at pobox.com Thu May 22 12:55:52 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 12:56:01 2003 Subject: [Spambayes] /.? In-Reply-To: <3ECCF597.9080400@parducci.net> References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> <16076.61634.56014.327258@montanaro.dyndns.org> <3ECCF597.9080400@parducci.net> Message-ID: <16077.280.984032.477048@montanaro.dyndns.org> bill> right. i am in the process of appending it with more recent bill> questions/answers. (i was thinking of something specific for bill> outlook users, but realized that extending the existing faq make bill> more sense ;-) Note that my plan is to have Windows, Mac and Unix/Linux links in the left-hand margin which link to http://spambayes.sf.net/windows.html http://spambayes.sf.net/mac.html http://spambayes.sf.net/unix.html There are hardly-more-than-stub pages there now. I welcome content for any of them. Skip From skip at pobox.com Thu May 22 13:05:02 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 13:05:11 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> References: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> Message-ID: <16077.830.856498.540462@montanaro.dyndns.org> >> http://spambayes.sf.net/windows.html >> >> and help me fill in empty cells in the table. Robert> I'm using version 1.0a2 with python 2.2.2 (not the binary Robert> installer) on Windows 98se with Outlook 2002 (XP) pulling mail Robert> from an Exchange 2000 server. No problems so far. Thanks. I know next to nothing about Windows. Are Windows 98 and Windows 98se different enough that I should have separate columns for them? Skip From bill at parducci.net Thu May 22 11:06:10 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 13:06:44 2003 Subject: [Spambayes] /.? References: <20030521185319.30177.qmail@web13907.mail.yahoo.com> <16076.1313.850777.38242@montanaro.dyndns.org> <3ECCDF19.8030903@parducci.net> <16076.59213.701284.321611@montanaro.dyndns.org> <3ECCEE4C.70306@parducci.net> <16076.61634.56014.327258@montanaro.dyndns.org> <3ECCF597.9080400@parducci.net> <16077.280.984032.477048@montanaro.dyndns.org> Message-ID: <3ECD0382.5050007@parducci.net> ok. for now i will just stitch together whatever i can come up with into the current FAQ and then we can slice and dice as needed. this way we will have as much a possible available for consumption quickly. that work? b Skip Montanaro wrote: > bill> right. i am in the process of appending it with more recent > bill> questions/answers. (i was thinking of something specific for > bill> outlook users, but realized that extending the existing faq make > bill> more sense ;-) > > Note that my plan is to have Windows, Mac and Unix/Linux links in the > left-hand margin which link to > > http://spambayes.sf.net/windows.html > http://spambayes.sf.net/mac.html > http://spambayes.sf.net/unix.html > > There are hardly-more-than-stub pages there now. I welcome content for any > of them. > > Skip From skip at pobox.com Thu May 22 13:09:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 13:09:20 2003 Subject: [Spambayes] Happy user In-Reply-To: References: Message-ID: <16077.1073.536902.761805@montanaro.dyndns.org> Kenneth> I recently discovered your product & so far, like it very much. Kenneth> How can I be notified of, or obtain, updates to the program? Kenneth, You can keep an eye on the Spambayes website: http://spambayes.sf.net/ or subscribe to this mailing list. We have plans to expand the number of lists to separate user, development and announcement lists. When that's in place (should happen in the next week or so), you can subscribe to the announcement list if you're not interested in the other lists. I'm working on a Windows/Outlook compatibility matrix for the Outlook plugin. Is that the way you're using the tool? If so, what versions of Windows and Outlook are you using? Thanks, Skip From tim.one at comcast.net Thu May 22 14:11:03 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 13:14:41 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: <16077.830.856498.540462@montanaro.dyndns.org> Message-ID: [skip@pobox.com] > Thanks. I know next to nothing about Windows. Are Windows 98 and > Windows 98se different enough that I should have separate columns for > them? 98, 98SE, and ME usually act the same, so let's not distinguish among them until there's a solid reason to (except that we have to list the names, because users generally don't know how similar they are). From tim at fourstonesexpressions.com Thu May 22 13:23:17 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Thu May 22 13:24:05 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: References: Message-ID: On Thu, 22 May 2003 13:11:03 -0400, Tim Peters wrote: > 98, 98SE, and ME usually act the same, so let's not distinguish among > them > until there's a solid reason to (except that we have to list the names, > because users generally don't know how similar they are). We could call this cell in the matrix: "The Windows Versions that broke almost everything in the name of USB support" c'est moi - TimS From skip at pobox.com Thu May 22 13:27:47 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 13:27:54 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: References: <16077.830.856498.540462@montanaro.dyndns.org> Message-ID: <16077.2195.285241.64834@montanaro.dyndns.org> >> Are Windows 98 and Windows 98se different enough that I should have >> separate columns for them? Tim> 98, 98SE, and ME usually act the same, so let's not distinguish Tim> among them until there's a solid reason to (except that we have to Tim> list the names, because users generally don't know how similar they Tim> are). Done. Please review to make sure I didn't muff something. Skip From tim at fourstonesexpressions.com Thu May 22 13:44:16 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Thu May 22 13:45:01 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <5.2.0.9.0.20030512151849.0223bd98@mail.grenna.net> References: <5.2.0.9.0.20030512151849.0223bd98@mail.grenna.net> Message-ID: On Mon, 12 May 2003 15:21:50 +0100, Peter Bengtsson wrote: > Any ideas what this might mean or what I have done to deserve this? Well that's an interesting one... did you install bsddb3, or are you using the default windows python 2.2 install? If so, you're likely using dumbdbm (the default dbm implementation) which is aptly named... I don't know if this would be the problem or not, I use dumbdbm all the time, but this clearly is a problem in the database side of things, and it's the only thing I can think of right off hand... c'est moi - TimS From rbell at hcdemocrat.com Thu May 22 14:55:39 2003 From: rbell at hcdemocrat.com (Bell, Robert) Date: Thu May 22 13:56:43 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix Message-ID: <409800B2B5630547824C923B202D086613F50D@et-prod5.express-times.net> > > >> http://spambayes.sf.net/windows.html > >> > >> and help me fill in empty cells in the table. > > Robert> I'm using version 1.0a2 with python 2.2.2 (not the binary > Robert> installer) on Windows 98se with Outlook 2002 (XP) > pulling mail > Robert> from an Exchange 2000 server. No problems so far. > > Thanks. I know next to nothing about Windows. Are Windows > 98 and Windows 98se different enough that I should have > separate columns for them? I don't think so, but I don't have a machine set up with Windows 98 to test. From David.Bear at asu.edu Thu May 22 12:22:27 2003 From: David.Bear at asu.edu (David Bear) Date: Thu May 22 14:25:33 2003 Subject: [Spambayes] maybe a procmail question... Message-ID: <20030522112227.B15578@asu.edu> Since I'm loving spambayes, I'm wondering about how soon its procmail rule fires. Should the hammie lines # spam beyes filter :0 fw:hamlock | /usr/bin/hammie.py -f -d -p $HOME/.hammiedb be the FIRST rule in .procmailrc? Right now I have other rules that fire and move messages to specific folders. I've notice that sometimes spam get file in thes 'other' folders... so, this may be more of a procmail question -- is rule order significant? -- David Bear phone: 480-965-8257 fax: 480-965-9189 College of Public Programs/ASU Wilson Hall 232 Tempe, AZ 85287-0803 "Beware the IP portfolio, everyone will be suspect of trespassing" From noreply at sourceforge.net Thu May 22 12:26:23 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 22 14:27:00 2003 Subject: [Spambayes] [ spambayes-Bugs-741914 ] Unable to Register DLL Error Message-ID: Bugs item #741914, was opened at 2003-05-22 13:26 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Gary Thomas (garyt977) Assigned to: Nobody/Anonymous (nobody) Summary: Unable to Register DLL Error Initial Comment: I'm running WinXP and Office 2000 Professional. When I attempt to install the binary version on my system I get the following error: ---- C:\Program files\Spambayes Outlook Addin\Spambayes_addin.dll Unable to register the DLL/OCX:DllRegister failed:code 0x00000000 Abort, Retry, Fail ---- After searching my hard drive, I was unable to find a log file for this program. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 From WalkerK2 at DNB.com Thu May 22 15:43:15 2003 From: WalkerK2 at DNB.com (Walker, Ken) Date: Thu May 22 14:53:26 2003 Subject: [Spambayes] Outlook XP/Win 2000 Message-ID: Hi! Great product! I found you via Udell's article in InfoWorld. I wanted to let you know that the plugin install did not work with my Outlook XP/Win 2000 configuration. Do you think the new version of the plugin installer will fix this problem? I got an error when it tried to register spambayes_addin.dll and an Abort/Retry/Ignore prompt. Thanks, Ken ___ Ken Walker D&B Application Specialist 800.526.0651 x6728 walkerk2@dnb.com From noreply at sourceforge.net Thu May 22 12:57:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 22 14:59:59 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 04:44 Message generated for change (Comment added) made by radesai You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-22 18:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From noreply at sourceforge.net Thu May 22 13:20:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu May 22 15:21:40 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-31 00:47 Message generated for change (Comment added) made by ecvogel You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- Comment By: Eric C. Vogel (ecvogel) Date: 2003-05-22 15:20 Message: Logged In: YES user_id=784818 Same here. Outlook XP all SP and patches, WIn XP Pro SP1 all patches. The file is registered and in the COM and has the check. Exchange Server 2000 all SP and patches on SBS 2kSP-1 all patches. 002 install ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-19 18:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 03:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 04:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 21:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 01:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From david.proske at dhs.state.tx.us Thu May 22 15:19:16 2003 From: david.proske at dhs.state.tx.us (Proske,David) Date: Thu May 22 15:22:17 2003 Subject: [Spambayes] bug in binary installer Message-ID: <615D36A41691D211BB0900A0C9D619990452EA5F@houmis03.dhs.state.tx.us> I have a win2000 running outlook 2000 sr1 and I get the following message <> and the logs are as follows the first two time the log was empty but the next two time I got the following logs. <> <> David -------------- next part -------------- A non-text attachment was scrubbed... Name: error.jpeg Type: application/octet-stream Size: 49653 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030522/4c9b9f14/error-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 207 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030522/4c9b9f14/spambayes3-0001.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes4.log Type: application/octet-stream Size: 510 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030522/4c9b9f14/spambayes4-0001.obj From tim.one at comcast.net Thu May 22 16:20:10 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 15:23:11 2003 Subject: [Spambayes] List moeration In-Reply-To: Message-ID: The number of times per day a harmless msg to this list is sitting waiting for moderator approval has gotten out of hand, so I've turned off list moderation here. Maybe one spam per day will come through now. If/when we set up distinct lists for end users, developers, and announcements, they can have distinct policies. Until then, if anyone is dying to get moderation back here, volunteer to be a moderator or stuff it . From bobcirrito at bellsouth.net Thu May 22 17:04:51 2003 From: bobcirrito at bellsouth.net (Bob Cirrito) Date: Thu May 22 22:19:28 2003 Subject: [Spambayes] Prerequisites Message-ID: <002701c3209d$69acea40$0100a8c0@Bobs3GHzDell> I'm running a small office network and using Outlook 2002 on Windows XP Pro. Spam is almost unmanageable. I don't have Python 2.x. Is it something I can download so I can download and install SpamBayes? Sounds like you're on to something decent to manage spam. Have you thought of a composite installation package to bundle all this for us little guys that really get socked with spam and who can least afford it. Bob Cirrito Protec Industries P.O. Box 17105 Plantation, FL 33318 USA (+1)(954)587-0894 phone (+1)(954)583-8490 fax bobcirrito@bellsouth.net email NEW www.ProtecIndustries.com www.PackagingForensics.com From T.A.Meyer at massey.ac.nz Fri May 23 14:15:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:20:51 2003 Subject: FAQ (Was RE: [Spambayes] /.?) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A00@its-xchg4.massey.ac.nz> > Four Stones Expressions wrote: > > We do have a FAQ page on spambayes.sourceforge.net... > > right. i am in the process of appending it with more recent > questions/answers. (i was thinking of something specific for outlook > users, but realized that extending the existing faq make more > sense ;-) BTW, I put up the FAQ that's there at the moment, but I'm not at all attached to it. My hope was that others would add to it if there was a start...I'm not attached to the layout either, if anyone wants to make it nicer. Note that it's based on (but a little behind) the FAQ.txt that's in CVS. At the time, I only had time to add questions/answers from mail I hadn't dealt with, plus there are a couple of extra ones I added. If someone is willing to trawl through the archives, or similarly come up with others, please do! =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 14:17:30 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:21:02 2003 Subject: [Spambayes] /.? Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A02@its-xchg4.massey.ac.nz> > i can clean it up, but for now i just grabbed a few of the latest > support questions/answers for posterity. i can continue to do so if > others think it will help slow the postings. (i can e-mail updates to > someone with access to the site if that works...) You can email me updates and I can add them in. You can probably email them to Skip also. The more 'cleaned up', the better, so that we can just paste them into the .ht file. (Or, if you really want to do all the work, then check out the faq.ht file from the website CVS and mail new versions of that...). =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 15:20:33 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:21:28 2003 Subject: [Spambayes] List moderation Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A56@its-xchg4.massey.ac.nz> [Tim Peters] > I've turned off list moderation here. Maybe one > spam per day will come through now. Am I right in thinking that the reason the Spambayes project started was to create a filter for mailman? (I'm sure I read that somewhere...) If so, does this mean that mail to the list is filtered through Spambayes? (Or is it filtered through anything else?) If it does get filtered, it would be interesting to know the stats. If it doesn't, then no-one on the list should have a problem, since our own copies should catch one measly spam per day. =Tony Meyer From kent at redbird.net Thu May 22 22:18:01 2003 From: kent at redbird.net (Kent Margraf) Date: Thu May 22 22:22:12 2003 Subject: [Spambayes] SpamBayes can't call universal dispatcher Message-ID: I'm seeing a problem with the binary distribution (002) in Outlook 2000 running on WinMe. Here's the log file: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding Any suggestions? From mikering at midwtech.com Thu May 22 15:56:48 2003 From: mikering at midwtech.com (Michael T. Ring) Date: Thu May 22 22:26:43 2003 Subject: [Spambayes] Comments Message-ID: I have 2 comments/questions about the Plug-In for Outlook. I love how it learns, but when I shut down Outlook when I log off and log back in the next day, it's forgotten everything and has to retrain. Also, why is the Advanced button deactivated? MIDWEST TECHNICAL CONSULTANTS, INC. Michael T. Ring, PLS Vice President 1832 Centre Point Drive Suite 106 Naperville IL 60563 630.505.0101 630.505.0318 fax -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1672 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030522/4be5152d/winmail.bin From T.A.Meyer at massey.ac.nz Fri May 23 15:25:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:27:13 2003 Subject: [Spambayes] Prerequisites Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A5D@its-xchg4.massey.ac.nz> > I'm running a small office network and using Outlook 2002 on > Windows XP Pro. Spam is almost unmanageable. I don't have > Python 2.x. Is it something I can download so I can download > and install SpamBayes? There are two things here: 1. You should be able to download the Outlook plugin binary and install that, and that's all you need (i.e. no need for Python). 2. There are known issues with the current installer and Windows/Outlook XP. Some people using Windows/Outlook XP have used the installer and everything is fine. For others, it's not. We haven't managed to figure out what the difference is yet - I think Mark suspected that it was Outlook 2002 SP-1, but that might not be the case. So you should download the installer and give it a go. Please let us know if it does install correctly, and which version (exactly) of Windows/Outlook you're using. If it doesn't, you have two options: 1. Wait for the next binary, which hopefully will fix this problem (and hopefully this is due in the next fortnight). 2. Install Python (from http://www.python.org), download the alpha2 release and use that. The Python installer is nice and simple to use, but the alpha2 isn't so much (which is why there is the binary). HTH. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 15:27:39 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:28:46 2003 Subject: [Spambayes] Quick Question Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A5F@its-xchg4.massey.ac.nz> > Will the spambayes work with outlook 11? What is Outlook 11? Is that the numeric version number for Outlook 2003? (My copy of Outlook 2002 has 10 in the version number, so that might make sense). If it is, then the answer is 'not yet'. We would welcome help in changing this, though. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 15:31:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:32:54 2003 Subject: [Spambayes] Outlook XP/Win 2000 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A6B@its-xchg4.massey.ac.nz> > I wanted to let you know that the plugin > install did not work with my Outlook XP/Win 2000 > configuration. Do you think the new version of the plugin > installer will fix this problem? I got an error when it > tried to register spambayes_addin.dll and an > Abort/Retry/Ignore prompt. This error is being tracked here: [ 740198 ] Installation fails with regsvr error https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_i d=61702&atid=498103 The new version of the installer *should* fix this problem. Given the number of times it's been reported recently, it needs to! Could you let us know (by attaching a comment to the above bug, preferably), exactly what version of Windows/Outlook you are using? (From the "About" menu item in the "Help" menu). =Tony Meyer From aaronlineberger at cox.net Thu May 22 23:32:16 2003 From: aaronlineberger at cox.net (Aaron Lineberger) Date: Thu May 22 22:33:26 2003 Subject: [Spambayes] (no subject) Message-ID: Hi, I just downloaded the .EXE installer and was running through the setup instructions when I ran into a problem when trying to set up the Spam Score Field (I'm running Outlook 2000). The instructions say that under "User Defined Fields" there will be a button with a "Spam" label. Sure enough it is there. :) However, when I drag the field to the Inbox header, the field Type is "Number" instead of "Percent" and the Format is "All Digits 1,234,567 -1,234,567" instead of "Rounded 65% -65%". I have tried deleting this user defined field and recreating it with the Type and Format specified in the instructions to get it to show percentages, but to no avail; after recreating it, I drag it over to the Inbox header and the Type and Format revert back to "Number" and "All Didgits ..." I know this seems minor, but it's kind of annoying having to look at the Spam Percentage as a long decimal number. Any ideas? Thanks in advance, and please reply to aaron@cs.odu.edu. Aaron From T.A.Meyer at massey.ac.nz Fri May 23 15:33:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:35:09 2003 Subject: [Spambayes] SpamBayes can't call universal dispatcher Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A6F@its-xchg4.massey.ac.nz> > I'm seeing a problem with the binary distribution (002) in > Outlook 2000 running on WinMe. Here's the log file: [...] > File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir > exceptions.LookupError: no codec search functions registered: > can't find encoding This is being tracked here: [ 725449 ] Binary plugin fails on Win9x https://sourceforge.net/tracker/index.php?func=detail&aid=725449&group_i d=61702&atid=498103 At the moment the only solution is to install Python and use a source release (alpha2 or CVS) of Spambayes. This will definitely be fixed in the next version of the installer, which is due out in the next couple of weeks. =Tony Meyer From anthony at interlink.com.au Fri May 23 11:34:27 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu May 22 22:35:55 2003 Subject: [Spambayes] maybe a procmail question... In-Reply-To: <20030522112227.B15578@asu.edu> Message-ID: <200305230034.h4N0YSl03348@localhost.localdomain> >>> David Bear wrote > Since I'm loving spambayes, I'm wondering about how soon its procmail > rule fires. Should the hammie lines > > # spam beyes filter > :0 fw:hamlock > | /usr/bin/hammie.py -f -d -p $HOME/.hammiedb > > be the FIRST rule in .procmailrc? Right now I have other rules that > fire and move messages to specific folders. I've notice that > sometimes spam get file in thes 'other' folders... > > so, this may be more of a procmail question -- is rule order > significant? Yes and yes. (Actually, my first procmailrc entry is actually a call to formail, to weed out duplicates) Anthony -- Anthony Baxter It's never too late to have a happy childhood. From joe at swelltech.com Thu May 22 17:15:33 2003 From: joe at swelltech.com (Joe Cooper) Date: Thu May 22 22:36:50 2003 Subject: [Spambayes] maybe a procmail question... In-Reply-To: <20030522112227.B15578@asu.edu> References: <20030522112227.B15578@asu.edu> Message-ID: <3ECD3DF5.10304@swelltech.com> David Bear wrote: > Since I'm loving spambayes, I'm wondering about how soon its procmail > rule fires. Should the hammie lines > > # spam beyes filter > :0 fw:hamlock > | /usr/bin/hammie.py -f -d -p $HOME/.hammiedb > > be the FIRST rule in .procmailrc? Right now I have other rules that > fire and move messages to specific folders. I've notice that > sometimes spam get file in thes 'other' folders... > > so, this may be more of a procmail question -- is rule order > significant? Yes. Rule order is significant in procmail. -- Joe Cooper Web caching appliances and support. http://www.swelltech.com From T.A.Meyer at massey.ac.nz Fri May 23 15:40:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:41:10 2003 Subject: [Spambayes] Moved my spambayes folder and killed the plugin Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A79@its-xchg4.massey.ac.nz> My turn to ask for help ;) I moved the folder that I keep the version of spambayes I'm using for the Outlook plugin yesterday. Now all sorts of things don't work... I noticed from the trace that it still referred to the old path, and recalling the invalid drive discussion recently, I figured I needed to delete my old .pyc files. So I deleted all of these, unregistered the plugin, and registered it again. Now when I open Outlook I get a "ERROR: There was an error initializing the SpamBayes addin\n\nPlease re-start Outlook and try again." message. (Restarting does not help). I've also tried deleting my config file, which also didn't help. I also tried manually removing the registry key, which also didn't help. The trace has lots of these: Loaded databases in 43.9123ms Error connecting to Outlook! And these: if self.manager.config.filter.enabled: AttributeError: 'NoneType' object has no attribute 'filter' Anyone got any advice? =Tony Meyer From bill at parducci.net Thu May 22 19:28:04 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 22:42:05 2003 Subject: [Spambayes] /.? References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A02@its-xchg4.massey.ac.nz> Message-ID: <3ECD7924.7020800@parducci.net> i just sourced the ht file via my browser and updated it. skip has my first cut of updates. i will start looking through the code for tidbits of info to salvage and stick them in the file. b Meyer, Tony wrote: >>i can clean it up, but for now i just grabbed a few of the latest >>support questions/answers for posterity. i can continue to do so if >>others think it will help slow the postings. (i can e-mail updates to >>someone with access to the site if that works...) > > > You can email me updates and I can add them in. You can probably email > them to Skip also. The more 'cleaned up', the better, so that we can > just paste them into the .ht file. (Or, if you really want to do all > the work, then check out the faq.ht file from the website CVS and mail > new versions of that...). > > =Tony Meyer From popiel at wolfskeep.com Thu May 22 14:12:31 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 22:42:35 2003 Subject: [Spambayes] maybe a procmail question... In-Reply-To: Message from David Bear of "Thu, 22 May 2003 11:22:27 PDT." <20030522112227.B15578@asu.edu> References: <20030522112227.B15578@asu.edu> Message-ID: <20030522201231.71E0E2DEA0@cashew.wolfskeep.com> In message: <20030522112227.B15578@asu.edu> David Bear writes: >Since I'm loving spambayes, I'm wondering about how soon its procmail >rule fires. Should the hammie lines > ># spam beyes filter >:0 fw:hamlock >| /usr/bin/hammie.py -f -d -p $HOME/.hammiedb > >be the FIRST rule in .procmailrc? Right now I have other rules that >fire and move messages to specific folders. I've notice that >sometimes spam get file in thes 'other' folders... > >so, this may be more of a procmail question -- is rule order >significant? Yes, rule order is potentially very significant. Personally, I have SB as the second rule in my procmail script; the first rule is for logging that yes, I got mail. I then have incremental training as my third and fourth rules (reinforcing SB's ham/spam decisions), then the memoing into myriad folders. Then I shuffle spam off into a spambox, and dump all remaining into my inbox. Corrections for mistakes are handled by my nightly retrain (which picks up of where I've moved messages that were misclassified). I actually _want_ spam to be memoed along with ham into my various folders; the folders are largely records of traffic on mailing lists, and I want complete records, not edited ones. If I wanted to keep spam out of the other folders, though, I'd just move the rule putting spam into my spambox so that it was rule number 5, above all the other folder foo. - Alex > >-- >David Bear >phone: 480-965-8257 >fax: 480-965-9189 >College of Public Programs/ASU >Wilson Hall 232 >Tempe, AZ 85287-0803 > "Beware the IP portfolio, everyone will be suspect of trespassing" > >_______________________________________________ >Spambayes mailing list >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes > > -------- From popiel at wolfskeep.com Thu May 22 14:04:11 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 22:42:45 2003 Subject: [Spambayes] List moeration In-Reply-To: Message from Tim Peters of "Thu, 22 May 2003 15:20:10 EDT." References: Message-ID: <20030522200411.D9B832DE9C@cashew.wolfskeep.com> In message: Tim Peters writes: >Until then, if anyone is dying to get moderation back here, >volunteer to be a moderator or stuff it . I've got a trained SB instance that can be moderator! Oh, wait, we all do... ;-) - Alex From bill at parducci.net Thu May 22 14:53:31 2003 From: bill at parducci.net (bill parducci) Date: Thu May 22 22:42:58 2003 Subject: [Spambayes] maybe a procmail question... References: <20030522112227.B15578@asu.edu> Message-ID: <3ECD38CB.30808@parducci.net> here is my .procmailrc file. i call hammiefilter with my second recipe (not shown is the first recipe which allows me to kick off retraining via e-mail). next i have procmail throw identified spam into a bucket, then file what gets through. MAILDIR=${HOME}/mail :0fw | /opt/spambayes/hammiefilter.py :0 * ^X-Spambayes-Classification: spam ${MAILDIR}/spam INCLUDERC=${HOME}/procmailrc.local (where i file stuff) the only rule order that matters is that that you run hammiefilter before checking for spambayes classification! ;-) b David Bear wrote: > Since I'm loving spambayes, I'm wondering about how soon its procmail > rule fires. Should the hammie lines > > # spam beyes filter > :0 fw:hamlock > | /usr/bin/hammie.py -f -d -p $HOME/.hammiedb > > be the FIRST rule in .procmailrc? Right now I have other rules that > fire and move messages to specific folders. I've notice that > sometimes spam get file in thes 'other' folders... > > so, this may be more of a procmail question -- is rule order > significant? > From T.A.Meyer at massey.ac.nz Fri May 23 15:43:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:44:24 2003 Subject: [Spambayes] Comments Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A7E@its-xchg4.massey.ac.nz> > I have 2 comments/questions about the Plug-In > for Outlook. I love how it learns, but when I shut down > Outlook when I log off and log back in the next day, it's > forgotten everything and has to retrain. It shouldn't do this. Can you open a bug on sourceforge (http://sf.net/projects/spambayes) and include a log file? (The troubleshooting.html file has instructions about how to do all of this). We can probably figure out why it's not saving/loading the database. > Also, why is the Advanced button deactivated? Because there aren't any advanced options that are currently exposed. This will probably change, but not for a wee while yet. =Tony Meyer From mhammond at skippinet.com.au Fri May 23 10:43:59 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 22 22:46:18 2003 Subject: [Spambayes] Outlook XP/Win 2000 In-Reply-To: Message-ID: <119a01c320bc$04f9dc10$530f8490@eden> We are tracking this bug in https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61 702&atid=498103, and we certainly hope to have it fixed by the next release. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Walker, Ken > Sent: Friday, 23 May 2003 4:43 AM > To: spambayes@python.org > Subject: [Spambayes] Outlook XP/Win 2000 > > > Hi! > > > > Great product! I found you via Udell's article > in InfoWorld. I > wanted to let you know that the plugin install did not work > with my Outlook > XP/Win 2000 configuration. Do you think the new version of the plugin > installer will fix this problem? I got an error when it > tried to register > spambayes_addin.dll and an Abort/Retry/Ignore prompt. > > > > Thanks, > > Ken > > ___ > > > Ken Walker > > D&B Application Specialist > > 800.526.0651 x6728 > > walkerk2@dnb.com > > > > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > From T.A.Meyer at massey.ac.nz Fri May 23 15:46:49 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 22:48:00 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmail question... ) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A82@its-xchg4.massey.ac.nz> [Alex] > I then have incremental training as my third and fourth rules > (reinforcing SB's ham/spam decisions), then the memoing into > myriad folders. Is there any consensus about whether incremental training is a good thing or bad thing? Given that I mostly use the Outlook plugin, I've often wondered how things would go if it automatically trained on mail it filters (apart from the unsures). I recall that there was discussion about this a long way back, but I don't remember seeing anything conclusive. =Tony Meyer From skip at pobox.com Thu May 22 22:52:18 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 22:52:53 2003 Subject: [Spambayes] SpamBayes can't call universal dispatcher In-Reply-To: References: Message-ID: <16077.36066.723869.604917@montanaro.dyndns.org> Kent> I'm seeing a problem with the binary distribution (002) in Outlook Kent> 2000 running on WinMe. Here's the log file: ... Kent> exceptions.LookupError: no codec search functions registered: Kent> can't find encoding Kent> Any suggestions? This is a known problem with the installer and some versions of Windows (or I seem to recall that it is - I'm not a Windows person and there is so much Windows/Outlook traffic here these days I can't keep it all straight). I think you should be okay with the Spambayes source distribution, but that means you'll have to install Python 2.2.2 and maybe some of Mark Hammond's other goodies (win32all?). Maybe we need is a set of instructions - maybe we already have them somewhere - for people who aren't successful with the binary installer. Skip From tim.one at comcast.net Thu May 22 23:50:31 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 22:53:04 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting In-Reply-To: <16E1010E4581B049ABC51D4975CEDB88619A8B@UKDCX001.uk.int.atosorigin.com> Message-ID: [Moore, Paul] > I have a friend who is using the POP3 proxy for his mail. He has a > 10:1 spam:ham imbalance, and he's found that he gets quite a high > proportion of unsures (from 200 or so mails a day, over 75% of which > are spam). His DB contains about 1300 spam and 150 spam. 150 ham, right (you said "spam" twice there)? That's not much of a ham sample regardless of option settings, and without knowing what he set his unsure range to, "quite a high proportion" may be astonishing or inescapable. > In addition to the unsure rate being high, he's finding that training > on the unsures isn't helping. I suspect that this is because the > Ham/Spam imbalance setting means that training one unsure as spam has > little effect If that option is enabled, yes, the system is already (relatively) glutted with spam, and won't pay much more attention to spam clues until it's taught more about ham. > (10% of the effect it'd have on a balanced DB?). Like anything is that simple . > Am I right in thinking that pop3proxy has this parameter set to true? I don't see anything to suggest that it is. The default is still False, and AFAICT only Outlook2000/default_bayes_customize.ini sets it True. > I know it is for the Outlook plugin (which I use, but I have a fairly > balanced DB these days). > > Is there any good view on whether the setting is a good thing yet? It's a pick-your-poison thing. If you have more spam than ham and keep this False, a higher false positive rate is the expected result (or a higher FN rate if you have more ham than spam). > My feeling is that the higher proportion of unsures, plus the > unresponsiveness to training, makes it an overall loss. The fellow you're talking about has a pathologically low number of ham; I wouldn't decide anything based on that, and I expect the option is actually False for him anyway. Sjoerd Mullender and Richie Hindle were the only ones here to report testing results at the time the option was introduced. Richie reported very good improvements with it True, Sjoerd reported mildly worse results with it True but did less testing than Richie did. It remains experimental because the evidence was/is spotty and mixed. > I got the same qualitative results myself when my DB was badly > unbalanced - that's why I made the effort to make and keep my DB > balanced. But I have no corresponding feel for the real-life results > with the parameter *not* set. I have about a 2-to-1 imbalance (favoring spam) in each of my three real-life databases, all Outlook-based and so all with this option True, and like it just fine. Now you've got all the evidence anyone has . > My friend has now purged his database and is starting from scratch, > to try to improve his results. He should have kept the 150 most-recent spam instead. > I mentioned the setting, but as it's a config file edit, rather than a > button in the UI, he didn't feel comfortable changing it (and AIUI, > he'd need to retrain as well - is that right?) Nope! We don't store spamprobs in a database, just word counts. experimental_ham_spam_imbalance_adjustment is used (only) in Classifier.probability() when a probability is (dynamically) computed. > Maybe the option should be exposed in the UI (but that may not be > sensible if changing it *does* require a retrain). For researchers that would be fine, but end users don't have a clue about what to do with exotic internal options. > If it is, then the help could explain that this option is only relevant > if your database has unequal numbers of ham and spam, and what the > disadvantages of each setting are (option set = more unsures, less > responsive to training; option unset = ???) Your friend could spend his time better by collecting more ham <0.9 wink>. > If the option isn't exposed, I'd vote for taking it out. We're not > getting any useful new feedback that I'm aware of. Since mass testing here stopped, we haven't got useful feedback on any of the non-default options. Since there wasn't enough info to decide about them when mass testing stopped, they still deserve a chance to survive. I hope mass testing resumes, but I can't drive it (no time). Until it does resume, the continued existence of these options seems appropriate. From mhammond at skippinet.com.au Fri May 23 13:49:04 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 22 23:02:55 2003 Subject: [Spambayes] Moved my spambayes folder and killed the plugin In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A79@its-xchg4.massey.ac.nz> Message-ID: <123201c320d5$dfe05610$530f8490@eden> > I noticed from the trace that it still referred to the old path, and > recalling the invalid drive discussion recently, I figured I needed to > delete my old .pyc files. So I deleted all of these, unregistered the > plugin, and registered it again. It shouldn't be necessary to delete the old .pyc files, but it couldn't hurt :) > And these: > if self.manager.config.filter.enabled: > AttributeError: 'NoneType' object has no attribute 'filter' It sounds like the config is hosed. Clearly it is a bug we don't handle that gracefully, but I am not sure how it gets so lost. Try deleting the config pickle maybe (actually, just move it away so we can try and recreate the error) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030523/45fb33bd/winmail.bin From skip at pobox.com Thu May 22 23:06:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 23:06:46 2003 Subject: [Spambayes] Re: Compatibility matrix In-Reply-To: <119301c320ba$ae52e560$530f8490@eden> References: <001501c32080$a93177c0$8d80b042@dexter> <119301c320ba$ae52e560$530f8490@eden> Message-ID: <16077.36893.736314.215492@montanaro.dyndns.org> Mark> How about we say that Outlook XP SP2 is "known to work", and that Mark> earlier versions are "known to have some issues". We can then Mark> correct it when people correct us :) Done. Mark> I would drop the "XP Home" and "XP Professional" distinction - I Mark> seriously doubt that would have any impact, and we should have Mark> "2000 Pro/Server" distinctions if we make that one. Done. Should I drop the Win95 column as well? Nobody seems to be running that. Would Outlook 2000 or XP even run on it? You don't support any older versions of Outlook do you? Mark> Re the "Exchange Server" comments in your matrix: I would tone Mark> down the disclaimer, as it does now work fairly well in that Mark> environment (with one or 2 minor bugs specific to it). I even Mark> have a (very quiet) Exchange Server currently that I am able to Mark> run against. Done. Mark> I'm happy to hack something up on this if you like - just let me Mark> know when you have checked in your latest. It looks excellent Mark> though - thanks! Hack away. This stuff is beginning to make my head spin. Ignoring the fact that next to nothing has been written about the Mac-specific page, I'm going to go ahead and stitch the platform-specific pages into the left-hand margin. That should make it easier for people just stumbling onto the site to find the Windows-specific issues, which look like they are going to dominate for some time to come. Skip From skip at pobox.com Thu May 22 23:07:12 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 23:07:45 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB39C7@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB39C7@its-xchg4.massey.ac.nz> Message-ID: <16077.36960.486158.231011@montanaro.dyndns.org> Tony> This is with Outlook 2000 or 2002, right? The table currently has Tony> Outlook 2000 working, which I don't think is the case. If I was Tony> sure, I'd change it, but I'm not so... I thought Outlook 2000 was the version which got the most exercise. Tony> I think that the Win98(etc) rows should both be "1.0a2 source or Tony> CVS works, installer 002 fails". Done. I love rowspan=... ;-) Skip From skip at pobox.com Thu May 22 23:12:15 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 22 23:12:51 2003 Subject: FAQ (Was RE: [Spambayes] /.?) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A00@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A00@its-xchg4.massey.ac.nz> Message-ID: <16077.37263.719901.63141@montanaro.dyndns.org> Tony> BTW, I put up the FAQ that's there at the moment, but I'm not at Tony> all attached to it. My hope was that others would add to it if Tony> there was a start...I'm not attached to the layout either, if Tony> anyone wants to make it nicer. Bill sent me a new section today which I merged in. I also tweaked it a bit in other more cosmetic ways. Tony> Note that it's based on (but a little behind) the FAQ.txt that's Tony> in CVS. Would someone like to merge the two please? Maybe then the FAQ.txt file can just point to the website. It appears that most of our users are going to be quickly addicted to Mark's installer. S From popiel at wolfskeep.com Thu May 22 15:26:24 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 23:18:01 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix In-Reply-To: Message from Tim Stone of "Thu, 22 May 2003 11:31:43 CDT." References: <409800B2B5630547824C923B202D086613F504@et-prod5.express-times.net> Message-ID: <20030522212624.587032DEA7@cashew.wolfskeep.com> In message: Tim Stone writes: >On Thu, 22 May 2003 12:16:19 -0400, Bell, Robert >wrote: > >For other mail clients that use pop3 and imap, Tony Meyer did a cursory >test of about 25 clients. I'm not sure what the scope of his testing was, >or whether or not it exercised pop3proxy or smtpproxy, but his matrix >appears somewhere in the archive... I've lost my mbox due to a hd crash, so >I can't find it... Tony? Well, I'm not Tony, but here's his message: | (Message everything:28253) | From: "Meyer, Tony" | To: | Delivered-To: popiel@wolfskeep.com | X-MimeOLE: Produced By Microsoft Exchange V6.0.6334.0 | content-class: urn:content-classes:message | X-MS-Has-Attach: | X-MS-TNEF-Correlator: | Thread-Topic: [Spambayes] training WAS: aging information | Thread-Index: AcLXri68fD6CctHwRp+HG+RAEVwIxQAI8gmwAApz/sAAWteFwAACSPyw | X-OriginalArrivalTime: 21 Feb 2003 06:16:35.0734 (UTC) | FILETIME=[C957E360:01C2D970] | X-Spambayes-Classification: ham; 0.00 | Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1318D509@its-xchg4.massey.ac.nz> | Sender: spambayes-bounces+popiel=wolfskeep.com@python.org | | Subject: RE: [Spambayes] training | Date: Fri, 21 Feb 2003 19:14:50 +1300 | | Ok, I've done some testing, and here are the results from these clients. | They are all windows apps, as I don't have a linux or mac box handy to | test with. If anyone wants some other test done while I have all of | these installed, let me know :) | | [I did not do any Outlook testing, since the plugin does everything | anyone might possibly want] | | All clients tested, unsurprisingly, will include the message body in the | forwarded message. | | The following client/methods will forward all headers: | Eudora 5.2 Forward | Netscape Messenger (4.7) Forward (inline) | Netscape Messenger (4.7) Forward (attachment) Plain | Netscape Messenger (4.7) Forward (attachment) HTML | Netscape Messenger (4.7) Forward (attachment) Plain & HTML | M2 (Opera Mailer) Redirect | The Bat! Forward (RFC Headers visible) | The Bat! Alternative Forward | The Bat! Custom Template | Pegasus Mail Forward (all headers option set) | Calypso 3 Redirect | Becky! Redirect as attachment | | The following client/methods will *not* forward all headers: | Eudora 5.2 Redirect | Netscape Messenger (4.7) Forward (quoted) Plain | Netscape Messenger (4.7) Forward (quoted) HTML | Netscape Messenger (4.7) Forward (quoted) Plain & HTML | Outlook Express 6 Forward HTML (Base64) | Outlook Express 6 Forward HTML (None) | Outlook Express 6 Forward HTML (QP) | Outlook Express 6 Forward Plain (Base64) | Outlook Express 6 Forward Plain (None) | Outlook Express 6 Forward Plain (QP) | Outlook Express 6 Forward Plain (uuencoded) | http://www.endymion.com/products/mailman Forward | M2 (Opera Mailer) Forward | The Bat! Forward (RFC Headers not visible) | The Bat! Redirect | AllegroMail Forward | AllegroMail Redirect | PocoMail Forward | PocoMail Bounce | Pegasus Mail Forward (all headers option not set) | Calypso 3 Forward | Becky! Forward | Becky! Redirect | | So the only mailers (tested) that do not have a forwarding option that | will preserve headers are Outlook Express, AllegroMail, PocoMail and | (Endymion's) Mailman. Not too bad. | | =Tony Meyer | | _______________________________________________ | Spambayes mailing list | Spambayes@python.org | http://mail.python.org/mailman/listinfo/spambayes - Alex From tim.one at comcast.net Fri May 23 00:16:44 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 23:18:16 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16077.36960.486158.231011@montanaro.dyndns.org> Message-ID: [Skip] > I thought Outlook 2000 was the version which got the most exercise. I have three of those, and they all work fine (using CVS spambayes; I haven't tried the binary installer yet). From T.A.Meyer at massey.ac.nz Fri May 23 14:38:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:18:37 2003 Subject: [Spambayes] RE: Windows/Outlook compatibility matrix Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A21@its-xchg4.massey.ac.nz> > For other mail clients that use pop3 and imap, Tony Meyer did > a cursory test of about 25 clients. I'm not sure what the scope > of his testing was, or whether or not it exercised pop3proxy or > smtpproxy, but his matrix appears somewhere in the archive... The results appear in the docstring of smtpproxy.py. Basically I was testing whether a forwarded/bounced message would include the headers of the original message or not. This did mean that I was using pop3proxy and smtpproxy with all of them, but apart from a few test messages in each, I didn't do any other testing (although I did post an offer to the list to do other testing). A note could be constructed listing all the clients that have been tested, as long as it noted :) that testing has been limited. I've since changed machines, so I don't have them all installed anymore. I could run through them again, I suppose, if there was a reason to. >From memory, none of them were any trouble to set up - the same old 'set to localhost' instruction applied. I didn't test with multiple accounts, or anything like that, but again, it should work the same. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 16:09:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:18:49 2003 Subject: [Spambayes] Moved my spambayes folder and killed the plugin Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A9A@its-xchg4.massey.ac.nz> > > And these: > > if self.manager.config.filter.enabled: > > AttributeError: 'NoneType' object has no attribute 'filter' > > It sounds like the config is hosed. Clearly it is a bug we > don't handle that gracefully, but I am not sure how it gets > so lost. Try deleting the config pickle maybe (actually, > just move it away so we can try and recreate the error) Bah. I could have sworn that I did that already. But I obediently followed your instructions and lo and behold it fixed it. If I manually load the pickle in PythonWin the __class__ of the created object is "NoneType", so I gather that somehow I pickled None instead of the config object. I have no idea how things went so wrong! I suppose there could be a check when loading the config pickle to see if it is the correct type, but I'm not sure it's worth it (how many people are going to do what I did?) Thanks for the help! =Tony Meyer From tim.one at comcast.net Fri May 23 00:24:34 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 23:25:39 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmail question... ) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A82@its-xchg4.massey.ac.nz> Message-ID: [Alex] >> I then have incremental training as my third and fourth rules >> (reinforcing SB's ham/spam decisions), then the memoing into >> myriad folders. [Meyer, Tony] > Is there any consensus about whether incremental training is a good > thing or bad thing? I'm unclear on what incremental training means here. When I use the phrase, it means training on a few new msgs, adding to a trained database that already exists. > Given that I mostly use the Outlook plugin, I'veoften wondered how things > would go if it automatically trained on mail it filters (apart from the > unsures). > > I recall that there was discussion about this a long way back, but I > don't remember seeing anything conclusive. This is more like what I'd call self-training: letting the classifier train on its own decisions. Rob Hooft and I both experimented with this and got amazingly good results, contradicting the received wisdom in this area. From tim.one at comcast.net Thu May 22 19:25:19 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 23:30:27 2003 Subject: [Spambayes] List moeration In-Reply-To: <16077.12882.255940.607868@montanaro.dyndns.org> Message-ID: [Tim] > The number of times per day a harmless msg to this list is > sitting waiting for moderator approval has gotten out of > hand, so I've turned off list moderation here. Maybe one > spam per day will come through now. > [skip@pobox.com] > What's the term? We now eat our own dog food? Not yet -- we simply get an average of about one spam per day sent to this list. Nothing will stop it without moderation, because the SpamAssassin settings for this list have effectively disabled SA (in the early days, *most* of my posts to this list were blocked by SA, and simply because I have a large penis ). From tim.one at comcast.net Fri May 23 00:31:30 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 23:32:34 2003 Subject: [Spambayes] List moderation In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A56@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer] > Am I right in thinking that the reason the Spambayes project started > was to create a filter for mailman? (I'm sure I read that > somewhere...) That's right -- Barry Warsaw suckered me into this . Since then, our mutual employer has found things for us to do that try not to involve Mailman or spambayes to the fullest extent of the law. > If so, does this mean that mail to the list is filtered through > Spambayes? No, the whole Mailman connection remains unexplored. > (Or is it filtered through anything else?) python.org's highly tuned mix of Spamassassin and assorted scripts, hacks, and magical potions. SA is effectively castrated for this specific list, because in the early days it kept blocking the meat of the discusssion. > If it does get filtered, it would be interesting to know the stats. In the dim past I ran large-scale tests of a single spambayes classifier applied to all of python.org's non-personal traffic (mostly tech mailing lists relating to Python and Zope, several of which are high-volume). It did great. Nobody had (or has) time to deploy it for real, though. > If it doesn't, then no-one on the list should have a problem, since > our own copies should catch one measly spam per day. At least after you train your classifier to stop thinking that the 500 header-line clues saying "this came from the spambayes list!" are strong ham clues, yes. From T.A.Meyer at massey.ac.nz Fri May 23 16:37:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:38:48 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmailquestion... ) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> > I'm unclear on what incremental training means here. When I > use the phrase, it means training on a few new msgs, adding > to a trained database that already exists. Good point. I mean "self-training" then :) > This is more like what I'd call self-training: letting the > classifier train on its own decisions. Rob Hooft and I both > experimented with this and got amazingly good results, > contradicting the received wisdom in this area. I'm surprised you didn't push Mark to get the Outlook plugin to self train :) What method did you & Rob use to test this? Is it something that others could easily duplicate? =Tony Meyer From tim at fourstonesexpressions.com Thu May 22 23:40:11 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Thu May 22 23:40:59 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmailquestion... ) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> Message-ID: On Fri, 23 May 2003 15:37:35 +1200, Meyer, Tony wrote: > I'm surprised you didn't push Mark to get the Outlook plugin to self > train :) What method did you & Rob use to test this? Is it something > that others could easily duplicate? It's a simple call to classifier.learn based upon the classification result.... c'est moi - TimS From T.A.Meyer at massey.ac.nz Fri May 23 15:11:10 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:44:22 2003 Subject: [Spambayes] bug in binary installer Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A51@its-xchg4.massey.ac.nz> > I have a win2000 running outlook 2000 sr1 and I get the > following message > <> and the logs are as follows the first two time > the log was empty but the next two time I got the following > logs. <> > <> This error is being tracked here: [ 740198 ] Installation fails with regsvr error https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_i d=61702&atid=498103 For the moment, your only option is probably to install Python and use the source. Hopefully the binary due out in the next couple of weeks will resolve this (and other) install problem(s). =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 16:48:01 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:49:13 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmailquestion... ) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3ACA@its-xchg4.massey.ac.nz> > > I'm surprised you didn't push Mark to get the Outlook > plugin to self > > train :) What method did you & Rob use to test this? Is > it something > > that others could easily duplicate? > > It's a simple call to classifier.learn based upon the classification > result.... Poor wording on my part. My question wasn't how to do the "self-training", but how Tim & Rob did their testing (given that it's (the other) Tim, no doubt it was systematic and not just that it seemed to do better :) =Tony Meyer From popiel at wolfskeep.com Thu May 22 21:49:38 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 22 23:50:12 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmailquestion... ) In-Reply-To: Message from "Meyer, Tony" of "Fri, 23 May 2003 15:37:35 +1200." <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> Message-ID: <20030523034938.157372DE9C@cashew.wolfskeep.com> In message: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> "Meyer, Tony" writes: >> I'm unclear on what incremental training means here. When I=20 >> use the phrase, it means training on a few new msgs, adding=20 >> to a trained database that already exists. > >Good point. I mean "self-training" then :) By Tim's definition, I do self-training, with occasional manual corrections as I notice things and refile. >> This is more like what I'd call self-training: letting the=20 >> classifier train on its own decisions. Rob Hooft and I both=20 >> experimented with this and got amazingly good results,=20 >> contradicting the received wisdom in this area. > >I'm surprised you didn't push Mark to get the Outlook plugin to self >train :) What method did you & Rob use to test this? Is it something >that others could easily duplicate? If you look in CVS, under the testtools directory, there's incremental.HOWTO.txt detailing how I did some testing along the self-training lines. My scripts are there, too. Enjoy. - Alex From T.A.Meyer at massey.ac.nz Fri May 23 14:41:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu May 22 23:50:43 2003 Subject: [Spambayes] Performance of CVS Outlook addin Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A27@its-xchg4.massey.ac.nz> [Brad] > I don't think you can 'post correct' the hapaxes you already have > in your wordlist, so unless you're willing to start > with an empty database I don't think there's anything you can do > about this. [Skip] > I can, but only because I keep all my training inputs. I suspect most of those that contributed (or will) do keep mail. (Those people that would end up on a -devel list, basically). Of the lists I gave, the one of my personal mail would include such hapaxes, the training ones would not. Let me know if you want a list build with current cvs. =Tony Meyer From db3l at fitlinxx.com Fri May 23 04:52:26 2003 From: db3l at fitlinxx.com (David Bolen) Date: Thu May 22 23:52:27 2003 Subject: [Spambayes] Re: Trouble w/ Outlook buttons References: Message-ID: Tim Peters writes: > The attached came from a private correspondent. I don't recognize the > symptoms. I urged them to open a bug report on SF, as I don't see anything > particularly relevant there now. I'm *assuming* they used the binary > installer, and I don't know which version of Outlook or OS is in use. Have > asked for clarification. In the past I managed to get this sort of scenario (multiple sets of buttons while doing lots of installing/uninstalling of the add-in). You sort of end up with a stranded set of buttons that aren't tied to a current add-in installation. I found that just using the Tools->Customize options, then selecting the Standard toolbar under the Toolbars tab and choosing Reset cleared out all of the SpamBayes related buttons. The proper set of buttons (matching my currently registered add-in) were then re-created when I next started Outlook as the addin loaded. -- David From tim.one at comcast.net Fri May 23 00:56:27 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 22 23:57:32 2003 Subject: Incremental training (was RE: [Spambayes] maybe a procmailquestion... ) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3AB8@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > I'm surprised you didn't push Mark to get the Outlook plugin to self > train :) What method did you & Rob use to test this? Is it something > that others could easily duplicate? It's in the archives ... somewhere . It's dead easy in principle, and not much harder in practics ... ah, TimS explained it more briefly than I could (thanks, Tim!). I think we lost interest in training strategies when real-life deployment showed excellent results in a few days, and several long-time users stopped paying any attention to training anymore. My 3 DBs each have about 1000 msgs, and that's all. I rarely train on anything anymore. Every now and again I blow a database away and start over, just to clear the boredom. Every training strategy I've tried works fine, *except* for purely mistake-based training from the very start. In the two+ weeks I stuck to that, I didn't get above 150 trained msgs total (against about 600 emails per day), and the Unsures started and remained maddening (mostly blatant spam). That was highly hapax-driven, and hapaxes are brittle (they catch near-duplicates of msgs you've trained on, but don't seem ever to generalize). So I can have a tiny database that's more than good enough to make me very happy, or push Mark toward training on 600 emails a day and have a gigantic database that wouldn't make me any happier. This wasn't a hard choice . From mail at peterbe.com Fri May 23 01:41:20 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Fri May 23 00:10:42 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: References: <5.2.0.9.0.20030512151849.0223bd98@mail.grenna.net> <5.2.0.9.0.20030512151849.0223bd98@mail.grenna.net> Message-ID: <5.2.0.9.0.20030522222020.022b9d98@mail.grenna.net> At 12:44 2003-05-22 -0500, Tim Stone wrote: >On Mon, 12 May 2003 15:21:50 +0100, Peter Bengtsson wrote: > > >>Any ideas what this might mean or what I have done to deserve this? > >Well that's an interesting one... did you install bsddb3, or are you using >the default windows python 2.2 install? If so, you're likely using >dumbdbm (the default dbm implementation) which is aptly named... I don't >know if this would be the problem or not, I use dumbdbm all the time, but >this clearly is a problem in the database side of things, and it's the >only thing I can think of right off hand... Okey I will install this bsddb3 whatever it is. Is there anything else I need to do to get this going with my spambayes? ... Now I've done that. Plus that I checked out spambayes from CVS. Now the pop3_proxy works a little bit better. When I press the Save & Shutdown I get this error message: 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "spambayes\UserInterface.py", line 328, in onSave self._doSave() File "spambayes\UserInterface.py", line 321, in _doSave classifier.store() File "spambayes\storage.py", line 183, in store self.db[key] = val.__getstate__() File "spambayes\classifier.py", line 66, in __getstate__ return (self.spamcount, AttributeError: 'WordInfo' object has no attribute 'spamcount' From noreply at sourceforge.net Thu May 22 22:05:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:10:57 2003 Subject: [Spambayes] [ spambayes-Bugs-742066 ] SpamBayes doesn't install properly Message-ID: Bugs item #742066, was opened at 2003-05-22 23:43 Message generated for change (Comment added) made by yetanotherrusty You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Stephen Green (sgreen0) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes doesn't install properly Initial Comment: Intalling SpamBayes into Outlook 2000 running Win98 SE. Installation seems to go fine. But SpamBayes tools aren't available in Outlook. Tools/Options/Other/Advanced Options/Com Addins shows the SpamBayes addin unchecked. I check it, click OK, but when I return it is unchecked again. There are errors in the log files, which I'm attaching. Thanks for your help. Stephen Green Long Beach, CA ---------------------------------------------------------------------- Comment By: Rusty Carr (yetanotherrusty) Date: 2003-05-23 04:05 Message: Logged In: YES user_id=785104 I get the same thing using Windows 98. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 From noreply at sourceforge.net Thu May 22 21:55:20 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:11:18 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-22 10:22 Message generated for change (Comment added) made by kimtracy You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 22:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-22 21:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From T.A.Meyer at massey.ac.nz Fri May 23 15:00:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 00:12:12 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A3F@its-xchg4.massey.ac.nz> > Am I right in > thinking that pop3proxy has this parameter set to true? No. The Outlook plugin is the only app with a different set of defaults, everything else uses what's in Options.py. There it's set to False. (Unless he has a config file that sets it to True, of course). > Is there any good view on whether the setting is a good thing > yet? Mark tried to initiate a round of testing a little while back, but there wasn't much in the way of response. My fp went from 0% to 0.2% and fn from 4% to 1% (h:s 6.8:1), which I call a loss. Mark's fp went from 0.8% to 0.5% and fn from 0.4% to 0.6% (h:s 1:2.2). All this really says is that we don't have enough data to make a call ;) > (and AIUI, he'd need to retrain as well - is that right?) No. The database holds raw numbers of how many messages the token has been seen in. The adjustment only comes into effect when calculating the score. So the thing to do (although time consuming) would be to score a message with the adjustment off, then change the setting, rescore, and compare. > Maybe the option should be exposed in the UI (but that may > not be sensible if changing it *does* require a retrain). It would be a one line edit to add this to the pop3proxy ui (see, told you that the options makeover was a good thing ). Does everyone agree that it should be there? > If it is, then the help could explain that this option is only > relevant if your database has unequal numbers of ham and > spam, It does that. > and what the disadvantages of each setting are (option > set = more unsures, less responsive to training; option unset = ???) It doesn't do that. I don't know that I would be comfortable writing this; hopefully one of the stats-ier people would do it. > If the option isn't exposed, I'd vote for taking it out. > We're not getting any useful new feedback that I'm aware of. The trickiest thing here is that the Outlook users (who probably are a majority now?) do use the option, while by default, the others don't. We really need people to put in some time testing to see if it is worth keeping or not. If it is, I gather from (Tim P's?) comment in the code that it can just be enabled for everyone. =Tony Meyer From noreply at sourceforge.net Thu May 22 20:17:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:12:30 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-23 03:22 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 14:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From noreply at sourceforge.net Thu May 22 20:09:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:12:58 2003 Subject: [Spambayes] [ spambayes-Bugs-740198 ] Installation fails with regsvr error Message-ID: Bugs item #740198, was opened at 2003-05-20 13:12 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rob de Santos (rdesantos) Assigned to: Mark Hammond (mhammond) Summary: Installation fails with regsvr error Initial Comment: I'm trying to install SpamBayes into Outlook XP on Win98SE. The install fails with a regsvr error. Attempts at manually registering the file also fail. Here's the log: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\genca che.py", line 368, in EnsureModule pywintypes.com_error: (-2147312566, 'Error loading type library/DLL.', None, None) SystemError: PyErr_NormalizeException() called without exception Traceback (most recent call last): File "", line 11, in ? AttributeError: 'module' object has no attribute 'OutlookAddin' Any help? Cheers, Rob ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 14:09 Message: Logged In: YES user_id=552329 >From the list: "I have a win2000 running outlook 2000 sr1 and I get the following message <> and the logs are as follows the first two time the log was empty but the next two time I got the following logs. <> <> David" The jpeg was just a screenshot of the regsvr error. Log3 is: "Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py", line 388, in RegisterClasses IndexError: list index out of range" and log4 is: "warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) Unregistered: SpamBayes.OutlookAddin Traceback (most recent call last): File "", line 22, in DllUnregisterServer File "E:\src\spambayes\Outlook2000\addin.py", line 791, in UnregisterAddin File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named _winreg" ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 11:45 Message: Logged In: YES user_id=14198 Noting a dupe in https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 From noreply at sourceforge.net Thu May 22 19:27:43 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:13:30 2003 Subject: [Spambayes] [ spambayes-Bugs-742102 ] SpamBayes doesn't install properly Message-ID: Bugs item #742102, was opened at 2003-05-22 18:27 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742102&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Stephen Green (sgreen0) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes doesn't install properly Initial Comment: Intalling SpamBayes into Outlook 2000 running Win98 SE. Installation seems to go fine. But SpamBayes tools aren't available in Outlook. Tools/Options/Other/Advanced Options/Com Addins shows the SpamBayes addin unchecked. I check it, click OK, but when I return it is unchecked again. There are errors in the log files, which I'm attaching. Thanks for your help. Stephen Green Long Beach, CA ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742102&group_id=61702 From noreply at sourceforge.net Thu May 22 17:45:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:14:03 2003 Subject: [Spambayes] [ spambayes-Bugs-740198 ] Installation fails with regsvr error Message-ID: Bugs item #740198, was opened at 2003-05-20 11:12 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rob de Santos (rdesantos) Assigned to: Mark Hammond (mhammond) Summary: Installation fails with regsvr error Initial Comment: I'm trying to install SpamBayes into Outlook XP on Win98SE. The install fails with a regsvr error. Attempts at manually registering the file also fail. Here's the log: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\genca che.py", line 368, in EnsureModule pywintypes.com_error: (-2147312566, 'Error loading type library/DLL.', None, None) SystemError: PyErr_NormalizeException() called without exception Traceback (most recent call last): File "", line 11, in ? AttributeError: 'module' object has no attribute 'OutlookAddin' Any help? Cheers, Rob ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-23 09:45 Message: Logged In: YES user_id=14198 Noting a dupe in https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 From noreply at sourceforge.net Thu May 22 17:45:15 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:14:30 2003 Subject: [Spambayes] [ spambayes-Bugs-741914 ] Unable to Register DLL Error Message-ID: Bugs item #741914, was opened at 2003-05-23 04:26 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 Category: None Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Gary Thomas (garyt977) Assigned to: Nobody/Anonymous (nobody) Summary: Unable to Register DLL Error Initial Comment: I'm running WinXP and Office 2000 Professional. When I attempt to install the binary version on my system I get the following error: ---- C:\Program files\Spambayes Outlook Addin\Spambayes_addin.dll Unable to register the DLL/OCX:DllRegister failed:code 0x00000000 Abort, Retry, Fail ---- After searching my hard drive, I was unable to find a log file for this program. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-23 09:45 Message: Logged In: YES user_id=14198 Dupe of https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61702&atid=498103 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 From noreply at sourceforge.net Thu May 22 17:43:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:15:02 2003 Subject: [Spambayes] [ spambayes-Bugs-742066 ] SpamBayes doesn't install properly Message-ID: Bugs item #742066, was opened at 2003-05-22 16:43 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Stephen Green (sgreen0) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes doesn't install properly Initial Comment: Intalling SpamBayes into Outlook 2000 running Win98 SE. Installation seems to go fine. But SpamBayes tools aren't available in Outlook. Tools/Options/Other/Advanced Options/Com Addins shows the SpamBayes addin unchecked. I check it, click OK, but when I return it is unchecked again. There are errors in the log files, which I'm attaching. Thanks for your help. Stephen Green Long Beach, CA ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 From noreply at sourceforge.net Thu May 22 17:40:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:15:34 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-31 15:47 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-23 09:40 Message: Logged In: YES user_id=14198 *sob* - so it certainly fails with Outlook SP2 here? ecvogel - is your traceback identical to this one? ---------------------------------------------------------------------- Comment By: Eric C. Vogel (ecvogel) Date: 2003-05-23 05:20 Message: Logged In: YES user_id=784818 Same here. Outlook XP all SP and patches, WIn XP Pro SP1 all patches. The file is registered and in the COM and has the check. Exchange Server 2000 all SP and patches on SBS 2kSP-1 all patches. 002 install ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-20 08:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 17:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 18:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 12:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 16:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From noreply at sourceforge.net Thu May 22 16:36:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:16:01 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-20 21:44 Message generated for change (Comment added) made by sgreen0 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- Comment By: Stephen Green (sgreen0) Date: 2003-05-22 15:36 Message: Logged In: YES user_id=784968 I am also having the same problem. After install, the Spambayes tools aren't available in Outlook 2000 (with Win98 SE). In Tools/Options/Other/Advanced Options/Com Addins the Spmbayes add-in appears but in unchecked. No effort on my part can cause the add-in to remain checked. As soon as I click OK, I can go back to Com Addins and it is unchecked again. Stephen ---------------------------------------------------------------------- Comment By: David Sloane (goofanasia) Date: 2003-05-22 15:26 Message: Logged In: YES user_id=240438 I experienced similar behavior on Windows XP SP1 with Outlook 2002 SP2. The install went fine. On first opening Outlook, the toolbar items displayed, and then Outlook crashed. When I opened Outlook a second time, same behavior. Third time opening Outlook - no SpamBayes buttons and the COM AddIn item was un-checked. Since then, it won't remain checked. I've uninstalled and reinstalled SpamBayes twice, but the buttons haven't appeared and the checking the check-box doesn't seem to 'stick.' ---------------------------------------------------------------------- Comment By: George Heidenrich (gheidenr) Date: 2003-05-22 14:28 Message: Logged In: YES user_id=784929 Same problem and results as Jack and radesai. Removed the unchecked add-in at Outlook|Tools|Options|Other|Advanced Options|COM Add-Ins, then attempted to add it again. Received the following Outlook Error: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0177:1e03ff78. Registers: EAX=505c3a43 CS=0177 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=017f ESP=0056b930 EBP=1e0b8ec0 ECX=005fdc70 DS=017f ESI=005fdc70 FS=6527 EDX=00735cf0 ES=017f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 George Heidenrich gheidenr@bestweb.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-22 11:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From noreply at sourceforge.net Thu May 22 22:15:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:16:11 2003 Subject: [Spambayes] [ spambayes-Bugs-742102 ] SpamBayes doesn't install properly Message-ID: Bugs item #742102, was opened at 2003-05-23 13:27 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742102&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Stephen Green (sgreen0) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes doesn't install properly Initial Comment: Intalling SpamBayes into Outlook 2000 running Win98 SE. Installation seems to go fine. But SpamBayes tools aren't available in Outlook. Tools/Options/Other/Advanced Options/Com Addins shows the SpamBayes addin unchecked. I check it, click OK, but when I return it is unchecked again. There are errors in the log files, which I'm attaching. Thanks for your help. Stephen Green Long Beach, CA ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 16:15 Message: Logged In: YES user_id=552329 Duplicate of [ 725449 ] Binary plugin fails on Win9x ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742102&group_id=61702 From noreply at sourceforge.net Thu May 22 16:26:23 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:16:32 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 04:44 Message generated for change (Comment added) made by goofanasia You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- Comment By: David Sloane (goofanasia) Date: 2003-05-22 22:26 Message: Logged In: YES user_id=240438 I experienced similar behavior on Windows XP SP1 with Outlook 2002 SP2. The install went fine. On first opening Outlook, the toolbar items displayed, and then Outlook crashed. When I opened Outlook a second time, same behavior. Third time opening Outlook - no SpamBayes buttons and the COM AddIn item was un-checked. Since then, it won't remain checked. I've uninstalled and reinstalled SpamBayes twice, but the buttons haven't appeared and the checking the check-box doesn't seem to 'stick.' ---------------------------------------------------------------------- Comment By: George Heidenrich (gheidenr) Date: 2003-05-22 21:28 Message: Logged In: YES user_id=784929 Same problem and results as Jack and radesai. Removed the unchecked add-in at Outlook|Tools|Options|Other|Advanced Options|COM Add-Ins, then attempted to add it again. Received the following Outlook Error: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0177:1e03ff78. Registers: EAX=505c3a43 CS=0177 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=017f ESP=0056b930 EBP=1e0b8ec0 ECX=005fdc70 DS=017f ESI=005fdc70 FS=6527 EDX=00735cf0 ES=017f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 George Heidenrich gheidenr@bestweb.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-22 18:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From noreply at sourceforge.net Thu May 22 22:18:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:19:03 2003 Subject: [Spambayes] [ spambayes-Bugs-742066 ] SpamBayes doesn't install properly Message-ID: Bugs item #742066, was opened at 2003-05-23 11:43 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Stephen Green (sgreen0) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes doesn't install properly Initial Comment: Intalling SpamBayes into Outlook 2000 running Win98 SE. Installation seems to go fine. But SpamBayes tools aren't available in Outlook. Tools/Options/Other/Advanced Options/Com Addins shows the SpamBayes addin unchecked. I check it, click OK, but when I return it is unchecked again. There are errors in the log files, which I'm attaching. Thanks for your help. Stephen Green Long Beach, CA ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 16:18 Message: Logged In: YES user_id=552329 (Yet another) Duplicate of [ 725449 ] Binary plugin fails on Win9x ---------------------------------------------------------------------- Comment By: Rusty Carr (yetanotherrusty) Date: 2003-05-23 16:05 Message: Logged In: YES user_id=785104 I get the same thing using Windows 98. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742066&group_id=61702 From noreply at sourceforge.net Thu May 22 22:19:02 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:19:37 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-23 01:22 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-23 14:19 Message: Logged In: YES user_id=14198 Cute - I left "Delete" out as an option on purpose - I didn't think of setting it to move to "Deleted Items" :) Can you please find a newer log file, and attach it? I see strange exceptions, but not ones that would relate to filtering. ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 13:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 12:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From noreply at sourceforge.net Thu May 22 22:20:17 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:20:34 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-22 10:22 Message generated for change (Comment added) made by kimtracy You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 23:20 Message: Logged In: YES user_id=784661 It appears to be working at the moment--I sent the uncertain mail to a folder as well and made sure I was pointing to the correct inbox. Thanks for the help--it seems to categorize very well given a relatively small amount of training. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-22 23:19 Message: Logged In: YES user_id=14198 Cute - I left "Delete" out as an option on purpose - I didn't think of setting it to move to "Deleted Items" :) Can you please find a newer log file, and attach it? I see strange exceptions, but not ones that would relate to filtering. ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 22:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-22 21:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From noreply at sourceforge.net Thu May 22 22:21:13 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:21:58 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 16:44 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 16:21 Message: Logged In: YES user_id=552329 Note that for the most part this is a duplicate of [ 725449 ] Binary plugin fails on Win9x However, the two people that had it working and then has problems have (presumably) a different problem. Maybe their comments ought to be moved into a separate bug. ---------------------------------------------------------------------- Comment By: Stephen Green (sgreen0) Date: 2003-05-23 10:36 Message: Logged In: YES user_id=784968 I am also having the same problem. After install, the Spambayes tools aren't available in Outlook 2000 (with Win98 SE). In Tools/Options/Other/Advanced Options/Com Addins the Spmbayes add-in appears but in unchecked. No effort on my part can cause the add-in to remain checked. As soon as I click OK, I can go back to Com Addins and it is unchecked again. Stephen ---------------------------------------------------------------------- Comment By: David Sloane (goofanasia) Date: 2003-05-23 10:26 Message: Logged In: YES user_id=240438 I experienced similar behavior on Windows XP SP1 with Outlook 2002 SP2. The install went fine. On first opening Outlook, the toolbar items displayed, and then Outlook crashed. When I opened Outlook a second time, same behavior. Third time opening Outlook - no SpamBayes buttons and the COM AddIn item was un-checked. Since then, it won't remain checked. I've uninstalled and reinstalled SpamBayes twice, but the buttons haven't appeared and the checking the check-box doesn't seem to 'stick.' ---------------------------------------------------------------------- Comment By: George Heidenrich (gheidenr) Date: 2003-05-23 09:28 Message: Logged In: YES user_id=784929 Same problem and results as Jack and radesai. Removed the unchecked add-in at Outlook|Tools|Options|Other|Advanced Options|COM Add-Ins, then attempted to add it again. Received the following Outlook Error: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0177:1e03ff78. Registers: EAX=505c3a43 CS=0177 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=017f ESP=0056b930 EBP=1e0b8ec0 ECX=005fdc70 DS=017f ESI=005fdc70 FS=6527 EDX=00735cf0 ES=017f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 George Heidenrich gheidenr@bestweb.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-23 06:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From noreply at sourceforge.net Thu May 22 22:26:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:26:44 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-23 03:22 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 16:26 Message: Logged In: YES user_id=552329 Was it pointing at the wrong inbox? It's an interesting point that the inbox that is the default choice (Outlook's default, I think) might be the wrong one, and confuse people that have another folder called inbox. Not sure what could be done about that (apart from docs). If the solution was setting the uncertain folder, then there might be a problem. You should be able to have unsures be untouched, and still filter certains. I know I've never run it in this configuration, though (Mark might have). ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 16:20 Message: Logged In: YES user_id=784661 It appears to be working at the moment--I sent the uncertain mail to a folder as well and made sure I was pointing to the correct inbox. Thanks for the help--it seems to categorize very well given a relatively small amount of training. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 16:19 Message: Logged In: YES user_id=14198 Cute - I left "Delete" out as an option on purpose - I didn't think of setting it to move to "Deleted Items" :) Can you please find a newer log file, and attach it? I see strange exceptions, but not ones that would relate to filtering. ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 15:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 14:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From director at bigguymedia.com Thu May 22 23:30:40 2003 From: director at bigguymedia.com (Director) Date: Fri May 23 00:30:30 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in Message-ID: <21631E523BEBD41197FC0020AFCF88F613735D@dsl-mr-207-34-113-i59-cgy.nucleus.com> Hello, If anyone has any thoughts on my situation I would appreciate the feedback. I am running WinXP with outlook 2000, Today in the course of setting up WINCVS I installed python. Subsequently my Spambayes add-in stopped working. I have corrected it by removing python and reinstalling the add-in but am curious why this should occur and how to remedy it so I can install python on this machine in future. Russ Cann, Director BigBlogTool www.bigblogtool.com mailto: director@bigblogtool.com "Kick Ass Web Publishing" From noreply at sourceforge.net Thu May 22 22:32:10 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 00:32:46 2003 Subject: [Spambayes] [ spambayes-Bugs-741797 ] Does not filter incoming mail Message-ID: Bugs item #741797, was opened at 2003-05-22 10:22 Message generated for change (Comment added) made by kimtracy You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) Summary: Does not filter incoming mail Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 23:32 Message: Logged In: YES user_id=784661 I did both (make sure I had the right inbox & put unsure in 'Deleted Items" as well). In my case, I needed to filter incoming mail from an IMAP inbox, which I would think is a fairly common case. I would have preferred no presets or having a full path name for the folder so that I could easily tell which it was pointing to. Having the wrong inbox, was likely the problem. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-22 23:26 Message: Logged In: YES user_id=552329 Was it pointing at the wrong inbox? It's an interesting point that the inbox that is the default choice (Outlook's default, I think) might be the wrong one, and confuse people that have another folder called inbox. Not sure what could be done about that (apart from docs). If the solution was setting the uncertain folder, then there might be a problem. You should be able to have unsures be untouched, and still filter certains. I know I've never run it in this configuration, though (Mark might have). ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 23:20 Message: Logged In: YES user_id=784661 It appears to be working at the moment--I sent the uncertain mail to a folder as well and made sure I was pointing to the correct inbox. Thanks for the help--it seems to categorize very well given a relatively small amount of training. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-22 23:19 Message: Logged In: YES user_id=14198 Cute - I left "Delete" out as an option on purpose - I didn't think of setting it to move to "Deleted Items" :) Can you please find a newer log file, and attach it? I see strange exceptions, but not ones that would relate to filtering. ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-22 22:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-22 21:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From skip at pobox.com Thu May 22 16:01:25 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 00:58:20 2003 Subject: [Spambayes] maybe a procmail question... In-Reply-To: <20030522112227.B15578@asu.edu> References: <20030522112227.B15578@asu.edu> Message-ID: <16077.11413.132694.442158@montanaro.dyndns.org> David> Since I'm loving spambayes, I'm wondering about how soon its David> procmail rule fires. Should the hammie lines David> # spam beyes filter David> :0 fw:hamlock David> | /usr/bin/hammie.py -f -d -p $HOME/.hammiedb David> be the FIRST rule in .procmailrc? Yes, as early as possible. The start of my .procmailrc file looks about like this. The only thing which appears ahead of my hammiefilter rule is a procmail rule to eliminate duplicate message ids: FORMAIL=/usr/bin/formail HAMMIE=$HOME/local/bin/hammiefilter.py BAYESCUSTOMIZE=$HOME/hammie.opt # make sure we don't get two copies of the same message :0 Wh: msgid.lock | $FORMAIL -D 16384 $HOME/tmp/msgid.cache :0 fw:ham.lock | $HAMMIE -d $HOME/hammie.db At this point, I then check the spam classification header: :0 * ^X-Spambayes-Classification: spam { ### segregate spam by score - need not do this, but false positives are ### much more likely to be at the bottom end of the range. Save 0.99 ### and 1.00 scores into a separate spam file which is scanned less ### frequently :0: * ^X-Spambayes-Classification: spam; (0.99|1.00) spam1 :0: spam } David> so, this may be more of a procmail question -- is rule order David> significant? Oh yeah... Skip From T.A.Meyer at massey.ac.nz Fri May 23 14:32:40 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 00:58:31 2003 Subject: [Spambayes] Usage on XP version of Outlook Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A19@its-xchg4.massey.ac.nz> > I just wanted you to know that I have installed and it runs > great. I do have a question about the filter button on the > Xp version of outlook. After using the train mode and > setting the filters. The enable filter button is grayed out, > not available. Haven't had this problem on 2000 outlook. Is > this a problem? Does it still run automatically? You need to have done these things to enable that button: * Trained at least 5 ham and 5 spam * Set at least one folder to watch * Set folders to move spam to, and to move unsures to * Changed the action to "copy" or "move", rather than "untouched" If you've done all of these, and it's still greyed out, then this is a bug (or I missed something ;), and you should definitely report it. =Tony Meyer From skip at pobox.com Thu May 22 16:25:54 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 01:03:59 2003 Subject: [Spambayes] List moeration In-Reply-To: References: Message-ID: <16077.12882.255940.607868@montanaro.dyndns.org> Tim> The number of times per day a harmless msg to this list is sitting Tim> waiting for moderator approval has gotten out of hand, so I've Tim> turned off list moderation here. Maybe one spam per day will come Tim> through now. What's the term? We now eat our own dog food? Skip From bud.shepherd at countrysidefamily.com Thu May 22 16:12:31 2003 From: bud.shepherd at countrysidefamily.com (Bud Shepherd) Date: Fri May 23 01:04:38 2003 Subject: [Spambayes] Exchange Message-ID: <509D9723B069154EB5B353EA8A08F77D01243D@mail.countrysidefamily.com> We have Outlook 2000 connecting to an Exchange 2000 server. Will spambayes work for us? From mll at usa.net Thu May 22 14:46:16 2003 From: mll at usa.net (Mike Liveright) Date: Fri May 23 01:05:33 2003 Subject: [Spambayes] I am corrently using SAproxy Message-ID: <003301c320a3$3157c6a0$6501a8c0@mike> I am using Outlook Express and loaded, and am reasonably happy with SAproxy, though I assume that in a while I will need to get a more adaptable system. I think that both you and they are based on Bayes logic, and I hope that you can point me to some comparison on the logics. I suspect that I will want to pay for a more user friendly system in a while with lots of extra features as time proceeds. Can you also tell me when, if ever, I can look forward to a SpamBayes system that I can use with Outlook Express, or other mail programs. What I imagine is that there might be a way of Copy/Pasting (Spam or Ham) to SpamBayes if it is too difficult to read folder contents, or if I prefer to tell SpamBayes what I want to use to train it. If you want more suggestions, based on SAproxy, or if you are talking to a developer who is thinking about offering a commercial version and is interested in Beta testers, please tell me... Mike Liveright 260 Byron Street. mll@usa.net Palo Alto, CA 94301 http://www.WeMatter.com (650) 323-2028 From T.A.Meyer at massey.ac.nz Fri May 23 13:16:40 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:08:02 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB39C7@its-xchg4.massey.ac.nz> > [skip@pobox.com] > > If you use (or have tried unsuccessfully to use) the Outlook plugin > > with any version of Windows, please take a look at > > > > http://spambayes.sf.net/windows.html > > > > and help me fill in empty cells in the table. [Tim Peters] > The current binary installer is known not to work on Win98 > and WinME. Installing from CVS is known to work on Win98. This is with Outlook 2000 or 2002, right? The table currently has Outlook 2000 working, which I don't think is the case. If I was sure, I'd change it, but I'm not so... I think that the Win98(etc) rows should both be "1.0a2 source or CVS works, installer 002 fails". Note that this is only from observation of messages on the list; I haven't gone near Win98 for years. =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 18:08:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:10:53 2003 Subject: [Spambayes] Exchange Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B29@its-xchg4.massey.ac.nz> > We have Outlook 2000 connecting to an Exchange 2000 server. > Will spambayes work for us? It should do, yes. There haven't been any problems reported using that combination. All you need is the binary installer, which you can get from: http://spambayes.sourceforge.net/download.html =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 18:06:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:11:32 2003 Subject: [Spambayes] Re: Compatibility matrix Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> > Done. Should I drop the Win95 column as well? Nobody seems > to be running that. Would Outlook 2000 or XP even run on it? Outlook 2000 will; no idea about XP. You might as well keep it there so that any potential win95 users can see that it's untested. > You don't support any older versions of Outlook do you? No. What's the story with the auto-reply text? The barrage of identical Outlook plugin problems is starting to be overwhelming...(not that the auto-reply will necessarily solve these). =Tony Meyer From T.A.Meyer at massey.ac.nz Fri May 23 18:12:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:14:38 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B2B@its-xchg4.massey.ac.nz> > I am running WinXP with outlook 2000, Today in the course of > setting up WINCVS I installed python. Subsequently my > Spambayes add-in stopped working. Does this strike anyone else as amusing? :) Can you provide some more details? Which version of Python did you install? In the troubleshooting.html file that should have been installed with the plugin, there are instructions about how to find the log file. If there's anything in the log, it would be useful to see this, too. The list is pretty busy at the moment - you also might want to add this as a bug (http://sf.net/projects/spambayes), since it does seem to be something we haven't come across before. =Tony Meyer From sgeers at deltanet.com Thu May 22 23:15:28 2003 From: sgeers at deltanet.com (Steve and Suvan Geer) Date: Fri May 23 01:15:09 2003 Subject: [Spambayes] SpamBayes Will not Load Into Outlook 2000 Message-ID: <3ECDAE70.4000200@deltanet.com> I cannot get the COM Add-Ins to find SpamBayes. I have downloaded 3 times and uninstalled/installed 5 times. Attaached is the log file. Thanks, Steve Geer -------------- next part -------------- SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding From T.A.Meyer at massey.ac.nz Fri May 23 18:18:01 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:19:28 2003 Subject: [Spambayes] SpamBayes Will not Load Into Outlook 2000 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B31@its-xchg4.massey.ac.nz> > I cannot get the COM Add-Ins to find SpamBayes. I have downloaded 3 > times and uninstalled/installed 5 times. Attached is the log file. This is being tracked here: [ 725449 ] Binary plugin fails on Win9x https://sourceforge.net/tracker/index.php?func=detail&aid=725449&group_i d=61702&atid=498103 At the moment the only solution is to install Python and use a source release (alpha2 or CVS) of Spambayes. This will definitely be fixed in the next version of the installer, which is due out in the next couple of weeks. =Tony Meyer From noreply at sourceforge.net Thu May 22 15:28:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 01:24:55 2003 Subject: [Spambayes] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 00:44 Message generated for change (Comment added) made by gheidenr You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- Comment By: George Heidenrich (gheidenr) Date: 2003-05-22 17:28 Message: Logged In: YES user_id=784929 Same problem and results as Jack and radesai. Removed the unchecked add-in at Outlook|Tools|Options|Other|Advanced Options|COM Add-Ins, then attempted to add it again. Received the following Outlook Error: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0177:1e03ff78. Registers: EAX=505c3a43 CS=0177 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=017f ESP=0056b930 EBP=1e0b8ec0 ECX=005fdc70 DS=017f ESI=005fdc70 FS=6527 EDX=00735cf0 ES=017f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 George Heidenrich gheidenr@bestweb.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-22 14:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From T.A.Meyer at massey.ac.nz Fri May 23 18:31:38 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri May 23 01:33:10 2003 Subject: Format of Spam Field in Plugin (was RE: [Spambayes] (no subject)) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B33@its-xchg4.massey.ac.nz> > The instructions say that under "User Defined Fields" there > will be a button with a "Spam" label. Sure enough it is > there. :) However, when I drag the field to the Inbox header, > the field Type is "Number" instead of "Percent" and the > Format is "All Digits 1,234,567 -1,234,567" instead of > "Rounded 65% -65%". Hopefully Mark will notice this message (hence the changed subject) and answer it, because he might know. Interestingly, the Spam field in some of my folders is locked as a number, but in other folders it's locked as a percent. Either way, I can change the reported accuracy, but I'm using Outlook 2002, and I think I recall something similar when I was using Outlook 2000. This might be an Outlook issue (although the percent/number thing shouldn't be). Mark, did you change from number to percent since the binary release? Maybe my old folders are numbers, and new ones percents? =Tony Meyer From anthony at interlink.com.au Fri May 23 16:40:47 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri May 23 01:41:30 2003 Subject: [Spambayes] I am corrently using SAproxy In-Reply-To: <003301c320a3$3157c6a0$6501a8c0@mike> Message-ID: <200305230540.h4N5elp08662@localhost.localdomain> >>> "Mike Liveright" wrote > I think that both you and they are based on Bayes logic, and I hope > that you can point me to some comparison on the logics. See the website, http://spambayes.sourceforge.net/background.html > I suspect that I will want to pay for a more user friendly system in a > while with lots of extra features as time proceeds. Sean True is working on a commercial, supported version of SpamBayes called SpamAtBay. I'm not aware of a website for it. > Can you also tell me when, if ever, I can look forward to a SpamBayes > system that I can use with Outlook Express, or other mail programs. Outlook Express and Outlook are totally different beasts - OE has almost no hooks that SpamBayes can use. Unless someone steps forward with the secret knowledge of Outlook Express internals, it's extremely unlikely that the Outlook plugin will ever work with Outlook Express. You can, however, use spambayes' pop/imap proxy - see http://spambayes.sourceforge.net/applications.html Anthony From anthony at interlink.com.au Fri May 23 17:21:13 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri May 23 02:22:04 2003 Subject: [Spambayes] preamble on SF bug page. Message-ID: <200305230621.h4N6LDp08992@localhost.localdomain> Is it worth changing the SF project configuration to put a message at the top of the bug report page saying that the Win98 install issues are a known bug and that additional bug reports should not be filed for it? And maybe a Paypal button for MarkH, in case people want to encourage him to work on it sooner? ;) Anthony, happy MH user. -- Anthony Baxter It's never too late to have a happy childhood. From tdickenson at devmail.geminidataloggers.co.uk Fri May 23 09:02:11 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Fri May 23 03:02:16 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A3F@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A3F@its-xchg4.massey.ac.nz> Message-ID: <200305230802.11768.tdickenson@devmail.geminidataloggers.co.uk> On Friday 23 May 2003 3:00 am, Meyer, Tony wrote: > The trickiest thing here is that the Outlook users (who probably are a > majority now?) do use the option, while by default, the others don't. > We really need people to put in some time testing to see if it is worth > keeping or not. If it is, I gather from (Tim P's?) comment in the code > that it can just be enabled for everyone. I tried this option for a few weeks, and it *seemed* to be a disadvantage for me. A certain type of spam became indistinguishable from ham. Those spams came often enough to be annoying, and the annoyance stopped when I turned the option off. This is with a 10:1 ham:spam ratio, and a training regime that retrains from scratch overnight with everything in my spam mailbox, and everything in every other mailbox. From director at bigguymedia.com Fri May 23 02:36:39 2003 From: director at bigguymedia.com (Director) Date: Fri May 23 03:36:26 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed a dd-in Message-ID: <21631E523BEBD41197FC0020AFCF88F613735E@dsl-mr-207-34-113-i59-cgy.nucleus.com> Tony, Thanks for the reply. I didn't get a chance to look over all the messages so I hope I'm not duplicating things and causing additional work for your "busy" list. The only relevance I can see in the log files is: E:\src\python-cvs\lib\fcntl.py:7: DeprecationWarning: the FCNTL module is deprecated; please use fcntl which is intersting as I have no E:\src directory. That said, I installed Python 2.3b1 today. Everything was working great prior to that. After installing it I noticed the SB buttons in outlook were not able to do anything, as well the scoring had ceased. thanks. r. -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Thursday, May 22, 2003 11:12 PM To: Director; spambayes@python.org Subject: RE: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in > I am running WinXP with outlook 2000, Today in the course of > setting up WINCVS I installed python. Subsequently my > Spambayes add-in stopped working. Does this strike anyone else as amusing? :) Can you provide some more details? Which version of Python did you install? In the troubleshooting.html file that should have been installed with the plugin, there are instructions about how to find the log file. If there's anything in the log, it would be useful to see this, too. The list is pretty busy at the moment - you also might want to add this as a bug (http://sf.net/projects/spambayes), since it does seem to be something we haven't come across before. =Tony Meyer From david.stewart at shaw.ca Fri May 23 01:50:29 2003 From: david.stewart at shaw.ca (Dave Stewart) Date: Fri May 23 03:50:16 2003 Subject: [Spambayes] Strange message In-Reply-To: <16E1010E4581B049ABC51D4975CEDB880113DB04@UKDCX001.uk.int.atosorigin.com> Message-ID: <000001c320ff$fda627e0$03c0a8c0@grimjack> I have very definitely seen this on my XP box, regarding my DVD drive. Hasn't appeared in nearly a week now, though. -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] On Behalf Of Moore, Paul Sent: Monday, May 19, 2003 4:36 AM To: Mark Hammond Cc: spambayes@python.org Subject: RE: [Spambayes] Strange message From: Mark Hammond [mailto:mhammond@skippinet.com.au] > I can't recall all the details - have you (or anyone) seen > this on other than XP? Or on a CD/DVD drive? Just to clarify - I've never had this from Spambayes, but I have had similar looking messages, on both 2K and XP. They are sometimes when there *is* a disk present, but not recognised for some reason. And they sometimes involve problems ejecting the disk. Hence my comment that zip support is flakey. I've never seen anything like this with a CD/DVD drive. Not much help, I know. Sorry. Paul. _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From phobos at pobox.com Fri May 23 19:01:42 2003 From: phobos at pobox.com (Peets) Date: Fri May 23 04:02:42 2003 Subject: [Spambayes] SpamBayes installer improvement suggestions Message-ID: Wonderful job, simply wonderful. Firstly, Thanks 8-). This is code well done, simple, effective, just doing the job. Secondly, after installing it I have basically one remark to make: the installation sequence might improve if you change the order of events somewhat. At the moment you document Install - set up folders - train - set up filters - enable filters (not clearly, but that's what it boils down to). I'm leaving aside an option for creating the folders from within the installer (gets too complicated ;-). For end users, installation is probably more logical in the following order: 1) create folders 2) [start installer] Set source/spam/possible spam folders 3) Offer filtering settings, default to 'enabled' 4) Train 5) Filter 6) End with instructions (verify spam/unsure folders and act, synchronise with server etc) This leaves the installation with SpamBayes up and running. At the moment, post training there is a confusing gap before filtering comes into play. You can document that to a degree, but streamlining the process will probably make it easier to document (I'm happy to do it for you if that's of any help, but the last time I coded it was in raw machine code on a Psion Organiser II ;-). That's all. Did I say Thank You ? ;-) Regards, /// Peter /// From anthony at interlink.com.au Fri May 23 19:39:58 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri May 23 04:40:09 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed a dd-in In-Reply-To: <21631E523BEBD41197FC0020AFCF88F613735E@dsl-mr-207-34-113-i59-cgy.nucleus.com> Message-ID: <200305230839.h4N8dw310774@localhost.localdomain> >>> Director wrote > Tony, > > Thanks for the reply. I didn't get a chance to look over all the messages so > I hope I'm not duplicating things and causing additional work for your > "busy" list. > > The only relevance I can see in the log files is: > > E:\src\python-cvs\lib\fcntl.py:7: DeprecationWarning: the FCNTL module is > deprecated; please use fcntl > > which is intersting as I have no E:\src directory. Note that python's byte compiled files contain pathnames of where they were originally compiled. This can be a little bit confusing, but it's harmless. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From Paul.Moore at atosorigin.com Fri May 23 11:00:30 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Fri May 23 05:00:53 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting Message-ID: <16E1010E4581B049ABC51D4975CEDB88619A8E@UKDCX001.uk.int.atosorigin.com> From: Tim Peters [mailto:tim.one@comcast.net] > [Moore, Paul] > > I have a friend who is using the POP3 proxy for his mail. He has a > > 10:1 spam:ham imbalance, and he's found that he gets quite a high > > proportion of unsures (from 200 or so mails a day, over 75% of which > > are spam). His DB contains about 1300 spam and 150 spam. > > 150 ham, right (you said "spam" twice there)? That's not much of a ham > sample regardless of option settings, and without knowing what he set his > unsure range to, "quite a high proportion" may be astonishing or > inescapable. [...] > > My feeling is that the higher proportion of unsures, plus the > > unresponsiveness to training, makes it an overall loss. > > The fellow you're talking about has a pathologically low number of ham; Hmm. I was a little worried about that possibility. The trouble is, it's a very similar situation to the one I'm in. I get virtually *no* ham (excluding mailing lists, which are filtered off before the email program sees them), but ridiculous amounts of spam (hundreds per day). I'd ignore email totally, if it wasn't for the fact that the few ham I do get are fairly important. I don't have any way of training on more ham - I train on it all. My current approach (which is working reasonably well) is to train on ham and unsures only, until I get good results, then stop *totally*. This has left me with a database containing 40-odd ham, and 150 spam. My unsure rate is tolerable, so I accept that I'm not going to do any better. I'm close to going for the other option - get a new mail account :-( > > Am I right in thinking that pop3proxy has this parameter set to true? > > I don't see anything to suggest that it is. The default is still False, and > AFAICT only Outlook2000/default_bayes_customize.ini sets it True. You're right. I misremembered, and couldn't find the default value. > It's a pick-your-poison thing. If you have more spam than ham and keep this > False, a higher false positive rate is the expected result (or a higher FN > rate if you have more ham than spam). (Thinks) OK, I see this. > It remains experimental because the evidence was/is spotty and mixed. Yes, that was partly my point. As I understand things (I came into this after the extensive testing work had pretty much died down) it has become pretty much impossible to see significant test results now, thanks to the level of effectiveness which has been achieved. What I see now is much more of a "real life gut feel" type of effect, which is nearly impossible to either quantify, or to reproduce reliably. Whether such evidence is useful is a difficult judgement call :-( > > My friend has now purged his database and is starting from scratch, > > to try to improve his results. > > He should have kept the 150 most-recent spam instead. Good point. But getting 150 new spam isn't exactly a long-term job :-( > Nope! We don't store spamprobs in a database, just word counts. > experimental_ham_spam_imbalance_adjustment is used (only) in > Classifier.probability() when a probability is (dynamically) computed. Oh. That's good news. I could (and probably should) do some real tests, then. (It's much easier if I don't need to retrain). > > Maybe the option should be exposed in the UI (but that may not be > > sensible if changing it *does* require a retrain). > > For researchers that would be fine, but end users don't have a clue about > what to do with exotic internal options. Hmm. I think I could explain this in end-user language. How does this sound: Compensate for unequal numbers of spam and ham ---------------------------------------------- If your training database has significantly (5 times) more ham than spam, or vice versa, you may start seeing an increase in incorrect classifications (messages put in the wrong category, not just marked as unsure). If so, this option allows you to compensate for this, at the cost of increasing the number of messages classified as "unsure". Note that the effect is subtle, and you should experiment with both settings to choose the option that suits you best. This is always going to be an "advanced" option, so I don't see the longer explanation as a bad thing... > Your friend could spend his time better by collecting more ham <0.9 wink>. It's a shame, nobody sends him any. We're both sad, unloved people :-) > Since mass testing here stopped, we haven't got useful feedback on any of > the non-default options. Since there wasn't enough info to decide about > them when mass testing stopped, they still deserve a chance to survive. I > hope mass testing resumes, but I can't drive it (no time). Until it does > resume, the continued existence of these options seems appropriate. Fair enough. I agree about testing, but I also don't have the time to do a good job (or the understanding, or the large corpus of data...) Spambayes is a victim of its own success. Theoretically, it's still only alpha, but we're getting a real live user base, support issues, the lot. I'm not sure whether to blame Microsoft for getting people used to the idea that alpha is as good as it gets, or the Greeks for not having any letters before alpha :-) Thanks for taking the time to explain all this. Paul. From lists at olivermaunder.co.uk Fri May 23 12:19:13 2003 From: lists at olivermaunder.co.uk (Oliver Maunder) Date: Fri May 23 06:20:48 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A3F@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A3F@its-xchg4.massey.ac.nz> Message-ID: <3ECDF5A1.7010908@olivermaunder.co.uk> Meyer, Tony wrote: >The trickiest thing here is that the Outlook users (who probably are a >majority now?) do use the option, while by default, the others don't. >We really need people to put in some time testing to see if it is worth >keeping or not. If it is, I gather from (Tim P's?) comment in the code >that it can just be enabled for everyone. > > I''ll turn the option on, and see what happens. I get a *lot* more spam than ham. At the moment, I'm only training on unsures - but all the unsures are clearly spam, so the spam db is growing much faster than ham. At the moment, about a third of my spam ends up classified as Unsure. There's never any ham in the Unsure folder. I'll try the ham-spam-imbalance option and see if there's any change. If not, I'll just need to review my threshold settings Olly From noreply at sourceforge.net Fri May 23 05:14:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 07:14:40 2003 Subject: [Spambayes] [ spambayes-Support Requests-742258 ] Shutdown/restart Message-ID: Support Requests item #742258, was opened at 2003-05-23 06:14 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=742258&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Mike Ring (mikering) Assigned to: Nobody/Anonymous (nobody) Summary: Shutdown/restart Initial Comment: Whenever I logoff and restart Outlook, SpamBayes doesn't filter messages on startup. I have to retrain it and then filter the Inbox. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=742258&group_id=61702 From mail at peterbe.com Fri May 23 14:05:56 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Fri May 23 08:13:17 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB328D@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030523130045.021d9d98@mail.grenna.net> At 10:55 2003-05-21 +1200, Meyer, Tony wrote: > > "pickle or a dbm"??? I have no idea. What should it be and > > how do you change? > >Sorry, I should have been clearer. Unless you have changed the >"persistent_use_database" option or are calling pop3proxy/imapfilter >with the "-d" option, you will be using a database and not a pickle. To be honest, I don't understand the -d and -D switches to the commandline. I've never used them and maybe this is a reminder to the Spambayes homepage webmasters to explain it a little bit for dummies. >There seems to be a general consensus that your database was pretty >large for the number of messages trained. This might be as a result of >the messageinfo db problems (before there was an explicit path to it) - >some messages may have been trained multiple times. I bow my head in shame. I was wrong. What I reported as so big was hammie.db.dat or hammie.dat hammie.db was less than a fifth in size. Thank you for your support. Spambayes seems to have a problem the second time you run it. Yesterday I installed the latest CVS without problem (except that Wordcount) didn't work after you had trained an mbox. But then (and this happened last time too) when I the second time run it I get these error messages: C:\Python22\spambayes>C:\Python22\python.exe pop3proxy.py Loading database... Done. Listener on port 110 is proxying mail.grenna.net:110 User interface url is http://localhost:8880/ error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0xb83140> (exceptions.AssertionError: [C:\Python22\lib\asyncore.py|poll|99] [C:\Python22\lib\asyncore.py|handle_read_event|396] [C:\Python22\lib\asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|146] [pop3proxy.py|onServerLine|214] [pop3proxy.py|onResponse|288] [pop3proxy.py|onTransaction|389] [pop3proxy.py|onRetr|440] [spambayes\classifier.py|chi2_spamprob|217] [spambayes\classifier.py|_getclues|445] [spambayes\classifier.py|probability|301]) Cheers, Peter From bill at parducci.net Fri May 23 07:13:52 2003 From: bill at parducci.net (bill parducci) Date: Fri May 23 09:13:57 2003 Subject: [Spambayes] List moeration References: <16077.12882.255940.607868@montanaro.dyndns.org> Message-ID: <3ECE1E90.6060307@parducci.net> Skip Montanaro wrote: > What's the term? We now eat our own dog food? you are eating it, it's just being served up in individual portions! :) b From bill at parducci.net Fri May 23 07:24:00 2003 From: bill at parducci.net (bill parducci) Date: Fri May 23 09:24:04 2003 Subject: FAQ (Was RE: [Spambayes] /.?) References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A00@its-xchg4.massey.ac.nz> <16077.37263.719901.63141@montanaro.dyndns.org> Message-ID: <3ECE20F0.2050804@parducci.net> Skip Montanaro wrote: > Bill sent me a new section today which I merged in. I also tweaked it a bit > in other more cosmetic ways. yikes! can you send me the updated version? i am still creating FAQs. > Would someone like to merge the two please? Maybe then the FAQ.txt file can > just point to the website. It appears that most of our users are going to > be quickly addicted to Mark's installer. working on it... b From mhammond at skippinet.com.au Fri May 23 10:42:51 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri May 23 09:35:33 2003 Subject: [Spambayes] bug in binary installer In-Reply-To: <615D36A41691D211BB0900A0C9D619990452EA5F@houmis03.dhs.state.tx.us> Message-ID: <119901c320bb$dc555960$530f8490@eden> The good news is that you are not the first to report this. The bad news is we don't really know what is causing it. It appears it is the same as the bug in https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61 702&atid=498103, and hopefully we will sort it out ready for the new release. Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Proske,David > Sent: Friday, 23 May 2003 5:19 AM > To: spambayes@python.org > Subject: [Spambayes] bug in binary installer > > > I have a win2000 running outlook 2000 sr1 and I get the > following message > <> and the logs are as follows the first two time > the log was > empty but the next two time I got the following logs. > <> > <> > David > From mhammond at skippinet.com.au Fri May 23 10:37:29 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri May 23 09:35:44 2003 Subject: [Spambayes] Usage on XP version of Outlook In-Reply-To: <4358DC03B08CD31191A60006293844E20326FFE7@FWM-MAIL> Message-ID: <119401c320bb$1cb6a190$530f8490@eden> > I just wanted you to know that I have installed and it runs > great. Great - although it doesn't sound like it > I do > have a question about the filter button on the Xp version of > outlook. After > using the train mode and setting the filters. The enable > filter button is > grayed out, not available. Haven't had this problem on 2000 > outlook. Is > this a problem? Does it still run automatically? If the "Enable filtering" button is disabled, then I doubt it is running great - it is probably not filtering anything. The main dialog should indicate why filtering can not be enabled - generally as there are less than 5 good and 5 bad messages in the training database. Otherwise, please see Trobuleshooting.html, and attach a log file. Regards, Mark. From Ben.Marshall at swinc.com Fri May 23 09:39:18 2003 From: Ben.Marshall at swinc.com (Marshall, Ben) Date: Fri May 23 09:39:53 2003 Subject: [Spambayes] Quick Question Message-ID: <4836C7598D2C5F448B888393437FBC0601083A32@moose.austin.swinc.com> Yes, Outlook 11 is Outlook 2003. How can I help? Thanks, Ben -----Original Message----- From: Meyer, Tony [mailto:T.A.Meyer@massey.ac.nz] Sent: Thursday, May 22, 2003 9:28 PM To: Marshall, Ben; spambayes@python.org > Will the spambayes work with outlook 11? What is Outlook 11? Is that the numeric version number for Outlook 2003? (My copy of Outlook 2002 has 10 in the version number, so that might make sense). If it is, then the answer is 'not yet'. We would welcome help in changing this, though. =Tony Meyer From skip at pobox.com Fri May 23 09:46:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 09:46:41 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: References: <16077.36960.486158.231011@montanaro.dyndns.org> Message-ID: <16078.9786.637017.288318@montanaro.dyndns.org> >>>>> "Tim" == Tim Peters writes: Tim> [Skip] >> I thought Outlook 2000 was the version which got the most exercise. Tim> I have three of those, and they all work fine (using CVS spambayes; Tim> I haven't tried the binary installer yet). I just tried the binary installer on Win2k Pro (SP3) w/ Outlook 2k (SR1). Seems to work fine once set up properly. The only change I'd make to the initial interface is to either grey out the "Delete as Spam" button until the users has defined a spam folder or pop up a dialog box if they click it without having a spam folder defined. I spun my wheels for a few minutes until I realized I had put the cart before the horse. Skip From tim.one at comcast.net Fri May 23 11:12:17 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 10:14:48 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16078.9786.637017.288318@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > I just tried the binary installer on Win2k Pro (SP3) w/ Outlook 2k > (SR1). Seems to work fine once set up properly. Yup, that's the combo I use at work, and others where I work have used the binary installer with that combo too. I also use two Win98SE + OL2K (SR1) combos, and the binary installer doesn't work on Win98SE (or presumably any flavor of Win94 or ME either). > The only change I'd make to the initial interface is to either grey out > the "Delete as Spam" button until the users has defined a spam folder or > pop up a dialog box if they click it without having a spam folder > defined. I spun my wheels for a few minutes until I realized I had put > the cart before the horse. I'm sure Mark will appreciate your patch . From tim at fourstonesexpressions.com Fri May 23 10:16:53 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Fri May 23 10:17:12 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: References: Message-ID: On Fri, 23 May 2003 10:12:17 -0400, Tim Peters wrote: > Yup, that's the combo I use at work, and others where I work have used > the > binary installer with that combo too. I also use two Win98SE + OL2K > (SR1) > combos, and the binary installer doesn't work on Win98SE (or presumably > any > flavor of Win94 or ME either). I'm reasonably sure there has been NO testing on Win94. c'est moi - TimS From skip at pobox.com Fri May 23 10:23:46 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 10:24:00 2003 Subject: [Spambayes] Re: Compatibility matrix In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> Message-ID: <16078.12018.785964.97356@montanaro.dyndns.org> Tony> What's the story with the auto-reply text? The barrage of Tony> identical Outlook plugin problems is starting to be Tony> overwhelming...(not that the auto-reply will necessarily solve Tony> these). It's essentially ready to go. I decided to hold off on asking Barry to set things up yesterday because the mail server was having problems and I knew Barry was working on the Python 2.2.3c1 release. So, Barry, would now be a good time to ask you to make Mailman return http://spambayes.sf.net/reply.txt to new posters (or people who haven't seen it in awhile)? Thanks, Skip From tim.one at comcast.net Fri May 23 11:26:43 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 10:27:48 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: Message-ID: >> Yup, that's the combo I use at work, and others where I work have >> used the binary installer with that combo too. I also use two >> Win98SE + OL2K (SR1) combos, and the binary installer doesn't work >> on Win98SE (or presumably any flavor of Win94 or ME either). [Tim Stone] > I'm reasonably sure there has been NO testing on Win94. Sorry, my five key is worn out from responding to people about bugs in Win9five. So I type Win94 instead. This is much easier to believe than that I had a typo . or-mabye-it-was-a-network-error!-terrorism?-sars-ly y'rs - tim From skip at pobox.com Fri May 23 10:35:33 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 10:35:39 2003 Subject: [Spambayes] Alternative to binary installer Message-ID: <16078.12725.91836.254995@montanaro.dyndns.org> A lot of people are running into the no codec search functions registered: can't find encoding install problem. To get them going I think we will need a dead simple set of instructions for installing from source. Is this documented somewhere on the website already? Not having tried it, my guess would be 1. Download and run the Python 2.2.2 (or 2.2.3 or 2.3 when avaiable) installer. 2. Install Mark's win32all package. 3. Install the 1.0a2 (or CVS) source package. 4. From a DOS shell positioned in the Outlook2000 directory, run "python addin.py". Is that roughly correct? Skip From skip at pobox.com Fri May 23 10:43:27 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 10:44:34 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in In-Reply-To: <21631E523BEBD41197FC0020AFCF88F613735D@dsl-mr-207-34-113-i59-cgy.nucleus.com> References: <21631E523BEBD41197FC0020AFCF88F613735D@dsl-mr-207-34-113-i59-cgy.nucleus.com> Message-ID: <16078.13199.251565.853662@montanaro.dyndns.org> Russ> I am running WinXP with outlook 2000, Today in the course of Russ> setting up WINCVS I installed python. Subsequently my Spambayes Russ> add-in stopped working. I installed the plugin today (Win2k, Outlook2k) and verified that it seemed to be doing the right thing. Later on I installed the just-released 2.2.3c1 binary. The plugin seems to still be humming right along. Russ> I have corrected it by removing python and reinstalling the add-in Russ> but am curious why this should occur and how to remedy it so I can Russ> install python on this machine in future. The only thing I can think of is that perhaps the version of Python installed with the plugin is different than the version you installed. The two Python versions might have a difference in the byte code they generate and interpret. After the installation, perhaps you got the new Python interpreter trying to interpret old byte codes or vice versa. What version of Python did you install? Skip From barry at python.org Fri May 23 11:46:56 2003 From: barry at python.org (Barry Warsaw) Date: Fri May 23 10:47:30 2003 Subject: [Spambayes] Re: Compatibility matrix In-Reply-To: <16078.12018.785964.97356@montanaro.dyndns.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> <16078.12018.785964.97356@montanaro.dyndns.org> Message-ID: <3ECE3460.50507@python.org> Skip Montanaro wrote: > It's essentially ready to go. I decided to hold off on asking Barry to set > things up yesterday because the mail server was having problems and I knew > Barry was working on the Python 2.2.3c1 release. > > So, Barry, would now be a good time to ask you to make Mailman return > > http://spambayes.sf.net/reply.txt > > to new posters (or people who haven't seen it in awhile)? It is the best of all possible times you could have picked Skip! So I've uploaded the text and turned on autoreply for posters. Only. I didn't set up the autoreply for -owners or -request. This message should provoke the autoresponder for me, so we'll see if it worked. -Barry From skip at pobox.com Fri May 23 10:59:42 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 10:59:49 2003 Subject: FAQ (Was RE: [Spambayes] /.?) In-Reply-To: <3ECE20F0.2050804@parducci.net> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3A00@its-xchg4.massey.ac.nz> <16077.37263.719901.63141@montanaro.dyndns.org> <3ECE20F0.2050804@parducci.net> Message-ID: <16078.14174.295978.673153@montanaro.dyndns.org> >> Bill sent me a new section today which I merged in. I also tweaked >> it a bit in other more cosmetic ways. bill> yikes! can you send me the updated version? i am still creating bill> FAQs. It's what's on the web site: http://spambayes.sourceforge.net/faq.html or source: http://spambayes.sourceforge.net/faq.ht Skip From skip at pobox.com Fri May 23 11:35:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 11:35:13 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: References: <16078.9786.637017.288318@montanaro.dyndns.org> Message-ID: <16078.16297.755648.756038@montanaro.dyndns.org> >> The only change I'd make to the initial interface is to either grey >> out the "Delete as Spam" button until the users has defined a spam >> folder ... Tim> I'm sure Mark will appreciate your patch . No doubt. Looking at the source I sort of suspect this might already be in place for the CVS code. How do I go about seeing if that's the case? Can I just download the source and run Outlook2000/addin.py or do I have to do some sort of uninstall of the existing code? Thx, Skip From skip at pobox.com Fri May 23 11:38:03 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 11:38:11 2003 Subject: [Spambayes] Re: Compatibility matrix In-Reply-To: <3ECE3460.50507@python.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> <16078.12018.785964.97356@montanaro.dyndns.org> <3ECE3460.50507@python.org> Message-ID: <16078.16475.944474.797715@montanaro.dyndns.org> Barry> This message should provoke the autoresponder for me, so we'll Barry> see if it worked. Thanks, Barry. It replied to me. Looks like we're set in that regard. If the reply.txt file gets tweaked do we need to tell you or do you open it with urllib.urlopen periodically? ;-) Skip From barry at python.org Fri May 23 12:55:59 2003 From: barry at python.org (Barry Warsaw) Date: Fri May 23 11:56:33 2003 Subject: [Spambayes] Re: Compatibility matrix In-Reply-To: <16078.16475.944474.797715@montanaro.dyndns.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B27@its-xchg4.massey.ac.nz> <16078.12018.785964.97356@montanaro.dyndns.org> <3ECE3460.50507@python.org> <16078.16475.944474.797715@montanaro.dyndns.org> Message-ID: <3ECE448F.9010002@python.org> Skip Montanaro wrote: > Barry> This message should provoke the autoresponder for me, so we'll > Barry> see if it worked. > > Thanks, Barry. It replied to me. Looks like we're set in that regard. If > the reply.txt file gets tweaked do we need to tell you or do you open it > with urllib.urlopen periodically? ;-) Heh. Actually Skip, you know the list admin password for spambayes now, so you can always update the text yourself! :) -Barry From tim.one at comcast.net Fri May 23 12:54:00 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 11:56:45 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16078.16297.755648.756038@montanaro.dyndns.org> Message-ID: [Skip] > Looking at the source I sort of suspect this might already > be in place for the CVS code. How do I go about seeing if that's the > case? Can I just download the source and run Outlook2000/addin.py or > do I have to do some sort of uninstall of the existing code? Because I haven't used the binary installer, I don't know what wrinkles that adds to the equation. I simply update from CVS, and that's all -- I don't even run addin.py (well, not since the first time). If I were you I'd certainly uninstall first, though -- why risk shortcuts? From jhadzima at alphas.com Fri May 23 08:43:15 2003 From: jhadzima at alphas.com (John Hadzima) Date: Fri May 23 12:16:32 2003 Subject: [Spambayes] possible bug Message-ID: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). Attached are a couple screen shots that will hopefully give you a clearer idea of what I'm talking about. John <> <> -------------- next part -------------- A non-text attachment was scrubbed... Name: Outlook_AutoComplete_Working1.jpg Type: image/jpeg Size: 17323 bytes Desc: Outlook_AutoComplete_Working1.jpg Url : http://mail.python.org/pipermail/spambayes/attachments/20030523/5bc4c6c2/Outlook_AutoComplete_Working1-0001.jpg -------------- next part -------------- A non-text attachment was scrubbed... Name: Outlook_AutoComplete_Broken1.jpg Type: image/jpeg Size: 13937 bytes Desc: Outlook_AutoComplete_Broken1.jpg Url : http://mail.python.org/pipermail/spambayes/attachments/20030523/5bc4c6c2/Outlook_AutoComplete_Broken1-0001.jpg From noreply at sourceforge.net Fri May 23 10:21:02 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 12:21:08 2003 Subject: [Spambayes] [ spambayes-Bugs-742440 ] Outlook 2003(beta) install fails Message-ID: Bugs item #742440, was opened at 2003-05-23 09:21 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kenneth Haynes (kh4yn3s) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2003(beta) install fails Initial Comment: I am running Windows XP, SP1. I have the Beta 2 version of Office that includes Outlook 2003. When I tried to install using the windows installer, I got no error messages, but neither did I get the configuration buttons on the menu. spambayes1.log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_configuration.pck' Loaded bayes database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.68803ms SpamAddin - Disconnecting from Outlook spambayes2.log and spambayes3.log all look the same as spambayes1.log. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 From noreply at sourceforge.net Fri May 23 10:26:28 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 12:26:34 2003 Subject: [Spambayes] [ spambayes-Bugs-740198 ] Installation fails with regsvr error Message-ID: Bugs item #740198, was opened at 2003-05-20 01:12 Message generated for change (Comment added) made by kwalker411 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rob de Santos (rdesantos) Assigned to: Mark Hammond (mhammond) Summary: Installation fails with regsvr error Initial Comment: I'm trying to install SpamBayes into Outlook XP on Win98SE. The install fails with a regsvr error. Attempts at manually registering the file also fail. Here's the log: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\genca che.py", line 368, in EnsureModule pywintypes.com_error: (-2147312566, 'Error loading type library/DLL.', None, None) SystemError: PyErr_NormalizeException() called without exception Traceback (most recent call last): File "", line 11, in ? AttributeError: 'module' object has no attribute 'OutlookAddin' Any help? Cheers, Rob ---------------------------------------------------------------------- Comment By: Ken Walker (kwalker411) Date: 2003-05-23 16:26 Message: Logged In: YES user_id=785557 Hi Fellas. As per Tony Meyer's request, I'm appending my version information here. The plugin wouldn't install on my platform due to a regsvr error. System: Microsoft Windows 2000 5.00.2195 Service Pack 3 Mail Client: Microsoft Outlook 2002 10.4219.5025 Service Pack 2 Thanks, K ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 02:09 Message: Logged In: YES user_id=552329 >From the list: "I have a win2000 running outlook 2000 sr1 and I get the following message <> and the logs are as follows the first two time the log was empty but the next two time I got the following logs. <> <> David" The jpeg was just a screenshot of the regsvr error. Log3 is: "Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py", line 388, in RegisterClasses IndexError: list index out of range" and log4 is: "warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) Unregistered: SpamBayes.OutlookAddin Traceback (most recent call last): File "", line 22, in DllUnregisterServer File "E:\src\spambayes\Outlook2000\addin.py", line 791, in UnregisterAddin File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named _winreg" ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-22 23:45 Message: Logged In: YES user_id=14198 Noting a dupe in https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741914&group_id=61702 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740198&group_id=61702 From mhammond at skippinet.com.au Fri May 23 10:52:22 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri May 23 12:37:16 2003 Subject: [Spambayes] Quick Question In-Reply-To: <4836C7598D2C5F448B888393437FBC0601083A12@moose.austin.swinc.com> Message-ID: <11a001c320bd$307d4650$530f8490@eden> > Will the spambayes work with outlook 11? It will eventually. We have no idea if it does now. I may grab a beta of this to test, but until I do, or until someone reports something conclusive, we just don't know. Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1564 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030523/db7943bd/winmail.bin From tim.one at comcast.net Fri May 23 14:08:25 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 13:09:53 2003 Subject: [Spambayes] preamble on SF bug page. In-Reply-To: <200305230621.h4N6LDp08992@localhost.localdomain> Message-ID: [Anthony Baxter] > Is it worth changing the SF project configuration to put a message at > the top of the bug report page saying that the Win98 install issues > are a known bug and that additional bug reports should not be filed > for it? And maybe a Paypal button for MarkH, in case people want to > encourage him to work on it sooner? ;) I added some text. Someone remind me to delete it when it becomes obsolete! From matos at attbi.com Fri May 23 15:49:11 2003 From: matos at attbi.com (David Matos) Date: Fri May 23 14:49:43 2003 Subject: [Spambayes] Autocomplete bug In-Reply-To: Message-ID: <000b01c3215b$fffce450$8d80b042@dexter> FWIW, I've experienced this same problem. ---------------------------------------------------------------------- Message: 1 Date: Fri, 23 May 2003 07:43:15 -0400 From: "John Hadzima" Subject: [Spambayes] possible bug To: Message-ID: Content-Type: text/plain; charset="us-ascii" Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). Attached are a couple screen shots that will hopefully give you a clearer idea of what I'm talking about. John From skip at pobox.com Fri May 23 14:57:42 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 14:57:47 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: References: <16078.16297.755648.756038@montanaro.dyndns.org> Message-ID: <16078.28454.179381.468975@montanaro.dyndns.org> >> Can I just download the source and run Outlook2000/addin.py or do I >> have to do some sort of uninstall of the existing code? Tim> Because I haven't used the binary installer, I don't know what Tim> wrinkles that adds to the equation. I simply update from CVS, and Tim> that's all -- I don't even run addin.py (well, not since the first Tim> time). If I were you I'd certainly uninstall first, though -- why Tim> risk shortcuts? Fair enough, but remember, you're dealing with a real Windows idiot here. Took me awhile to even find out how to uninstall the package. I did so, then verified that Outlook didn't show any smileys. I then checked out the spambayes CVS and installed (after installing win32all). Everything works okay except what I actually wanted to test. The uninstall step appears not to have zeroed out all my configuration stuff. It seemed to remember basic config stuff, like where I wanted to place spams and unsures, which defeated the purpose of this exercise, to see if the "Delete as spam" button would be made inactive. I guess the question is still open. How do I get back all the way to the pre-plugin state: no registration, no configuration, etc.? There must be some files or registry entries laying about even after the uninstall step. Skip From tim.one at comcast.net Fri May 23 16:12:00 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 15:14:14 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16078.28454.179381.468975@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > Fair enough, but remember, you're dealing with a real Windows idiot > here. The irony is that MS believe they designed Windows for idiots . > Took me awhile to even find out how to uninstall the package. > I did so, then verified that Outlook didn't show any smileys. I then > checked out the spambayes CVS and installed (after installing > win32all). > > Everything works okay except what I actually wanted to test. The > uninstall step appears not to have zeroed out all my configuration > stuff. It seemed to remember basic config stuff, like where I wanted > to place spams and unsures, which defeated the purpose of this > exercise, to see if the "Delete as spam" button would be made > inactive. > > I guess the question is still open. How do I get back all the way to > the pre-plugin state: no registration, no configuration, etc.? There > must be some files or registry entries laying about even after the > uninstall step. I haven't used the binary installer so I don't know what all it does. A definitive answer here will have to come from Mark, or someone else Windows-savvy who has actually used it. If you have a directory somewhere (assuming Win2K) like \Documents and Settings\ YOUR_LOGIN_NAME\ Application Data\ SpamBayes\ then it's hiding config and database info in there, and you should blow that away. From kson001 at pacbell.net Fri May 23 13:33:28 2003 From: kson001 at pacbell.net (kson001) Date: Fri May 23 15:33:37 2003 Subject: [Spambayes] configuration help Message-ID: I can't find the configuration file that I saw during the installation. I need help in setting up the spam score column in Outlook Inbox, as well as confirming use of filtering, etc. I set up Spam and Possible Spam folders and trained the program on the Spam folder. There is no help on the Spam Manager. Thanks, Kyung Son From tim.one at comcast.net Fri May 23 16:39:53 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 15:41:58 2003 Subject: [Spambayes] configuration help In-Reply-To: Message-ID: [Kyung Son] > I can't find the configuration file that I saw during the > installation. I need help in setting up the spam score column in > Outlook Inbox, as well as confirming use of filtering, etc. I set up > Spam and Possible Spam folders and trained the program on the Spam > folder. There is no help on the Spam Manager. Click the "About" button in the bottom center of the Anti-Spam Manager dialog for help. From Jerry.Weygint at tmcaz.com Fri May 23 13:41:48 2003 From: Jerry.Weygint at tmcaz.com (Jerry Weygint) Date: Fri May 23 15:45:00 2003 Subject: [Spambayes] SpamBayes problem? Message-ID: I installed the SpamBayes plugin into Outlook 2002 (xp) and I'm noticing now that when I address messages, I'm not getting the options to auto-complete the email address anymore. I get a box that pops up, but it just contains the first letter of whatever name I'm entering. Have you heard of this before? Jerry Weygint Technical Systems Support Manager TMC HealthCare, Information Services 520.324.4232 From leisureman4 at lycos.com Fri May 23 15:56:07 2003 From: leisureman4 at lycos.com (leisureman4@lycos.com) Date: Fri May 23 15:56:09 2003 Subject: [Spambayes] CONDO for sale Message-ID: <4128-22003552319567908@home> If you ever wanted to get a timeshare, now is the time. I have one for sale in North Myrtle Beach, SC. I paid over $5,000 for it. It is up for sale to the first caller for only $2,999. It is week 46 (around Thanksgiving time). The unit has all kinds of amenities including full kitchen, washer/dryer, ocean beach, covered outdoor pool, whirlpool, hot tub, you name it and it is all here. And not far away, you will find boating, fishing, golf, live entertainment, (another Nashville) medical facilities, restaurants, tennis courts and everything a fabulous resort community has to offer. To get detailed information hit “reply” and send "CONDO" with your name and phone number. I will get back with you. xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Under Bill s.1618 Title III passed by the 105th US Congress, this letter cannot be considered Spam as long as the sender includes contact information and a method of "removal." If at any time you no longer wish to receive email from me, you may send a “REPLY” with the word REMOVE in the subject line. From tim.one at comcast.net Fri May 23 16:56:16 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 23 15:57:10 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting In-Reply-To: <200305230802.11768.tdickenson@devmail.geminidataloggers.co.uk> Message-ID: [Toby Dickenson] > I tried this option for a few weeks, and it *seemed* to be a > disadvantage for me. A certain type of spam became indistinguishable > from ham. Those spams came often enough to be annoying, and the > annoyance stopped when I turned the option off. > > This is with a 10:1 ham:spam ratio, and a training regime that > retrains from scratch overnight with everything in my spam mailbox, > and everything in every other mailbox. That's most peculiar. The effect of enabling the option when you have more ham than spam is to make scores "less hammy". Something else may be at work with the "certain type of spam" in question. From noreply at sourceforge.net Fri May 23 14:36:39 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri May 23 16:36:48 2003 Subject: [Spambayes] [ spambayes-Bugs-740311 ] Fails on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 08:09 Message generated for change (Comment added) made by hmeyer666 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Nobody/Anonymous (nobody) Summary: Fails on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- Comment By: Harald Meyer (hmeyer666) Date: 2003-05-23 20:36 Message: Logged In: YES user_id=785713 Same here with Win2000 and Outlook XP ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-20 22:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From bill at parducci.net Fri May 23 16:05:39 2003 From: bill at parducci.net (bill parducci) Date: Fri May 23 18:05:42 2003 Subject: [Spambayes] FAQ Message-ID: <3ECE9B33.3010808@parducci.net> here is the latest version of the FAQ. i have merged the pervious FAQ with INTEGRATION.txt (and a few other files) and added some recent stuff from the list. the reason i pulled in all this stuff was because there is some overlap with the FAQ and i think that a lot of the issues discussed elsewhere there can be more helpful if served up right from the beginning. (i for one have been guilty of asking questions that had answers written down *somewhere* in the code rep. ;-) http://parducci.net/faq2.ht anyway, for good or bad here it is for review. (the internal links won't work until the file is moved to the main website and the first two tags are removed) b From seant at iname.com Fri May 23 19:12:11 2003 From: seant at iname.com (Sean True) Date: Fri May 23 18:12:48 2003 Subject: [Spambayes] Outlook 2003 In-Reply-To: Message-ID: >Category: Outlook >Group: None >Status: Open >Resolution: None >Priority: 5 >Submitted By: Kenneth Haynes (kh4yn3s) >Assigned to: Mark Hammond (mhammond) >Summary: Outlook 2003(beta) install fails We've tested SpamAtBay (the imminent commercial version of SpamBayes for Outlook) on 2003 quite a bit. Both SpamBayes and SpamAtBay fail to find the right COM interface, and then when they are fixed to, fail to get _any_ events delivered by Outlook 2003. There are a lot of issues floating around on the microsoft newsgroups about events in Outlook 2003. I suspect it's aggravated by the MS emphasis on .NET based addins. Basically, it doesn't work. When we figure out how (assuming we beat Mark to it, a dubious assumption), we'll share how with him! -- Sean Still looking for beta users, feel free to contact me. From hank at vantconsulting.com Fri May 23 17:05:21 2003 From: hank at vantconsulting.com (Henry Krzciuk) Date: Fri May 23 19:05:25 2003 Subject: [Spambayes] Remove SpamBayes Message-ID: <000101c3217f$c9cb6860$0300a8c0@LIFEBOOK> I tried loading to Outlook 2002 on XP. Now I get an error every time I open Outlook asking for Drive e:. Plus it didn't create spam folder. How can I remove SpamBayes from outlook. Thanks, Hank From skip at pobox.com Fri May 23 19:20:36 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri May 23 19:20:41 2003 Subject: [Spambayes] Remove SpamBayes In-Reply-To: <000101c3217f$c9cb6860$0300a8c0@LIFEBOOK> References: <000101c3217f$c9cb6860$0300a8c0@LIFEBOOK> Message-ID: <16078.44228.599056.729671@montanaro.dyndns.org> Henry> I tried loading to Outlook 2002 on XP. Now I get an error every Henry> time I open Outlook asking for Drive e:. Plus it didn't create Henry> spam folder. The drive e: problem is one some other people have reported. Do you have a removable device attached to drive e:? If so, try inserting the appropriate media. As for the missing spam folder, I believe you need to create it yourself from Outlook, then tell the plugin to put messages it classifies as spam in it. You should also create an Uncertain folder and tell it to put messages marked unsure there. Henry> How can I remove SpamBayes from outlook. Try searching for the plugin in the Add/Remove Software app in the Control Panel. Skip From jotdot at covad.net Fri May 23 17:24:50 2003 From: jotdot at covad.net (Gene and Jo Ann) Date: Fri May 23 19:26:24 2003 Subject: [Spambayes] Spambayes Message-ID: <000c01c32182$8287aa60$1686fea9@sony1> I was wondering if you have this available for outlook express yet? If not, I will bide my time... Thanks From tim_one at email.msn.com Fri May 23 21:46:28 2003 From: tim_one at email.msn.com (Tim Peters) Date: Fri May 23 20:47:06 2003 Subject: [Spambayes] Spambayes In-Reply-To: <000c01c32182$8287aa60$1686fea9@sony1> Message-ID: [Gene and Jo Ann] > I was wondering if you have this available for outlook express yet? > If not, I will bide my time... It's unlikely there will ever be an add-in for OE -- OE is a stripped-down product that doesn't interact with other programs. If you're comfortable with playing with your computer, you could set up OE to use this project's POP3 proxy (although if the phrase "POP3 proxy" gives you chills, you may want to get a computer-nerd friend to help set it up). From tim_one at email.msn.com Fri May 23 22:46:53 2003 From: tim_one at email.msn.com (Tim Peters) Date: Fri May 23 21:47:34 2003 Subject: [Spambayes] Experimental Ham/Spam imbalance setting In-Reply-To: <16E1010E4581B049ABC51D4975CEDB88619A8E@UKDCX001.uk.int.atosorigin.com> Message-ID: [Tim] >> The fellow you're talking about has a pathologically low number of >> ham; [Moore, Paul] > Hmm. I was a little worried about that possibility. The trouble is, > it's a very similar situation to the one I'm in. I get virtually *no* > ham (excluding mailing lists, which are filtered off before the email > program sees them), but ridiculous amounts of spam (hundreds per > day). I'd ignore email totally, if it wasn't for the fact that the > few ham I do get are fairly important. > > I don't have any way of training on more ham - I train on it all. > > My current approach (which is working reasonably well) is to train on > ham and unsures only, until I get good results, then stop *totally*. > This has left me with a database containing 40-odd ham, and 150 spam. > My unsure rate is tolerable, so I accept that I'm not going to do > any better. I don't know. This project never did research on sub-200 msg databases, or on highly skewed databases until late in the game. My gut feeling is that the sample size is indeed too small for a statistical approach to do a great job. Still, you *can* fiddle the ham and spam cutoffs to try to compensate, and there a lot of other options to fiddle too. You'd be doing new research, of course. In the archives you'll find reports of experiments on fancier schemes using word n-grams (for n > 1), and I expect they could help a lot: word n-gram schemes certainly learn faster (fewer training msgs are needed to get comparable results). They didn't get pursued here because they ran slower, needed more memory and bigger databases, and in head-to-head tests on our typically much-larger-than-200-msg test sets didn't do better than the highly tuned unigram scheme this project is still using. > I'm close to going for the other option - get a new mail account :-( It's hard to imagine what good you're getting out of this one . ... >> It remains experimental because the evidence was/is spotty and mixed. > Yes, that was partly my point. As I understand things (I came into > this after the extensive testing work had pretty much died down) it > has become pretty much impossible to see significant test results > now, thanks to the level of effectiveness which has been achieved. It became flatly impossible to make any improvements on my main 50,000+ msg test database -- there were no false negatives remaining, and the 6-or-so FP remaining were hopeless. Those tests were still geared toward my original purpose, though, seeing whether this technology would work for high-volume Mailman mailing lists. All evidence said it would work superbly (but that still hasn't been done). What kinds of tweaks may work better for individual, lower-volume inboxes didn't get nearly as much attention. > What I see now is much more of a "real life gut feel" type of effect, > which is nearly impossible to either quantify, or to reproduce > reliably. Whether such evidence is useful is a difficult judgement > call :-( Objective results require a large variety of testers using their real life inboxes. > ... > Hmm. I think I could explain this in end-user language. How does this > sound: Made up . > Compensate for unequal numbers of spam and ham > ---------------------------------------------- > > If your training database has significantly (5 times) more ham > than spam, or vice versa, you may start seeing an increase in > incorrect classifications I've seen that a factor of 2 imbalance is enough to trigger surprises. > (messages put in the wrong category, not just marked as unsure). If > so, this option allows you to compensate for this, at the cost of > increasing the number of messages classified as "unsure". Also at the cost of misclassifying msgs in the *other* direction. So, e.g., setting the option True is most appropriate if you both (1) have more spam than ham, and (2) have a deeper fear of false positives than false negatives; or, (1a) have more ham than spam, and (2a) have a deeper fear of false negatives than false positives. Enabling the option is expected to increase the Unsure rate in either case. It's probably not the best way to deal with imbalance either, it's just the best I could dream up at the time. > Note that the effect is subtle, and you should experiment with > both settings to choose the option that suits you best. Why just this option? There are *many* options under the covers, and their effects on inboxes unlike the large and relatively balanced ones most people tested on simply isn't known. For example, decreasing unknown_word_strength may help a lot on small and/or lopsided databases -- or may hurt a lot. We simply don't know, since it wasn't tested, and it's easy enough to make up a *plausibility* argument either way. Decreasing it will almost certainly reduce the # of unsures, BTW -- but *probably* at the cost of increasing misclassification rates. When there's not much data to go on, it's likely hard to get a pure win. BTW, as you boost unknown_word_strength toward infinity, every msg will tend toward a score of unknown_word_prob (which defaults to 0.5). ... >> Since mass testing here stopped, we haven't got useful feedback on >> any of the non-default options. Since there wasn't enough info to >> decide about them when mass testing stopped, they still deserve a >> chance to survive. I hope mass testing resumes, but I can't drive >> it (no time). Until it does resume, the continued existence of >> these options seems appropriate. > Fair enough. I agree about testing, but I also don't have the time to > do a good job (or the understanding, or the large corpus of data...) > > Spambayes is a victim of its own success. Theoretically, it's still > only alpha, but we're getting a real live user base, support issues, > the lot. I'm not sure whether to blame Microsoft for getting people > used to the idea that alpha is as good as it gets, or the Greeks for > not having any letters before alpha :-) It's also a victim of economics: the people who did most of the theoretical "heavy lifting" (Gary Robinson, Rob Hooft, and me) aren't active here anymore, and nobody has filled that void yet. The things that were being tested when I got yanked from this haven't made any progress, and MarkH's attempt to get another test round started fizzled out (bless his heart for trying, though!). The protocols under which we developed this stuff (see TESTING.txt) are solid, and when testing stopped there were still more questions open than had been answered. Dealing with small and/or lopsided and/or real-life individual inboxes could be approached the same way, given someone ruthless enough to drive it, and enough volunteer testers to feed it. It Would Work. It's tedious and time-consuming work, though. That's something I learned from my life in commercial speech recognition: there's plenty of clever theory to be exploited, but making it work in real life requires an enormous investment in data collection, cleaning, tagging and analysis, and ruthlessness (lack of ego attachment) in letting the data tell you what is and isn't working. I suspect that's why no particularly good open source speech recog program has appeared: the huge mass of unglamorous grunt work required doesn't attract volunteers, and most clever ideas get shot down by the data (it's rarely an ego booster). We got a good start on playing that game here because my employer paid my salary to work on it at first, Gary Robinson was borderline obsessed with dreaming up theoretically clean foundations, and a wonderful group of testers was attracted enough by the novelty and promise of it all to play along. That was great while it lasted. Deployment is more important over the long run, but I regret that there's nothing driving the theoretical underpinnings anymore. BTW, I have to confess that it works so well on my personal 1,000-msg databases that I've got no incentive to try to make "spare time" for it anymore -- to the extent spam was chafing me, my itch is thoroughly scratched. > Thanks for taking the time to explain all this. Thanks for listening -- I bet you didn't expect the Spanish Inquisition . From bwims at yahoo.com Sat May 24 12:42:17 2003 From: bwims at yahoo.com (=?iso-8859-1?q?Brian=20Williams?=) Date: Sat May 24 06:42:52 2003 Subject: [Spambayes] Improvement request Message-ID: <20030524104217.40732.qmail@web41011.mail.yahoo.com> Hi, many thanks for a marvelous product! The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. Please don't think me ungrateful... this is a great product! Thanks, Brian Williams --------------------------------- It's Samaritans' Week. Help Samaritans help others. Call 08709 000032 to give or donate online now at http://www.samaritans.org/support/donations.shtm From ba at arrivalprojects.com Sat May 24 10:46:40 2003 From: ba at arrivalprojects.com (Brent Ahsmuhs, Arrival Projects Group) Date: Sat May 24 10:46:35 2003 Subject: [Spambayes] Outlook Express Support? Message-ID: <003501c32203$4921bd20$94761e41@kc.rr.com> Does SpamBayes support Outlook Express? I'm on V 5 Please advise, thanks! Brent Ahsmuhs From bill at parducci.net Sat May 24 11:15:29 2003 From: bill at parducci.net (bill parducci) Date: Sat May 24 13:15:34 2003 Subject: [Spambayes] Outlook Express Support? References: <003501c32203$4921bd20$94761e41@kc.rr.com> Message-ID: <3ECFA8B1.1000106@parducci.net> from an earlier post: /* Does Spambayes work with Outlook Express? Outlook Express isn't a version of Outlook, it's a completely separate program (from the same company). Because they give it away for free, OE is a really stripped down program, and it's extremely difficult to create a plugin for it. As someone else said, you can use pop3proxy or imapfilter (depending on whether you use POP3 or IMAP). Check out the INTEGRATION.TXT file for instructions. Pop3proxy/imapfilter aren't quite as 'transparent' as the Outlook plugin, but they're still quite easy to use/setup, and they use the same core, so the results will be the same */ b Brent Ahsmuhs, Arrival Projects Group wrote: > Does SpamBayes support Outlook Express? I'm on V 5 > > Please advise, thanks! > > Brent Ahsmuhs > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From noreply at sourceforge.net Sat May 24 11:33:49 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 24 13:33:56 2003 Subject: [Spambayes] [ spambayes-Bugs-742791 ] Install fails with TypeError Message-ID: Bugs item #742791, was opened at 2003-05-24 17:33 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742791&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: Rajendra Wall (dirtynuke) Assigned to: Mark Hammond (mhammond) Summary: Install fails with TypeError Initial Comment: on WXP/Outlook 2K, install failed with TypeError: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\gencache.py ", line 341, in EnsureModule File "E:\src\pythonex\com\win32com\client\gencache.py ", line 213, in GetModuleForTypelib File "E:\src\pythonex\com\win32com\client\gencache.py ", line 473, in _GetModule File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 335, in doimport File "E:\src\Installer\iu.py", line 181, in getmod File "E:\src\Installer\archive.py", line 375, in getmod TypeError: unbound method getmod() must be called with DirOwner instance as first argument (got ExtInPkgImporter instance instead) Thanks! ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742791&group_id=61702 From w1az at arrl.net Sat May 24 20:15:41 2003 From: w1az at arrl.net (Bernie Gardner) Date: Sat May 24 15:15:42 2003 Subject: [Spambayes] Problem with pop3proxy Message-ID: <1053803470.4315.9.camel@linux.local> I'm just starting to use spambayes and need a little assistance. I am using SuSE linux 8.2 and Evolution as a mail program. When I set up the localhost:port address in Evolution and start pop3proxy with the same port number, I get a connection refused error when I try to check mail on the isp server. Thanks for any suggestions! -- Bernie Gardner From skip at pobox.com Sat May 24 16:28:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Sat May 24 16:28:15 2003 Subject: [Spambayes] Problem with pop3proxy In-Reply-To: <1053803470.4315.9.camel@linux.local> References: <1053803470.4315.9.camel@linux.local> Message-ID: <16079.54746.974946.960554@montanaro.dyndns.org> Bernie> I'm just starting to use spambayes and need a little assistance. Bernie> I am using SuSE linux 8.2 and Evolution as a mail program. When Bernie> I set up the localhost:port address in Evolution and start Bernie> pop3proxy with the same port number, I get a connection refused Bernie> error when I try to check mail on the isp server. What app gets the connection refused, Evolution or pop3proxy? If it's pop3proxy, can you send us your configuration info? If it's Evolution, make sure you have pop3proxy started first. (I know that's probably obvious, but it's not clear that's what you did from your wording.) Skip From rallan at helical.com Sat May 24 17:56:08 2003 From: rallan at helical.com (Robb Aley Allan) Date: Sat May 24 16:55:53 2003 Subject: [Spambayes] applescript to control pop3proxy in OS X Message-ID: <248C5B76-8E2A-11D7-B133-000393796B40@helical.com> Just for fun, I've written an applescript that launches and monitors the spambayes pop3proxy from a double-clickable icon, which can be stored on the desktop or the dock to stop and start the proxy. If you all have any interest in looking this over for inclusion as a utility, I'd be happy to send it along. -- Robb Aley Allan robb@helical.com robballan@minskoff.com From richard at varnes.com Sat May 24 17:28:46 2003 From: richard at varnes.com (Richard Varnes) Date: Sat May 24 17:29:37 2003 Subject: [Spambayes] Spambayes Installation Message-ID: <001401c3223b$790ee520$0200a8c0@Portable> I have installed Spambayes and it works perfectly. I was wondering if it remembers the characteristics of junk mail if the mail in the junk mail folder is deleted. Richard Varnes 225 Trails End Rd Elkhorn NE, 68022 richard@varnes.com tel: 402-289-5825 fax: 805-293-8545 cell: 402-690-4825 _____ Upgrade Your Email - Click here! From w1az at arrl.net Sat May 24 23:26:37 2003 From: w1az at arrl.net (Bernie Gardner) Date: Sat May 24 18:26:38 2003 Subject: [Spambayes] Problem with pop3proxy In-Reply-To: <16079.54746.974946.960554@montanaro.dyndns.org> References: <1053803470.4315.9.camel@linux.local> <16079.54746.974946.960554@montanaro.dyndns.org> Message-ID: <1053814925.4316.14.camel@linux.local> On Sat, 2003-05-24 at 16:28, Skip Montanaro wrote: > Bernie> I'm just starting to use spambayes and need a little assistance. > Bernie> I am using SuSE linux 8.2 and Evolution as a mail program. When > Bernie> I set up the localhost:port address in Evolution and start > Bernie> pop3proxy with the same port number, I get a connection refused > Bernie> error when I try to check mail on the isp server. > > What app gets the connection refused, Evolution or pop3proxy? If it's > pop3proxy, can you send us your configuration info? If it's Evolution, make > sure you have pop3proxy started first. (I know that's probably obvious, but > it's not clear that's what you did from your wording.) > > Skip Thanks for the reply and reasonable suggestions. I did start pop3proxy first,and in fact I can access the setup pages in my browser with no problem. I think it is Evolution which is refused by my local computer, not pop3proxy by the isp. The error message definitely comes from Evolution and comes up immediately after I press send/recieve. -- Bernie Gardner From sgrigsby at worldnet.att.net Sat May 24 20:50:14 2003 From: sgrigsby at worldnet.att.net (Shirley Grigsby) Date: Sat May 24 20:45:55 2003 Subject: [Spambayes] Integration with Outlook Express - Tim's fix not appearing Message-ID: <003201c32257$a5703800$2413570c@compaqcomputer> Today I downloaded and installed version 1.0a2 and got it working down to adding the "X-SpamBayes-Classification" to the email headers. I then realized that Outlook Express would not sort on headers, nor could my Outlook 97 apparently. I saw where Tim Stone addressed this problem on the website at http://mail.python.org/pipermail/spambayes/2003-February/003200.html: >Comment By: Tim Stone (timstone4) Date: 2003-02-08 20:36 Message: Logged In: YES user_id=645698 Added a configuration option to add classification to recipient list, which can be tested by Outlook Express mail rules. See Option Configuration page in the pop3proxy user interface. However, when I open my newly downloaded pop3proxy.py interface, I do not find this option on the main page nor the linked Configurations page. Was it dropped in the newest version? I would love to have a solution, as I spent my entire day getting this up and going . . . to no avail. Please help if you can; I'm desperate to stop the awful spam. Many thanks, Shirley Grigsby From noreply at sourceforge.net Sat May 24 19:30:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat May 24 21:30:48 2003 Subject: [Spambayes] [ spambayes-Bugs-740843 ] No Disk Error with Outlook 2000 on startup Message-ID: Bugs item #740843, was opened at 2003-05-21 01:39 Message generated for change (Comment added) made by fgb You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Sam Snow (snowsam) Assigned to: Mark Hammond (mhammond) Summary: No Disk Error with Outlook 2000 on startup Initial Comment: After installing SpamBayes-Outlook-Setup-002.exe I am now getting an error dialog on Outlook startup. The box says: (Header) Inbox - Microsoft Outlook:OUTLOOK.EXE - No Disk (Body) There is no disk in the drive. Please insert a disk into drive \Device\Harddisk0\DR0. (Buttons) Cancel, Try Again, Continue I am able to click cancel or continue several times and then outlook goes ahead and opens up. I just installed this evening, so I am not sure if the filtering is still working correctly. I was able to train the program sucessfully. I am using Office 2000 SP3 on Win 2000. I will try to attach a jpg of the dialog box. My error log says the following: SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.64165ms AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Processing 0 missed spam in folder 'Inbox' took 31.9599ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 316, in probability exceptions.AssertionError: ---------------------------------------------------------------------- Comment By: Ferruccio Barletta (fgb) Date: 2003-05-25 01:30 Message: Logged In: YES user_id=786210 I get the same error with Office 2002 SP1 on Windows XP SP1 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 From tim_one at email.msn.com Sat May 24 23:01:27 2003 From: tim_one at email.msn.com (Tim Peters) Date: Sat May 24 22:02:11 2003 Subject: [Spambayes] Spambayes Installation In-Reply-To: <001401c3223b$790ee520$0200a8c0@Portable> Message-ID: [Richard Varnes] > I have installed Spambayes and it works perfectly. Glad to hear it! Please keep in mind that this is "alpha" software: the developers don't consider it to be production-quality yet, and you will find bugs. You're a pioneer if you're using it now (which is a good thing -- I'm not trying to dissuade you ). > I was wondering if it remembers the characteristics of junk mail if the > mail in the junk mail folder is deleted. That depends on the exact details of how you're using spambayes. In general, yes, it does remember, until you retrain it from scratch. Because it is alpha software, most of the developers keep their junk mail around, in order to recover and retrain after a bug damages the database. From mail at peterbe.com Sun May 25 04:17:29 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Sat May 24 22:59:43 2003 Subject: [Spambayes] applescript to control pop3proxy in OS X In-Reply-To: <248C5B76-8E2A-11D7-B133-000393796B40@helical.com> Message-ID: <5.2.0.9.0.20030525031706.0220dea0@mail.grenna.net> At 16:56 2003-05-24 -0400, Robb Aley Allan wrote: >Just for fun, I've written an applescript that launches and monitors the >spambayes pop3proxy from a double-clickable icon, which can be stored on >the desktop or the dock to stop and start the proxy. If you all have any >interest in looking this over for inclusion as a utility, I'd be happy to >send it along. Can this please be uploaded to the sourceforge files page? > -- >Robb Aley Allan > >robb@helical.com >robballan@minskoff.com > > >_______________________________________________ >Spambayes mailing list >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes From mhammond at skippinet.com.au Fri May 23 10:34:24 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 02:52:06 2003 Subject: [Spambayes] Compatibility matrix In-Reply-To: <001501c32080$a93177c0$8d80b042@dexter> Message-ID: <119301c320ba$ae52e560$530f8490@eden> > Whoops. I forgot to mention that I used Mark's installer, > version 002, on an > SP2 version of Outlook XP. I am suspecting that the specific version of Outlook XP is the real issue here. It also makes alot of sense that SP2 would fix customer complaints about all kinds of plugins breaking, not just ours. How about we say that Outlook XP SP2 is "known to work", and that earlier versions are "known to have some issues". We can then correct it when people correct us :) I would drop the "XP Home" and "XP Professional" distinction - I seriously doubt that would have any impact, and we should have "2000 Pro/Server" distinctions if we make that one. Re the "Exchange Server" comments in your matrix: I would tone down the disclaimer, as it does now work fairly well in that environment (with one or 2 minor bugs specific to it). I even have a (very quiet) Exchange Server currently that I am able to run against. I'm happy to hack something up on this if you like - just let me know when you have checked in your latest. It looks excellent though - thanks! Mark. From noreply at sourceforge.net Sun May 25 05:41:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 07:41:15 2003 Subject: [Spambayes] [ spambayes-Bugs-712480 ] Outlook 2002 (XP) installation fails Message-ID: Bugs item #712480, was opened at 2003-03-31 15:47 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 (XP) installation fails Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-25 21:41 Message: Logged In: YES user_id=14198 Failes with Outlook XP ver.10.4712.4219 sp-2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 09:40 Message: Logged In: YES user_id=14198 *sob* - so it certainly fails with Outlook SP2 here? ecvogel - is your traceback identical to this one? ---------------------------------------------------------------------- Comment By: Eric C. Vogel (ecvogel) Date: 2003-05-23 05:20 Message: Logged In: YES user_id=784818 Same here. Outlook XP all SP and patches, WIn XP Pro SP1 all patches. The file is registered and in the COM and has the check. Exchange Server 2000 all SP and patches on SBS 2kSP-1 all patches. 002 install ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-20 08:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 17:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 18:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 12:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 16:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From noreply at sourceforge.net Sun May 25 05:47:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 07:47:38 2003 Subject: [Spambayes] [ spambayes-Bugs-743113 ] Multiple profile issues Message-ID: Bugs item #743113, was opened at 2003-05-25 21:47 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Multiple profile issues Initial Comment: The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 From mhammond at skippinet.com.au Sun May 25 22:47:58 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 07:48:53 2003 Subject: [Spambayes] Improvement request In-Reply-To: <20030524104217.40732.qmail@web41011.mail.yahoo.com> Message-ID: <18f101c322b3$7db4d800$530f8490@eden> Thanks - that sounds like a real bug :) I've added http://sourceforge.net/tracker/index.php?func=detail&aid=743113&group_id=617 02&atid=498103 Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Brian Williams > Sent: Saturday, 24 May 2003 8:42 PM > To: SpamBayes@python.org > Subject: [Spambayes] Improvement request > > > Hi, > > many thanks for a marvelous product! > > The only problem I have is that I use multiple outlook > profiles. At work, I use Exchange and at home I use a > personal folder for my mail delivery, which contains its own > inbox, junk and maybe folders. When I initiate Outlook with > my work folder, it really screws up the buttons. The manager > button doesn't work, and I get a duplicate pair of buttons > appearing at the start of the standard toolbar. Fix is to > disable the COM object, exit, enter, re-enable COM object, > (exit and re-enter?). > > The only thing I can do right now is to disable the COM > object before moving profiles. > > What would be great would be either to have a button that > toggles the functionality on/off, or (ideally) get it to > recognise the different profiles. > > Please don't think me ungrateful... this is a great product! > > Thanks, > > Brian Williams > > > > > --------------------------------- > It's Samaritans' Week. Help Samaritans help others. > Call 08709 000032 to give or donate online now at > http://www.samaritans.org/support/donations.shtm > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > From mhammond at skippinet.com.au Sun May 25 23:00:20 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 08:01:16 2003 Subject: Format of Spam Field in Plugin (was RE: [Spambayes] (no subject)) In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301AB3B33@its-xchg4.massey.ac.nz> Message-ID: <18f201c322b5$37b639a0$530f8490@eden> [Tony] > > The instructions say that under "User Defined Fields" there > > will be a button with a "Spam" label. Sure enough it is > > there. :) However, when I drag the field to the Inbox header, > > the field Type is "Number" instead of "Percent" and the > > Format is "All Digits 1,234,567 -1,234,567" instead of > > "Rounded 65% -65%". > > Hopefully Mark will notice this message (hence the changed > subject) and > answer it, because he might know. Mark noticed it just fine - unfortunately, he still doesn't know > Mark, did you change from number to percent since the binary release? > Maybe my old folders are numbers, and new ones percents? I haven't touched that for ages, since, we thought, we got it right (or more accurately, got it as close enough to how we wanted that it actually seemed to work) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2072 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030525/9d3f423f/winmail.bin From noreply at sourceforge.net Sun May 25 06:04:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 08:04:18 2003 Subject: [Spambayes] [ spambayes-Bugs-740311 ] 'skip_max_word_size' error on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 18:09 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 >Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) >Assigned to: Mark Hammond (mhammond) >Summary: 'skip_max_word_size' error on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- Comment By: Harald Meyer (hmeyer666) Date: 2003-05-24 06:36 Message: Logged In: YES user_id=785713 Same here with Win2000 and Outlook XP ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-21 08:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From noreply at sourceforge.net Sun May 25 06:04:49 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 08:04:53 2003 Subject: [Spambayes] [ spambayes-Bugs-731538 ] SpamBayes not filtering messages. Message-ID: Bugs item #731538, was opened at 2003-05-03 05:05 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 >Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) >Assigned to: Mark Hammond (mhammond) Summary: SpamBayes not filtering messages. Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- Comment By: Brian Takita (btakita) Date: 2003-05-04 08:21 Message: Logged In: YES user_id=620484 Windows 2000 Outlook XP Python 2.2 SpamBayes 1.0a2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:41 Message: Logged In: YES user_id=14198 Sorry - my mistake - you already did that - thanks! ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-03 05:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From noreply at sourceforge.net Sun May 25 06:06:45 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 08:06:49 2003 Subject: [Spambayes] [ spambayes-Bugs-743122 ] Outlook 2002 auto-complete breaks Message-ID: Bugs item #743122, was opened at 2003-05-25 22:06 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 auto-complete breaks Initial Comment: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 From mhammond at skippinet.com.au Sun May 25 23:06:56 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 08:07:52 2003 Subject: [Spambayes] possible bug In-Reply-To: Message-ID: <190101c322b6$238cfb20$530f8490@eden> Thanks - I added http://sourceforge.net/tracker/index.php?func=detail&aid=743122&group_id=617 02&atid=498103 Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of John Hadzima > Sent: Friday, 23 May 2003 9:43 PM > To: spambayes@python.org > Subject: [Spambayes] possible bug > > > Just installed SpamBayes and I think it breaks the auto-complete popup > list for email addresses used in the To field in Outlook. The popup > still appears but only with the first letter of each email > address - the > rest of the email address doesn't appear. I've uninstalled and > reinstalled SpamBayes several times to confirm this problem. I'm > running Outlook 2002 (SP2) on Windows 2000 (SP3). Attached > are a couple > screen shots that will hopefully give you a clearer idea of what I'm > talking about. > > John > > > > <> > <> > > -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2096 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030525/9df5ae66/winmail.bin From mhammond at skippinet.com.au Sun May 25 23:17:03 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 08:17:59 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix In-Reply-To: <16078.28454.179381.468975@montanaro.dyndns.org> Message-ID: <191401c322b7$8ecb6600$530f8490@eden> > I guess the question is still open. How do I get back all > the way to the > pre-plugin state: no registration, no configuration, etc.? > There must be > some files or registry entries laying about even after the > uninstall step. The Outlook toolbar doesn't always behave exactly as documented either :) Unfortunately, you will need to right-click the toolbar, select "Customise", and Reset the standard toolbar. Mark. From noreply at sourceforge.net Sun May 25 06:31:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 08:31:10 2003 Subject: [Spambayes] [ spambayes-Bugs-743131 ] connection refused with pop3proxy Message-ID: Bugs item #743131, was opened at 2003-05-25 08:31 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743131&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Bernie Gardner (berniegardner) Assigned to: Nobody/Anonymous (nobody) Summary: connection refused with pop3proxy Initial Comment: I'm just starting to use spambayes and need a little assistance. I am using SuSE linux 8.2 and Evolution as a mail program. When I set up the localhost:port address in Evolution and start pop3proxy with the same port number, I get a connection refused error when I try to check mail on the isp server. I started pop3proxy first,and in fact I can access the setup pages in my browser from localhost:8880 with no problem. I think it is Evolution which is refused by my local computer, not pop3proxy by the isp. The error message definitely comes from Evolution and comes up immediately after I press send/recieve. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743131&group_id=61702 From noreply at sourceforge.net Sun May 25 08:40:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 10:40:47 2003 Subject: [Spambayes] [ spambayes-Bugs-740843 ] No Disk Error with Outlook 2000 on startup Message-ID: Bugs item #740843, was opened at 2003-05-21 01:39 Message generated for change (Comment added) made by fgb You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Sam Snow (snowsam) Assigned to: Mark Hammond (mhammond) Summary: No Disk Error with Outlook 2000 on startup Initial Comment: After installing SpamBayes-Outlook-Setup-002.exe I am now getting an error dialog on Outlook startup. The box says: (Header) Inbox - Microsoft Outlook:OUTLOOK.EXE - No Disk (Body) There is no disk in the drive. Please insert a disk into drive \Device\Harddisk0\DR0. (Buttons) Cancel, Try Again, Continue I am able to click cancel or continue several times and then outlook goes ahead and opens up. I just installed this evening, so I am not sure if the filtering is still working correctly. I was able to train the program sucessfully. I am using Office 2000 SP3 on Win 2000. I will try to attach a jpg of the dialog box. My error log says the following: SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.64165ms AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Processing 0 missed spam in folder 'Inbox' took 31.9599ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 316, in probability exceptions.AssertionError: ---------------------------------------------------------------------- Comment By: Ferruccio Barletta (fgb) Date: 2003-05-25 14:40 Message: Logged In: YES user_id=786210 I may have found the root cause of this problem. When I brought up disk management on my notebook I noticed that my hard drive was Disk1 and the SD media drive was Disk0. When I disabled the SD drive and rebooted, the hard drive became Disk0 and the problem disappeared. ---------------------------------------------------------------------- Comment By: Ferruccio Barletta (fgb) Date: 2003-05-25 01:30 Message: Logged In: YES user_id=786210 I get the same error with Office 2002 SP1 on Windows XP SP1 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 From skip at pobox.com Sun May 25 11:26:04 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun May 25 11:26:07 2003 Subject: [Spambayes] Problem with pop3proxy In-Reply-To: <1053814925.4316.14.camel@linux.local> References: <1053803470.4315.9.camel@linux.local> <16079.54746.974946.960554@montanaro.dyndns.org> <1053814925.4316.14.camel@linux.local> Message-ID: <16080.57484.886961.802578@montanaro.dyndns.org> Bernie> I did start pop3proxy first, and in fact I can access the setup Bernie> pages in my browser with no problem. I think it is Evolution Bernie> which is refused by my local computer, not pop3proxy by the isp. Bernie> The error message definitely comes from Evolution and comes up Bernie> immediately after I press send/recieve. Can you tell that pop3proxy is listening to the port you think it should be listening to (localhost:110 for example)? If you're not running as root, make sure you selected a port > 1024. If you execute "netstat -an" you should see something like "127.0.0.1:NNN" in the Local Address column with "LISTEN" in the State column. You should also be able to telnet to port NNN of your machine while pop3proxy is running. Also try a "netstat -an" without running pop3proxy. Perhaps your system already has something listening to port NNN and you'll have to pick another port. Skip From bkc at murkworks.com Sun May 25 17:13:26 2003 From: bkc at murkworks.com (Brad Clements) Date: Sun May 25 15:54:23 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <20030521164223.GA16321@calvin> References: <3ECB5F8E.18223.8466880@localhost> Message-ID: <3ED0EBA5.15635.1DF18942@localhost> Ok, everyone say "I told you so". Something seems amiss with my analysis, I just find it hard to believe that users have so few words in common. I've collected wordlists from about 8 or 9 folks and want to see how many words are common. Either my code is bad, the sha collection process is flawed or there really isn't much in common, but I didn't expect it to be this bad. First comparing personal corpuses that all claim to be english. [bkc@strader data]$ python2.3 ../countwords.py key-hash.* loading key-hash.ark3 24295 loading key-hash.markh 79488 loading key-hash.papadoc 19260 loading key-hash.popiel 376496 loading key-hash.skip 392161 loading key-hash.tameyer 45776 loading key-hash.wjp 198470 Loaded 7 wordlists with 947127 distinct words out of 1135946 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 823450 86.9% 2 71131 7.5% 3 40919 4.3% 4 10690 1.1% 5 905 0.1% 6 32 0.0% 7 0 0.0% And now through in another one that claims to have "lots of french words" [bkc@strader data]$ python2.3 ../countwords.py key-hash.* french-key-hash.alf loading key-hash.ark3 24295 loading key-hash.markh 79488 loading key-hash.papadoc 19260 loading key-hash.popiel 376496 loading key-hash.skip 392161 loading key-hash.tameyer 45776 loading key-hash.wjp 198470 loading french-key-hash.alf 186599 Loaded 8 wordlists with 1059601 distinct words out of 1322545 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 911816 86.1% 2 79202 7.5% 3 33255 3.1% 4 25001 2.4% 5 9437 0.9% 6 859 0.1% 7 31 0.0% 8 0 0.0% And now, here's a collection of "test" data, which I assume came from the same test dataset, but seems strangely low common word count too. [bkc@strader data]$ python2.3 ../countwords.py key-hash[0-9]* loading key-hash12 17784 loading key-hash161 28584 loading key-hash162 18167 loading key-hash168 30469 loading key-hash533 16806 loading key-hash548 27110 loading key-hash578 30389 loading key-hash586 32707 loading key-hash600 16659 loading key-hash887 16463 Loaded 10 wordlists with 232531 distinct words out of 235138 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 230098 99.0% 2 2276 1.0% 3 141 0.1% 4 15 0.0% 5 1 0.0% 6 0 0.0% 7 0 0.0% 8 0 0.0% 9 0 0.0% 10 0 0.0% Finally, here's my code. Note that I use a dictionary to map each sha into a number, then do the analysis on the number. I also put the numbers into a set. I was planning on using the sets for further analysis but the "global numbers" are so bad, I don't see any point in playing around with the sets. (so, they're not used in this code) Anyone see anything wrong with this code? """countwords.py""" import sets import os, sys class WordList(object): def __init__(self,path): self.path = path self.name = os.path.basename(path) self.set = sets.Set() def load(self,globaldict,countdict): f = file(self.path,'rb') set = self.set nextID = len(globaldict) + 1 while 1: w = f.read(20) if not w or len(w) != 20: break idnumber = globaldict.get(w,None) if not idnumber: globaldict[w] = idnumber = nextID nextID += 1 set.add(idnumber) countdict[idnumber] = countdict.get(idnumber,0) + 1 class WordCollection(object): def __init__(self,filepathlist): """Load up the words in each file and cross check them""" self.globaldict = globaldict = dict() self.countdict = countdict = dict() self.collection = [] self.totalwordcount = 0 for filepath in filepathlist: w = WordList(filepath) self.collection.append(w) print "loading ",w.name, w.load(globaldict,countdict) l = len(w.set) print l self.totalwordcount += l def printstats(self): """Figure out some stats""" uniqueWordCount = len(self.globaldict) print "Loaded %d wordlists with %d distinct words out of %d total words" % \ (len(self.collection), uniqueWordCount,self.totalwordcount) # now get counts for each word and bin them bins = [0]*(len(self.collection)+1) # should never be more then total number of WordLists for cnt in self.countdict.values(): try: bins[cnt] += 1 except IndexError: print "hey, cnt is ",cnt print "Word counts by number of collections each word is seen in" print "Col.",'',"# Words","% of unique words" for cnt,bin in enumerate(bins): if not cnt: continue print cnt,'',"% 9d" % bin,"%3.1f%%" % (100.0 * bin/uniqueWordCount) if __name__ == "__main__": if len(sys.argv) > 1: wc = WordCollection(sys.argv[1:]) wc.printstats() else: print "Usage: %s files ... " % sys.argv[0] The above code is supposed to match this extraction code: >>> import anydbm >>> import sha >>> db = anydbm.open("hammie.db") >>> f = open("key-hash", "w") >>> for k in db.keys(): ... o = sha.new(k) ... f.write(o.digest()) ... >>> f.close() >>> db.close() -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From bkc at murkworks.com Sun May 25 17:15:59 2003 From: bkc at murkworks.com (Brad Clements) Date: Sun May 25 15:56:54 2003 Subject: [Spambayes] The bounce message.. Message-ID: <3ED0EC3E.15965.1DF3E022@localhost> Should indicate that the post went through to the list anyway. That is, if it did go through.. Did my post go through? did this one? -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From popiel at wolfskeep.com Sun May 25 14:18:23 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Sun May 25 16:18:28 2003 Subject: [Spambayes] The bounce message.. In-Reply-To: Message from "Brad Clements" of "Sun, 25 May 2003 16:15:59 EDT." <3ED0EC3E.15965.1DF3E022@localhost> References: <3ED0EC3E.15965.1DF3E022@localhost> Message-ID: <20030525201823.A53132DE36@cashew.wolfskeep.com> In message: <3ED0EC3E.15965.1DF3E022@localhost> "Brad Clements" writes: >Should indicate that the post went through to the list anyway. > >That is, if it did go through.. Did my post go through? did this one? I saw it. - Alex From nas at python.ca Sun May 25 14:26:17 2003 From: nas at python.ca (Neil Schemenauer) Date: Sun May 25 16:23:30 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <3ED0EBA5.15635.1DF18942@localhost> References: <3ECB5F8E.18223.8466880@localhost> <3ED0EBA5.15635.1DF18942@localhost> Message-ID: <20030525202617.GA22013@glacier.arctrix.com> Brad Clements wrote: > Ok, everyone say "I told you so". > > Something seems amiss with my analysis, I just find it hard to believe > that users have so few words in common. I think you are missing a piece of the puzzle. Of the words in common, how often do they appear in messages? IOW, your results could mean there are a lot of "noise" words bloating everyone's DB and that they don't significantly affect classification. The common words could be the ones that really matter. Neil From popiel at wolfskeep.com Sun May 25 14:44:30 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Sun May 25 16:44:36 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: Message from "Brad Clements" of "Sun, 25 May 2003 16:13:26 EDT." <3ED0EBA5.15635.1DF18942@localhost> References: <3ECB5F8E.18223.8466880@localhost> <3ED0EBA5.15635.1DF18942@localhost> Message-ID: <20030525204430.EC84B2DE36@cashew.wolfskeep.com> In message: <3ED0EBA5.15635.1DF18942@localhost> "Brad Clements" writes: >Ok, everyone say "I told you so". "I told you so", even though I didn't expect this sort of result. (I thought it would be pointless, not impossible.) >Something seems amiss with my analysis, I just find it hard to believe >that users have so few words in common. I also think this is extremely peculiar. >Either my code is bad, the sha collection process is flawed or there >really isn't much in common, but I didn't expect it to be this bad. I don't see anything wrong with your analysis code, so I'm beginning to suspect the collection process is flawed. I'm perfectly willing to make my non-SHA'd wordlist available for verification. If there's more than 71131 words in common between my wordlist and your wordlist, then we _know_ there's something wrong with the collection process. >First comparing personal corpuses that all claim to be english. > >Loaded 7 wordlists with 947127 distinct words out of 1135946 total words >Word counts by number of collections each word is seen in >Col. # Words % of unique words >1 823450 86.9% >2 71131 7.5% >3 40919 4.3% >4 10690 1.1% >5 905 0.1% >6 32 0.0% >7 0 0.0% Now this is just too strange to believe... for English mail, there should be at least about 5000 words in common among everybody; that's the size of the everyday usage English vocabulary. This number should even be bloated a bit by aliasing due to punctuation. - Alex From skip at pobox.com Sun May 25 18:27:14 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun May 25 18:27:15 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <3ED0EBA5.15635.1DF18942@localhost> References: <3ECB5F8E.18223.8466880@localhost> <3ED0EBA5.15635.1DF18942@localhost> Message-ID: <16081.17218.457732.286578@montanaro.dyndns.org> Brad> Something seems amiss with my analysis, I just find it hard to Brad> believe that users have so few words in common. "word" has to be defined. There are tons of synthetic tokens in the mix. I suspect many of those synthetic tokens will be unique among such a small population. Many (most? all?) synthetic words will contain a colon (some non-synthetic words will as well). I just quickly looked through my current database and found of 392,849 tokens, 138,855 contain a colon. Maybe the comparison should be restricted to non-synthetic tokens. Another quick scan shows that of the 392,849 tokens, 233,961 are hapaxes. Maybe another source of uniqueness. Skip From mhammond at skippinet.com.au Mon May 26 09:50:01 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 18:50:27 2003 Subject: [Spambayes] Alternative to binary installer In-Reply-To: <16078.12725.91836.254995@montanaro.dyndns.org> Message-ID: <1a3c01c3230f$f99a33a0$530f8490@eden> > Is that roughly correct? Sounds completely correct to me. Mark. From tim_one at email.msn.com Sun May 25 20:08:33 2003 From: tim_one at email.msn.com (Tim Peters) Date: Sun May 25 19:09:09 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <3ED0EBA5.15635.1DF18942@localhost> Message-ID: [Brad Clements, with odd results] It's not credible that *no* word appears in all 7 wordlists. For example, everyone must have at least one msg with a sentence starting with "the". > ... > First comparing personal corpuses that all claim to be english. There you go: MarkH's is Australian . > ... > 7 0 0.0% Not a chance. > def load(self,globaldict,countdict): > f = file(self.path,'rb') > set = self.set > nextID = len(globaldict) + 1 > while 1: > w = f.read(20) > if not w or len(w) != 20: Should have assert len(w) in (0, 20) here to guard against corrupt input files. > break > idnumber = globaldict.get(w,None) > if not idnumber: > globaldict[w] = idnumber = nextID > nextID += 1 > set.add(idnumber) > countdict[idnumber] = countdict.get(idnumber,0) + 1 It's unclear why you're mucking with globaldict. That is, countdict could be more easily indexed directly by SHA digest than by another layer of artificial id associated with SHA digests. I don't see anything obviously wrong, but the idnumber indirection interferes with clarity. > >>> import anydbm > >>> import sha > >>> db = anydbm.open("hammie.db") > >>> f = open("key-hash", "w") SHA digests are binary data, so it's necessary to open the output file in "wb" mode (and "w" mode is silently deadly on Windows). From T.A.Meyer at massey.ac.nz Mon May 26 12:15:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 25 19:16:36 2003 Subject: [Spambayes] Integration with Outlook Express - Tim's fix notappearing Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F563@its-xchg4.massey.ac.nz> [notate-to / notate-subject] > However, when I open my newly downloaded pop3proxy.py > interface, I do not find this option on the main page nor the > linked Configurations page. Was it dropped in the newest > version? I would love to have a solution, as I spent my > entire day getting this up and going . . . to no avail. > Please help if you can; I'm desperate to stop the awful spam. By "newest version", I presume you mean the alpha2 that you downloaded? [1]. The problem is that this was added _after_ alpha2 was released (alpha2 really is getting long in the tooth). (I didn't realise this! We need to be careful about saying that OE will work, and perhaps hurry even more with getting alpha3 out). There isn't a solution at the moment, unfortunately. However, alpha3 is due out *really* soon (I hope the end of this week), and that will include this feature. =Tony Meyer [1] AFAICT, neither notate-to or notate-subject have worked in CVS for quite a while now (they were killed when the message.py module was added). No-one seems to have noticed, though ;) They will be back before alpha3. From T.A.Meyer at massey.ac.nz Mon May 26 12:28:08 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 25 19:29:26 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F56F@its-xchg4.massey.ac.nz> > And now, here's a collection of "test" data, which I assume > came from the same test > dataset, but seems strangely low common word count too. They came from two datasets, although one was much bigger than the other. As I said indicated (in the offlist mail I sent), I realised after I did this that I should have only chosen one from each dataset, since there should be a pretty big overlap here. > Loaded 10 wordlists with 232531 distinct words out of 235138 Given that half of these were from the same dataset, there should be a *lot* less distinct words than this. I can't say what the problem is, but the result is definitely not correct. =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 26 12:35:03 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 25 19:35:57 2003 Subject: [Spambayes] Outlook Express (yikes!) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F57C@its-xchg4.massey.ac.nz> I said this in a previous post, but it was kinda hidden, and this is fairly important, so I'm going to say it again (apologies to those that read the other one ;) Alpha2 *does not* support Outlook Express. OE can't filter on arbitrary headers, so although the messages get classified, OE can't do anything with them. TimS created a fix for this way back in February (adding information to the To: and/or Subject: headers), but this was *after* alpha2 was released. I didn't realise this (nor, does it seem, did anyone else), and so assumed that alpha2 would be ok for OE. Additionally, at the moment, the CVS code *does not* support OE. I'll try and find time today to put the notate-to and notate-subject code back in, but it's been gone for a while. So the only way to use Spambayes with OE would be to grab a CVS snapshot from a month or so ago (not exactly a process that's easy for OE users). If Skip/Bill etc could perhaps update the docs that would be great. For the moment, probably a good idea to stop telling people that OE is ok :) (Although OE with IMAP is ok with CVS...) =Tony Meyer From mhammond at skippinet.com.au Mon May 26 10:58:18 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 19:59:12 2003 Subject: [Spambayes] New mailing lists Message-ID: <1a4f01c32319$841466f0$530f8490@eden> There seemed to be consensus that new mailing lists would be a good idea, specifically spambayes-users, spambayes-dev and spambayes-announce, with "spambayes" being an alias to either users or dev. I would love the see the "announce" one setup - simply so we can tell people with problems to subscribe so they know when to try our next big mess . How about we "just do it". I propose that -users, like -dev, be unmoderated and not restricted to member posting, for the same reasons discussed re this list. -announce should be restricted and moderated, and I volunteer to be it. Any objections? If not, then I suggest we just as Barry to wave his wand. That-huge-wand-with-4-strings ly, Mark. From mhammond at skippinet.com.au Mon May 26 11:04:15 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun May 25 20:05:11 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in In-Reply-To: <21631E523BEBD41197FC0020AFCF88F613735D@dsl-mr-207-34-113-i59-cgy.nucleus.com> Message-ID: <1a5201c3231a$591c3ad0$530f8490@eden> > I am running WinXP with outlook 2000, Today in the course of > setting up > WINCVS I installed python. Subsequently my Spambayes add-in > stopped working. > > > I have corrected it by removing python and reinstalling the > add-in but am > curious why this should occur and how to remedy it so I can > install python > on this machine in future. I'm very surprised to hear that. This *should* work fine, and indeed does for me where I have many different versions of Python hanging around. Skip's reply seems to indicate it works for him too. So unfortunately we will need more details. Please feel free to open a bug at source-forge. Mark. From noreply at sourceforge.net Sun May 25 18:07:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 20:08:03 2003 Subject: [Spambayes] [ spambayes-Bugs-743384 ] Show Spam Clues Marks Message as Read Message-ID: Bugs item #743384, was opened at 2003-05-26 12:07 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul C Coyle (pccoyle) Assigned to: Mark Hammond (mhammond) Summary: Show Spam Clues Marks Message as Read Initial Comment: After using "Show Spam Clues" on an unread message in the Spam folder, the message status changed form Read to Unread. My concern here is that an perhaps an acknowledge has been sent to the spammer showing the account as active? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 From noreply at sourceforge.net Sun May 25 18:13:31 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 20:13:36 2003 Subject: [Spambayes] [ spambayes-Bugs-743387 ] Using Show Spam Clues Marks Message as Unread Message-ID: Bugs item #743387, was opened at 2003-05-26 12:13 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743387&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul C Coyle (pccoyle) Assigned to: Mark Hammond (mhammond) Summary: Using Show Spam Clues Marks Message as Unread Initial Comment: Using outlook 2000 and exchange 5.5. When I used the "Show Spam Clues" feature on an unread message in the Spam folder, the message status changed from Unread to read. My concern is that maybe and acknowledge was sent to the sender thereby showing my address as active? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743387&group_id=61702 From noreply at sourceforge.net Sun May 25 18:13:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 20:13:48 2003 Subject: [Spambayes] [ spambayes-Bugs-743384 ] Show Spam Clues Marks Message as Read Message-ID: Bugs item #743384, was opened at 2003-05-26 10:07 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul C Coyle (pccoyle) Assigned to: Mark Hammond (mhammond) Summary: Show Spam Clues Marks Message as Read Initial Comment: After using "Show Spam Clues" on an unread message in the Spam folder, the message status changed form Read to Unread. My concern here is that an perhaps an acknowledge has been sent to the spammer showing the account as active? ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-26 10:13 Message: Logged In: YES user_id=14198 I can reproduce this, but I am not at all sure I will be able to do anything about it. If you are concerned about auto-responses, then I suggest you customize outlook accordingly. Note however that spammers can check account details in other ways - especially if the body of the HTML message references anything that is downloaded from a server (such as an image). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 From T.A.Meyer at massey.ac.nz Mon May 26 13:14:44 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun May 25 20:15:29 2003 Subject: [Spambayes] Windows/Outlook compatibility matrix Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F5D6@its-xchg4.massey.ac.nz> > I haven't used the binary installer so I don't know what all > it does. A definitive answer here will have to come from > Mark, or someone else Windows-savvy who has actually used it. It doesn't really do all that much that's different from CVS, apart from the registration of the plugin itself, and creating log files rather than just throwing things into the trace. > If you have a directory somewhere (assuming Win2K) like > > \Documents and Settings\ > YOUR_LOGIN_NAME\ > Application Data\ > SpamBayes\ > > then it's hiding config and database info in there, and you > should blow that away. This will be where it is (whether binary or source). You'll have a .ini file, a config pickle and the message and word count databases. It's probably safest to get rid of all of them (apart from the ini) if you really want to go from scratch. IMO, it is correct behaviour to leave these there, although to be really nice the (un)installer should *offer* to remove *all* created files. Certainly shouldn't be the default, though, since the user might be uninstalling in order to install a newer version, and would then want to keep the config, and definitely the databases. It's also useful if switching between source and binary, since they can share these files. =Tony Meyer From noreply at sourceforge.net Sun May 25 19:13:43 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 21:13:54 2003 Subject: [Spambayes] [ spambayes-Bugs-743387 ] Using Show Spam Clues Marks Message as Unread Message-ID: Bugs item #743387, was opened at 2003-05-26 10:13 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743387&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Paul C Coyle (pccoyle) Assigned to: Mark Hammond (mhammond) Summary: Using Show Spam Clues Marks Message as Unread Initial Comment: Using outlook 2000 and exchange 5.5. When I used the "Show Spam Clues" feature on an unread message in the Spam folder, the message status changed from Unread to read. My concern is that maybe and acknowledge was sent to the sender thereby showing my address as active? ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-26 11:13 Message: Logged In: YES user_id=14198 Dupe of [ 743384 ] Show Spam Clues Marks Message as Read ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743387&group_id=61702 From w1az at arrl.net Mon May 26 02:49:48 2003 From: w1az at arrl.net (Bernie Gardner) Date: Sun May 25 21:49:49 2003 Subject: [Spambayes] Problem with pop3proxy In-Reply-To: <16080.57484.886961.802578@montanaro.dyndns.org> References: <1053803470.4315.9.camel@linux.local> <16079.54746.974946.960554@montanaro.dyndns.org> <1053814925.4316.14.camel@linux.local> <16080.57484.886961.802578@montanaro.dyndns.org> Message-ID: <1053913511.4322.18.camel@linux.local> On Sun, 2003-05-25 at 11:26, Skip Montanaro wrote: > Bernie> I did start pop3proxy first, and in fact I can access the setup > Bernie> pages in my browser with no problem. I think it is Evolution > Bernie> which is refused by my local computer, not pop3proxy by the isp. > Bernie> The error message definitely comes from Evolution and comes up > Bernie> immediately after I press send/recieve. > > Can you tell that pop3proxy is listening to the port you think it should be > listening to (localhost:110 for example)? If you're not running as root, > make sure you selected a port > 1024. If you execute "netstat -an" you > should see something like "127.0.0.1:NNN" in the Local Address column with > "LISTEN" in the State column. You should also be able to telnet to port NNN > of your machine while pop3proxy is running. > > Also try a "netstat -an" without running pop3proxy. Perhaps your system > already has something listening to port NNN and you'll have to pick another > port. > > Skip Thanks for the hint. I checked with netstat and it turned out it was listening on 0.0.0.0:NNN instead of localhost. It works if I set Evolution to that address, but I'm trying to figure out why pop3proxy is using the wrong address. Bernie -- Bernie Gardner From noreply at sourceforge.net Sun May 25 19:54:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun May 25 21:54:38 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 20:54 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From bkc at murkworks.com Sun May 25 23:14:24 2003 From: bkc at murkworks.com (Brad Clements) Date: Sun May 25 21:55:29 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: References: <3ED0EBA5.15635.1DF18942@localhost> Message-ID: <3ED1403F.7393.1F3C035E@localhost> On 25 May 2003 at 19:08, Tim Peters wrote: > It's unclear why you're mucking with globaldict. This is only the first pass at 'analysis'. I had thought that I would be saving data and making multiple passes. So, to save on RAM I felt converting unique sha hashes to an int would, in later passes (loading sets from pickles, etc) would use less memory. Anyway, I didn't get that far due to the lousy turnout. > > >>> import anydbm > > >>> import sha > > >>> db = anydbm.open("hammie.db") > > >>> f = open("key-hash", "w") > > SHA digests are binary data, so it's necessary to open the output file in > "wb" mode (and "w" mode is silently deadly on Windows). > Is the default 'b' on Linux? Can we get new contributions using "wb"? I've cleaned out the upload directory, you can use the same names. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From tim_one at email.msn.com Sun May 25 23:33:36 2003 From: tim_one at email.msn.com (Tim Peters) Date: Sun May 25 22:34:12 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <3ED1403F.7393.1F3C035E@localhost> Message-ID: [Brad] > This is only the first pass at 'analysis'. I had thought that I would > be saving data and making multiple passes. So, to save on RAM I felt > converting unique sha hashes to an int would, in later passes > (loading sets from pickles, etc) would use less memory. Ah, premature optimization <0.5 wink>. Get it working correctly first. I'll emphasize again that it's simply impossible for 7 databases not to have any words in common. For example, just from this sentence, the tokens "for", "just", "from", "this", "the" and "should" should be in everyone's database. It's even more so for spambayes databases, due to what should be universally present synthesized tokens like 'url:www' and 'proto:http' Those should be present in every database even if the source is entirely non-English, and a lot of synthesized header-line tokens should be present everywhere too. When you get beyond this hurdle, Neil Schemenauer's msg will be important too. >>> >>> f = open("key-hash", "w") >> SHA digests are binary data, so it's necessary to open the output >> file in "wb" mode (and "w" mode is silently deadly on Windows). > Is the default 'b' on Linux? No, but there's no difference between "w" and "wb" on Unix systems. There is on Windows and Macs, and more so on Windows. > Can we get new contributions using "wb"? > > I've cleaned out the upload directory, you can use the same names. Could you post the instructions again, please? Uploaders should be careful to ensure binary-mode transfers, too (ftp command "binary" before uploading). From tim_one at email.msn.com Sun May 25 23:47:19 2003 From: tim_one at email.msn.com (Tim Peters) Date: Sun May 25 22:48:17 2003 Subject: [Spambayes] New mailing lists In-Reply-To: <1a4f01c32319$841466f0$530f8490@eden> Message-ID: [Mark Hammond] > There seemed to be consensus that new mailing lists would be a good > idea, specifically spambayes-users, spambayes-dev and > spambayes-announce, with "spambayes" being an alias to either users > or dev. Make spambayes an alias for spambayes-users; developers should be savvy enough not to get confused by the hyphen <0.5 wink>. > I would love the see the "announce" one setup - simply so we can tell > people with problems to subscribe so they know when to try our next > big mess . > > How about we "just do it". I propose that -users, like -dev, be > unmoderated and not restricted to member posting, for the same > reasons discussed re this list. -announce should be restricted and > moderated, and I volunteer to be it. > > Any objections? If not, then I suggest we just ask Barry to wave his > wand. It's a holiday weekend on this side of the pond, and I don't think Barry is around. I'm addressing this directly to him too in the hopes that it will stick out more in the 10,000 pieces of email awaiting his return. I'm +1 on all you said, but lack the magical powers to make it so. From gyount490 at lycos.com Mon May 26 00:43:41 2003 From: gyount490 at lycos.com (let's go) Date: Sun May 25 23:43:49 2003 Subject: [Spambayes] Start Making Money from Hom Now Message-ID: <4194-22003512634341170@oemcomputer> EZ HOME BUSINESS Tired of working hard and still getting behind on bills? Looking to make extra income? If you're ready for a financial breakthrough, I'll get right to the point. You can start making $500-$5000 per week starting right away with a SIMPLE, yet powerful, online system. Start selling on eBay as well as other auctions sites! No inventory, or money out of your pocket for products until you sell them! We also do all the drop shipping! We provide you all the tools, products and services you need to run a successful home-based online business! Products include giftware, collectibles, electronics, computer products, and more! All are offered to you at or below wholesale prices. It's worth taking a look at becoming a member! The worst case is you buy everything wholesale from now on! Click the link below to find out how you can get started today! http://beans49.agshomebiz.com/ (Please respond with OFF LIST in the subject to be removed from my list.) From noreply at sourceforge.net Sun May 25 22:47:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 00:47:10 2003 Subject: [Spambayes] [ spambayes-Bugs-742440 ] Outlook 2003(beta) install fails Message-ID: Bugs item #742440, was opened at 2003-05-23 10:21 Message generated for change (Comment added) made by corts You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kenneth Haynes (kh4yn3s) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2003(beta) install fails Initial Comment: I am running Windows XP, SP1. I have the Beta 2 version of Office that includes Outlook 2003. When I tried to install using the windows installer, I got no error messages, but neither did I get the configuration buttons on the menu. spambayes1.log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_configuration.pck' Loaded bayes database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.68803ms SpamAddin - Disconnecting from Outlook spambayes2.log and spambayes3.log all look the same as spambayes1.log. ---------------------------------------------------------------------- Comment By: Cort Schaefer (corts) Date: 2003-05-25 22:47 Message: Logged In: YES user_id=541173 This looks exactly like what I am getting running the same configuration. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 From noreply at sourceforge.net Sun May 25 22:51:38 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 00:51:44 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-26 01:54 Message generated for change (Comment added) made by mylene You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 04:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From tim_one at email.msn.com Mon May 26 02:07:26 2003 From: tim_one at email.msn.com (Tim Peters) Date: Mon May 26 01:10:14 2003 Subject: [Spambayes] FW: [Spambayes-checkins] spambayes/spambayes classifier.py, 1.5, 1.6 Message-ID: I've been using my relatively feeble laptop most of the weekend, and was curious about why my relatively small 1,000-msg dict database was consuming so much memory while Outlook was running (I don't have RAM to spare on this box). That lead to the following checkin, restoring some original code. I *think* old pickles should continue to load without complaint, and know that mine do. But if this creates a problem for someone, let me know. > Update of /cvsroot/spambayes/spambayes/spambayes > In directory sc8-pr-cvs1:/tmp/cvs-serv13387/spambayes > > Modified Files: > classifier.py > Log Message: > Somewhere along the way, WordInfo lost its __slots__ decoration. > Restoring it cuts the memory burden of my dict-based classifier by > more than a factor of 2. It should also speed it a bit. Also > added/restored comments about the purpose of this class, and removed > extraneous punctuation from the method implementations. From noreply at sourceforge.net Sun May 25 23:22:18 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 01:22:25 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 20:54 Message generated for change (Comment added) made by kaleid You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- >Comment By: Tim Scott (kaleid) Date: 2003-05-26 00:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-25 23:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Mon May 26 00:18:57 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 02:19:05 2003 Subject: [Spambayes] [ spambayes-Bugs-737986 ] Message.as_string() fails. Message-ID: Bugs item #737986, was opened at 2003-05-15 13:09 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 Category: None Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Nobody/Anonymous (nobody) Summary: Message.as_string() fails. Initial Comment: Reported on the list and discussed via email. Sadly, this one has me beaten; I really can't figure out why it's going wrong. --- I've attached two plain text files. One is from using evolution, the other mozilla. Mozilla tries to access two accounts. But it only accesses the second account after I've cancelled the session because it's hung from the first. It looks to me ( great email expert that I am ) that it does all the negotiations, dl's the first mesage and then just freezes. Note that in the evolution file, I had it get mail from an empty server. All went fine. --- When evol gets the mail, it says getting pop summary, then hangs waiting for the first message. at the terminal window where I ran pop3proxy I get: error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x8353a4c> (exceptions.TypeError:len() of unsized object [/usr/lib/python2.2/asyncore.py|poll|99] [/usr/lib/python2.2/asyncore.py|handle_read_event|396] [/usr/lib/python2.2/asynchat.py|handle_read|130] [/usr/bin/pop3proxy.py|found_terminator|147] [/usr/bin/pop3proxy.py|onServerLine|215] [/usr/bin/pop3proxy.py|onResponse|289] [/usr/bin/pop3proxy.py|onTransaction|390] [/usr/bin/pop3proxy.py|onRetr|459] [/usr/lib/python2.2/site- packages/spambayes/message.py|as_string|193] [/usr/lib/python2.2/site- packages/email/Message.py|as_string|109] [/usr/lib/python2.2/site- packages/email/Generator.py|flatten|102] [/usr/lib/python2.2/site- packages/email/Generator.py|_write|137] [/usr/lib/python2.2/site- packages/email/Generator.py|_write_headers|183] [/usr/lib/python2.2/site- packages/email/Header.py|encode|412] [/usr/lib/python2.2/site- packages/email/Header.py|_split|297] [/usr/lib/python2.2/site- packages/email/Charset.py|encoded_header_len|341]) --- ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-26 18:18 Message: Logged In: YES user_id=552329 Ha! Beaten no longer. I should have checked that I was using the same option set as he was. We were setting a header to a float, which caused the problem - but only if include_prob was True. Fixed in message.py v1.28 ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-15 13:10 Message: Logged In: YES user_id=552329 This is the second file. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=737986&group_id=61702 From tim_one at email.msn.com Mon May 26 03:29:55 2003 From: tim_one at email.msn.com (Tim Peters) Date: Mon May 26 02:31:09 2003 Subject: [Spambayes] Speeding DBDictClassifier In-Reply-To: Message-ID: I haven't looked at this code before. It appears that DBDictClassifier.store() writes to the DB for every word in the wordinfo dict, whether or not the info associated with the word has changed; and doesn't clear the wordinfo dict at the end, so that the next time .store() is called it will write every word all over again, and store() becomes more expensive every time it's called. Maybe that's essential . Sketch of a different approach; the thrust is to change store() so that it only touches the database records that actually changed since the last time store() was called. load(): set new instance vars self.changed_words = {} self.deleted_words = {} store(): don't mutate wordinfo at all; don't iterate over wordinfo at all delete from the DB: only the words in self.deleted_words update in the DB: only the words in self.changed_words clear changed_words and deleted_words before returning _wordinfoget(): remove the comment about None (it's no longer special) after "ret = None", do if word in self.deleted_words: return ret _wordinfoset(): define this def _wordinfoset(self, word, record): self.wordinfo[word] = record if word in self.deleted_words: del self.deleted_words[word] self.changed_words[word] = 1 _wordinfodel(): change to: def _wordinfodel(self, word): del wordinfo[word] if word in self.changed_words: del self.changed_words[word] self.deleted_words[word] = 1 From T.A.Meyer at massey.ac.nz Mon May 26 19:49:20 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 02:49:59 2003 Subject: [Spambayes] Outlook Express (notate to and notate subject) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7CE@its-xchg4.massey.ac.nz> For those that don't read the check-ins list. Further to my earlier email - I've restore the functionality of the notate_to and notate_subject options, so Outlook Express'ers should once again be ok with the latest CVS. It may not work if a message has multiple To or Subject headers (and I'm not 100% about if it has "to" instead of "To"), but then neither the did old version, IIRC. If someone complains, we can fix it ;) So, for those who want to use OE (with pop3), you can with the latest CVS, but not with alpha2. Along the way I finally found a bug I tried to fix a while back, and did that, so if anyone found that the include_prob or include_score option wasn't working, it should now. =Tony Meyer From noreply at sourceforge.net Mon May 26 01:58:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 03:58:56 2003 Subject: [Spambayes] [ spambayes-Bugs-743515 ] Unable to expand folders in folder selection dialog. Message-ID: Bugs item #743515, was opened at 2003-05-26 00:58 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Pedriana (ppedriana) Assigned to: Nobody/Anonymous (nobody) Summary: Unable to expand folders in folder selection dialog. Initial Comment: On my computer I do the following steps: 1 - Install the software with Outlook 2002. This succeeds 2 - Call forth the Anti-Spam manager dialog. 3 - Click the "Train Now..." button. This calls forth the "Training" dialog. 4 - Click either of the two "Browse" buttons. This calls forth the "Select Folders..." dialog, which has a tree control with two top level nodes: "Public Folders" and "Personal Folders." 5 - Clicking on the "Public Folders" node expands it, as expected. 6 - Clicking on the "Personal Folders" node does nothing; it does not expand as expected. --------------------------------------- Looking at the log file, the following lines are present: Traceback (most recent call last): "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 384, in OnTreeItemExpanding "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 74, in _BuildFoldersMAPI pywintypes.com_error: (-2147221246, 'Invalid window handle', None, None) win32ui: Exception in OnNotify() handler The exception appears to happen in a call to FolderSpec() from the _BuildFoldersMAPI function. Here is a snippet of _BuildFoldersMAPI containing the call to FolderSpec: try: child_folder = manager.message_store.GetFolder(temp_id) except pythoncom.com_error: # Bad folder for some reason - ignore it. child_folder = None if child_folder is not None: spec = FolderSpec(child_folder.GetID(), name) # If we have no children at all, indicate # the item is not expandable. table = child_folder.OpenEntry().GetHierarchyTable(0) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 From spambayes at rodland.no Mon May 26 11:12:49 2003 From: spambayes at rodland.no (Fredrik Rodland) Date: Mon May 26 04:13:41 2003 Subject: [Spambayes] RE: [Spambayes-checkins] spambayes CHANGELOG.txt, NONE, 1.1 WHAT_IS_NEW.txt, NONE, 1.1 In-Reply-To: Message-ID: > -----Original Message----- > From: spambayes-checkins-bounces@python.org > [mailto:spambayes-checkins-bounces@python.org]On Behalf Of Tony Meyer > Sent: 26. mai 2003 10:05 > To: spambayes-checkins@python.org > Subject: [Spambayes-checkins] spambayes CHANGELOG.txt, NONE,1.1 > WHAT_IS_NEW.txt, NONE, 1.1 > > > Feel free to critique/change the style, and fix any mistakes. ok: > Mark Hammond 03/02/03 Fix SF#677804. > Mark Hammond 04/02/03 Fix SF#642740. isn't it better to add a descriptive line - rather than just the bug-# i.e. - replace: Fix SF#677804. with: Fix: Untouched fitler command error (SF#677804) F -- Fredrik R?dland Technical Architect, Stocknet, Oslo, Norway Stocknet: http://www.stocknet.com phone: +47 23 28 40 17 Private: http://rodland.no phone: +47 99 21 98 17 From T.A.Meyer at massey.ac.nz Mon May 26 22:14:13 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 05:14:52 2003 Subject: [Spambayes] Outlook 2000 on XP, installed Python and killed add-in Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7D7@its-xchg4.massey.ac.nz> > The only thing I can think of is that perhaps the version of > Python installed with the plugin is different than the > version you installed. The two Python versions might have a > difference in the byte code they generate and interpret. > After the installation, perhaps you got the new Python > interpreter trying to interpret old byte codes or vice versa. > What version of Python did you install? It was 2.3b1. Oddly, though, I think Mark used 2.3 (not b1, obviously) to build the plugin. Going by this, anyway: http://mail.python.org/pipermail/spambayes/2003-February/003611.html Sean did mention something about his group finding things better building the plugin with 2.2, though. It would interesting to see if this is easily duplicable (i.e. install the plugin on a machine without Python, then install 2.3b1 and see if all is still good). I don't really want to uninstall Python here, though, especially since I'd have to uninstall 2.3b1, CVS, 2.2.2, and whatever version cygwin has, to be thorough. Maybe someone that doesn't do Python coding could give it a go? =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 26 22:16:48 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 05:17:27 2003 Subject: [Spambayes] Quick Question Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7D8@its-xchg4.massey.ac.nz> [Tony] > If it is, then the answer is 'not yet'. We would welcome > help in changing this, though. [Ben] > Yes, Outlook 11 is Outlook 2003. How can I help? Hmm. I might have spoken too soon. From what I've (since) read, the plugin will install, but has trouble with the COM/event interface. This is very tricky stuff that should probably be left to Sean/Mark. On the positive side, I think that Sean's group is actively working on this, so a solution might be at hand sooner rather than later (maybe even before Outlook 2003 is non-beta). =Tony Meyer From noreply at sourceforge.net Mon May 26 03:23:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 05:23:30 2003 Subject: [Spambayes] [ spambayes-Support Requests-742258 ] Shutdown/restart Message-ID: Support Requests item #742258, was opened at 2003-05-23 23:14 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=742258&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Mike Ring (mikering) Assigned to: Nobody/Anonymous (nobody) Summary: Shutdown/restart Initial Comment: Whenever I logoff and restart Outlook, SpamBayes doesn't filter messages on startup. I have to retrain it and then filter the Inbox. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-26 21:23 Message: Logged In: YES user_id=552329 According to the log the databases are not being saved - in fact there isn't even the "exiting outlook" message. Are you exiting Outlook normally? (i.e. it's not being forced to quit or crashing?) Are you exiting Outlook and then logging off, or getting Windows to exit Outlook after a log off command? In any case, the next alpha release (later this week, hopefully) will fix this as training is saved after each operation. (Alpha2 should still work, however). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=742258&group_id=61702 From T.A.Meyer at massey.ac.nz Mon May 26 22:30:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 05:31:05 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7D9@its-xchg4.massey.ac.nz> > To be honest, I don't understand the -d and -D switches to > the commandline. > I've never used them and maybe this is a reminder to the > Spambayes homepage webmasters to explain it a little bit for dummies. Basically if you use "-d", then you want to use a pickle to save the wordinfo database. If you use "-D" then you want to use something else (like bsddb or anydbm). In even simpler terms, it allows you to specify the way that the database is stored. I believe that the idea is that people don't have to use them - it defaults to "-D", and that should be fine for most people. (In fact, AFAIK, few people are still using a pickle, although TimP is with the Outlook plugin). > Spambayes seems to have a problem the second time you run it. [...] > But then (and this happened last time too) when I the second > time run it I get these error messages: [...] > (exceptions.AssertionError: [C:\Python22\lib\asyncore.py|poll|99] [...] > [spambayes\classifier.py|probability|301]) This is basically saying that the database is stuffed - there are words that have appeared in more emails that you have trained, which is impossible. The only real solution is to do a full retrain, although you can doctor the numbers by hand. It seems that there is still a bug *somewhere* that causes the database counts to die like this, but we haven't been able to track it down. If you can consistently reproduce this error, that would be a great help in doing so. If not, then at least things work for you ;) =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 26 22:31:27 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 05:32:05 2003 Subject: [Spambayes] Suddenly a lot of errors Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7DA@its-xchg4.massey.ac.nz> > When I press the Save & Shutdown I get this error message: [...] > AttributeError: 'WordInfo' object has no attribute 'spamcount' Your next message seemed to indicate that you managed to fix this problem. Is that the case? If so, could you let me/the list know what it was, for future reference? =Tony Meyer From T.A.Meyer at massey.ac.nz Mon May 26 22:41:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 05:41:44 2003 Subject: Format of Spam Field in Plugin (was RE: [Spambayes] (no subject)) Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7DB@its-xchg4.massey.ac.nz> > > Mark, did you change from number to percent since the binary release? > > Maybe my old folders are numbers, and new ones percents? > > I haven't touched that for ages, since, we thought, we got it > right (or more accurately, got it as close enough to how we > wanted that it actually seemed to work) I just played around with it and it's very strange behaviour. I'm willing to blame all of that on Outlook, and none on the plugin, though ;) (Using Outlook XP SP1) If I create a new folder, it defaults to having the spam field, as a number (one d.p.). If I remove that column, add a message, and filter the folder, I have a column with a (one d.p.) *percentage*. If I then go into the properties to try and change it to give more accuracy, it says that it is a number (and if I change it to 2 d.p., for example, it is then stuck as a number and I can't get it back to a percent). Go figure. If the hordes of Outlook users are reading these messages, what do they think? Is the current behaviour good enough? (It is for me). =Tony Meyer From mhammond at skippinet.com.au Mon May 26 22:09:01 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon May 26 07:09:58 2003 Subject: [Spambayes] RE: [Spambayes-checkins] spambayes CHANGELOG.txt, NONE, 1.1 WHAT_IS_NEW.txt, NONE, 1.1 In-Reply-To: Message-ID: <1b6c01c32377$38149820$530f8490@eden> > > Feel free to critique/change the style, and fix any mistakes. > > ok: > > > Mark Hammond 03/02/03 Fix SF#677804. > > Mark Hammond 04/02/03 Fix SF#642740. > > isn't it better to add a descriptive line - rather than just the bug-# > > i.e. - replace: > Fix SF#677804. > with: > Fix: Untouched fitler command error (SF#677804) Yes, I would have to agree! In fact, I often (almost usually) do that now, and will strive for always :) Mark. From tim_one at email.msn.com Mon May 26 09:35:26 2003 From: tim_one at email.msn.com (Tim Peters) Date: Mon May 26 08:36:02 2003 Subject: [Spambayes] Speeding DBDictClassifier In-Reply-To: Message-ID: [TimP] > ... > Sketch of a different approach; the thrust is to change store() so > that it only touches the database records that actually changed since > the last time store() was called. If someone pursues this (afraid I can't), it occurs to me it would be better to introduce only one new dict, say self.changed_word_kind, mapping changed and deleted words to whether they were last changed or last deleted. Like mapping to 'C' for changed or 'D' for deleted. It would simplify the code I sketched; e.g., instead of def _wordinfodel(self, word): del self.wordinfo[word] if word in self.changed_words: del self.changed_words[word] self.deleted_words[word] = 1 it would be def _wordinfodel(self, word): del self.wordinfo[word] self.changed_word_klnd[word] = 'D' store() could then march over self.changed_word_kind.iteritems(), and use the 'C'/'D' values to update/delete_from the DB appropriately. From bill at parducci.net Mon May 26 09:37:10 2003 From: bill at parducci.net (Bill Parducci) Date: Mon May 26 11:37:14 2003 Subject: [Spambayes] FAQ In-Reply-To: <3ED203BB.1030201@videotron.ca> References: <3ECE9B33.3010808@parducci.net> <3ED203BB.1030201@videotron.ca> Message-ID: <1071.65.138.137.207.1053963430.squirrel@mail.parducci.net> > I thinl it can be a good idea to put number In front of the text. i am not sure that i understand your request. the current format is: 1. Overview a. So what is Spambayes? b. What do I need to install Spambayes? are you suggesting: 1. Overview 1a. So what is Spambayes? 1b. What do I need to install Spambayes? ? > P.S. The link to the name tag are not working the links don't work on this page because my server doesn't handle .ht files like html files. they will work once moved to the project server. thanks for the feedback. b From mark at enynsp.org Mon May 26 13:08:59 2003 From: mark at enynsp.org (Mark S. Bergman) Date: Mon May 26 12:09:36 2003 Subject: [Spambayes] Install error Message-ID: Uponinstallation of SpamBayes into outlook 2000 running on a fully updated Win2K, I get the followin log error: Traceback (most recent call last): File "", line 15, in DllRegisterServer MemoryError Help! Mark S. Bergman 518.251.5930 From papaDoc at videotron.ca Mon May 26 13:19:53 2003 From: papaDoc at videotron.ca (papaDoc) Date: Mon May 26 12:21:03 2003 Subject: [Spambayes] FAQ In-Reply-To: <1070.65.138.137.207.1053963406.squirrel@mail.parducci.net> References: <3ECE9B33.3010808@parducci.net> <3ED203BB.1030201@videotron.ca> <1070.65.138.137.207.1053963406.squirrel@mail.parducci.net> Message-ID: <3ED23EA9.60809@videotron.ca> Hi Bill, >I am not sure that i understand your request. the current format is: > 1. Overview > a. So what is Spambayes? > b. What do I need to install Spambayes? > >are you suggesting: > 1. Overview > 1a. So what is Spambayes? > 1b. What do I need to install Spambayes? > No, This is not what I tried to explain..... In the text of the FAQ, You have in bold the "title of the item" I suggest to add to this the number. Ex 1. Overview a. So what is Spambayes? b. What do I need to install Spambayes? 2. Meat a. What is ham b. What is spam. 3. etc 1 Overview Some text here 1.a- So what is Spambayes? Again some text here. ..... ..... 1.b- What do I need to install Spambayes? .... ..... Usually when I look at a FAQ, I take note of the questions I want to look at (If there is many question) So when I'm looking at an answer of the FAQ, I know where to go for the next one if there is number in front. For Example I'm at answer 23.c- and the next I want to look is 107.d- I can scroll rapidly to 100 If I stop at the question 121 by mistake I know I have to scroll up since there is the number in front. >the links don't work on this page because my server doesn't handle .ht >files like html files. they will work once moved to the project server. > I think the problem what that you where including a # in front of the name tag instead of . The needs the # >thanks for the feedback. > No problem Remi From seandarcy at hotmail.com Mon May 26 13:25:20 2003 From: seandarcy at hotmail.com (sean darcy) Date: Mon May 26 12:25:54 2003 Subject: [Spambayes] web interface: 500 Server error when training Message-ID: Using the web interface for training sb. Choose ham and spam from the list, click Train. Get this error message: Training... Trained on 84 messages. Saving... Done. 500 Server error Traceback (most recent call last): File "/usr/lib/python2.2/site-packages/spambayes/Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.2/site-packages/spambayes/ProxyUI.py", line 361, in onReview message = spambayes.mboxutils.get_message(cachedMessage.getSubstance()) File "/usr/lib/python2.2/site-packages/spambayes/Corpus.py", line 349, in getSubstance return self.hdrtxt + self.payload File "/usr/lib/python2.2/site-packages/spambayes/Corpus.py", line 296, in __getattr__ raise AttributeError, attributeName AttributeError: hdrtxt _________________________________________________________________ The new MSN 8: smart spam protection and 2 months FREE* http://join.msn.com/?page=features/junkmail From rich at richlyon.net Mon May 26 10:53:33 2003 From: rich at richlyon.net (Richard Lyon) Date: Mon May 26 12:45:41 2003 Subject: [Spambayes] BUG: spambayes crashes when filtering moves mail Message-ID: <22918219.1053968013938.JavaMail.www-data@four.oddpost.com> Hi, I'm running the binary version of spambayes on Outlook 2000 SR-1/Win2000 Professional SP2 Build 2195. Outlook is configured in Exchange mode. My computer virus scanner is labelled "Vshield" - it is a corporate laptop and I can't disable it. Spambayes is installed and classifying OK. But if I configure the filterer to do anything other than leave files untouched, Outlook crashes on receipt of the first message after enabling the filter. I can't tell whether the messages that cause the crash are ham (and therefore spambayes is crashing on anything) or whether it is a spam and it is something to do with the transfer. There is nothing untoward in the log files. Last entry before the last crash is " Message 'Its here! The weight loss patch! vrhd' had a Spam classification of 'Yes'" but I don't know if that refers to the message that was arriving when it crashed or not. I've removed all of my inbox assistant rules. My security setting is "Medium-Low". Help would be appreciated as I'm drowning in spam and this looks great. Regards Richard Lyon _____________________ mail:rich@richlyon.net web:www.richlyon.net/ From noreply at sourceforge.net Mon May 26 11:16:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 13:19:57 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 21:54 Message generated for change (Comment added) made by rdesantos You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 01:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 00:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Mon May 26 11:27:56 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 13:28:02 2003 Subject: [Spambayes] [ spambayes-Bugs-743791 ] BUG: spambayes crashes when filtering moves mail Message-ID: Bugs item #743791, was opened at 2003-05-26 17:27 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743791&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Richard Lyon (lyonrj) Assigned to: Mark Hammond (mhammond) Summary: BUG: spambayes crashes when filtering moves mail Initial Comment: Hi, I'm running the binary version of spambayes on Outlook 2000 SR-1/Win2000 Professional SP2 Build 2195. Outlook is configured in Exchange mode. My computer virus scanner is labelled "Vshield" - it is a corporate laptop and I can't disable it. Spambayes is installed and classifying OK. But if I configure the filterer to do anything other than leave files untouched, Outlook crashes on receipt of the first message after enabling the filter. I can't tell whether the messages that cause the crash are ham (and therefore spambayes is crashing on anything) or whether it is a spam and it is something to do with the transfer. There is nothing untoward in the log files. Last entry before the last crash is "Message 'Its here! The weight loss patch! vrhd' had a Spam classification of 'Yes'" but I don't know if that refers to the message that was arriving when it crashed or not. I've removed all of my inbox assistant rules. My security setting is "Medium-Low". Help would be appreciated as I'm drowning in spam and this looks great. Regards, Richard Lyon ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743791&group_id=61702 From noreply at sourceforge.net Mon May 26 11:30:11 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 13:30:21 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 20:54 Message generated for change (Comment added) made by kaleid You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- >Comment By: Tim Scott (kaleid) Date: 2003-05-26 12:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 12:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 00:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-25 23:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Mon May 26 11:37:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 13:37:39 2003 Subject: [Spambayes] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 21:54 Message generated for change (Comment added) made by rdesantos You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:37 Message: Logged In: YES user_id=782486 My fault I suppose for failing to detail it but I saw this same error message when my problem occurred. Also see posting #741914. I suspect all three bug reports are the same regsvr error. -Rob ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 13:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 01:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 00:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From skip at pobox.com Mon May 26 14:50:22 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 26 14:50:22 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <3ED1403F.7393.1F3C035E@localhost> References: <3ED0EBA5.15635.1DF18942@localhost> <3ED1403F.7393.1F3C035E@localhost> Message-ID: <16082.25070.219095.680606@montanaro.dyndns.org> Brad> I've cleaned out the upload directory, you can use the same names. I see you anticipated that we would have bookmarked it. For those of us who didn't, can you repeat the ftp server info? BTW, on my Mac (OS X), "w" and "wb" yield the same result. Skip From skip at pobox.com Mon May 26 14:54:19 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 26 14:54:24 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: References: <3ED1403F.7393.1F3C035E@localhost> Message-ID: <16082.25307.692172.854125@montanaro.dyndns.org> >> Is the default 'b' on Linux? Tim> No, but there's no difference between "w" and "wb" on Unix systems. Tim> There is on Windows and Macs, and more so on Windows. I tried both "w" and "wb" on my Mac OS X system and got precisely the same output. Perhaps the "and Macs" part only refers to versions < OS X. Skip From dadro at clickcapecod.com Mon May 26 16:13:38 2003 From: dadro at clickcapecod.com (Nick Maloney) Date: Mon May 26 15:05:55 2003 Subject: [Spambayes] Love your product! Message-ID: Hello, I just wanted to express my appreciation for SpamBayes. I just configured it and trained it and cannot believe how well it works even based on the few spam I had left in my deleted items box. Great Job! Nick Maloney From bkc at murkworks.com Mon May 26 16:25:22 2003 From: bkc at murkworks.com (Brad Clements) Date: Mon May 26 15:06:25 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <16082.25070.219095.680606@montanaro.dyndns.org> References: <3ED1403F.7393.1F3C035E@localhost> Message-ID: <3ED231E1.14460.22EBDE72@localhost> On 26 May 2003 at 13:50, Skip Montanaro wrote: > Brad> I've cleaned out the upload directory, you can use the same > names. > > I see you anticipated that we would have bookmarked it. For those of us > who didn't, can you repeat the ftp server info? > > BTW, on my Mac (OS X), "w" and "wb" yield the same result. > > Skip The path is ftp://ftp.murkworks.com/uploads/spambayes Reminder, this is a drop only box. After upload you won't see your file there, and you cannot overwrite files. If you get a conflict, try a different name. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From skip at pobox.com Mon May 26 15:13:42 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 26 15:21:43 2003 Subject: [Spambayes] FAQ In-Reply-To: <1071.65.138.137.207.1053963430.squirrel@mail.parducci.net> References: <3ECE9B33.3010808@parducci.net> <3ED203BB.1030201@videotron.ca> <1071.65.138.137.207.1053963430.squirrel@mail.parducci.net> Message-ID: <16082.26470.784633.295713@montanaro.dyndns.org> >> I thinl it can be a good idea to put number In front of the text. Bill> are you suggesting: Bill> 1. Overview Bill> 1a. So what is Spambayes? Bill> 1b. What do I need to install Spambayes? I don't know if the

    tag can do that. >> P.S. The link to the name tag are not working I fixed this at one point, but it looks like a recent update I made to the faq undid a lot of stuff (like wrapping to 80 columns using tidy and fixing the name attributes). I'll do another checkin a bit later today. Folks, before you check in a file, make sure you've done a "cvs up" very recently, otherwise you may unwittingly obliterated recent updates. Skip From vanhorn at whidbey.com Mon May 26 13:29:27 2003 From: vanhorn at whidbey.com (G. Armour Van Horn) Date: Mon May 26 15:29:32 2003 Subject: [Spambayes] Compatibility matrix References: <119301c320ba$ae52e560$530f8490@eden> Message-ID: <3ED26B17.61C4BE82@whidbey.com> Mark Hammond wrote: > I would drop the "XP Home" and "XP Professional" distinction - I seriously > doubt that would have any impact, and we should have "2000 Pro/Server" > distinctions if we make that one. There are specific network distinctions between XP Home and XP Pro, while there are no such differences between XP Pro and XP Server, as there were none between Win2K Pro and Win2K Server. I have seen, but cannot now find, a Microsoft page that delineated the differences between Home and the rest of the line. I don't remember details, I do remember it led me to avoid any use of XP Home. In keeping with SpamBayes tradition, it should be testing that tells us there is no distinction of value. I suggest leaving it in there until collected data tell us we don't need two columns. Obviously, if the columns are collapsed into one we can't readily split it back out later. Van -- ---------------------------------------------------------- Sign up now for Quotes of the Day, a handful of quotations on a theme delivered every morning. Enlightenment! Daily, for free! mailto:twisted@whidbey.com?subject=Subscribe_QOTD For web hosting and maintenance, visit Van's home page: http://www.domainvanhorn.com/van/ ---------------------------------------------------------- From skip at pobox.com Mon May 26 15:29:07 2003 From: skip at pobox.com (Skip Montanaro) Date: Mon May 26 15:30:12 2003 Subject: [Spambayes] FAQ In-Reply-To: <3ED23EA9.60809@videotron.ca> References: <3ECE9B33.3010808@parducci.net> <3ED203BB.1030201@videotron.ca> <1070.65.138.137.207.1053963406.squirrel@mail.parducci.net> <3ED23EA9.60809@videotron.ca> Message-ID: <16082.27395.738037.189951@montanaro.dyndns.org> >> are you suggesting: >> 1. Overview >> 1a. So what is Spambayes? >> 1b. What do I need to install Spambayes? >> papaDoc> No, This is not what I tried to explain..... papaDoc> In the text of the FAQ, You have in bold the "title of the papaDoc> item" I suggest to add to this the number. I will give it a try, however, what we will wind up with is a table of contents which is automatically numbered and a body which is manually numbered. If people start reordering things or adding new questions without adding numbers it may be better to simply return to the current situation. >> the links don't work on this page because my server doesn't handle >> .ht files like html files. they will work once moved to the project >> server. papaDoc> I think the problem what that you where including a # in front papaDoc> of the name tag instead of name="whatisit">. The needs the # Yes, I corrected these a few days ago, but they got wiped out by a more recent "cvs commit". I just corrected them again. Skip From tim.one at comcast.net Mon May 26 17:24:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Mon May 26 16:25:29 2003 Subject: [Spambayes] Re: Collecting word lists.. - BUMMER In-Reply-To: <16082.25307.692172.854125@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > I tried both "w" and "wb" on my Mac OS X system and got precisely the > same output. Perhaps the "and Macs" part only refers to versions < > OS X. To those versions of Mac OS that use \r as the native line end. Part of text-mode writing is converting \n to \r then (and part of text-mode reading is converting \r to \n; if you don't have any ord('\n') bytes in your binary data when writing in text mode on such a Mac, then it won't make any difference; if you do have any ord('\n') bytes in your binary data when writing in text mode on such a Mac, then all such bytes will be corrupted in the output). From mail at peterbe.com Tue May 27 00:23:38 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Mon May 26 18:24:47 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7DA@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030526231807.0221f1f0@localhost> At 21:31 2003-05-26 +1200, Meyer, Tony wrote: > > When I press the Save & Shutdown I get this error message: >[...] > > AttributeError: 'WordInfo' object has no attribute 'spamcount' > >Your next message seemed to indicate that you managed to fix this >problem. Is that the case? If so, could you let me/the list know what >it was, for future reference? Well, the problem is that even though I've trained thousands of messages it says that I have trained 0/0 spam/ham. My guess is that the training works, but when it after tries to count em something goes wrong. Is there a way to force a recount? Or some other equivalent cleanup function? >=Tony Meyer From noreply at sourceforge.net Mon May 26 16:48:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 18:49:01 2003 Subject: [Spambayes] [ spambayes-Bugs-743791 ] BUG: spambayes crashes when filtering moves mail Message-ID: Bugs item #743791, was opened at 2003-05-27 03:27 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743791&group_id=61702 Category: Outlook Group: None >Status: Pending Resolution: None Priority: 5 Submitted By: Richard Lyon (lyonrj) Assigned to: Mark Hammond (mhammond) Summary: BUG: spambayes crashes when filtering moves mail Initial Comment: Hi, I'm running the binary version of spambayes on Outlook 2000 SR-1/Win2000 Professional SP2 Build 2195. Outlook is configured in Exchange mode. My computer virus scanner is labelled "Vshield" - it is a corporate laptop and I can't disable it. Spambayes is installed and classifying OK. But if I configure the filterer to do anything other than leave files untouched, Outlook crashes on receipt of the first message after enabling the filter. I can't tell whether the messages that cause the crash are ham (and therefore spambayes is crashing on anything) or whether it is a spam and it is something to do with the transfer. There is nothing untoward in the log files. Last entry before the last crash is "Message 'Its here! The weight loss patch! vrhd' had a Spam classification of 'Yes'" but I don't know if that refers to the message that was arriving when it crashed or not. I've removed all of my inbox assistant rules. My security setting is "Medium-Low". Help would be appreciated as I'm drowning in spam and this looks great. Regards, Richard Lyon ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-05-27 08:48 Message: Logged In: YES user_id=14198 I'm afraid I am going to have to blame vshield. Is there any way you can get your IT department to disable it just long enough for you to verify that it is or isn't the problem? Certainly this kind of report is not common. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743791&group_id=61702 From mhammond at skippinet.com.au Tue May 27 09:54:10 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon May 26 18:55:04 2003 Subject: [Spambayes] Version information Message-ID: <1cf401c323d9$b963d9b0$530f8490@eden> I'm wondering how we can store version information that applications can read to give some idea of what is running. I propose a single file, spambayes\Version.py. This would be a file in the spirit of Options.py - ie, an "ini" file embedded in a Python script. The embedded section would look like: """ # Version information for spambayes [Default] # Non app specific - changed when "spambayes\*" changes significantly Version=0.2 # Simple major.minor Description=SpamBayes Beta1 Date=May 2003 Full Description=%(Description)s, version %(Version)s (%(Date)s) # Apps can define their own sections [Outlook] Binary=003 Description=SpamBayes Outlook Plugin [pop3proxy] # Whatever makes sense, etc """ And there would be one or two function to extract a useful version string. Other options include a file per directory, or a simple file with a single line, or no centralised verison information at all. I'm not bothered, but this seems the best compromise to me. Binaries etc that are clever can use this info at build time, while CVS versions of the apps could do whatever they like. Thoughts? Mark. From noreply at sourceforge.net Mon May 26 16:56:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon May 26 18:56:08 2003 Subject: [Spambayes] [ spambayes-Bugs-743515 ] Unable to expand folders in folder selection dialog. Message-ID: Bugs item #743515, was opened at 2003-05-26 17:58 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 >Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Pedriana (ppedriana) >Assigned to: Mark Hammond (mhammond) Summary: Unable to expand folders in folder selection dialog. Initial Comment: On my computer I do the following steps: 1 - Install the software with Outlook 2002. This succeeds 2 - Call forth the Anti-Spam manager dialog. 3 - Click the "Train Now..." button. This calls forth the "Training" dialog. 4 - Click either of the two "Browse" buttons. This calls forth the "Select Folders..." dialog, which has a tree control with two top level nodes: "Public Folders" and "Personal Folders." 5 - Clicking on the "Public Folders" node expands it, as expected. 6 - Clicking on the "Personal Folders" node does nothing; it does not expand as expected. --------------------------------------- Looking at the log file, the following lines are present: Traceback (most recent call last): "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 384, in OnTreeItemExpanding "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 74, in _BuildFoldersMAPI pywintypes.com_error: (-2147221246, 'Invalid window handle', None, None) win32ui: Exception in OnNotify() handler The exception appears to happen in a call to FolderSpec() from the _BuildFoldersMAPI function. Here is a snippet of _BuildFoldersMAPI containing the call to FolderSpec: try: child_folder = manager.message_store.GetFolder(temp_id) except pythoncom.com_error: # Bad folder for some reason - ignore it. child_folder = None if child_folder is not None: spec = FolderSpec(child_folder.GetID(), name) # If we have no children at all, indicate # the item is not expandable. table = child_folder.OpenEntry().GetHierarchyTable(0) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 From mail at peterbe.com Tue May 27 00:28:07 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Mon May 26 19:14:47 2003 Subject: [Spambayes] Suddenly a lot of errors In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F7D9@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030526232341.02219178@localhost> At 21:30 2003-05-26 +1200, Meyer, Tony wrote: > > To be honest, I don't understand the -d and -D switches to > > the commandline. > > I've never used them and maybe this is a reminder to the > > Spambayes homepage webmasters to explain it a little bit for dummies. > >Basically if you use "-d", then you want to use a pickle to save the >wordinfo database. If you use "-D" then you want to use something else >(like bsddb or anydbm). In even simpler terms, it allows you to specify >the way that the database is stored. I believe that the idea is that >people don't have to use them - it defaults to "-D", and that should be >fine for most people. (In fact, AFAIK, few people are still using a >pickle, although TimP is with the Outlook plugin). That's good that those two options exist. But which one should I use (assuming I'm not a senior unix server admin)? > > Spambayes seems to have a problem the second time you run it. >[...] > > But then (and this happened last time too) when I the second > > time run it I get these error messages: >[...] > > (exceptions.AssertionError: [C:\Python22\lib\asyncore.py|poll|99] >[...] > > [spambayes\classifier.py|probability|301]) > >This is basically saying that the database is stuffed - there are words >that have appeared in more emails that you have trained, which is >impossible. The only real solution is to do a full retrain, although >you can doctor the numbers by hand. I sent a separate email about this. Basically, can I just retrain the _pop3proxyham.mbox and _pop3proxyspam.mbox files? Is the count that important? My guess would be that the count is more for the [statistical] interface. >It seems that there is still a bug *somewhere* that causes the database >counts to die like this, but we haven't been able to track it down. If >you can consistently reproduce this error, that would be a great help in >doing so. If not, then at least things work for you ;) If it happens again (it works now) I'll "send" you my whole spambayes folder minus the big mbox and db files. Or do you need them too? What other info do you need? Peter From mail at peterbe.com Tue May 27 00:42:39 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Mon May 26 19:14:58 2003 Subject: [Spambayes] Impressed by SAProxy Message-ID: <5.2.0.9.0.20030526233614.0221f330@localhost> Installed SAproxy (http://saproxy.bloomba.com/) today and was quite impressed by their little windows gui. You download, doubleclick and all of a sudden you have a little icon in the taskbar which starts when windows starts. You can right-click it and select "Configure", "View manual" or "Exit". It didn't work very well (often timeouts) but I thought it could be a nice completement to spambayes since SAproxy uses SpamAssassin which is more concrete and different from Spambayes. That's not my point. My point is that the gui was very usable and simple. I would love to contribute with some code but my wxPython (or Tkinter) skills are very poor. From T.A.Meyer at massey.ac.nz Tue May 27 12:38:18 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 19:39:01 2003 Subject: [Spambayes] Impressed by SAProxy Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F931@its-xchg4.massey.ac.nz> > Installed SAproxy (http://saproxy.bloomba.com/) today and was quite > impressed by their little windows gui. > You download, doubleclick and all of a sudden you have a > little icon in the taskbar which starts when windows starts. > You can right-click it and select "Configure", "View manual" > or "Exit". I think a little applet like this would be especially good if it could do a little more than this. For example, if the mail app was in the foreground, it would offer "train as spam" and "train as ham" options, and be able to take the appropriate action. Rather a difficult thing to program, though :) For the moment, (to me) it doesn't seem worth creating an applet like this. Clicking "configure" is no different than launching a shortcut to http://localhost:8880/config. We're a bit lacking on the manual side, but at some point the web interface (for pop3proxy, imapfilter et al) will have a link to documentation. When that happens clicking "View manual" will be the same as launching a shortcut to http://localhost:8880/help (or whatever the link will be). On Windows it's simple enough to add those to the Quick Launch bar, for example. Would you agree with this? Or am I missing something that makes the applet better? (I don't have SpamAssassin, so I presume SAProxy wouldn't work for me). =Tony Meyer From T.A.Meyer at massey.ac.nz Tue May 27 12:43:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 19:44:39 2003 Subject: [Spambayes] Version information Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F93B@its-xchg4.massey.ac.nz> > I propose a single file, spambayes\Version.py. +1 > This would be > a file in the spirit of Options.py - ie, an "ini" file embedded > in a Python script. The embedded section would look like: Well, the old Options.py :) > Other options include a file per directory, +0 > or a simple file with a single line, -1 > or no centralised version information at all. -1 > Thoughts? If this gets general approval, I'm happy to throw together a first version :) of this if you'd rather spend time on the plugin/binary issues. It shouldn't take much time anyway since ConfigParser can do all the real work. =Tony Meyer From mhammond at skippinet.com.au Tue May 27 10:45:00 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon May 26 19:45:53 2003 Subject: [Spambayes] Impressed by SAProxy In-Reply-To: <5.2.0.9.0.20030526233614.0221f330@localhost> Message-ID: <1d0a01c323e0$d3313610$530f8490@eden> > I would love to contribute with some code but my wxPython (or > Tkinter) > skills are very poor. Grab win32all, and check out win32/Demos/win32gui_taskbar.py - this has a fairly simple skeleton for such an app. Mark. From T.A.Meyer at massey.ac.nz Tue May 27 12:49:42 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 19:50:27 2003 Subject: [Spambayes] FAQ Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F941@its-xchg4.massey.ac.nz> > I fixed this at one point, but it looks like a recent update > I made to the faq undid a lot of stuff (like wrapping to 80 > columns using tidy and fixing the name attributes). I'll do > another checkin a bit later today. > > Folks, before you check in a file, make sure you've done a > "cvs up" very recently, otherwise you may unwittingly > obliterate recent updates. My apologies; this was me. I did try and cvs up, but the faq that I grabbed from bill's post was so different to the one that was there it just ended up full of conflicts. I looked at it by hand and it didn't seem to be missing anything except for the messiness (I missed the name attributes). I should have done this more carefully, sorry. (I tried to use tidy to made it, um, tidy, but I couldn't get it to work.) =Tony Meyer From mail at peterbe.com Tue May 27 02:32:05 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Mon May 26 20:54:47 2003 Subject: [Spambayes] Impressed by SAProxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F931@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030527013031.021ebe20@localhost> At 11:38 2003-05-27 +1200, Meyer, Tony wrote: > > Installed SAproxy (http://saproxy.bloomba.com/) today and was quite > > impressed by their little windows gui. > > You download, doubleclick and all of a sudden you have a > > little icon in the taskbar which starts when windows starts. > > You can right-click it and select "Configure", "View manual" > > or "Exit". > >I think a little applet like this would be especially good if it could >do a little more than this. For example, if the mail app was in the >foreground, it would offer "train as spam" and "train as ham" options, >and be able to take the appropriate action. Rather a difficult thing to >program, though :) > >For the moment, (to me) it doesn't seem worth creating an applet like >this. Clicking "configure" is no different than launching a shortcut to >http://localhost:8880/config. We're a bit lacking on the manual side, >but at some point the web interface (for pop3proxy, imapfilter et al) >will have a link to documentation. When that happens clicking "View >manual" will be the same as launching a shortcut to >http://localhost:8880/help (or whatever the link will be). On Windows >it's simple enough to add those to the Quick Launch bar, for example. I basically like the double-clickability. As opposed to running the console and typing commands (hate win services). I will personally have a look at maybe writing an app that does this. >Would you agree with this? Or am I missing something that makes the >applet better? (I don't have SpamAssassin, so I presume SAProxy >wouldn't work for me). > >=Tony Meyer From T.A.Meyer at massey.ac.nz Tue May 27 13:58:53 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon May 26 20:59:34 2003 Subject: [Spambayes] Impressed by SAProxy Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F9A9@its-xchg4.massey.ac.nz> > I basically like the double-clickability. As opposed to > running the console and typing commands (hate win services). But you don't have to type any commands. Just launch a browser and go to a bookmark, or (to cut out a step), open a shortcut to a URL. This is double-clicking (possibly only one click). > I will personally have a look at maybe writing an app that does this. If you use Mark's toolbar demo as a starter, then all you have to do is set one menu item ("Configure") to open the web browser at the appropriate page (pop3proxy.py has code that does this that you can steal), and one to launch whatever documentation you're going to refer to. (And keep the example exit). And that's it. I still don't see the utility of this application, but if you do make it, please submit it as a patch so that others can use it also. =Tony Meyer From bill at parducci.net Mon May 26 20:14:42 2003 From: bill at parducci.net (bill parducci) Date: Mon May 26 22:14:46 2003 Subject: [Spambayes] FAQ References: <3ECE9B33.3010808@parducci.net> <3ED203BB.1030201@videotron.ca> <1071.65.138.137.207.1053963430.squirrel@mail.parducci.net> <16082.26470.784633.295713@montanaro.dyndns.org> Message-ID: <3ED2CA12.3020800@parducci.net> Skip Montanaro wrote: > Folks, before you check in a file, make sure you've done a "cvs up" very > recently, otherwise you may unwittingly obliterated recent updates. mea culpa, i was working from my original copy so i didn't get your changes into the update. i will operate from the cvs version in the future. > I will give it a try, however, what we will wind up with is a table of > contents which is automatically numbered and a body which is manually > numbered. If people start reordering things or adding new questions without > adding numbers it may be better to simply return to the current situation. there may be a way to get this to work automagically, although it may violate some wc3 guidelines. i'll doink around with it to see if the body can't have autonumbered text as well (using the latest cvs version, of course ;-) b From T.A.Meyer at massey.ac.nz Tue May 27 18:15:28 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 27 01:16:08 2003 Subject: [Spambayes] Classifying on headers only Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1FB27@its-xchg4.massey.ac.nz> A little while back (before the plethora of Outlook problems ;) there was discussion about a "leave spam on server temporarily" flag for pop3proxy. The idea is that when travelling and using an expensive/slow connection (a mobile phone is one example) you don't want to download any mail classified as spam; you'll do that at a later point. This is relatively simple to do, and I'm going to play about with it. It might end up being a patch that people can apply rather than an addition to the main code; I'm not sure. Anyway, the most obvious problem is that this won't work unless the proxy is on the server, otherwise the whole message must be downloaded to classify anyway. The other option, of course, is to download *part* of the message and classify with that. The easiest part to get is the headers. There was a little bit of discussion about header-only classification here: http://mail.python.org/pipermail/spambayes/2002-October/001215.html I can't find anything else, though. I tried it out (disabling the tokenise_body call) and got the following results. This uses the SpamAssassin corpus, except for the my* ones, which uses my own corpus. These are pretty terrible results for general use (6% unsure!), but strike me as not too bad for this purpose. Using the last column, I would download .49% fn and 5.72% unsures, but I still win out, because I've blocked most of the spam. I even win with fp's (presumably they move out into the unsures), so I can be *less* concerned about leaving fp's on the server. Thoughts? =Tony Meyer filename: bodys nobodys mybodys mynobodys balancednobodys balancedbodys ham:spam: 4140:7580 4700:1980 7580:7580 4140:7580 4700:1980 7580:7580 fp total: 80 15 0 20 44 19 fp %: 1.93 0.36 0.00 0.43 0.58 0.25 fn total: 11 27 75 12 16 37 fn %: 0.15 0.36 3.79 0.61 0.21 0.49 unsure t: 286 728 190 418 357 867 unsure %: 2.44 6.21 2.84 6.26 2.35 5.72 real cost: $868.20 $322.60 $113.00 $295.60 $527.40 $400.40 best cost:$1020.60 $821.00 $197.00 $312.60 $592.60 $875.80 h mean: 8.72 6.39 0.64 7.25 3.40 4.50 h sdev: 22.63 16.67 4.50 18.06 14.19 14.12 s mean: 98.99 92.75 83.83 89.67 97.94 91.61 s sdev: 6.58 14.36 29.98 17.54 9.43 15.76 mean diff: 90.27 86.36 83.19 82.42 94.54 87.11 k: 3.09 2.78 2.41 2.32 4.00 2.92 From gpip at seeprescott.com Mon May 26 23:20:42 2003 From: gpip at seeprescott.com (Ron Campbell) Date: Tue May 27 01:20:45 2003 Subject: [Spambayes] Kill The Spam! Message-ID: Sir; Please keep us on the list for your next release! Thanks Ron From T.A.Meyer at massey.ac.nz Tue May 27 18:28:24 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 27 01:29:05 2003 Subject: [Spambayes] Kill The Spam! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1FB30@its-xchg4.massey.ac.nz> > Sir; > Please keep us on the list for your next release! > > Thanks > Ron What's the current story with the announce/users/dev lists? Was there enough agreement to convince the powers that be (Tim? Barry?) that they would be a good thing? The announce list, certainly, seems to be called for. =Tony Meyer From T.A.Meyer at massey.ac.nz Tue May 27 18:32:49 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue May 27 01:33:24 2003 Subject: [Spambayes] Kill The Spam! Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301C1FB31@its-xchg4.massey.ac.nz> > What's the current story with the announce/users/dev lists? > Was there enough agreement to convince the powers that be > (Tim? Barry?) that they would be a good thing? ^^^^ Poor wording :) By they, I mean the new lists, not the powers themselves. Of course they're a good thing . =Tony Meyer From tim.one at comcast.net Tue May 27 03:03:42 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 27 02:04:52 2003 Subject: [Spambayes] Classifying on headers only In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301C1FB27@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > ... > There was a little bit of discussion about header-only classification > here: > http://mail.python.org/pipermail/spambayes/2002-October/001215.html > > I can't find anything else, though. There's a bit more here: http://mail.python.org/pipermail/spambayes/2002-November/001525.html From lists at olivermaunder.co.uk Tue May 27 10:59:31 2003 From: lists at olivermaunder.co.uk (Oliver Maunder) Date: Tue May 27 05:00:56 2003 Subject: [Spambayes] Not a header, not a continuation Message-ID: <3ED328F3.8080201@olivermaunder.co.uk> Hi all I've got a malformed spam in my inbox that causes the email package to choke with the error: email.Errors.HeaderParseError: Not a header, not a continuation: ``PCEtLTI5NDU5L S0+PGJvZHk+CjxkaXYgYWxpZ249ImNlbnRlciI+PGZvbnQgZmFjZT0iQXJp'' I remember seeing some mention of this on the list before, but a quick google of the archives hasn't thrown up any obvious solutions. I'm using version 2.5 of the email package, fresh from Sourceforge, along with Python 2.2 on Windows XP. We could always amend the tokenizer to produce a token for "crappy malformed base64 encoded MIME message that breaks the email lib". I'm sure that would turn out to be a strong spam clue ;-) The full stack trace and offending email are included below. Olly C:\Development\SpamBayes\spambayes>imapfilter.py -c -v Loading database hammie.db... Loading state from hammie.db database hammie.db is an existing database, with 285 spam and 172 ham Done. Classifying Traceback (most recent call last): File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 739, in ? run() File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 729, in run imap_filter.Filter() File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 604, in Filter self.unsure_folder) File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 525, in Filter for msg in self: File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 434, in __iter__ yield self[key] File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 481, in __getite m__ msg.get_substance() File "C:\Development\SpamBayes\spambayes\imapfilter.py", line 331, in get_subs tance new_msg = email.Parser.Parser().parsestr(data["RFC822"]) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 64, in parse self._parsebody(root, fp) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 228, in _parsebody msgobj = self.parsestr(part) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 62, in parse self._parseheaders(root, fp) File "C:\PROGRA~1\Python\lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ``PCEtLTI5NDU5L S0+PGJvZHk+CjxkaXYgYWxpZ249ImNlbnRlciI+PGZvbnQgZmFjZT0iQXJp'' X-Sieve: cmu-sieve 2.0 Return-Path: Received: from xxxxxxx.xxxxxxx.net (xxxxxxx.xxxxxxx.net [111.111.111.111]) by xxxxxxx.xxxxxxx.net (8.12.3/8.12.3/Debian-5) with ESMTP id h4NKBYwc006866 for ; Fri, 23 May 2003 21:11:34 +0100 Received: from bigfoot.com (mail.bigfoot.com [64.15.239.131]) by xxxxxxx.xxxxxxx.net (8.9.3/8.9.3/Debian/GNU/f2s) with SMTP id UAA24003 for ; Fri, 23 May 2003 20:11:33 GMT Received: from CE142150.user.veloxzone.com.br ([200.164.142.150]) by BFLITEMAIL1A.bigfoot.com (LiteMail v3.03(BFLITEMAIL1A)) with SMTP id 23May2003_BFLITEMAIL1A_71152_48970636; Fri, 23 May 2003 16:12:49 -0400 EST Received: from email.com (4742 [145.42.195.235]) by cableaz.com (8.12.1/8.12.1) with ESMTP id 21433 for ; Fri, 23 May 2003 14:09:11 -0700 Received: from haggs-road.freeserve.co.uk ([131.90.178.57]) by ericcowie.freeserve.co.uk (8.9.3/8.9.3) with SMTP id 30055 for ; Fri, 23 May 2003 14:09:06 -0700 Message-ID: <1043319519roo|pCeljirrw1frp@saintly.com> From: "lothario desiree" To: "xxxxxxx@bigfoot.com" Date: Fri, 23 May 2003 14:09:01 -0700 Subject: Explicit cock sucking roo|pCeljirrw1frp MIME-Version: 1.0 Content-Type: multipart/related; boundary="----=_NextPart_000_0009_13056254.28517131" ------=_NextPart_000_0009_13056254.28517131 Content-Type: text/html; Content-Transfer-Encoding: base64 PCEtLTI5NDU5LS0+PGJvZHk+CjxkaXYgYWxpZ249ImNlbnRlciI+PGZvbnQgZmFjZT0iQXJp YWwsIEhlbHZldGljYSwgc2Fucy1zZXJpZiI+U2hlIGlzIGFuIG9yYWwgYWRkaWN0LiAKICA8 L2ZvbnQ+PGZvbnQgZmFjZT0iQXJpYWwsIEhlbHZldGljYSwgc2Fucy1zZXJpZiI+VGhlc2Ug d2hvcmVzIGxvdmUgY3VtLiA8L2ZvbnQ+PGZvbnQgZmFjZT0iQXJpYWwsIEhlbHZldGljYSwg c2Fucy1zZXJpZiI+QW1hemUgeW91cnNlbGYgYXQgaG93IG1hbnkgY29ja3Mgc2hlIGNhbiBz dHVmZiBpbiBoZXIgbW91dGguIAogIDwvZm9udD48Zm9udCBmYWNlPSJBcmlhbCwgSGVsdmV0 aWNhLCBzYW5zLXNlcmlmIj5DdW1taW5nIG92ZXIgdGhpcyB0ZWVuJ3MgZmFjZS4gPC9mb250 Pjxicj4KICA8Zm9udCBmYWNlPSJBcmlhbCwgSGVsdmV0aWNhLCBzYW5zLXNlcmlmIj5Zb3Ug d29uJ3QgYmVsaWV2ZSBob3cgbWFueSBndXlzIHRoaXMgZnJlc2htYW4gY2hpY2sgYmVkcy4g PC9mb250Pjxmb250IGZhY2U9IkFyaWFsLCBIZWx2ZXRpY2EsIHNhbnMtc2VyaWYiPjxicj4K ICBSb2NrIGhhcmQgUm9kbmV5IHBvdW5kZWQgaGVyIHRpZ2h0IHB1c3N5LiA8L2ZvbnQ+PGZv bnQgZmFjZT0iQXJpYWwsIEhlbHZldGljYSwgc2Fucy1zZXJpZiI+PGJyPgogIEhvdyBtYW55 IG9mIGhlciBicm90aGVyJ3MgaW5jaGVzIGNhbiB0aGlzIHRlZW4gYml0Y2ggdGFrZS4gPC9m b250Pjxmb250IGZhY2U9IkFyaWFsLCBIZWx2ZXRpY2EsIHNhbnMtc2VyaWYiPjwvZm9udD48 Zm9udCBmYWNlPSJBcmlhbCwgSGVsdmV0aWNhLCBzYW5zLXNlcmlmIj4gCiAgPC9mb250Pjxm b250IGZhY2U9IkFyaWFsLCBIZWx2ZXRpY2EsIHNhbnMtc2VyaWYiPiA8YnI+CiAgPGEgaHJl Zj0iaHR0cDovL3d3dy5hbGxhYm91dG1laG9zdGluZy5jb20vYWMtbmV3L2NzZy5odG1sP3Jv b3xwQ2VsamlycncxZnJwIj5IZXIgYm95ZnJpZW5kIGlzIHNvIGhvcnNlLWh1bmcgc2hlIGlu dml0ZWQgaGVyIHBsZWRnZSBzaXN0ZXIgb3ZlciA8L2E+IDxicj4KICBZb3Ugc2hvdWxkIGNo ZWNrIHRoaXMgY3Vtc2hvdHMgb3V0LiBUaGUgSG90dGVzdCAuIExpdHRsZSB0ZWVuYWdlIGdp cmxmcmllbmQgd2FudHMgdG8gbGVhcm4gaG93IHRvIGxvc2UgdGhlIGdhZyByZWZsZXguIDxi cj4KICA8YSBocmVmPSJodHRwOi8vd3d3Lmhvc3RlZHNpdGVzNHUuY29tL2FjLW5ldy9zZi5o dG1sP3Jvb3xwQ2VsamlycncxZnJwIj5UaGVzZSBnaXJscyBhcmUgYmVpbmcgaHVtaWxpYXRl ZDwvYT4gPC9mb250PiAKPC9kaXY+CjxwPiZuYnNwOzwvcD4KPHA+Jm5ic3A7PC9wPgo8cD4m bmJzcDs8L3A+CjxwPiZuYnNwOzwvcD4KPHA+Jm5ic3A7PC9wPgo8cD4mbmJzcDs8L3A+Cjxw Pjxmb250IGZhY2U9IkFyaWFsLCBIZWx2ZXRpY2EsIHNhbnMtc2VyaWYiPjxhIGhyZWY9Imh0 dHA6Ly93d3cudG9wOTlob3N0LmNvbS9jZ2ktYmluL2J5ZS9ieWUuY2dpP3Jvb3xwQ2Vsamly cncxZnJwIj5ubyBtb3JlIGVtYWlsPC9hPjwvZm9udD48L3A+CjwvYm9keT4= From yaki at ParadigmGeo.com Tue May 27 15:31:55 2003 From: yaki at ParadigmGeo.com (Yaki Rubin) Date: Tue May 27 07:30:50 2003 Subject: [Spambayes] Errors in LOG file Message-ID: Hi, I just installed your software and I checked the log file. Drive "E:" is my CDROM I don't know anything about folder named: #### File "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 355, in _UpdateStatus File "E:\src\Installer\iu.py", line 296, in importHook ImportError: No module named timer win32ui: Exception in OnNotify() handler #### Please advise... Thanks, Yaki Rubin I'm running outlook xp sp2 on winxp pro working with exchange 5.5 -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes1.log Type: application/octet-stream Size: 1891 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030527/743fba99/spambayes1.obj From noreply at sourceforge.net Tue May 27 06:42:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 08:42:37 2003 Subject: [Spambayes] [ spambayes-Bugs-738501 ] AttributeError second time around with pop3proxy Message-ID: Bugs item #738501, was opened at 2003-05-15 21:27 Message generated for change (Comment added) made by papadoc You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738501&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Peter Bengtsson (peterbe) Assigned to: Nobody/Anonymous (nobody) Summary: AttributeError second time around with pop3proxy Initial Comment: Yesterday (14/05) I downloaded the latest CVS version and starting it; worked fine. Then I turned that off when I went to bed and tried to start it again today. This time I get the following error:: C:\Python22\spambayes>C:\python22\python.exe pop3proxy.py Loading database... Traceback (most recent call last): File "pop3proxy.py", line 738, in ? run() File "pop3proxy.py", line 713, in run state.createWorkers() File "pop3proxy.py", line 559, in createWorkers self.bayes = storage.DBDictClassifier(filename) File "spambayes\storage.py", line 139, in __init__ self.load() File "spambayes\storage.py", line 151, in load t = self.db[self.statekey] File "C:\Python22\lib\shelve.py", line 71, in __getitem__ return Unpickler(f).load() EOFError Exception exceptions.AttributeError: "'NoneType' object has no attribute 'error' " in > ignored Any ideas what this might be? Is it a bug? Solutions? ---------------------------------------------------------------------- Comment By: Remi Ricard (papadoc) Date: 2003-05-27 12:42 Message: Logged In: YES user_id=145206 Hi, I'm getting the same error message. This is what happened. I was training pop3proxy with the web interface. Classifying a unsure to be spam. I got this error message in the web interface raceback (most recent call last): File "C:\Devtools\SPAMBA~1\SPAMBA~1.21\spambayes\Dibbler.py", line 398, in found_terminator getattr(plugin, name)(**params) File "C:\Devtools\SPAMBA~1\SPAMBA~1.21\spambayes\ProxyUI.py", line 359, in onReview message = spambayes.mboxutils.get_message(cachedMessage.getSubstance()) File "C:\Devtools\SPAMBA~1\SPAMBA~1.21\spambayes\Corpus.py", line 349, in getSubstance return self.hdrtxt + self.payload File "C:\Devtools\SPAMBA~1\SPAMBA~1.21\spambayes\Corpus.py", line 296, in __getattr__ raise AttributeError, attributeName AttributeError: hdrtxt Then I stop pop3proxy.py because I wanted to enable the logging with [globals] verbose = True Then I restarted the pop3proxy and I got the same error message as in bug 738501 Remi (papaDoc@videotron.ca) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=738501&group_id=61702 From mail at peterbe.com Tue May 27 14:55:35 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Tue May 27 09:04:09 2003 Subject: [Spambayes] Impressed by SAProxy In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301C1F9A9@its-xchg4.massey. ac.nz> Message-ID: <5.2.0.9.0.20030527135031.02271420@localhost> At 12:58 2003-05-27 +1200, Meyer, Tony wrote: > > I basically like the double-clickability. As opposed to > > running the console and typing commands (hate win services). > >But you don't have to type any commands. Just launch a browser and go >to a bookmark, or (to cut out a step), open a shortcut to a URL. This >is double-clicking (possibly only one click). > > > I will personally have a look at maybe writing an app that does this. > >If you use Mark's toolbar demo as a starter, then all you have to do is >set one menu item ("Configure") to open the web browser at the >appropriate page (pop3proxy.py has code that does this that you can >steal), and one to launch whatever documentation you're going to refer >to. (And keep the example exit). And that's it. > >I still don't see the utility of this application, but if you do make >it, please submit it as a patch so that others can use it also. No, it's not just the documentation or the configuration web interface. It's the actual application running- At the moment I have a command console open on my windows. What other ways are there to start the pop3proxy? What I do is: Run -> "cmd" Enter -> "cd C:\Python22\spambayes\" -> "C:\Python22\python.exe pop3proxy.py" Alternativly I could write some little batch file that I stick in the Autostart. I will see what I can do with the taskbar gui demo app. Peter From bkc at murkworks.com Tue May 27 10:32:21 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 27 09:13:08 2003 Subject: [Spambayes] sharing wordlists - better numbers Message-ID: <3ED330A5.24185.26CF06E1@localhost> Here are all the 2nd-pass word lists I have this morning: [bkc@strader data2]$ python2.3 ../countwords.py key-hash.* loading key-hash.popiel 376496 loading key-hash.skip 392849 loading key-hash.tameyer 45914 loading key-hash.timp 45426 loading key-hash.who 28251 Loaded 5 wordlists with 717487 distinct words out of 888936 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 597436 83.3% 2 83279 11.6% 3 22983 3.2% 4 12952 1.8% 5 837 0.1% And picking two that are nearly the same size [bkc@strader data2]$ python2.3 ../countwords.py key-hash.popiel key-hash.skip loading key-hash.popiel 376496 loading key-hash.skip 392849 Loaded 2 wordlists with 684190 distinct words out of 769345 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 599035 87.6% 2 85155 12.4% And another two nearly the same size loading key-hash.tameyer 45914 loading key-hash.timp 45426 Loaded 2 wordlists with 78218 distinct words out of 91340 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 65096 83.2% 2 13122 16.8% So .. it seems that at least 12000 words appear in nearly everyone's wordlist. (ie, I run the test with four lists, leaving out key-hash.who (was uploaded as just key- hash) loading key-hash.tameyer 45914 loading key-hash.timp 45426 loading key-hash.popiel 376496 loading key-hash.skip 392849 Loaded 4 wordlists with 717487 distinct words out of 860685 total words Word counts by number of collections each word is seen in Col. # Words % of unique words 1 618967 86.3% 2 66186 9.2% 3 19990 2.8% 4 12344 1.7% What can we say about these 12000 words? Is there some way to improve the system by knowing that others have these same words, even if we don't know how others have classified those words in their own databases? -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From lists at olivermaunder.co.uk Tue May 27 15:24:41 2003 From: lists at olivermaunder.co.uk (Oliver Maunder) Date: Tue May 27 09:26:06 2003 Subject: [Spambayes] Impressed by SAProxy In-Reply-To: <5.2.0.9.0.20030527135031.02271420@localhost> References: <5.2.0.9.0.20030527135031.02271420@localhost> Message-ID: <3ED36719.4080700@olivermaunder.co.uk> Peter Bengtsson wrote: > What other ways are there to start the pop3proxy? > What I do is: > Run -> "cmd" Enter -> "cd C:\Python22\spambayes\" -> > "C:\Python22\python.exe pop3proxy.py" Create a shortcut with the command line: C:\Python22\python.exe C:\python22\spambayes\pop3proxy.py In the shortcut properties set "Start In" to c:\python22\spambayes. Then, you can put this shortcut in your start menu, on the desktop, or in your Startup group to make sure it's run when Windows boots. You can get rid of the console window by running the script with pythonw.exe instead of python.exe. This does mean that you won't see the console window output. I don't run pop3proxy, so I don't know if that is important. Olly > > Alternativly I could write some little batch file that I stick in the > Autostart. > > I will see what I can do with the taskbar gui demo app. > > Peter > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From Pei-shan at anthology.com Tue May 27 10:59:34 2003 From: Pei-shan at anthology.com (Pei-shan) Date: Tue May 27 09:56:47 2003 Subject: [Spambayes] folder disappeared Message-ID: I installed your anti-spam program last week. It worked wonderfully for a few days. This morning, as I was admiring the fine job it did over the weekend (caught 250 spam emails), the 'Spam' folder I setup disappeared without a trace. I tried to re-'train' the program and it will not recognize the folder I newly setup. i.e. When I tried to 'delete as Spam', I get a message "You must configure the spam folder". Any suggestions? Pei-shan From skip at pobox.com Tue May 27 09:57:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 09:57:17 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <3ED330A5.24185.26CF06E1@localhost> References: <3ED330A5.24185.26CF06E1@localhost> Message-ID: <16083.28342.289866.251818@montanaro.dyndns.org> Brad> What can we say about these 12000 words? That they are common? ;-) How about the next step in the exercise? I propose that Alex and I (assuming Alex is amenable) each extract a non-hapax version of our word databases (real keys, real counts) then from that further extract a database from that of the most hammy and spammy words (how about <0.2 and >0.8?), run the usual tests against them and see how they do. If that looks promising, we can merge the common words from the two, test again, see how big the result is, then decide whether to include it in a later distribution. What's the formula again for computing the ham/spam probability for a single word given its counts in spam and ham messages? I can never remember it and can't locate it in the source. Is it just the usual 0...1 sort of thing: 1 - nham/(nham+nspam) ? Also, what's the key in the database which stores the total spam and ham counts? Thx, Skip From a at cisco.com Tue May 27 16:57:40 2003 From: a at cisco.com (Alister Forbes) Date: Tue May 27 09:58:19 2003 Subject: [Spambayes] muttrc file in the source distro Message-ID: <20030527135740.GA23883@cisco.com> All, First, thanks for spambayes... has cut my daily spam from 30 - 40 a day. Just one teeny tiny thing, the colour tagging in the supplied muttrc file doesn't work. Or at least, not in the latest versions of mutt. in contrib/muttrc you show: color index red black "~h 'X-Hammie-Disposition: spam' ~F" and what actually works is: color index red black "~h 'X-Spambayes-Classification: spam' " I just ran up a quick page showing how to install it locally (our sysadmins weren't too keen on giving me global access) and in the first half hour there's already been nearly 100 hits. Thanks for all the great work. Alister -- Alister Forbes Work: +32 2 704 5762 Internal: 322 5762 a@cisco.com Euro-CATS _.|._.|._ Cisco Systems Please avoid sending me Word or PowerPoint attachments. See http://www.fsf.org/philosophy/no-word-attachments.html -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 185 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030527/6a36c543/attachment.bin From bkc at murkworks.com Tue May 27 11:28:04 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 27 10:08:57 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <16083.28342.289866.251818@montanaro.dyndns.org> References: <3ED330A5.24185.26CF06E1@localhost> Message-ID: <3ED33DB4.30083.270206F9@localhost> On 27 May 2003 at 8:57, Skip Montanaro wrote: > How about the next step in the exercise? I propose that Alex and I > (assuming Alex is amenable) each extract a non-hapax version of our word > databases (real keys, real counts) then from that further extract a > database from that of the most hammy and spammy words (how about <0.2 and > >0.8?), run the usual tests against them and see how they do. If that > looks promising, we can merge the common words from the two, test again, > see how big the result is, then decide whether to include it in a later > distribution. Are you suggesting including a "starter database" in the spambayes distribution, by noting which words are common to more than one person? I wonder .. if we only used words which were "common" when determining spaminess, how well would that work? Lets suppose in a "semi-shared database" mode, there was a mechanism for 'upscaling' hapaxes into the "common word list", so that long term the collective wordlist would continue to evolve. Individuals keep only their personal weights for "common words", so the database is split.. word-list is shared, weights are private. Some users have gigantic lists of words, how many of them are hapaxes? Is it possible that we could get good results by only using "common words", even if non-hapaxes for some users were not in the common words list? I suppose that depends on how close an individuals preferences are to the median of their group's preferences. Do Skip and Alex have a small std deviation in their virtual group? ;-) -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From skip at pobox.com Tue May 27 10:36:31 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 10:36:38 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <3ED33DB4.30083.270206F9@localhost> References: <3ED330A5.24185.26CF06E1@localhost> <3ED33DB4.30083.270206F9@localhost> Message-ID: <16083.30703.462826.37792@montanaro.dyndns.org> Brad> Are you suggesting including a "starter database" in the spambayes Brad> distribution, by noting which words are common to more than one Brad> person? I thought that was the direction you were headed with this exercise. I guess I misunderstood. Brad> I wonder .. if we only used words which were "common" when Brad> determining spaminess, how well would that work? Should work pretty well if we include "common" words which turn out to be strong spam or ham indicators for a suitable cross-section of the group. Brad> Lets suppose in a "semi-shared database" mode, there was a Brad> mechanism for 'upscaling' hapaxes into the "common word list", so Brad> that long term the collective wordlist would continue to evolve. Alex and I both have fairly large word databases. I suspect hapaxes will remain hapaxes. I'm thinking of just a starter database of a reasonable size. It could be shipped in plain text form then installed using Tim Stone's (I believe) database importer/exporter tool. Brad> Do Skip and Alex have a small std deviation in their virtual Brad> group? ;-) I realize there's a smiley, but what do you mean by "virtual group"? Skip From popiel at wolfskeep.com Tue May 27 08:44:58 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 27 10:45:04 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: Message from Skip Montanaro of "Tue, 27 May 2003 09:36:31 CDT." <16083.30703.462826.37792@montanaro.dyndns.org> References: <3ED330A5.24185.26CF06E1@localhost> <3ED33DB4.30083.270206F9@localhost> <16083.30703.462826.37792@montanaro.dyndns.org> Message-ID: <20030527144458.0E3032DDEF@cashew.wolfskeep.com> In message: <16083.30703.462826.37792@montanaro.dyndns.org> Skip Montanaro writes: > > Brad> Are you suggesting including a "starter database" in the spambayes > Brad> distribution, by noting which words are common to more than one > Brad> person? > >I thought that was the direction you were headed with this exercise. >I guess I misunderstood. That would be one direction to take it, though I'd really be careful with it, for all the reasons already discussed on the list. Another direction is pure research; look at the pretty numbers and see what falls out. Perhaps we'll discover something novel. > Brad> I wonder .. if we only used words which were "common" when > Brad> determining spaminess, how well would that work? > >Should work pretty well if we include "common" words which turn out to be >strong spam or ham indicators for a suitable cross-section of the group. It's the identification of groups and common interests that's most likely to fall out of this line of study, I suspect. > Brad> Lets suppose in a "semi-shared database" mode, there was a > Brad> mechanism for 'upscaling' hapaxes into the "common word list", so > Brad> that long term the collective wordlist would continue to evolve. > >Alex and I both have fairly large word databases. I suspect hapaxes will >remain hapaxes. I'm thinking of just a starter database of a reasonable >size. It could be shipped in plain text form then installed using Tim >Stone's (I believe) database importer/exporter tool. I can make a larger database, too. The one I gave is just the last four months of data; I have nearly double that, now, which I could relatively easily load into a database for perusal. > Brad> Do Skip and Alex have a small std deviation in their virtual > Brad> group? ;-) > >I realize there's a smiley, but what do you mean by "virtual group"? Well, I can't speak for Skip, but my group of virtual friends are a pretty deviant lot. Not prone to being standard. ;-) - Alex From bkc at murkworks.com Tue May 27 12:10:59 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 27 10:51:47 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <16083.30703.462826.37792@montanaro.dyndns.org> References: <3ED33DB4.30083.270206F9@localhost> Message-ID: <3ED347C3.11751.27295172@localhost> On 27 May 2003 at 9:36, Skip Montanaro wrote: > I thought that was the direction you were headed with this exercise. > I guess I misunderstood. The original question came up when someone asked about how to run spambayes on a server with 7000 accounts. My thought was, if we split "the database" into two parts, the word list portion could be shared and the weights would still be personal. So picture a school, or a business where you want to run spambayes on the email server, still offer personal weighting but cut down on the amount of data that needs to be stored. Is it possible/practical to target this problem by using a split database? That's the question I'm trying to answer. > Alex and I both have fairly large word databases. I suspect hapaxes will > remain hapaxes. By this I meant that a hapaxe is only a hapaxe until it appears again in another message. So imagine the first appearance of 'p0rn'. First time it's trained, it's a hapax. But 50 other people in the company got the same email and they all mark it as spam. Somehow that word has to move from being a personal hapax into the collective word list. That mechanism isn't important right now, I just mentioned hapaxes because they have to be able to be "promoted" into the collective list. OTH, personal hapaxes will have to timeout quickly too, otherwise splitting the db is pointless if everyone has 380K hapaxes too. > Brad> Do Skip and Alex have a small std deviation in their virtual > Brad> group? ;-) > > I realize there's a smiley, but what do you mean by "virtual group"? > As in, all the students in the school, all the employees in the same division of the company. Obviously one database won't work for all of IBM, but might work for everyone in Endicott. The presumption is that members of a group, by sharing the same language and similar social settings will have a higher percentage of their words in common. So if 95% of your words are also in Alex's list, and 92% of his words are in your list, thats a lot better than only 50-50. Note that this doesn't mean members of the group weight their words the same, only that they see the same words. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From popiel at wolfskeep.com Tue May 27 08:57:02 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 27 10:57:07 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: Message from Skip Montanaro of "Tue, 27 May 2003 08:57:10 CDT." <16083.28342.289866.251818@montanaro.dyndns.org> References: <3ED330A5.24185.26CF06E1@localhost> <16083.28342.289866.251818@montanaro.dyndns.org> Message-ID: <20030527145702.71BBF2DDEF@cashew.wolfskeep.com> In message: <16083.28342.289866.251818@montanaro.dyndns.org> Skip Montanaro writes: > > Brad> What can we say about these 12000 words? > >That they are common? ;-) > >How about the next step in the exercise? I propose that Alex and I >(assuming Alex is amenable) each extract a non-hapax version of our word >databases (real keys, real counts) then from that further extract a database >from that of the most hammy and spammy words (how about <0.2 and >0.8?), run >the usual tests against them and see how they do. If that looks promising, >we can merge the common words from the two, test again, see how big the >result is, then decide whether to include it in a later distribution. I'm amenable, but I'm a bit short on time at the moment. If you have the time, I can give you my entire database... otherwise it'll likely wait until sometime this coming weekend. >What's the formula again for computing the ham/spam probability for a single >word given its counts in spam and ham messages? I can never remember it and >can't locate it in the source. Is it just the usual 0...1 sort of thing: > > 1 - nham/(nham+nspam) Nope. Method 'probability' in 'classifier.py': spamcount = record.spamcount hamcount = record.hamcount nham = float(self.nham or 1) nspam = float(self.nspam or 1) assert hamcount <= nham hamratio = hamcount / nham assert spamcount <= nspam spamratio = spamcount / nspam prob = spamratio / (hamratio + spamratio) if options.experimental_ham_spam_imbalance_adjustment: spam2ham = min(nspam / nham, 1.0) ham2spam = min(nham / nspam, 1.0) else: spam2ham = ham2spam = 1.0 S = options.unknown_word_strength StimesX = S * options.unknown_word_prob n = hamcount * spam2ham + spamcount * ham2spam prob = (StimesX + n * prob) / (S + n) >? Also, what's the key in the database which stores the total spam and ham >counts? Urgh... I'm forgetting this one. In the pickle, they're stored outside the wordlist, but I'm having trouble finding the non-pickle version of the storage... ah, here is is: it's stored under the "saved state" entry. - Alex From bill at parducci.net Tue May 27 09:02:47 2003 From: bill at parducci.net (bill parducci) Date: Tue May 27 11:09:40 2003 Subject: [Spambayes] sharing wordlists - better numbers References: <3ED33DB4.30083.270206F9@localhost> <3ED347C3.11751.27295172@localhost> Message-ID: <3ED37E17.9020307@parducci.net> Brad Clements wrote: > The presumption is that members of a group, by sharing the same language and > similar social settings will have a higher percentage of their words in common. So if > 95% of your words are also in Alex's list, and 92% of his words are in your list, thats > a lot better than only 50-50. even if this were 100%... > Note that this doesn't mean members of the group weight their words the same, only > that they see the same words. ...you would still have to have some sort of local index to refer the common token (as pointed out earlier in the discussion) so the size savings would be limited. > Is it possible/practical to target this problem by using a > split database? the answers would seem to be yes/no respectively once you consider the additional overhead of referring to external data, merging tokens and increased fragility (single point of failure). this is a very interesting idea, but after working it through in my head, it doesn't seem to offer an architectural improvement over the existing system (even for 7000 users). this is not to say that the pursuit of commonality, etc. won't bear fruit down the road, but that i personally don't think the original intent will be served. just my two cents... b From skip at pobox.com Tue May 27 11:40:31 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 11:40:37 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <20030527145702.71BBF2DDEF@cashew.wolfskeep.com> References: <3ED330A5.24185.26CF06E1@localhost> <16083.28342.289866.251818@montanaro.dyndns.org> <20030527145702.71BBF2DDEF@cashew.wolfskeep.com> Message-ID: <16083.34543.487263.763560@montanaro.dyndns.org> Alex> I'm amenable, but I'm a bit short on time at the moment. If you Alex> have the time, I can give you my entire database... otherwise Alex> it'll likely wait until sometime this coming weekend. Sure, I have a script which dumps the words which fit my criteria. I can run both our databases through them and then just use comm(1) to get their intersection. Alex> Nope. Method 'probability' in 'classifier.py': Thanks, got it now. Skip From skip at pobox.com Tue May 27 11:21:39 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 11:42:07 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <20030527144458.0E3032DDEF@cashew.wolfskeep.com> References: <3ED330A5.24185.26CF06E1@localhost> <3ED33DB4.30083.270206F9@localhost> <16083.30703.462826.37792@montanaro.dyndns.org> <20030527144458.0E3032DDEF@cashew.wolfskeep.com> Message-ID: <16083.33411.454483.16008@montanaro.dyndns.org> Brad> Do Skip and Alex have a small std deviation in their virtual Brad> group? ;-) >> >> I realize there's a smiley, but what do you mean by "virtual group"? Alex> Well, I can't speak for Skip, but my group of virtual friends are Alex> a pretty deviant lot. Not prone to being standard. ;-) Oh, I see. I doubt my virtual group of online friends is all that typical either. By taking an intersection of what we consider to be hammy and spammy words, I think we'll converge on a fairly innocuous set of words. In any case, this would just be for a starter database. People could presumably do one of three things: * ignore it altogether - start from scratch * start with it and build from there * start with it, then subtract it off after awhile In the last item "subtract" might be either a gradual or quick thing. Skip From tim.one at comcast.net Tue May 27 12:45:08 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue May 27 11:46:02 2003 Subject: [Spambayes] New mailing lists In-Reply-To: <1054045679.2927.1.camel@barry> Message-ID: [Barry Warsaw, in pvt email] > It's currently too hard to alias a mailing list on python.org, so I'm > giving up on that. For now, you've got spambayes, spambayes-dev, and > spambayes-announce. All have been assigned to you Tim, and you should > have the password notifications. Thanks, Barry! All, I'm out sick and can't deal with this now. The new mailing lists are at: http://python.org/mailman/admin/spambayes-dev http://python.org/mailman/admin/spambayes-announce spambayes-users doesn't exist; spambayes still does. If someone else can move this along, please do. The admin password can be gotten from me or Barry; Skip should also know it. From bkc at murkworks.com Tue May 27 13:15:30 2003 From: bkc at murkworks.com (Brad Clements) Date: Tue May 27 11:56:34 2003 Subject: [Spambayes] sharing wordlists - better numbers In-Reply-To: <3ED37E17.9020307@parducci.net> Message-ID: <3ED356E2.5229.2764644F@localhost> On 27 May 2003 at 8:02, bill parducci wrote: > this is a very interesting idea, but after working it through in my > head, it doesn't seem to offer an architectural improvement over the > existing system (even for 7000 users). this is not to say that the > pursuit of commonality, etc. won't bear fruit down the road, but that i > personally don't think the original intent will be served. > > just my two cents... You expressed this sentiment last week, so I think you're up to 4 cents now. ;-) My excuse continues to be, lets pass the first stage before worrying about the technical issues of deployment. We may never get that far anyway. Another thought.. In the case of 7000 users, how many are really going to bother to train? We know that a single person's weights probably don't speak for the whole community, but does an average of weights of a few members of the community represent the average of the weights of the entire community? In other words, for those orgs who want some control over their spam, could the average weighting of 10 members out of 1000 reasonably represent the average of all 1000 members? Heh, I know there's a technical name for this.. the mean of a sub-sample approaches the mean of the entire sample .. something like that. So I'm thinking .. suppose you allow people to keep their private weights, but for those who just want "good enough" filtering, they use a "synthesized database" which represents the "average" of the private database weights. Do you average the word weights across private databases before scoring, or do you average the scores? Just musing.. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From noreply at sourceforge.net Tue May 27 10:01:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 12:01:45 2003 Subject: [Spambayes] [ spambayes-Bugs-740311 ] 'skip_max_word_size' error on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 08:09 Message generated for change (Comment added) made by sjuerges You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Mark Hammond (mhammond) Summary: 'skip_max_word_size' error on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- Comment By: Sebastian Jürges (sjuerges) Date: 2003-05-27 16:01 Message: Logged In: YES user_id=740980 Same here. WinXP Service Pack 1 / OutlookXP ---------------------------------------------------------------------- Comment By: Harald Meyer (hmeyer666) Date: 2003-05-23 20:36 Message: Logged In: YES user_id=785713 Same here with Win2000 and Outlook XP ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-20 22:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From bill at parducci.net Tue May 27 10:32:29 2003 From: bill at parducci.net (bill parducci) Date: Tue May 27 12:39:24 2003 Subject: [Spambayes] sharing wordlists - better numbers References: <3ED356E2.5229.2764644F@localhost> Message-ID: <3ED3931D.5040506@parducci.net> Brad Clements wrote: > You expressed this sentiment last week, so I think you're up to 4 cents now. ;-) ok, i'll give you three :) > My excuse continues to be, lets pass the first stage before worrying about the > technical issues of deployment. We may never get that far anyway. hey, in terms of problem definition i am with you all the way. there are many that would love to see a centralized solution that provides acceptable results. however, can you determine this via word count direct redundancy comparison? it would seem that the only way to effectively test for this would be to test an 'averaged' db against numerous spam/ham profiles. and see if it works well enough. the reason that i brought this up is that it seems like there is an increasing amount of work now going on to determine commonality, yet i cannot fathom an outcome that will solve your original question (a shallow thought pool to be sure ;-) > Another thought.. In the case of 7000 users, how many are really going to bother to > train? We know that a single person's weights probably don't speak for the whole > community, but does an average of weights of a few members of the community > represent the average of the weights of the entire community? > > In other words, for those orgs who want some control over their spam, could the > average weighting of 10 members out of 1000 reasonably represent the average of > all 1000 members? > > Heh, I know there's a technical name for this.. the mean of a sub-sample approaches > the mean of the entire sample .. something like that. > > So I'm thinking .. suppose you allow people to keep their private weights, but for > those who just want "good enough" filtering, they use a "synthesized database" which > represents the "average" of the private database weights. > > Do you average the word weights across private databases before scoring, or do you > average the scores? i am a stats numbskull i think you are going to find some unexpected results when you start using averages as the basis for decision making (average = dilution). i'd be willing to bet a peanut butter sandwich that as your 'sub sample' grows your results deteriorate, and that one person may actually be able to offer the best representation. :-P taken one step further, it may make sense to have a 'profile db' for a variety of user types (student, teacher, staff, IS, guest, etc.) whereby a single user db is used to make decisions for those who are not inclined to train but are of similar interests. then again maybe not :) as has been pointed out to me on a number of occasions 'only testing will tell'. ;-) bottom line: i am not trying to disuade anyone from pursuing the solution, just openly 'musing' as well. b From noreply at sourceforge.net Tue May 27 10:52:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 12:52:09 2003 Subject: [Spambayes] [ spambayes-Bugs-744380 ] W982E/Outlook 2000: exception on loading Message-ID: Bugs item #744380, was opened at 2003-05-27 09:51 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744380&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Steve Clift (sclift) Assigned to: Mark Hammond (mhammond) Summary: W982E/Outlook 2000: exception on loading Initial Comment: Windows 98 2nd Edition Outlook 2000 SR-1 - Corporate or Workgroup SpamBayes throws an execption when loading. From the log file: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744380&group_id=61702 From noreply at sourceforge.net Tue May 27 10:50:38 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 13:02:56 2003 Subject: [Spambayes] [ spambayes-Bugs-743113 ] Multiple profile issues Message-ID: Bugs item #743113, was opened at 2003-05-25 11:47 Message generated for change (Comment added) made by bwims You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Multiple profile issues Initial Comment: The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-05-27 16:50 Message: Logged In: YES user_id=626762 I had a look at addin.py and it seems like it ought to be easy enough to specify a separate toolbar for the buttons. This might be the way forward since the layout of the standard toolbar varies considerable depending on which folder you are looking at, whereas special toolbars do not, and it seems to be that part of the problem occurs when you start off in the top level folder. Best regards, Brian Williams ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 From noreply at sourceforge.net Tue May 27 11:16:17 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 13:16:37 2003 Subject: [Spambayes] [ spambayes-Bugs-740843 ] No Disk Error with Outlook 2000 on startup Message-ID: Bugs item #740843, was opened at 2003-05-20 18:39 Message generated for change (Comment added) made by portola You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Sam Snow (snowsam) Assigned to: Mark Hammond (mhammond) Summary: No Disk Error with Outlook 2000 on startup Initial Comment: After installing SpamBayes-Outlook-Setup-002.exe I am now getting an error dialog on Outlook startup. The box says: (Header) Inbox - Microsoft Outlook:OUTLOOK.EXE - No Disk (Body) There is no disk in the drive. Please insert a disk into drive \Device\Harddisk0\DR0. (Buttons) Cancel, Try Again, Continue I am able to click cancel or continue several times and then outlook goes ahead and opens up. I just installed this evening, so I am not sure if the filtering is still working correctly. I was able to train the program sucessfully. I am using Office 2000 SP3 on Win 2000. I will try to attach a jpg of the dialog box. My error log says the following: SpamAddin - Connecting to Outlook Loaded bayes database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Snow1\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.64165ms AntiSpam: Watching for new messages in folder Inbox AntiSpam: Watching for new messages in folder Spam Processing 0 missed spam in folder 'Inbox' took 31.9599ms pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 316, in probability exceptions.AssertionError: ---------------------------------------------------------------------- Comment By: Dennis Austin (portola) Date: 2003-05-27 10:16 Message: Logged In: YES user_id=787905 I also usually see this error when I start Outlook, although not every time. I also see it at the end of running the installer. In my configuration it shows up as "No disk in drive E:". E: is CD-ROM 1 on this machine. I can get past the error either by clicking Cancel several times, or by putting any old CD in the drive and clicking Try Again. The error does not seem to affect any function of the add-on. ---------------------------------------------------------------------- Comment By: Ferruccio Barletta (fgb) Date: 2003-05-25 07:40 Message: Logged In: YES user_id=786210 I may have found the root cause of this problem. When I brought up disk management on my notebook I noticed that my hard drive was Disk1 and the SD media drive was Disk0. When I disabled the SD drive and rebooted, the hard drive became Disk0 and the problem disappeared. ---------------------------------------------------------------------- Comment By: Ferruccio Barletta (fgb) Date: 2003-05-24 18:30 Message: Logged In: YES user_id=786210 I get the same error with Office 2002 SP1 on Windows XP SP1 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740843&group_id=61702 From david.scavo at kmaone.com Tue May 27 15:26:55 2003 From: david.scavo at kmaone.com (David Scavo) Date: Tue May 27 14:27:00 2003 Subject: [Spambayes] SpamBayes and Unread Flag Message-ID: <5834614BE3281C459CE02E8B5D46A6A0655C57@msmail.kmaone.com> Hi. I read about SpamBayes in an article in Infoworld. Previously I was using SpamAssassin, but I wanted to try out Bayesian filtering to see how it would compare. So far today, SPAMBayes has only let 1 of 18 SPAM messages through (pretty darn good). The real test is a weekend, when I get 250-350 SPAMs.. However, I do have a minor problem with SPAMBayes. I am running Windows XP SP1 with Outlook 2000 SR-1 (9.0.0.3821). I installed 002 using the binary installer. I also have ActiveSync 3.6 installed (for my IPAQ 3765 Pocket PC), but my IPAQ has not been connected today. I did a complete uninstall of SpamAssassin before installing SPAMBayes. The backend server is Exchange 2000 SP2. What is happening is that random messages near the top of my Inbox keep getting their Unread flag reset (so they go from normal type back to BOLD type). Sometimes the message does not change to unread when you double-click and open it (ie it stays BOLD). Sometimes it changes to unread when you open it, but then while the message is open it turns back to unread (ie BOLD). It seems to happen mostly when a new SPAM message comes in, but I have seen it happen even when no new SPAM arrives. I am unable to reproduce the problem at will, be it has happened at least 15-20 times today. If I go into Tools / Options / Other / Advanced Options / Com Addins and uncheck SpamBayes (and restart Outlook), I don't see the problem anymore.... I would be glad to supply whatever information would help you track down this problem. Other than this minor annoyance, SPAMBayes is working great..... Thanks. David Scavo KMA One Director of Technology From KLEMKE at clgf.com Tue May 27 14:49:28 2003 From: KLEMKE at clgf.com (KLEMKE@clgf.com) Date: Tue May 27 14:53:03 2003 Subject: [Spambayes] Any ideas? Message-ID: <8E0DF15288B8D311BC830008C74C1103F2A088@POSTMAN> I was using Spam Assain and after reading the great article on SpamBayes in InfoWorld wanted to try the program. I uninstalled SpamAssain and retrieved the binary from http://starship.python.net/crew/mhammond/spambayes/ and it appeared to install successfully. I and running Outlook 2000 SP-3 (9.0.0.6627) on a Windows 98 machine. When I start Outlook, there is no AntiSpam dropdown button that I can see. Is there anything else I should try? Thanks. KL Kevin Lemke (701)787-6328 before 12:00 CT (701)772-0355 after 2:00 CT (701)787-7028 Fax klemke@clgf.com From Ken.Cornetet at kimball.com Tue May 27 15:35:54 2003 From: Ken.Cornetet at kimball.com (Ken Cornetet) Date: Tue May 27 15:36:28 2003 Subject: [Spambayes] Compatibility Message-ID: I installed on outlook 2002 (build 10.4705.4219 SP-2) running on Windows 2000 SP3 and it seems to work fine. From tim at fourstonesexpressions.com Tue May 27 15:52:55 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Tue May 27 15:53:42 2003 Subject: [Spambayes] That great Infoworld article... Message-ID: Here's the url, in case anyone missed it... dated May 16. http://www.infoworld.com/article/03/05/16/20TCspam_1.html?s=tc -- c'est moi - TimS From zander at zan.com Tue May 27 14:01:48 2003 From: zander at zan.com (Zander) Date: Tue May 27 16:02:20 2003 Subject: [Spambayes] email parse_headers error References: Message-ID: <027701c3248a$d5668b60$a100a8c0@zlichstein> So - the fact that the HeaderParseError() exception isn't handled gracefully by pop3proxy.py has essentially rendered it useless for me. It simply throws an error and halts proxying the e-mail communication. I'm not sure what the plan is for this, but I wanted to see where current thinking had gone: 1. Is there a newer version of email which allows for more tolerance these non-RFC-2822 compliant e-mails? I doubt it, because I would think that the intent of the package is to handle confirmant messages. 2. Is there any thought to a simple pre-processing step which would make sure that the last line of headers is appropriately formed *and would fudge it* if not? (In my experience, this is 99% of the malformed headers: garbage/beginning of the message on the last line of headers without an extra newline) 3. How about an exception handler that tries to fix the header? Possibly as simple as catching the parse error and iteratively inserting a newline at the last line of headers until it no longer throws the exception. 4. If not a exception handler that fixes - is there a simple mechanism that could work like a "pass" to allow this message through. As it is now, I need to manually RETR the message so that the rest can go through the proxy which is extremely annoying, so I wish I could revert back to the version I was using a few months ago (which worked ok, but simply left as "unsure" the ones where the parser was confused) - Z From tim at fourstonesexpressions.com Tue May 27 16:34:47 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Tue May 27 16:36:42 2003 Subject: [Spambayes] email parse_headers error In-Reply-To: <027701c3248a$d5668b60$a100a8c0@zlichstein> References: <027701c3248a$d5668b60$a100a8c0@zlichstein> Message-ID: On Tue, 27 May 2003 13:01:48 -0700, Zander wrote: > 3. How about an exception handler that tries to fix the header? Possibly > as > simple as catching the parse error and iteratively inserting a newline at > the last line of headers until it no longer throws the exception. This has been examined, and is not practical. > > 4. If not a exception handler that fixes - is there a simple mechanism > that > could work like a "pass" to allow this message through. I had thought this was already the case. I'll make this correction... c'est moi - TimS From noreply at sourceforge.net Tue May 27 15:19:29 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 17:19:39 2003 Subject: [Spambayes] [ spambayes-Bugs-744550 ] Outlook 2002 Tools/Customize dialog locks up Message-ID: Bugs item #744550, was opened at 2003-05-27 16:19 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744550&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Raabe (mraabe) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 Tools/Customize dialog locks up Initial Comment: During any given Outlook session, once I display the Anti-Spam Manager dialog, I can no longer use Outlook's Tools/Customize command (for modifying menus, toolbars, etc.). The Tools/Customize dialog displays, but the dialog's window title is corrupted (instead of "Customize" it just says "C") and all clicks anywhere on the dialog or its buttons are ignored (they produce the Windows "Default Beep" sound but nothing else happens). This means there's no way to dismiss the dialog, and since it's modal, there's no way to do anything else with Outlook, including closing it. In these circumstances the Outlook process has to be terminated. When Outlook is restarted, Tools/Customize is once again fully functional as long as I don't visit the Anti-Spam Manager. I'm running the 002 plugin + Outlook 2002 SP2 under Win2000 SP3. I attach a logfile from a session where I start Outlook, invoke the problem, and then terminate the process after 60 seconds or so -- but the log doesn't look very informative in this case. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744550&group_id=61702 From noreply at sourceforge.net Tue May 27 15:29:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue May 27 17:29:59 2003 Subject: [Spambayes] [ spambayes-Feature Requests-744558 ] Add "Delete As Spam" button to Outlook inspector window Message-ID: Feature Requests item #744558, was opened at 2003-05-27 16:29 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=744558&group_id=61702 Category: Outlook Group: None Status: Open Priority: 5 Submitted By: Mark Raabe (mraabe) Assigned to: Mark Hammond (mhammond) Summary: Add "Delete As Spam" button to Outlook inspector window Initial Comment: The "Delete As Spam" button that the addin adds to the toolbar in Outlook's main explorer window should also be added to the toolbar in each email inspector window as well. I.e., when you double-click on an email and are viewing it in its own window, you should have the option right there to delete it as spam, instead of having to return to the explorer window to do so. It's of lesser importance, but it may make sense to add the "Anti-Spam" dropdown button to the inspector window as well. I tried adding the buttons myself, which caused me to discover bug #744550 (where Outlook's Tools/Customize dialog is broken). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=744558&group_id=61702 From skip at pobox.com Tue May 27 17:32:12 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 17:32:19 2003 Subject: [Spambayes] Intersection of two databases Message-ID: <16083.55644.605271.500891@montanaro.dyndns.org> If you followed the thread about sharing wordlists in which I misunderstood why Brad Clements was interested in counting words in common, and are interested in the notion of a starter database, read on. I took two databases, mine and Alex Popiel's, and for each database extracted those keys with the following properties: * the key was not a hapax * the ham/spam probability landed it in the tails (p <= 0.2 or p >= 0.8) The decision to skip hapaxes was simply because since Alex and I each have fairly large word databases (mine has 8165 spam, 12315 ham and 392848 tokens, Alex's has 16384 spam, 6251 ham and 376495 tokens) I figured they wouldn't add much. The decision on the individual ham/spam thresholds to consider was largely arbitrary. After extracting the words which fit my two criteria, I used the Unix comm(1) command to intersect those two (sorted) lists. That left me with a common.words file containing 18725 tokens. The tokens included such obvious (for us) words like '__version__', 'acronym' and 'url:python' and more general words like 'synonym', 'switch' and 'acute'. Using that list, I then merged the corresponding entries from the two source databases. This resulted in a database listing 24999 spam, 18566 ham and containing (obviously) 18725 tokens. This database is of course also much smaller than either of the inputs (1.3MB vs. 21.0MB for each of the input databases). I then ran hammiefilter.py over a couple subsets of my old training data using each of the three databases. Data/Ham/Set1 contains 822 messages. Data/Spam/Set5 contains 365 messages. The table below compares the results of running those two sets through hammiefilter with each of the databases: word database ham unsure spam Data/Ham/Set1 common 779 35 8 skip 810 4 8 alex 763 50 9 Data/Spam/Set5 common 1 26 338 skip 0 7 358 alex 24 66 275 I suspect Alex's database didn't do as well as either my database or the common database because the mail in my training collection is all mail sent to me. (Perhaps of greater concern should be that my database marked 8 hams as spam. ;-) If Alex is okay with the possibility, I'm willing to make the common database available in plain text form for others to try on their email collections. I'll run it through dbExpImp.py first so everyone should be able to use it. Skip From popiel at wolfskeep.com Tue May 27 15:42:56 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue May 27 17:43:01 2003 Subject: [Spambayes] Intersection of two databases In-Reply-To: Message from Skip Montanaro of "Tue, 27 May 2003 16:32:12 CDT." <16083.55644.605271.500891@montanaro.dyndns.org> References: <16083.55644.605271.500891@montanaro.dyndns.org> Message-ID: <20030527214257.F09542DDEF@cashew.wolfskeep.com> In message: <16083.55644.605271.500891@montanaro.dyndns.org> Skip Montanaro writes: > >I then ran hammiefilter.py over a couple subsets of my old training data >using each of the three databases. Data/Ham/Set1 contains 822 messages. >Data/Spam/Set5 contains 365 messages. The table below compares the results >of running those two sets through hammiefilter with each of the databases: > > word database ham unsure spam > Data/Ham/Set1 common 779 35 8 > skip 810 4 8 > alex 763 50 9 > > Data/Spam/Set5 common 1 26 338 > skip 0 7 358 > alex 24 66 275 > >I suspect Alex's database didn't do as well as either my database or the >common database because the mail in my training collection is all mail sent >to me. (Perhaps of greater concern should be that my database marked 8 hams >as spam. ;-) I also suspect that my database didn't do well on your mail because, well, it was your mail. If you're interested, I could run similar tests of your database against my mail, and see if a similar problem occurs. >If Alex is okay with the possibility, I'm willing to make the common >database available in plain text form for others to try on their email >collections. I'll run it through dbExpImp.py first so everyone should be >able to use it. That's fine with me. :-) If you do, I'll be one of the ones to grab it and test it. - Alex From seant at webreply.com Tue May 27 19:14:01 2003 From: seant at webreply.com (Sean True) Date: Tue May 27 18:14:24 2003 Subject: [Spambayes] Announcement: SpamAtBay Outllok Add-in Beta Candidate 7 Message-ID: For the new people on the list: SpamAtBay is a soon to be available commercial version of the Spambayes addin for Outlook. Any reader of this list who would like to beta test for us is welcome to a permanent license -- and anyone who is a SpamBayes developer can have the license without anything other just asking. In either case, just email me. Brief history: I wrote the first, really, really bad integration with Outlook, using VBA and a Python COM server. Mark Hammond thought the idea was neat, but that the execution was, err, flawed, and pretty much rewrote it from scratch. Over the last few weeks, I've had a team of several people working on polishing, layout, docs, QA, and (gag) marketing oriented features. We're going to release a supported version sometime soon. We're focusing on Tim's "easy enough for my sister" metric. Supported systems: Win98SE, ME, NT 4, Win2K, XP; Outlook 2000/2002/XP Not supported: Win95; Outlook Express Change list for Beta Candidate 7: 1) Fixed installer bug on Win98 2) Add version and timestamps to log and "Contact Support" 3) More useful progress bars for learning and quarantining 4) Performance tuning on database 5) Memory usage optimizations 6) Whitelist code no longer causes crashes during selections of large numbers of messages 7) Whitelist code survives missing domain 8) Databases are always stored after training 9) Moved/Copied/Untouched drop down restored to fullsize drop down 10) Store after every incremental message train 11) Add new tokenizer changes from Tim Peters 12) Clean up dialogs (not system menu style any more) 13) Fix 10 in 25,000 message parsing problem 14) Add system information to log and Contact Support mail 15) Fixed _obj.close() error after changing Spam folder name 16) Save config at end of setup wizard -- Sean From skip at pobox.com Tue May 27 18:16:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 18:16:39 2003 Subject: [Spambayes] Intersection of two databases In-Reply-To: <20030527214257.F09542DDEF@cashew.wolfskeep.com> References: <16083.55644.605271.500891@montanaro.dyndns.org> <20030527214257.F09542DDEF@cashew.wolfskeep.com> Message-ID: <16083.58306.496945.306171@montanaro.dyndns.org> >> If Alex is okay with the possibility, I'm willing to make the common >> database available in plain text form for others to try on their >> email collections. I'll run it through dbExpImp.py first so everyone >> should be able to use it. Alex> That's fine with me. :-) If you do, I'll be one of the ones to Alex> grab it and test it. The file common.export is available at http://www.musi-cal.com/~skip/common.export It's in the format exported by dbExpImp.py. You should be able to execute something like dbExpImp.py -i -D common.db -f common.export to generate a common.db file suitable for your particular environment. If you can post a simple table like my previous post which compares the performance of this small database to your current database over some reasonable sample of ham and spam, that would be great. Skip From sgreen0 at charter.net Tue May 27 16:35:25 2003 From: sgreen0 at charter.net (Stephen Green) Date: Tue May 27 19:05:34 2003 Subject: [Spambayes] Installing SpamBayes Message-ID: I have been trying to install SpamBayes on my computer (running Windows 98SE and Outlook 2000). As you already know, the latest binary version won't install properly. I tried to install Python, then the SpamBayes package. I have only had partial success. In particular, I haven't seen any effect on Outlook - no SpamBayes buttons or menus. I got so far as to run the configuration via http:\\localhost:8880. The dialogues on that page imply that all is working. I've tried to run addin.py from the Outlook2000 folder of the SpamBayes program, but I can't get it installed. BTW, running these python programs open DOS windows that shut when they're done, so I can't see if any error message show up... Is there a particular order of programs I'm to run to get everything installed properly? Am I supposed to move any of the .py files to other locations (other that the Spambayes-1 folder created when it was unzipped)? Help!! Thanks. Stephen Green Long Beach, CA From mhammond at skippinet.com.au Wed May 28 10:11:57 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 27 19:12:50 2003 Subject: [Spambayes] Installing SpamBayes In-Reply-To: Message-ID: <1fe401c324a5$5f7c95f0$530f8490@eden> > I got so far as to run the configuration via > http:\\localhost:8880. The > dialogues on that page imply that all is working. Unfortunately, that is for a different product - that URL is for configuring the pop3proxy app, which is generally not used by Outlook users. I've tried to run > addin.py from the Outlook2000 folder of the SpamBayes > program, but I can't > get it installed. BTW, running these python programs open DOS > windows that > shut when they're done, so I can't see if any error message show up... Open a "Command Prompt" window, and run "python addin.py" from there. That way the console will not close, and you will see any messages. > > Is there a particular order of programs I'm to run to get everything > installed properly? Am I supposed to move any of the .py > files to other > locations (other that the Spambayes-1 folder created when it > was unzipped)? Nope - just running "addin.py" is all you need. Mark. From kalin at dkdigital.com Tue May 27 17:42:04 2003 From: kalin at dkdigital.com (Dave Kalin) Date: Tue May 27 19:42:15 2003 Subject: [Spambayes] Thanks! Message-ID: Just a quick note from a fan to say that I just installed SpamBayes and it immediately got rid of all my junk. I had another Bayesiam Spam Filter which only caught 26% of my spam (hardly worth it!) so I gave yours a try and it works perfectly.. Keep up the good work!!! -Dave Kalin, Los Angeles, CA From mhammond at skippinet.com.au Wed May 28 11:25:17 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 27 20:26:13 2003 Subject: [Spambayes] New mailing lists In-Reply-To: Message-ID: <201301c324af$9e0b6710$530f8490@eden> > Thanks, Barry! Thanks, Tim! > All, I'm out sick and can't deal with this now. The new > mailing lists are > at: > > http://python.org/mailman/admin/spambayes-dev > http://python.org/mailman/admin/spambayes-announce > > spambayes-users doesn't exist; spambayes still does. If > someone else can > move this along, please do. The admin password can be gotten > from me or > Barry; Skip should also know it. I propose we start using these immediately (well, within the next couple of days). spambayes@ becomes the user list, and starting now, all development issues should be taken to spambayes-dev. Bug reports should go to -dev (and I think I can change that myself at sourceforge). ASAP, and certainly before any next releases, we can advertise and start using the -announce list. Starting immediately, we can casually tell users with problems to subscribe to this -announce list for the next version? Any objections? Mark. From seandarcy at hotmail.com Tue May 27 21:53:53 2003 From: seandarcy at hotmail.com (sean darcy) Date: Tue May 27 20:54:26 2003 Subject: [Spambayes] no classification in header; where is it? Message-ID: I'm trying sb in linux redhat 9 on evolution. It seems to be working great. When I go the web interface training page it's done a remarkable job. But.... there's no classification info in evolution itself. I was expecting a HAM or SPAM in the message header, together with a probability, so evolution could sort it into folders. What'm I doing wrong? sean Here's bayescustomize.ini: cat bayescustomize.ini [pop3proxy] add_mailid_to:header listen_ports:110,120 remote_servers:mail.xxxxxxxxx.net,mail.yyyyyyyyyyyyyy.com strip_incoming_mailids:True notate_subject:True [smtpproxy] ham_address:ham@spam spam_address:spam@spam listen_ports:25 remote_servers:mail.xxxxxxxx.net [globals] verbose:True [Headers] include_score:True include_thermostat:True [Storage] persistent_storage_file:/opt/spam/spambayes/data/hammie.db [html_ui] display_to:True _________________________________________________________________ Add photos to your e-mail with MSN 8. Get 2 months FREE*. http://join.msn.com/?page=features/featuredemail From skip at pobox.com Tue May 27 21:01:05 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue May 27 21:01:31 2003 Subject: [Spambayes] New mailing lists In-Reply-To: <201301c324af$9e0b6710$530f8490@eden> References: <201301c324af$9e0b6710$530f8490@eden> Message-ID: <16084.2641.653244.797380@montanaro.dyndns.org> >> Barry; Skip should also know it. I believe I do. Mark> I propose we start using these immediately (well, within the next Mark> couple of days). spambayes@ becomes the user list, and starting Mark> now, all development issues should be taken to spambayes-dev. Bug Mark> reports should go to -dev (and I think I can change that myself at Mark> sourceforge). Sounds like a good idea. Mark> ASAP, and certainly before any next releases, we can advertise and Mark> start using the -announce list. Starting immediately, we can Mark> casually tell users with problems to subscribe to this -announce Mark> list for the next version? Mark> Any objections? Nope. Skip From danimoor at hotpop.com Tue May 27 19:13:06 2003 From: danimoor at hotpop.com (Dan) Date: Tue May 27 21:05:37 2003 Subject: [Spambayes] Spambayes working win2k server, outlook2002, bin install Message-ID: Windows 2000 Server, SP3 Outlook 2002 (xp)- I just realized I don't have a clue if it's been service packed, duh... Version # is 10.2627.2625 Binary Installation Works! Filters set up and tested. Thanks! From mhammond at skippinet.com.au Wed May 28 12:06:54 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 27 21:07:50 2003 Subject: [Spambayes] New mailing lists In-Reply-To: <16084.2641.653244.797380@montanaro.dyndns.org> Message-ID: <002001c324b5$6e451ed0$530f8490@eden> > Mark> I propose we start using these immediately (well, within the next > Mark> couple of days). spambayes@ becomes the user list, and starting > Mark> now, all development issues should be taken to spambayes-dev. Bug > Mark> reports should go to -dev (and I think I can change that myself at > Mark> sourceforge). OK, I have taken that first step - all bug reports now go to spambayes-dev. Mark. From seandarcy at hotmail.com Tue May 27 22:15:56 2003 From: seandarcy at hotmail.com (sean darcy) Date: Tue May 27 21:16:32 2003 Subject: [Spambayes] web interface: 500 Server error when training Message-ID: It appears that this eror only appears the first time you train sb. I've now trained it twice more, no problems. sean _________________________________________________________________ Add photos to your e-mail with MSN 8. Get 2 months FREE*. http://join.msn.com/?page=features/featuredemail From mjweaver at pacbell.net Tue May 27 19:17:43 2003 From: mjweaver at pacbell.net (Mike Weaver) Date: Tue May 27 21:17:54 2003 Subject: [Spambayes] Status with Outlook 2003 Message-ID: <0HFK00B45OWWP0@mta6.snfc21.pbi.net> Just curious about the status of support for Outlook 2003, I'm running beta2 and would certainly be willing to help test. Sincerely, Mike Weaver Director, Business Development ProxyLogic http://www.proxylogic.com DEVELOPMENT ON DEMAND Office - 818.865.8145 Mobile - 805.405.3898 From mhammond at skippinet.com.au Wed May 28 12:27:23 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue May 27 21:28:16 2003 Subject: [Spambayes] Status with Outlook 2003 In-Reply-To: <0HFK00B45OWWP0@mta6.snfc21.pbi.net> Message-ID: <002801c324b8$4b71d6c0$530f8490@eden> > Just curious about the status of support for Outlook 2003, > I'm running beta2 > and would certainly be willing to help test. Excellent. The best thing you can do is to grab Python, win32all and the source-code version (preferably via CVS so you have the latest), the "suck it and see". See the main SpamBayes pages for info on this. Report problems to SourceForge, and I will hopefully be able to nut through most of them. Mark. From anthony at interlink.com.au Wed May 28 12:46:16 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue May 27 21:47:03 2003 Subject: [Spambayes] New mailing lists In-Reply-To: <002001c324b5$6e451ed0$530f8490@eden> Message-ID: <200305280146.h4S1kHU15030@localhost.localdomain> >>> "Mark Hammond" wrote > OK, I have taken that first step - all bug reports now go to spambayes-dev. And the website now publicises all 4 lists. I also spell-checked a number of pages - probably a good habit to get into when putting large chunks of text up. From mark at enynsp.org Wed May 28 00:05:01 2003 From: mark at enynsp.org (Mark S. Bergman) Date: Tue May 27 23:05:58 2003 Subject: [Spambayes] Updates? In-Reply-To: Message-ID: Is this going to be the best list for getting announcements of updates to the SpamBayes installer? I'm running it on Win2K and Outlook 2K with great results. I'd like to be able to stay up to date and offer feedback. Another friend installed it without creating a probable spam folder. He just lets it go to his in box. This seems like a good approach to avoid misclassified spam. to date I have no false spam's but plenty of mistaken probable spam classifications. Mark -----Original Message----- From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org]On Behalf Of spambayes-request@python.org Sent: Tuesday, May 27, 2003 8:26 PM To: spambayes@python.org Subject: Spambayes Digest, Vol 57, Issue 94 Send Spambayes mailing list submissions to spambayes@python.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.python.org/mailman/listinfo/spambayes or, via email, send a message with subject or body 'help' to spambayes-request@python.org You can reach the person managing the list at spambayes-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of Spambayes digest..." Today's Topics: 1. Re: email parse_headers error (Tim Stone) 2. [ spambayes-Bugs-744550 ] Outlook 2002 Tools/Customize dialog locks up (SourceForge.net) 3. [ spambayes-Feature Requests-744558 ] Add "Delete As Spam" button to Outlook inspector window (SourceForge.net) 4. Intersection of two databases (Skip Montanaro) 5. Re: Intersection of two databases (T. Alexander Popiel) 6. Announcement: SpamAtBay Outllok Add-in Beta Candidate 7 (Sean True) 7. Re: Intersection of two databases (Skip Montanaro) 8. Installing SpamBayes (Stephen Green) 9. RE: Installing SpamBayes (Mark Hammond) 10. Thanks! (Dave Kalin) 11. RE: New mailing lists (Mark Hammond) ---------------------------------------------------------------------- Message: 1 Date: Tue, 27 May 2003 15:34:47 -0500 From: Tim Stone Subject: Re: [Spambayes] email parse_headers error To: Zander , spambayes@python.org Message-ID: Content-Type: text/plain; charset=iso-8859-15; format=flowed On Tue, 27 May 2003 13:01:48 -0700, Zander wrote: > 3. How about an exception handler that tries to fix the header? Possibly > as > simple as catching the parse error and iteratively inserting a newline at > the last line of headers until it no longer throws the exception. This has been examined, and is not practical. > > 4. If not a exception handler that fixes - is there a simple mechanism > that > could work like a "pass" to allow this message through. I had thought this was already the case. I'll make this correction... c'est moi - TimS ------------------------------ Message: 2 Date: Tue, 27 May 2003 14:19:29 -0700 From: "SourceForge.net" Subject: [Spambayes] [ spambayes-Bugs-744550 ] Outlook 2002 Tools/Customize dialog locks up To: noreply@sourceforge.net Message-ID: Bugs item #744550, was opened at 2003-05-27 16:19 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744550&group_id =61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Raabe (mraabe) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 Tools/Customize dialog locks up Initial Comment: During any given Outlook session, once I display the Anti-Spam Manager dialog, I can no longer use Outlook's Tools/Customize command (for modifying menus, toolbars, etc.). The Tools/Customize dialog displays, but the dialog's window title is corrupted (instead of "Customize" it just says "C") and all clicks anywhere on the dialog or its buttons are ignored (they produce the Windows "Default Beep" sound but nothing else happens). This means there's no way to dismiss the dialog, and since it's modal, there's no way to do anything else with Outlook, including closing it. In these circumstances the Outlook process has to be terminated. When Outlook is restarted, Tools/Customize is once again fully functional as long as I don't visit the Anti-Spam Manager. I'm running the 002 plugin + Outlook 2002 SP2 under Win2000 SP3. I attach a logfile from a session where I start Outlook, invoke the problem, and then terminate the process after 60 seconds or so -- but the log doesn't look very informative in this case. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744550&group_id =61702 ------------------------------ Message: 3 Date: Tue, 27 May 2003 14:29:54 -0700 From: "SourceForge.net" Subject: [Spambayes] [ spambayes-Feature Requests-744558 ] Add "Delete As Spam" button to Outlook inspector window To: noreply@sourceforge.net Message-ID: Feature Requests item #744558, was opened at 2003-05-27 16:29 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=744558&group_id =61702 Category: Outlook Group: None Status: Open Priority: 5 Submitted By: Mark Raabe (mraabe) Assigned to: Mark Hammond (mhammond) Summary: Add "Delete As Spam" button to Outlook inspector window Initial Comment: The "Delete As Spam" button that the addin adds to the toolbar in Outlook's main explorer window should also be added to the toolbar in each email inspector window as well. I.e., when you double-click on an email and are viewing it in its own window, you should have the option right there to delete it as spam, instead of having to return to the explorer window to do so. It's of lesser importance, but it may make sense to add the "Anti-Spam" dropdown button to the inspector window as well. I tried adding the buttons myself, which caused me to discover bug #744550 (where Outlook's Tools/Customize dialog is broken). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=744558&group_id =61702 ------------------------------ Message: 4 Date: Tue, 27 May 2003 16:32:12 -0500 From: Skip Montanaro Subject: [Spambayes] Intersection of two databases To: spambayes@python.org Message-ID: <16083.55644.605271.500891@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii If you followed the thread about sharing wordlists in which I misunderstood why Brad Clements was interested in counting words in common, and are interested in the notion of a starter database, read on. I took two databases, mine and Alex Popiel's, and for each database extracted those keys with the following properties: * the key was not a hapax * the ham/spam probability landed it in the tails (p <= 0.2 or p >= 0.8) The decision to skip hapaxes was simply because since Alex and I each have fairly large word databases (mine has 8165 spam, 12315 ham and 392848 tokens, Alex's has 16384 spam, 6251 ham and 376495 tokens) I figured they wouldn't add much. The decision on the individual ham/spam thresholds to consider was largely arbitrary. After extracting the words which fit my two criteria, I used the Unix comm(1) command to intersect those two (sorted) lists. That left me with a common.words file containing 18725 tokens. The tokens included such obvious (for us) words like '__version__', 'acronym' and 'url:python' and more general words like 'synonym', 'switch' and 'acute'. Using that list, I then merged the corresponding entries from the two source databases. This resulted in a database listing 24999 spam, 18566 ham and containing (obviously) 18725 tokens. This database is of course also much smaller than either of the inputs (1.3MB vs. 21.0MB for each of the input databases). I then ran hammiefilter.py over a couple subsets of my old training data using each of the three databases. Data/Ham/Set1 contains 822 messages. Data/Spam/Set5 contains 365 messages. The table below compares the results of running those two sets through hammiefilter with each of the databases: word database ham unsure spam Data/Ham/Set1 common 779 35 8 skip 810 4 8 alex 763 50 9 Data/Spam/Set5 common 1 26 338 skip 0 7 358 alex 24 66 275 I suspect Alex's database didn't do as well as either my database or the common database because the mail in my training collection is all mail sent to me. (Perhaps of greater concern should be that my database marked 8 hams as spam. ;-) If Alex is okay with the possibility, I'm willing to make the common database available in plain text form for others to try on their email collections. I'll run it through dbExpImp.py first so everyone should be able to use it. Skip ------------------------------ Message: 5 Date: Tue, 27 May 2003 14:42:56 -0700 From: "T. Alexander Popiel" Subject: Re: [Spambayes] Intersection of two databases To: skip@pobox.com Cc: popiel@wolfskeep.com, spambayes@python.org Message-ID: <20030527214257.F09542DDEF@cashew.wolfskeep.com> In message: <16083.55644.605271.500891@montanaro.dyndns.org> Skip Montanaro writes: > >I then ran hammiefilter.py over a couple subsets of my old training data >using each of the three databases. Data/Ham/Set1 contains 822 messages. >Data/Spam/Set5 contains 365 messages. The table below compares the results >of running those two sets through hammiefilter with each of the databases: > > word database ham unsure spam > Data/Ham/Set1 common 779 35 8 > skip 810 4 8 > alex 763 50 9 > > Data/Spam/Set5 common 1 26 338 > skip 0 7 358 > alex 24 66 275 > >I suspect Alex's database didn't do as well as either my database or the >common database because the mail in my training collection is all mail sent >to me. (Perhaps of greater concern should be that my database marked 8 hams >as spam. ;-) I also suspect that my database didn't do well on your mail because, well, it was your mail. If you're interested, I could run similar tests of your database against my mail, and see if a similar problem occurs. >If Alex is okay with the possibility, I'm willing to make the common >database available in plain text form for others to try on their email >collections. I'll run it through dbExpImp.py first so everyone should be >able to use it. That's fine with me. :-) If you do, I'll be one of the ones to grab it and test it. - Alex ------------------------------ Message: 6 Date: Tue, 27 May 2003 18:14:01 -0400 From: "Sean True" Subject: [Spambayes] Announcement: SpamAtBay Outllok Add-in Beta Candidate 7 To: Message-ID: Content-Type: text/plain; charset="iso-8859-1" For the new people on the list: SpamAtBay is a soon to be available commercial version of the Spambayes addin for Outlook. Any reader of this list who would like to beta test for us is welcome to a permanent license -- and anyone who is a SpamBayes developer can have the license without anything other just asking. In either case, just email me. Brief history: I wrote the first, really, really bad integration with Outlook, using VBA and a Python COM server. Mark Hammond thought the idea was neat, but that the execution was, err, flawed, and pretty much rewrote it from scratch. Over the last few weeks, I've had a team of several people working on polishing, layout, docs, QA, and (gag) marketing oriented features. We're going to release a supported version sometime soon. We're focusing on Tim's "easy enough for my sister" metric. Supported systems: Win98SE, ME, NT 4, Win2K, XP; Outlook 2000/2002/XP Not supported: Win95; Outlook Express Change list for Beta Candidate 7: 1) Fixed installer bug on Win98 2) Add version and timestamps to log and "Contact Support" 3) More useful progress bars for learning and quarantining 4) Performance tuning on database 5) Memory usage optimizations 6) Whitelist code no longer causes crashes during selections of large numbers of messages 7) Whitelist code survives missing domain 8) Databases are always stored after training 9) Moved/Copied/Untouched drop down restored to fullsize drop down 10) Store after every incremental message train 11) Add new tokenizer changes from Tim Peters 12) Clean up dialogs (not system menu style any more) 13) Fix 10 in 25,000 message parsing problem 14) Add system information to log and Contact Support mail 15) Fixed _obj.close() error after changing Spam folder name 16) Save config at end of setup wizard -- Sean ------------------------------ Message: 7 Date: Tue, 27 May 2003 17:16:34 -0500 From: Skip Montanaro Subject: Re: [Spambayes] Intersection of two databases To: "T. Alexander Popiel" Cc: spambayes@python.org Message-ID: <16083.58306.496945.306171@montanaro.dyndns.org> Content-Type: text/plain; charset=us-ascii >> If Alex is okay with the possibility, I'm willing to make the common >> database available in plain text form for others to try on their >> email collections. I'll run it through dbExpImp.py first so everyone >> should be able to use it. Alex> That's fine with me. :-) If you do, I'll be one of the ones to Alex> grab it and test it. The file common.export is available at http://www.musi-cal.com/~skip/common.export It's in the format exported by dbExpImp.py. You should be able to execute something like dbExpImp.py -i -D common.db -f common.export to generate a common.db file suitable for your particular environment. If you can post a simple table like my previous post which compares the performance of this small database to your current database over some reasonable sample of ham and spam, that would be great. Skip ------------------------------ Message: 8 Date: Tue, 27 May 2003 15:35:25 -0700 From: "Stephen Green" Subject: [Spambayes] Installing SpamBayes To: Message-ID: Content-Type: text/plain; charset="iso-8859-1" I have been trying to install SpamBayes on my computer (running Windows 98SE and Outlook 2000). As you already know, the latest binary version won't install properly. I tried to install Python, then the SpamBayes package. I have only had partial success. In particular, I haven't seen any effect on Outlook - no SpamBayes buttons or menus. I got so far as to run the configuration via http:\\localhost:8880. The dialogues on that page imply that all is working. I've tried to run addin.py from the Outlook2000 folder of the SpamBayes program, but I can't get it installed. BTW, running these python programs open DOS windows that shut when they're done, so I can't see if any error message show up... Is there a particular order of programs I'm to run to get everything installed properly? Am I supposed to move any of the .py files to other locations (other that the Spambayes-1 folder created when it was unzipped)? Help!! Thanks. Stephen Green Long Beach, CA ------------------------------ Message: 9 Date: Wed, 28 May 2003 09:11:57 +1000 From: "Mark Hammond" Subject: RE: [Spambayes] Installing SpamBayes To: "'Stephen Green'" , Message-ID: <1fe401c324a5$5f7c95f0$530f8490@eden> Content-Type: text/plain; charset="iso-8859-1" > I got so far as to run the configuration via > http:\\localhost:8880. The > dialogues on that page imply that all is working. Unfortunately, that is for a different product - that URL is for configuring the pop3proxy app, which is generally not used by Outlook users. I've tried to run > addin.py from the Outlook2000 folder of the SpamBayes > program, but I can't > get it installed. BTW, running these python programs open DOS > windows that > shut when they're done, so I can't see if any error message show up... Open a "Command Prompt" window, and run "python addin.py" from there. That way the console will not close, and you will see any messages. > > Is there a particular order of programs I'm to run to get everything > installed properly? Am I supposed to move any of the .py > files to other > locations (other that the Spambayes-1 folder created when it > was unzipped)? Nope - just running "addin.py" is all you need. Mark. ------------------------------ Message: 10 Date: Tue, 27 May 2003 16:42:04 -0700 From: "Dave Kalin" Subject: [Spambayes] Thanks! To: Message-ID: Content-Type: text/plain; charset="iso-8859-1" Just a quick note from a fan to say that I just installed SpamBayes and it immediately got rid of all my junk. I had another Bayesiam Spam Filter which only caught 26% of my spam (hardly worth it!) so I gave yours a try and it works perfectly.. Keep up the good work!!! -Dave Kalin, Los Angeles, CA ------------------------------ Message: 11 Date: Wed, 28 May 2003 10:25:17 +1000 From: "Mark Hammond" Subject: RE: [Spambayes] New mailing lists To: "'Tim Peters'" , Message-ID: <201301c324af$9e0b6710$530f8490@eden> Content-Type: text/plain; charset="iso-8859-1" > Thanks, Barry! Thanks, Tim! > All, I'm out sick and can't deal with this now. The new > mailing lists are > at: > > http://python.org/mailman/admin/spambayes-dev > http://python.org/mailman/admin/spambayes-announce > > spambayes-users doesn't exist; spambayes still does. If > someone else can > move this along, please do. The admin password can be gotten > from me or > Barry; Skip should also know it. I propose we start using these immediately (well, within the next couple of days). spambayes@ becomes the user list, and starting now, all development issues should be taken to spambayes-dev. Bug reports should go to -dev (and I think I can change that myself at sourceforge). ASAP, and certainly before any next releases, we can advertise and start using the -announce list. Starting immediately, we can casually tell users with problems to subscribe to this -announce list for the next version? Any objections? Mark. ------------------------------ _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes End of Spambayes Digest, Vol 57, Issue 94 ***************************************** From mail at peterbe.com Wed May 28 11:57:07 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 28 06:26:13 2003 Subject: [Spambayes] Bug again Message-ID: <5.2.0.9.0.20030528104445.01f06790@localhost> Now I get errors again. If you want to see the content of my spambayes folder, go to http://home.peterbe.com:8080/spambayes/ C:\Python22\spambayes>C:\Python22\python.exe pop3proxy.py Loading database... Done. Listener on port 110 is proxying mail.grenna.net:110 User interface url is http://localhost:8880/ error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0xbdcb10> (exceptions.TypeError:len() of unsized object [C:\Python22\lib\asyncore.py|poll|99] [C:\Python22\lib\asyncore.py|handle_read_event|396] [C:\Python22\lib\asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|146] [pop3proxy.py|onServerLine|214] [pop3proxy.py|onResponse|288] [pop3proxy.py|onTransaction|389] [pop3proxy.py|onRetr|458] [spambayes\message.py|as_string|199] [C:\Python22\Lib\site-packages\email\Message.py|as_string|109] [C:\Python22\Lib\site-packages\email\Generator.py|flatten|102] [C:\Python22\Lib\site-packages\email \Generator.py|_write|137] [C:\Python22\Lib\site-packages\email\Generator.py|_write_headers|183] [C:\Python22\Lib\site-packages\email\Header.py|encode|412] [C:\Python22\Lib\site-packages\email\Header.py|_split|297] [C:\Python22\Lib\site-packages\email\Charset.py|encoded_header_len|341]) From skip at pobox.com Wed May 28 08:36:28 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 28 08:36:40 2003 Subject: [Spambayes] Updates? In-Reply-To: References: Message-ID: <16084.44364.231578.307723@montanaro.dyndns.org> Mark> Is this going to be the best list for getting announcements of Mark> updates to the SpamBayes installer? Mark, The number of Spambayes-related lists just grew. There are now four: spambayes (this list) - the list for users of the package - get help, answer questions, etc. spambayes-dev - the list for developers of the package or those who want to see where it's going. spambayes-announce - the list for those who just want to know about releases and other earth-shattering announcements. spambayes-checkins - the list for those who want to see how the code is changing at the file level. Links to the subscription forms for each list are available on the home page: http://spambayes.sf.net/ about 3/4ths of the way down. Mark> to date I have no false spam's but plenty of mistaken probable Mark> spam classifications. By 'mistaken probable spam classifications' do you mean spam that was mistakenly classified as ham or unsure, or good mail that was classified as unsure? It helps to learn and use the lingo, mostly because those of us working on the package (usually) know what the terms mean. There is a short glossary at http://spambayes.sf.net/docs.html If you encounter terms you don't understand which aren't included there, just ask the list. If you mention that it's missing from the glossary (or better yet, file a bug report at SourceForge), someone will add it. Skip From mail at peterbe.com Wed May 28 14:38:36 2003 From: mail at peterbe.com (Peter Bengtsson) Date: Wed May 28 08:56:16 2003 Subject: [Spambayes] Impressed by SAProxy In-Reply-To: <3ED36719.4080700@olivermaunder.co.uk> References: <5.2.0.9.0.20030527135031.02271420@localhost> <5.2.0.9.0.20030527135031.02271420@localhost> Message-ID: <5.2.0.9.0.20030528133758.01f58590@mail.grenna.net> At 14:24 2003-05-27 +0100, Oliver Maunder wrote: >Peter Bengtsson wrote: > >>What other ways are there to start the pop3proxy? >>What I do is: >>Run -> "cmd" Enter -> "cd C:\Python22\spambayes\" -> >>"C:\Python22\python.exe pop3proxy.py" > >Create a shortcut with the command line: > >C:\Python22\python.exe C:\python22\spambayes\pop3proxy.py > >In the shortcut properties set "Start In" to c:\python22\spambayes. Then, >you can put this shortcut in your start menu, on the desktop, or in your >Startup group to make sure it's run when Windows boots. > >You can get rid of the console window by running the script with >pythonw.exe instead of python.exe. This does mean that you won't see the >console window output. I don't run pop3proxy, so I don't know if that is >important. Cool! I've never understood the difference between python.exe and pythonw.exe That will solve my problem. Peter >Olly > > > >> >>Alternativly I could write some little batch file that I stick in the >>Autostart. >> >>I will see what I can do with the taskbar gui demo app. >> >>Peter >> >>_______________________________________________ >>Spambayes mailing list >>Spambayes@python.org >>http://mail.python.org/mailman/listinfo/spambayes > > > > >_______________________________________________ >Spambayes mailing list >Spambayes@python.org >http://mail.python.org/mailman/listinfo/spambayes From dan.ball at retek.com Wed May 28 09:53:39 2003 From: dan.ball at retek.com (Ball, Dan) Date: Wed May 28 09:54:13 2003 Subject: [Spambayes] spambayes... Message-ID: Ya know one feature that I think might be cool for your project...adding the ability to bounce spam emails as undeliverable back to the sender. Just a thought. Dan Ball - Retek Application Support Web Developer/NT System Administrator Phone - 612 587-2143 Cell - 612 720-3409 dan.ball@retek.com From popiel at wolfskeep.com Wed May 28 08:39:32 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 28 10:39:35 2003 Subject: [Spambayes] spambayes... In-Reply-To: Message from "Ball, Dan" of "Wed, 28 May 2003 08:53:39 CDT." References: Message-ID: <20030528143932.A8F1C2DE3E@cashew.wolfskeep.com> In message: "Ball, Dan" writes: >Ya know one feature that I think might be cool for your project...adding = >the ability to bounce spam emails as undeliverable back to the sender. > >Just a thought. We've thought it a couple times, too... but given that most spammers these days don't accept incoming email, or (worse) forge the from and sender addresses, it's unlikely that it would do any good, and may well do some innocent much harm. One of my friends in college ended up on the receiving end of a huge amount of hate mail (the sheer volume of which incapacitated his system) because of spammer forgeries. - Alex From dan.ball at retek.com Wed May 28 10:43:52 2003 From: dan.ball at retek.com (Ball, Dan) Date: Wed May 28 10:56:41 2003 Subject: [Spambayes] spambayes... Message-ID: Ya, I thought of that possible outcome, fortunately, I don't know anyone that was ever one the recieving end of that situation, if I did, I may be much more sensitive to it as well. Good points, thanks for the response. Dan Ball - Retek Application Support Web Developer/NT System Administrator Phone - 612 587-2143 Cell - 612 720-3409 dan.ball@retek.com -----Original Message----- From: T. Alexander Popiel [mailto:popiel@wolfskeep.com] Sent: Wednesday, May 28, 2003 9:40 AM To: Ball, Dan Cc: spambayes@python.org; popiel@wolfskeep.com Subject: Re: [Spambayes] spambayes... In message: "Ball, Dan" writes: >Ya know one feature that I think might be cool for your project...adding = >the ability to bounce spam emails as undeliverable back to the sender. > >Just a thought. We've thought it a couple times, too... but given that most spammers these days don't accept incoming email, or (worse) forge the from and sender addresses, it's unlikely that it would do any good, and may well do some innocent much harm. One of my friends in college ended up on the receiving end of a huge amount of hate mail (the sheer volume of which incapacitated his system) because of spammer forgeries. - Alex From python-spambayes at discworld.dyndns.org Wed May 28 10:17:47 2003 From: python-spambayes at discworld.dyndns.org (Charles Cazabon) Date: Wed May 28 11:13:29 2003 Subject: [Spambayes] spambayes... In-Reply-To: ; from dan.ball@retek.com on Wed, May 28, 2003 at 09:43:52AM -0500 References: Message-ID: <20030528091747.A3679@discworld.dyndns.org> Ball, Dan wrote: > Ya, I thought of that possible outcome, fortunately, I don't know anyone > that was ever one the recieving end of that situation, if I did, I may be > much more sensitive to it as well. It's very common; if it's done maliciously, it's known as a joe-job. The last time it happened to us, we received forty million bounces in three days. Yes, please be sensitive to it -- and never try to "bounce" spam, it just adds to the problem. Charles -- ----------------------------------------------------------------------- Charles Cazabon GPL'ed software available at: http://www.qcc.ca/~charlesc/software/ ----------------------------------------------------------------------- From param at cs.wisc.edu Wed May 28 12:15:16 2003 From: param at cs.wisc.edu (Paramjit Oberoi) Date: Wed May 28 12:15:20 2003 Subject: [Spambayes] hammiebulk.py: Untrain bug Message-ID: <20030528111516.A27284@cs.wisc.edu> Untraining does not work since when the "-U" option is detected, the "untrain" variable is set to "1", overriding the function definition... patch: 145c145 < untrain = 0 --- > untrain_mode = 0 169c169 < untrain = 1 --- > untrain_mode = 1 182c182 < if not untrain: --- > if not untrain_mode: From bkc at murkworks.com Wed May 28 15:06:26 2003 From: bkc at murkworks.com (Brad Clements) Date: Wed May 28 13:47:01 2003 Subject: [Spambayes] spambayes... - NDRs In-Reply-To: Message-ID: <3ED4C261.14820.2CF04840@localhost> On 28 May 2003 at 9:43, Ball, Dan wrote: > Ya, I thought of that possible outcome, fortunately, I don't know anyone > that was ever one the recieving end of that situation, if I did, I may be > much more sensitive to it as well. > Here's another thought. Suppose the MTA had spambayes support 'built-in'. It could reject the message after DATA by returning a 500 series response. That should generate an NDR at the sending MTA. If the spammer is running is own originating MTA (how many of them do) then they might be interested in getting truly invalid addresses off their lists. Musing .. -- Brad Clements, bkc@murkworks.com (315)268-1000 http://www.murkworks.com (315)268-9812 Fax http://www.wecanstopspam.org/ AOL-IM: BKClements From popiel at wolfskeep.com Wed May 28 12:08:19 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 28 14:08:23 2003 Subject: [Spambayes] spambayes... - NDRs In-Reply-To: Message from "Brad Clements" of "Wed, 28 May 2003 14:06:26 EDT." <3ED4C261.14820.2CF04840@localhost> References: <3ED4C261.14820.2CF04840@localhost> Message-ID: <20030528180819.CD1DE2DE3E@cashew.wolfskeep.com> In message: <3ED4C261.14820.2CF04840@localhost> "Brad Clements" writes: > >Suppose the MTA had spambayes support 'built-in'. > >It could reject the message after DATA by returning a 500 series response. >That should generate an NDR at the sending MTA. > >If the spammer is running is own originating MTA (how many of them do) then >they might be interested in getting truly invalid addresses off their lists. > >Musing .. Others have reported that spammers sophisticated enough to cull based on MTA returns ignore all results after confirmation of RCPT... paying attention to returns after that point just slows them down. - Alex From tys at cs.ualberta.ca Wed May 28 13:15:18 2003 From: tys at cs.ualberta.ca (Tys von Gaza) Date: Wed May 28 14:15:52 2003 Subject: [Spambayes] spambayes... - NDRs In-Reply-To: <3ED4C261.14820.2CF04840@localhost> Message-ID: <003301c32545$1856aad0$b5f2b38e@gaza> Bouncing spam was heavily discussed on this list starting on 26/11/2002. Worth taking a look at what was discussed, pretty sure MTA's were somewhere in the mix too. Thread 1: http://mail.python.org/pipermail/spambayes/2002-November/002307.html Thread 2: http://mail.python.org/pipermail/spambayes/2002-November/002329.html Thread 3: http://mail.python.org/pipermail/spambayes/2002-November/002331.html Tys von Gaza tys@cs.ualberta.ca > -----Original Message----- > From: spambayes-bounces@python.org [mailto:spambayes-bounces@python.org] > On Behalf Of Brad Clements > Sent: May 28, 2003 12:06 PM > To: spambayes@python.org > Subject: RE: [Spambayes] spambayes... - NDRs > > On 28 May 2003 at 9:43, Ball, Dan wrote: > > > Ya, I thought of that possible outcome, fortunately, I don't know anyone > > that was ever one the recieving end of that situation, if I did, I may > be > > much more sensitive to it as well. > > > > Here's another thought. > > Suppose the MTA had spambayes support 'built-in'. > > It could reject the message after DATA by returning a 500 series response. > That > should generate an NDR at the sending MTA. > > If the spammer is running is own originating MTA (how many of them do) > then they > might be interested in getting truly invalid addresses off their lists. > > Musing .. > > > > -- > Brad Clements, bkc@murkworks.com (315)268-1000 > http://www.murkworks.com (315)268-9812 Fax > http://www.wecanstopspam.org/ AOL-IM: BKClements > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From Arnie at 1568.com Wed May 28 12:41:32 2003 From: Arnie at 1568.com (Arnie Rowland) Date: Wed May 28 14:43:55 2003 Subject: [Spambayes] Enhancement Suggestion Message-ID: <41A6707794EAD311A5D6204C4F4F5020132E59@WESTWOOD01> It would be quite nice if the derived database could be saved to disk upon closing Outlook -and re-read upon opening. It would save the time and effort to re-train and rebuild the database every time the computer is powered off. Is there any movement toward an Exchange hosted version? Regards, Arnie Rowland, MCDBA, MCSE, MCT WestWood Consulting, Inc 1568 SW Westwood Dr. Portland, OR 97201-2757 503-246-6172 503-246-9972 (Fax) arnie@1568.com "They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety." -Ben Franklin *************************************** This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. From wsy at merl.com Wed May 28 15:43:25 2003 From: wsy at merl.com (wsy@merl.com) Date: Wed May 28 14:46:31 2003 Subject: [Spambayes] spambayes... In-Reply-To: (dan.ball@retek.com) References: Message-ID: <200305281843.h4SIhPA03702@localhost.localdomain> From: "Ball, Dan" Ya know one feature that I think might be cool for your project...adding the ability to bounce spam emails as undeliverable back to the sender. I've often wanted to do this as well. The problem is this: the "from" address on spam is almost always forged. That doesn't make the task impossible, just harder. You need to execute the bounce at SMTP transfer time, not at MDA execution time (which is where filters usually go). Basically, one needs to write an SMTP front end that does filtering and 400's the message during the SMTP protocol whenever it thinks it's spam. Good idea, hard to implement. - Bill Yerazunis From wsy at merl.com Wed May 28 15:51:10 2003 From: wsy at merl.com (wsy@merl.com) Date: Wed May 28 14:54:16 2003 Subject: [Spambayes] spambayes... - NDRs In-Reply-To: <20030528180819.CD1DE2DE3E@cashew.wolfskeep.com> (popiel@wolfskeep.com) References: <3ED4C261.14820.2CF04840@localhost> <20030528180819.CD1DE2DE3E@cashew.wolfskeep.com> Message-ID: <200305281851.h4SIpAI03725@localhost.localdomain> From: "T. Alexander Popiel" > >If the spammer is running is own originating MTA (how many of them do) then >they might be interested in getting truly invalid addresses off their lists. > >Musing .. Others have reported that spammers sophisticated enough to cull based on MTA returns ignore all results after confirmation of RCPT... paying attention to returns after that point just slows them down. Yes, it's called "pipelining" where they just stick the whole stream in without waiting for the proper responses. However, deeveeing the stream at SMTP time still works "right", as legitimate senders who are falseley rejected will still get an error message, and the spammers are still denyed resources and viewer eyes. It's still a hard-to-do-but-may-be-worthwhile thing. -Bill Y. From zander at zan.com Wed May 28 13:40:28 2003 From: zander at zan.com (Zander) Date: Wed May 28 15:40:38 2003 Subject: [Spambayes] email parse_headers error References: <027701c3248a$d5668b60$a100a8c0@zlichstein> Message-ID: <016e01c32551$0137aaf0$a100a8c0@zlichstein> Not sure if you were under the impressoin that this was fixed, but I'm still getting errors that cause POP proxying to completely stop until I manually DELE the message: error: uncaptured python exception, closing channel <__main__.ServerLineReader connected at 0x86b069c> (email.Errors.HeaderParseError:Not a header, not a continuation: ``'' [//usr/lib/python2.2/asyncore.py|poll|99] [//usr/lib/python2.2/asyncore.py|handle_read_event|396] [//usr/lib/python2.2/asynchat.py|handle_read|130] [pop3proxy.py|found_terminator|146] [pop3proxy.py|onServerLine|214] [pop3proxy.py|onResponse|288] [pop3proxy.py|onTransaction|389] [pop3proxy.py|onRetr|436] [spambayes/message.py|setPayload|165] [/usr/lib/python2.2/site-packages/email/Parser.py|_parseheaders|128]) ------- snip offending message header -------- Return-Path: Delivered-To: zander@zan.com Received: (qmail 29472 invoked from network); 28 May 2003 17:23:36 -0000 Received: from unknown (HELO ca-arcdca-cuda1-c1m-69.arcdca.adelphia.net) (24.54.204.69) by swift.phpwebhosting.com with SMTP; 28 May 2003 17:23:36 -0000 From: "Lenore" To: "Lee" Subject: Recommended Content-Type: text/html; charset="windows-1251" .... ----------- /snip ----------------- Should I just shut up and file a bug report (or fix it myself?) - Z ----- Original Message ----- .... > I had thought this was already the case. I'll make this correction... > > > c'est moi - TimS > From skip at pobox.com Wed May 28 17:18:23 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 28 17:18:32 2003 Subject: [Spambayes] FAQ updated Message-ID: <16085.10143.186656.280842@montanaro.dyndns.org> (Sending to both spambayes and spambayes-dev to catch all interested parties.) Folks, With a little assistance from Anthony Baxter, I updated the faq.ht file to automatically number both the table of contents and the main section with the answers. I also ran it through ispell and added a comment near the top to help people figure out how to add new content. If you see anything amiss, feel free to send me a correction or check it in yourself if you're so enabled. Skip From nas at python.ca Wed May 28 15:38:07 2003 From: nas at python.ca (Neil Schemenauer) Date: Wed May 28 17:35:11 2003 Subject: [Spambayes] spambayes... In-Reply-To: <200305281843.h4SIhPA03702@localhost.localdomain> References: <200305281843.h4SIhPA03702@localhost.localdomain> Message-ID: <20030528213807.GA27541@glacier.arctrix.com> wsy@merl.com wrote: > The problem is this: the "from" address on spam is almost always forged. > > That doesn't make the task impossible, just harder. You need to execute > the bounce at SMTP transfer time, not at MDA execution time (which is > where filters usually go). Right. Bouncing the message after the SMTP transaction does more harm than good. > Basically, one needs to write an SMTP front end that does filtering and > 400's the message during the SMTP protocol whenever it thinks it's spam. I have something for qmail: http://arctrix.com/nas/qmail/spambayes/ Greg Wand and I also have something for Exim (using elspy). I've placed a sanitized version here: http://arctrix.com/nas/exim/local_scan.py It should be generally usable with a little editing. Note that both of these implementations use my "spam delay" idea. I.e., if the message looks like spam, return a temporary 4xx error for a certain amount of time. The Exim version is more lenient because it is being used in a corporate setting. I've toyed with the idea of writing an SMTP proxy that does filtering using spambayes and then passes the message to a real MTA. Would people find this more useful than a MTA specific solution? Neil From Steven.Yarnell at Yarnell-Research.com Wed May 28 15:55:30 2003 From: Steven.Yarnell at Yarnell-Research.com (Steven Yarnell) Date: Wed May 28 17:55:43 2003 Subject: [Spambayes] Install failure Message-ID: <1273FD5DCFB2D211A64E00805F6515093C58FE@MARCH> Just an fyi: I have Outlook XP, SP2 and Spambayes will not install, though it is listed as a checked Com add-in. Thank you Steven Yarnell Inc Steven M. Yarnell, PhD | Yarnell Inc t. 407.876.8549 c. 321.689.2853 steven.yarnell@yarnell-research.com www.yarnell-research.com -------------- next part -------------- A non-text attachment was scrubbed... Name: emailSig_logo.gif Type: image/gif Size: 1107 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030528/ea7ebf00/emailSig_logo.gif From mhammond at skippinet.com.au Thu May 29 10:22:01 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed May 28 19:24:34 2003 Subject: [Spambayes] Enhancement Suggestion In-Reply-To: <41A6707794EAD311A5D6204C4F4F5020132E59@WESTWOOD01> Message-ID: <02cb01c3256f$f1b03380$530f8490@eden> > It would be quite nice if the derived database could be saved > to disk upon > closing Outlook -and re-read upon opening. It certainly is supposed to work that way - and indeed does for most people. Are you sure that Outlook is being completely shut down before you power off the machine? Either way, the latest versions in CVS actually save the database after a training operation rather than at shutdown, meaning this should be even less likely to happen again. > Is there any movement toward an Exchange hosted version? Nope. I'm not even sure what you mean by this. You mean a system where the per-user database is stored in exchange thereby giving the ability to "roam", or you mean a single database shared between all users and administered centrally? The former is more likely than the latter :) Mark. From buckwheat at videotron.ca Wed May 28 22:19:06 2003 From: buckwheat at videotron.ca (Michel Therien) Date: Wed May 28 21:20:01 2003 Subject: [Spambayes] (no subject) Message-ID: hi! I just installed the Spambayes plugin for outlook, it looks really good and I am looking forward to having more spam to see it work. However, I have a question, since I dont have much to do with spam,I would like to set it up so my spam folder is emptied without me having to do it, what I wonder is if the spam folder is emptied will I lose the "training" from all those messages? keep up the good work! and Thank You From skip at pobox.com Wed May 28 21:33:10 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed May 28 21:34:18 2003 Subject: [Spambayes] (no subject) In-Reply-To: References: Message-ID: <16085.25430.412240.563438@montanaro.dyndns.org> Michel> However, I have a question, since I dont have much to do with Michel> spam,I would like to set it up so my spam folder is emptied Michel> without me having to do it, what I wonder is if the spam folder Michel> is emptied will I lose the "training" from all those messages? Michel> keep up the good work! No, I don't believe you can, and it's a bad idea anyway. You'd eventually wind up deleting a message you wanted to see. You're probably better off sorting your Spam folder by Subject or author and skimming it quickly. Skip From popiel at wolfskeep.com Wed May 28 20:00:04 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed May 28 22:00:08 2003 Subject: [Spambayes] spambayes... In-Reply-To: Message from Neil Schemenauer of "Wed, 28 May 2003 14:38:07 PDT." <20030528213807.GA27541@glacier.arctrix.com> References: <200305281843.h4SIhPA03702@localhost.localdomain> <20030528213807.GA27541@glacier.arctrix.com> Message-ID: <20030529020004.EDBF72DE3E@cashew.wolfskeep.com> In message: <20030528213807.GA27541@glacier.arctrix.com> Neil Schemenauer writes: > >I've toyed with the idea of writing an SMTP proxy that does filtering >using spambayes and then passes the message to a real MTA. Would people >find this more useful than a MTA specific solution? I would, if I were interested in running such a thing. However, I'm not interested in running such, so I'm not sure if this counts as a vote or not... - Alex From tshumway-sf at jdiworks.net Wed May 28 20:03:11 2003 From: tshumway-sf at jdiworks.net (Terrel Shumway) Date: Wed May 28 22:03:19 2003 Subject: [Spambayes] spambayes... - NDRs In-Reply-To: <200305281851.h4SIpAI03725@localhost.localdomain> References: <3ED4C261.14820.2CF04840@localhost> <20030528180819.CD1DE2DE3E@cashew.wolfskeep.com> <200305281851.h4SIpAI03725@localhost.localdomain> Message-ID: <20030529020311.B5E2E63C25@smtp.us2.messagingengine.com> On Wed, 28 May 2003 14:51:10 -0400, wsy@merl.com said: > > It's still a hard-to-do-but-may-be-worthwhile thing. I do think it is worth trying. Combine it with some TMDA-like confirmation, and the risk of losing good mail to a FP goes even lower than it already is. 553 I am rejecting this because it looks like spam. If you think it isn't, follow the instructions at http://.... Keep it around for a week, then if no human confirms it .... No, it won't matter to spammers, but it will matter to people. From parnold at CuraGen.com Thu May 29 00:46:17 2003 From: parnold at CuraGen.com (Arnold, Paul) Date: Wed May 28 23:46:22 2003 Subject: [Spambayes] spambayes... Message-ID: <5A1D8FAF546576439E5E0BEE5E4E772AD246CF@ENTERPRISEA.CURAGEN.COM> Actually, this is what the commercial server-based products do. The most attractive option I have seen is to "tag" suspected spam. For example, I like the option where they insert "[SPAM]" or whatever string you like into the beginning of the subject line. We have a This way, the end client can set (easy) rules to move messages with this subject line to a folder, etc. Naturally, as a mail admin, the difficulty would be finding the "sweet" threshold for all users... Paul -----Original Message----- From: T. Alexander Popiel [mailto:popiel@wolfskeep.com] Sent: Wednesday, May 28, 2003 10:00 PM To: spambayes@python.org Subject: Re: [Spambayes] spambayes... In message: <20030528213807.GA27541@glacier.arctrix.com> Neil Schemenauer writes: > >I've toyed with the idea of writing an SMTP proxy that does filtering >using spambayes and then passes the message to a real MTA. Would people >find this more useful than a MTA specific solution? I would, if I were interested in running such a thing. However, I'm not interested in running such, so I'm not sure if this counts as a vote or not... - Alex _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. From parnold at CuraGen.com Thu May 29 00:50:04 2003 From: parnold at CuraGen.com (Arnold, Paul) Date: Wed May 28 23:50:09 2003 Subject: [Spambayes] (no subject) Message-ID: <5A1D8FAF546576439E5E0BEE5E4E772AD0220D@ENTERPRISEA.CURAGEN.COM> Yes, you can. In Outlook, right click the folder and choose Properties. On the AutoArchive tab you can set a time limit (in days/weeks/months) and have the messages deleted permanently. The next thing you have to do is under Tools/Options set the AutoArchive to run every x days. (14 by default) HTH, Paul -----Original Message----- From: Michel Therien [mailto:buckwheat@videotron.ca] Sent: Wednesday, May 28, 2003 9:19 PM To: spambayes@python.org Subject: [Spambayes] (no subject) hi! I just installed the Spambayes plugin for outlook, it looks really good and I am looking forward to having more spam to see it work. However, I have a question, since I dont have much to do with spam,I would like to set it up so my spam folder is emptied without me having to do it, what I wonder is if the spam folder is emptied will I lose the "training" from all those messages? keep up the good work! and Thank You _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. From hartmut.otting at topaz.ws Thu May 29 17:12:13 2003 From: hartmut.otting at topaz.ws (Hartmut Otting) Date: Thu May 29 00:12:21 2003 Subject: [Spambayes] Spambayes Compatibility Message-ID: Hi there, I've installed (with the Windows Installer) on W2K Service Pack 3 and Outlook XP Service Pack 2. There appear to be no problems with this configuration and it does the job perfectly. Terrific product. Hartmut Otting Managing Director Topaz Consulting Ltd. L11, BDS House, 86-90 Lambton Quay, PO Box 2319 Wellington, New Zealand Phone +64-4-472-1308 Mobile +64-21-620-038 Email mailto:hartmut.otting@topaz.ws Web www.topaz.ws This email and any attachments transmitted with it may contain information that is confidential and subject to legal professional privilege. If you are not the intended recipient you are notified that any use, dissemination, distribution or copying of this message or data is prohibited. All content is to be treated as confidential, unless otherwise specified, and is not to be forwarded to third parties without the prior permission of the author. To do so may breach the New Zealand Privacy Act 1993. If you have received this email in error, please delete it immediately and inform us of the mistake by return email. Thank you. From matthewd at cisco.com Thu May 29 15:16:19 2003 From: matthewd at cisco.com (Dickinson, Matthew) Date: Thu May 29 00:18:13 2003 Subject: [Spambayes] Spambayes problem: it stopped working - com ad in is disabled? wo n't let me re-enable... Message-ID: <11D2868F916DD411982D00508B694F430FC5E494@syd-xch2.cisco.com> Hi, Spambayes was working perfectly for me - it's the best spam filterer that I've ever used! So I've been very inundated with spam over the past week because it hasn't been working... I'm running the binary version. About a week ago Outlook pretty much hung and was hanging the whole PC, eventually it reported that an error had occured with an Add In (I'm assuming Spambayes as I have no other adins) and that it would disable it... so that it could start Outlook (it kept on crashing each time I tried to start it until I did this)... So once Outlook was back up I went into the com add in section and saw that it was there (just without the checkbox) so I checked the box hit ok etc... but no icon's appeared on the menu and my spam wasn't being filtered (ie it wasn't running)... so I went back to the com ad in area and noticed that it was unchecked again... I tried checking it again etc... and basically its as if it won't stay enabled... So I tried uninstalling it, then reinstalling it - same problem. I even tried upgrading to Outlook 2002, then installing it - same problem There is no log file generated from the session (the last log file says that Spambayes was registered successfully) Any ideas on how I can get it working? Thanks Matt From matthewd at cisco.com Thu May 29 15:22:55 2003 From: matthewd at cisco.com (Dickinson, Matthew) Date: Thu May 29 00:23:32 2003 Subject: [Spambayes] RE: Spambayes problem: it stopped working - com ad in is disabled ? won't let me re-enable... Message-ID: <11D2868F916DD411982D00508B694F430FC5E495@syd-xch2.cisco.com> PS: I'm running Windows 2000 Pro -----Original Message----- From: Dickinson, Matthew Sent: Thursday, 29 May 2003 2:16 PM To: 'spambayes@python.org' Subject: Spambayes problem: it stopped working - com ad in is disabled? won't let me re-enable... Hi, Spambayes was working perfectly for me - it's the best spam filterer that I've ever used! So I've been very inundated with spam over the past week because it hasn't been working... I'm running the binary version. About a week ago Outlook pretty much hung and was hanging the whole PC, eventually it reported that an error had occured with an Add In (I'm assuming Spambayes as I have no other adins) and that it would disable it... so that it could start Outlook (it kept on crashing each time I tried to start it until I did this)... So once Outlook was back up I went into the com add in section and saw that it was there (just without the checkbox) so I checked the box hit ok etc... but no icon's appeared on the menu and my spam wasn't being filtered (ie it wasn't running)... so I went back to the com ad in area and noticed that it was unchecked again... I tried checking it again etc... and basically its as if it won't stay enabled... So I tried uninstalling it, then reinstalling it - same problem. I even tried upgrading to Outlook 2002, then installing it - same problem There is no log file generated from the session (the last log file says that Spambayes was registered successfully) Any ideas on how I can get it working? Thanks Matt From mhammond at skippinet.com.au Thu May 29 15:33:21 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 29 00:34:12 2003 Subject: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... In-Reply-To: <11D2868F916DD411982D00508B694F430FC5E494@syd-xch2.cisco.com> Message-ID: <037d01c3259b$701b45e0$530f8490@eden> Unfortunately, I have no idea what could cause this. Certainly, re-installing the addin should work, and it should at least generate a log with the failure reason in this case. You could try running "regsvr32 SpamBayesAddin.dll" (or whatever it was I named that DLL ;) - hopefully you will get a "success" message. Then try re-starting Outlook and check for a log. About the only thing I can think of is that SpamBayes is failing when *creating* the log file - hence we see no errors. The only way I can see that happening is if your Windows temp directory doesn't exist - and I imagine much more than SpamBayes would get upset in that situation. Another option would be that the log files are somehow read-only - can you try deleting all old log files first? Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Dickinson, Matthew > Sent: Thursday, 29 May 2003 2:16 PM > To: spambayes@python.org > Subject: [Spambayes] Spambayes problem: it stopped working - com ad in > isdisabled? wo n't let me re-enable... > > > Hi, > > Spambayes was working perfectly for me - it's the best spam > filterer that > I've ever used! > > So I've been very inundated with spam over the past week > because it hasn't > been working... > > I'm running the binary version. > > About a week ago Outlook pretty much hung and was hanging the > whole PC, > eventually it reported that an error had occured with an Add In (I'm > assuming Spambayes as I have no other adins) and that it > would disable it... > so that it could start Outlook (it kept on crashing each time > I tried to > start it until I did this)... So once Outlook was back up I > went into the > com add in section and saw that it was there (just without > the checkbox) so > I checked the box hit ok etc... but no icon's appeared on the > menu and my > spam wasn't being filtered (ie it wasn't running)... so I > went back to the > com ad in area and noticed that it was unchecked again... I > tried checking > it again etc... and basically its as if it won't stay enabled... > So I tried uninstalling it, then reinstalling it - same problem. > > I even tried upgrading to Outlook 2002, then installing it - > same problem > > There is no log file generated from the session (the last log > file says that > Spambayes was registered successfully) > > Any ideas on how I can get it working? > > Thanks > > Matt > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > From m2 at plusseven.com Thu May 29 14:14:20 2003 From: m2 at plusseven.com (Alex Polite) Date: Thu May 29 07:14:27 2003 Subject: [Spambayes] multiple languages Message-ID: <20030529111420.GA584@matijek> I've been using spambayes in conjuction with procmail, mutt and fetchmail for some time. I'm very happy with this setup. Theres one glitch though. 99% of my ham is in Swedish. 99% of my spam is in English. Because of this I get quite a number of false negatives written in Swedish and false positives written in english. Has this problem already been brought to your attention? I assume that it affect many European users. alex -- Alex Polite http://plusseven.com/gpg From ballant at bnl.gov Thu May 29 08:24:37 2003 From: ballant at bnl.gov (Ballantyne, Ian) Date: Thu May 29 07:25:14 2003 Subject: [Spambayes] possible bug Message-ID: <28A2E0D6A920954ABBF13AF712CEBDB6024584A9@exchange05.bnl.gov> Using the latest binary for Windows 1.02a. If I select multiple messages in the possible spam folder and drag to the spam folder, the statistics displayed next to the "Train Now" button are not updated. I am unsure if the database is actually updated. Doing each message separately does update the statistics. My OS is w2k sp3 with all patches and using Outlook 2000. Other than that, is working great for me in the week that I have been using it. Zero false positives for me and getting better with possible spam as time goes on. Excellent piece of software. Let me know if you need me to supply any more information. Ian From rcharbon at mitre.org Thu May 29 08:58:52 2003 From: rcharbon at mitre.org (Ray Charbonneau) Date: Thu May 29 07:59:40 2003 Subject: [Spambayes] Documentation notes for IMAP Message-ID: <003e01c325d9$ac8600e0$2d025381@MITRE.ORG> The "about.html" files asks for documentation help, but doesn't supply any contact info. Anyhow... It would be useful to remind IMAP users of two things: -Be sure to purge your Inbox and Spam folders before training, so you don't inadvertantly train on mail that you've moved out of the folders. -The default ham folder is the local Inbox, not your IMAP Inbox. You'll have to select your IMAP Inbox before proceeding. It would be a good design choice to show the entire folder path, rather than just the folder "common name". This would help avoid the second issue. -- Ray Charbonneau R107 - Enterprise Desktop Solutions The MITRE Corporation From conway at pjm.com Thu May 29 10:22:54 2003 From: conway at pjm.com (conway@pjm.com) Date: Thu May 29 09:23:41 2003 Subject: [Spambayes] RE: Spambayes problem: it stopped working - com a d in is disabled ? won't let me re-enable... Message-ID: Matthew, I had a similar problem. Win2K, Outlook 2000, the add-in would not stay "checked". Reinstalling did not help. Try this: Go into the com add-ins screen, remove the add-in (remember where it is located), and then add it back in manually. In my case, Outlook crashed, but the add-in started working the next time I started Outlook. Since then, I've also had Outlook start up and complain about the add-in affecting "stability", but I'm able to tell Outlook to load it anyway, and it continues to work. Good Luck! Chuck Conway conway@pjm.com -----Original Message----- From: Dickinson, Matthew [mailto:matthewd@cisco.com] Sent: Thursday, May 29, 2003 12:23 AM To: spambayes@python.org Subject: [Spambayes] RE: Spambayes problem: it stopped working - com ad in is disabled ? won't let me re-enable... PS: I'm running Windows 2000 Pro -----Original Message----- From: Dickinson, Matthew Sent: Thursday, 29 May 2003 2:16 PM To: 'spambayes@python.org' Subject: Spambayes problem: it stopped working - com ad in is disabled? won't let me re-enable... Hi, Spambayes was working perfectly for me - it's the best spam filterer that I've ever used! So I've been very inundated with spam over the past week because it hasn't been working... I'm running the binary version. About a week ago Outlook pretty much hung and was hanging the whole PC, eventually it reported that an error had occured with an Add In (I'm assuming Spambayes as I have no other adins) and that it would disable it... so that it could start Outlook (it kept on crashing each time I tried to start it until I did this)... So once Outlook was back up I went into the com add in section and saw that it was there (just without the checkbox) so I checked the box hit ok etc... but no icon's appeared on the menu and my spam wasn't being filtered (ie it wasn't running)... so I went back to the com ad in area and noticed that it was unchecked again... I tried checking it again etc... and basically its as if it won't stay enabled... So I tried uninstalling it, then reinstalling it - same problem. I even tried upgrading to Outlook 2002, then installing it - same problem There is no log file generated from the session (the last log file says that Spambayes was registered successfully) Any ideas on how I can get it working? Thanks Matt _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes From mhammond at skippinet.com.au Fri May 30 01:04:39 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 29 10:05:26 2003 Subject: [Spambayes] Documentation notes for IMAP In-Reply-To: <003e01c325d9$ac8600e0$2d025381@MITRE.ORG> Message-ID: <048201c325eb$3fa0f040$530f8490@eden> > The "about.html" files asks for documentation help, but doesn't supply > any contact info. Anyhow... The best thing to do would be to make any changes you feel necessary, and open a bug (or patch) on SourceForge with the modified/new files. Assign it directly to me for Outlook changes. Indeed, adding this tidbit would be a good first start. If you could do that, (a) I would really appreciate it, and (b) it is more likely to actually happen :) > It would be useful to remind IMAP users of two things: If you could make such a change, it would be great. > It would be a good design choice to show the entire folder > path, rather > than just the folder "common name". This would help avoid the second > issue. Yes, but the design of the dialog doesn't really allow for that. Sean has given back some code making it easier for me to work on new dialogs - which I hope to do after I get a new binary out. Please open a bug at sourceforge asking for the full folder path to be shown in the dialogs - again, that way it wont rely on my fuzzy memory. Mark. From mhammond at skippinet.com.au Fri May 30 01:07:57 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 29 10:08:25 2003 Subject: [Spambayes] possible bug In-Reply-To: <28A2E0D6A920954ABBF13AF712CEBDB6024584A9@exchange05.bnl.gov> Message-ID: <048501c325eb$b551e010$530f8490@eden> Thanks - that certainly does sound strange! I have added http://sourceforge.net/tracker/index.php?func=detail&aid=745518&group_id=617 02&atid=498103 Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org]On Behalf Of Ballantyne, Ian > Sent: Thursday, 29 May 2003 9:25 PM > To: 'SpamBayes@python.org' > Subject: [Spambayes] possible bug > > > Using the latest binary for Windows 1.02a. If I select > multiple messages in > the possible spam folder and drag to the spam folder, the statistics > displayed next to the "Train Now" button are not updated. I > am unsure if the > database is actually updated. Doing each message separately > does update the > statistics. > > My OS is w2k sp3 with all patches and using Outlook 2000. > > Other than that, is working great for me in the week that I > have been using > it. Zero false positives for me and getting better with > possible spam as > time goes on. Excellent piece of software. > > Let me know if you need me to supply any more information. > > Ian > > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes From joseph.warner at siemens.com Thu May 29 09:39:22 2003 From: joseph.warner at siemens.com (joseph.warner@siemens.com) Date: Thu May 29 10:41:05 2003 Subject: [Spambayes] Spambayes Install Error Message-ID: Hi, I'm running Windows 2000 SP3 & Outlook 2000 SP-3 and keep getting the error: C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL\OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel the installation. This has appeared in a couple of places in the archives and I found a couple bug reports were filed but the bug references seem to be in a "dupe" loop: https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61702&atid=498103 If there's an existing resolution/fix, can someone help me find the correct url? Thanks -- Joe ------------------------------------------------------------------------------- This message and any included attachments are from Siemens Medical Solutions USA, Inc. and are intended only for the addressee(s). The information contained herein may include trade secrets or privileged or otherwise confidential information. Unauthorized review, forwarding, printing, copying, distributing, or using such information is strictly prohibited and may be unlawful. If you received this message in error, or have reason to believe you are not authorized to receive it, please promptly delete this message and notify the sender by e-mail with a copy to CSOffice@smed.com. Thank you From parnold at CuraGen.com Thu May 29 11:48:45 2003 From: parnold at CuraGen.com (Arnold, Paul) Date: Thu May 29 10:48:51 2003 Subject: [Spambayes] Spambayes Install Error Message-ID: <5A1D8FAF546576439E5E0BEE5E4E772AD02211@ENTERPRISEA.CURAGEN.COM> 1) You have to grant the user local admin rights. 2) go to http://office.microsoft.com/ProductUpdates and make sure you are up to the latest SP for Outlook. After that, install SpamBayes again. If it fails again register the DLL manually with (if you used the defaults) regsvr32 "c:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll" HTH, Paul -----Original Message----- From: joseph.warner@siemens.com [mailto:joseph.warner@siemens.com] Sent: Thursday, May 29, 2003 10:39 AM To: spambayes@python.org Subject: [Spambayes] Spambayes Install Error Hi, I'm running Windows 2000 SP3 & Outlook 2000 SP-3 and keep getting the error: C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL\OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel the installation. This has appeared in a couple of places in the archives and I found a couple bug reports were filed but the bug references seem to be in a "dupe" loop: https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61702&atid=498103 If there's an existing resolution/fix, can someone help me find the correct url? Thanks -- Joe ------------------------------------------------------------------------------- This message and any included attachments are from Siemens Medical Solutions USA, Inc. and are intended only for the addressee(s). The information contained herein may include trade secrets or privileged or otherwise confidential information. Unauthorized review, forwarding, printing, copying, distributing, or using such information is strictly prohibited and may be unlawful. If you received this message in error, or have reason to believe you are not authorized to receive it, please promptly delete this message and notify the sender by e-mail with a copy to CSOffice@smed.com. Thank you _______________________________________________ Spambayes mailing list Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. From skip at pobox.com Thu May 29 10:49:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 29 10:50:03 2003 Subject: [Spambayes] multiple languages In-Reply-To: <20030529111420.GA584@matijek> References: <20030529111420.GA584@matijek> Message-ID: <16086.7697.994833.828282@montanaro.dyndns.org> Alex> 99% of my ham is in Swedish. Alex> 99% of my spam is in English. Alex> Because of this I get quite a number of false negatives written in Alex> Swedish and false positives written in english. I believe it's been discussed a bit here, though not recently. In theory, if you train on a sufficient number of Swedish spam and English ham it should gather enough information to significantly reduce the FP/FN problem. Since the preponderance of your ham is Swedish it's going to take fair bit of Swedish spam to offset that. For instance, in reality most of the words in your Swedish emails (all the common stuff) shouldn't be considered hammy or spammy, but since you get very little Swedish spam essentially every Swedish word is considered hammy. As you train on more and more Swedish spam, the common Swedish words will become much less strong spam indicators, leaving the uncommon words used in Swedish ham and Swedish spam as classifiers. The inverse will be true for English. Let me see if I can demonstrate using words from my own database. "information" is neither strongly hammy nor spammy: >>> db["saved state"] (5, 8165, 12315) # 8165 spam, 12315 ham >>> db["information"] (1335, 1030) # appears in 1335 spams, 1030 hams while "viagra" clearly is spammy: >>> db["viagra"] (63, 2) For you, "information" is probably a fairly spammy word (unless it's also a Swedish word). Essentially all the Spanish email I receive is spam, so common Spanish words are relatively strong spam indicators for me: >>> db["todos"] (35, 0) >>> db["nosotros"] (5, 0) Still, because I don't get very much Spanish spam, the raw numbers are rather small. This is what's happening to your English ham. Almost all common English words look spammy. I'm not sure there's an easy way out of this. If you've saved all your training messages you can try deleting a bunch (maybe 75%) of the Swedish ham and English spam from your database and retrain on the remaining messages. Then starting from that point, only train on the mistakes (messages which are completely misclassified or wind up marked "unsure"). This probably won't improve things immediately, but it should make it easier for Swedish spam or English ham to begin to tip the scales. It would be quite helpful if you could try this (or other schemes) out and let us know what - if anything - works for you. As you pointed out, as Spambayes gains more bilingual users this will probably become more of a problem. Having some semi-proven (or at least tried) techniques will be helpful. (I'm sure Tim Peters can state this all much more eloquently. Perhaps he'll step in and clear the air.) Skip From ja at sesi.org Thu May 29 11:53:02 2003 From: ja at sesi.org (jay adam) Date: Thu May 29 10:52:32 2003 Subject: [Spambayes] Outlook Express? Message-ID: <006301c325f2$0143f4d0$1500a8c0@tecra8100> Will there be an add-in for Outlook Express? Jay Adam From popiel at wolfskeep.com Thu May 29 08:56:15 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 29 10:56:19 2003 Subject: [Spambayes] multiple languages In-Reply-To: Message from Alex Polite of "Thu, 29 May 2003 13:14:20 +0200." <20030529111420.GA584@matijek> References: <20030529111420.GA584@matijek> Message-ID: <20030529145615.7F3BC2DDC4@cashew.wolfskeep.com> In message: <20030529111420.GA584@matijek> Alex Polite writes: >I've been using spambayes in conjuction with procmail, mutt and >fetchmail for some time. I'm very happy with this setup. > >Theres one glitch though. > >99% of my ham is in Swedish. >99% of my spam is in English. > >Because of this I get quite a number of false negatives written in >Swedish and false positives written in english. > >Has this problem already been brought to your attention? I assume that >it affect many European users. The problem was mentioned briefly in theory, but not really actively discussed (to the best of my knowledge). I think the best way to handle it from the algorithm point of view is to have three classifiers: one to distinguish Swedish from English, then one each to distinguish ham from spam in each language. This is may help the false conclusion rates... but at the cost of much more complicated training and maintenance. Actually, the more that I think about it, the less I like the above suggestion. It's got too much of a burden on the user for maintaining multiple classification types... and a major goal of spambayes is to make stuff simple. Blah. Heck, if the amount of ham/spam in each language is very out of balance, the above trick might not help anyway, just due to that imbalance. Ugly. Nasty problem. Yuck. - (another) Alex From skip at pobox.com Thu May 29 10:57:36 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu May 29 10:57:52 2003 Subject: [Spambayes] Outlook Express? In-Reply-To: <006301c325f2$0143f4d0$1500a8c0@tecra8100> References: <006301c325f2$0143f4d0$1500a8c0@tecra8100> Message-ID: <16086.8160.900585.924609@montanaro.dyndns.org> jay> Will there be an add-in for Outlook Express? Probably not. See: http://spambayes.sourceforge.net/faq.html#outlookexpress Skip From rfredericks at nssolutions.com Thu May 29 12:09:35 2003 From: rfredericks at nssolutions.com (Rob Fredericks) Date: Thu May 29 11:20:39 2003 Subject: [Spambayes] Bug 725449 Message-ID: <000101c325f4$53cb82c0$8e01000a@nssolutions.com> I'm experiencing the same behavior as this reported bug. I read in the bug file about what appears to be a workaround. I have Win98 Se 4.10.2222 A and Outlook 2000 Sr-1. Is there a workaround and how can I use it? Thanks Rob Robert W. Fredericks Vice President, COO/CFO Network Storage Solutions, Inc. 14020 Thunderbolt Place Suite 500 Chantilly, VA 20151 (703)834-2254 (703)834-2221 Fax From francois.granger at free.fr Thu May 29 18:34:39 2003 From: francois.granger at free.fr (Francois Granger) Date: Thu May 29 11:34:46 2003 Subject: [Spambayes] multiple languages Message-ID: (Sorry, mis handling of the To: field of this liste !) At 09:49 -0500 on 29/05/2003, in message Re: [Spambayes] multiple languages, Skip Montanaro wrote: > Alex> 99% of my ham is in Swedish. > Alex> 99% of my spam is in English. > > Alex> Because of this I get quite a number of false negatives written in > Alex> Swedish and false positives written in english. > >I believe it's been discussed a bit here, though not recently. I raised the issue here long time ago and did not got a really good answer from Tim. >I'm not sure there's an easy way out of this. If you've saved all your >training messages you can try deleting a bunch (maybe 75%) of the Swedish >ham and English spam from your database and retrain on the remaining >messages. Then starting from that point, only train on the mistakes >(messages which are completely misclassified or wind up marked "unsure"). >This probably won't improve things immediately, but it should make it easier >for Swedish spam or English ham to begin to tip the scales. I am french. I get a similar problem as stated here. I get some occasional spanish and portuguese spam in addition. I am using the Pop3proxy version. I have been using various versions of SpamBayes since Sept 2002. My current database was created on 1 Feb 2003. I trained on some (100) messages to start with, then trained mostly on unsure and mis classified. I kept an eye to the balance of ham/spam as well as trying to put some english ham in the training set when I trained on english unsure as spam. and the same for the other combination. I have now trained on Spam: 639 Ham: 486. The success rate is astonishing since a long time. I get only few unsure and no mis calssified messages in either language. -- Hofstadter's Law : It always takes longer than you expect, even when you take into account Hofstadter's Law. From joseph.warner at siemens.com Thu May 29 11:33:06 2003 From: joseph.warner at siemens.com (joseph.warner@siemens.com) Date: Thu May 29 12:34:01 2003 Subject: [Spambayes] Re: Spambayes Install Error Message-ID: >1) You have to grant the user local admin rights. That was the problem, though my account had administrative rights I still had to sign on with the admin account, run the install, logout, login with my account and rerun the install. After doing that the "Anti-Spam dropdown button" appeared in my Outlook toolbar and I was able to configure it. Thanks for your help and very quick response! -- Joe ----- Forwarded by Joe Warner/SMS on 05/29/2003 10:28 AM ----- |---------+---------------------------> | | Joe Warner | | | | | | 05/29/2003 08:39| | | AM | | | | |---------+---------------------------> >---------------------------------------------------------------------------------------------------------------| | | | To: spambayes@python.org | | cc: | | Subject: Spambayes Install Error | >---------------------------------------------------------------------------------------------------------------| Hi, I'm running Windows 2000 SP3 & Outlook 2000 SP-3 and keep getting the error: C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL\OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel the installation. This has appeared in a couple of places in the archives and I found a couple bug reports were filed but the bug references seem to be in a "dupe" loop: https://sourceforge.net/tracker/index.php?func=detail&aid=740198&group_id=61702&atid=498103 If there's an existing resolution/fix, can someone help me find the correct url? Thanks -- Joe ------------------------------------------------------------------------------- This message and any included attachments are from Siemens Medical Solutions USA, Inc. and are intended only for the addressee(s). The information contained herein may include trade secrets or privileged or otherwise confidential information. Unauthorized review, forwarding, printing, copying, distributing, or using such information is strictly prohibited and may be unlawful. If you received this message in error, or have reason to believe you are not authorized to receive it, please promptly delete this message and notify the sender by e-mail with a copy to CSOffice@smed.com. Thank you From bill at parducci.net Thu May 29 10:41:29 2003 From: bill at parducci.net (bill parducci) Date: Thu May 29 12:41:34 2003 Subject: [Spambayes] website/list suggestion References: <006301c325f2$0143f4d0$1500a8c0@tecra8100> <16086.8160.900585.924609@montanaro.dyndns.org> Message-ID: <3ED63839.30404@parducci.net> wondering if the 'Email Us' link on left hand nav bar would be more effective if it were named 'Contact Us' and it linked to a page that had the new mailing lists? in addition to showing each of the lists and the corresponding purpose descriptors, it could be laden with the usual directions to 'read the docs, think hard and consult psychics before posting a question' kinda stuff. something like the attached for example... b Skip wrote: > Probably not. See: > > http://spambayes.sourceforge.net/faq.html#outlookexpress > From papaDoc at videotron.ca Thu May 29 15:09:08 2003 From: papaDoc at videotron.ca (papaDoc) Date: Thu May 29 14:09:14 2003 Subject: [Spambayes] Pop3proxy and mime Message-ID: <3ED64CC4.1010207@videotron.ca> Hi Alex, In bug [ 737967 ] Malformed messages break pop3proxy (et al) You were saying that the problem was due to the malformed mime part. Do you know what is malformed ? Because now I can't use SB since I'm receiving virus (many time the same mail) that cause pop3proxy to hang. I might try to solve the problem if I know where to look. Remi P.S. Do you know how to send only one message to pop3proxy so I can test it on the command line ? From tim.one at comcast.net Thu May 29 15:22:50 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 29 14:28:23 2003 Subject: [Spambayes] multiple languages In-Reply-To: <16086.7697.994833.828282@montanaro.dyndns.org> Message-ID: [Skip Montanaro] > ... > (I'm sure Tim Peters can state this all much more eloquently. Perhaps > he'll step in and clear the air.) Nope! It's a statistical classifier with no semantic knowledge, and you already explained the consequences of that. We did no research on non-English languages here, or on mixtures, and nobody has volunteered work in those areas (although people whose primary language isn't English have tested and used it, and most seem to be happy despite our benign neglect). When I get a msg in, e.g., Polish, I instantly think "hey, that must be spam!". So does spambayes. Even if it *were* ham, spambayes would have no way to know that, because I don't know that, and so I've never trained any Polish msg as ham. It only knows what you tell it, which is sometimes even less than you know . From popiel at wolfskeep.com Thu May 29 12:43:27 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Thu May 29 14:43:32 2003 Subject: [Spambayes] Re: Pop3proxy and mime In-Reply-To: Message from papaDoc of "Thu, 29 May 2003 14:09:08 EDT." <3ED64CC4.1010207@videotron.ca> References: <3ED64CC4.1010207@videotron.ca> Message-ID: <20030529184327.64E3A2DDC4@cashew.wolfskeep.com> In message: <3ED64CC4.1010207@videotron.ca> papaDoc writes: >Hi Alex, > >In bug > > > [ 737967 ] Malformed messages break pop3proxy (et al) > > >You were saying that the problem was due to the malformed mime part. > >Do you know what is malformed ? The way I got it to break was by having a mime section where either there were no headers and no blank line before the section body, or there were headers and there was no blank line between the headers and the body. >Because now I can't use SB since I'm receiving virus (many time the >same mail) that cause pop3proxy to hang. > >I might try to solve the problem if I know where to look. > > >Remi > >P.S. Do you know how to send only one message to pop3proxy so I can test >it on the command line ? The only easy way I can think of offhand is to create a new mailbox and send only one mail message to it. - Alex From mrhodes at dc.com Thu May 29 16:26:35 2003 From: mrhodes at dc.com (Rhodes, Michael E (US - North Canton)) Date: Thu May 29 15:30:22 2003 Subject: [Spambayes] Tested on Outlook 2002 with Win2k Message-ID: <6FA22478F39C3640910C711FCDBD2BCA481C3E@amew2exmb06.global.braxton.com> FYI, I attempted to install SpamBayes and got the following error. This was a configuration that http://spambayes.sourceforge.net/windows.html reported as "Unknown". Per your request, I've sent the details of the error and config. <> OS: Win2000 5.00.2195 Service Pack 2 Client: Outlook 2002 (10.2627.2625) Installer: Binary - SpamBayes-Outlook-Setup-002.exe BTW, sweet concept and product! I *am* using it on my personal, home PC. Regards, Michael Rhodes Deloitte Consulting | North Canton Voice 330.490.5412 Fax 330.490.5420 Email mrhodes@dc.com ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/bmp Size: 87146 bytes Desc: Picture (Metafile) Url : http://mail.python.org/pipermail/spambayes/attachments/20030529/84d29ba4/attachment-0001.bin From parnold at CuraGen.com Thu May 29 16:54:07 2003 From: parnold at CuraGen.com (Arnold, Paul) Date: Thu May 29 15:54:13 2003 Subject: [Spambayes] Tested on Outlook 2002 with Win2k Message-ID: <5A1D8FAF546576439E5E0BEE5E4E772AD02216@ENTERPRISEA.CURAGEN.COM> You probably do not have local admin rights on the account you are installing with. -----Original Message----- From: Rhodes, Michael E (US - North Canton) [mailto:mrhodes@dc.com] Sent: Thursday, May 29, 2003 3:27 PM To: spambayes@python.org Subject: [Spambayes] Tested on Outlook 2002 with Win2k FYI, I attempted to install SpamBayes and got the following error. This was a configuration that http://spambayes.sourceforge.net/windows.html reported as "Unknown". Per your request, I've sent the details of the error and config. <> OS: Win2000 5.00.2195 Service Pack 2 Client: Outlook 2002 (10.2627.2625) Installer: Binary - SpamBayes-Outlook-Setup-002.exe BTW, sweet concept and product! I *am* using it on my personal, home PC. Regards, Michael Rhodes Deloitte Consulting | North Canton Voice 330.490.5412 Fax 330.490.5420 Email mrhodes@dc.com ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. From zander at zan.com Thu May 29 13:56:40 2003 From: zander at zan.com (Zander) Date: Thu May 29 15:56:50 2003 Subject: [Spambayes] Re: Spambayes Digest, Vol 57, Issue 101 References: Message-ID: <00e801c3261c$6f36ada0$a100a8c0@zlichstein> > >Because now I can't use SB since I'm receiving virus (many time the > >same mail) that cause pop3proxy to hang. > > > >I might try to solve the problem if I know where to look. > > > > > >Remi The problem is definitely that there are no exception handlers around the lines: prs._parseheaders(self, fp) and prs._parsebody(self, fp) But I'm just not sure how to pass a "skip this message" back up the chain properly... - Z From mrhodes at dc.com Thu May 29 17:05:19 2003 From: mrhodes at dc.com (Rhodes, Michael E (US - North Canton)) Date: Thu May 29 16:06:38 2003 Subject: [Spambayes] Tested on Outlook 2002 with Win2k Message-ID: <6FA22478F39C3640910C711FCDBD2BCA481C3F@amew2exmb06.global.braxton.com> I've double-checked and I am in the Administrators group. Thanks. -----Original Message----- From: Arnold, Paul [mailto:parnold@CuraGen.com] Sent: Thursday, May 29, 2003 3:54 PM To: Rhodes, Michael E (US - North Canton); spambayes@python.org Subject: RE: [Spambayes] Tested on Outlook 2002 with Win2k You probably do not have local admin rights on the account you are installing with. -----Original Message----- From: Rhodes, Michael E (US - North Canton) [mailto:mrhodes@dc.com] Sent: Thursday, May 29, 2003 3:27 PM To: spambayes@python.org Subject: [Spambayes] Tested on Outlook 2002 with Win2k FYI, I attempted to install SpamBayes and got the following error. This was a configuration that http://spambayes.sourceforge.net/windows.html reported as "Unknown". Per your request, I've sent the details of the error and config. <> OS: Win2000 5.00.2195 Service Pack 2 Client: Outlook 2002 (10.2627.2625) Installer: Binary - SpamBayes-Outlook-Setup-002.exe BTW, sweet concept and product! I *am* using it on my personal, home PC. Regards, Michael Rhodes Deloitte Consulting | North Canton Voice 330.490.5412 Fax 330.490.5420 Email mrhodes@dc.com ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. From mrhodes at dc.com Thu May 29 17:07:36 2003 From: mrhodes at dc.com (Rhodes, Michael E (US - North Canton)) Date: Thu May 29 16:08:36 2003 Subject: [Spambayes] Tested on Outlook 2002 with Win2k Message-ID: <6FA22478F39C3640910C711FCDBD2BCA481C41@amew2exmb06.global.braxton.com> FYI, I attempted this again but to my c: drive instead of d: and it worked. No other changes to the install parms or my system were made. Thanks again! -----Original Message----- From: Rhodes, Michael E (US - North Canton) Sent: Thursday, May 29, 2003 4:04 PM To: 'Arnold, Paul'; spambayes@python.org Subject: RE: [Spambayes] Tested on Outlook 2002 with Win2k I've double-checked and I am in the Administrators group. Thanks. -----Original Message----- From: Arnold, Paul [mailto:parnold@CuraGen.com] Sent: Thursday, May 29, 2003 3:54 PM To: Rhodes, Michael E (US - North Canton); spambayes@python.org Subject: RE: [Spambayes] Tested on Outlook 2002 with Win2k You probably do not have local admin rights on the account you are installing with. -----Original Message----- From: Rhodes, Michael E (US - North Canton) [mailto:mrhodes@dc.com] Sent: Thursday, May 29, 2003 3:27 PM To: spambayes@python.org Subject: [Spambayes] Tested on Outlook 2002 with Win2k FYI, I attempted to install SpamBayes and got the following error. This was a configuration that http://spambayes.sourceforge.net/windows.html reported as "Unknown". Per your request, I've sent the details of the error and config. <> OS: Win2000 5.00.2195 Service Pack 2 Client: Outlook 2002 (10.2627.2625) Installer: Binary - SpamBayes-Outlook-Setup-002.exe BTW, sweet concept and product! I *am* using it on my personal, home PC. Regards, Michael Rhodes Deloitte Consulting | North Canton Voice 330.490.5412 Fax 330.490.5420 Email mrhodes@dc.com ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. LEGAL NOTICE: Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately. This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. ----------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited. From prabhu at aero.iitm.ernet.in Fri May 30 03:45:36 2003 From: prabhu at aero.iitm.ernet.in (Prabhu Ramachandran) Date: Thu May 29 17:16:34 2003 Subject: [Spambayes] Newbie experiences (Emacs + VM) and questions. Message-ID: <16086.30840.634464.946319@monster.linux.in> Hi, I use XEmacs and VM as my mailer. I installed Spambayes-1.0a2 a few days back and bungled my way getting things working to my satisfaction. Everything seems to work great now and I'm really happy with the results. Many thanks for the software! The instructions for VM users in INTEGRATION.txt didn't cover all I wanted. I first tried the pop3proxy and liked the web interface. However I did not like a few things (personal preference): 1. All my incoming mails are cached. How many days are these messages cached for? I read somewhere that it was 7 days, is that right and is this configurable? 2. Reviewing messages proved to be a little bit of a pain since it required the browser and an extra set of things to be done. I preferred a solution from within the mailer. BTW can the proxy be configured so that all reviewed messages are deleted? I then adapted the instructions and code from the contrib directory to work with VM directly. Here are instructions for future VM users. 0. Collect a few hundred HAM and SPAM messages each for a while and then do this: /usr/local/bin/hammie.py -d -p ~/vmmail/hammie.db -g HAM -s SPAM I guess this is optional since Spambayes will anyway be retrained later. 1. Create a ~/.spambayesrc that looks like this: [Categorization] ham_cutoff: 0.2 spam_cutoff: 0.90 [hammiefilter] hammiefilter_persistent_storage_file: ~/vmmail/hammie.db 2. Create a .procmailrc like so: :0fw: | /usr/local/bin/hammiefilter.py -d $HOME/vmmail/hammie.db -t The -d $HOME/vmmail/hammie.db is unnecessary I guess but safe. 3. Add the following to your .vm (adapted from contrib/spambayes.el): ;;;;;;;; .vm ;;;;;;;;;;; (defvar spambayes-hammiefilter "/usr/local/bin/hammiefilter.py" "Path to the hammiefilter.py program") (defun spambayes-retrain (is-spam) "Retrain on all processable articles, or the one under the cursor. is-spam is a boolean--true if you want to retrain the message as spam, false if you want to retrain as ham. " (interactive) (message (concat "Retraining" (if is-spam " as SPAM" " as HAM") " ...")) (vm-pipe-message-to-command (concat spambayes-hammiefilter (if is-spam " -s" " -g") " -f") nil) (message (concat "Done retraining messages" (if is-spam " as SPAM" " as HAM") ".") ) ) (defun spambayes-retrain-as-spam () "Retrain and refilter messages as spam, them" (interactive) (spambayes-retrain 't) ) (defun spambayes-retrain-as-ham () "Retrain and refilter messages as ham" (interactive) (spambayes-retrain nil) ) (define-key vm-mode-map "ls" 'spambayes-retrain-as-spam) (define-key vm-summary-mode-map "ls" 'spambayes-retrain-as-spam) (define-key vm-mode-map "lh" 'spambayes-retrain-as-ham) (define-key vm-summary-mode-map "lh" 'spambayes-retrain-as-ham) (setq vm-auto-folder-alist '(("X-Spambayes-Classification:" ("spam" . "~/vmmail/SPAM")) ("X-Spambayes-Classification:" ("unsure" . "~/vmmail/UNSURE")) ) ) ;;;;;;;; .vm ;;;;;;;;;;; Thats it. Now, any false positives or negetives are simply retrained using "ls" or "lh". The commands also work with marked messages. Of course, I might be doing something wrong here so please correct me if I am wrong. Please also feel free to use this in part of the documentation for Spambayes. A few more questions/feature requests/whatnot: 1. Is the approach I use above a decent approach, i.e. training messages as they are received and then correcting errors using retraining. It works fairly well for me but I've not tested it for very long. 2. Is it a good idea to train on every incoming message or are there better approaches? 3. At the moment I receive more HAM than SPAM. Does this matter or make no difference? 4. I see that some folks retrain the whole database every so often. Is this necessary? If so why? 5. It would be nice if hammie.py would retrain on any messages that had an X-Spambayes-Trained header i.e. if Hammie.train(...) would check for the header and if found retrain. If this is intentionally not done it needs to be documented somewhere. Its a little confusing (for a newbie) if there are different scripts with similar options. :) I understand that many things are still being experimented on but it would be nice to have a few rules of thumb with respect to some of the questions I've asked for new users. Thanks again for the excellent software! cheers, prabhu From jwillett at afgweb.com Thu May 29 16:12:22 2003 From: jwillett at afgweb.com (Jerry Willett) Date: Thu May 29 18:13:10 2003 Subject: [Spambayes] Spam Software Message-ID: I've only been using your software for one day and it seems to have nailed all of my spam right away. You've done a fabulous job with this. Had one question. After the software has been trained on all the messages in the Spam folder, is it ok to delete the contents of that folder each day? Thanks again for great product. Jerry Willett Chief Technology Officer Associated Financial Group jwillett@afgweb.com From bill at parducci.net Thu May 29 17:04:56 2003 From: bill at parducci.net (bill parducci) Date: Thu May 29 19:11:57 2003 Subject: [Spambayes] Spam Software References: Message-ID: <3ED69218.3000607@parducci.net> > I've only been using your software for one day and it seems to have nailed > all of my spam right away. You've done a fabulous job with this. Had one > question. After the software has been trained on all the messages in the > Spam folder, is it ok to delete the contents of that folder each day? > Thanks again for great product. once a message has been [correctly] trained there is no need to keep it around. however, spambayes' accuracy is dependent upon having a 'sufficient' sample from which to make its decisions. therefore, most users retain a fair amount of spam in the event that they may wish to rebuild the corpus from scratch. of course, this begs the question: 'how much is enough?' that is where the 'art' of spambayes meets the science :) personally, i keep a couple thousand spam--two month's worth--as well as a similar number of ham. that is not to say that you won't have excellent results with a tenth (or less) of that number; since everyone's e-mail profile is different, the requirements for training are as well. b From Greg_Thompson at jdedwards.com Thu May 29 18:48:59 2003 From: Greg_Thompson at jdedwards.com (Thompson, Greg) Date: Thu May 29 19:49:33 2003 Subject: [Spambayes] Addin will not load Message-ID: <20BAE62F0232D249A4A3B65440244B971AD9A0@denmails2.jdedwards.com> I have been using spambayes successfully for a while now and it rocks. Unfortunately I installed something today that broke it. The addin will no longer start. I am running Outlook 2002 SP2 on XP w/SP1. I followed the troubleshooting guide and noticed that the COM addin was not checked. I tried to check the box, exit Outlook but still no go. If I examine the Com addin box again it is unchecked. I tried to reinstall spambayes, just cause I could, but still no go. The spambayes log has a single entry that says: Registered: SpamBayes.OutlookAddin. Any help you can provide to get me back online would be greatly appreciated. Greg Thompson From mhammond at skippinet.com.au Fri May 30 10:57:45 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu May 29 19:59:17 2003 Subject: [Spambayes] Addin will not load In-Reply-To: <20BAE62F0232D249A4A3B65440244B971AD9A0@denmails2.jdedwards.com> Message-ID: <061a01c3263e$19f350b0$530f8490@eden> > I have been using spambayes successfully for a while now and it rocks. > Unfortunately I installed something today that broke it. The > addin will > no longer start. I am running Outlook 2002 SP2 on XP w/SP1. I followed > the troubleshooting guide and noticed that the COM addin was not > checked. I tried to check the box, exit Outlook but still no go. If I > examine the Com addin box again it is unchecked. I tried to reinstall > spambayes, just cause I could, but still no go. The > spambayes log has a > single entry that says: Registered: SpamBayes.OutlookAddin. > Any help you > can provide to get me back online would be greatly appreciated. You are not alone :( Just yesterday, a google search found this tidbit related to Outlook 2002: -- Outlook 2002 has probably disabled SpamBayes. Just do the following to get it back: Start Outlook, go to Help, About Microsoft Outlook. Click the Disabled Items button. Select SpamBayes. Click Enable. Restart Outlook 2002. -- (It didn't say SpamBayes before though :) I have no idea if this will solve your problem, or indeed if it even makes any sense. Please let us know either way. Thanks, Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2240 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20030530/1902f695/winmail-0001.bin From gward at python.net Thu May 29 21:48:13 2003 From: gward at python.net (Greg Ward) Date: Thu May 29 20:48:17 2003 Subject: [Spambayes] Spambayes for python.org Message-ID: <20030530004813.GA28375@cthulhu.gerg.ca> Hi all -- after months of procrastinating, I'm finally starting to think about how to protect python.org's mail server with spambayes. From poking around the archive, I see I'm not the first to think about server-side spambayes, handling many recipient addresses at SMTP time. But it also looks like not much code exists. (Actually, a bit of code exists, and I wrote it: I've been using spambayes for my gward@python.net address, running at SMTP time on the python.net mail server, for several weeks now, and it works like a champ. But it's only handling one recipient address -- mine -- now.) Right now, my training corpus consists of all non-personal mail received by mail.python.org from 2002-09-11 to 2002-09-18 and 2002-10-19 to 2002-10-24. (Yes, I'm going to start another harvesting run to bring the corpus more up-to-date.) Anyways, there are around 15700 messages all told: 2709 bounces (a subset of ham), 6614 ham, 4736 spam, and 1632 viruses (which I'm going to leave out). That sounds like a big enough corpus, but when I break it up by recipient address, it's not so hot. There are several hundred distinct addresses; my first approach to slimming things down was to combine all Mailman "*-admin" addresses into LISTS-admin, "*-request" into LISTS-request, etc. Now I have 134 recipients or recipient combinations to worry about, ie. 134 distinct training corpi, and most of them don't have very many messages. Here's a subset; first number is spam count, second is ham count (including bounces): patches@python.org 37 86 peps@python.org 2 1 persistence-sig@python.org 2 7 postmaster@python.org 0 80 psc@python.org 3 0 psf-board@python.org 0 12 psf-members@python.org 1 15 psf@python.org 63 2 psf-webmasters@python.org 29 0 pydotorg@python.org 6 32 python-announce-list@python.org 9 6 python-announce@python.org 123 25 So, given enough time, I could probably assemble a decent training corpus for patches@python.org or python-announce@python.org. (Hmmm, I guess if I trained on the psf-webmasters@python.org collection, spambayes would conclude that 100% of mail for that address is spam -- how sad!) But there are a lot of addresses that just don't get enough mail to build a meaningful corpus. I'm pretty sure part of the solution is to combine the smaller training corpuses. Common sense dictates that topically similar addresses should be combined, but that's a judgment call that only human eyeballs and grey matter can make. What I'm wondering about is when to stop, ie. when is a training corpus "big enough"? Does anyone have a feel for that? ISTR some research into this many months ago, but I haven't followed this list in about six months, so I'm out of date. And what about the spam:ham ratio in a training corpus -- should it reflect reality, or is a fixed ratio (1:1, 37:22, whatever) better? Actual experience would be great, but even vague hand-wavey speculative explanations are welcome. Thanks! Greg -- Greg Ward http://www.gerg.ca/ The box said "Requires Windows 95 or better" -- so I installed Linux! From matthewd at cisco.com Fri May 30 11:48:08 2003 From: matthewd at cisco.com (Dickinson, Matthew) Date: Thu May 29 20:48:47 2003 Subject: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... Message-ID: <11D2868F916DD411982D00508B694F431E70B9D6@syd-xch2.cisco.com> Awesome that worked! Thanks heaps :-) -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Thursday, 29 May 2003 4:54 PM To: 'Dickinson, Matthew' Subject: RE: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... A quick google found the following for Outlook 2002: Outlook 2002 has probably disabled Outclass. Just do the following to get it back: Start Outlook, go to Help, About Microsoft Outlook. Click the Disabled Items button. Select SpamBayes. Click Enable. Restart Outlook 2002. Please let me know if that works or not (or even makes sense :) Mark. -----Original Message----- From: Dickinson, Matthew [mailto:matthewd@cisco.com] Sent: Thursday, 29 May 2003 4:08 PM To: Mark Hammond Subject: RE: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... I've got a feeling it's a setting in Outlook that was changed when it was having problems loading... do you know of any "disable all ad-ins" area anywhere in outlook, or something else that would have that effect? -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au] Sent: Thursday, 29 May 2003 3:58 PM To: 'Dickinson, Matthew' Subject: RE: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... I am afraid I have no idea. My only suggestion is to run the source code version - info is available on the main spambayes pages about that. Mark. -----Original Message----- From: Dickinson, Matthew [mailto:matthewd@cisco.com] Sent: Thursday, 29 May 2003 3:49 PM To: Mark Hammond Subject: RE: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... Hi, thanks for responding so quickly! My temp folder is there... I tried deleting all the Spambayes log files... Then I re registered the dll... This is what was in the new log file: Registered: SpamBayes.OutlookAddin I started up outlook, still no spambayes... same problem as before... I've attached a screen shot so you can see whats happening... when I check that box and hit ok, it goes back to outlook... when I go back in it's unchecked! Any other ideas? Matt -----Original Message----- From: Mark Hammond [mailto:mhammond@skippinet.com.au ] Sent: Thursday, 29 May 2003 2:33 PM To: 'Dickinson, Matthew'; spambayes@python.org Subject: RE: [Spambayes] Spambayes problem: it stopped working - com ad in isdisabled? wo n't let me re-enable... Unfortunately, I have no idea what could cause this. Certainly, re-installing the addin should work, and it should at least generate a log with the failure reason in this case. You could try running "regsvr32 SpamBayesAddin.dll" (or whatever it was I named that DLL ;) - hopefully you will get a "success" message. Then try re-starting Outlook and check for a log. About the only thing I can think of is that SpamBayes is failing when *creating* the log file - hence we see no errors. The only way I can see that happening is if your Windows temp directory doesn't exist - and I imagine much more than SpamBayes would get upset in that situation. Another option would be that the log files are somehow read-only - can you try deleting all old log files first? Mark. > -----Original Message----- > From: spambayes-bounces@python.org > [mailto:spambayes-bounces@python.org ]On Behalf Of Dickinson, Matthew > Sent: Thursday, 29 May 2003 2:16 PM > To: spambayes@python.org > Subject: [Spambayes] Spambayes problem: it stopped working - com ad in > isdisabled? wo n't let me re-enable... > > > Hi, > > Spambayes was working perfectly for me - it's the best spam filterer > that I've ever used! > > So I've been very inundated with spam over the past week because it > hasn't been working... > > I'm running the binary version. > > About a week ago Outlook pretty much hung and was hanging the whole > PC, eventually it reported that an error had occured with an Add In > (I'm assuming Spambayes as I have no other adins) and that it > would disable it... > so that it could start Outlook (it kept on crashing each time > I tried to > start it until I did this)... So once Outlook was back up I > went into the > com add in section and saw that it was there (just without > the checkbox) so > I checked the box hit ok etc... but no icon's appeared on the > menu and my > spam wasn't being filtered (ie it wasn't running)... so I > went back to the > com ad in area and noticed that it was unchecked again... I > tried checking > it again etc... and basically its as if it won't stay enabled... > So I tried uninstalling it, then reinstalling it - same problem. > > I even tried upgrading to Outlook 2002, then installing it - same > problem > > There is no log file generated from the session (the last log file > says that Spambayes was registered successfully) > > Any ideas on how I can get it working? > > Thanks > > Matt > _______________________________________________ > Spambayes mailing list > Spambayes@python.org http://mail.python.org/mailman/listinfo/spambayes > From tim.one at comcast.net Thu May 29 22:11:06 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 29 21:13:54 2003 Subject: [Spambayes] Spambayes for python.org In-Reply-To: <20030530004813.GA28375@cthulhu.gerg.ca> Message-ID: [Greg Ward] > Hi all -- after months of procrastinating, I'm finally starting to > think about how to protect python.org's mail server with spambayes. Cool! > ... > So, given enough time, I could probably assemble a decent training > corpus for patches@python.org or python-announce@python.org. (Hmmm, I > guess if I trained on the psf-webmasters@python.org collection, > spambayes would conclude that 100% of mail for that address is spam -- > how sad!) But there are a lot of addresses that just don't get enough > mail to build a meaningful corpus. In the tests we ran before, I didn't try to segregate recipients at all. So long as personal email is left out of the mix, I think it's fine to lump *all* the lists together. Note that the *-request addresses accounted for almost all of the few false positives, though, due to crap like one-word "unsubscribe" followed by 20KB of HTML employer-generated Portuguese disclaimer nonsense. > ... > What I'm wondering about is when to stop, ie. when is a training > corpus "big enough"? Does anyone have a feel for that? Quantify the error rates you're willing to accept. That's the only way to get a meaningful answer. Note that unlike my dict-based tests, there's now a good way to use a Berkeley database, so the memory burden should be a fraction of what it was when I ran python.org tests (and the memory burden even then seemed quite livable). > ISTR some research into this many months ago, but I haven't followed > this list in about six months, so I'm out of date. And what about the > spam:ham ratio in a training corpus -- should it reflect reality, or is > a fixed ratio (1:1, 37:22, whatever) better? Actual experience would > be great, but even vague hand-wavey speculative explanations are > welcome. My belief is that a 1:1 ratio works best with the current code, regardless of the real-life ratio. I also recommend these non-default options for python.org use, with possible exceptions noted later: """ [Tokenizer] replace_nonascii_chars: True record_header_absence: True address_headers: from to cc sender reply-to [Classifier] experimental_ham_spam_imbalance_adjustment: True """ If python.org is carrying any lists with predominantly non-English traffic, I'd advise segragating those, and leaving replace_nonascii_chars out of its options. experimental_ham_spam_imbalance_adjustment has no effect if the spam:ham training ratio is 1:1, so I recommend setting it True because it won't stay in balance. If you have any questions, ask Barry . From gward at python.net Thu May 29 22:31:57 2003 From: gward at python.net (Greg Ward) Date: Thu May 29 21:32:00 2003 Subject: [Spambayes] Spambayes for python.org In-Reply-To: References: <20030530004813.GA28375@cthulhu.gerg.ca> Message-ID: <20030530013157.GC28065@cthulhu.gerg.ca> On 29 May 2003, Tim Peters said: > In the tests we ran before, I didn't try to segregate recipients at all. So > long as personal email is left out of the mix, I think it's fine to lump > *all* the lists together. Note that the *-request addresses accounted for > almost all of the few false positives, though, due to crap like one-word > "unsubscribe" followed by 20KB of HTML employer-generated Portuguese > disclaimer nonsense. Shortly before leaving my job at the MEMS Exchange, I replaced SpamAssassin on the mail server with Spambayes [1], and I took a chance on using a single corpus for the whole organization -- a dozen people and 3 or 4 mailing lists. It was pretty painful for the first few days, with >50% FP rate. (My initial corpus was a bunch of my spam, some stuff sent by clueless users to our webmaster address, and a healthy chunk of our one big, high-profile mailing list.) After some frantic retraining on real mail, things settled down and after a week or so, spambayes was pretty good -- noticeably better than SpamAssassin, but still hardly perfect. As I mentioned, I've also set SB up on my python.net address, where the corpus is my mail and only my mail. (And a large chunk of the corpus [maybe all of it, can't remember] was captured by the python.net SMTP server, so is very very close to what spambayes is being asked to evaluate day-by-day.) In this scenario, especially after a retraining session one month in, spambayes operates with terrifying laser-like precision. It's so good it's spooky. On python.org, I'd like to see something closer to "terrifying laser-like precision" than "pretty good", so I'm willing to go to the trouble of building and maintaining multiple training corpi. But not 134 of them! (Especially not at 1-10 MB each.) Well, I'm still in playing-around mode. Will write back when I have interesting numbers or more meaningful questions. Thanks! Greg [1] thereby taking the Barry Warsaw approach to mail/web server maintenance -- do everything after 5pm on a Friday just before going on vacation -- to its logical extreme. ;-) -- Greg Ward http://www.gerg.ca/ In order to understand recursion, you really have to understand recursion. From tim.one at comcast.net Thu May 29 22:58:42 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu May 29 22:00:34 2003 Subject: [Spambayes] Spambayes for python.org In-Reply-To: <20030530013157.GC28065@cthulhu.gerg.ca> Message-ID: [Greg Ward] > Shortly before leaving my job at the MEMS Exchange, I replaced > SpamAssassin on the mail server with Spambayes [1], and I took a > chance on using a single corpus for the whole organization -- a dozen > people and 3 or 4 mailing lists. It was pretty painful for the first > few days, with >50% FP rate. (My initial corpus was a bunch of my > spam, some stuff sent by clueless users to our webmaster address, and > a healthy chunk of our one big, high-profile mailing list.) After > some frantic retraining on real mail, things settled down and after a > week or so, spambayes was pretty good -- noticeably better than > SpamAssassin, but still hardly perfect. As I said, you have to leave personal email out of it. The tests we ran before excluded personal email carried by python.org, and did great. > As I mentioned, I've also set SB up on my python.net address, where > the corpus is my mail and only my mail. (And a large chunk of the > corpus [maybe all of it, can't remember] was captured by the > python.net SMTP server, so is very very close to what spambayes is > being asked to evaluate day-by-day.) In this scenario, especially > after a retraining session one month in, spambayes operates with > terrifying laser-like precision. It's so good it's spooky. OK, you can leave *your* personal email in the mix, but for the love of God don't put Barry's in there too . > On python.org, I'd like to see something closer to "terrifying > laser-like precision" than "pretty good", so I'm willing to go to the > trouble of building and maintaining multiple training corpi. But not > 134 of them! (Especially not at 1-10 MB each.) We ran tests before on python.org mailing-list traffic lumped into one big ball. The software is even better now. > Well, I'm still in playing-around mode. Will write back when I have > interesting numbers or more meaningful questions. Thanks! You're welcome. Now believe the data and stop trying to out-think it. From joe at swelltech.com Thu May 29 22:39:44 2003 From: joe at swelltech.com (Joe Cooper) Date: Thu May 29 22:39:49 2003 Subject: [Spambayes] Spambayes for python.org In-Reply-To: <20030530013157.GC28065@cthulhu.gerg.ca> References: <20030530004813.GA28375@cthulhu.gerg.ca> <20030530013157.GC28065@cthulhu.gerg.ca> Message-ID: <3ED6C470.4020304@swelltech.com> Greg Ward wrote: > On 29 May 2003, Tim Peters said: > >>In the tests we ran before, I didn't try to segregate recipients at all. So >>long as personal email is left out of the mix, I think it's fine to lump >>*all* the lists together. Note that the *-request addresses accounted for >>almost all of the few false positives, though, due to crap like one-word >>"unsubscribe" followed by 20KB of HTML employer-generated Portuguese >>disclaimer nonsense. > > > Shortly before leaving my job at the MEMS Exchange, I replaced > SpamAssassin on the mail server with Spambayes [1], and I took a chance > on using a single corpus for the whole organization -- a dozen people > and 3 or 4 mailing lists. It was pretty painful for the first few days, > with >50% FP rate. (My initial corpus was a bunch of my spam, some > stuff sent by clueless users to our webmaster address, and a healthy > chunk of our one big, high-profile mailing list.) After some frantic > retraining on real mail, things settled down and after a week or so, > spambayes was pretty good -- noticeably better than SpamAssassin, but > still hardly perfect. As an added data point, I've installed it on the SciPy mailing lists about 10 days ago, using a corpus that I built from my own mailbox, old legitimate SciPy traffic, and a big box of ~2000 spams that I've received over the past year. So far it has had zero false positives, and a couple of false negatives. The effectiveness is about 75%, compared to ~90% on my personal mailbox with the same corpus. It works pretty well...hopefully with a bit more training on current scipy list spam, it will improve some. -- Joe Cooper Web caching appliances and support. http://www.swelltech.com From leob at mailcom.com Thu May 29 23:59:27 2003 From: leob at mailcom.com (Leonid Broukhis) Date: Fri May 30 02:00:14 2003 Subject: [Spambayes] Database cleaning? Message-ID: <3ED6F33F.9050000@mailcom.com> Dear Developers, many thanks for the cool program, it helps already, just a few hours after installation. I have a question, though: the spammers actively employ subject line chaffing (inserting nonsensical words or random letter sequences): does this cause database pollution and is there a way to weed those (usually unique) entries out? I only have 200 Mb in my "virtual server" ISP account, and the database is closing up to 3 Mb after one day. Thanks, Leo From DF at damartin.com Fri May 30 12:49:09 2003 From: DF at damartin.com (David Fox) Date: Fri May 30 06:29:05 2003 Subject: [Spambayes] Windows NT Message-ID: <2475B0099742D611A87A0090273D9E7E0FFC7E@NTSERVER> Hi, The majority of clients on my network run Outlook 2000 on NT workstations. Is Spambayes compatible with this combination? Yours David Fox DDI: 020 7375 0174 FAX: 020 7375 2450 DA Martin & Co. 13 Holywell Row London EC2A 4JF Tel: 020 7377 5995 Fax: 020 7247 6100 DX 33864 Finsbury Square **************************************************************************** *********** A list of partners may be inspected at our office. The partners are solicitors. This email and the information it contains are confidential and may be privileged, if you have received this email in error please notify us immediately. You should not copy it for any purpose, or disclose its contents to any other person. Internet communications are not secure and therefore D A Martin & Co does not accept legal responsibility for the contents of this message as it has been transmitted over a public network. If you suspect the message may have been intercepted or amended, please call the sender. From anthony at interlink.com.au Fri May 30 21:58:43 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri May 30 06:58:50 2003 Subject: [Spambayes] Windows NT In-Reply-To: <2475B0099742D611A87A0090273D9E7E0FFC7E@NTSERVER> Message-ID: <200305301058.h4UAwhS05455@localhost.localdomain> >>> David Fox wrote > Hi, > > The majority of clients on my network run Outlook 2000 on NT workstations. > Is Spambayes compatible with this combination? The page http://spambayes.sourceforge.net/windows.html lists what's known at the moment. NT isn't specifically listed there, but assuming it's like win2k, it should work. Give it a go and let us know the results, and we can add it to the list! Anthony From johnsone at titanlink.com Fri May 30 06:00:26 2003 From: johnsone at titanlink.com (Eldon Johnson) Date: Fri May 30 07:07:21 2003 Subject: [Spambayes] Works it MS Messager Message-ID: <01C32669.C0593840@ERJSERVER> Please tell me if Spambayes will work with Microsoft's Messager 4.0 from the oldies in Win98. thank you for your time...............ERJ From Giulio.Spinelli at italtel.it Fri May 30 14:52:40 2003 From: Giulio.Spinelli at italtel.it (Spinelli Giulio) Date: Fri May 30 07:52:50 2003 Subject: [Spambayes] Problem with outlook2000 Message-ID: Goodmornig, I've installed a binary version but When I start Outlook, there is no Anti-Spam item in the toolbar. To resolve this: I've performed the following steps: Start Outlook, and select Tools->Options to display the main Options dialog. Select the tab labelled Other, then click on the Advanced button. Click on the COM Add-Ins button. The SpamBayes addin is listed but not checked, then simply check it and close the dialog. Up to here is fine but reopening the dialog with the COM Add-Ins button the SpamBayes addin is listed but not checked again !!!! I've tried many and many times with different operation-sequences but still it is not possible to have the addin checked. I'm using NT o.s. and SpamBayes is the only pligin in the list. I think that is a problem of OutlookSetup, can you help me to manage it ? Regards Giulio. P.S. Sorry for the English but it is not my mothertongue and I live in Italy. From burdick at research.ge.com Fri May 30 11:59:20 2003 From: burdick at research.ge.com (Burdick, William E (Research)) Date: Fri May 30 11:00:09 2003 Subject: [Spambayes] (no subject) Message-ID: Upon arriving at work today I found, for the first time this year, no SPAM in my Inbox. SPAM Bayes, after training on only 9 known SPAMs and 40 known good emails performed wonderfully ... so far it has correctly identified 4 SPAMs as SPAM, 8 SPAMs as possible SPAM, and only one good email as possible SPAM. I expect improvements in accuracy as the system has additional opportunity to learn. Even if accuracy does not improve I am delighted with this tool's performance. MANY THANKS for your efforts. Bill From drw at dennisuniform.com Fri May 30 09:02:02 2003 From: drw at dennisuniform.com (Don Wyman) Date: Fri May 30 11:02:16 2003 Subject: [Spambayes] Will not register Message-ID: <12BEB02D0E22D411894300A0C9FC083EF40278@dunt1.dennisuniform.com> The spambayes_addin.dll will not register at installation or manually on my Windows 2000 pro pc.....I had it installed once, had to make a change to Outlook and now after uninstalling spambayes and trying to reinstall it again...get: dllregisterserver in spambayes_admin.dll failed....return code 0xc0000005 removed all occurrences of spam out of the registry, did a regclean, tried to reinstall...still get errors! I really would like to use this utility, it was working fine at one time...now I can not get it to install...Help!! Thanks...Don Wyman From tim at fourstonesexpressions.com Fri May 30 11:11:54 2003 From: tim at fourstonesexpressions.com (Tim Stone) Date: Fri May 30 11:12:21 2003 Subject: [Spambayes] Re: Spambayes Digest, Vol 57, Issue 101 In-Reply-To: <00e801c3261c$6f36ada0$a100a8c0@zlichstein> References: <00e801c3261c$6f36ada0$a100a8c0@zlichstein> Message-ID: On Thu, 29 May 2003 12:56:40 -0700, Zander wrote: > The problem is definitely that there are no exception handlers around the > lines: > > prs._parseheaders(self, fp) > > and > > prs._parsebody(self, fp) I'm working on this one, dudes. > > But I'm just not sure how to pass a "skip this message" back up the chain > properly... This is THE problem. c'est moi - TimS From jhinkle at autobaseinc.com Fri May 30 11:54:59 2003 From: jhinkle at autobaseinc.com (Jon Hinkle) Date: Fri May 30 11:53:48 2003 Subject: [Spambayes] I have a resolution for an install problem Message-ID: Hey Guys Great so far! I had a problem with the installation in that the spambayes_addin.dll wouldn't register on my system. Well, I'm a software developer and I know a little bit about this, so I ignored the error on your install and ran regsvr32 on that dll and it installed just fine then. Maybe you've packaged a different regsvr32 with your install. I don't know, but it registered fine once I did that. Anyway I opened Outlook and looked for the buttons your about page was speaking about but didn't see those so I followed the troubleshooting guide by going to the COM-Addins button under the Advanced button on the "Other" tab. I noticed that the spambayes addin was there and it was checked, but there were no buttons and the troubleshooting guide said that you were stumped. Well, all I did was remove that add-in and then I add it back in through the add-in manager there and just pointed back to the original installation folder. I'm making this sound more difficult than it is. Basically, just remove and add the spambayes_addin.dll file from the COM-Addin manager within Outlook and it should appear there. At least that's all I did and it works now. That could be another alternative for your troubleshooting guide besides, "I'm stumped". I've heard good things about this software so I'm hoping it does better than what I've had before. Thanks Jon Hinkle Software Developer Autobase, Inc. (317) 806-7911 (317) 842-4242 From drw at dennisuniform.com Fri May 30 10:42:57 2003 From: drw at dennisuniform.com (Don Wyman) Date: Fri May 30 12:43:05 2003 Subject: [Spambayes] Can't register sapmbayes on my PC????? Message-ID: <12BEB02D0E22D411894300A0C9FC083EF40279@dunt1.dennisuniform.com> The spambayes_addin.dll will not register at installation or manually on my Windows 2000 pro pc.....I had it installed once, had to make a change to Outlook and now after uninstalling spambayes and trying to reinstall it again...I get a return code 0 error, will not register dll. I then tried the regsvr32 spambayes_addin.dll command in the sambayes directory and get the following error: dllregisterserver in spambayes_admin.dll failed....return code 0xc0000005 removed all occurrences of spam out of the registry, did a regclean, tried to reinstall...still get the errors. I really would like to use this utility, it was working fine at one time...now I can not get it to install...Help!! Thanks...Don Wyman From neale at woozle.org Fri May 30 10:46:37 2003 From: neale at woozle.org (Neale Pickett) Date: Fri May 30 12:46:43 2003 Subject: [Spambayes] Newbie experiences (Emacs + VM) and questions. In-Reply-To: <16086.30840.634464.946319@monster.linux.in> (Prabhu Ramachandran's message of "Fri, 30 May 2003 02:45:36 +0530") References: <16086.30840.634464.946319@monster.linux.in> Message-ID: Hi Prabhu. This is excellent! I've added it to spambayes.el (included at the end of this message). Please try this out and let me know if it works for you. If it does, I'll check it in to CVS. > A few more questions/feature requests/whatnot: > > 1. Is the approach I use above a decent approach, i.e. training > messages as they are received and then correcting errors using > retraining. It works fairly well for me but I've not tested it for > very long. > > 2. Is it a good idea to train on every incoming message or are there > better approaches? I did this (train on everything) for a while, but I've found that mistake- and unsure-based training has been working very well for me. IIRC, this is even how Tim Peters does it. So you don't train on incoming messages, you only train on the ones that spambayes got wrong. > 3. At the moment I receive more HAM than SPAM. Does this matter or > make no difference? It doesn't seem to matter for me. > 4. I see that some folks retrain the whole database every so often. > Is this necessary? If so why? I think people do this as a way of pruning their database. > 5. It would be nice if hammie.py would retrain on any messages that > had an X-Spambayes-Trained header i.e. if Hammie.train(...) would > check for the header and if found retrain. If this is intentionally > not done it needs to be documented somewhere. Its a little > confusing (for a newbie) if there are different scripts with similar > options. :) I know :( I may be able to get back in and fix this soon, as $FIRM may let me start working on the project again soon. But in the meantime, feel free to hurl invectives my way about the myriad confusing options to the myriad similar command-line utilities :) Here's a new spambayes.el: ---8<--- ;; spambayes.el -- integrate spambayes into Gnus ;; Copyright (C) 2003 Neale Pickett ;; Time-stamp: <2003-05-30 09:40:00 neale> ;; This is free software; you can redistribute it and/or modify it under ;; the terms of the GNU General Public License as published by the Free ;; Software Foundation; either version 2, or (at your option) any later ;; version. ;; This program is distributed in the hope that it will be useful, but ;; WITHOUT ANY WARRANTY; without even the implied warranty of ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU ;; General Public License for more details. ;; You should have received a copy of the GNU General Public License ;; along with GNU Emacs; see the file COPYING. If not, write to the ;; Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA. ;; Purpose: ;; ;; Functions to put spambayes into Gnus. ;; ;; This assumes you are already filtering with a procmail recipie ;; similar to: ;; ;; :0fw ;; | hammiefilter.py -t ;; ;; If you can't run procmail on all your incoming messages, you shold ;; have a look at spam.el, which is included with Oort Gnus. ;; ;; This binds "B s" to "refile as spam", and "B h" to "refile as ham". ;; After refiling, the message is rescored and respooled. I haven't yet ;; run across a case where refiling doesn't change a message's score ;; well into the ham or spam range. If this happens to you, please let ;; me know. ;; Installation: ;; ;; ;; GNUS ;; ---- ;; To install, just drop this file in your load path, and insert the ;; following lines in ~/.gnus: ;; ;; (load-library "spambayes") ;; (add-hook ;; 'gnus-sum-load-hook ;; (lambda nil ;; (define-key gnus-summary-mode-map [(B) (s)] 'spambayes-gnus-refile-as-spam) ;; (define-key gnus-summary-mode-map [(B) (h)] 'spambayes-gnus-refile-as-ham))) ;; ;; ;; VM (Courtesy of Prabhu Ramachandran ) ;; ---- ;; Put the following in ~/.vm: ;; ;; (define-key vm-mode-map "ls" 'spambayes-gnus-retrain-as-spam) ;; (define-key vm-summary-mode-map "ls" 'spambayes-gnus-retrain-as-spam) ;; (define-key vm-mode-map "lh" 'spambayes-gnus-retrain-as-ham) ;; (define-key vm-summary-mode-map "lh" 'spambayes-gnus-retrain-as-ham) ;; ;; (setq vm-auto-folder-alist ;; '(("X-Spambayes-Classification:" ("spam" . "~/vmmail/SPAM")) ;; ("X-Spambayes-Classification:" ("unsure" . "~/vmmail/UNSURE")) ;; ) ;; ) (defvar spambayes-spam-group "spam" "Group name for spam messages") (defvar spambayes-hammiefilter "~/src/spambayes/hammiefilter.py" "Path to the hammiefilter program") ;; Gnus (defun spambayes-gnus-retrain (is-spam) "Retrain on all processable articles, or the one under the cursor. This will replace the buffer contents with command output. You can then respool the article. is-spam is a boolean--true if you want to retrain the message as spam, false if you want to retrain as ham. " (labels ((do-exec (n group is-spam) (message "Retraining...") (with-temp-buffer (gnus-request-article-this-buffer n group) (shell-command-on-region (point-min) (point-max) (concat spambayes-hammiefilter (if is-spam " -s" " -g") " -f") (current-buffer) t) (gnus-request-replace-article n group (current-buffer))) (message "Retrained article."))) (let ((group gnus-newsgroup-name) (list gnus-newsgroup-processable)) (if (>= (length list) 1) (while list (let ((n (car list))) (do-exec n group is-spam)) (setq list (cdr list))) (let ((n (gnus-summary-article-number))) (do-exec n group is-spam)))))) (defun spambayes-gnus-refile-as-spam () "Retrain and refilter all process-marked messages as spam, then respool them" (interactive) (spambayes-retrain 't) (gnus-summary-respool-article nil (gnus-group-method gnus-newsgroup-name))) (defun spambayes-gnus-refile-as-ham () "Retrain and refilter all process-marked messages as ham, then respool them" (interactive) (spambayes-retrain nil) (gnus-summary-respool-article nil (gnus-group-method gnus-newsgroup-name))) ;;; VM (defun spambayes-vm-retrain (is-spam) "Retrain on all processable articles, or the one under the cursor. is-spam is a boolean--true if you want to retrain the message as spam, false if you want to retrain as ham. " (interactive) (message (concat "Retraining" (if is-spam " as SPAM" " as HAM") " ...")) (vm-pipe-message-to-command (concat spambayes-hammiefilter (if is-spam " -s" " -g") " -f") nil) (message (concat "Done retraining messages" (if is-spam " as SPAM" " as HAM") ".") ) ) (defun spambayes-vm-retrain-as-spam () "Retrain and refilter messages as spam" (interactive) (spambayes-retrain 't) ) (defun spambayes-vm-retrain-as-ham () "Retrain and refilter messages as ham" (interactive) (spambayes-retrain nil) ) From popiel at wolfskeep.com Fri May 30 11:14:47 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Fri May 30 13:14:51 2003 Subject: [Spambayes] Newbie experiences (Emacs + VM) and questions. In-Reply-To: Message from Neale Pickett of "Fri, 30 May 2003 09:46:37 PDT." References: <16086.30840.634464.946319@monster.linux.in> Message-ID: <20030530171447.8C6AE2DE9C@cashew.wolfskeep.com> In message: Neale Pickett writes: > >> 4. I see that some folks retrain the whole database every so often. >> Is this necessary? If so why? > >I think people do this as a way of pruning their database. That's one of the reasons for me; the other is so that stuff expires from my classifier (and thus the mutation of spam doesn't dillute the effectiveness of the classifer - not that such dillution has been empirically shown to exist. ;-) ). Call back in another year and a half and I might have real numbers about spam mutation affecting the accuracy rates. - Alex From Randy.Barth at gsfc.nasa.gov Fri May 30 14:44:17 2003 From: Randy.Barth at gsfc.nasa.gov (Randy Barth) Date: Fri May 30 13:44:24 2003 Subject: [Spambayes] Feedback Message-ID: <000d01c326d3$17e144e0$cda2b780@focus.local> I just installed the Outlook plug-in on my Windows 2000/Outlook 2002 system. Two comments for your edification: -It wasn't clear whether to install as administrator of the machine, or as the non-administration account I normally run my email under. I did the former and, while it appeared if I ran Outlook as administrator, it didn't appear on my normal email account. So I installed a second time (without uninstalling) as the normal user id and everything looks fine. If it is really unnecessary to install as administrator first, you may want to include that in your install info. If it is necessary to install twice, knowing that rather than guessing would also be useful. -While setting up for training, I had an Outlook reminder go off and the popup screen froze Outlook. I could neither clear the reminder popup nor get a response from the AntiSpam Manager. I think this occurred while I was picking the folders for training. Later when I was running the training I also got a reminder but I was able to clear that one in the normal fashion and the training continued without problems. Good luck on your continued work. (And here's hoping that Hormel doesn't come after you for the ham is what spam isn't implications!) -Randy Barth, Goddard/NASA From LPeck at etisc.com Fri May 30 14:56:36 2003 From: LPeck at etisc.com (Libby Peck) Date: Fri May 30 13:57:07 2003 Subject: [Spambayes] No Anti-Spam Item in Outlook Toolbar Message-ID: I have been testing SpamBayes Add-In for Outlook. I have installed the software (Python 2.2 http://www.python.org , Win32-152.exe http://starship.python.net/crew/mhammond/win32/Downloads.html) on five systems (Two - Win XP Pro, Two - Win2k, and One- Win98) I have installed the add-in using the installer (http://spambayes.sourceforge.net/applications.html ) and also by manually running addin.py. Niether installation has worked on this system. There is no anti-spam item in Outlook toolbar. All of the systems are running Outlook 2002 SP2. The second Win XP Pro system is identical and it is working normally. I do not receive any errors during installation, which completes normally. There is no spambayes1.log file created. I reviewed the troubleshooting.html document and found ................... * If you are running from source code, please re-register the addin, as per the README.txt file. I have reviewed all readme.txt files but have not located instructions on re-registering the add-in. I enjoyed the technician's last response in the troubleshooting document (under binary code troubleshooting) .................. 6. If the SpamBayes addin is listed and checked, but still not working and still not creating log files, then I am stumped! ............ but I hope someone has this figured out by now *S* Thanks, Libby From gumzilla at voicenet.com Fri May 30 20:20:07 2003 From: gumzilla at voicenet.com (Henry) Date: Fri May 30 19:21:55 2003 Subject: [Spambayes] New feaure request Message-ID: First of all- well done! I'm very happy that I found out about SpamBayes. So far, it looks like the system is correctly identifying every item on which it is trained. Even so (you knew this was coming, right?) I'd like to see an option added- upon filtering, I'd love to have spam messages permanently deleted instead of just moving them to the "deleted" folder. I think that this would add another level of flexibility that would be most welcome. Thanks! Henry Hayden From david.stewart at shaw.ca Fri May 30 18:53:49 2003 From: david.stewart at shaw.ca (david.stewart@shaw.ca) Date: Fri May 30 20:53:52 2003 Subject: [Spambayes] New feaure request Message-ID: <1b7fbb1b9729.1b97291b7fbb@shaw.ca> Why not just set Outlook to automatically purge the deleted items on exit? ----- Original Message ----- From: Henry Date: Friday, May 30, 2003 4:20 pm Subject: [Spambayes] New feaure request > First of all- well done! I'm very happy that I found out about > SpamBayes.So far, it looks like the system is correctly > identifying every item on > which it is trained. Even so (you knew this was coming, right?) > I'd like to > see an option added- upon filtering, I'd love to have spam messages > permanently deleted instead of just moving them to the "deleted" > folder. I > think that this would add another level of flexibility that would > be most > welcome. > > Thanks! > > > Henry Hayden > > > > > _______________________________________________ > Spambayes mailing list > Spambayes@python.org > http://mail.python.org/mailman/listinfo/spambayes > From tim.one at comcast.net Fri May 30 23:30:10 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 30 22:32:49 2003 Subject: [Spambayes] Feedback In-Reply-To: <000d01c326d3$17e144e0$cda2b780@focus.local> Message-ID: [Randy Barth] > ... > Good luck on your continued work. (And here's hoping that Hormel > doesn't come after you for the ham is what spam isn't implications!) Here's Hormel's official statement; their lawyer apparently told them they had as much chance of coming down on geeks for using "spam" as DuPont had of coming down on journalists for calling Ronald Reagan "teflon" : http://www.spam.com/ci/ci_in.htm From tim.one at comcast.net Fri May 30 23:41:55 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri May 30 22:42:55 2003 Subject: [Spambayes] (no subject) In-Reply-To: Message-ID: [Burdick, William E (Research)] > Upon arriving at work today I found, for the first time this year, no > SPAM in my Inbox. SPAM (all uppercase) is a registered trademark of Hormel Foods Corporation. The crap in your inbox is just spam . > SPAM Bayes, after training on only 9 known SPAMs and 40 known good > emails performed wonderfully ... so far it has correctly identified 4 > SPAMs as SPAM, 8 SPAMs as possible SPAM, and only one good email as > possible SPAM. I expect improvements in accuracy as the system has > additional opportunity to learn. Even if accuracy does not improve > I am delighted with this tool's performance. Accuracy should improve enormously over this. "Possible spam" is a phrase I hate: I call it "unsure", and that's what it is -- the system has conflicting evidence about the nature of a message, and simply doesn't know what to call it. For *some* people Unsures are usually spam, for others they're usually ham, and for people like me it varies from week to week. In any case, training on Unsures is important -- except when they're messages you have a hard time deciding about yourself! The system is remarkable in finding those too. As a general tip, you'll probably get best results if you strive to balance the number of ham and spam you train on. If you need more spam in order to do this, just put your email address somewhere on the web . > MANY THANKS for your efforts. Thank you! It's a labor of love by a large cast of characters, and everyone appreciates being appreciated. From cirkus at communityservices.com Sat May 31 08:56:01 2003 From: cirkus at communityservices.com (Bill Cirkus) Date: Sat May 31 07:43:36 2003 Subject: [Spambayes] Training Message-ID: <81867992CD17D5119103000102D0FB7936A673@CIRKUS-NT2> I have installed your product on my home PC, from which I retrieve my email via a VPN, connected to my office exchange server. Since I log off my office PC when I'm not there, and my home PC when I'm not there, I think I need your product installed on both PC's. Is there a way to export the file that was created in the training process on my home PC, so I can import it into my office PC? Bill Cirkus From slinthic at nycap.rr.com Sat May 31 10:18:15 2003 From: slinthic at nycap.rr.com (Steve Linthicum) Date: Sat May 31 09:18:18 2003 Subject: [Spambayes] outlook express Message-ID: <000801c32777$182ed620$6501a8c0@vaio> Should spambayes be compatible with outlook express??? I've done all installation steps, no errors reported but outlook express does not have spam drop down menu item when I run it. Any help appreciated. Thanks, Steve L. From skip at pobox.com Sat May 31 09:39:59 2003 From: skip at pobox.com (Skip Montanaro) Date: Sat May 31 09:40:05 2003 Subject: [Spambayes] Training In-Reply-To: <81867992CD17D5119103000102D0FB7936A673@CIRKUS-NT2> References: <81867992CD17D5119103000102D0FB7936A673@CIRKUS-NT2> Message-ID: <16088.45231.820713.96987@montanaro.dyndns.org> Bill> via a VPN, connected to my office exchange server. Since I log off Bill> my office PC when I'm not there, and my home PC when I'm not Bill> there, I think I need your product installed on both PC's. Is Bill> there a way to export the file that was created in the training Bill> process on my home PC, so I can import it into my office PC? You should be able to just toss the training database on a floppy and transport it between PCs. I'm not in my office at the moment so I can't check where it's located on the PC. I pulled this out of a previous message from Tim Peters: If you have a directory somewhere (assuming Win2K) like \Documents and Settings\ YOUR_LOGIN_NAME\ Application Data\ SpamBayes\ then it's hiding config and database info in there... The contents of that folder will probably be what you're looking for, though you should experiment without blowing stuff away. And you definitely *don't* want to be replacing those files while Outlook is running. Skip From skip at pobox.com Sat May 31 09:47:51 2003 From: skip at pobox.com (Skip Montanaro) Date: Sat May 31 09:47:54 2003 Subject: [Spambayes] outlook express In-Reply-To: <000801c32777$182ed620$6501a8c0@vaio> References: <000801c32777$182ed620$6501a8c0@vaio> Message-ID: <16088.45703.265351.458991@montanaro.dyndns.org> Steve> Should spambayes be compatible with outlook express??? I've done Steve> all installation steps, no errors reported but outlook express Steve> does not have spam drop down menu item when I run it. The Outlook plugin doesn't work with Outlook Express, though you can use the pop3 or imap proxies between your POP3 or IMAP server. Check the FAQ for details (question 2.2): http://spambayes.sourceforge.net/faq.html Skip (P.S. Your name seems familiar to me. I used to live in the Albany area and see you're posting from there.) From seant at webreply.com Sat May 31 10:54:43 2003 From: seant at webreply.com (Sean True) Date: Sat May 31 09:55:03 2003 Subject: [Spambayes] Announcement: SpamAtBay Outllok Add-in Beta Candidate 8 Message-ID: For the new people on the list: SpamAtBay is a soon to be available commercial version of the Spambayes addin for Outlook. Any reader of this list who would like to beta test for us is welcome to a permanent license -- and anyone who is a SpamBayes developer can have the license without anything other just asking. In either case, just e-mail me. Brief history: I wrote the first, really, really bad integration with Outlook, using VBA and a Python COM server. Mark Hammond thought the idea was neat, but that the execution was, err, flawed, and pretty much rewrote it from scratch. Over the last few weeks, I've had a team of several people working on polishing, layout, docs, QA, and (gag) marketing oriented features. We're going to release a supported version sometime soon. We're focusing on Tim's "easy enough for my sister" metric. Supported systems: Win98SE, ME, NT 4, Win2K, XP; Outlook 2000/2002/XP. Not supported: Win95; Outlook Express. Release 1.0 Beta Candidate 8 (0.97) Reliability and bug fixes. If you are an Exchange user having problems, please install this release. 1) Hitting the It's Spam button reliably moves the message to the Spam folder 2) Process missed messages after changing rules 3) Numerous changes to folder navigation, to survive Exchange and Hotmail idiosyncracies 4) Hotmail folders are not watchable (to be fixed) 5) Log folder that message arrived in 6) Fix bug in getting folder name 7) Bitmaps for installer 8) Menu tree changes (Advanced split into Advanced and Support) 9) Spelling and punctuation fixes 10) Use SpamBayes application data directory if present, but create a new SpamAtBay directory if not. (If you switch to SpamBayes, you'll need to manually copy your data files, but they _will_ work. 11) Support/Troubleshooting brings up correct HTML file 12) Added Purchase button to registration dialog, goes to web site. (No purchase required for any beta user, but someday this will do something). 13) Fix for RedThumb on clipboard problem (text only) 14) Senderlist left and down from mouse. 15) Detect Outlook running during install 16) Support mail now goes to spamatbay.com 17) Uninstall now cleans out registration keys From skip at pobox.com Sat May 31 12:15:57 2003 From: skip at pobox.com (Skip Montanaro) Date: Sat May 31 12:15:57 2003 Subject: [Spambayes] Training In-Reply-To: <81867992CD17D5119103000102D0FB7936A674@CIRKUS-NT2> References: <81867992CD17D5119103000102D0FB7936A674@CIRKUS-NT2> Message-ID: <16088.54589.409025.270474@montanaro.dyndns.org> (make sure when asking for help you cc the list.) >>>>> "Bill" == Bill Cirkus writes: Bill> I can find the directory that Spambayes is in, but can't identify Bill> the specific files(s) that contain the config and database Bill> info. Can you identify them? Bill> The only files that have a date stamp of 5/29/03, when I installed Bill> the program, are located either in the folder: Bill> support\gen_py\00062FFF-0000-0000-C000-000000000046x0x9x0 or Bill> support\gen_py\2DF8D04C-5BFA-101B-BDE5-00AA0044DE52x0x2x1 Bill> The rest have a date stamp of sometime in March 2003 or earlier. I can't identify them without poking around the actual computer at work. Perhaps someone else on the list with more Windows experience (Tim's sisters have more Windows experience than I do) knows. Skip From popiel at wolfskeep.com Sat May 31 11:00:37 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Sat May 31 13:00:41 2003 Subject: [Spambayes] Database cleaning? In-Reply-To: Message from Leonid Broukhis of "Thu, 29 May 2003 22:59:27 PDT." <3ED6F33F.9050000@mailcom.com> References: <3ED6F33F.9050000@mailcom.com> Message-ID: <20030531170037.10DB82DDF2@cashew.wolfskeep.com> In message: <3ED6F33F.9050000@mailcom.com> Leonid Broukhis writes: >many thanks for the cool program, it helps already, just a few hours after >installation. I have a question, though: the spammers actively employ >subject line chaffing (inserting nonsensical words or random letter >sequences): does this cause database pollution and is there a way to weed >those (usually unique) entries out? Yes, those words cause database pollution, and yes, they can be weeded out with just a handful of lines of code... but it's hard to tell which hapax legomena will be useless, and which will soon get reinforced by other occurences, so it's (IMNSHO) generally not worth the hassle. >I only have 200 Mb in my "virtual server" ISP account, and the database is >closing up to 3 Mb after one day. Don't worry too hard about it. With 4 months of data, my database is only 21 meg. The DB growth slows down a lot after the first couple days. - Alex From piersh at friskit.com Sat May 31 15:21:08 2003 From: piersh at friskit.com (Piers Haken) Date: Sat May 31 17:17:30 2003 Subject: [Spambayes] Non-pickle database with outlook plugin Message-ID: <9891913C5BFE87429D71E37F08210CB92C75F6@zeus.sfhq.friskit.com> I was wondering if some kind soul could tell me how to set up the oulook plugin so that it uses bsdbb3 (or similar) instead of pickles for storing its database. I'm getting bored of waiting 5 minutes for outlook to start/stop and having to retrain every time outlook crashes. I know there must be instructions omewhere, but I can't find any. Many thanks. Piers.