From tim.one at comcast.net Sun Jun 1 01:39:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 1 00:40:19 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: <1054430548.31.1335@sake.mondoinfo.com> Message-ID: [Matthew Dixon Cowles] > ... > With an eye toward reducing the size of the database, I instrumented > the classifier a while ago and found a very strong indication that > that's true. Indeed, hapaxes often figured in scoring. I didn't > bother to calculate exact numbers because the results were strong > enough to persuade me that removing hapaxes wasn't a useful strategy. The original spambayes code saved a time-of-last-access stamp in each WordInfo record. That was to support research into database cleaning strategies. The research never happened, though, and several WordInfo members got tossed to reduce the database size. If people want to start research on this again, an official patch set to maintain this kind of info in researchers' databases would be a real help. Earlier experiments showed that removing hapaxes was fine *if* you had trained carefully on many thousands of messages at random. It also showed that removing hapaxes was a disaster if you engaged in mistake-based training alone (that is, never train on anything except misclassifed msgs, and possibly also unsures -- then you end up with a very small, and also a very brittle (prone to major ongoing surprises), database). In hindsight, I'd rephrase this to say that hapax-driven databases need their hapaxes . From noreply at sourceforge.net Sun Jun 1 01:39:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 1 08:56:54 2003 Subject: [spambayes-dev] [ spambayes-Bugs-746909 ] Conflicting with Message Rules Message-ID: Bugs item #746909, was opened at 2003-06-01 00:39 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=746909&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Henry Ho (cornflakeboy) Assigned to: Mark Hammond (mhammond) Summary: Conflicting with Message Rules Initial Comment: If you have Outlook message rules defined (i.e. moving messages automatically into different folders), and a message comes in, Spambayes will move it into spam or possible spam folders as it should. But Outlook will move that message to the folder defined by the rule, which subsequently makes Spambayes think it's good and will train itself accordingly. An example rule is: Apply this rule after the message arrives through the myemail@mydomain.com account, move it to the My Personal folder The above rule was created to sort multiple email accounts into proper folders. I have Spambayes configured to watch my Inbox and my "My Personal" folder. Perhaps we can make Spambayes process messages after Outlook's done running its rules? I am using Outlook 2002 SP-1 Thanks! Great program! I have been recommending it to everyone who owns a PC! :) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=746909&group_id=61702 From matt at mondoinfo.com Sun Jun 1 14:48:19 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Sun Jun 1 14:51:10 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: References: <1054430548.31.1335@sake.mondoinfo.com> Message-ID: <1054491712.69.2336@sake.mondoinfo.com> [Tim] > The original spambayes code saved a time-of-last-access stamp in > each WordInfo record. That was to support research into database > cleaning strategies. The research never happened, though, and > several WordInfo members got tossed to reduce the database size. > If people want to start research on this again, an official patch > set to maintain this kind of info in researchers' databases would > be a real help. I patched my classifier to record when a token is used in scoring at the same time that I patched it to record the other statistics. My thought is to have my classifier calculate several scores, some ignoring tokens that haven't been used in scoring for a while. I haven't gotten to that part but if anyone is interested in the (trivial) changes so far, I'd be glad to upload the patch to SourceForge. Regards, Matt From noreply at sourceforge.net Sun Jun 1 12:28:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 1 15:21:34 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747107 ] SpamBayes leaves duplicate messages Message-ID: Bugs item #747107, was opened at 2003-06-01 11:28 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747107&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Eric Brown (ericbr) Assigned to: Mark Hammond (mhammond) Summary: SpamBayes leaves duplicate messages Initial Comment: I am subscribed to a number of different mailing lists, and I have a number of Outlook rules (set up via Rules Wizard) to route the different mailing lists into different folders. When the rules are of the form: Apply this rule after the message arrives with in the subject move it to the folder and a matching message comes in, SpamBayes seems to leave multiple copies of that message in the inbox. This is very frustrating. If the rule is of the form: Apply this rule after the message arrives sent to move it to the folder and a matching message comes in, duplicate messages do not occur. So it seems to be some sort of odd interaction with Outlook Rules. I'm running Outlook 2002 SP 2 on Windows XP SP 1a (with current hotfixes). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747107&group_id=61702 From noreply at sourceforge.net Sun Jun 1 14:54:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 1 17:02:23 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743122 ] Outlook 2002 auto-complete breaks Message-ID: Bugs item #743122, was opened at 2003-05-26 00:06 Message generated for change (Comment added) made by spidawebs You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 auto-complete breaks Initial Comment: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). ---------------------------------------------------------------------- Comment By: Jonathan Evatt (spidawebs) Date: 2003-06-02 08:54 Message: Logged In: YES user_id=48654 I am also getting this problem. Have installed the app on Outlook XP (2002) SP2 over Win XP SP1. If I start up Outlook the problem only seems to occur AFTER I actually run the spambayes configuration routine. If I do not access that then the problem does not (so far - limited testing) seem to occur. Jonathan ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 From noreply at sourceforge.net Sun Jun 1 18:08:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 1 21:34:48 2003 Subject: [spambayes-dev] [ spambayes-Bugs-746909 ] Conflicting with Message Rules Message-ID: Bugs item #746909, was opened at 2003-06-01 17:39 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=746909&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Wont Fix Priority: 5 Submitted By: Henry Ho (cornflakeboy) Assigned to: Mark Hammond (mhammond) Summary: Conflicting with Message Rules Initial Comment: If you have Outlook message rules defined (i.e. moving messages automatically into different folders), and a message comes in, Spambayes will move it into spam or possible spam folders as it should. But Outlook will move that message to the folder defined by the rule, which subsequently makes Spambayes think it's good and will train itself accordingly. An example rule is: Apply this rule after the message arrives through the myemail@mydomain.com account, move it to the My Personal folder The above rule was created to sort multiple email accounts into proper folders. I have Spambayes configured to watch my Inbox and my "My Personal" folder. Perhaps we can make Spambayes process messages after Outlook's done running its rules? I am using Outlook 2002 SP-1 Thanks! Great program! I have been recommending it to everyone who owns a PC! :) ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-02 10:08 Message: Logged In: YES user_id=14198 What version of Outlook? I use 2000, and have never seen *both* builtin rules and SpamBayes to get a single message - only one gets it and the other misses. I'm not sure what we can do here though - the problem is with the built in rule - can't you tell it to ignore the Spam folder? Either way, SpamBayes is reacting by design - it presumes the message was mis-classified and is being moved by you. I suggest you disable the 2 checkboxes that relate to watching folders and training. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=746909&group_id=61702 From popiel at wolfskeep.com Sun Jun 1 21:46:21 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Sun Jun 1 23:46:25 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: Message from Matthew Dixon Cowles of "Sat, 31 May 2003 21:19:41 CDT." <1054430548.31.1335@sake.mondoinfo.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> Message-ID: <20030602034621.1148D2DE9A@cashew.wolfskeep.com> In message: <1054430548.31.1335@sake.mondoinfo.com> Matthew Dixon Cowles writes: > >I tore that code out and instead hacked the classifier so that I >could determine how soon after a word figures in scoring that it's >used again. I think that the results are at least slightly >interesting. Note that the histogram below is log scaled. [ snip of histogram showing an apparent exponential dropoff in usage frequency ] Yes, this is a very interesting result. I'm not sure it's actually useful, but it is pretty. Another thing that would be interesting to plot would be a histogram of the average frequency each token gets used at... which might give us some idea of how large a DB is actually useful. - Alex From noreply at sourceforge.net Mon Jun 2 03:55:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 2 09:50:01 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743122 ] Outlook 2002 auto-complete breaks Message-ID: Bugs item #743122, was opened at 2003-05-25 12:06 Message generated for change (Comment added) made by worger You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 auto-complete breaks Initial Comment: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). ---------------------------------------------------------------------- Comment By: Martin Worger (worger) Date: 2003-06-02 09:55 Message: Logged In: YES user_id=751487 I am running the same - W2K SP3 & OL2K and get the same problem. ---------------------------------------------------------------------- Comment By: Jonathan Evatt (spidawebs) Date: 2003-06-01 20:54 Message: Logged In: YES user_id=48654 I am also getting this problem. Have installed the app on Outlook XP (2002) SP2 over Win XP SP1. If I start up Outlook the problem only seems to occur AFTER I actually run the spambayes configuration routine. If I do not access that then the problem does not (so far - limited testing) seem to occur. Jonathan ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 From noreply at sourceforge.net Mon Jun 2 04:42:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 2 09:50:01 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747458 ] messages not processed Message-ID: Bugs item #747458, was opened at 2003-06-02 06:42 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Russell S Blue (bluer) Assigned to: Mark Hammond (mhammond) Summary: messages not processed Initial Comment: AFter installing SpamBayes about a week ago, and coming to count on its performance, midday yesterday it stopped processing messages. I noticed it originally not because some spam got through, but because messages were coming in wiithout a spam scare. Anyway, now when I run "Filter Now", it comes up with all errors... log attached. I running a binary version, which for the heck of it I tried reinstalling several times last night (no effect). Windows XP Outlook 2002 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 From noreply at sourceforge.net Mon Jun 2 04:46:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 2 09:50:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747458 ] messages not processed Message-ID: Bugs item #747458, was opened at 2003-06-02 06:42 Message generated for change (Comment added) made by bluer You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Russell S Blue (bluer) Assigned to: Mark Hammond (mhammond) Summary: messages not processed Initial Comment: AFter installing SpamBayes about a week ago, and coming to count on its performance, midday yesterday it stopped processing messages. I noticed it originally not because some spam got through, but because messages were coming in wiithout a spam scare. Anyway, now when I run "Filter Now", it comes up with all errors... log attached. I running a binary version, which for the heck of it I tried reinstalling several times last night (no effect). Windows XP Outlook 2002 ---------------------------------------------------------------------- >Comment By: Russell S Blue (bluer) Date: 2003-06-02 06:46 Message: Logged In: YES user_id=791991 Cut out big chunk of the errors (in the log file)\ to get it to upload... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 From noreply at sourceforge.net Mon Jun 2 05:25:09 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 2 09:50:03 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747472 ] Install fails on W2K, OL2K... Message-ID: Bugs item #747472, was opened at 2003-06-02 13:25 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: rainer scherg (rasc) Assigned to: Mark Hammond (mhammond) Summary: Install fails on W2K, OL2K... Initial Comment: Trying to install spambayes-Outlook-Setup-002.exe fails: System: Win2K Sp1 (German) Outlook 2K SR1 (German) - McAfee Virusscanner - Exchange Connector In fact it failed on 3 computers and succeded on 1 computer (all same basic installation) ... I tried also to switch off Virusscan plugin. --Log (installation): Registered: SpamBayes.OutlookAddin ----------------------------- Buttons are not visible in Outlook. Troubleshoot: SpamBayes Com-Addin in Outlook is "un-checked". Trying to activate the add-in fails (after reentering the add-in applet, the SpamBayes add-in entry again is unchecked). -- Log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Dokumente und Einstellungen\rainsche\Anwendungsdaten\SpamBayes\d efault_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 From noreply at sourceforge.net Mon Jun 2 10:17:21 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 2 13:36:46 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747472 ] Install fails on W2K, OL2K... Message-ID: Bugs item #747472, was opened at 2003-06-02 13:25 Message generated for change (Comment added) made by rasc You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: rainer scherg (rasc) Assigned to: Mark Hammond (mhammond) Summary: Install fails on W2K, OL2K... Initial Comment: Trying to install spambayes-Outlook-Setup-002.exe fails: System: Win2K Sp1 (German) Outlook 2K SR1 (German) - McAfee Virusscanner - Exchange Connector In fact it failed on 3 computers and succeded on 1 computer (all same basic installation) ... I tried also to switch off Virusscan plugin. --Log (installation): Registered: SpamBayes.OutlookAddin ----------------------------- Buttons are not visible in Outlook. Troubleshoot: SpamBayes Com-Addin in Outlook is "un-checked". Trying to activate the add-in fails (after reentering the add-in applet, the SpamBayes add-in entry again is unchecked). -- Log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Dokumente und Einstellungen\rainsche\Anwendungsdaten\SpamBayes\d efault_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: rainer scherg (rasc) Date: 2003-06-02 18:17 Message: Logged In: YES user_id=115734 Somethbing I tried after reading in the maillist archives: I removed SpamBayes COM-Add-in from Outlook an re-added it. I this case it was "selected" (checked), but Outlook crashed with an exception. BTW: I have administrator rights on the computers I tried... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 From matt at mondoinfo.com Mon Jun 2 15:59:57 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Mon Jun 2 16:35:18 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: <20030602034621.1148D2DE9A@cashew.wolfskeep.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> Message-ID: <1054582419.53.613@sake.mondoinfo.com> [Alex Popiel] > [ snip of histogram showing an apparent exponential > dropoff in usage frequency ] > > Yes, this is a very interesting result. I'm not sure it's actually > useful, but it is pretty. I'm not sure it is either but I'm hopeful that it may be. For example, it says that (with my mail) if a token is used in scoring, there's a 90% chance that it will be used again within one day, a 95% chance it will be used again within four days, and a 98% chance that it will be used again within two weeks. That suggests to me that a relatively simple mechanism for database pruning may be useful. When I have a few minutes, I plan to do some more work to see if that's true. > Another thing that would be interesting to plot would be a > histogram of the average frequency each token gets used at... which > might give us some idea of how large a DB is actually useful. I'd be glad to poke at the data in a different way, but it's not clear to me how that's different from what I've done. Can you tell me a little more specifically what you mean? Regards, Matt From popiel at wolfskeep.com Mon Jun 2 15:05:10 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon Jun 2 17:05:15 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: Message from Matthew Dixon Cowles of "Mon, 02 Jun 2003 14:59:57 CDT." <1054582419.53.613@sake.mondoinfo.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> Message-ID: <20030602210510.600EB2DE98@cashew.wolfskeep.com> In message: <1054582419.53.613@sake.mondoinfo.com> Matthew Dixon Cowles writes: > >> Another thing that would be interesting to plot would be a >> histogram of the average frequency each token gets used at... which >> might give us some idea of how large a DB is actually useful. > >I'd be glad to poke at the data in a different way, but it's not >clear to me how that's different from what I've done. Can you tell me >a little more specifically what you mean? If I'm reading your histogram right, then you're plotting for each usage, how long ago it was since that token was last used. Thus, a single token that gets used frequently will contribute multiple times to the histogram. What I'm suggesting is having each token keep track of its usage frequency, and then building a histogram of token vs. frequency, with each token only contributing once to the chart. This would give an idea of what percentage of tokens are used a lot, as opposed to what you've got now (which says that for tokens that are used, most will be used again soon). - Alex From matt at mondoinfo.com Mon Jun 2 21:36:02 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Mon Jun 2 21:36:51 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: <20030602210510.600EB2DE98@cashew.wolfskeep.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> <20030602210510.600EB2DE98@cashew.wolfskeep.com> Message-ID: <1054603627.46.805@sake.mondoinfo.com> > What I'm suggesting is having each token keep track of its usage > frequency, and then building a histogram of token vs. frequency, > with each token only contributing once to the chart. This would > give an idea of what percentage of tokens are used a lot, as opposed > to what you've got now (which says that for tokens that are used, > most will be used again soon). Here you go. Though this one doesn't seem to be worth a histogram: Over 30.0 days, 63209 tokens were used in scoring a total of 1107800 times Largest number of uses 11144, smallest 1 0-500 uses 62929 500-1000 uses 145 1000-1500 uses 36 1500-2000 uses 26 2000-2500 uses 27 2500-3000 uses 10 3000-3500 uses 3 3500-4000 uses 3 4000-4500 uses 18 4500-5000 uses 4 5000-5500 uses 2 5500-6000 uses 1 6000-6500 uses 2 6500-7000 uses 1 7000-7500 uses 0 7500-8000 uses 1 8000-8500 uses 0 8500-9000 uses 0 9000-9500 uses 0 9500-10000 uses 0 10000-10500 uses 0 10500-11000 uses 0 11000-11500 uses 1 That token that was used 11144 times was "content-type:text/plain" and the next most commonly-used one was "subject:: ". Regards, Matt From popiel at wolfskeep.com Mon Jun 2 20:50:06 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon Jun 2 22:50:09 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: Message from Matthew Dixon Cowles of "Mon, 02 Jun 2003 20:36:02 CDT." <1054603627.46.805@sake.mondoinfo.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> <20030602210510.600EB2DE98@cashew.wolfskeep.com> <1054603627.46.805@sake.mondoinfo.com> Message-ID: <20030603025006.162D72DE98@cashew.wolfskeep.com> In message: <1054603627.46.805@sake.mondoinfo.com> Matthew Dixon Cowles writes: >> What I'm suggesting is having each token keep track of its usage >> frequency, and then building a histogram of token vs. frequency, >> with each token only contributing once to the chart. This would >> give an idea of what percentage of tokens are used a lot, as opposed >> to what you've got now (which says that for tokens that are used, >> most will be used again soon). > >Here you go. Though this one doesn't seem to be worth a histogram: > >Over 30.0 days, 63209 tokens were used in scoring a total of 1107800 >times >Largest number of uses 11144, smallest 1 > > 0-500 uses 62929 Could you give more detail on this bucket? Over 99% of your tokens are here. - Alex From matt at mondoinfo.com Mon Jun 2 23:07:46 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Mon Jun 2 23:08:03 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: <20030603025006.162D72DE98@cashew.wolfskeep.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> <20030602210510.600EB2DE98@cashew.wolfskeep.com> <1054603627.46.805@sake.mondoinfo.com> <20030603025006.162D72DE98@cashew.wolfskeep.com> Message-ID: <1054609440.76.1032@sake.mondoinfo.com> [me] > 0-500 uses 62929 [Alex Popiel] > Could you give more detail on this bucket? Over 99% of your > tokens are here. Sure. It really should have said 0-499 but I'm sure that everyone figured that out. Here it is by 50s. The total is slightly larger since I've gotten some mail since the last count. 0-49 uses 60444 50-99 uses 1403 100-149 uses 466 150-199 uses 217 200-249 uses 141 250-299 uses 78 300-349 uses 78 350-399 uses 50 400-449 uses 38 450-499 uses 28 Regards, matt From popiel at wolfskeep.com Mon Jun 2 23:12:19 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue Jun 3 01:12:23 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: Message from Matthew Dixon Cowles of "Mon, 02 Jun 2003 22:07:46 CDT." <1054609440.76.1032@sake.mondoinfo.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> <20030602210510.600EB2DE98@cashew.wolfskeep.com> <1054603627.46.805@sake.mondoinfo.com> <20030603025006.162D72DE98@cashew.wolfskeep.com> <1054609440.76.1032@sake.mondoinfo.com> Message-ID: <20030603051219.513772DE98@cashew.wolfskeep.com> In message: <1054609440.76.1032@sake.mondoinfo.com> Matthew Dixon Cowles writes: >[me] >> 0-500 uses 62929 > >[Alex Popiel] >> Could you give more detail on this bucket? Over 99% of your >> tokens are here. > >Sure. It really should have said 0-499 but I'm sure that everyone >figured that out. Here it is by 50s. The total is slightly larger >since I've gotten some mail since the last count. > > 0-49 uses 60444 > 50-99 uses 1403 >100-149 uses 466 >150-199 uses 217 >200-249 uses 141 >250-299 uses 78 >300-349 uses 78 >350-399 uses 50 >400-449 uses 38 >450-499 uses 28 Interesting. If you plot this (and your other data, scaled suitably to reflect the different bucket sizes) on log-log axes, then you get a straight line (up to the point that the data becomes too sparse to be useful). I really ought to instrument my own test db; the next question I have is "What do the numbers become if you only count uses where the word prob was outside .4-.6?" Hrm. I think I'm trying to narrow in on a pruning criterion along the lines of "If it hasn't contributed to classification more than once every N days (on average), then it's safe to drop it." - Alex From noreply at sourceforge.net Tue Jun 3 00:09:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 10:28:50 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747945 ] apparent corruption of Received lines Message-ID: Bugs item #747945, was opened at 2003-06-03 06:09 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747945&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Raphael Manfredi (rmanfredi) Assigned to: Nobody/Anonymous (nobody) Summary: apparent corruption of Received lines Initial Comment: It looks like the message header is corrupted by hammiefilter.py. Here's my evidence: I'm running the 1.0a2 release. I use "mailagent" to filter my mail, which is like procmail (with a much cleaner code base and readable rules) and I've set up the rules like that: ### Begin SPAM filtering section ### { FEED ~/bin/spambayes/hammiefilter.py; RESYNC; REJECT; }; X-Spambayes-Classification: /^spam/ { REJECT SPAM } /^ham/ { REJECT INITIAL } /^unsure/ { REJECT LOST }; # Dispatch to spam/lost, keep going if ham... { VACATION off; BIFF off; REJECT }; { SAVE lost }; { SAVE spam }; ### End of SPAM filtering section ### The "FEED" command pipes the message to the specified scripts and gets the output back as the new message. The RESYNC command reparses the headers, since they might have been changed by the command (and indeed, they are since X-Spambayes-Classification is added!). However, here are the logs I see: ------------------------------ 03/06/03 07:44:37 mailagent[277851]: FROM 03/06/03 07:44:37 mailagent[277851]: RELAYED hbs43-138.hbs.edu, orb.pobox.com, pop.free.fr, lyon.ram.loc 03/06/03 07:44:37 mailagent[277851]: ABOUT Approved 03/06/03 07:44:37 mailagent[277851]: MATCH on rule #68 in mode INITIAL 03/06/03 07:44:39 mailagent[277851]: FED [qm16938] through '~/bin/spambayes/hammiefilter.py' 03/06/03 07:44:40 mailagent[277851]: WARNING no by in first Received: line 'from localhost (mail@localhost [127.0.0.1])h535iTik016935 for ; Tue, 3 Jun 2003 07:44:30 +0200' 03/06/03 07:44:40 mailagent[277851]: weird Received: line 'from localhost (mail@localhost [127.0.0.1])h535iTik016935 for ; Tue, 3 Jun 2003 07:44:30 +0200' 03/06/03 07:44:40 mailagent[277851]: RESYNCED [qm16938] 03/06/03 07:44:40 mailagent[277851]: REJECTED [qm16938] in state INITIAL 03/06/03 07:44:40 mailagent[277851]: MATCH on rule #69 in mode INITIAL 03/06/03 07:44:40 mailagent[277851]: entering new state SPAM --------------------------------- Look at the WARNING about the absence of "by" in Received. The problem is that no such warning is originally emitted, but it appears during RESYNC when the headers are reparsed after the message went through hammiefilter.py. This indicates a corruption problem, since mailagent uses the same code to reparse the headers during RESYNC as it used originally, and no warning were emitted before FEED was run. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747945&group_id=61702 From noreply at sourceforge.net Tue Jun 3 08:13:45 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 10:28:50 2003 Subject: [spambayes-dev] [ spambayes-Bugs-748199 ] Spam score now shows as decimal Message-ID: Bugs item #748199, was opened at 2003-06-03 09:13 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748199&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Peter Zalinski (pezalinski) Assigned to: Mark Hammond (mhammond) Summary: Spam score now shows as decimal Initial Comment: When I created a "Spam" column in Outlook 2000, and scored the folder using your product, everything was grand -- the scores showed up as percentages. Then I moved the "Spam" column by dragging it over a few columns, and the scores converted to decimals, and I can't seem to get them to convert back to a percentage, no matter what I try. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748199&group_id=61702 From noreply at sourceforge.net Tue Jun 3 08:10:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 10:28:51 2003 Subject: [spambayes-dev] [ spambayes-Bugs-721664 ] mboxtrain.py doesn't find Maildir tmp/ directory properly Message-ID: Bugs item #721664, was opened at 2003-04-15 10:10 Message generated for change (Comment added) made by avalon You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=721664&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: David M. Cooke (dmcooke) Assigned to: Nobody/Anonymous (nobody) Summary: mboxtrain.py doesn't find Maildir tmp/ directory properly Initial Comment: The Maildir handler for mboxtrain.py tries to use a tmp/ under the directory passed to it. It should use a tmp/ directory at the same level (so given a ~/Maildir/cur, it should use ~/Maildir/tmp). ---------------------------------------------------------------------- Comment By: Daniel Buchmann (avalon) Date: 2003-06-03 16:10 Message: Logged In: YES user_id=184577 Hi, just wanted to confirm this bug as of CVS 2003-06-03. The patch attached by dmcooke fixes the bug for me too, so I agree with dripton, it would be nice if the patch could be applied. Thanks. :) ---------------------------------------------------------------------- Comment By: David Ripton (dripton) Date: 2003-05-17 14:26 Message: Logged In: YES user_id=9425 This is still broken as of latest CVS on 2003-05-17 Example output from an mboxtrain cron job very similar to the one in HAMMIE.txt: Training ham (/home/dripton/Maildir/): Reading as Maildir 1052838601.29078_0.ludo:2,RS 1052472674.1616_0.ludo:2,RS 1049307199.16131_1.dydimus,S=829:2,S 1047587487.1367_2.hoggle:2,S 1051558961.21629_2.willum:2,RS 1053034631.15484_1.willum:2,S Traceback (most recent call last): File "/home/dripton/src/spambayes/mboxtrain.py", line 294, in ? main() File "/home/dripton/src/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/dripton/src/spambayes/mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "/home/dripton/src/spambayes/mboxtrain.py", line 113, in maildir_train f = file(tfn, "wb") IOError: [Errno 2] No such file or directory: '/home/dripton/Maildir/cur/tmp/1053163264.21320_6.vidar' The one-line patch already attached to this bug works for me. Please apply it to CVS. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=721664&group_id=61702 From matt at mondoinfo.com Tue Jun 3 16:01:56 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Tue Jun 3 16:08:51 2003 Subject: [spambayes-dev] Re: [Spambayes] Database cleaning? In-Reply-To: <20030603051219.513772DE98@cashew.wolfskeep.com> References: <3ED6F33F.9050000@mailcom.com> <20030531170037.10DB82DDF2@cashew.wolfskeep.com> <1054430548.31.1335@sake.mondoinfo.com> <20030602034621.1148D2DE9A@cashew.wolfskeep.com> <1054582419.53.613@sake.mondoinfo.com> <20030602210510.600EB2DE98@cashew.wolfskeep.com> <1054603627.46.805@sake.mondoinfo.com> <20030603025006.162D72DE98@cashew.wolfskeep.com> <1054609440.76.1032@sake.mondoinfo.com> <20030603051219.513772DE98@cashew.wolfskeep.com> Message-ID: <1054669602.93.1607@sake.mondoinfo.com> [Alex Popiel] > Interesting. If you plot this (and your other data, scaled > suitably to reflect the different bucket sizes) on log-log > axes, then you get a straight line (up to the point that the > data becomes too sparse to be useful). The data seems remarkably uniform to me. > I really ought to instrument my own test db; the next question > I have is "What do the numbers become if you only count uses > where the word prob was outside .4-.6?" My data is a record of tokens that were actually used in scoring so I think they were outside [.4-.6]. At least, I haven't fiddled with options.minimum_prob_strength. > I think I'm trying to narrow in on a pruning criterion along the > lines of "If it hasn't contributed to classification more than once > every N days (on average), then it's safe to drop it." I fiddled my classifier last night to compute several scores and log them when it computes one. The other scores ignore words that haven't figured in scoring in one day, one week, two weeks, and thirty days. A random scroll through the results suggests that they look pretty promising so far. Of course, given that people report good results even with minimal training, I guess that's not too surprising. Regards, Matt From noreply at sourceforge.net Tue Jun 3 15:36:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 20:31:14 2003 Subject: [spambayes-dev] [ spambayes-Bugs-748472 ] Fails to load Message-ID: Bugs item #748472, was opened at 2003-06-03 14:36 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748472&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: Benson Hougland (bensonh) Assigned to: Mark Hammond (mhammond) Summary: Fails to load Initial Comment: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 152, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 355, in LoadConfig exceptions.EOFError: ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748472&group_id=61702 From noreply at sourceforge.net Tue Jun 3 20:00:04 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 22:06:46 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 21:54 Message generated for change (Comment added) made by shokk You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-03 22:00 Message: Logged In: YES user_id=3668 I am also seeing this for Outlook 2002 on WinXPPro Sp1. I have no log files available anywhere. I have Python 2.3b1 installed. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:37 Message: Logged In: YES user_id=782486 My fault I suppose for failing to detail it but I saw this same error message when my problem occurred. Also see posting #741914. I suspect all three bug reports are the same regsvr error. -Rob ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 13:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 01:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 00:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From mhammond at skippinet.com.au Wed Jun 4 13:25:33 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Jun 3 22:26:26 2003 Subject: [spambayes-dev] More message munging Message-ID: <007301c32a40$95d2ce80$530f8490@eden> For those who don't follow the checkins. This is following from Tim P's message about hacks to get the email package to recognise the message. I made the following patch to Outlook, and should we decide on a "choke point", the patch below should be included in some form. After checking it in, I noticed: * If the first header is "Content-Type", my patch will fail to fix the problem. * A better solution may be to fall back to the "rfc822" module. It is pretty dumb about mime, and should be easy to use to perform better header translations before feeding into the email package (or indeed, the rfc822.Message object may be able to be used in place of the email package object. But I'm not going to bother with either of them for Outlook. Mark -----Original Message----- From: spambayes-checkins-bounces@python.org [mailto:spambayes-checkins-bounces@python.org]On Behalf Of Mark Hammond Sent: Wednesday, 4 June 2003 11:47 AM To: spambayes-checkins@python.org Subject: [Spambayes-checkins] spambayes/Outlook2000 msgstore.py,1.44,1.45 Update of /cvsroot/spambayes/spambayes/Outlook2000 In directory sc8-pr-cvs1:/tmp/cvs-serv20925 Modified Files: msgstore.py Log Message: Get even more brutal with the message when the email package still barfs. Index: msgstore.py =================================================================== RCS file: /cvsroot/spambayes/spambayes/Outlook2000/msgstore.py,v retrieving revision 1.44 retrieving revision 1.45 diff -C2 -d -r1.44 -r1.45 *** msgstore.py 20 May 2003 00:30:45 -0000 1.44 --- msgstore.py 4 Jun 2003 01:47:13 -0000 1.45 *************** *** 666,670 **** # structure, just the words it contains, so no harm and # much good in trying to suppress this error. ! msg = email.message_from_string(text + "\n\n") except: print "FAILED to create email.message from: ", `text` --- 666,687 ---- # structure, just the words it contains, so no harm and # much good in trying to suppress this error. ! try: ! msg = email.message_from_string(text + "\n\n") ! except email.Errors.BoundaryError: ! # But even this doesn't get *everything*. We can still see: ! # "multipart message with no defined boundary" ! # so now it is time to turn into a butcher - hack out ! # the Content-Type header, so we see it as plain text. ! butcher_pos = text.lower().find("\ncontent-type: ") ! if butcher_pos < 0: ! # This error just just gunna get caught below anyway ! raise RuntimeError( ! "email package croaked with boundary error, but " ! "there appears to be no 'Content-Type' header") ! # Put it back together, skipping the original "\n" but ! # leaving the header leaving "\nSpamBayes-Content-Type: " ! butchered = text[:butcher_pos] + "\nSpamBayes-" + \ ! text[butcher_pos+1:] + "\n\n" ! msg = email.message_from_string(butchered) except: print "FAILED to create email.message from: ", `text` _______________________________________________ Spambayes-checkins mailing list Spambayes-checkins@python.org http://mail.python.org/mailman/listinfo/spambayes-checkins From mhammond at skippinet.com.au Wed Jun 4 14:03:06 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Jun 3 23:03:54 2003 Subject: [spambayes-dev] New Outlook toolbars Message-ID: <001e01c32a45$d2e419a0$530f8490@eden> Hi all, I just checked into CVS code that changes the way Toolbars are managed. Specifically, we create our own toolbar, and all items are created permanent. I hope that the only visible change you will see is a new toolbar, and our items appearing on that instead of the standard one. You can drag the new toolbar anywhere you like, (including floating) and it should persist. I am worried you will see various things apart from that, so please let me know! I am hoping this will fix the problems with multiple buttons, and if we are really lucky, fix Outlook 2003 problems. Mark. From noreply at sourceforge.net Tue Jun 3 21:23:57 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 3 23:25:10 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 21:54 Message generated for change (Comment added) made by shokk You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-03 23:23 Message: Logged In: YES user_id=3668 I fixed this by removing Python 2.3b1. After that it immediately installed correctly. ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-03 22:00 Message: Logged In: YES user_id=3668 I am also seeing this for Outlook 2002 on WinXPPro Sp1. I have no log files available anywhere. I have Python 2.3b1 installed. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:37 Message: Logged In: YES user_id=782486 My fault I suppose for failing to detail it but I saw this same error message when my problem occurred. Also see posting #741914. I suspect all three bug reports are the same regsvr error. -Rob ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 13:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 01:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 00:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Tue Jun 3 21:34:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:19 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-25 21:54 Message generated for change (Comment added) made by rdesantos You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-06-03 23:34 Message: Logged In: YES user_id=782486 Hmm. I've never had Python installed so that doesn't explain why I have this problem or how to work around it. -Rob ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-03 23:23 Message: Logged In: YES user_id=3668 I fixed this by removing Python 2.3b1. After that it immediately installed correctly. ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-03 22:00 Message: Logged In: YES user_id=3668 I am also seeing this for Outlook 2002 on WinXPPro Sp1. I have no log files available anywhere. I have Python 2.3b1 installed. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:37 Message: Logged In: YES user_id=782486 My fault I suppose for failing to detail it but I saw this same error message when my problem occurred. Also see posting #741914. I suspect all three bug reports are the same regsvr error. -Rob ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 13:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-26 13:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 01:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 00:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Wed Jun 4 01:51:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:20 2003 Subject: [spambayes-dev] [ spambayes-Bugs-748472 ] Fails to load Message-ID: Bugs item #748472, was opened at 2003-06-04 07:36 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748472&group_id=61702 Category: Outlook Group: v1.0 (example) >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Benson Hougland (bensonh) Assigned to: Mark Hammond (mhammond) Summary: Fails to load Initial Comment: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 152, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 355, in LoadConfig exceptions.EOFError: ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-04 17:51 Message: Logged In: YES user_id=14198 Thanks for your report. This bug has previously been reported, and has already been fixed in the source-code - so you can expect to see the fix in the next binary release. You may wish to subscribe to the spambayes-announce list to see when each version is released - details at http://mail.python.org/mailman-21/listinfo/spambayes-announce If you are technically minded, you may like to run from the CVS version of the source-code, and you should find the fix immediately - see the main SpamBayes pages for details. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=748472&group_id=61702 From noreply at sourceforge.net Wed Jun 4 01:53:18 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:21 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747945 ] apparent corruption of Received lines Message-ID: Bugs item #747945, was opened at 2003-06-03 16:09 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747945&group_id=61702 >Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Raphael Manfredi (rmanfredi) Assigned to: Nobody/Anonymous (nobody) Summary: apparent corruption of Received lines Initial Comment: It looks like the message header is corrupted by hammiefilter.py. Here's my evidence: I'm running the 1.0a2 release. I use "mailagent" to filter my mail, which is like procmail (with a much cleaner code base and readable rules) and I've set up the rules like that: ### Begin SPAM filtering section ### { FEED ~/bin/spambayes/hammiefilter.py; RESYNC; REJECT; }; X-Spambayes-Classification: /^spam/ { REJECT SPAM } /^ham/ { REJECT INITIAL } /^unsure/ { REJECT LOST }; # Dispatch to spam/lost, keep going if ham... { VACATION off; BIFF off; REJECT }; { SAVE lost }; { SAVE spam }; ### End of SPAM filtering section ### The "FEED" command pipes the message to the specified scripts and gets the output back as the new message. The RESYNC command reparses the headers, since they might have been changed by the command (and indeed, they are since X-Spambayes-Classification is added!). However, here are the logs I see: ------------------------------ 03/06/03 07:44:37 mailagent[277851]: FROM 03/06/03 07:44:37 mailagent[277851]: RELAYED hbs43-138.hbs.edu, orb.pobox.com, pop.free.fr, lyon.ram.loc 03/06/03 07:44:37 mailagent[277851]: ABOUT Approved 03/06/03 07:44:37 mailagent[277851]: MATCH on rule #68 in mode INITIAL 03/06/03 07:44:39 mailagent[277851]: FED [qm16938] through '~/bin/spambayes/hammiefilter.py' 03/06/03 07:44:40 mailagent[277851]: WARNING no by in first Received: line 'from localhost (mail@localhost [127.0.0.1])h535iTik016935 for ; Tue, 3 Jun 2003 07:44:30 +0200' 03/06/03 07:44:40 mailagent[277851]: weird Received: line 'from localhost (mail@localhost [127.0.0.1])h535iTik016935 for ; Tue, 3 Jun 2003 07:44:30 +0200' 03/06/03 07:44:40 mailagent[277851]: RESYNCED [qm16938] 03/06/03 07:44:40 mailagent[277851]: REJECTED [qm16938] in state INITIAL 03/06/03 07:44:40 mailagent[277851]: MATCH on rule #69 in mode INITIAL 03/06/03 07:44:40 mailagent[277851]: entering new state SPAM --------------------------------- Look at the WARNING about the absence of "by" in Received. The problem is that no such warning is originally emitted, but it appears during RESYNC when the headers are reparsed after the message went through hammiefilter.py. This indicates a corruption problem, since mailagent uses the same code to reparse the headers during RESYNC as it used originally, and no warning were emitted before FEED was run. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747945&group_id=61702 From noreply at sourceforge.net Wed Jun 4 01:56:11 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:22 2003 Subject: [spambayes-dev] [ spambayes-Bugs-712480 ] Install error: bizarre import errors Message-ID: Bugs item #712480, was opened at 2003-03-31 15:47 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) >Summary: Install error: bizarre import errors Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-25 21:41 Message: Logged In: YES user_id=14198 Failes with Outlook XP ver.10.4712.4219 sp-2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 09:40 Message: Logged In: YES user_id=14198 *sob* - so it certainly fails with Outlook SP2 here? ecvogel - is your traceback identical to this one? ---------------------------------------------------------------------- Comment By: Eric C. Vogel (ecvogel) Date: 2003-05-23 05:20 Message: Logged In: YES user_id=784818 Same here. Outlook XP all SP and patches, WIn XP Pro SP1 all patches. The file is registered and in the COM and has the check. Exchange Server 2000 all SP and patches on SBS 2kSP-1 all patches. 002 install ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-20 08:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 17:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 18:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 12:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 16:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From noreply at sourceforge.net Wed Jun 4 01:59:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:23 2003 Subject: [spambayes-dev] [ spambayes-Bugs-705379 ] Outlook dialogs sometimes become unresponsive Message-ID: Bugs item #705379, was opened at 2003-03-18 14:43 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=705379&group_id=61702 Category: Outlook Group: None Status: Open >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook dialogs sometimes become unresponsive Initial Comment: The training and filtering dialogs sometimes become unresponsive during filtering/training. They shouldn't, as hoops are jumped through to keep the UI and worker in separate threads. Further, it only seems to happen on "large" folders - eg, I can provoke it on my Inbox, but not on smaller folders. I'm guessing some bullshit COM/Outlook thread rule I am breaking. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 17:59 Message: Logged In: YES user_id=14198 Dropping the background thread priority seemed to fix this - haven't seen it for ages. Now the dialog is no less responsive than the machine itself. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=705379&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:00:42 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:24 2003 Subject: [spambayes-dev] [ spambayes-Bugs-725449 ] Binary plugin fails on Win9x: encoding exception Message-ID: Bugs item #725449, was opened at 2003-04-22 14:41 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: dan maer (dmara) Assigned to: Mark Hammond (mhammond) >Summary: Binary plugin fails on Win9x: encoding exception Initial Comment: Running pure Windows (98SE) and Outlook 2000. No Python installed. I can't get the plugin to initialize. I go to Tools/options/other/advanced options/Com Addins and check the SpamBayes plug-in box, but it won't stayed checked, and I've uninstalled/reinstalled Outlook and the plugin but no go. I've got this working great, really great, on Winnt 4.0 and Xp. Logfile being attached for upload... Dan ---------------------------------------------------------------------- Comment By: Helge Stenstr?m (helge) Date: 2003-05-02 19:27 Message: Logged In: YES user_id=19115 Installation fails on Windows 2000. I have: * http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe (2968028 bytes) * ActivePython-2.2.2-224-win32-ix86.msi The log file talks about files on E:\, which isn't mounted. An earlier log file (which seems to be overwritten, I can't find it) claimed to have written stuff to c:/Documents and Settings/erahege/Application Data/SpamBayes/.pck, but that file is nonexistent. I have NOT installed http://starship.python.net/crew/mhammond/downloads/win32all-152.exe, beacuse the instructions on http://starship.python.net/crew/mhammond/spambayes/ doesn't say so. My log file is very similar to the one already uploaded. I will upload my log file if I find out how to do it. ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-26 15:03 Message: Logged In: YES user_id=763953 ok, since i liked it so much on win2000 i did the python install & manual install of the spambayes outlook addin & its working good now, so i guess it was the installer. thanks ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-24 15:10 Message: Logged In: YES user_id=763953 I also get the same errors as in the logfile on a Win98SE installation w/Outlook 2000 SP1, but it works on Win2000 w/same Outlook. If you have any workarounds (w/regedit?) please let us know... ---------------------------------------------------------------------- Comment By: dan maer (dmara) Date: 2003-04-23 13:06 Message: Logged In: YES user_id=759684 Ok Mark... Anyway to bypass the installer issue and get it working by manual means? Dan ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-23 09:19 Message: Logged In: YES user_id=14198 This is an issue with the "Installer" tool I use. I will try and sus it out before the next binary release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:02:07 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:26 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747472 ] Install fails on W2K, OL2K... Message-ID: Bugs item #747472, was opened at 2003-06-02 21:25 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: rainer scherg (rasc) Assigned to: Mark Hammond (mhammond) Summary: Install fails on W2K, OL2K... Initial Comment: Trying to install spambayes-Outlook-Setup-002.exe fails: System: Win2K Sp1 (German) Outlook 2K SR1 (German) - McAfee Virusscanner - Exchange Connector In fact it failed on 3 computers and succeded on 1 computer (all same basic installation) ... I tried also to switch off Virusscan plugin. --Log (installation): Registered: SpamBayes.OutlookAddin ----------------------------- Buttons are not visible in Outlook. Troubleshoot: SpamBayes Com-Addin in Outlook is "un-checked". Trying to activate the add-in fails (after reentering the add-in applet, the SpamBayes add-in entry again is unchecked). -- Log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Dokumente und Einstellungen\rainsche\Anwendungsdaten\SpamBayes\d efault_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:02 Message: Logged In: YES user_id=14198 Dupe of 740311 ---------------------------------------------------------------------- Comment By: rainer scherg (rasc) Date: 2003-06-03 02:17 Message: Logged In: YES user_id=115734 Somethbing I tried after reading in the maillist archives: I removed SpamBayes COM-Add-in from Outlook an re-added it. I this case it was "selected" (checked), but Outlook crashed with an exception. BTW: I have administrator rights on the computers I tried... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747472&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:07:27 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:27 2003 Subject: [spambayes-dev] [ spambayes-Bugs-747458 ] messages not processed Message-ID: Bugs item #747458, was opened at 2003-06-02 20:42 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Russell S Blue (bluer) Assigned to: Mark Hammond (mhammond) Summary: messages not processed Initial Comment: AFter installing SpamBayes about a week ago, and coming to count on its performance, midday yesterday it stopped processing messages. I noticed it originally not because some spam got through, but because messages were coming in wiithout a spam scare. Anyway, now when I run "Filter Now", it comes up with all errors... log attached. I running a binary version, which for the heck of it I tried reinstalling several times last night (no effect). Windows XP Outlook 2002 ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:07 Message: Logged In: YES user_id=14198 This is a dupe of 706520 - but never before seen on Windows. The solution is to do a "full retrain" - that is, go to "Anti-Spam Manager->Train Now, and select the "Rebuild Entire Database" option. ---------------------------------------------------------------------- Comment By: Russell S Blue (bluer) Date: 2003-06-02 20:46 Message: Logged In: YES user_id=791991 Cut out big chunk of the errors (in the log file)\ to get it to upload... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=747458&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:07:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:28 2003 Subject: [spambayes-dev] [ spambayes-Bugs-706520 ] assert fails in classifier Message-ID: Bugs item #706520, was opened at 2003-03-20 07:46 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Adam Glass (adamglass) Assigned to: Nobody/Anonymous (nobody) Summary: assert fails in classifier Initial Comment: This morning, I noticed that my emails no longer had a X-Spambayes-Classification header, so I looked through my procmail logs, and sure enough, hammiefilter.py is giving a traceback when an assertion fails. This happens on all messages now; it is not specific to a single message, or intermittent. Therefore, I suspect my .hammiedb is corrupted... I can supply it to anyone who would like to investigate it for debugging purposes. I am using Spambayes 1.0a2, installed on a system with Python 2.2.1, with the new version of the email library (as per the install docs.) Please contact me if you require any further details. Example of how to generate the error follows, along with traceback: adam$ /usr/local/bin/hammiefilter.py -f -d $HOME/.hammiedb < example Traceback (most recent call last): File "/usr/local/bin/hammiefilter.py", line 179, in ? main() File "/usr/local/bin/hammiefilter.py", line 175, in main action(msg) File "/usr/local/bin/hammiefilter.py", line 113, in filter return h.filter(msg) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 108, in filter prob, clues = self._scoremsg(msg, True) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 217, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 441, in _getclues prob = self.probability(record) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 304, in probability assert spamcount <= nspam AssertionError ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:07 Message: Logged In: YES user_id=14198 Noting we have an Outlook user with this now too: [ 747458 ] messages not processed (which I closed as a dupe) ---------------------------------------------------------------------- Comment By: Leonid (leobru) Date: 2003-05-31 10:36 Message: Logged In: YES user_id=790676 This happens, e.g., if a forced re-training was performed on a non-empty database, thus screwing up the message counts - this is for sure, I was bitten by it myself; or, potentially, if hammiefilter.py -t and mboxtrain.py were running at the same time ??? To avoid: do not do it (I do not use hammiefilter.py -t to be on the safe side). To fix, once it happens: start from scratch. Good to have in the next version: a database validator and corrector. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:10:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:29 2003 Subject: [spambayes-dev] [ spambayes-Bugs-744380 ] W982E/Outlook 2000: exception on loading Message-ID: Bugs item #744380, was opened at 2003-05-28 02:51 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744380&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Steve Clift (sclift) Assigned to: Mark Hammond (mhammond) Summary: W982E/Outlook 2000: exception on loading Initial Comment: Windows 98 2nd Edition Outlook 2000 SR-1 - Corporate or Workgroup SpamBayes throws an execption when loading. From the log file: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:10 Message: Logged In: YES user_id=14198 Dupe of: 725449 Binary plugin fails on Win9x: encoding exception ---------------------------------------------------------------------- Comment By: Larry Jobbins (jobbins) Date: 2003-05-30 02:28 Message: Logged In: YES user_id=788287 Looks similar to 725449 and 740893. ---------------------------------------------------------------------- Comment By: Larry Jobbins (jobbins) Date: 2003-05-28 14:26 Message: Logged In: YES user_id=788287 Same error. Installed Setup-002.exe from http://starship.python.net/crew/mhammond/spambayes/ Using Win98SE, Outlook 2000, all MS updates. Shows add-in, but won't stay checked, no icon appears. Install log looks same - pythoncom error: Failed to call the universal dispatcher, etc ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=744380&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:11:51 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:30 2003 Subject: [spambayes-dev] [ spambayes-Bugs-740893 ] failure to install Message-ID: Bugs item #740893, was opened at 2003-05-21 14:44 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Jack Steiner (steinman1950) Assigned to: Mark Hammond (mhammond) Summary: failure to install Initial Comment: Attempted install of binary version on W98 machine running Outlook2000. Did install and uninstall twice. As directed by the troubleshooting guide, checked the COM Add-ins - Spambayes was listed but not checked. I checked it, closed the dialog box, reopened the dialog box and it was not checked again. I ran regsvr32.exe but got an error. This may be similar to ID #725449. Jack Steiner 502-228-8110 jack@steiner.win.net ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:11 Message: Logged In: YES user_id=14198 Dupe of #740198 ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 14:21 Message: Logged In: YES user_id=552329 Note that for the most part this is a duplicate of [ 725449 ] Binary plugin fails on Win9x However, the two people that had it working and then has problems have (presumably) a different problem. Maybe their comments ought to be moved into a separate bug. ---------------------------------------------------------------------- Comment By: Stephen Green (sgreen0) Date: 2003-05-23 08:36 Message: Logged In: YES user_id=784968 I am also having the same problem. After install, the Spambayes tools aren't available in Outlook 2000 (with Win98 SE). In Tools/Options/Other/Advanced Options/Com Addins the Spmbayes add-in appears but in unchecked. No effort on my part can cause the add-in to remain checked. As soon as I click OK, I can go back to Com Addins and it is unchecked again. Stephen ---------------------------------------------------------------------- Comment By: David Sloane (goofanasia) Date: 2003-05-23 08:26 Message: Logged In: YES user_id=240438 I experienced similar behavior on Windows XP SP1 with Outlook 2002 SP2. The install went fine. On first opening Outlook, the toolbar items displayed, and then Outlook crashed. When I opened Outlook a second time, same behavior. Third time opening Outlook - no SpamBayes buttons and the COM AddIn item was un-checked. Since then, it won't remain checked. I've uninstalled and reinstalled SpamBayes twice, but the buttons haven't appeared and the checking the check-box doesn't seem to 'stick.' ---------------------------------------------------------------------- Comment By: George Heidenrich (gheidenr) Date: 2003-05-23 07:28 Message: Logged In: YES user_id=784929 Same problem and results as Jack and radesai. Removed the unchecked add-in at Outlook|Tools|Options|Other|Advanced Options|COM Add-Ins, then attempted to add it again. Received the following Outlook Error: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0177:1e03ff78. Registers: EAX=505c3a43 CS=0177 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=017f ESP=0056b930 EBP=1e0b8ec0 ECX=005fdc70 DS=017f ESI=005fdc70 FS=6527 EDX=00735cf0 ES=017f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 George Heidenrich gheidenr@bestweb.net ---------------------------------------------------------------------- Comment By: Ravil A. Desai (radesai) Date: 2003-05-23 04:57 Message: Logged In: YES user_id=753768 I am getting the same error. I had it installed and it worked fine for a day or two. Now, it has stopped working and I get the same errors. I didn't make sense to recreate another bug. I removed and reinstalled, but that did not fix the problem. Thanks radesai ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740893&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:12:44 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:15:31 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743408 ] Install fails Outlook 2000, WinXP Message-ID: Bugs item #743408, was opened at 2003-05-26 11:54 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 Category: None Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Tim Scott (kaleid) Assigned to: Nobody/Anonymous (nobody) Summary: Install fails Outlook 2000, WinXP Initial Comment: During install of version 002 for Outlook 2000 in Win Xp Pro, I get this error: --------------------------- Error --------------------------- C:\Program Files\Spambayes Outlook Addin\spambayes_addin.dll Unable to register the DLL/OCX: DllRegisterServer failed; code 0x00000000. Click Retry to try again, Ignore to proceed anyway (not recommended), or Abort to cancel installation. --------------------------- Abort Retry Ignore --------------------------- There is a spambayes1.log file in my %TEMP%, but it is empty. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:12 Message: Logged In: YES user_id=14198 Dupe of #740198 ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-06-04 13:34 Message: Logged In: YES user_id=782486 Hmm. I've never had Python installed so that doesn't explain why I have this problem or how to work around it. -Rob ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-04 13:23 Message: Logged In: YES user_id=3668 I fixed this by removing Python 2.3b1. After that it immediately installed correctly. ---------------------------------------------------------------------- Comment By: Ernie Oporto (shokk) Date: 2003-06-04 12:00 Message: Logged In: YES user_id=3668 I am also seeing this for Outlook 2002 on WinXPPro Sp1. I have no log files available anywhere. I have Python 2.3b1 installed. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-27 03:37 Message: Logged In: YES user_id=782486 My fault I suppose for failing to detail it but I saw this same error message when my problem occurred. Also see posting #741914. I suspect all three bug reports are the same regsvr error. -Rob ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-27 03:30 Message: Logged In: YES user_id=20330 I saw item #740198, but the symptoms reported were different than this one, so I didn't consider this a dupe. Causes may be the same, however. ---------------------------------------------------------------------- Comment By: Rob de Santos (rdesantos) Date: 2003-05-27 03:16 Message: Logged In: YES user_id=782486 This is the same as my submitted item #740198. -Rob (rdesantos) ---------------------------------------------------------------------- Comment By: Tim Scott (kaleid) Date: 2003-05-26 15:22 Message: Logged In: YES user_id=20330 I tried re-installing (no other actions)....and it worked. Weird. ---------------------------------------------------------------------- Comment By: Mylene Reiners (mylene) Date: 2003-05-26 14:51 Message: Logged In: YES user_id=20281 This problem seems to occur when not only Spambayes is installed, but also Python (2.3b1 ?) Mylene ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743408&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:16:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:10 2003 Subject: [spambayes-dev] [ spambayes-Bugs-745292 ] Logs Show COM error Message-ID: Bugs item #745292, was opened at 2003-05-29 12:23 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=745292&group_id=61702 Category: Outlook Group: v1.0 (example) >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Bryan Hunt (brhunt) Assigned to: Mark Hammond (mhammond) Summary: Logs Show COM error Initial Comment: I installed, configured and trained one day. Everything worked great. Next day, it says that I no longer have any items in the database. The "delete as spam" and "filter now" buttons no longer work. The log files show that there are COM errors. This looks similar to bug 689298. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:16 Message: Logged In: YES user_id=14198 Dupe of [ 706520 ] assert fails in classifier. The solution is to do a "full retrain" - that is, go to "Anti-Spam Manager->Train Now, and select the "Rebuild Entire Database" option. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=745292&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:16:27 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:11 2003 Subject: [spambayes-dev] [ spambayes-Bugs-706520 ] assert fails in classifier Message-ID: Bugs item #706520, was opened at 2003-03-20 07:46 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Adam Glass (adamglass) Assigned to: Nobody/Anonymous (nobody) Summary: assert fails in classifier Initial Comment: This morning, I noticed that my emails no longer had a X-Spambayes-Classification header, so I looked through my procmail logs, and sure enough, hammiefilter.py is giving a traceback when an assertion fails. This happens on all messages now; it is not specific to a single message, or intermittent. Therefore, I suspect my .hammiedb is corrupted... I can supply it to anyone who would like to investigate it for debugging purposes. I am using Spambayes 1.0a2, installed on a system with Python 2.2.1, with the new version of the email library (as per the install docs.) Please contact me if you require any further details. Example of how to generate the error follows, along with traceback: adam$ /usr/local/bin/hammiefilter.py -f -d $HOME/.hammiedb < example Traceback (most recent call last): File "/usr/local/bin/hammiefilter.py", line 179, in ? main() File "/usr/local/bin/hammiefilter.py", line 175, in main action(msg) File "/usr/local/bin/hammiefilter.py", line 113, in filter return h.filter(msg) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 108, in filter prob, clues = self._scoremsg(msg, True) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 217, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 441, in _getclues prob = self.probability(record) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 304, in probability assert spamcount <= nspam AssertionError ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:16 Message: Logged In: YES user_id=14198 Another Outlook report in [ 745292 ] Logs Show COM error ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:07 Message: Logged In: YES user_id=14198 Noting we have an Outlook user with this now too: [ 747458 ] messages not processed (which I closed as a dupe) ---------------------------------------------------------------------- Comment By: Leonid (leobru) Date: 2003-05-31 10:36 Message: Logged In: YES user_id=790676 This happens, e.g., if a forced re-training was performed on a non-empty database, thus screwing up the message counts - this is for sure, I was bitten by it myself; or, potentially, if hammiefilter.py -t and mboxtrain.py were running at the same time ??? To avoid: do not do it (I do not use hammiefilter.py -t to be on the safe side). To fix, once it happens: start from scratch. Good to have in the next version: a database validator and corrector. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:17:26 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:13 2003 Subject: [spambayes-dev] [ spambayes-Bugs-742791 ] Install fails with TypeError: bizarre "installer" exception Message-ID: Bugs item #742791, was opened at 2003-05-25 03:33 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742791&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: Rajendra Wall (dirtynuke) Assigned to: Mark Hammond (mhammond) >Summary: Install fails with TypeError: bizarre "installer" exception Initial Comment: on WXP/Outlook 2K, install failed with TypeError: Traceback (most recent call last): File "", line 10, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\addin.py", line 59, in ? File "E:\src\pythonex\com\win32com\client\gencache.py ", line 341, in EnsureModule File "E:\src\pythonex\com\win32com\client\gencache.py ", line 213, in GetModuleForTypelib File "E:\src\pythonex\com\win32com\client\gencache.py ", line 473, in _GetModule File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 335, in doimport File "E:\src\Installer\iu.py", line 181, in getmod File "E:\src\Installer\archive.py", line 375, in getmod TypeError: unbound method getmod() must be called with DirOwner instance as first argument (got ExtInPkgImporter instance instead) Thanks! ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742791&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:26:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:14 2003 Subject: [spambayes-dev] [ spambayes-Bugs-741797 ] Wrong inbox is selected by default Message-ID: Bugs item #741797, was opened at 2003-05-23 01:22 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Kim Tracy (kimtracy) Assigned to: Mark Hammond (mhammond) >Summary: Wrong inbox is selected by default Initial Comment: Does not actively filter incoming e-mails and leaves Spam field empty. Appears to properly filter mail when doing a "filter now" and "delete as Spam" works fine. The log file is attached and it does contain several error messages that may be of help. This is the binary version running on Win2K using outlook2000. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:26 Message: Logged In: YES user_id=14198 Removed any default at all - the user must select their Inbox now. Checking in FilterDialog.py; new revision: 1.18; previous revision: 1.17 ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 14:32 Message: Logged In: YES user_id=784661 I did both (make sure I had the right inbox & put unsure in 'Deleted Items" as well). In my case, I needed to filter incoming mail from an IMAP inbox, which I would think is a fairly common case. I would have preferred no presets or having a full path name for the folder so that I could easily tell which it was pointing to. Having the wrong inbox, was likely the problem. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 14:26 Message: Logged In: YES user_id=552329 Was it pointing at the wrong inbox? It's an interesting point that the inbox that is the default choice (Outlook's default, I think) might be the wrong one, and confuse people that have another folder called inbox. Not sure what could be done about that (apart from docs). If the solution was setting the uncertain folder, then there might be a problem. You should be able to have unsures be untouched, and still filter certains. I know I've never run it in this configuration, though (Mark might have). ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 14:20 Message: Logged In: YES user_id=784661 It appears to be working at the moment--I sent the uncertain mail to a folder as well and made sure I was pointing to the correct inbox. Thanks for the help--it seems to categorize very well given a relatively small amount of training. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 14:19 Message: Logged In: YES user_id=14198 Cute - I left "Delete" out as an option on purpose - I didn't think of setting it to move to "Deleted Items" :) Can you please find a newer log file, and attach it? I see strange exceptions, but not ones that would relate to filtering. ---------------------------------------------------------------------- Comment By: Kim Tracy (kimtracy) Date: 2003-05-23 13:55 Message: Logged In: YES user_id=784661 Yes, I now have several hundred certain spam that have been trained into the DB and have the enable filtering box ticked. I also delete all certain spam by moving it to deleted_items. I don't do anything to the unsures (which is what I would prefer). I also have set up the folders to filter. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-05-23 12:17 Message: Logged In: YES user_id=552329 In the manager dialog, is the enable filtering box ticked? >From the log, when you first started you only had two spam - there needs to be at least 5 to enable filtering. Later on you trained more spam, so there would be enough, but it doesn't show whether you enabled it or not. (In addition, you need to set up which folder(s) to filter, which folder to deliver unsures to, and set the action to "move" or "copy" rather than "untouched"). Unfortunately the error messages in the log aren't of use; they are hangovers from the installer (that will be fixed in the next release). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=741797&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:29:21 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:15 2003 Subject: [spambayes-dev] [ spambayes-Bugs-739989 ] Corrupt Database wont train. Message-ID: Bugs item #739989, was opened at 2003-05-20 03:50 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Invalid Priority: 5 Submitted By: Benjamin J. Judson (astrogen) Assigned to: Mark Hammond (mhammond) Summary: Corrupt Database wont train. Initial Comment: So after updating to the latest CVS release I have not been able to run a manual train on the database. I checked out the message for why and got a stacktrace indicating that the database was corrupt. So I tried to do a rebuild database, and the same thing happened. So I deleted the database, and all the .pck files, and started from scratch. This is the stack trace Im getting now. C:\spambayes-1.0a2\Outlook2000>c:\python22\python.exe manager.py Loaded bayes database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck' Loaded message database from 'C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 3.5133ms Exception in thread Thread-1: Traceback (most recent call last): File "C:\Python22\Lib\threading.py", line 408, in __bootstrap self.run() File "C:\Python22\Lib\threading.py", line 396, in run apply(self.__target, self.__args, self.__kwargs) File "C:\spambayes-1.0a2\Outlook2000\dialogs\AsyncDialog.py", line 115, in thread_target self._DoProcess() File "C:\spambayes-1.0a2\Outlook2000\dialogs\TrainingDialog.py", line 120, in _DoProcess self.trainer(self.mgr, self.progress, self.rebuild) File "C:\spambayes-1.0a2\Outlook2000\train.py", line 118, in trainer progress.set_stages(stages) AttributeError: _Progress instance has no attribute 'set_stages' Saving bayes database with 0 spam and 0 good messages -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_bayes_database.pck -> C:\Documents and Settings\Administrator\Application Data\SpamBayes\default_message_database.pck Saved databases in 5.97925ms ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:29 Message: Logged In: YES user_id=14198 I think we established in mail that a further cvs update worked - please re-open if this is not the case. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-20 09:13 Message: Logged In: YES user_id=14198 It sounds to me like your CVS update didn't finish the update of Outlook2000\dialogs. Revision 1.4 of AsyncDialog grew that method. Please check you have the latest files ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=739989&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:32:36 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:16 2003 Subject: [spambayes-dev] [ spambayes-Bugs-693371 ] Invalid configuration causes us to silently start disabled Message-ID: Bugs item #693371, was opened at 2003-02-26 13:44 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=693371&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) >Summary: Invalid configuration causes us to silently start disabled Initial Comment: If you "Reconfigure Mail Support" and switch between "corporate" and "internet only" mode (in either direction), the EntryIDs of our folders all change. SpamBayes then starts in an enabled state, but is silently doing nothing. Even if we can't re-locate the folders, we should report failure to start somehow. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:32 Message: Logged In: YES user_id=14198 As Outlook itself can't even gracefully recover from this, I think I will water this bug down to the fact we don't notify the user that we are screwed and disabled. Just deleting the config file has the same basic effect. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-26 14:12 Message: Logged In: YES user_id=14198 I should note that all the builtin rules also break in this case, and need the folders re-specified. The builtin rules however display a dialog as the rule fails (ie, as a mail matches the condition). We could maybe take the same approach: * If filtering is enabled when we start, but we can not locate the "watch" folder, silently assume the Inbox. * Watch these messages. As soon as a Spam or Unsure message is received, display the message indicating why we couldn't move it. * The "silent inbox" assumption will then hopefully be noticed by the user as they re-configure the dialogs. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=693371&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:33:48 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:18 2003 Subject: [spambayes-dev] [ spambayes-Bugs-705379 ] Outlook dialogs sometimes become unresponsive Message-ID: Bugs item #705379, was opened at 2003-03-18 14:43 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=705379&group_id=61702 Category: Outlook Group: None >Status: Closed Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook dialogs sometimes become unresponsive Initial Comment: The training and filtering dialogs sometimes become unresponsive during filtering/training. They shouldn't, as hoops are jumped through to keep the UI and worker in separate threads. Further, it only seems to happen on "large" folders - eg, I can provoke it on my Inbox, but not on smaller folders. I'm guessing some bullshit COM/Outlook thread rule I am breaking. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-04 17:59 Message: Logged In: YES user_id=14198 Dropping the background thread priority seemed to fix this - haven't seen it for ages. Now the dialog is no less responsive than the machine itself. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=705379&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:35:18 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:19 2003 Subject: [spambayes-dev] [ spambayes-Bugs-730151 ] Outlook fails to classify Message-ID: Bugs item #730151, was opened at 2003-04-30 21:55 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=730151&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Outlook fails to classify Initial Comment: After updating to the latest CVS version today, I get the following tracebacks whenever a mails arrives: pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\addin.py", line 210, in OnItemAdd ProcessMessage(msgstore_message, self.manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\addin.py", line 170, in ProcessMessage disposition = filter.filter_message (msgstore_message, manager) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\filter.py", line 15, in filter_message prob = mgr.score(msg) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\ Outlook2000\manager.py", line 440, in score return self.bayes.spamprob(bayes_tokenize(email), evidence) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\s pambayes\classifier.py", line 217, in chi2_spamprob clues = self._getclues(wordstream) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\s pambayes\classifier.py", line 441, in _getclues prob = self.probability(record) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\s pambayes\classifier.py", line 301, in probability assert hamcount <= nham exceptions.AssertionError: ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:35 Message: Logged In: YES user_id=14198 Tracking in [ 706520 ] assert fails in classifier ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-01 07:55 Message: Logged In: YES user_id=14198 This has come up before - I am afraid you really must re-train ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=730151&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:35:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:21 2003 Subject: [spambayes-dev] [ spambayes-Bugs-706520 ] assert fails in classifier Message-ID: Bugs item #706520, was opened at 2003-03-20 07:46 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Adam Glass (adamglass) Assigned to: Nobody/Anonymous (nobody) Summary: assert fails in classifier Initial Comment: This morning, I noticed that my emails no longer had a X-Spambayes-Classification header, so I looked through my procmail logs, and sure enough, hammiefilter.py is giving a traceback when an assertion fails. This happens on all messages now; it is not specific to a single message, or intermittent. Therefore, I suspect my .hammiedb is corrupted... I can supply it to anyone who would like to investigate it for debugging purposes. I am using Spambayes 1.0a2, installed on a system with Python 2.2.1, with the new version of the email library (as per the install docs.) Please contact me if you require any further details. Example of how to generate the error follows, along with traceback: adam$ /usr/local/bin/hammiefilter.py -f -d $HOME/.hammiedb < example Traceback (most recent call last): File "/usr/local/bin/hammiefilter.py", line 179, in ? main() File "/usr/local/bin/hammiefilter.py", line 175, in main action(msg) File "/usr/local/bin/hammiefilter.py", line 113, in filter return h.filter(msg) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 108, in filter prob, clues = self._scoremsg(msg, True) File "/usr/local/lib/python2.2/site-packages/spambayes/hammie.py", line 38, in _scoremsg return self.bayes.spamprob(tokenize(msg), evidence) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 217, in chi2_spamprob clues = self._getclues(wordstream) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 441, in _getclues prob = self.probability(record) File "/usr/local/lib/python2.2/site-packages/spambayes/classifier.py", line 304, in probability assert spamcount <= nspam AssertionError ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:35 Message: Logged In: YES user_id=14198 Outlook again: [ 730151 ] Outlook fails to classify ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:16 Message: Logged In: YES user_id=14198 Another Outlook report in [ 745292 ] Logs Show COM error ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-04 18:07 Message: Logged In: YES user_id=14198 Noting we have an Outlook user with this now too: [ 747458 ] messages not processed (which I closed as a dupe) ---------------------------------------------------------------------- Comment By: Leonid (leobru) Date: 2003-05-31 10:36 Message: Logged In: YES user_id=790676 This happens, e.g., if a forced re-training was performed on a non-empty database, thus screwing up the message counts - this is for sure, I was bitten by it myself; or, potentially, if hammiefilter.py -t and mboxtrain.py were running at the same time ??? To avoid: do not do it (I do not use hammiefilter.py -t to be on the safe side). To fix, once it happens: start from scratch. Good to have in the next version: a database validator and corrector. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=706520&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:36:43 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:22 2003 Subject: [spambayes-dev] [ spambayes-Bugs-731538 ] exceptions.AssertionError: What kind of ID is 'None Message-ID: Bugs item #731538, was opened at 2003-05-03 05:05 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Mark Hammond (mhammond) >Summary: exceptions.AssertionError: What kind of ID is 'None Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- Comment By: Brian Takita (btakita) Date: 2003-05-04 08:21 Message: Logged In: YES user_id=620484 Windows 2000 Outlook XP Python 2.2 SpamBayes 1.0a2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:41 Message: Logged In: YES user_id=14198 Sorry - my mistake - you already did that - thanks! ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-03 05:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From noreply at sourceforge.net Wed Jun 4 02:38:26 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 04:39:23 2003 Subject: [spambayes-dev] [ spambayes-Bugs-689298 ] email.Errors.HeaderParseError: Not a header, not a Message-ID: Bugs item #689298, was opened at 2003-02-20 01:43 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jacques Benoit (jbenoit) Assigned to: Mark Hammond (mhammond) >Summary: email.Errors.HeaderParseError: Not a header, not a Initial Comment: Using Spambayes 1.0a1 and Outlook Plug-in Using Python 2.2.2 Some email messages are not processed correctly. The Outlook buttons "Delete as Spam" and "Recover from Spam" have no effect. A Pythonwin trace follows. An email message is provided. ===== Deleting and spam training message 'Lose 22.5lbs in 3 weeks for FREE! ' - FAILED to create email.message from: 'Received: from maili41.mxdat.org (ms5.mxdat.com [209.236.58.41]) by cybqc07.cybectec.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)\r\n\tid 1B84HXQH; Wed, 19 Feb 2003 07:13:59 -0500\r\nTo: jbenoit@cybectec.com\r\nDate: Wed, 19 Feb 2003 07:19:15 -0500\r\nMessage-ID: <1045657155.5456@green5>\r\nX-Mailer: Pine.GSO.4.31\r\nFrom: "Get Serious" \r\nReturn-Path: \r\nReply-To: \r\nSubject: Lose 22.5lbs in 3 weeks for FREE!\r\n \r\n************************************************** ********************\r\nPLEASE DO NOT REPLY TO THIS EMAIL - To unsubscribe, please see the\r\nsubscription management section at the bottom of this newsletter.\r\n*************************************** *******************************\r\n\n\n\r\n"I Couldn\'t Face Another Holiday Being Called the \'FAT ONE\'... \r\nThank God I Found Apple Cider Vinegar Enhanced!" \r\n\r\nGet Your Free Bottle & SEE FOR YOURSELF! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\nCLICK HERE FOR DETAILS! \r\nNo Crash Diets! No Painful Excercise! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\n=========================== ========================== \r\n\r\n\r\nEnjoy your day,\r\n\r\n\r\nDaily Max Deal Chopper\r\n\r\n\r\n######################## ######################################### #####\r\nIf you no longer wish to receive your edition of the Daily Max Deal Chop \r\nNewsletter, please follow the link below and follow the simple \r\nunsubscribe instructions.\r\n\r\nhttp://209.236.60.3/unsub.ht m\r\n\r\nThe use and unauthorized reproduction of this message and delivery header \r\ninformation is strictly prohibited. This e-mail is meant for informational \r\npurposes only. JudoMonkey makes no guarantees in connection with the \r\nproduct(s) or service(s) presented.\r\n############################# ######################################### \r\n\r\nworabvg^plorpgrp(pbz\r\n' pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 510, in _invokeex_ return apply(func, args) File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\addin.py", line 315, in OnClick if train.train_message(msgstore_message, True, self.manager, rescore = True): File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\train.py", line 43, in train_message stream = msg.GetEmailPackageObject() File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\msgstore.py", line 565, in GetEmailPackageObject msg = email.message_from_string(text) File "D:\PROGRA~1\Python22 \Lib\email\__init__.py", line 52, in message_from_string return Parser(_class, strict=strict).parsestr(s) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 62, in parse self._parseheaders(root, fp) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ``**************************************************** ******************'' ===== ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-20 02:47 Message: Logged In: YES user_id=645698 Great. Thanks for helping out here. ---------------------------------------------------------------------- Comment By: Jacques Benoit (jbenoit) Date: 2003-02-20 02:46 Message: Logged In: YES user_id=715810 Installed alpha 2. Same error in the Python... File "D:\PROGRA~1\Python22\Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-20 02:08 Message: Logged In: YES user_id=645698 Might this have been fixed in alpha 2? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 From noreply at sourceforge.net Wed Jun 4 19:49:53 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 4 22:10:10 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749277 ] Should prevent "Deleted Items" being target folder Message-ID: Bugs item #749277, was opened at 2003-06-05 11:49 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749277&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Should prevent "Deleted Items" being target folder Initial Comment: As a user pointed out: I set up the filter to automatically move spam to ?deleted items?. Then I thought about it, and when I re-read the documentation, it seems SpamBayes will automatically assume anything in my deleted items is spam and add it to the database. If this is the case, then is it fair to say that after I?ve read non-spam (ham) email, and delete it as normal, Spambayes would flag the ham as spam? He is correct, and this would be evil. We either must display a severe warning, or simply prevent it. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749277&group_id=61702 From T.A.Meyer at massey.ac.nz Thu Jun 5 16:15:59 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 4 23:17:27 2003 Subject: [spambayes-dev] New Outlook toolbars Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CDB1@its-xchg4.massey.ac.nz> Nice toolbar :) All works fine for me, except that the toolbar is still there if I unregister spambayes (via addin.py). Is this just a quirk of running via CVS and won't happen with (un)installer 003? (either way, maybe worth documenting/fixing) =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 5 16:30:52 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 4 23:31:32 2003 Subject: [spambayes-dev] Next to no feedback on the trial faq Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CDC3@its-xchg4.massey.ac.nz> > So far, the only person who responded was Bill Parducci. [...] > Still, I thought one or two other people would have responded. My apologies. My computer (well, WindowsXP) died last Tuesday and the support people around here only managed to get to fixing it today. Otherwise I would have :) > Before I make an executive decision and simply adopt > this new stuff I gather this has been done, but my comments would have been positive anyway. I've managed to look over the FAQ and it certainly looks much better than it was. Sometime I'll get round to reading it properly, but not now! Anyway, good work! =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 5 16:34:46 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 4 23:35:24 2003 Subject: [spambayes-dev] Alpha Release 3 Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CDCC@its-xchg4.massey.ac.nz> Hey all :) I had rather hoped that we could get alpha3 out at the end of last week, but then I had a meltdown here and that all went out the window . It seems to me that the only things necessary/desirable before alpha3 are: 1. Mark's version information file. 2. Checking that the documentation is up-to-date. 3. Checking that the setup.py (etc) stuff is up-to-date. I'm not sure what the status of (1) is (comments?), but I'm happy to do (2) (today/tomorrow). Anyone want to volunteer to do (3)? (I'm not really up with all that stuff). Anyone have anything to add to the list? Once these are done, what's the process for building the release? (Or, to rephrase: who wants to take care of this? :) =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 5 17:16:52 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 5 00:17:33 2003 Subject: [spambayes-dev] RE: [Spambayes] multiple languages Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CE1B@its-xchg4.massey.ac.nz> [Moving to spambayes-dev since it seems more in place there] > Nope! It's a statistical classifier with no semantic > knowledge, and you already explained the consequences of > that. It would be interested (IMO) to see what sort of effect doing a token-by-token translation (into a common language, whichever one) would have (i.e. give it a bit of semantic knowledge). Those tokens that weren't in the translation dictionary could be left alone (which would include the garbage that is often added). This would then (in theory) mean that if I get email offering me "pornograf?a", Spambayes would give it the same score as one offering me "pornography". Hopefully the email itself would say that it was in Spanish, but otherwise a range of dictionaries could be consulted. It would be easy enough to implement (I presume) - either using some (f/oss) translation tool or even babelfish or translate.google.com. OTOH, Francois's message indicated that he gets good results without all this bother, so maybe it isn't worth it... Thoughts? =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 5 17:27:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 5 00:27:52 2003 Subject: [spambayes-dev] RE: [Spambayes] website/list suggestion Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CE29@its-xchg4.massey.ac.nz> > wondering if the 'Email Us' link on left hand nav bar would be more > effective if it were named 'Contact Us' and it linked to a > page that had the new mailing lists? +1. I definitely think that the mailing list information should be more prominent, and moving them onto a "contact" page would do that (if there was a link on the left). > in addition to showing each of the > lists and the > corresponding purpose descriptors, it could be laden with the usual > directions to 'read the docs, think hard and consult psychics before > posting a question' kinda stuff. +0. =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 5 17:41:19 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 5 00:41:55 2003 Subject: [spambayes-dev] [ spambayes-Bugs-721664 ] mboxtrain.py doesn't findMaildir tmp/ directory properly Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CE37@its-xchg4.massey.ac.nz> Can someone that uses mboxtrain confirm that this should be applied? If it does, it could be good to apply & close this. If not, it would be good to note & close this :) =Tony Meyer From noreply at sourceforge.net Wed Jun 4 21:26:13 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 5 01:25:34 2003 Subject: [spambayes-dev] [ spambayes-Bugs-740311 ] 'skip_max_word_size' error on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 20:09 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Mark Hammond (mhammond) Summary: 'skip_max_word_size' error on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-05 15:26 Message: Logged In: YES user_id=552329 I'm sure this wasn't the bug that I originally posted too, but never mind :) FWIW: I tried to install 002 with Outlook XP SP1 (WinXP SP1) and it failed with the skip_max_word_size thing. I then upgraded to Outlook XP SP2 and tried again. I then got the no disk error (e:, which is my CD drive). I tried putting a disk, in the drive and then got the could not register error. CVS still works fine :) ---------------------------------------------------------------------- Comment By: Sebastian J?rges (sjuerges) Date: 2003-05-28 04:01 Message: Logged In: YES user_id=740980 Same here. WinXP Service Pack 1 / OutlookXP ---------------------------------------------------------------------- Comment By: Harald Meyer (hmeyer666) Date: 2003-05-24 08:36 Message: Logged In: YES user_id=785713 Same here with Win2000 and Outlook XP ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-21 10:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From noreply at sourceforge.net Wed Jun 4 21:40:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 5 01:25:35 2003 Subject: [spambayes-dev] [ spambayes-Bugs-712480 ] Install error: bizarre import errors Message-ID: Bugs item #712480, was opened at 2003-03-31 17:47 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Paul Marrero (pmarrero) Assigned to: Mark Hammond (mhammond) Summary: Install error: bizarre import errors Initial Comment: I use office XP with the Outlook client. It appears that the registration was successfull but I cannnot find any menu buttons. XP clipboard does appear to have the Icons. The command line train works. Not sure where to go from here. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-05 15:40 Message: Logged In: YES user_id=552329 Ah - here's the tracer I was looking for! I posted this in another trace, but it belonged here. In the aims of testing the SP2 theory, I made no other changes, upgraded to SP2 (same version as below) and tried to install the binary. I have progressed ;) to the 'no disk' error. I can't really be bothered (at the moment) trying to get further since CVS still works :) In case this is of use... ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-25 23:41 Message: Logged In: YES user_id=14198 Failes with Outlook XP ver.10.4712.4219 sp-2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-23 11:40 Message: Logged In: YES user_id=14198 *sob* - so it certainly fails with Outlook SP2 here? ecvogel - is your traceback identical to this one? ---------------------------------------------------------------------- Comment By: Eric C. Vogel (ecvogel) Date: 2003-05-23 07:20 Message: Logged In: YES user_id=784818 Same here. Outlook XP all SP and patches, WIn XP Pro SP1 all patches. The file is registered and in the COM and has the check. Exchange Server 2000 all SP and patches on SBS 2kSP-1 all patches. 002 install ---------------------------------------------------------------------- Comment By: Steve McCullough (sm5669) Date: 2003-05-20 10:24 Message: Logged In: YES user_id=782446 Same here. Outlook 2002 on Win2000 with all the latest SPs and patches applied. Log file... warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) warning: use func(*args, **kwargs) instead of apply(func, args, kwargs) SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\universal.py", line 170, in dispatch retVal = ob._InvokeEx_(meth.dispid, 0, meth.invkind, args, None, None) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 322, in _InvokeEx_ return self._invokeex_(dispid, lcid, wFlags, args, kwargs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\PYTHON32\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "E:\src\spambayes\Outlook2000\addin.py", line 654, in OnConnection File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\manager.py", line 15, in ? File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\Outlook2000\msgstore.py", line 86, in ? File "E:\src\Installer\iu.py", line 296, in importHook exceptions.ImportError: No module named win32com.mapi ---------------------------------------------------------------------- Comment By: Tormod Hystad (thystad) Date: 2003-05-16 19:27 Message: Logged In: YES user_id=780089 Exactly same as Nick here. Win2003 RC2, Outlook XP (Norwegian, version 10.2627.4219 SP2, configured with one Exchange (Standard) email acount) and the 002 install package. Thanks, Tormod ---------------------------------------------------------------------- Comment By: Nick Trautmann (nicktrautmann) Date: 2003-05-12 20:15 Message: Logged In: YES user_id=776857 Similar here: pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\Nick\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize Thanks Nick ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-01 14:48 Message: Logged In: YES user_id=552329 Actually, I get this too. I've just switched to Outlook XP, so I'm not sure if this is the reason, or just that I'm doing a fresh install. The log includes the following traces: SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' SpamAddin - Connecting to Outlook Failed to load bayes database Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\manager.py", line 310, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes AttributeError: 'module' object has no attribute 'DBDictClassifier' Loaded message database from 'C:\Documents and Settings\tameyer\Application Data\SpamBayes\default_message_database.db' Either bayes database or message database is missing - creating new pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 165, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 329, in LoadBayes File "E:\src\spambayes\Outlook2000\manager.py", line 378, in InitNewBayes File "E:\src\spambayes\Outlook2000\manager.py", line 94, in new_bayes File "E:\src\spambayes\Outlook2000\manager.py", line 118, in open_bayes exceptions.AttributeError: 'module' object has no attribute 'DBDictClassifier' ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-03-31 18:05 Message: Logged In: YES user_id=552329 Which version of the Outlook plugin are you using? (a) the latest CVS, (b) the 001 stand-alone installer, or (c) the 002 stand-alone installer? I know that the 001 installer has been known to have this problem (although it appeared to be fixed in 002). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=712480&group_id=61702 From noreply at sourceforge.net Wed Jun 4 21:57:59 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 5 01:25:37 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749309 ] Spambayes button does not load Message-ID: Bugs item #749309, was opened at 2003-06-04 23:57 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749309&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: Harvey Schoenman (hschoenman) Assigned to: Mark Hammond (mhammond) Summary: Spambayes button does not load Initial Comment: Spambayes (binary version) installed with no errors but after starting Outlook 2000 with SR-1 there was no Spambayes button on the toolbar. I checked for a log file (one is attached). I also looked in Tools, Options, Other, Advanced, Com Add-ins. Spambayes.outlookaddin was present but unchecked. I was able to check it but the checkmark did not "stick." Whenever I went back in, it remained unchecked. The log file contains the following error: pythoncom error: Failed to call the universal dispatcher -Harvey Schoenman ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749309&group_id=61702 From mhammond at skippinet.com.au Thu Jun 5 20:05:04 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Thu Jun 5 05:16:47 2003 Subject: [spambayes-dev] New Outlook toolbars In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CDB1@its-xchg4.massey.ac.nz> Message-ID: <01a601c32b41$8e092b20$530f8490@eden> > All works fine for me, except that the toolbar is still there if I > unregister spambayes (via addin.py). > > Is this just a quirk of running via CVS and won't happen with > (un)installer 003? The only way I can uninstall is to start outlook, and programatically remove the toolbar. Depending on your configuration, this may require a dialog. If you leave Outlook running for the uninstall, then the DLL will not be removed (hopefully it will be flagged for remove next reboot, but then the uninstall is likely to offer to reboot, etc. So then I thought maybe the uninstall could ask if it really *wants* to start Outlook to remove the toolbar, or telling the user how to remove it manually. So it all got too hard :) I'm more than willing to take opinions on the "right thing" though! Just documentation on the current behaviour is my favourite option . Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 2032 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes-dev/attachments/20030605/42d48dda/winmail.bin From T.A.Meyer at massey.ac.nz Fri Jun 6 08:46:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 5 18:39:48 2003 Subject: [spambayes-dev] New Outlook toolbars Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6CEA8@its-xchg4.massey.ac.nz> > The only way I can uninstall is to start outlook, and > programmatically remove the toolbar. Depending on your > configuration, this may require a dialog. If you leave > Outlook running for the uninstall, then the DLL will not be > removed (hopefully it will be flagged for remove next reboot, > but then the uninstall is likely to offer to reboot, etc. So > then I thought maybe the uninstall could ask if it really > *wants* to start Outlook to remove the toolbar, or telling > the user how to remove it manually. So it all got too hard :) > > I'm more than willing to take opinions on the "right thing" > though! Just documentation on the current behaviour is my > favourite option . IMO, the uninstaller should definitely programmatically remove the toolbar (giving the user instructions about what to do with the potential dialog). Can the uninstaller not force the user to quit Outlook before removing the dll? And then programmatically start Outlook :), remove the toolbar and then programmatically quit Outlook? Pretty roundabout, but I can see binary users getting pretty confused if they have to remove it manually. For CVS, just a note in the docs somewhere about doing the Customize->Spambayes->Delete sequence should be enough. Probably not a priority, though :) =Tony Meyer From noreply at sourceforge.net Thu Jun 5 15:30:17 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 5 18:48:09 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749814 ] Add-In Load Failure Message-ID: Bugs item #749814, was opened at 2003-06-05 16:30 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749814&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Robert Garber (rgarber) Assigned to: Mark Hammond (mhammond) Summary: Add-In Load Failure Initial Comment: The Add-In appears in list of Outlook add-ins but with no check mark. When I check the box and click OK it does not load or save the check mark. Environment Win 98 Outlook 2000 Executable install Log file is attached with an error. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749814&group_id=61702 From mhammond at skippinet.com.au Fri Jun 6 15:00:53 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Jun 6 00:01:29 2003 Subject: [spambayes-dev] Version information repository Message-ID: <002701c32be0$3a5714c0$530f8490@eden> I have just checked in spambayes/Version.py, as a simple repository for version information. It is a trivial little file, mainly consisting of a single dictionary: versions = { # Non app specific - changed when "spambayes\*" changes significantly "Version": 0.1, "Description": "SpamBayes Beta1", "Date": "June 2003", "Full Description": "%(Description)s, version %(Version)s (%(Date)s)", # Sub-dict for application specific version strings. "Apps": { "Outlook" : { "Version": 0.3, "BinaryVersion": 003, "Description": "SpamBayes Outlook Addin Beta1", "Date": "June 2003", "Full Description": "%(Description)s, version %(Version)s (%(Date)s)", "Full Description Binary": "%(Description)s, Binary version %(BinaryVersion)s (%(Date)s)", }, }, } There are really 2 reasons for this mail: * To let you know it exists, and should be updated before releases. See the module for a couple of handy functions you may like to use to print the version of your app. * To get feedback on the specific version numbers. We are moving towards Beta1, and a version of 0.1 seems reasonable (whereas presumably 0.02 would have been used for alpha 2. However, I am open to anything, really. I have the core engine dated as "June 2003" which isn't really correct either (as I recall no significant changes there for at least a month) but I think it is good enough to get started. Mark. From T.A.Meyer at massey.ac.nz Fri Jun 6 17:16:45 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Jun 6 00:22:51 2003 Subject: [spambayes-dev] Version information repository Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301D6D17D@its-xchg4.massey.ac.nz> > I have just checked in spambayes/Version.py, as a simple > repository for version information. Don't forget to close the feature request for this :) I notice that at the same time you're moving away from the "Anti-Spam" name to Spambayes. Are you also planning to change the name of the toolbar button & menuitem? > * To get feedback on the specific version numbers. We are > moving towards Beta1, and a version of 0.1 seems reasonable > (whereas presumably 0.02 would have been used for alpha 2. > However, I am open to anything, really. I've said this to Mark via email, but to get the list started: I think that this makes sense as a current state, assuming that we are about to put up a new package for download: Main: 0.10, Spambayes Beta1, May 2003 Hammie: 0.10, ? Beta1, Jan 2003 POP3Proxy: 0.03, Spambayes POP3 Proxy Alpha3, May 2003 SMTPProxy: 0.01, Spambayes SMTP Proxy Alpha1, May 2003 IMAPFilter: 0.01, Spambayes IMAP Filter Alpha1, May 2003 NotesFilter: 0.01, Spambayes Lotus Notes Filter Alpha1, Mar 2003 I don't really know what to call the package. Spambayes-1.0b1? =Tony Meyer From skip at pobox.com Fri Jun 6 08:09:34 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Jun 6 08:09:43 2003 Subject: [spambayes-dev] Version information repository In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301D6D17D@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301D6D17D@its-xchg4.massey.ac.nz> Message-ID: <16096.33918.575111.340130@montanaro.dyndns.org> Tony> I think that this makes sense as a current state, assuming that we Tony> are about to put up a new package for download: Tony> Main: 0.10, Spambayes Beta1, May 2003 Tony> Hammie: 0.10, ? Beta1, Jan 2003 Tony> POP3Proxy: 0.03, Spambayes POP3 Proxy Alpha3, May 2003 Tony> SMTPProxy: 0.01, Spambayes SMTP Proxy Alpha1, May 2003 Tony> IMAPFilter: 0.01, Spambayes IMAP Filter Alpha1, May 2003 Tony> NotesFilter: 0.01, Spambayes Lotus Notes Filter Alpha1, Mar 2003 I think the POP3Proxy is sufficiently well-tested to be considered beta unless you have some significant known issues which are unresolved. Skip From noreply at sourceforge.net Fri Jun 6 07:10:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 6 09:44:13 2003 Subject: [spambayes-dev] [ spambayes-Bugs-750069 ] Spambayes Enabled, No Decimal Values Reported Message-ID: Bugs item #750069, was opened at 2003-06-06 09:10 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=750069&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: George Bertoli (gbertoli) Assigned to: Mark Hammond (mhammond) Summary: Spambayes Enabled, No Decimal Values Reported Initial Comment: Running Windows 2000 professional, Outlook 2000 SP3 Log files attached. Think this might be result of Ad-Aware or Spybot removing or disabling Python? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=750069&group_id=61702 From papaDoc at videotron.ca Fri Jun 6 13:35:49 2003 From: papaDoc at videotron.ca (papaDoc) Date: Fri Jun 6 12:37:24 2003 Subject: [spambayes-dev] Version information repository In-Reply-To: <16096.33918.575111.340130@montanaro.dyndns.org> References: <1ED4ECF91CDED24C8D012BCF2B034F1301D6D17D@its-xchg4.massey.ac.nz> <16096.33918.575111.340130@montanaro.dyndns.org> Message-ID: <3EE0C2E5.3080503@videotron.ca> Hi, >I think the POP3Proxy is sufficiently well-tested to be considered beta >unless you have some significant known issues which are unresolved. > There is still the bug [ 737967 ] Malformed messages break pop3proxy (et al) . This is a big problem since you can retrieve any message after that. You can't use pop3proxy if this message in not removed by another mean. Remi From skip at pobox.com Fri Jun 6 15:45:58 2003 From: skip at pobox.com (Skip Montanaro) Date: Fri Jun 6 15:46:12 2003 Subject: [spambayes-dev] Trigraphs as indicators of invalid subject words In-Reply-To: <3ee0e0ba$1@nntp0.pdx.net> References: <3edfd266$1@nntp0.pdx.net> <3EDFEAD0.B50662FA@alcyone.com> <3EDFFC1F.247ED469@alcyone.com> <3ee0e0ba$1@nntp0.pdx.net> Message-ID: <16096.61302.749033.920447@montanaro.dyndns.org> [ ... on using trigraphs as clues to identify bogus words in message subjects ... ] >> Now you could turn things around and say the subject contained an >> invalid word. That might be a useful clue for Spambayes. Scott> That was my idea. Find a way to use the non-wordness to Scott> penalize, rather than favor a message. I tried it and found it had essentially no effect. That doesn't mean it isn't a good idea. It's just that Spambayes is already so good that there isn't much room for improvement. I just ran a 10x10 cross validation test using 500 spams and 500 hams in each test set. It trained on 9 sets each (4500 messages) of hams and spams then tested against the remaining one set of each, then repeated choosing a different set to be the test. Over all runs it scored 16 hams incorrectly (false positives - 0.32%), scored 40 spams incorrectly (false negatives - 0.80%) and was unsure about 573 messages (5.73%). When I added in Scott's idea implemented as a synthetic "subject:invalid word" token, the false positives and false negatives didn't change. The unsures crept up to 574. This was run on a new training database (12700+ hams and 8600+ spams) which I haven't exhaustively combed for errors, so it's possible there are still some mistakes of mine in there (placing a ham message in the spam training set for example), but it is essentially the same data which I use to train Spambayes and classify messages on a daily basis, so I think it's fairly clean. Skip From noreply at sourceforge.net Fri Jun 6 14:22:31 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 6 16:26:41 2003 Subject: [spambayes-dev] [ spambayes-Bugs-689298 ] email.Errors.HeaderParseError: Not a header, not a Message-ID: Bugs item #689298, was opened at 2003-02-19 09:43 Message generated for change (Comment added) made by bwarsaw You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jacques Benoit (jbenoit) Assigned to: Mark Hammond (mhammond) Summary: email.Errors.HeaderParseError: Not a header, not a Initial Comment: Using Spambayes 1.0a1 and Outlook Plug-in Using Python 2.2.2 Some email messages are not processed correctly. The Outlook buttons "Delete as Spam" and "Recover from Spam" have no effect. A Pythonwin trace follows. An email message is provided. ===== Deleting and spam training message 'Lose 22.5lbs in 3 weeks for FREE! ' - FAILED to create email.message from: 'Received: from maili41.mxdat.org (ms5.mxdat.com [209.236.58.41]) by cybqc07.cybectec.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)\r\n\tid 1B84HXQH; Wed, 19 Feb 2003 07:13:59 -0500\r\nTo: jbenoit@cybectec.com\r\nDate: Wed, 19 Feb 2003 07:19:15 -0500\r\nMessage-ID: <1045657155.5456@green5>\r\nX-Mailer: Pine.GSO.4.31\r\nFrom: "Get Serious" \r\nReturn-Path: \r\nReply-To: \r\nSubject: Lose 22.5lbs in 3 weeks for FREE!\r\n \r\n************************************************** ********************\r\nPLEASE DO NOT REPLY TO THIS EMAIL - To unsubscribe, please see the\r\nsubscription management section at the bottom of this newsletter.\r\n*************************************** *******************************\r\n\n\n\r\n"I Couldn\'t Face Another Holiday Being Called the \'FAT ONE\'... \r\nThank God I Found Apple Cider Vinegar Enhanced!" \r\n\r\nGet Your Free Bottle & SEE FOR YOURSELF! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\nCLICK HERE FOR DETAILS! \r\nNo Crash Diets! No Painful Excercise! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\n=========================== ========================== \r\n\r\n\r\nEnjoy your day,\r\n\r\n\r\nDaily Max Deal Chopper\r\n\r\n\r\n######################## ######################################### #####\r\nIf you no longer wish to receive your edition of the Daily Max Deal Chop \r\nNewsletter, please follow the link below and follow the simple \r\nunsubscribe instructions.\r\n\r\nhttp://209.236.60.3/unsub.ht m\r\n\r\nThe use and unauthorized reproduction of this message and delivery header \r\ninformation is strictly prohibited. This e-mail is meant for informational \r\npurposes only. JudoMonkey makes no guarantees in connection with the \r\nproduct(s) or service(s) presented.\r\n############################# ######################################### \r\n\r\nworabvg^plorpgrp(pbz\r\n' pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 510, in _invokeex_ return apply(func, args) File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\addin.py", line 315, in OnClick if train.train_message(msgstore_message, True, self.manager, rescore = True): File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\train.py", line 43, in train_message stream = msg.GetEmailPackageObject() File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\msgstore.py", line 565, in GetEmailPackageObject msg = email.message_from_string(text) File "D:\PROGRA~1\Python22 \Lib\email\__init__.py", line 52, in message_from_string return Parser(_class, strict=strict).parsestr(s) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 62, in parse self._parseheaders(root, fp) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ``**************************************************** ******************'' ===== ---------------------------------------------------------------------- >Comment By: Barry A. Warsaw (bwarsaw) Date: 2003-06-06 16:22 Message: Logged In: YES user_id=12800 wut shud i feed my 6" burmeez phyton, monkeyboy? (ignore this -- i'm testing the mailing list :) ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-19 10:47 Message: Logged In: YES user_id=645698 Great. Thanks for helping out here. ---------------------------------------------------------------------- Comment By: Jacques Benoit (jbenoit) Date: 2003-02-19 10:46 Message: Logged In: YES user_id=715810 Installed alpha 2. Same error in the Python... File "D:\PROGRA~1\Python22\Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-19 10:08 Message: Logged In: YES user_id=645698 Might this have been fixed in alpha 2? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 From noreply at sourceforge.net Fri Jun 6 14:27:44 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 6 16:28:04 2003 Subject: [spambayes-dev] [ spambayes-Bugs-689298 ] email.Errors.HeaderParseError: Not a header, not a Message-ID: Bugs item #689298, was opened at 2003-02-19 09:43 Message generated for change (Comment added) made by bwarsaw You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jacques Benoit (jbenoit) Assigned to: Mark Hammond (mhammond) Summary: email.Errors.HeaderParseError: Not a header, not a Initial Comment: Using Spambayes 1.0a1 and Outlook Plug-in Using Python 2.2.2 Some email messages are not processed correctly. The Outlook buttons "Delete as Spam" and "Recover from Spam" have no effect. A Pythonwin trace follows. An email message is provided. ===== Deleting and spam training message 'Lose 22.5lbs in 3 weeks for FREE! ' - FAILED to create email.message from: 'Received: from maili41.mxdat.org (ms5.mxdat.com [209.236.58.41]) by cybqc07.cybectec.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)\r\n\tid 1B84HXQH; Wed, 19 Feb 2003 07:13:59 -0500\r\nTo: jbenoit@cybectec.com\r\nDate: Wed, 19 Feb 2003 07:19:15 -0500\r\nMessage-ID: <1045657155.5456@green5>\r\nX-Mailer: Pine.GSO.4.31\r\nFrom: "Get Serious" \r\nReturn-Path: \r\nReply-To: \r\nSubject: Lose 22.5lbs in 3 weeks for FREE!\r\n \r\n************************************************** ********************\r\nPLEASE DO NOT REPLY TO THIS EMAIL - To unsubscribe, please see the\r\nsubscription management section at the bottom of this newsletter.\r\n*************************************** *******************************\r\n\n\n\r\n"I Couldn\'t Face Another Holiday Being Called the \'FAT ONE\'... \r\nThank God I Found Apple Cider Vinegar Enhanced!" \r\n\r\nGet Your Free Bottle & SEE FOR YOURSELF! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\nCLICK HERE FOR DETAILS! \r\nNo Crash Diets! No Painful Excercise! \r\n\r\nhttp://209.236.60.3/lc1/go.php?10 \r\n\r\n=========================== ========================== \r\n\r\n\r\nEnjoy your day,\r\n\r\n\r\nDaily Max Deal Chopper\r\n\r\n\r\n######################## ######################################### #####\r\nIf you no longer wish to receive your edition of the Daily Max Deal Chop \r\nNewsletter, please follow the link below and follow the simple \r\nunsubscribe instructions.\r\n\r\nhttp://209.236.60.3/unsub.ht m\r\n\r\nThe use and unauthorized reproduction of this message and delivery header \r\ninformation is strictly prohibited. This e-mail is meant for informational \r\npurposes only. JudoMonkey makes no guarantees in connection with the \r\nproduct(s) or service(s) presented.\r\n############################# ######################################### \r\n\r\nworabvg^plorpgrp(pbz\r\n' pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "D:\PROGRA~1\Python22\lib\site- packages\win32com\server\policy.py", line 510, in _invokeex_ return apply(func, args) File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\addin.py", line 315, in OnClick if train.train_message(msgstore_message, True, self.manager, rescore = True): File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\train.py", line 43, in train_message stream = msg.GetEmailPackageObject() File "D:\Program Files\Spambayes\spambayes- 1.0a1\Outlook2000\msgstore.py", line 565, in GetEmailPackageObject msg = email.message_from_string(text) File "D:\PROGRA~1\Python22 \Lib\email\__init__.py", line 52, in message_from_string return Parser(_class, strict=strict).parsestr(s) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 75, in parsestr return self.parse(StringIO(text), headersonly=headersonly) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 62, in parse self._parseheaders(root, fp) File "D:\PROGRA~1\Python22 \Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ``**************************************************** ******************'' ===== ---------------------------------------------------------------------- >Comment By: Barry A. Warsaw (bwarsaw) Date: 2003-06-06 16:27 Message: Logged In: YES user_id=12800 who you calling a monkeyboy, squakhead? ---------------------------------------------------------------------- Comment By: Barry A. Warsaw (bwarsaw) Date: 2003-06-06 16:22 Message: Logged In: YES user_id=12800 wut shud i feed my 6" burmeez phyton, monkeyboy? (ignore this -- i'm testing the mailing list :) ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-19 10:47 Message: Logged In: YES user_id=645698 Great. Thanks for helping out here. ---------------------------------------------------------------------- Comment By: Jacques Benoit (jbenoit) Date: 2003-02-19 10:46 Message: Logged In: YES user_id=715810 Installed alpha 2. Same error in the Python... File "D:\PROGRA~1\Python22\Lib\email\Parser.py", line 128, in _parseheaders raise Errors.HeaderParseError( email.Errors.HeaderParseError: Not a header, not a continuation: ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-19 10:08 Message: Logged In: YES user_id=645698 Might this have been fixed in alpha 2? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=689298&group_id=61702 From matt at mondoinfo.com Sat Jun 7 16:10:32 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Sat Jun 7 16:19:56 2003 Subject: [spambayes-dev] More obvious logarithmic expiration data Message-ID: <1054952982.19.1541@sake.mondoinfo.com> I mentioned a while ago that I'd do a little more work based on the statistics that I had collected that showed that tokens that figured in scoring were likely to be used for scoring again soon. I instrumented classifier.py and hammie.py to compute several scores and log them when computing a score. Each time SpamBayes computes a score, it also computes scores using only tokens that had been used in scoring in the previous 24 hours, the previous week, the previous two weeks, and the previous 30 days. Here are some results: 2587 sets of scores processed Number of scores that differ from actual score by 0.00 6885 by 0.01 or less 633 by 0.10 or less 179 by 0.20 or less 32 by more than 0.20 32 (The repeated 32 isn't a bug, I checked.) Because of a flaw in the way I set up the log, the "actual" score isn't quite the actual score. Rather, it's the score that used only tokens that have been used in the last 30 days. But I'm convinced that it's very near to the actual score. Also encouragingly, the score changes that happen don't seem to move the scores out of the standard 0.0-0.2 and 0.9-1.0 categories much: Moved out of spam Moved out of ham Restricted to one day 13 8 Restricted to one week 3 0 Restricted to two weeks 2 0 If I were cleverer, I'd have guessed all this from the number of posts in which people have said that they've trained SpamBayes on only a couple of hundred emails and that it's already working well for them. But then I wouldn't have the fabulous collection of ambiguous and invalid data that came before looking at how often tokens are used for scoring . Judging from this data, I could relatively painlessly use a database that contains only those tokens that have figured in scoring in the last ten days or so. That's about 11% of the 273487 tokens in my database. You'd need to bootstrap the process, presumably by counting a token as used when it's first trained on. Waiting for a token to be used before making it eligible for use has a certain theoretical elegance but results might suffer . And, of course, it's not really time that counts but rather the number of emails seen. I seem to get something like 150 emails per day. So that 10-day period is really 1500 emails scored. Adding an extra field to the "saved state" entry and recording the number of emails scored there and in the WordInfo record seems practical on the face of it. Ironically, I started collecting these statistics when I was using a laptop with a tiny hard disk. Now, with 60G at my disposal, the 23M that my database takes up is pretty trifling. Regards, Matt From noreply at sourceforge.net Sat Jun 7 20:55:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat Jun 7 22:55:41 2003 Subject: [spambayes-dev] [ spambayes-Bugs-750069 ] Spambayes Enabled, No Decimal Values Reported Message-ID: Bugs item #750069, was opened at 2003-06-06 09:10 Message generated for change (Comment added) made by gbertoli You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=750069&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: George Bertoli (gbertoli) Assigned to: Mark Hammond (mhammond) Summary: Spambayes Enabled, No Decimal Values Reported Initial Comment: Running Windows 2000 professional, Outlook 2000 SP3 Log files attached. Think this might be result of Ad-Aware or Spybot removing or disabling Python? ---------------------------------------------------------------------- >Comment By: George Bertoli (gbertoli) Date: 2003-06-07 22:55 Message: Logged In: YES user_id=795444 Further notes, Spambayes was installed and working fine for two weeks before it stopped working (stopped reporting decimal values and resultant no filtering). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=750069&group_id=61702 From gward at python.net Sun Jun 8 17:47:52 2003 From: gward at python.net (Greg Ward) Date: Sun Jun 8 16:48:00 2003 Subject: [spambayes-dev] Evaluating a training corpus Message-ID: <20030608204751.GA19819@cthulhu.gerg.ca> I'm mulling ways to evaluate the quality of a training corpus, and was wondering what the rest of you have tried. My current technique is pretty bogus: train on the complete corpus, and then score every message in the corpus using the resulting database. Obviously this is a self-fulfilling prophecy, but at least it highlights spam that are *really* different from other spam (and ditto for ham). I know there's code lurking in there somewhere (timcv.py?) for training on 90% of the corpus, and then evaluating the other 10% under the resulting database. That got me to thinking: why not build a complete training database, and then do this: foreach message: remove message from database (ie. untrain) score message report score put message back in database That removes the "self-fulfilling prophecy bit", and the arbitrary nature of the 10%/90% selection. But it should preserve the property of highlighting unusual spam or ham. Seems to me like this should do a pretty good job of finding misclassified messages, at least. Has anyone else tried something like this? Is there code out there already? Greg -- Greg Ward http://www.gerg.ca/ OUR PLAN HAS FAILED STOP JOHN DENVER IS NOT TRULY DEAD STOP HE LIVES ON IN HIS MUSIC STOP PLEASE ADVISE FULL STOP From nas at python.ca Sun Jun 8 15:13:23 2003 From: nas at python.ca (Neil Schemenauer) Date: Sun Jun 8 17:10:03 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <20030608204751.GA19819@cthulhu.gerg.ca> References: <20030608204751.GA19819@cthulhu.gerg.ca> Message-ID: <20030608211323.GA19297@glacier.arctrix.com> Greg Ward wrote: > I'm mulling ways to evaluate the quality of a training corpus, and was > I know there's code lurking in there somewhere (timcv.py?) for training > on 90% of the corpus, and then evaluating the other 10% under the > resulting database. mboxtest.py is probably the easiest to get going. I think timcv.py gives better results but it's a little more trouble to setup your test data. See README.txt for a short explaination of the tools. If you want to use timcv.py, you can use splitndirs.py to create the test data. HTH, Neil From T.A.Meyer at massey.ac.nz Mon Jun 9 10:21:03 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 8 17:21:43 2003 Subject: [spambayes-dev] Version information repository Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A54@its-xchg4.massey.ac.nz> > Tony> Main: 0.10, Spambayes Beta1, May 2003 > Tony> Hammie: 0.10, ? Beta1, Jan 2003 > Tony> POP3Proxy: 0.03, Spambayes POP3 Proxy Alpha3, May 2003 > Tony> SMTPProxy: 0.01, Spambayes SMTP Proxy Alpha1, May 2003 > Tony> IMAPFilter: 0.01, Spambayes IMAP Filter Alpha1, May 2003 > Tony> NotesFilter: 0.01, Spambayes Lotus Notes Filter Alpha1, Mar 2003 [Skip] > I think the POP3Proxy is sufficiently well-tested to be > considered beta unless you have some significant known issues > which are unresolved. I was waiting for a comment like this ;) In the Python world, what's the difference between alpha & beta? Is it purely a stability difference, or does 'feature completion' come into it as well? The only significant issue I can think of with pop3proxy is the one that Remi mentioned (i.e. malformed messages), which is really a spambayes issue more than a pop3proxy one. I think TimS is working on this at the moment (?). =Tony Meyer From T.A.Meyer at massey.ac.nz Mon Jun 9 10:26:40 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 8 17:27:17 2003 Subject: [spambayes-dev] Evaluating a training corpus Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A62@its-xchg4.massey.ac.nz> > mboxtest.py is probably the easiest to get going. I think > timcv.py gives better results but it's a little more trouble > to setup your test data. See README.txt for a short > explanation of the tools. If you want to use timcv.py, you > can use splitndirs.py to create the test data. Which is preferred, timtest or timcv? The readme has: [timcv] is the preferred way to test when possible: it makes best use of limited data, and interpreting results is straightforward. But also: [timtest] is a much harder test than timcv, because it trains on N-1 times less data, and makes each classifier predict against N-1 times more data than it's been taught about. And I would have thought that a harder test was a better test. (I presume that if I understood more statistics I could answer this myself...). =Tony Meyer From tim.one at comcast.net Sun Jun 8 18:42:50 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 8 17:43:21 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A62@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > Which is preferred, timtest or timcv? The readme has: > [timcv] is the preferred way to test when possible: So which part of "preferred" is unclear there ? > ... > But also: > [timtest] is a much harder test than timcv, because it trains on > N-1 times > less data, and makes each classifier predict against N-1 times > more data than it's been taught about. > And I would have thought that a harder test was a better test. (I > presume that if I understood more statistics I could answer this > myself...). Carry it to an extreme: train on 1 ham and 1 spam, then score a million msgs against that 2-message database. That's as hard as it gets, but unlikely to be predictive of real-life usage. If you have a thousand msgs in your database, and score against 100 per day, then timcv is quite close to real-life per-day usage. If you have 100 msgs in your database, and score against 1000 per day, then timtest is closer, but also harder to make sense of since each msg is scored N-1 different times (by each of N-1 different classifiers). timcv scores each msg exactly once, so is easier to make sense of. Pick your poison accordingly. From tim.one at comcast.net Sun Jun 8 18:49:30 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 8 17:53:05 2003 Subject: [spambayes-dev] Version information repository In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A54@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > I was waiting for a comment like this ;) In the Python world, what's > the difference between alpha & beta? Is it purely a stability > difference, or does 'feature completion' come into it as well? Historically, yes: nothing is guaranteed to remain the same across alpha releases. When a beta comes out, it comes with a promise that APIs are frozen, unless a catastrophic bug is discovered that requires changing them. New features can be snuck into a beta too, provided they don't change behavior of existing features. The next level after beta is "release candidate", to which only pure bugfixes can be applied. That works well for Python, but we're not constrained by it here. In the absence of a better idea, I prefer to stick to it, though. From T.A.Meyer at massey.ac.nz Mon Jun 9 10:55:05 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 8 17:55:47 2003 Subject: [spambayes-dev] Evaluating a training corpus Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A98@its-xchg4.massey.ac.nz> > [Meyer, Tony] > > Which is preferred, timtest or timcv? The readme has: > > [timcv] is the preferred way to test when possible: > > So which part of "preferred" is unclear there ? :p [explanation] > Pick your poison accordingly. Thanks for this. Timtest is probably closer to my real-life usage since I keep a pretty small Outlook database (going by the 'if it works, why make it bigger' theory). In terms of posting any results to the list, does it matter which poison is chosen? I've historically used timtest, but only because that was the one used in the example 'how to test' that Mark posted a while back. =Tony Meyer =Tony Meyer From tim.one at comcast.net Sun Jun 8 19:00:17 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 8 18:02:16 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <20030608204751.GA19819@cthulhu.gerg.ca> Message-ID: [Greg Ward] > I'm mulling ways to evaluate the quality of a training corpus, and was > wondering what the rest of you have tried. My current technique is > pretty bogus: train on the complete corpus, and then score every > message in the corpus using the resulting database. Obviously this > is a self-fulfilling prophecy, but at least it highlights spam that > are *really* different from other spam (and ditto for ham). What is the purpose of testing for you? A useful answer will contain at least one number . > I know there's code lurking in there somewhere (timcv.py?) for > training on 90% of the corpus, and then evaluating the other 10% > under the resulting database. timcv does that 10 times (or N times, for whatever N you choose), training on (N-1)/N of the messages and scoring the remaining 1/N of them. > That got me to thinking: why not build a complete training database, > and then do this: > > foreach message: > remove message from database (ie. untrain) > score message > report score > put message back in database That's what timcv does if you set N equal to the number of messages (M) in the database. In outline: partition the msgs into N groups, each with about M/N msgs foreach group: remove group from database score group report scores put group back in database > That removes the "self-fulfilling prophecy bit", and the arbitrary > nature of the 10%/90% selection. M seems as arbitrary as 10 to me . > But it should preserve the property of highlighting unusual spam or > ham. Seems to me like this should do a pretty good job of finding > misclassified messages, at least. I think most people have found that breaking the msgs into 10 groups does an excellent job of finding misclassified msgs already. When we were running python.org tests, that's where my reports of misclassified msgs came from! BTW, after getting the misclassifed msgs into the right classes, it's not unusual to find more misclassified msgs by running it again. Sometimes this goes on for several iterations. > Has anyone else tried something like this? Is there code out there > already? timcv is a general "cross validation" driver (that's what "cv" stands for). It's a standard statistical testing technique, and you can make N as large (or small) as you like. For purposes of predicting real-live behavior, pick N so that (N-1)/N * M is about equal to the number of msgs you expect to train on. timcv then builds and tests N classifiers of that size, testing each against the M/N withheld from it. From tim.one at comcast.net Sun Jun 8 19:05:55 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 8 18:08:26 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A98@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > Thanks for this. Timtest is probably closer to my real-life usage > since I keep a pretty small Outlook database (going by the 'if it > works, why make it bigger' theory). Same here: I use 3 distinct Outlook 2000s regularly, they all have databases with about 1000 msgs in them, and I rarely bother to train any of them anymore. > In terms of posting any results to the list, does it matter which > poison is chosen? I've historically used timtest, but only because > that was the one used in the example 'how to test' that Mark posted a > while back. We're still working on the meaning of "preferred" here ? timcv is faster, its results are easier to interpret, and it's generally more realistic given the relatively small amount of data most people have to throw at it. Best to view timtest as a tool for extreme testing by extreme researchers with extreme needs. From T.A.Meyer at massey.ac.nz Mon Jun 9 11:11:09 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 8 18:11:48 2003 Subject: [spambayes-dev] Evaluating a training corpus Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9AB9@its-xchg4.massey.ac.nz> > We're still working on the meaning of "preferred" here > ? timcv is faster, its results are easier to > interpret, and it's generally more realistic given the > relatively small amount of data most people have to throw at > it. Best to view timtest as a tool for extreme testing by > extreme researchers with extreme needs. Yes, this is the meaning I was looking for ;). Something that absolves me of the need to make a choice of my own... :) =Tony Meyer From noreply at sourceforge.net Sun Jun 8 20:27:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 8 22:27:49 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751127 ] Outlook AddIn Installtion Failure - WinXP Message-ID: Bugs item #751127, was opened at 2003-06-08 19:27 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751127&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Michael Williams (lamwilli) Assigned to: Mark Hammond (mhammond) Summary: Outlook AddIn Installtion Failure - WinXP Initial Comment: I attempted to install the Spambayes Outlook AddIn from: SpamBayes-Outlook-Setup-002.exe to my WinXP machine. Relevent configuration info follows: OS Name Microsoft Windows XP Professional Version 5.1.2600 Service Pack 1 Build 2600 Application name Outlook Version 9.0 Build 3821 AKA MS Outlook 2000 SR-1 (9.0.0.3821) The entire contents of the log file was: ---------------------------------------------------- Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py", line 405, in RegisterClasses MemoryError ---------------------------------------------------- The error message at installation time was: ---------------------------------------------------- The instruction at "0x77f51ed3" referenced memory at "0xfffffff8". The memory could not be "read" ---------------------------------------------------- See attached... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751127&group_id=61702 From skip at pobox.com Sun Jun 8 22:55:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Sun Jun 8 22:56:01 2003 Subject: [spambayes-dev] Version information repository In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A54@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301EE9A54@its-xchg4.massey.ac.nz> Message-ID: <16099.63289.290613.698791@montanaro.dyndns.org> Tony> [Skip] >> I think the POP3Proxy is sufficiently well-tested to be >> considered beta unless you have some significant known issues >> which are unresolved. Tony> I was waiting for a comment like this ;) In the Python world, Tony> what's the difference between alpha & beta? Is it purely a Tony> stability difference, or does 'feature completion' come into it as Tony> well? Generally speaking, both feature completion and stability affect the transition from alpha to beta. In my experience with Python, it seems that the alpha-to-beta transition can get held up waiting for some expected functionality to arrive. Once in beta it's much less frequent to see any new functionality, only bug fixes. If the bad message problem is a general Spambayes problem and the functionality is complete, I would say that pop3proxy can go to beta. Since you do have several bits which are considered still in alpha, a 1.0a3 release for Spambayes package as a whole would be prudent. Skip From noreply at sourceforge.net Mon Jun 9 05:35:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 9 07:35:14 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751283 ] addin installs but does not work (stays as unchecked in COM) Message-ID: Bugs item #751283, was opened at 2003-06-09 13:35 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Radek Stachowiak (radek) Assigned to: Mark Hammond (mhammond) Summary: addin installs but does not work (stays as unchecked in COM) Initial Comment: here is my spamlog1: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy. py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy. py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy. py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 From popiel at wolfskeep.com Mon Jun 9 13:11:29 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon Jun 9 15:11:34 2003 Subject: [spambayes-dev] More obvious logarithmic expiration data In-Reply-To: Message from Matthew Dixon Cowles of "Sat, 07 Jun 2003 15:10:32 CDT." <1054952982.19.1541@sake.mondoinfo.com> References: <1054952982.19.1541@sake.mondoinfo.com> Message-ID: <20030609191129.7EAB32DE97@cashew.wolfskeep.com> In message: <1054952982.19.1541@sake.mondoinfo.com> Matthew Dixon Cowles writes: >I mentioned a while ago that I'd do a little more work based on the >statistics that I had collected that showed that tokens that figured >in scoring were likely to be used for scoring again soon. > >I instrumented classifier.py and hammie.py to compute several scores >and log them when computing a score. Each time SpamBayes computes a >score, it also computes scores using only tokens that had been used >in scoring in the previous 24 hours, the previous week, the previous >two weeks, and the previous 30 days. > >Here are some results: > >2587 sets of scores processed >Number of scores that differ from actual score >by 0.00 6885 >by 0.01 or less 633 >by 0.10 or less 179 >by 0.20 or less 32 >by more than 0.20 32 Are these numbers from the within-24-hours number to the within-30-days number, or the within-7-days number to rhe within-30-days number (given that later you say you're comaparing against the 30-days number, not actual), or some combination of both? >Also encouragingly, the score changes that happen don't seem to move >the scores out of the standard 0.0-0.2 and 0.9-1.0 categories much: > > Moved out of spam Moved out of ham >Restricted to one day 13 8 >Restricted to one week 3 0 >Restricted to two weeks 2 0 This is much clearer data. >If I were cleverer, I'd have guessed all this from the number of >posts in which people have said that they've trained SpamBayes on >only a couple of hundred emails and that it's already working well >for them. But then I wouldn't have the fabulous collection of >ambiguous and invalid data that came before looking at how often >tokens are used for scoring . Yep. Empiricism beats clever theory, here. >Judging from this data, I could relatively painlessly use a database >that contains only those tokens that have figured in scoring in the >last ten days or so. That's about 11% of the 273487 tokens in my >database. Nifty. Do you have any provision for retaining (or desire to retain) words that were used a lot, but suddenly go through an N+1 day dry spell where they aren't used at all? >You'd need to bootstrap the process, presumably by counting a token >as used when it's first trained on. Waiting for a token to be used >before making it eligible for use has a certain theoretical elegance >but results might suffer . Yeah, that's likely the best bootstrap. Alternately, you could base off words appearing rather than being used, which has some value in not continually dropping noise words like 'the' and relearning that they're worthless, then not using them because they're within the .4-.6 exclusion range, then dropping them, then relearning them, etc... >And, of course, it's not really time that counts but rather the >number of emails seen. I'm not so convinced of this. One of the things we're dealing with is spam mutation rate, which I believe is independent of how much mail any one person receives. >Ironically, I started collecting these statistics when I was using a >laptop with a tiny hard disk. Now, with 60G at my disposal, the 23M >that my database takes up is pretty trifling. Indeed. I've got mine capped at about 21M, by only considering mail within the last 4 months... but it wouldn't significantly hurt my disk usage (out of if 30-some gig) if I didn't bother. I have far more space than this consumed by keeping archival copies of PennMUSH patch releases going back a decade... - Alex From matt at mondoinfo.com Mon Jun 9 16:56:43 2003 From: matt at mondoinfo.com (Matthew Dixon Cowles) Date: Mon Jun 9 16:56:53 2003 Subject: [spambayes-dev] More obvious logarithmic expiration data In-Reply-To: <20030609191129.7EAB32DE97@cashew.wolfskeep.com> References: <1054952982.19.1541@sake.mondoinfo.com> <20030609191129.7EAB32DE97@cashew.wolfskeep.com> Message-ID: <1055186806.42.2463@sake.mondoinfo.com> [me] >>2587 sets of scores processed >>Number of scores that differ from actual score >>by 0.00 6885 >>by 0.01 or less 633 >>by 0.10 or less 179 >>by 0.20 or less 32 >>by more than 0.20 32 [Alex Popiel] > Are these numbers from the within-24-hours number to the > within-30-days number, or the within-7-days number to rhe > within-30-days number (given that later you say you're comaparing > against the 30-days number, not actual), or some combination of > both? It's a combination. Four scores were computed for each message and the within-two-weeks, within-one-week, and within-24-hours scores were compared to the within-30-days score. Presumably, the larger differences are from the comparisons with the results that use the shorter cutoffs. >> Judging from this data, I could relatively painlessly use a >> database that contains only those tokens that have figured in >> scoring in the last ten days or so. That's about 11% of the 273487 >> tokens in my database. > Nifty. Do you have any provision for retaining (or desire to > retain) words that were used a lot, but suddenly go through an N+1 > day dry spell where they aren't used at all? I've thought a bit of that. It might be useful to bias the delete function toward retaining a token that hadn't been used for longer periods as a function of hamcount+spamcount. Some more work could determine if that's a valuable strategy. >>And, of course, it's not really time that counts but rather the >>number of emails seen. > > I'm not so convinced of this. One of the things we're dealing > with is spam mutation rate, which I believe is independent of > how much mail any one person receives. I agree but I meant something simpler than that. If I were on vacation for two weeks and therefore hadn't scored any messages in that time, it wouldn't make sense to expire my entire database. Regards, Matt From popiel at wolfskeep.com Mon Jun 9 15:09:03 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Mon Jun 9 17:09:07 2003 Subject: [spambayes-dev] More obvious logarithmic expiration data In-Reply-To: Message from Matthew Dixon Cowles of "Mon, 09 Jun 2003 15:56:43 CDT." <1055186806.42.2463@sake.mondoinfo.com> References: <1054952982.19.1541@sake.mondoinfo.com> <20030609191129.7EAB32DE97@cashew.wolfskeep.com> <1055186806.42.2463@sake.mondoinfo.com> Message-ID: <20030609210903.BD15E2DE97@cashew.wolfskeep.com> In message: <1055186806.42.2463@sake.mondoinfo.com> Matthew Dixon Cowles writes: > >It's a combination. Four scores were computed for each message and >the within-two-weeks, within-one-week, and within-24-hours scores >were compared to the within-30-days score. Presumably, the larger >differences are from the comparisons with the results that use the >shorter cutoffs. I've seen strange things come out of the data wherein more training made things worse, so I wouldn't take the above presumption on faith. ;-) >I've thought a bit of that. It might be useful to bias the delete >function toward retaining a token that hadn't been used for longer >periods as a function of hamcount+spamcount. Some more work could >determine if that's a valuable strategy. *nod* This is why I was leaning towards average appearance frequency over the lifetime of the token instead of time since last use in my own thoughts. Of course, I still haven't done anything beyond thought-experiments. >I agree but I meant something simpler than that. If I were on >vacation for two weeks and therefore hadn't scored any messages in >that time, it wouldn't make sense to expire my entire database. I keep forgetting that some people score messages only when a human is about to look at them. I've got my rules in procmail, so even if I go on vacation, the scoring takes place. - Alex From noreply at sourceforge.net Mon Jun 9 20:31:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 9 22:31:12 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751686 ] Stopped filtering easy messages, log file errors Message-ID: Bugs item #751686, was opened at 2003-06-09 19:31 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751686&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Stopped filtering easy messages, log file errors Initial Comment: I've had SpamBayes up and running for about a week. After doing a lot of new filtering today, it will no longer filter my inbox. I see from the log file that it is trying to. Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Sample: AssertionError Error filtering message '' Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 81, in filter_folder File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751686&group_id=61702 From noreply at sourceforge.net Mon Jun 9 20:46:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 9 22:46:11 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751693 ] Get multiple dialog boxes; close one, 2nd copy appears Message-ID: Bugs item #751693, was opened at 2003-06-09 19:46 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Get multiple dialog boxes; close one, 2nd copy appears Initial Comment: When I click the Anti-Spam button in Outlook, I do get the dialog box. When I click Close, the dialog pops up again, and I have to hit Close a second time. One time I actually got 3 dialog boxes in a row. I installed on a friend's machine and saw that same thing. I think it's harmless (?), but odd. I didn't see it logged in the database yet (looked for "multiple"). I've seen this on two different machines. Machine 1 Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Machine 2 Setup: Outlook XP Windows XP SpamBayes binar install BTW, VERY COOL SOFTWARE!!!! ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 From noreply at sourceforge.net Mon Jun 9 21:11:02 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 9 23:11:07 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751699 ] Keep getting "Error processing missed messages!" Message-ID: Bugs item #751699, was opened at 2003-06-09 20:11 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Keep getting "Error processing missed messages!" Initial Comment: When I restart Outlook it doesn't pick up messages that it has missed. Though similar to another bug I reported (about errors during filtering), the errors in the log file look completely different, so perhaps this is a different issue, or might provide more clues if it's the same issue. Outlook 2000 on Windows 2000 SpamBayes binary release 02a Error processing missed messages! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\addin.py", line 671, in OnConnection File "E:\src\spambayes\Outlook2000\addin.py", line 945, in ProcessMissedMessag es File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd ... etc... Attaching full log file. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 From noreply at sourceforge.net Mon Jun 9 21:17:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 9 23:18:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751699 ] Keep getting "Error processing missed messages!" Message-ID: Bugs item #751699, was opened at 2003-06-09 23:11 Message generated for change (Comment added) made by tim_one You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Keep getting "Error processing missed messages!" Initial Comment: When I restart Outlook it doesn't pick up messages that it has missed. Though similar to another bug I reported (about errors during filtering), the errors in the log file look completely different, so perhaps this is a different issue, or might provide more clues if it's the same issue. Outlook 2000 on Windows 2000 SpamBayes binary release 02a Error processing missed messages! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\addin.py", line 671, in OnConnection File "E:\src\spambayes\Outlook2000\addin.py", line 945, in ProcessMissedMessag es File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd ... etc... Attaching full log file. ---------------------------------------------------------------------- >Comment By: Tim Peters (tim_one) Date: 2003-06-09 23:17 Message: Logged In: YES user_id=31435 It looks like your database is corrupt, and needs to be retrained from scratch. If you do that, I suggest training it on about equal numbers of ham and spam (you've got a 10-to- 1 imbalance now, which gives the classifier a warped view of the world -- it tries to make up for that, but may not be able to -- the math and the code weren't developed or tested or tuned with such a lopsided mix). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 From noreply at sourceforge.net Wed Jun 11 06:56:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 11 08:57:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-752568 ] Lost DB on Outlook exit Message-ID: Bugs item #752568, was opened at 2003-06-11 07:56 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=752568&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: matt tagliaferri (mtagliaf) Assigned to: Mark Hammond (mhammond) Summary: Lost DB on Outlook exit Initial Comment: I think I've narrowed this down - it appears that the database is lost when I shut down Windows and it closes Outlook as part of the shutdown process. If I close Outlook myself, the database is retained. matt tag PS: kick-butt program. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=752568&group_id=61702 From T.A.Meyer at massey.ac.nz Thu Jun 12 18:38:35 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 12 01:39:18 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA5D2@its-xchg4.massey.ac.nz> The "Tim's sister" metric has recently become a little more important to me, since my fianc? wants to use a spam filter, and she has similar usage patterns as the described sisters. Annoyingly, she's using Outlook Express, which, although it works with pop3proxy, isn't anywhere as nice as other solutions (e.g. the Outlook plugin and some of the hammie solutions). I had a thought about how to make things easier, and wanted to run it past the list first, in case there are glaring problems that I'm overlooking, since it'll be a bit of coding to get it to work. There are two thoughts, really. The first is to write a script like mboxtrain that will take OE mbx/dbx files. I *think* that this shouldn't be too tricky if I piggyback off other projects. It doesn't really help that much, but it would be a start. An alternative (this is the main idea) is to run a (local) IMAP server. OE works fine with IMAP, so you could connect to the Spambayes IMAP server as well as whatever normal POP3/IMAP/Hotmail connections you already have. The server would have two folders - "train as ham" and "train as spam". The idea would be that you use the "copy to folder" option to move messages to one of those two folders, and can train that way. You'd still use imapfilter/pop3proxy for classification purposes. You could actually just move spam, rather than copy it. This has the advantage of not having to use the browser to train, and is more intuitive (I think) than the forward/bounce method of smtpproxy (which doesn't work well with OE, anyway). Thoughts? =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 12 18:53:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 12 01:53:44 2003 Subject: [spambayes-dev] Feature Requests Tracker Mail Items Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA5D5@its-xchg4.massey.ac.nz> Would one of the people with magic powers (Mark? Skip? Tim?) be able to make the feature requests (and patches, I guess) come through to -dev rather than spambayes? It seems (to me at least) that they ought to follow along with the bug reports. (And they're of more interest to developers, who might just implement the request, than users, anyway). From tim_one at email.msn.com Thu Jun 12 03:59:10 2003 From: tim_one at email.msn.com (Tim Peters) Date: Thu Jun 12 03:00:09 2003 Subject: [spambayes-dev] Feature Requests Tracker Mail Items In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA5D5@its-xchg4.massey.ac.nz> Message-ID: [Meyer, Tony] > Would one of the people with magic powers (Mark? Skip? Tim?) be able > to make the feature requests (and patches, I guess) come through to > -dev rather than spambayes? It seems (to me at least) that they > ought to follow along with the bug reports. (And they're of more > interest to developers, who might just implement the request, than > users, anyway). Good idea -- done. From noreply at sourceforge.net Thu Jun 12 05:35:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 12 07:35:35 2003 Subject: [spambayes-dev] [ spambayes-Bugs-753215 ] Should be more fault-tolerent Message-ID: Bugs item #753215, was opened at 2003-06-12 11:35 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=753215&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Raphael Manfredi (rmanfredi) Assigned to: Nobody/Anonymous (nobody) Summary: Should be more fault-tolerent Initial Comment: I've got the following stack trace, probably due to a badly encoded MIME extension. Traceback (most recent call last): File "/home/ram/bin/spambayes/hammiefilter.py", line 178, in ? main() File "/home/ram/bin/spambayes/hammiefilter.py", line 174, in main action(msg) File "/home/ram/bin/spambayes/hammiefilter.py", line 112, in filter return h.filter(msg) File "/home/ram/bin/spambayes/hammie.py", line 108, in filter spambayes.hammiebulk.main() File "/home/ram/bin/spambayes/hammie.py", line 38, in _scoremsg spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 217, in chi2_spamprob File "./spambayes/classifier.py", line 436, in _getclues File "./spambayes/compatsets.py", line 374, in __init__ File "./spambayes/compatsets.py", line 333, in _update File "./spambayes/tokenizer.py", line 1052, in tokenize File "./spambayes/tokenizer.py", line 1106, in tokenize_headers File "/usr/lib/python2.2/email/Header.py", line 92, in decode_header dec = email.base64MIME.decode(encoded) File "/usr/lib/python2.2/email/base64MIME.py", line 179, in decode dec = a2b_base64(s) binascii.Error: Incorrect padding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=753215&group_id=61702 From gward at python.net Thu Jun 12 21:56:18 2003 From: gward at python.net (Greg Ward) Date: Thu Jun 12 20:56:22 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: References: <20030608204751.GA19819@cthulhu.gerg.ca> Message-ID: <20030613005618.GA6245@cthulhu.gerg.ca> [me, last Sunday] > I'm mulling ways to evaluate the quality of a training corpus, and was > wondering what the rest of you have tried. [Tim's response] > What is the purpose of testing for you? A useful answer will contain at > least one number . Primary purpose is to find misclassified messages. Secondary purpose is to give me a warm fuzzy feeling that Spambayes is #1 (ie. correctly classifies mail that it has not been trained on). (There, I got a number in.) > timcv does that 10 times (or N times, for whatever N you choose), training > on (N-1)/N of the messages and scoring the remaining 1/N of them. [...] > That's what timcv does if you set N equal to the number of messages (M) in > the database. In outline: Hmmm, OK. I guess I could use timcv.py then, but rearranging my 18 corpora into 10 directories each would is a bit inconvenient. So I tried an end-run around timcv.py by modifying my scoring script to untrain, score, and retrain. Here's a simplified version: """ import sys from spambayes import hammie from spambayes import tokenizer def score(tokens, label): (prob, clues) = bayes.spamprob(tokens, True) high_clues = ["%s:%.3f" % clue for clue in clues[-5:]] high_clues = ", ".join(high_clues) print "%s: %.3f: %s" % (label, prob, high_clues) (db_filename, msg_filename) = sys.argv[1:] hammie = hammie.open(db_filename, mode="w") bayes = hammie.bayes # Read and tokenize message (which must be spam) message = open(msg_filename).read() tokens = list(tokenizer.tokenize(message)) # Score with that message (presumably) in the database. score(tokens, "initial") # Untrain (ie. remove this message from the database) and score again # (this is where we assume the message is spam). bayes.unlearn(tokens, True) score(tokens, "unlearn") # Retrain and score one last time. Should give identical results # to the initial scoring... but doesn't! bayes.learn(tokens, True) score(tokens, "relearn") """ ...does that look correct? It seems to work with a pickle store, but I'm getting weird results with a DB store. I think that's another issue though -- see my next post... Greg -- Greg Ward http://www.gerg.ca/ Jesus Saves -- and you can too, by redeeming these valuable coupons! From gward at python.net Thu Jun 12 22:19:28 2003 From: gward at python.net (Greg Ward) Date: Thu Jun 12 21:19:32 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies Message-ID: <20030613011928.GB6245@cthulhu.gerg.ca> I'm getting inconsistent results using the same training corpus when I store the database to a pickle vs a DB file. Here's how I created the training databases (once DB, once pickle): $ hammie.py -d -p db/default.db -g corpus/default/ham -s corpus/default/spam $ hammie.py -D -p db/default.pkl -g corpus/default/ham -s corpus/default/spam Results are unsurprising: $ ll db/default.{db,pkl} -rw-rw-r-- 1 greg dev 2600960 Jun 12 20:20 db/default.db -rw-rw-r-- 1 greg dev 2277665 Jun 12 20:16 db/default.pkl Now I try to score a message with each database: $ msg=corpus/checkins/ham/cur/19OLiy-0000XO-00:2,S $ hammie.py -f -d -p db/default.db < $msg | grep X-Spambayes X-Spambayes-Classification: unsure; 0.26 $ hammie.py -f -D -p db/default.pkl < $msg | grep X-Spambayes X-Spambayes-Classification: ham; 0.15 Huh?!? My own scoring script (which just exists because I like one line of output per scored message) shows the same thing: $ ./score -d db/default.db $msg ? 0.258 corpus/checkins/ham/cur/19OLiy-0000XO-00:2,S $ ./score -d db/default.pkl $msg N 0.153 corpus/checkins/ham/cur/19OLiy-0000XO-00:2,S The other neat feature of my "score" script is its -v option, which dumps all the clues. -v on the above two runs reveals that the set of clue tokens are *nearly* identical, but the scores of each token are subtly different between DB and pickle. Some excerpts: '*H*': 1.000 '*H*': 0.991 '*S*': 0.515 '*S*': 0.297 'to:spambayes': 0.001 'to:spambayes': 0.002 'from:Greg': 0.003 'from:Greg': 0.006 'system.': 0.006 'system.': 0.012 'binary': 0.007 'binary': 0.014 '(not': 0.009 '(not': 0.018 [...] 'taking': 0.370 'taking': 0.372 'are': 0.377 'are': 0.377 'reply-to:none': 0.379 'reply-to:none': 0.379 'for': 0.382 'for': 0.382 'privileges': 0.386 [not in the pickle store] 'header:Received:2': 0.392 'header:Received:2': 0.392 'windows': 0.397 'windows': 0.398 'unable': 0.606 'unable': 0.604 'west': 0.617 'west': 0.614 [...] 'notified': 0.946 'staff': 0.937 'hereby': 0.949 'notified': 0.941 'click': 0.955 'federal': 0.941 'message-id:skip:3 30': 0.965 'click': 0.955 'belonging': 0.978 'belonging': 0.959 'los': 0.984 'los': 0.970 'medical': 0.984 'medical': 0.970 'message-id:skip:t 20': 0.988 'message-id:skip:t 20': 0.976 'street,': 0.990 'street,': 0.980 (The correspondence gets jumbled near the end because the tokens are sorted by score; it appears that the variance is higher near the top end.) Anybody have a clue WTF is going on here? I'm running a several-days-old CVS spambayes, so I'll try "cvs up" first. And then I guess I'll start picking through the DB and pickle files manually to see if those differences are visible that way. But I have no idea what that will tell me ... Greg -- Greg Ward http://www.gerg.ca/ Outside of a dog, a book is man's best friend. Inside of a dog, it's too dark to read. From T.A.Meyer at massey.ac.nz Fri Jun 13 14:28:42 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 12 21:35:40 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA7A0@its-xchg4.massey.ac.nz> > Anybody have a clue WTF is going on here? Not much of one, but: > I'm running a > several-days-old CVS spambayes, so I'll try "cvs up" first. The only semi-recent thing I can think of that might effect this sort of thing are Mark's changes to the DB classifier. The two main points, IIRC, were that the classifier now doesn't cache hapaxes and stores a list of changed words so that not all tokens are saved. It's possible that you have a cvs that has the first changes he checked in (which were buggy, IIRC), and not the fix. Apart from trying current cvs, you might also try checking out cvs from the 28th of May or before, which doesn't have these changes, and see if that fixes it. > And then I guess I'll start picking through the DB and pickle > files manually to see if those differences are visible that way. > But I have no idea what that will tell me ... BTW you can use the DBImpExp script to do this if you want to (to convert to text/pickles/db). What happens if you only train on a single message? Do you get the same result? =Tony Meyer From gward at python.net Thu Jun 12 22:40:10 2003 From: gward at python.net (Greg Ward) Date: Thu Jun 12 21:40:13 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA7A0@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA7A0@its-xchg4.massey.ac.nz> Message-ID: <20030613014010.GA6598@cthulhu.gerg.ca> On 13 June 2003, Meyer, Tony said: > The only semi-recent thing I can think of that might effect this sort of > thing are Mark's changes to the DB classifier. Lucky guess (I guess) -- I cvs up'd and the problem went away. False alarm. Thanks Tony! Greg -- Greg Ward http://www.gerg.ca/ A man without religion is like a fish without a bicycle. From skip at pobox.com Thu Jun 12 22:01:41 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Jun 12 22:01:49 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <20030613005618.GA6245@cthulhu.gerg.ca> References: <20030608204751.GA19819@cthulhu.gerg.ca> <20030613005618.GA6245@cthulhu.gerg.ca> Message-ID: <16105.12421.375631.365376@montanaro.dyndns.org> Greg> Hmmm, OK. I guess I could use timcv.py then, but rearranging my Greg> 18 corpora into 10 directories each would is a bit inconvenient. Not really. Take a look at utilities/splitndirs.py. It can split an mbox file across a user-specified number of directories. Skip From gward at python.net Thu Jun 12 23:01:46 2003 From: gward at python.net (Greg Ward) Date: Thu Jun 12 22:01:52 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies In-Reply-To: <20030613014010.GA6598@cthulhu.gerg.ca> References: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA7A0@its-xchg4.massey.ac.nz> <20030613014010.GA6598@cthulhu.gerg.ca> Message-ID: <20030613020146.GA6579@cthulhu.gerg.ca> No wait, false alarm on the false alarm. I *am* still seeing inconsistent behaviour between pickle and DB stores, only this time I have to score/untrain/score/retrain/score to show the difference. Here's the code I'm using (same as I posted earlier this evening): """ import sys from spambayes import hammie from spambayes import tokenizer def score(tokens, label): (prob, clues) = bayes.spamprob(tokens, True) high_clues = ["%s:%.3f" % clue for clue in clues[-5:]] high_clues = ", ".join(high_clues) print "%s: %.3f: %s" % (label, prob, high_clues) (db_filename, msg_filename) = sys.argv[1:] usedb = db_filename.endswith(".db") # assume pickle otherwise hammie = hammie.open(db_filename, usedb=usedb, mode="w") bayes = hammie.bayes # Read and tokenize message (which must be spam) message = open(msg_filename).read() tokens = list(tokenizer.tokenize(message)) # Score with that message (presumably) in the database. score(tokens, "initial") # Untrain (ie. remove this message from the database) and score again # (this is where we assume the message is spam). bayes.unlearn(tokens, True) score(tokens, "unlearn") # Retrain and score one last time. Should give identical results # to the initial scoring... but doesn't! bayes.learn(tokens, True) score(tokens, "relearn") """ First, let's score/untrain/score/retrain/score the same message with two copies of the same training database (one pickle, one DB) (sorry about the long lines): $ ./simplescore db/default.db $msg initial: 0.995: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 unlearn: 0.482: volume:0.986, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 relearn: 1.000: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 $ ./simplescore db/default.pkl $msg initial: 0.995: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 unlearn: 0.272: volume:0.986, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 relearn: 0.995: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 I see two problems here: * untraining on this message gives a different score for the pickle and DB store * after retraining, the score with the DB store is not the same as the initial score (or with the retrained pickle store) Now let's repeat the experiment: $ ./simplescore db/default.db $msg initial: 1.000: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 unlearn: 0.482: volume:0.986, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 relearn: 1.000: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 $ ./simplescore db/default.pkl $msg initial: 0.995: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 unlearn: 0.272: volume:0.986, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 relearn: 0.995: volume:0.987, hands:0.988, materials:0.991, 24/7:0.994, purchase:0.994 Still getting inconsistent results after untraining. At least the DB store has settled down and gives the same results initially and after retraining. Too bad it's inconsistent with the pickle store! ;-( *Now* WTF is going on? Greg -- Greg Ward http://www.gerg.ca/ This message transmitted with 100% recycled electrons. From gward at python.net Thu Jun 12 23:06:27 2003 From: gward at python.net (Greg Ward) Date: Thu Jun 12 22:06:30 2003 Subject: [spambayes-dev] Evaluating a training corpus In-Reply-To: <16105.12421.375631.365376@montanaro.dyndns.org> References: <20030608204751.GA19819@cthulhu.gerg.ca> <20030613005618.GA6245@cthulhu.gerg.ca> <16105.12421.375631.365376@montanaro.dyndns.org> Message-ID: <20030613020627.GB6598@cthulhu.gerg.ca> On 12 June 2003, Skip Montanaro said: > > Greg> Hmmm, OK. I guess I could use timcv.py then, but rearranging my > Greg> 18 corpora into 10 directories each would is a bit inconvenient. > > Not really. Take a look at utilities/splitndirs.py. It can split an mbox > file across a user-specified number of directories. I said a *bit* inconvenient, not unachievable. It's just that I already have 17,945 files spread across 166 directories in my play area; the last thing I need is to take the 17,908 files that comprise my 18 training corpora and hardlink them into 360 new directories... grumble... Greg PS. details: this is for mail.python.org : 300+ recipient addresses whittled down into 18 corpora of related addresses; input data is the three email harvesting runs (2002-09, 2002-10, and 2003-06) I've done on python.org. It's fun, but slow going. -- Greg Ward http://www.gerg.ca/ Any priest or shaman must be presumed guilty until proven innocent. From T.A.Meyer at massey.ac.nz Fri Jun 13 15:29:27 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 12 22:30:43 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA800@its-xchg4.massey.ac.nz> > No wait, false alarm on the false alarm. I *am* still seeing > inconsistent behaviour between pickle and DB stores, only this time I > have to score/untrain/score/retrain/score to show the difference. [...] > *Now* WTF is going on? Before you cvs-up'd, how old was the source you were using? Pre 27/05/03? If not, could you try running exactly the same test with a cvs from that date? =Tony Meyer From noreply at sourceforge.net Thu Jun 12 21:17:53 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 12 23:17:57 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-753708 ] Support POP over SSL Message-ID: Feature Requests item #753708, was opened at 2003-06-12 20:17 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=753708&group_id=61702 Category: pop3proxy Group: None Status: Open Priority: 5 Submitted By: rick gregory (rickg17) Assigned to: Nobody/Anonymous (nobody) Summary: Support POP over SSL Initial Comment: Enable POPproxy to proxy a connection to a POP3 server over SSL, supporting either port 110 or 995. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=753708&group_id=61702 From tim.one at comcast.net Fri Jun 13 01:16:27 2003 From: tim.one at comcast.net (Tim Peters) Date: Fri Jun 13 00:18:32 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies In-Reply-To: <20030613020146.GA6579@cthulhu.gerg.ca> Message-ID: [Greg Ward, wrestling with mysteries] I'd presume that the pickle code is correct. What you're seeing is consistent with that (the scores appear the same after untraining and retraining when using the pickle, but don't when using the DB). I suspect there's something wrong with the DB code, as for months we've gotten reports of odd bugs from DB users that nobody using the pickle code has reported. The storage format isn't so interesting as that the pickle code uses a plain Python dict *during* training and scoring -- at the start, that code was so simple it was obviously correct. It all got a lot more complicated, via layers of indirection, to cater to the DB backend, but the runtime dict is still a lot simpler than building a funky cache by hand on top of an external DB. I still haven't tried the DB code. For better clues, replace your %.3f and %.2f formats with %.17g, i.e. print values to full machine precision (or print repr(some_float) -- close to the same thing). Then we don't have to guess whether values are "just close", we can see whether they are (or aren't) in fact identical. Also, as Tony said, if you train on just a couple messages, it will be straightforward to Pronounce on exactly what should have happened, down to the 17th digit. That would tell us for sure which scheme is hosed, and then digging will reveal how. From mhammond at skippinet.com.au Fri Jun 13 16:28:05 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Fri Jun 13 01:28:55 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA5D2@its-xchg4.massey.ac.nz> Message-ID: <014d01c3316c$91de4d80$530f8490@eden> That all sounds great. Another option for filtering may be to have the pop3proxy *not* deliver any spam or uncertain messages, but instead "move" them to an IMAP folder in our server. I don't know enough about IMAP to know if this is feasable, particularly when trying to track items in these folders for incremental training purposes. You are right - there is a bit of work there :) Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1752 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes-dev/attachments/20030613/d5b6e321/winmail.bin From T.A.Meyer at massey.ac.nz Fri Jun 13 20:19:07 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri Jun 13 03:19:47 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA8EB@its-xchg4.massey.ac.nz> > That all sounds great. Another option for filtering may be > to have the pop3proxy *not* deliver any spam or uncertain > messages, but instead "move" them to an IMAP folder in our > server. I don't know enough about IMAP to know if this is > feasible, particularly when trying to track items in these > folders for incremental training purposes. That's an interesting idea, and as far as I can tell it should work. > You are right - there is a bit of work there :) Before I go ahead and try this out, I don't suppose anyone knows of an IMAP server written in Python? (where's the imapserv module when you need it? :) I've googled, but there doesn't seem to be anything. =Tony Meyer From anthony at interlink.com.au Fri Jun 13 19:47:01 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri Jun 13 05:28:35 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1301EEA8EB@its-xchg4.massey.ac.nz> Message-ID: <200306130847.h5D8l2TD002799@localhost.localdomain> >>> "Meyer, Tony" wrote > Before I go ahead and try this out, I don't suppose anyone knows of an > IMAP server written in Python? (where's the imapserv module when you > need it? :) I believe one of the twisted guys was working on one. I plan to spend some time on it for my own porpoises (hopefully soon). -- Anthony Baxter It's never too late to have a happy childhood. From gward at python.net Fri Jun 13 09:17:08 2003 From: gward at python.net (Greg Ward) Date: Fri Jun 13 08:17:11 2003 Subject: [spambayes-dev] Pickle vs DB inconsistencies In-Reply-To: References: <20030613020146.GA6579@cthulhu.gerg.ca> Message-ID: <20030613121708.GA7638@cthulhu.gerg.ca> On 13 June 2003, Tim Peters said: > Also, as Tony said, if you train on just a couple messages, it will be > straightforward to Pronounce on exactly what should have happened, down to > the 17th digit. That would tell us for sure which scheme is hosed, and then > digging will reveal how. Just tried it with a two-message corpus (one spam, one ham), and it looks like DB and pickle are the same now. Gotta go to work now; will dig into it this evening. Greg -- Greg Ward http://www.gerg.ca/ Pointers are Arrays; Code is Data; Time is Money From noreply at sourceforge.net Fri Jun 13 12:49:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 13 14:49:13 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-751204 ] Recalculate spam % when you "delete as spam" Message-ID: Feature Requests item #751204, was opened at 2003-06-09 02:36 Message generated for change (Comment added) made by swassbac You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751204&group_id=61702 Category: None Group: None Status: Closed Priority: 5 Submitted By: Dan Boccabella (swassbac) Assigned to: Nobody/Anonymous (nobody) Summary: Recalculate spam % when you "delete as spam" Initial Comment: I frequently go through my SPAM folder that I have set to autmatically filter to and sort by SPAM %. This way I can take a closer look at ones it wasn't as sure about. Near the top there are always some that I said delete as spam to. I would think those would be updated to 100% when I click the delete as spam button. On a related note, if I say recover as SPAM, why can't it set the SPAM % to 0%? Thanks for a great product! Dan ---------------------------------------------------------------------- >Comment By: Dan Boccabella (swassbac) Date: 2003-06-13 13:49 Message: Logged In: YES user_id=797027 I understand that 0%/100% maybe an exaggeration, but just recalcing to move the number closer to the right end (so I don't still see 0% in the SPAM folder). Is there any white/black list functionality planned? Seems that I still get an occasional message from a co-worker in quarantine - it would be nice to be able to say always (no matter what the content) accept mail from this person. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-12 00:56 Message: Logged In: YES user_id=552329 The "Delete as spam" button does (I believe) rescore the message. The thing is that even if you train a message as ham/spam, that doesn't mean that it will score 0%/100%. It should move closer to the correct end, but the score depends on how messages have been classified that include the same sort of tokens (features) as that message. For example, if I trained a message as spam, and then trained an identical message as ham, the score should be 50% - each token is present in one ham message and one spam message. There's more information (probably clearer!) on the website under documentation (start with the FAQ). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751204&group_id=61702 From noreply at sourceforge.net Sat Jun 14 20:56:09 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat Jun 14 22:56:19 2003 Subject: [spambayes-dev] [ spambayes-Bugs-754729 ] Install fails with removable hard drive Message-ID: Bugs item #754729, was opened at 2003-06-14 19:56 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=754729&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: David Martin (hurricane51) Assigned to: Mark Hammond (mhammond) Summary: Install fails with removable hard drive Initial Comment: Windows XP Pro SP2 Outlook 2002 (XP) with all Windows Update patches When trying to install with a "removable Hard Disk" device (in this case a CompactFlach reader with no media installed), I received the following error: Inbox - Microsoft Outlook: OUTLOOK.EXE - NO DISK There is no disk in the drive. Please insert a disk into drive \Device\Harddisk2\DR4. Installation log was clean, however, the plugin did not appear in Outlook. This drive is only used for downloading photros from my digicam. Otherwise, it is never used, it is not the TEMP drive, etc. See attached GIF image. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=754729&group_id=61702 From mhammond at skippinet.com.au Mon Jun 16 12:20:12 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Jun 15 21:20:53 2003 Subject: [spambayes-dev] Options changes Message-ID: <067601c333a5$70e40d00$530f8490@eden> I am making a few changes to Options that I will detail a little later, but there are a couple of things that I would like some feedback on: 1) Options does a simple "print" for invalid values. Eg: print "Attempted to set [%s] %s with invalid value %s (%s)" % \ (sect, opt, val, type(val)) It makes sense to me that an exception would be better here. It would prevent strange "silent" errors in programs, and would also allow programs to lean on the validation code already in place (ie, they could catch the exception and tell the user). We would need to ensure that when loading a file (which the user may have edited), these errors are simply printed and the file load continues. But when the program itself is setting the option to an invalid value, an exception would be more useful. Can anyone tell me quickly how pop3proxy's config server validates the form input? 2) Saving to a temp file. Currently we use TemporaryFile() then shutil.copyfileobj it to the real file. These may be on different devices. It makes more sense to me to save it to a ".tmp" file in the correct directory, then remove and move the file across. It wouldn't be thread-safe, but a) not much else here is, and b) and "thread-safe" program that tries to write to the same config file at the same time is insane anyway . These config files are likely to be small, so it isn't really a big deal, but still did strike me. Any thoughts on either of these? Mark. From T.A.Meyer at massey.ac.nz Mon Jun 16 14:29:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 15 21:30:06 2003 Subject: [spambayes-dev] Options changes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E026@its-xchg4.massey.ac.nz> > 1) Options does a simple "print" for invalid values. > It makes sense to me that an exception would be better > here. It used to do this (wait until all the options had been parsed, print an error for each one, and stop). I took it out when I redid the options structure because no matter how much testing I did, I kept missing little errors. The hope was to let people continue to function while pointing out the errors. The plan always was to return it, though: In the to-do list: o Once the above is done, and we have waited a suitable time, stop allowing invalid options in configuration files Things seem to be stable again with the options, so now is probably good enough a time. > Can anyone tell me quickly how pop3proxy's config server > validates the form input? In a nutshell: * It checks that options.is_valid(section, option, value) is True for each option. (In UserInterface.py). * It checks that there aren't duplicate ports, and that the number of servers matches the number of ports (ProxyUI.py, and similar in ImapUI.py). > 2) Saving to a temp file. Currently we use TemporaryFile() > then shutil.copyfileobj it to the real file. These may be on > different devices. It makes more sense to me to save it to a > ".tmp" file in the correct directory, then remove and move > the file across. That's what I originally did, and then I thought that the temp file routines must be there for a reason, so I figured I should use them :) No problems with changing it here. =Tony Meyer From noreply at sourceforge.net Sun Jun 15 19:51:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 15 21:51:41 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-751204 ] Recalculate spam % when you "delete as spam" Message-ID: Feature Requests item #751204, was opened at 2003-06-09 19:36 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751204&group_id=61702 Category: None Group: None Status: Closed Priority: 5 Submitted By: Dan Boccabella (swassbac) Assigned to: Nobody/Anonymous (nobody) Summary: Recalculate spam % when you "delete as spam" Initial Comment: I frequently go through my SPAM folder that I have set to autmatically filter to and sort by SPAM %. This way I can take a closer look at ones it wasn't as sure about. Near the top there are always some that I said delete as spam to. I would think those would be updated to 100% when I click the delete as spam button. On a related note, if I say recover as SPAM, why can't it set the SPAM % to 0%? Thanks for a great product! Dan ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-16 13:51 Message: Logged In: YES user_id=552329 White/black lists are really no good. What happens when your co-worker's address is spoofed by a spammer? If you do really, really want them, the SpamAtBay beta has them. This is a commerical offshoot of spambayes that's in beta at the moment. You can get a free beta by emailing Sean True (address and more details in the archives; don't have it at hand, sorry). ---------------------------------------------------------------------- Comment By: Dan Boccabella (swassbac) Date: 2003-06-14 06:49 Message: Logged In: YES user_id=797027 I understand that 0%/100% maybe an exaggeration, but just recalcing to move the number closer to the right end (so I don't still see 0% in the SPAM folder). Is there any white/black list functionality planned? Seems that I still get an occasional message from a co-worker in quarantine - it would be nice to be able to say always (no matter what the content) accept mail from this person. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-12 17:56 Message: Logged In: YES user_id=552329 The "Delete as spam" button does (I believe) rescore the message. The thing is that even if you train a message as ham/spam, that doesn't mean that it will score 0%/100%. It should move closer to the correct end, but the score depends on how messages have been classified that include the same sort of tokens (features) as that message. For example, if I trained a message as spam, and then trained an identical message as ham, the score should be 50% - each token is present in one ham message and one spam message. There's more information (probably clearer!) on the website under documentation (start with the FAQ). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751204&group_id=61702 From noreply at sourceforge.net Sun Jun 15 19:56:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 15 21:57:01 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-755095 ] IMAP Filter: move trained messages Message-ID: Feature Requests item #755095, was opened at 2003-06-16 13:56 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755095&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: IMAP Filter: move trained messages Initial Comment: >From private email. > For dialup slowcoaches like me, it would be nice if there > were a trained_folder option in .spambayesrc, so that when I > run imapfilter.py -t, trained messages get moved from my > INBOX.untrained to INBOX.read. I can then set up my mail > reader to move read messages from INBOX to INBOX.untrained. > > If I *don't* move messages out of the training folder (by > hand ATM), it takes fifteen minutes or so to train, just > because of all the headers it reads. This seems a reasonable request. It would be better if it could somehow move the messages back to wherever they came from, but I don't see any way of doing that. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755095&group_id=61702 From noreply at sourceforge.net Sun Jun 15 19:58:17 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 15 21:58:20 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-755098 ] IMAP Filter: progress indicator Message-ID: Feature Requests item #755098, was opened at 2003-06-16 13:58 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755098&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: IMAP Filter: progress indicator Initial Comment: > Would be helpul if imapfilter.py -tv or -cv (note the 'v's) were to: > > a) Show some indication of progress. Say, print a "." for > every 10 messages whose headers are read, or a "*" if any of > those 10 messages had their bodies read too. > > b). For classification, print a summary of how many spam, ham > and unsure were classified. This could be a separate switch, > or part of -v. Doing anything over dialup is slow, and > checking INBOX, INBOX.Spam and INBOX.Unsure after every > classification is a chore. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755098&group_id=61702 From T.A.Meyer at massey.ac.nz Mon Jun 16 15:44:26 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sun Jun 15 22:45:28 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E087@its-xchg4.massey.ac.nz> > > Before I go ahead and try this out, I don't suppose anyone > > knows of an IMAP server written in Python? > > I believe one of the twisted guys was working on one. I plan to spend > some time on it for my own porpoises (hopefully soon). Actually twisted seems to have one that is "semi-stable". There's a IMAPServer class in the API, at least. (Wonder why my googling didn't find this?) New question: I know nothing about twisted other than I've seen the name in c.l.p at various times. Should I: (a) learn how to use twisted and create the OE/IMAP thing based on that (possibly meaning that users would need to install twisted) (b) wait for Anthony to write his, get him to contribute it, and base it on that (c) write my own (c) has the appeal that I can start now, and I don't need a fully-fledged server anyway (I don't think), but has the reinventing the wheel problem. (a) looks complicated. (b) has the word 'hopefully' before soon . Advice? =Tony Meyer From noreply at sourceforge.net Sun Jun 15 20:56:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 15 22:56:10 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-751206 ] Any chance for an exchange version? Message-ID: Feature Requests item #751206, was opened at 2003-06-09 19:39 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Dan Boccabella (swassbac) Assigned to: Nobody/Anonymous (nobody) Summary: Any chance for an exchange version? Initial Comment: I have tried dozens of anti-spam products and this is the first one that has been nearly perfect in filtering the 250-300 daily spam messages from my inbox. We have been considering deploying another product that works on exchange. Do you have any plans for a server-side version? I would be willing to kick in some money towards the effort. Dan ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-16 14:56 Message: Logged In: YES user_id=552329 See the FAQ question about a server side version. In particular, how do you see this working? Does each user have their own database? Do they train just as individuals do now? If so, then why not just install the current binary? It certainly works with Exchange. If there is a 'master' db, then where do the spam & unsures go? To some quarentine where some admin has to clear them out individually? To the user (if so, then why not just get them to use the binary?). Server-side apps have a lot of complex decisions to be made, many of which will be site-specific. Feel free to be more specific, and maybe something will be created, but I don't think it's all that likely. Personally, it seems to me that the whole strength of statistical filtering like this is the individuality. That's lost if you server-side it, and without any obvious benefits. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 From mhammond at skippinet.com.au Mon Jun 16 14:51:37 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Sun Jun 15 23:52:24 2003 Subject: [spambayes-dev] Changes to Options.py checked in Message-ID: <001001c333ba$96852e30$530f8490@eden> FYI, I just split the classes from Options.py into a new OptionsClass.py. This means that it is possible to reuse the classes without loading the SpamBayes config file. There should not be any changes to the existing options, or the general semantics of how options are used. I have checked pop3proxy and hammie, and both appear to work fine. A quick "grep" shows that no one relies on where the classes are defined, so I think everything will be fine. Here is the full checkin comment: Split the class into its own file, so it is possible to use other instances *without* loading the default SpamBayes options. (Outlook now uses this file format, but stores its UI options in a different file). Allow lists to be used for multi-valued options (previously only tuples were supported.) Allow the first entry in the "defaults" table to be a sub-class of Option - this allows custom packing/unpacking - currently only used by Outlook to manage folder IDs (which are a list of tuples) Mark. From noreply at sourceforge.net Sun Jun 15 22:04:44 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 00:04:49 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-751206 ] Any chance for an exchange version? Message-ID: Feature Requests item #751206, was opened at 2003-06-09 02:39 Message generated for change (Comment added) made by swassbac You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Dan Boccabella (swassbac) Assigned to: Nobody/Anonymous (nobody) Summary: Any chance for an exchange version? Initial Comment: I have tried dozens of anti-spam products and this is the first one that has been nearly perfect in filtering the 250-300 daily spam messages from my inbox. We have been considering deploying another product that works on exchange. Do you have any plans for a server-side version? I would be willing to kick in some money towards the effort. Dan ---------------------------------------------------------------------- >Comment By: Dan Boccabella (swassbac) Date: 2003-06-15 23:04 Message: Logged In: YES user_id=797027 You would still have an individual database - but the main advantages would be it would start with a pre-trained db (in our office, I have 2 folders setup for this - of course every person will customize their database going forward but we can all agree we don't want viagra, penis enlargement, etc.) setup by the company and it would run without outlook running. The problem with the outlook-based product is you need to leave your office computer on and logged in 24/7. If I didn't then when I check my e-mail from my cell phone, pda, or outlook web access, I'd have 100s of spams in my inbox. With server-side coded, there would also be no requirement ot install anything client side. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-15 21:56 Message: Logged In: YES user_id=552329 See the FAQ question about a server side version. In particular, how do you see this working? Does each user have their own database? Do they train just as individuals do now? If so, then why not just install the current binary? It certainly works with Exchange. If there is a 'master' db, then where do the spam & unsures go? To some quarentine where some admin has to clear them out individually? To the user (if so, then why not just get them to use the binary?). Server-side apps have a lot of complex decisions to be made, many of which will be site-specific. Feel free to be more specific, and maybe something will be created, but I don't think it's all that likely. Personally, it seems to me that the whole strength of statistical filtering like this is the individuality. That's lost if you server-side it, and without any obvious benefits. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 From ta-meyer at ihug.co.nz Mon Jun 16 18:42:05 2003 From: ta-meyer at ihug.co.nz (Tony Meyer) Date: Mon Jun 16 01:42:43 2003 Subject: [spambayes-dev] RE: [Spambayes-checkins] spambayes/spambayes OptionsClass.py, NONE, 1.1 Options.py, 1.54, 1.55 In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130206344F@its-xchg4.massey.ac.nz> Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13C8CC78@its-xchg4.massey.ac.nz> [939-940 of Options.py] > + # Set verbosity of this options instance to an option value! > + options.verbose = options["globals", "verbose"] What does this do? As far as I understand things, these two should be equivalent without this line, and rudimentary testing seems to indicate that they are. =Tony Meyer From anthony at interlink.com.au Mon Jun 16 16:34:41 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Mon Jun 16 07:39:32 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130207E087@its-xchg4.massey.ac.nz> Message-ID: <200306160534.h5G5Yg45022131@localhost.localdomain> >>> "Meyer, Tony" wrote > Actually twisted seems to have one that is "semi-stable". There's a > IMAPServer class in the API, at least. (Wonder why my googling didn't > find this?) Beats me. Googling for "site:twistedmatrix.com python imap server twisted" doesn't produce much, either. Note that the "semi-stable" refers to the API, not to the code quality. > (a) learn how to use twisted and create the OE/IMAP thing based on that > (possibly meaning that users would need to install twisted) Yes. This is going to be the happiest path. > (b) wait for Anthony to write his, get him to contribute it, and base it > on that Unlikely that the stuff I'm doing will be of much/any use to you. I'm simply bolting an MH backend onto the twisted IMAP server so that I can use a GUI mailer with MH support, and still use the command line tools. > (c) write my own > (c) has the appeal that I can start now, and I don't need a > fully-fledged server anyway (I don't think), but has the reinventing the > wheel problem. We (the python community) really don't need to go down the perl path of everyone re-inventing the same wheel, and each implementing a subset of the full functionality of said wheel. Particularly when the wheel is as complex as IMAP. > (a) looks complicated. (b) has the word 'hopefully' before soon . Learning to use twisted is going to be a whole pile less work than writing a full imap server. Really. Anthony From noreply at sourceforge.net Mon Jun 16 06:02:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 08:02:14 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-751206 ] Any chance for an exchange version? Message-ID: Feature Requests item #751206, was opened at 2003-06-08 23:39 Message generated for change (Comment added) made by seantrue You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Dan Boccabella (swassbac) Assigned to: Nobody/Anonymous (nobody) Summary: Any chance for an exchange version? Initial Comment: I have tried dozens of anti-spam products and this is the first one that has been nearly perfect in filtering the 250-300 daily spam messages from my inbox. We have been considering deploying another product that works on exchange. Do you have any plans for a server-side version? I would be willing to kick in some money towards the effort. Dan ---------------------------------------------------------------------- >Comment By: Sean True (seantrue) Date: 2003-06-16 04:02 Message: Logged In: YES user_id=114311 Dan -- please feel free to get in touch with me directly. We're working on a commercial version of the Outlook plugin, and that's my _day_ job. We've been kicking around how to integrate that with a server side implementation, and have some thoughts. We include a pretrained database in the Outlook version, and it can be imported and exported (and hence, shared) You can reach me at seant@iname.com. -- Sean ---------------------------------------------------------------------- Comment By: Dan Boccabella (swassbac) Date: 2003-06-15 20:04 Message: Logged In: YES user_id=797027 You would still have an individual database - but the main advantages would be it would start with a pre-trained db (in our office, I have 2 folders setup for this - of course every person will customize their database going forward but we can all agree we don't want viagra, penis enlargement, etc.) setup by the company and it would run without outlook running. The problem with the outlook-based product is you need to leave your office computer on and logged in 24/7. If I didn't then when I check my e-mail from my cell phone, pda, or outlook web access, I'd have 100s of spams in my inbox. With server-side coded, there would also be no requirement ot install anything client side. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-15 18:56 Message: Logged In: YES user_id=552329 See the FAQ question about a server side version. In particular, how do you see this working? Does each user have their own database? Do they train just as individuals do now? If so, then why not just install the current binary? It certainly works with Exchange. If there is a 'master' db, then where do the spam & unsures go? To some quarentine where some admin has to clear them out individually? To the user (if so, then why not just get them to use the binary?). Server-side apps have a lot of complex decisions to be made, many of which will be site-specific. Feel free to be more specific, and maybe something will be created, but I don't think it's all that likely. Personally, it seems to me that the whole strength of statistical filtering like this is the individuality. That's lost if you server-side it, and without any obvious benefits. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=751206&group_id=61702 From noreply at sourceforge.net Mon Jun 16 18:33:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 20:33:51 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743384 ] Show Spam Clues Marks Message as Read Message-ID: Bugs item #743384, was opened at 2003-05-26 10:07 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 Category: Outlook Group: None Status: Open >Resolution: Postponed Priority: 5 Submitted By: Paul C Coyle (pccoyle) Assigned to: Mark Hammond (mhammond) Summary: Show Spam Clues Marks Message as Read Initial Comment: After using "Show Spam Clues" on an unread message in the Spam folder, the message status changed form Read to Unread. My concern here is that an perhaps an acknowledge has been sent to the spammer showing the account as active? ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 10:33 Message: Logged In: YES user_id=14198 I can't fix this, as I don't know how. Adding the original as an attachment causes Outlook to mark the original as "read" - but this read status is not reflected in the message itself. I'm out of ideas for this. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-26 10:13 Message: Logged In: YES user_id=14198 I can reproduce this, but I am not at all sure I will be able to do anything about it. If you are concerned about auto-responses, then I suggest you customize outlook accordingly. Note however that spammers can check account details in other ways - especially if the body of the HTML message references anything that is downloaded from a server (such as an image). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743384&group_id=61702 From noreply at sourceforge.net Mon Jun 16 18:34:29 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 20:34:35 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743122 ] Outlook 2002 auto-complete breaks Message-ID: Bugs item #743122, was opened at 2003-05-25 22:06 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 auto-complete breaks Initial Comment: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 10:34 Message: Logged In: YES user_id=14198 Apparently this problem only happens *after* a SpamBayes dialog has been shown, but works fine until then. Can someone else confirm this? ---------------------------------------------------------------------- Comment By: Martin Worger (worger) Date: 2003-06-02 19:55 Message: Logged In: YES user_id=751487 I am running the same - W2K SP3 & OL2K and get the same problem. ---------------------------------------------------------------------- Comment By: Jonathan Evatt (spidawebs) Date: 2003-06-02 06:54 Message: Logged In: YES user_id=48654 I am also getting this problem. Have installed the app on Outlook XP (2002) SP2 over Win XP SP1. If I start up Outlook the problem only seems to occur AFTER I actually run the spambayes configuration routine. If I do not access that then the problem does not (so far - limited testing) seem to occur. Jonathan ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 From T.A.Meyer at massey.ac.nz Tue Jun 17 13:50:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Jun 16 20:51:34 2003 Subject: [spambayes-dev] Outlook Express and other 'dumb' clients Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E34A@its-xchg4.massey.ac.nz> > Beats me. Googling for "site:twistedmatrix.com python imap > server twisted" doesn't produce much, either. And here I am, relying on google to know everything. . > Note that the > "semi-stable" refers to the API, not to the code quality. Ah, I didn't realise that; thanks. > > (a) learn how to use twisted and create the OE/IMAP thing based on > > that (possibly meaning that users would need to install twisted) > Yes. This is going to be the happiest path. [...] > Learning to use twisted is going to be a whole pile less work > than writing a full imap server. Really. Alright then, you've convinced me. I suppose that knowing twisted may have other benefits someday anyway. I'll get to this at some point, once I catch up with real (i.e. paid) work. =Tony Meyer From noreply at sourceforge.net Mon Jun 16 19:53:51 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 21:53:59 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743122 ] Outlook 2002 auto-complete breaks Message-ID: Bugs item #743122, was opened at 2003-05-25 04:06 Message generated for change (Comment added) made by seantrue You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Outlook 2002 auto-complete breaks Initial Comment: Just installed SpamBayes and I think it breaks the auto-complete popup list for email addresses used in the To field in Outlook. The popup still appears but only with the first letter of each email address - the rest of the email address doesn't appear. I've uninstalled and reinstalled SpamBayes several times to confirm this problem. I'm running Outlook 2002 (SP2) on Windows 2000 (SP3). ---------------------------------------------------------------------- >Comment By: Sean True (seantrue) Date: 2003-06-16 17:53 Message: Logged In: YES user_id=114311 Fixing this bug will require a complete rewrite to remove all MFC based dialogs from SpamBayes. We're about 80% of the way there on SpamAtBay, and it's _not_ fun. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-16 16:34 Message: Logged In: YES user_id=14198 Apparently this problem only happens *after* a SpamBayes dialog has been shown, but works fine until then. Can someone else confirm this? ---------------------------------------------------------------------- Comment By: Martin Worger (worger) Date: 2003-06-02 01:55 Message: Logged In: YES user_id=751487 I am running the same - W2K SP3 & OL2K and get the same problem. ---------------------------------------------------------------------- Comment By: Jonathan Evatt (spidawebs) Date: 2003-06-01 12:54 Message: Logged In: YES user_id=48654 I am also getting this problem. Have installed the app on Outlook XP (2002) SP2 over Win XP SP1. If I start up Outlook the problem only seems to occur AFTER I actually run the spambayes configuration routine. If I do not access that then the problem does not (so far - limited testing) seem to occur. Jonathan ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743122&group_id=61702 From noreply at sourceforge.net Mon Jun 16 19:54:46 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 21:54:51 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751693 ] Get multiple dialog boxes; close one, 2nd copy appears Message-ID: Bugs item #751693, was opened at 2003-06-09 18:46 Message generated for change (Comment added) made by seantrue You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Get multiple dialog boxes; close one, 2nd copy appears Initial Comment: When I click the Anti-Spam button in Outlook, I do get the dialog box. When I click Close, the dialog pops up again, and I have to hit Close a second time. One time I actually got 3 dialog boxes in a row. I installed on a friend's machine and saw that same thing. I think it's harmless (?), but odd. I didn't see it logged in the database yet (looked for "multiple"). I've seen this on two different machines. Machine 1 Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Machine 2 Setup: Outlook XP Windows XP SpamBayes binar install BTW, VERY COOL SOFTWARE!!!! ---------------------------------------------------------------------- >Comment By: Sean True (seantrue) Date: 2003-06-16 17:54 Message: Logged In: YES user_id=114311 This appears to happen whenever multiple explorers are open. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 From noreply at sourceforge.net Mon Jun 16 20:16:13 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 22:16:17 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751693 ] Get multiple dialog boxes; close one, 2nd copy appears Message-ID: Bugs item #751693, was opened at 2003-06-09 19:46 Message generated for change (Comment added) made by ttennebkram You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Get multiple dialog boxes; close one, 2nd copy appears Initial Comment: When I click the Anti-Spam button in Outlook, I do get the dialog box. When I click Close, the dialog pops up again, and I have to hit Close a second time. One time I actually got 3 dialog boxes in a row. I installed on a friend's machine and saw that same thing. I think it's harmless (?), but odd. I didn't see it logged in the database yet (looked for "multiple"). I've seen this on two different machines. Machine 1 Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Machine 2 Setup: Outlook XP Windows XP SpamBayes binar install BTW, VERY COOL SOFTWARE!!!! ---------------------------------------------------------------------- >Comment By: Mark Bennett (ttennebkram) Date: 2003-06-16 19:16 Message: Logged In: YES user_id=797668 Thanks for your report. This bug has previously been reported, and has already been fixed in the source-code - so you can expect to see the fix in the next binary release. You may wish to subscribe to the spambayes-announce list to see when each version is released - details at http://mail.python.org/mailman-21/listinfo/spambayes-announce If you are technically minded, you may like to run from the CVS version of the source-code, and you should find the fix immediately - see the main SpamBayes pages for details. ---------------------------------------------------------------------- Comment By: Mark Bennett (ttennebkram) Date: 2003-06-16 19:16 Message: Logged In: YES user_id=797668 Thanks seantrue. Yes, if I close all of my Outlook windows except my main inbox, I only get one dialog box. I usually have my contacts open in a second window. ---------------------------------------------------------------------- Comment By: Sean True (seantrue) Date: 2003-06-16 18:54 Message: Logged In: YES user_id=114311 This appears to happen whenever multiple explorers are open. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 From noreply at sourceforge.net Mon Jun 16 20:16:25 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 22:16:30 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743113 ] Multiple profile issues Message-ID: Bugs item #743113, was opened at 2003-05-25 21:47 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Multiple profile issues Initial Comment: The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 12:16 Message: Logged In: YES user_id=14198 I think the new configuration options have largely fixed this. Each profile uses the same database, but different configuration. A different database may be desirable too, but one step at a time - at least now SpamBayes *works* with multiple profiles. I even tested it :) ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-05-28 02:50 Message: Logged In: YES user_id=626762 I had a look at addin.py and it seems like it ought to be easy enough to specify a separate toolbar for the buttons. This might be the way forward since the layout of the standard toolbar varies considerable depending on which folder you are looking at, whereas special toolbars do not, and it seems to be that part of the problem occurs when you start off in the top level folder. Best regards, Brian Williams ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 From noreply at sourceforge.net Mon Jun 16 20:18:19 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 22:18:25 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751699 ] Keep getting "Error processing missed messages!" Message-ID: Bugs item #751699, was opened at 2003-06-09 20:11 Message generated for change (Comment added) made by ttennebkram You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Keep getting "Error processing missed messages!" Initial Comment: When I restart Outlook it doesn't pick up messages that it has missed. Though similar to another bug I reported (about errors during filtering), the errors in the log file look completely different, so perhaps this is a different issue, or might provide more clues if it's the same issue. Outlook 2000 on Windows 2000 SpamBayes binary release 02a Error processing missed messages! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\addin.py", line 671, in OnConnection File "E:\src\spambayes\Outlook2000\addin.py", line 945, in ProcessMissedMessag es File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd ... etc... Attaching full log file. ---------------------------------------------------------------------- >Comment By: Mark Bennett (ttennebkram) Date: 2003-06-16 19:18 Message: Logged In: YES user_id=797668 Thanks Tim! Rebuilding the training DB did fix the problem. And this time I had about 5,000 of spam and ham in the training set. ---------------------------------------------------------------------- Comment By: Tim Peters (tim_one) Date: 2003-06-09 20:17 Message: Logged In: YES user_id=31435 It looks like your database is corrupt, and needs to be retrained from scratch. If you do that, I suggest training it on about equal numbers of ham and spam (you've got a 10-to- 1 imbalance now, which gives the classifier a warped view of the world -- it tries to make up for that, but may not be able to -- the math and the code weren't developed or tested or tuned with such a lopsided mix). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 From noreply at sourceforge.net Mon Jun 16 21:22:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 23:22:45 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749277 ] Should prevent "Deleted Items" being target folder Message-ID: Bugs item #749277, was opened at 2003-06-05 11:49 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749277&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Should prevent "Deleted Items" being target folder Initial Comment: As a user pointed out: I set up the filter to automatically move spam to ?deleted items?. Then I thought about it, and when I re-read the documentation, it seems SpamBayes will automatically assume anything in my deleted items is spam and add it to the database. If this is the case, then is it fair to say that after I?ve read non-spam (ham) email, and delete it as normal, Spambayes would flag the ham as spam? He is correct, and this would be evil. We either must display a severe warning, or simply prevent it. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 13:22 Message: Logged In: YES user_id=14198 Checking in FolderSelector.py; /cvsroot/spambayes/spambayes/Outlook2000/dialogs/FolderSelector.p new revision: 1.18; previous revision: 1.17 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749277&group_id=61702 From noreply at sourceforge.net Mon Jun 16 21:23:17 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 16 23:23:20 2003 Subject: [spambayes-dev] [ spambayes-Bugs-753215 ] binascii.Error decoding message Message-ID: Bugs item #753215, was opened at 2003-06-12 21:35 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=753215&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Raphael Manfredi (rmanfredi) Assigned to: Nobody/Anonymous (nobody) >Summary: binascii.Error decoding message Initial Comment: I've got the following stack trace, probably due to a badly encoded MIME extension. Traceback (most recent call last): File "/home/ram/bin/spambayes/hammiefilter.py", line 178, in ? main() File "/home/ram/bin/spambayes/hammiefilter.py", line 174, in main action(msg) File "/home/ram/bin/spambayes/hammiefilter.py", line 112, in filter return h.filter(msg) File "/home/ram/bin/spambayes/hammie.py", line 108, in filter spambayes.hammiebulk.main() File "/home/ram/bin/spambayes/hammie.py", line 38, in _scoremsg spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 217, in chi2_spamprob File "./spambayes/classifier.py", line 436, in _getclues File "./spambayes/compatsets.py", line 374, in __init__ File "./spambayes/compatsets.py", line 333, in _update File "./spambayes/tokenizer.py", line 1052, in tokenize File "./spambayes/tokenizer.py", line 1106, in tokenize_headers File "/usr/lib/python2.2/email/Header.py", line 92, in decode_header dec = email.base64MIME.decode(encoded) File "/usr/lib/python2.2/email/base64MIME.py", line 179, in decode dec = a2b_base64(s) binascii.Error: Incorrect padding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=753215&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:03:47 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:03:52 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751699 ] Keep getting "Error processing missed messages!" Message-ID: Bugs item #751699, was opened at 2003-06-10 13:11 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Keep getting "Error processing missed messages!" Initial Comment: When I restart Outlook it doesn't pick up messages that it has missed. Though similar to another bug I reported (about errors during filtering), the errors in the log file look completely different, so perhaps this is a different issue, or might provide more clues if it's the same issue. Outlook 2000 on Windows 2000 SpamBayes binary release 02a Error processing missed messages! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\addin.py", line 671, in OnConnection File "E:\src\spambayes\Outlook2000\addin.py", line 945, in ProcessMissedMessag es File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd ... etc... Attaching full log file. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:03 Message: Logged In: YES user_id=14198 Dupe of 706520 assert fails in classifier. But I am checking in a change to catch this exception and report an error. ---------------------------------------------------------------------- Comment By: Mark Bennett (ttennebkram) Date: 2003-06-17 12:18 Message: Logged In: YES user_id=797668 Thanks Tim! Rebuilding the training DB did fix the problem. And this time I had about 5,000 of spam and ham in the training set. ---------------------------------------------------------------------- Comment By: Tim Peters (tim_one) Date: 2003-06-10 13:17 Message: Logged In: YES user_id=31435 It looks like your database is corrupt, and needs to be retrained from scratch. If you do that, I suggest training it on about equal numbers of ham and spam (you've got a 10-to- 1 imbalance now, which gives the classifier a warped view of the world -- it tries to make up for that, but may not be able to -- the math and the code weren't developed or tested or tuned with such a lopsided mix). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:08:14 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:08:18 2003 Subject: [spambayes-dev] [ spambayes-Bugs-754729 ] Install fails with removable hard drive Message-ID: Bugs item #754729, was opened at 2003-06-15 12:56 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=754729&group_id=61702 Category: Outlook Group: v1.0 (example) >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: David Martin (hurricane51) Assigned to: Mark Hammond (mhammond) Summary: Install fails with removable hard drive Initial Comment: Windows XP Pro SP2 Outlook 2002 (XP) with all Windows Update patches When trying to install with a "removable Hard Disk" device (in this case a CompactFlach reader with no media installed), I received the following error: Inbox - Microsoft Outlook: OUTLOOK.EXE - NO DISK There is no disk in the drive. Please insert a disk into drive \Device\Harddisk2\DR4. Installation log was clean, however, the plugin did not appear in Outlook. This drive is only used for downloading photros from my digicam. Otherwise, it is never used, it is not the TEMP drive, etc. See attached GIF image. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:08 Message: Logged In: YES user_id=14198 Dupe of 740843 No Disk Error with Outlook 2000 on startup ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=754729&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:08:37 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:08:41 2003 Subject: [spambayes-dev] [ spambayes-Bugs-752568 ] Lost DB on Outlook exit Message-ID: Bugs item #752568, was opened at 2003-06-11 22:56 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=752568&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: matt tagliaferri (mtagliaf) Assigned to: Mark Hammond (mhammond) Summary: Lost DB on Outlook exit Initial Comment: I think I've narrowed this down - it appears that the database is lost when I shut down Windows and it closes Outlook as part of the shutdown process. If I close Outlook myself, the database is retained. matt tag PS: kick-butt program. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:08 Message: Logged In: YES user_id=14198 Thanks for your report. This bug has previously been reported, and has already been fixed in the source-code - so you can expect to see the fix in the next binary release. You may wish to subscribe to the spambayes-announce list to see when each version is released - details at http://mail.python.org/mailman-21/listinfo/spambayes-announce If you are technically minded, you may like to run from the CVS version of the source-code, and you should find the fix immediately - see the main SpamBayes pages for details. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=752568&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:09:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:09:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751699 ] Keep getting "Error processing missed messages!" Message-ID: Bugs item #751699, was opened at 2003-06-10 13:11 Message generated for change (Settings changed) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Keep getting "Error processing missed messages!" Initial Comment: When I restart Outlook it doesn't pick up messages that it has missed. Though similar to another bug I reported (about errors during filtering), the errors in the log file look completely different, so perhaps this is a different issue, or might provide more clues if it's the same issue. Outlook 2000 on Windows 2000 SpamBayes binary release 02a Error processing missed messages! Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\addin.py", line 671, in OnConnection File "E:\src\spambayes\Outlook2000\addin.py", line 945, in ProcessMissedMessag es File "E:\src\spambayes\Outlook2000\addin.py", line 163, in ProcessMessage File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 203, in OnItemAdd ... etc... Attaching full log file. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:03 Message: Logged In: YES user_id=14198 Dupe of 706520 assert fails in classifier. But I am checking in a change to catch this exception and report an error. ---------------------------------------------------------------------- Comment By: Mark Bennett (ttennebkram) Date: 2003-06-17 12:18 Message: Logged In: YES user_id=797668 Thanks Tim! Rebuilding the training DB did fix the problem. And this time I had about 5,000 of spam and ham in the training set. ---------------------------------------------------------------------- Comment By: Tim Peters (tim_one) Date: 2003-06-10 13:17 Message: Logged In: YES user_id=31435 It looks like your database is corrupt, and needs to be retrained from scratch. If you do that, I suggest training it on about equal numbers of ham and spam (you've got a 10-to- 1 imbalance now, which gives the classifier a warped view of the world -- it tries to make up for that, but may not be able to -- the math and the code weren't developed or tested or tuned with such a lopsided mix). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751699&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:10:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:10:10 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751283 ] addin installs but does not work (stays as unchecked in COM) Message-ID: Bugs item #751283, was opened at 2003-06-09 21:35 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Radek Stachowiak (radek) Assigned to: Mark Hammond (mhammond) Summary: addin installs but does not work (stays as unchecked in COM) Initial Comment: here is my spamlog1: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy. py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy. py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy. py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:10 Message: Logged In: YES user_id=14198 Please see the latest Troubleshooting.html for info on how to reenable. http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/spambayes/spambayes/Outlook2000/docs/troubleshooting.html?rev=HEAD ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:16:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:16:58 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743515 ] Unable to expand folders in folder selection dialog. Message-ID: Bugs item #743515, was opened at 2003-05-26 17:58 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Paul Pedriana (ppedriana) Assigned to: Mark Hammond (mhammond) Summary: Unable to expand folders in folder selection dialog. Initial Comment: On my computer I do the following steps: 1 - Install the software with Outlook 2002. This succeeds 2 - Call forth the Anti-Spam manager dialog. 3 - Click the "Train Now..." button. This calls forth the "Training" dialog. 4 - Click either of the two "Browse" buttons. This calls forth the "Select Folders..." dialog, which has a tree control with two top level nodes: "Public Folders" and "Personal Folders." 5 - Clicking on the "Public Folders" node expands it, as expected. 6 - Clicking on the "Personal Folders" node does nothing; it does not expand as expected. --------------------------------------- Looking at the log file, the following lines are present: Traceback (most recent call last): "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 384, in OnTreeItemExpanding "E:\src\spambayes\Outlook2000\dialogs\FolderSelector.py", line 74, in _BuildFoldersMAPI pywintypes.com_error: (-2147221246, 'Invalid window handle', None, None) win32ui: Exception in OnNotify() handler The exception appears to happen in a call to FolderSpec() from the _BuildFoldersMAPI function. Here is a snippet of _BuildFoldersMAPI containing the call to FolderSpec: try: child_folder = manager.message_store.GetFolder(temp_id) except pythoncom.com_error: # Bad folder for some reason - ignore it. child_folder = None if child_folder is not None: spec = FolderSpec(child_folder.GetID(), name) # If we have no children at all, indicate # the item is not expandable. table = child_folder.OpenEntry().GetHierarchyTable(0) ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:16 Message: Logged In: YES user_id=14198 Thanks for your report. This bug has previously been reported, and has already been fixed in the source-code - so you can expect to see the fix in the next binary release. You may wish to subscribe to the spambayes-announce list to see when each version is released - details at http://mail.python.org/mailman-21/listinfo/spambayes-announce If you are technically minded, you may like to run from the CVS version of the source-code, and you should find the fix immediately - see the main SpamBayes pages for details. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743515&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:23:48 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:27:45 2003 Subject: [spambayes-dev] [ spambayes-Bugs-731538 ] exceptions.AssertionError: What kind of ID is 'None Message-ID: Bugs item #731538, was opened at 2003-05-03 05:05 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Brian Takita (btakita) Assigned to: Mark Hammond (mhammond) Summary: exceptions.AssertionError: What kind of ID is 'None Initial Comment: The Delete as Spam button does not work either. It gives email spam a score if I request it. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:23 Message: Logged In: YES user_id=14198 I can recreate this when there is no spam folder defined. Fixed. Checking in addin.py; new revision: 1.62; previous revision: 1.61 ---------------------------------------------------------------------- Comment By: Brian Takita (btakita) Date: 2003-05-04 08:21 Message: Logged In: YES user_id=620484 Windows 2000 Outlook XP Python 2.2 SpamBayes 1.0a2 ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:41 Message: Logged In: YES user_id=14198 Sorry - my mistake - you already did that - thanks! ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-03 21:40 Message: Logged In: YES user_id=14198 Can you please see the about.html file that comes with the package (click "About" on the main Window), see how to find the log for your sesssion, and attach it to this bug? Thanks ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-05-03 05:50 Message: Logged In: YES user_id=645698 Can you give us a bit more info here? What platform are you running on, what version of python, what version of Outlook, what version of spambayes (alpha1, alpha2, current cvs)? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=731538&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:33:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:33:38 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751686 ] Stopped filtering easy messages, log file errors Message-ID: Bugs item #751686, was opened at 2003-06-10 12:31 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751686&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Stopped filtering easy messages, log file errors Initial Comment: I've had SpamBayes up and running for about a week. After doing a lot of new filtering today, it will no longer filter my inbox. I see from the log file that it is trying to. Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Sample: AssertionError Error filtering message '' Traceback (most recent call last): File "E:\src\spambayes\Outlook2000\filter.py", line 81, in filter_folder File "E:\src\spambayes\Outlook2000\filter.py", line 15, in filter_message File "E:\src\spambayes\Outlook2000\manager.py", line 440, in score File "e:\src\spambayes\spambayes\classifier.py", line 217, in chi2_spamprob File "e:\src\spambayes\spambayes\classifier.py", line 465, in _getclues File "e:\src\spambayes\spambayes\classifier.py", line 319, in probability AssertionError ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:33 Message: Logged In: YES user_id=14198 Please do a full re-train on your database - that should solve it. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751686&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:34:41 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:34:45 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751127 ] WinXP MemoryError at installation Message-ID: Bugs item #751127, was opened at 2003-06-09 12:27 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751127&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Michael Williams (lamwilli) Assigned to: Mark Hammond (mhammond) >Summary: WinXP MemoryError at installation Initial Comment: I attempted to install the Spambayes Outlook AddIn from: SpamBayes-Outlook-Setup-002.exe to my WinXP machine. Relevent configuration info follows: OS Name Microsoft Windows XP Professional Version 5.1.2600 Service Pack 1 Build 2600 Application name Outlook Version 9.0 Build 3821 AKA MS Outlook 2000 SR-1 (9.0.0.3821) The entire contents of the log file was: ---------------------------------------------------- Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py", line 405, in RegisterClasses MemoryError ---------------------------------------------------- The error message at installation time was: ---------------------------------------------------- The instruction at "0x77f51ed3" referenced memory at "0xfffffff8". The memory could not be "read" ---------------------------------------------------- See attached... ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:34 Message: Logged In: YES user_id=14198 Do you get the same error after a reboot? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751127&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:35:53 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:35:57 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749814 ] Add-In Load Failure Message-ID: Bugs item #749814, was opened at 2003-06-06 07:30 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749814&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Robert Garber (rgarber) Assigned to: Mark Hammond (mhammond) Summary: Add-In Load Failure Initial Comment: The Add-In appears in list of Outlook add-ins but with no check mark. When I check the box and click OK it does not load or save the check mark. Environment Win 98 Outlook 2000 Executable install Log file is attached with an error. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:35 Message: Logged In: YES user_id=14198 Dupe of 725449 Binary plugin fails on Win9x: encoding exception ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749814&group_id=61702 From noreply at sourceforge.net Mon Jun 16 22:36:29 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 00:36:36 2003 Subject: [spambayes-dev] [ spambayes-Bugs-749309 ] Spambayes button does not load Message-ID: Bugs item #749309, was opened at 2003-06-05 13:57 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749309&group_id=61702 Category: Outlook Group: v1.0 (example) >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Harvey Schoenman (hschoenman) Assigned to: Mark Hammond (mhammond) Summary: Spambayes button does not load Initial Comment: Spambayes (binary version) installed with no errors but after starting Outlook 2000 with SR-1 there was no Spambayes button on the toolbar. I checked for a log file (one is attached). I also looked in Tools, Options, Other, Advanced, Com Add-ins. Spambayes.outlookaddin was present but unchecked. I was able to check it but the checkmark did not "stick." Whenever I went back in, it remained unchecked. The log file contains the following error: pythoncom error: Failed to call the universal dispatcher -Harvey Schoenman ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:36 Message: Logged In: YES user_id=14198 Dupe of 725449 Binary plugin fails on Win9x: encoding exception ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=749309&group_id=61702 From noreply at sourceforge.net Tue Jun 17 00:57:25 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 02:57:36 2003 Subject: [spambayes-dev] [ spambayes-Bugs-755738 ] Latest CVS outllok doesn't work Message-ID: Bugs item #755738, was opened at 2003-06-17 08:57 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=755738&group_id=61702 Category: Outlook Group: v1.0 (example) Status: Open Resolution: None Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Latest CVS outllok doesn't work Initial Comment: I have been using spambayes for some months. The latest cvs-version fails with the following traceback: pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 609, in OnSelectionChange self.SetupUI() File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 500, in SetupUI Tag = "SpamBayesCommand.Manager") File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 564, in _AddControl item = parent.Controls.Add(Type=control_type, Temporary=False) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\client\__init__.py", line 451, in __getattr__ return apply(self._ApplyTypes_, args) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\client\__init__.py", line 445, in _ApplyTypes_ return self._get_good_object_(apply (self._oleobj_.InvokeTypes, (dispid, 0, wFlags, retType, argTypes) + args), user, resultCLSID) pywintypes.com_error: (-2147352567, 'Det oppstod et unntak.', (0, None, None, None, 0, -2147467259), None) Message 'Kunder med utest?ende hos Stocknet.' had a Spam classification of 'No' The traceback is repeated each time a message is selected. I tried the the cvs-version ca a week ago as well with the same result. I unregistered the plugin before installing it, and also tried this multiple times. The new toolbar doen not seem to work. When clicking the "Anti-Spam"-button nothing happens. When selecting the spam-folder, the "Delete as spam"-button does not change to "recover-from-spam", the latter does not show up at all. OS: win XP home outllook 2000 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=755738&group_id=61702 From noreply at sourceforge.net Tue Jun 17 00:57:49 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 02:57:56 2003 Subject: [spambayes-dev] [ spambayes-Bugs-755738 ] Latest CVS outllok doesn't work Message-ID: Bugs item #755738, was opened at 2003-06-17 08:57 Message generated for change (Settings changed) made by fmmr You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=755738&group_id=61702 Category: Outlook >Group: None Status: Open Resolution: None Priority: 5 Submitted By: Fredrik Rodland (fmmr) Assigned to: Mark Hammond (mhammond) Summary: Latest CVS outllok doesn't work Initial Comment: I have been using spambayes for some months. The latest cvs-version fails with the following traceback: pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 275, in _Invoke_ return self._invoke_(dispid, lcid, wFlags, args) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 280, in _invoke_ return S_OK, -1, self._invokeex_(dispid, lcid, wFlags, args, None, None) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 601, in _invokeex_ return DesignatedWrapPolicy._invokeex_( self, dispid, lcid, wFlags, args, kwArgs, serviceProvider) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\server\policy.py", line 541, in _invokeex_ return apply(func, args) File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 609, in OnSelectionChange self.SetupUI() File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 500, in SetupUI Tag = "SpamBayesCommand.Manager") File "c:\Programfiler\_UTIL\spambayes_cvs\spambayes\O utlook2000\addin.py", line 564, in _AddControl item = parent.Controls.Add(Type=control_type, Temporary=False) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\client\__init__.py", line 451, in __getattr__ return apply(self._ApplyTypes_, args) File "C:\PROGRA~1\_DEV\Python22\lib\site- packages\win32com\client\__init__.py", line 445, in _ApplyTypes_ return self._get_good_object_(apply (self._oleobj_.InvokeTypes, (dispid, 0, wFlags, retType, argTypes) + args), user, resultCLSID) pywintypes.com_error: (-2147352567, 'Det oppstod et unntak.', (0, None, None, None, 0, -2147467259), None) Message 'Kunder med utest?ende hos Stocknet.' had a Spam classification of 'No' The traceback is repeated each time a message is selected. I tried the the cvs-version ca a week ago as well with the same result. I unregistered the plugin before installing it, and also tried this multiple times. The new toolbar doen not seem to work. When clicking the "Anti-Spam"-button nothing happens. When selecting the spam-folder, the "Delete as spam"-button does not change to "recover-from-spam", the latter does not show up at all. OS: win XP home outllook 2000 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=755738&group_id=61702 From noreply at sourceforge.net Tue Jun 17 07:29:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 09:30:21 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743113 ] Multiple profile issues Message-ID: Bugs item #743113, was opened at 2003-05-25 11:47 Message generated for change (Comment added) made by bwims You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 Category: Outlook Group: None Status: Closed Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Multiple profile issues Initial Comment: The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-06-17 13:29 Message: Logged In: YES user_id=626762 Mark, what are the new configuration options? Are they in the Outlook installer, or the python distribution? I have the latest Outlook installer and when I move from a profile that uses a pst for delivery to one with an Exchange mailbox, it all goes haywire with buttons appearing twice and not working when pressed. What did I miss? Thanks, Brian Williams ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 02:16 Message: Logged In: YES user_id=14198 I think the new configuration options have largely fixed this. Each profile uses the same database, but different configuration. A different database may be desirable too, but one step at a time - at least now SpamBayes *works* with multiple profiles. I even tested it :) ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-05-27 16:50 Message: Logged In: YES user_id=626762 I had a look at addin.py and it seems like it ought to be easy enough to specify a separate toolbar for the buttons. This might be the way forward since the layout of the standard toolbar varies considerable depending on which folder you are looking at, whereas special toolbars do not, and it seems to be that part of the problem occurs when you start off in the top level folder. Best regards, Brian Williams ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 From Paul.Moore at atosorigin.com Tue Jun 17 15:32:35 2003 From: Paul.Moore at atosorigin.com (Moore, Paul) Date: Tue Jun 17 09:33:53 2003 Subject: [spambayes-dev] Archiving this list on gmane Message-ID: <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorigin.com> Would it be OK if I arranged for this list to be archived on gmane (www.gmane.org)? The main spambayes list and the CVS checkins list are already there, so I assume that there is not likely to be a problem with this...? Paul Moore From skip at pobox.com Tue Jun 17 09:49:54 2003 From: skip at pobox.com (Skip Montanaro) Date: Tue Jun 17 09:50:06 2003 Subject: [spambayes-dev] Archiving this list on gmane In-Reply-To: <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorigin.com> References: <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorigin.com> Message-ID: <16111.7298.136778.895242@montanaro.dyndns.org> Paul> Would it be OK if I arranged for this list to be archived on gmane Paul> (www.gmane.org)? Sure. S From tdickenson at devmail.geminidataloggers.co.uk Tue Jun 17 16:05:15 2003 From: tdickenson at devmail.geminidataloggers.co.uk (Toby Dickenson) Date: Tue Jun 17 10:05:19 2003 Subject: [spambayes-dev] My first ever false positive..... Message-ID: <200306171505.15686.tdickenson@devmail.geminidataloggers.co.uk> Ive had my first ever false positive last week...... My training regime over the last few months has involved a full train on my whole mailbox overnight. One afternoon last week I received an overtly commercial email from someone I had started talking with that morning. There were no ham clues in the message, and it scored 0.91. If the message had arrived a few hours later (after my overnight train) it would have scored differently. The header clues from my previous conversation would have been included, and it would have scored 0.26. I could also have avoided this false positive by automatically training on messages classified as ham as they arrive. I will try adding this to my training regime, and will report results. From bill at parducci.net Tue Jun 17 09:11:05 2003 From: bill at parducci.net (bill parducci) Date: Tue Jun 17 11:11:10 2003 Subject: [spambayes-dev] musings on latest enhancement Message-ID: <3EEF2F89.8000208@parducci.net> i was browsing through the notes on the latest updates in CVS and came across this, which gave me pause: 'Nonsense' HTML tags are stripped rather than replaced with a space (e.g. Wrinkle Reduction becomes "Wrinkle" and "Reduction" rather than "Wr", "inkle", "Reduc" and "tion"). does this mean that will be igonored? i wonder if it wouldn't be of value to treat the 'nonsense tags' as a tokens (e.g. append the list of tokens to the end of the text being scored) in addition to 'removing' them? b From tim.one at comcast.net Tue Jun 17 12:33:57 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Jun 17 11:37:35 2003 Subject: [spambayes-dev] musings on latest enhancement In-Reply-To: <3EEF2F89.8000208@parducci.net> Message-ID: [bill parducci] > i was browsing through the notes on the latest updates in CVS and came > across this, which gave me pause: > > 'Nonsense' HTML tags are stripped rather than replaced with a space > (e.g. Wrinkle Reduction becomes > "Wrinkle" and "Reduction" rather than "Wr", "inkle", "Reduc" and > "tion"). > > does this mean that will be igonored? Yes, by spambayes. > i wonder if it wouldn't be of value to treat the 'nonsense tags' as a > tokens (e.g. append the list of tokens to the end of the text being > scored) in addition to 'removing' them? You can try it. I doubt it will help; it will certainly bloat database size due to creating more hapaxes in the presence of this junk. From popiel at wolfskeep.com Tue Jun 17 10:14:40 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Tue Jun 17 12:14:44 2003 Subject: [spambayes-dev] Archiving this list on gmane In-Reply-To: Message from "Moore, Paul" of "Tue, 17 Jun 2003 14:32:35 BST." <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorigin.com> References: <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorigin.com> Message-ID: <20030617161440.6052E2DE9C@cashew.wolfskeep.com> In message: <16E1010E4581B049ABC51D4975CEDB880113DB6A@UKDCX001.uk.int.atosorig in.com> "Moore, Paul" writes: >Would it be OK if I arranged for this list to be archived on gmane = >(www.gmane.org)? The main spambayes list and the CVS checkins list are = >already there, so I assume that there is not likely to be a problem with = >this...? +1 - Alex From T.A.Meyer at massey.ac.nz Wed Jun 18 13:46:57 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 17 20:47:35 2003 Subject: [spambayes-dev] Release plan for this Friday Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E607@its-xchg4.massey.ac.nz> Unless anyone objects, I'm going to put up a release (a3) this Friday (NZ time). It doesn't seem like there is anything much left to do, apart from possibly some Outlook plug-in improvements, and those users will most likely wait for the next binary, anyway. Speak up if there are objections! I'll update the website, post to spambayes and -announce. =Tony Meyer From noreply at sourceforge.net Tue Jun 17 18:53:04 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 17 20:53:16 2003 Subject: [spambayes-dev] [ spambayes-Bugs-743113 ] Multiple profile issues Message-ID: Bugs item #743113, was opened at 2003-05-25 21:47 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 Category: Outlook Group: None Status: Closed Resolution: Fixed Priority: 5 Submitted By: Mark Hammond (mhammond) Assigned to: Mark Hammond (mhammond) Summary: Multiple profile issues Initial Comment: The only problem I have is that I use multiple outlook profiles. At work, I use Exchange and at home I use a personal folder for my mail delivery, which contains its own inbox, junk and maybe folders. When I initiate Outlook with my work folder, it really screws up the buttons. The manager button doesn't work, and I get a duplicate pair of buttons appearing at the start of the standard toolbar. Fix is to disable the COM object, exit, enter, re-enable COM object, (exit and re-enter?). The only thing I can do right now is to disable the COM object before moving profiles. What would be great would be either to have a button that toggles the functionality on/off, or (ideally) get it to recognise the different profiles. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-18 10:53 Message: Logged In: YES user_id=14198 Sorry, I should have been clearer. This is currently CVS only. A new binary will be out soon (I am running out of features I can simply add and bugs I can simply fix!) ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-06-17 23:29 Message: Logged In: YES user_id=626762 Mark, what are the new configuration options? Are they in the Outlook installer, or the python distribution? I have the latest Outlook installer and when I move from a profile that uses a pst for delivery to one with an Exchange mailbox, it all goes haywire with buttons appearing twice and not working when pressed. What did I miss? Thanks, Brian Williams ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 12:16 Message: Logged In: YES user_id=14198 I think the new configuration options have largely fixed this. Each profile uses the same database, but different configuration. A different database may be desirable too, but one step at a time - at least now SpamBayes *works* with multiple profiles. I even tested it :) ---------------------------------------------------------------------- Comment By: Brian Williams (bwims) Date: 2003-05-28 02:50 Message: Logged In: YES user_id=626762 I had a look at addin.py and it seems like it ought to be easy enough to specify a separate toolbar for the buttons. This might be the way forward since the layout of the standard toolbar varies considerable depending on which folder you are looking at, whereas special toolbars do not, and it seems to be that part of the problem occurs when you start off in the top level folder. Best regards, Brian Williams ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=743113&group_id=61702 From T.A.Meyer at massey.ac.nz Wed Jun 18 14:55:16 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 17 21:55:55 2003 Subject: [spambayes-dev] Message classes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E64C@its-xchg4.massey.ac.nz> It seems to me that we have too many base message clases. There are: Corpus.Message (storage) message.Message (pop3proxy, imapfilter, Outlook?) msgs.Msg (testing) email.Message (mboxutils) Is it really necessary to have all of these? Could Corpus.Message, message.Message be combined? Could mboxutils use message.Message? (msgs.Msg seems quite different). Does anyone know the answers to these, or do I need to try and figure them out myself? :) =Tony Meyer From mhammond at skippinet.com.au Wed Jun 18 13:27:01 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Tue Jun 17 22:28:49 2003 Subject: [spambayes-dev] Message classes In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130207E64C@its-xchg4.massey.ac.nz> Message-ID: <000701c33541$1a04e2a0$530f8490@eden> > Corpus.Message (storage) AFAIK, this isn't used outside of TimS's code. > message.Message (pop3proxy, imapfilter, Outlook?) Outlook intends moving here, but I'm still not sure the pain is worth the gain at this stage. > msgs.Msg (testing) NFI > email.Message (mboxutils) I think email.Message could be an internal implementation detail of message.Message. I think we need a clear API for message.Message, including all transformations to and from "external" representations, and clear semantics for the object once "transformed" from the Outside world. I believe it is in these transformations that the oft mentioned "choke point" for handling badly formed mail must live. Also, as I mentioned, I am semi-keen to try falling back to rfc822 when email.Message chokes, just for the ascii. Mark. -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1964 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes-dev/attachments/20030618/1d43315d/winmail.bin From anthony at interlink.com.au Tue Jun 17 18:27:12 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Jun 17 22:53:58 2003 Subject: [spambayes-dev] breakage in imapfilter Message-ID: <200306170727.h5H7RC4x006550@localhost.localdomain> The code below is from ImapUI.py - the stuff in the ImportError section is totally busted. I'm not sure what it's supposed to be doing, so I'll leave it for someone who's more acquainted with the code to fix... class IMAPUserInterface(UserInterface.UserInterface): """Serves the HTML user interface for the proxies.""" def __init__(self, cls, imap, pwd): global classifier # Only offer SSL if it is available try: from imaplib import IMAP_SSL except ImportError: parm_list = list(parm_map) parm_list.remove(("imap", "use_ssl")) parm_map = tuple(parm_list) UserInterface.UserInterface.__init__(self, cls, parm_map) classifier = cls self.imap = imap self.imap_pwd = pwd self.imap_logged_in = False From T.A.Meyer at massey.ac.nz Wed Jun 18 16:14:06 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 17 23:15:04 2003 Subject: [spambayes-dev] breakage in imapfilter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E6A8@its-xchg4.massey.ac.nz> > The code below is from ImapUI.py - the stuff in the > ImportError section is totally busted. I'm not sure what it's > supposed to be doing, so I'll leave it for someone who's more > acquainted with the code to fix... In what way is it busted? It works for me. If the ImportError is raised (ie IMAP_SSL is not available, as in pre Python 2.3), then it removes the "imap", "use_ssl" option from the list of options to be presented (parm_map). This is parm_map before the ImportError: (('IMAP Options', None), ('imap', 'server'), ('imap', 'username'), ('imap', 'pas sword'), ('imap', 'use_ssl'), ('Header Options', None), ('pop3proxy', 'notate_to '), ('pop3proxy', 'notate_subject'), ('Headers', 'include_score'), ('Headers', ' include_thermostat'), ('Headers', 'include_evidence'), ('pop3proxy', 'add_mailid _to'), ('pop3proxy', 'strip_incoming_mailids'), ('Storage Options', None), ('Sto rage', 'persistent_storage_file'), ('Storage', 'messageinfo_storage_file'), ('St atistics Options', None), ('Categorization', 'ham_cutoff'), ('Categorization', ' spam_cutoff'), ('Classifier', 'experimental_ham_spam_imbalance_adjustment')) This is it afterwards: (('IMAP Options', None), ('imap', 'server'), ('imap', 'username'), ('imap', 'pas sword'), ('Header Options', None), ('pop3proxy', 'notate_to'), ('pop3proxy', 'no tate_subject'), ('Headers', 'include_score'), ('Headers', 'include_thermostat'), ('Headers', 'include_evidence'), ('pop3proxy', 'add_mailid_to'), ('pop3proxy', 'strip_incoming_mailids'), ('Storage Options', None), ('Storage', 'persistent_st orage_file'), ('Storage', 'messageinfo_storage_file'), ('Statistics Options', No ne), ('Categorization', 'ham_cutoff'), ('Categorization', 'spam_cutoff'), ('Clas sifier', 'experimental_ham_spam_imbalance_adjustment')) They are identical, except that the second is missing the option that it is meant to be. This is tested with Python 2.2.3 & 2.2.2. I haven't tested with 2.3, but the ImportError shouldn't be raised in that case (because IMAP_SSL should be available). =Tony Meyer From anthony at interlink.com.au Wed Jun 18 14:39:47 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Jun 17 23:40:58 2003 Subject: [spambayes-dev] breakage in imapfilter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130207E6A8@its-xchg4.massey.ac.nz> Message-ID: <200306180339.h5I3dmDk015537@localhost.localdomain> >>> "Meyer, Tony" wrote > In what way is it busted? It works for me. If the ImportError is > raised (ie IMAP_SSL is not available, as in pre Python 2.3), then it > removes the "imap", "use_ssl" option from the list of options to be > presented (parm_map). Because you assign to parm_map, it's a local. I made the fix of making it explicitly global, and all is happy. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From T.A.Meyer at massey.ac.nz Wed Jun 18 16:44:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 17 23:45:05 2003 Subject: [spambayes-dev] breakage in imapfilter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E6CF@its-xchg4.massey.ac.nz> > Because you assign to parm_map, it's a local. I made the fix > of making it explicitly global, and all is happy. Oh - was the message to -dev before the check-in? I ran my test after cvs-up'ing (because I received the check-in message before the -dev message). So all is fine now? =Tony Meyer From anthony at interlink.com.au Wed Jun 18 14:48:01 2003 From: anthony at interlink.com.au (Anthony Baxter) Date: Tue Jun 17 23:49:00 2003 Subject: [spambayes-dev] breakage in imapfilter In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F130207E6CF@its-xchg4.massey.ac.nz> Message-ID: <200306180348.h5I3m2oQ015633@localhost.localdomain> >>> "Meyer, Tony" wrote > > Because you assign to parm_map, it's a local. I made the fix > > of making it explicitly global, and all is happy. > > Oh - was the message to -dev before the check-in? I ran my test after > cvs-up'ing (because I received the check-in message before the -dev > message). Technically no. It actually ended up being that way because my stupid laptop decided not to send any email out yesterday. :-( From noreply at sourceforge.net Tue Jun 17 22:23:44 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 00:23:47 2003 Subject: [spambayes-dev] [ spambayes-Patches-756413 ] Add a SQL-based storage class Message-ID: Patches item #756413, was opened at 2003-06-18 16:23 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Add a SQL-based storage class Initial Comment: The attached patches add a SQL based storage class that can be used via pop3proxy. It assumes that the user has mySQL and MySQLdb installed, and has already set up the table as suggested on the spambayes mailing list. There is a great deal of room for improvement here. In particular, having a separate table for the tokens is probably a waste (as discussed somewhat abstractly on the list). If anyone wants an SQL solution, here it is. This should be able to be patched into the current cvs for a reasonable while yet. If you do end up using this, please let the list know. If a suitably large number of people do (at least 2!), then it could be checked in to the main code. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 From noreply at sourceforge.net Tue Jun 17 22:24:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 00:25:01 2003 Subject: [spambayes-dev] [ spambayes-Patches-756413 ] Add a SQL-based storage class Message-ID: Patches item #756413, was opened at 2003-06-18 16:23 Message generated for change (Settings changed) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 Category: None Group: None Status: Open Resolution: None >Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Add a SQL-based storage class Initial Comment: The attached patches add a SQL based storage class that can be used via pop3proxy. It assumes that the user has mySQL and MySQLdb installed, and has already set up the table as suggested on the spambayes mailing list. There is a great deal of room for improvement here. In particular, having a separate table for the tokens is probably a waste (as discussed somewhat abstractly on the list). If anyone wants an SQL solution, here it is. This should be able to be patched into the current cvs for a reasonable while yet. If you do end up using this, please let the list know. If a suitably large number of people do (at least 2!), then it could be checked in to the main code. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 From noreply at sourceforge.net Tue Jun 17 22:51:51 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 00:51:54 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751693 ] Get multiple dialog boxes; close one, 2nd copy appears Message-ID: Bugs item #751693, was opened at 2003-06-10 12:46 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Mark Bennett (ttennebkram) Assigned to: Mark Hammond (mhammond) Summary: Get multiple dialog boxes; close one, 2nd copy appears Initial Comment: When I click the Anti-Spam button in Outlook, I do get the dialog box. When I click Close, the dialog pops up again, and I have to hit Close a second time. One time I actually got 3 dialog boxes in a row. I installed on a friend's machine and saw that same thing. I think it's harmless (?), but odd. I didn't see it logged in the database yet (looked for "multiple"). I've seen this on two different machines. Machine 1 Setup: Outlook 2000 Windows 2000 SpamBayes binary install, 0.2a? Machine 2 Setup: Outlook XP Windows XP SpamBayes binar install BTW, VERY COOL SOFTWARE!!!! ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-18 14:51 Message: Logged In: YES user_id=14198 This was fixed in the process of (hopefully) fixing all toolbar issues, particularly multiple window and outlook 2003. Checking in addin.py; new revision: 1.63; previous revision: 1.62 ---------------------------------------------------------------------- Comment By: Mark Bennett (ttennebkram) Date: 2003-06-17 12:16 Message: Logged In: YES user_id=797668 Thanks for your report. This bug has previously been reported, and has already been fixed in the source-code - so you can expect to see the fix in the next binary release. You may wish to subscribe to the spambayes-announce list to see when each version is released - details at http://mail.python.org/mailman-21/listinfo/spambayes-announce If you are technically minded, you may like to run from the CVS version of the source-code, and you should find the fix immediately - see the main SpamBayes pages for details. ---------------------------------------------------------------------- Comment By: Mark Bennett (ttennebkram) Date: 2003-06-17 12:16 Message: Logged In: YES user_id=797668 Thanks seantrue. Yes, if I close all of my Outlook windows except my main inbox, I only get one dialog box. I usually have my contacts open in a second window. ---------------------------------------------------------------------- Comment By: Sean True (seantrue) Date: 2003-06-17 11:54 Message: Logged In: YES user_id=114311 This appears to happen whenever multiple explorers are open. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751693&group_id=61702 From T.A.Meyer at massey.ac.nz Wed Jun 18 18:05:56 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 18 01:06:36 2003 Subject: [spambayes-dev] Message classes Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207E74B@its-xchg4.massey.ac.nz> > I think email.Message could be an internal implementation > detail of message.Message. I think we need a clear API for > message.Message, including all transformations to and from > "external" representations, and clear semantics for the > object once "transformed" from the Outside world. I believe > it is in these transformations that the oft mentioned "choke > point" for handling badly formed mail must live. Also, as I > mentioned, I am semi-keen to try falling back to rfc822 when > email.Message chokes, just for the ascii. +1. For the moment, I'm going to remove Corpus.Message, and change FileCorpus.FileMessage to inherit from message.Message instead. As far as I can see Corpus.Message and message.Message are trying to do much of the same thing, anyway. I'll leave msgs.Msg alone since it's only for testing. I think that mboxutils is much the same as the Outlook plugin, in that it one day will use message.Message, but it's too much hassle for the moment. Once message.Message has the 'choke point' code, the gain should be worth the pain. I've done a quick change and pop3proxy seems to still work, but I'll wait to check things in until after the alpha3 release I want to do on Friday. This will give others (hopefully including TimS) a chance to correct my misunderstandings... If I feel inspired I might start work on writing up a clear message.Message API, but otherwise not :) =Tony Meyer From skip at pobox.com Wed Jun 18 07:47:22 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Jun 18 07:47:33 2003 Subject: [spambayes-dev] Re: [Spambayes-checkins] spambayes/Outlook2000 addin.py, 1.62, 1.63 In-Reply-To: References: Message-ID: <16112.20810.58407.602288@montanaro.dyndns.org> >From a recent checkin message: Mark> Only create a SpamBayes toolbar once. If the toolbar was created Mark> for one explorer Window, newly created Windows don't see the new Mark> toolbar (but they do once outlook is restarted). What's an "explorer Window"? Is that a Microsoft term or does it refer to Internet Explorer or do a typical Windows file system explorer? naive-ly y'rs, Skip From noreply at sourceforge.net Wed Jun 18 05:54:29 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 07:54:34 2003 Subject: [spambayes-dev] [ spambayes-Patches-756413 ] Add a SQL-based storage class Message-ID: Patches item #756413, was opened at 2003-06-17 23:23 Message generated for change (Comment added) made by montanaro You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Add a SQL-based storage class Initial Comment: The attached patches add a SQL based storage class that can be used via pop3proxy. It assumes that the user has mySQL and MySQLdb installed, and has already set up the table as suggested on the spambayes mailing list. There is a great deal of room for improvement here. In particular, having a separate table for the tokens is probably a waste (as discussed somewhat abstractly on the list). If anyone wants an SQL solution, here it is. This should be able to be patched into the current cvs for a reasonable while yet. If you do end up using this, please let the list know. If a suitably large number of people do (at least 2!), then it could be checked in to the main code. ---------------------------------------------------------------------- >Comment By: Skip Montanaro (montanaro) Date: 2003-06-18 06:54 Message: Logged In: YES user_id=44345 A couple quick observations without having tried the code: * the diffs all seem to be reversed and are not context diffs, so it will be tough to apply them if the surrounding code changes at all * don't you need to specify a database name in the connect call? * maybe it should be called MySQLClassifier to admit other SQL implementations later. For the short term, perhaps a dummy SQLClassifier could inherit from classifier.Classifier and be the base class for MySQLClassifier. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 From mhammond at skippinet.com.au Wed Jun 18 23:55:39 2003 From: mhammond at skippinet.com.au (Mark Hammond) Date: Wed Jun 18 08:56:33 2003 Subject: [spambayes-dev] Re: [Spambayes-checkins] spambayes/Outlook2000addin.py, 1.62, 1.63 In-Reply-To: <16112.20810.58407.602288@montanaro.dyndns.org> Message-ID: <00de01c33598$ebf810f0$530f8490@eden> > >From a recent checkin message: > > Mark> Only create a SpamBayes toolbar once. If the > toolbar was created > Mark> for one explorer Window, newly created Windows > don't see the new > Mark> toolbar (but they do once outlook is restarted). > > What's an "explorer Window"? Is that a Microsoft term or > does it refer to > Internet Explorer or do a typical Windows file system explorer? The Outlook object model has a few of UI related objects - an "Explorer" (displays a collection (and maybe a preview pane) - ie, the main window) and an "Inspector" (views a single item - mail, contact etc) among them. I have managed to ignore Inspectors until now :) Combine this with "Internet Explorer" and "Windows Explorer" - both of which are different things (even thouhh for latter embeds the former), and confusion is to be expected <0.1 wink> Mark. From noreply at sourceforge.net Wed Jun 18 07:22:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 09:22:10 2003 Subject: [spambayes-dev] [ spambayes-Bugs-751283 ] addin installs but does not work (stays as unchecked in COM) Message-ID: Bugs item #751283, was opened at 2003-06-09 21:35 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Duplicate Priority: 5 Submitted By: Radek Stachowiak (radek) Assigned to: Mark Hammond (mhammond) Summary: addin installs but does not work (stays as unchecked in COM) Initial Comment: here is my spamlog1: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy. py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy. py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy. py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-18 23:22 Message: Logged In: YES user_id=14198 Dupe of 740311 - 'skip_max_word_size' error on XP ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-06-17 14:10 Message: Logged In: YES user_id=14198 Please see the latest Troubleshooting.html for info on how to reenable. http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/spambayes/spambayes/Outlook2000/docs/troubleshooting.html?rev=HEAD ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=751283&group_id=61702 From noreply at sourceforge.net Wed Jun 18 07:23:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 09:23:36 2003 Subject: [spambayes-dev] [ spambayes-Bugs-742440 ] Outlook 2003 doesn't display toolbar Message-ID: Bugs item #742440, was opened at 2003-05-24 02:21 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Kenneth Haynes (kh4yn3s) Assigned to: Mark Hammond (mhammond) >Summary: Outlook 2003 doesn't display toolbar Initial Comment: I am running Windows XP, SP1. I have the Beta 2 version of Office that includes Outlook 2003. When I tried to install using the windows installer, I got no error messages, but neither did I get the configuration buttons on the menu. spambayes1.log: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_configuration.pck' Loaded bayes database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\ken\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 0 spam and 0 good messages Loaded databases in 4.68803ms SpamAddin - Disconnecting from Outlook spambayes2.log and spambayes3.log all look the same as spambayes1.log. ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-18 23:23 Message: Logged In: YES user_id=14198 I'm hoping this is fixed in CVS - will know soon :) ---------------------------------------------------------------------- Comment By: Cort Schaefer (corts) Date: 2003-05-26 14:47 Message: Logged In: YES user_id=541173 This looks exactly like what I am getting running the same configuration. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=742440&group_id=61702 From noreply at sourceforge.net Wed Jun 18 17:43:31 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 19:43:39 2003 Subject: [spambayes-dev] [ spambayes-Patches-756413 ] Add a SQL-based storage class Message-ID: Patches item #756413, was opened at 2003-06-18 16:23 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Add a SQL-based storage class Initial Comment: The attached patches add a SQL based storage class that can be used via pop3proxy. It assumes that the user has mySQL and MySQLdb installed, and has already set up the table as suggested on the spambayes mailing list. There is a great deal of room for improvement here. In particular, having a separate table for the tokens is probably a waste (as discussed somewhat abstractly on the list). If anyone wants an SQL solution, here it is. This should be able to be patched into the current cvs for a reasonable while yet. If you do end up using this, please let the list know. If a suitably large number of people do (at least 2!), then it could be checked in to the main code. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-19 11:43 Message: Logged In: YES user_id=552329 Is there a diff how-to somewhere? I looked for one a while back, but couldn't find anything (apart from the manual for diff itself). I was just guessing... The connect call connects to the mySQL server. The USE command just after that connects to the appropriate database. I haven't used mySQLdb much, so I don't know for sure this is the correct way to do it, but it did work. The base class is a good idea. I'll do this if I ever put this in the main code. ---------------------------------------------------------------------- Comment By: Skip Montanaro (montanaro) Date: 2003-06-18 23:54 Message: Logged In: YES user_id=44345 A couple quick observations without having tried the code: * the diffs all seem to be reversed and are not context diffs, so it will be tough to apply them if the surrounding code changes at all * don't you need to specify a database name in the connect call? * maybe it should be called MySQLClassifier to admit other SQL implementations later. For the short term, perhaps a dummy SQLClassifier could inherit from classifier.Classifier and be the base class for MySQLClassifier. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 From noreply at sourceforge.net Wed Jun 18 18:03:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 20:03:43 2003 Subject: [spambayes-dev] [ spambayes-Patches-756413 ] Add a SQL-based storage class Message-ID: Patches item #756413, was opened at 2003-06-17 23:23 Message generated for change (Comment added) made by montanaro You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: Add a SQL-based storage class Initial Comment: The attached patches add a SQL based storage class that can be used via pop3proxy. It assumes that the user has mySQL and MySQLdb installed, and has already set up the table as suggested on the spambayes mailing list. There is a great deal of room for improvement here. In particular, having a separate table for the tokens is probably a waste (as discussed somewhat abstractly on the list). If anyone wants an SQL solution, here it is. This should be able to be patched into the current cvs for a reasonable while yet. If you do end up using this, please let the list know. If a suitably large number of people do (at least 2!), then it could be checked in to the main code. ---------------------------------------------------------------------- >Comment By: Skip Montanaro (montanaro) Date: 2003-06-18 19:03 Message: Logged In: YES user_id=44345 "diff -c oldfile newfile" should be sufficient to generate a patch that will be more forgiving of slight changes to the files being patched. I don't know if the patch command will even accept plain diffs. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-18 18:43 Message: Logged In: YES user_id=552329 Is there a diff how-to somewhere? I looked for one a while back, but couldn't find anything (apart from the manual for diff itself). I was just guessing... The connect call connects to the mySQL server. The USE command just after that connects to the appropriate database. I haven't used mySQLdb much, so I don't know for sure this is the correct way to do it, but it did work. The base class is a good idea. I'll do this if I ever put this in the main code. ---------------------------------------------------------------------- Comment By: Skip Montanaro (montanaro) Date: 2003-06-18 06:54 Message: Logged In: YES user_id=44345 A couple quick observations without having tried the code: * the diffs all seem to be reversed and are not context diffs, so it will be tough to apply them if the surrounding code changes at all * don't you need to specify a database name in the connect call? * maybe it should be called MySQLClassifier to admit other SQL implementations later. For the short term, perhaps a dummy SQLClassifier could inherit from classifier.Classifier and be the base class for MySQLClassifier. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=756413&group_id=61702 From noreply at sourceforge.net Wed Jun 18 19:17:58 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 18 21:18:01 2003 Subject: [spambayes-dev] [ spambayes-Support Requests-756965 ] I sent in the OCX/DLL problem Message-ID: Support Requests item #756965, was opened at 2003-06-18 18:17 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=756965&group_id=61702 Category: Install Problem (example) Group: None Status: Open Priority: 5 Submitted By: Kathy Gauvin (kmgauvin) Assigned to: Nobody/Anonymous (nobody) Summary: I sent in the OCX/DLL problem Initial Comment: Still having the same problem. I went to Tools, Options, Other, Advanced Options, and then looked at the Comm Add-ins, and there was nothing there. The Add-In Manager had nothing checked, Custom forms had nothing. I looked on every page and had nothing checked. What do I do to make the Spambayes install work? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498104&aid=756965&group_id=61702 From noreply at sourceforge.net Thu Jun 19 04:27:24 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 06:32:50 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757131 ] "Bad file descriptor" starting pop3proxy_service Message-ID: Bugs item #757131, was opened at 2003-06-19 10:27 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757131&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Steve Tregidgo (smst) Assigned to: Nobody/Anonymous (nobody) Summary: "Bad file descriptor" starting pop3proxy_service Initial Comment: (Running SpamBayes alpha2 release under Python 2.2.3 (Win2k) with the additional module pop3proxy_service.py.) I tried to diagnose a 'hdrtxt' AttributeError (which has now disappeared, so I have no traceback -- I remember only that it was raised explicitly by Message.__getattr__ for a FileMessage instance) while reviewing messages through the pop3proxy interface. I stopped the pop3proxy service (using Win2k's "Services" panel), enabled the verbose option in bayescustomize.ini and started the service again. The service silently failed to start; in the Event Viewer was the following traceback: The instance's SvcRun() method failed File "C:\PYTHON22\lib\site-packages\win32\lib\win32serviceutil.py", line 635, in SvcRun self.SvcDoRun() File "C:\Program Files\spambayes-1.0a2\windows\pop3proxy_service.py", line 68, in SvcDoRun pop3proxy.state.createWorkers() File "C:\Program Files\spambayes-1.0a2\pop3proxy.py", line 1198, in createWorkers self.hamCorpus.removeExpiredMessages() File "C:\Program Files\spambayes-1.0a2\spambayes\Corpus.py", line 264, in removeExpiredMessages for msg in self: File "C:\Program Files\spambayes-1.0a2\spambayes\Corpus.py", line 230, in __iter__ yield self[key] File "C:\Program Files\spambayes-1.0a2\spambayes\Corpus.py", line 216, in __getitem__ self.cacheMessage(amsg) File "C:\Program Files\spambayes-1.0a2\spambayes\Corpus.py", line 173, in cacheMessage self.unCacheMessage(keyToFlush) File "C:\Program Files\spambayes-1.0a2\spambayes\Corpus.py", line 180, in unCacheMessage print 'Flushing %s from corpus cache' % (key) exceptions.IOError: [Errno 9] Bad file descriptor Starting pop3proxy from a command prompt worked fine, with the print statement being executed properly. I suspected that the running of pop3proxy as a service was causing sys.stdout to be broken somehow, but restarting through "Services" again, catching the IOError and raising information about stdout gives the following, which seems reasonable: ', mode 'w' at 0x008D7708> I can continue to run the service with 'verbose' set to False of course, but it would be handy to be able to switch that flag back on in case my 'hdrtxt' error returns. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757131&group_id=61702 From noreply at sourceforge.net Thu Jun 19 09:01:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 11:01:15 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757287 ] syntax error in FolderSelector.PY Message-ID: Bugs item #757287, was opened at 2003-06-19 15:01 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757287&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Emile van Sebille (evansebille) Assigned to: Mark Hammond (mhammond) Summary: syntax error in FolderSelector.PY Initial Comment: Line 91 in FolderSelector.py per displayed cvs is missing the trailing colon not allowing the spam and ham training selections to be set. except pythoncom.com_error, details ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757287&group_id=61702 From noreply at sourceforge.net Thu Jun 19 10:42:25 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 12:42:28 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757370 ] Traceback when training on a certain message Message-ID: Bugs item #757370, was opened at 2003-06-19 18:42 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Daniel Buchmann (avalon) Assigned to: Nobody/Anonymous (nobody) Summary: Traceback when training on a certain message Initial Comment: Using current CVS, mboxtrain crashes on the attached Unix mbox file, with the following traceback: Training ham (tss2.mbox): Reading as Unix mbox Traceback (most recent call last): File "/home/db/spambayes/mboxtrain.py", line 294, in ? main() File "/home/db/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/db/spambayes/mboxtrain.py", line 214, in train mbox_train(h, path, is_spam, force) File "/home/db/spambayes/mboxtrain.py", line 148, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/db/spambayes/mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/db/spambayes/hammie.py", line 150, in train spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 276, in learn File "./spambayes/classifier.py", line 402, in _add_msg File "./spambayes/storage.py", line 215, in _wordinfoget File "/usr/lib/python2.2/shelve.py", line 65, in get if self.dict.has_key(key): TypeError: key type must be string What could be wrong? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 From noreply at sourceforge.net Thu Jun 19 10:42:26 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 12:42:32 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757371 ] Traceback when training on a certain message Message-ID: Bugs item #757371, was opened at 2003-06-19 18:42 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757371&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Daniel Buchmann (avalon) Assigned to: Nobody/Anonymous (nobody) Summary: Traceback when training on a certain message Initial Comment: Using current CVS, mboxtrain crashes on the attached Unix mbox file, with the following traceback: Training ham (buggy.mbox): Reading as Unix mbox Traceback (most recent call last): File "/home/db/spambayes/mboxtrain.py", line 294, in ? main() File "/home/db/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/db/spambayes/mboxtrain.py", line 214, in train mbox_train(h, path, is_spam, force) File "/home/db/spambayes/mboxtrain.py", line 148, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/db/spambayes/mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/db/spambayes/hammie.py", line 150, in train spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 276, in learn File "./spambayes/classifier.py", line 402, in _add_msg File "./spambayes/storage.py", line 215, in _wordinfoget File "/usr/lib/python2.2/shelve.py", line 65, in get if self.dict.has_key(key): TypeError: key type must be string What could be wrong? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757371&group_id=61702 From noreply at sourceforge.net Thu Jun 19 10:43:46 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 12:43:52 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757370 ] Traceback when training on a certain message Message-ID: Bugs item #757370, was opened at 2003-06-19 18:42 Message generated for change (Settings changed) made by avalon You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 Category: hammie Group: None >Status: Deleted Resolution: None Priority: 5 Submitted By: Daniel Buchmann (avalon) Assigned to: Nobody/Anonymous (nobody) Summary: Traceback when training on a certain message Initial Comment: Using current CVS, mboxtrain crashes on the attached Unix mbox file, with the following traceback: Training ham (tss2.mbox): Reading as Unix mbox Traceback (most recent call last): File "/home/db/spambayes/mboxtrain.py", line 294, in ? main() File "/home/db/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/db/spambayes/mboxtrain.py", line 214, in train mbox_train(h, path, is_spam, force) File "/home/db/spambayes/mboxtrain.py", line 148, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/db/spambayes/mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/db/spambayes/hammie.py", line 150, in train spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 276, in learn File "./spambayes/classifier.py", line 402, in _add_msg File "./spambayes/storage.py", line 215, in _wordinfoget File "/usr/lib/python2.2/shelve.py", line 65, in get if self.dict.has_key(key): TypeError: key type must be string What could be wrong? ---------------------------------------------------------------------- >Comment By: Daniel Buchmann (avalon) Date: 2003-06-19 18:43 Message: Logged In: YES user_id=184577 Oops.... submitted twice. Deleting duplicate. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 From noreply at sourceforge.net Thu Jun 19 10:45:27 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 12:45:32 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757370 ] Traceback when training on a certain message Message-ID: Bugs item #757370, was opened at 2003-06-19 18:42 Message generated for change (Settings changed) made by avalon You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 Category: hammie Group: None Status: Deleted Resolution: None Priority: 5 Submitted By: Daniel Buchmann (avalon) Assigned to: Nobody/Anonymous (nobody) Summary: Traceback when training on a certain message Initial Comment: Using current CVS, mboxtrain crashes on the attached Unix mbox file, with the following traceback: Training ham (tss2.mbox): Reading as Unix mbox Traceback (most recent call last): File "/home/db/spambayes/mboxtrain.py", line 294, in ? main() File "/home/db/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/db/spambayes/mboxtrain.py", line 214, in train mbox_train(h, path, is_spam, force) File "/home/db/spambayes/mboxtrain.py", line 148, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/db/spambayes/mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/db/spambayes/hammie.py", line 150, in train spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 276, in learn File "./spambayes/classifier.py", line 402, in _add_msg File "./spambayes/storage.py", line 215, in _wordinfoget File "/usr/lib/python2.2/shelve.py", line 65, in get if self.dict.has_key(key): TypeError: key type must be string What could be wrong? ---------------------------------------------------------------------- Comment By: Daniel Buchmann (avalon) Date: 2003-06-19 18:43 Message: Logged In: YES user_id=184577 Oops.... submitted twice. Deleting duplicate. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 From noreply at sourceforge.net Thu Jun 19 10:46:12 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 12:46:17 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757370 ] Traceback when training on a certain message Message-ID: Bugs item #757370, was opened at 2003-06-19 18:42 Message generated for change (Settings changed) made by avalon You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 Category: hammie Group: None Status: Deleted Resolution: None Priority: 5 Submitted By: Daniel Buchmann (avalon) Assigned to: Nobody/Anonymous (nobody) Summary: Traceback when training on a certain message Initial Comment: Using current CVS, mboxtrain crashes on the attached Unix mbox file, with the following traceback: Training ham (tss2.mbox): Reading as Unix mbox Traceback (most recent call last): File "/home/db/spambayes/mboxtrain.py", line 294, in ? main() File "/home/db/spambayes/mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "/home/db/spambayes/mboxtrain.py", line 214, in train mbox_train(h, path, is_spam, force) File "/home/db/spambayes/mboxtrain.py", line 148, in mbox_train if msg_train(h, msg, is_spam, force): File "/home/db/spambayes/mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/db/spambayes/hammie.py", line 150, in train spambayes.hammiebulk.main() File "./spambayes/classifier.py", line 276, in learn File "./spambayes/classifier.py", line 402, in _add_msg File "./spambayes/storage.py", line 215, in _wordinfoget File "/usr/lib/python2.2/shelve.py", line 65, in get if self.dict.has_key(key): TypeError: key type must be string What could be wrong? ---------------------------------------------------------------------- Comment By: Daniel Buchmann (avalon) Date: 2003-06-19 18:43 Message: Logged In: YES user_id=184577 Oops.... submitted twice. Deleting duplicate. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757370&group_id=61702 From noreply at sourceforge.net Thu Jun 19 16:37:13 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 19 18:37:20 2003 Subject: [spambayes-dev] [ spambayes-Bugs-757287 ] syntax error in FolderSelector.PY Message-ID: Bugs item #757287, was opened at 2003-06-20 01:01 Message generated for change (Comment added) made by mhammond You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757287&group_id=61702 Category: Outlook Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Emile van Sebille (evansebille) Assigned to: Mark Hammond (mhammond) Summary: syntax error in FolderSelector.PY Initial Comment: Line 91 in FolderSelector.py per displayed cvs is missing the trailing colon not allowing the spam and ham training selections to be set. except pythoncom.com_error, details ---------------------------------------------------------------------- >Comment By: Mark Hammond (mhammond) Date: 2003-06-20 08:37 Message: Logged In: YES user_id=14198 Ooops! Fixed. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=757287&group_id=61702 From greg at python.org Thu Jun 19 23:14:26 2003 From: greg at python.org (Greg Ward) Date: Thu Jun 19 22:14:29 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive Message-ID: <20030620021425.GA28324@cthulhu.gerg.ca> Gee, I don't think I told the spambayes-dev crowd that I switched mail.python.org over to Spambayes last weekend. Anyways, so far it has rejected 1070 spam messages, and we just got our first false positive today: """ Return-Path: <*CENSORED*@aol.com> Envelope-To: Tutor@python.org Received: from imo-d04.mx.aol.com ([205.188.157.36]) by mail.python.org with esmtp (Exim 4.05) id 19Sp8l-000636-00 for Tutor@python.org; Wed, 18 Jun 2003 22:27:07 -0400 Received: from *CENSORED*@aol.com by imo-d04.mx.aol.com (mail_out_v36.3.) id 8.bd.3377bddd (4402) for ; Wed, 18 Jun 2003 22:27:01 -0400 (EDT) From: *CENSORED*@aol.com Message-ID: Date: Wed, 18 Jun 2003 22:27:01 EDT Subject: tutor To: Tutor@python.org MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="part1_bd.3377bddd.2c227975_bounda ry" X-Mailer: 8.0 for Windows sub 6011 X-Spam-Status: SPAM (default 0.994) Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit UNSUBSCRIBE PLEASE *CENSORED*@aol.com """ Oh wait, it was *really* a multipart/alternative message with text/plain and text/html; the above is just how mutt renders it for me. But never mind that; there's some interesting stuff going on here. Background: I grouped the 300+ recipient addresses on mail.python.org into 18 clusters of similar addresses; tutor@python.org falls into the grab-bag "default" cluster (which is what "default 0.994" means in the X-Spam-Status header: in the context of the "default" training DB, this message scored 0.994). But note that this message was sent to the wrong address: admin requests should never be sent to the list post address! So, while this is certainly not spam in the UBE sense, it *is* undesired mail for tutor@python.org. If I score this message under the "list-misc" or "list-owner" training -- which are for *-request and *-owner addresses, ie. the *right* place to send this sort of message -- it scores 0.02 and 0.05 respectively. So if this guy had sent his message to the right address, it would have been accepted without problems; Spambayes successfully blocked him from bothering the whole list with his off-topic request. Cool! I guess I should go unsubscribe the poor slob now... Greg -- Greg Ward http://www.gerg.ca/ Always look on the bright side of life. From tim.one at comcast.net Fri Jun 20 00:40:36 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Jun 19 23:43:11 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: <20030620021425.GA28324@cthulhu.gerg.ca> Message-ID: [Greg Ward] > Gee, I don't think I told the spambayes-dev crowd that I switched > mail.python.org over to Spambayes last weekend. Anyways, so far it > has rejected 1070 spam messages, and we just got our first false > positive today: > > """ > Return-Path: <*CENSORED*@aol.com> spambayes considers all email from AOL to be spam, you know . > Envelope-To: Tutor@python.org > Received: from imo-d04.mx.aol.com ([205.188.157.36]) > by mail.python.org with esmtp (Exim 4.05) > id 19Sp8l-000636-00 > for Tutor@python.org; Wed, 18 Jun 2003 22:27:07 -0400 > Received: from *CENSORED*@aol.com > by imo-d04.mx.aol.com (mail_out_v36.3.) id 8.bd.3377bddd > (4402) for ; Wed, 18 Jun 2003 22:27:01 > -0400 (EDT) From: *CENSORED*@aol.com > Message-ID: > Date: Wed, 18 Jun 2003 22:27:01 EDT > Subject: tutor > To: Tutor@python.org > MIME-Version: 1.0 > Content-Type: multipart/alternative; > boundary="part1_bd.3377bddd.2c227975_boundary" > X-Mailer: 8.0 for Windows sub 6011 > X-Spam-Status: SPAM (default 0.994) > > Content-Type: text/plain; charset="US-ASCII" > Content-Transfer-Encoding: 7bit > > UNSUBSCRIBE PLEASE > > > *CENSORED*@aol.com > """ > > Oh wait, it was *really* a multipart/alternative message with > text/plain and text/html; the above is just how mutt renders it for > me. But never mind that; there's some interesting stuff going on > here. > > Background: I grouped the 300+ recipient addresses on mail.python.org > into 18 clusters of similar addresses; tutor@python.org falls into the > grab-bag "default" cluster (which is what "default 0.994" means in the > X-Spam-Status header: in the context of the "default" training DB, > this message scored 0.994). It would be interesting to see the whole "clue" list -- I'm guessing there must have been more damaging stuff in the HTML part (spambayes looks at all text/* parts). > But note that this message was sent to the wrong address: admin > requests should never be sent to the list post address! So, while > this is certainly not spam in the UBE sense, it *is* undesired mail > for tutor@python.org. If I score this message under the "list-misc" > or "list-owner" training -- which are for *-request and *-owner > addresses, ie. the *right* place to send this sort of message -- it > scores 0.02 and 0.05 respectively. Cool! In my old python.org tests, you'll recall that the most significant source of false positives was the same kind of thing: multipart/alternative administrivia requests with one- or two- word text/plain parts. I used a single database then, though. > So if this guy had sent his message to the right address, it would > have been accepted without problems; Spambayes successfully blocked > him from bothering the whole list with his off-topic request. Cool! > > I guess I should go unsubscribe the poor slob now... Or sell him penis reduction pills. I can't imagine any other reason for why he couldn't see the list admin URL at the bottom of every email he got from the Tutor list! takes-one-to-know-one-ly y'rs - tim From greg at python.org Fri Jun 20 09:03:17 2003 From: greg at python.org (Greg Ward) Date: Fri Jun 20 08:03:20 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: References: <20030620021425.GA28324@cthulhu.gerg.ca> Message-ID: <20030620120317.GA29094@cthulhu.gerg.ca> On 19 June 2003, Tim Peters said: > spambayes considers all email from AOL to be spam, you know . Try <0.01 wink> -- wait till you see the incriminating clues! > It would be interesting to see the whole "clue" list -- I'm guessing there > must have been more damaging stuff in the HTML part (spambayes looks at all > text/* parts). Yup, forgot to post that last night. First here's my config filen -- or at least the [Tokenizer] section: basic_header_tokenize: True basic_header_skip: received envelope-to delivered-to delivery-date x-spam-flag x-spam-status content-type list-* record_header_absence: True address_headers: from to cc sender reply-to mine_received_headers: True And here's the complete token list: Y 0.996 save/ham/cur/1056075169.22878_58.mail:2,S '*H*': 0.007 '*S*': 1.000 'date:EDT': 0.016 'message-id:skip:b 20': 0.172 'charset:us-ascii': 0.270 'content-type:text/plain': 0.273 'reply-to:none': 0.380 'date:Wed': 0.384 'header:Received:2': 0.391 'date:2003': 0.630 'header:MIME-Version:1': 0.647 'date:Jun': 0.655 'to:addr:python.org': 0.686 'to:python.org': 0.686 'x-mailer:Windows': 0.689 'please': 0.718 'unsubscribe': 0.781 'email addr:aol.com': 0.845 'to:addr:tutor': 0.845 'content-type:multipart/alternative': 0.916 'x-mailer:for': 0.924 'received:aol.com': 0.965 'received:mx.aol.com': 0.965 'return-path:aol.com': 0.973 'content-type:text/html': 0.978 'x-mailer:sub': 0.978 'from:addr:aol.com': 0.983 'from:aol.com': 0.983 ...so your flippant remark about AOL was not all that far off! Oh yeah, here's what I get using the "list-misc" training DB -- ie. the DB that would have been used if this message had been sent (correctly) to tutor-request@python.org: N 0.024 save/ham/cur/1056075169.22878_58.mail:2,S '*H*': 0.957 '*S*': 0.004 'date:EDT': 0.029 'message-id:@aol.com': 0.081 'message-id:aol.com': 0.081 'message-id:skip:b 20': 0.081 'x-mailer:for': 0.141 'to:Tutor': 0.183 'x-mailer:8.0': 0.183 'charset:us-ascii': 0.222 'content-type:text/plain': 0.228 'received:mx.aol.com': 0.230 'email addr:aol.com': 0.268 'subject:tutor': 0.268 'please': 0.297 'x-mailer:Windows': 0.324 'received:com': 0.335 'header:Received:2': 0.349 'unsubscribe': 0.373 'from:no real name:2**0': 0.622 'date:2003': 0.663 'date:Jun': 0.681 'content-type:multipart/alternative': 0.743 'content-type:text/html': 0.931 All very interesting, no doubt. Greg -- Greg Ward http://www.gerg.ca/ Time flies like an arrow; fruit flies like a banana. From mj at zope.com Fri Jun 20 11:08:44 2003 From: mj at zope.com (Martijn Pieters) Date: Fri Jun 20 10:09:55 2003 Subject: [spambayes-dev] Re: 1070 spam, 1 false positive In-Reply-To: <20030620021425.GA28324@cthulhu.gerg.ca> References: <20030620021425.GA28324@cthulhu.gerg.ca> Message-ID: <20030620140844.GC6654@zope.com> On Thu, Jun 19, 2003 at 10:14:26PM -0400, Greg Ward wrote: > Gee, I don't think I told the spambayes-dev crowd that I switched > mail.python.org over to Spambayes last weekend. Anyways, so far it has > rejected 1070 spam messages, and we just got our first false positive > today: Sorry to be a party pooper, but there were more false positives; I rescued 2 earlier this week. The following message was also marked as spam: --------------------------------------------- Return-Path: Envelope-To: zope-announce@zope.org Received: from amsfep13-int.chello.nl ([213.46.243.24]) by mail.python.org with esmtp (Exim 4.05) id 19RtI8-0007ds-00 for zope-announce@zope.org; Mon, 16 Jun 2003 08:40:56 -0400 Received: from skullsplitter ([213.132.141.11]) by amsfep13-int.chello.nl (InterMail vM.5.01.05.17 201-253-122-126-117-20021021) with SMTP id <20030616124025.UVYR16676.amsfep13-int.chello.nl@skullsplitter>; Mon, 16 Jun 2003 14:40:25 +0200 Message-ID: <017601c33404$ef0101c0$8901a8c0@skullsplitter> Reply-To: "Tom Deprez" From: "Tom Deprez" To: , , , , Subject: EuroPython news Date: Mon, 16 Jun 2003 14:43:45 +0200 Organization: Aragne.com MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1158 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165 X-Antivirus: avast! (VPS 13/06/2003), Outbound message X-Antivirus-Status: Clean X-Spam-Status: SPAM (lists-zope 0.840) Content-Length: 443 Hi all, The EuroPython 2003 brochure is ready and available for download at: http://www.europython.org/other/brochure Interested in the countries represented at EuroPython? Here is a list: Spain Belgium Germany Austria United Kingdom Tunisia Portugal Switzerland Ireland Sweden United States France Lithuania Finland Poland Denmark Czech Republic Italy Netherlands Canada See you within 8 days! Regards, EuroPython --------------------------------------------- As was this one: --------------------------------------------- Return-Path: Envelope-To: zope-dev@zope.org Received: from [62.70.18.196] (helo=dns.activemedia.no) by mail.python.org with esmtp (Exim 4.05) id 19SKzr-0000NR-00 for zope-dev@zope.org; Tue, 17 Jun 2003 14:15:55 -0400 Received: from amavis by dns.activemedia.no with scanned-ok (Exim 3.36 #1) id 19SKzH-0002Fx-00 for zope-dev@zope.org; Tue, 17 Jun 2003 20:15:19 +0200 Received: from nidelven by dns.activemedia.no with local (Exim 3.36 #1) id 19SKzG-0002Fj-00 for zope-dev@zope.org; Tue, 17 Jun 2003 20:15:18 +0200 From: "Morten W. Petersen" To: zope-dev@zope.org Subject: Renaming a product X-Mailer: NeoMail 1.25 X-IPAddress: 80.202.17.36 MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Message-Id: Date: Tue, 17 Jun 2003 20:15:18 +0200 X-Virus-Scanned: by AMaViS 0.3.12 X-AntiAbuse: This header was added to track abuse, please include it with any +abuse report X-AntiAbuse: Primary Hostname - dns.activemedia.no X-AntiAbuse: Original Domain - zope.org X-AntiAbuse: Originator/Caller UID/GID - [32940 1441] / [32940 1441] X-AntiAbuse: Sender Address Domain - dns.activemedia.no X-Spam-Status: SPAM (lists-zope 0.854) Content-Length: 304 Hi all, does anyone know how to rename a product, so that instances of the product stored in ZODB will work (with the new name) after the rename? Thanks, Morten W. Petersen -- Technologies: Zope, Linux, Python, HTML, CSS, PHP Homepage: http://www.nidelven-it.no Phone number: (+47) 45 44 00 69 --------------------------------------------- -- Martijn Pieters | Software Engineer mailto:mj@zope.com | Zope Corporation http://www.zope.com/ | Creators of Zope http://www.zope.org/ --------------------------------------------- From greg at python.org Fri Jun 20 23:06:30 2003 From: greg at python.org (Greg Ward) Date: Fri Jun 20 22:06:34 2003 Subject: [spambayes-dev] Re: 1070 spam, 1 false positive In-Reply-To: <20030620140844.GC6654@zope.com> References: <20030620021425.GA28324@cthulhu.gerg.ca> <20030620140844.GC6654@zope.com> Message-ID: <20030621020630.GD29094@cthulhu.gerg.ca> On 20 June 2003, Martijn Pieters said: > Sorry to be a party pooper, but there were more false positives; I rescued > 2 earlier this week. The following message was also marked as spam: Darn. But there's good news: > From: "Tom Deprez" > To: , > , > , > , > > Subject: EuroPython news > Date: Mon, 16 Jun 2003 14:43:45 +0200 This one was rejected fairly early in the Spambayes regime. I just scored it with the current training set, and it scored < 0.1. Also, for some reason the envelope recipient of that message was *just* zope-announce@zope.org, in spite of what the "To" header says. I bet if that message had really been sent to europython@python.org, it would have been flagged UNSURE. No way to tell now, though, since I don't have the training DB from Monday. > From: "Morten W. Petersen" > To: zope-dev@zope.org > Subject: Renaming a product > X-Mailer: NeoMail 1.25 > X-IPAddress: 80.202.17.36 > MIME-Version: 1.0 > Content-Type: text/plain; charset=iso-8859-1 > Message-Id: > Date: Tue, 17 Jun 2003 20:15:18 +0200 > X-Virus-Scanned: by AMaViS 0.3.12 > X-AntiAbuse: This header was added to track abuse, please include it with > any > +abuse report > X-AntiAbuse: Primary Hostname - dns.activemedia.no > X-AntiAbuse: Original Domain - zope.org > X-AntiAbuse: Originator/Caller UID/GID - [32940 1441] / [32940 1441] > X-AntiAbuse: Sender Address Domain - dns.activemedia.no > X-Spam-Status: SPAM (lists-zope 0.854) And this one was treated very badly because of the X-AntiAbuse headers; here's how it scores with the current DB: Y 0.869 save/ham/cur/19SKzr-0000NR-00:2,S '*H*': 0.060 '*S*': 0.797 'all,': 0.065 'message-id:skip:d 10': 0.065 'zodb': 0.065 'does': 0.086 'product,': 0.092 'thanks,': 0.092 '(with': 0.155 'instances': 0.155 'python,': 0.155 'return-path:skip:d 10': 0.155 'date:0200': 0.173 'date:Tue': 0.191 'anyone': 0.230 'content-type:text/plain': 0.266 'received:62': 0.303 'product': 0.379 'know': 0.380 'header:Received:3': 0.388 'to:no real name:2**0': 0.610 'date:Jun': 0.627 'after': 0.635 'charset:iso-8859-1': 0.641 'work': 0.645 'proto:http': 0.656 'stored': 0.666 'new': 0.715 'to:addr:zope-dev': 0.789 'to:dev': 0.789 'url:www': 0.800 'number:': 0.811 'x-antiabuse:Address': 0.811 'x-antiabuse:Caller': 0.811 'x-antiabuse:Domain': 0.811 'x-antiabuse:GID': 0.811 'x-antiabuse:Hostname': 0.811 'x-antiabuse:Original': 0.811 'x-antiabuse:Originator': 0.811 'x-antiabuse:Primary': 0.811 'x-antiabuse:Sender': 0.811 'x-antiabuse:This': 0.811 'x-antiabuse:UID': 0.811 'x-antiabuse:abuse': 0.811 'x-antiabuse:added': 0.811 'x-antiabuse:any': 0.811 'x-antiabuse:header': 0.811 'x-antiabuse:include': 0.811 'x-antiabuse:please': 0.811 'x-antiabuse:report': 0.811 'x-antiabuse:track': 0.811 'x-antiabuse:was': 0.811 'x-antiabuse:with': 0.811 'x-antiabuse:zope.org': 0.811 'phone': 0.971 But if I add x-antiabuse to basic_header_skip, it comes through fine: N 0.085 save/ham/cur/19SKzr-0000NR-00:2,S '*H*': 0.877 '*S*': 0.047 'all,': 0.065 'message-id:skip:d 10': 0.065 'zodb': 0.065 'does': 0.086 'product,': 0.092 'thanks,': 0.092 '(with': 0.155 'instances': 0.155 'python,': 0.155 'return-path:skip:d 10': 0.155 'date:0200': 0.173 'date:Tue': 0.191 'anyone': 0.230 'content-type:text/plain': 0.266 'received:62': 0.303 'product': 0.379 'know': 0.380 'header:Received:3': 0.388 'to:no real name:2**0': 0.610 'date:Jun': 0.627 'after': 0.635 'charset:iso-8859-1': 0.641 'work': 0.645 'proto:http': 0.656 'stored': 0.666 'new': 0.715 'to:addr:zope-dev': 0.789 'to:dev': 0.789 'url:www': 0.800 'number:': 0.811 'phone': 0.971 I'm building training DBs with x-antiabuse excluded now, to see how it helps/hurts. Another lively Friday night chez Greg... Greg -- Greg Ward http://www.gerg.ca/ Never put off till tomorrow what you can put off till the day after tomorrow. From noreply at sourceforge.net Sat Jun 21 05:47:08 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat Jun 21 07:47:18 2003 Subject: [spambayes-dev] [ spambayes-Bugs-758360 ] 'OptionsClass' object has no attribute 'skip_max_word_size' Message-ID: Bugs item #758360, was opened at 2003-06-21 13:47 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758360&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Wolfgang Hartl (wuff42) Assigned to: Mark Hammond (mhammond) Summary: 'OptionsClass' object has no attribute 'skip_max_word_size' Initial Comment: Windows 2000 SP3, Outlook 2002 SP2 Spambayse Outlook Plugin, Binary Version => Plugin does not work (no anti-Spam item in toolbar). => File c:\%TEMP%\spambayes1.log contains the following error messages: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\wh100011\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758360&group_id=61702 From noreply at sourceforge.net Sat Jun 21 15:19:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sat Jun 21 17:19:30 2003 Subject: [spambayes-dev] [ spambayes-Bugs-758552 ] Spambayes Outlook addin (dll registration failed) Message-ID: Bugs item #758552, was opened at 2003-06-21 14:19 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758552&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jeffrey Williams (wjeff) Assigned to: Mark Hammond (mhammond) Summary: Spambayes Outlook addin (dll registration failed) Initial Comment: Spambayes Outlook Addin version 002 Download and ran installer, with Outlook shutdown, installation failed with the error that the dll failed to register, attempted to run "regsvr32.exe spambayes_addin.dll" from the spambayes directory, which resulted in the same error message. The install log file was created, but empty. Error Message: DllRegisterServer in spambayes_addin.dll failed. Return code was: 0xc0000005 Outlook version info: Outlook 2000 SP3 (9.0.0.6627) Corporate or Workgroup - Security Update Hope this helps Jeff ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758552&group_id=61702 From T.A.Meyer at massey.ac.nz Sun Jun 22 15:14:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Sat Jun 21 22:14:55 2003 Subject: [spambayes-dev] A3 release Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F130207ED35@its-xchg4.massey.ac.nz> As you may have noticed, I released a3 just now. I don't have the ability to create a tar.gz package right at the moment, so if someone wants to do that for me and add it, that would be great. If not, I'll try to do it tomorrow. This is somewhat later than the Friday I said, but never mind. This is also my first sf release, so I might have screwed something up. If you notice something, please fix it if you can! =Tony Meyer From noreply at sourceforge.net Sun Jun 22 20:51:23 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 22 22:51:28 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-755098 ] IMAP Filter: progress indicator Message-ID: Feature Requests item #755098, was opened at 2003-06-16 13:58 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755098&group_id=61702 Category: None Group: None >Status: Closed Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: IMAP Filter: progress indicator Initial Comment: > Would be helpul if imapfilter.py -tv or -cv (note the 'v's) were to: > > a) Show some indication of progress. Say, print a "." for > every 10 messages whose headers are read, or a "*" if any of > those 10 messages had their bodies read too. > > b). For classification, print a summary of how many spam, ham > and unsure were classified. This could be a separate switch, > or part of -v. Doing anything over dialup is slow, and > checking INBOX, INBOX.Spam and INBOX.Unsure after every > classification is a chore. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-23 14:51 Message: Logged In: YES user_id=552329 (a) With the 'v' switch a '.' is now printed for every message header that is retrieved (training or filtering), and this is replaced with a '*' if the body is also received. It's much simpler to print for every message rather than every 10. Let me know if this isn't sufficient. (b) Done. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755098&group_id=61702 From noreply at sourceforge.net Sun Jun 22 20:56:22 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 22 22:56:35 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-680629 ] Outlook plugin: Delete as spam marks as read Message-ID: Feature Requests item #680629, was opened at 2003-02-05 15:30 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=680629&group_id=61702 Category: Outlook Group: None >Status: Closed Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Mark Hammond (mhammond) Summary: Outlook plugin: Delete as spam marks as read Initial Comment: Personally I think it would be nice if the "delete as spam" button marked the mail item as read. Note that I'm not saying that mail that is filtered as spam should be marked as read - it shouldn't (by default). If others agree, this would be a nice addition. Perhaps as an option in the prefs. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-23 14:56 Message: Logged In: YES user_id=552329 Done. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-05-04 13:43 Message: Logged In: YES user_id=14198 Out of interest, which behaviour would you rather have? (a) Mark all messages classified as Spam as read. (b) Mark all messages actioned with the "Delete as Spam" button as read. My answer is that, in a perfect world, I'd like to see two checkboxes on the Anti-Spam Manager combo box, labelled: [] Mark messages as read when the filter moves them to the Spam folder [] Mark messages as read when they are moved using the "Delete as Spam" button ---------------------------------------------------------------------- Comment By: Tim Stone (timstone4) Date: 2003-02-27 04:45 Message: Logged In: YES user_id=645698 This is an interesting thread. I think it should move to the main list. Pop3proxy has a very similar configuration function, which manages options into bayescustomize.ini (by default). This is another area that we should solve the problem once... ---------------------------------------------------------------------- Comment By: Piers Haken (piersh) Date: 2003-02-08 00:38 Message: Logged In: YES user_id=10551 i don't care if you do this or not (since spambayes catches all my spam ;-) ), but please don't mark any automatically- filtered spam as 'read' - it would be a pain to check for FPs if you did. thx. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-08 00:05 Message: Logged In: YES user_id=14198 Fair enough :) ---------------------------------------------------------------------- Comment By: Paul Moore (pmoore) Date: 2003-02-07 22:09 Message: Logged In: YES user_id=113328 I'd like the "Mark as read" option. Most unsures and false negatives which are spam, I can identify by subject, and hence I don't open (and I don't use the preview pane). But it's not crucial - Ctrl-Q does a very quick "Mark as read" anyway... ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 16:17 Message: Logged In: YES user_id=552329 Agreed that it is not necessary. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-05 16:11 Message: Logged In: YES user_id=14198 Yep, I see that makring as read could be useful in that they have been reviewed, but then I would expect Outlook's normal mechanism to still work and mark it read. I have my preview pane mark as read after 2 seconds :) Re the INI file - my problem is that the GUI needs to modify these options, and I don't see how it is trivial to keep the fairly "free-form" INI file format supported by configparser, while only writing out certain elements and not others and also keeping comments etc intact. I'll make a deal - help me with the options problem, and I will give you 5 free option . Let's take it to email... ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 15:56 Message: Logged In: YES user_id=552329 My reasoning was that if the user manually selects to delete it as spam, then it is as good as read. Those that are moving via the filter have not been read. Personally I still wade through the filtered spam to check it for false positives, and mark the messages as read as I go (so that the 'unread' display is the number of messages I haven't checked). If I choose delete as spam, I then have to go to the spam folder and mark it as read. In any case, no big deal if you disagree, it was just a thought :) Re: the ini file: looking at the ini, it doesn't seem to have anything that couldn't be in the GUI. Most of it would probably fit in the "advanced" dialog. It would probably be good if the ini was only for 'beta' options - anything that is for public use should be in the GUI. And if a 'beta' option moves to 'public', then it doesn't matter (much) if it breaks, because those using beta options should be upgrading anyway. Moving the existing settings (most of which should be exposed I think) would mean breaking existing code, but maybe just this once? Maybe this discussion should move to the list? (maybe I should have posted this there originally?) ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-02-05 15:41 Message: Logged In: YES user_id=14198 I'm not too sure this should happen unless the filter also marks the items as read - otherwise you still end up with many spam in the spam folder unread, and only the ones you move manually marked as read. I'm also kinda stuck about what to do with "options". Currently, options managed by the GUI are in a pickle, while other options are in the .ini file. I don't object to having new, outlook specific options in the INI file, but I do object to all our existing code breaking should we decide later to move this option into the GUI. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-02-05 15:31 Message: Logged In: YES user_id=552329 And who else to decide on this, but Mark :) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=680629&group_id=61702 From noreply at sourceforge.net Sun Jun 22 20:57:20 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 22 22:57:25 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-716437 ] Version information in GUI somewhere Message-ID: Feature Requests item #716437, was opened at 2003-04-07 11:52 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=716437&group_id=61702 Category: Outlook Group: None >Status: Closed Priority: 1 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Mark Hammond (mhammond) Summary: Version information in GUI somewhere Initial Comment: With the growing number of users, especially those using the binary, it would be good to have a version number printed somewhere in the GUI for people when they are reporting bugs. Greyed out text in the manager dialog, or even something in the about.html would work fine. I'll leave it to you to find somewhere appropriate :) ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-23 14:57 Message: Logged In: YES user_id=552329 Done ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-24 19:57 Message: Logged In: YES user_id=552329 If the version could be appended to the log file, that would be good, too. ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-04-07 12:01 Message: Logged In: YES user_id=552329 :) Well, for the binaries, your 001/002 system would work. For the full source releases, there's a __version__ attribute (1.0a2 at the moment, I think). I don't really know for CVS (maybe just 'cvs'?), but anyone using the cvs code should be able to describe when they retrieved it. ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-07 11:54 Message: Logged In: YES user_id=14198 I'm not sure *what* version to report though. I will find the "where" if you tell me that "what" ;) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=716437&group_id=61702 From tim.one at comcast.net Mon Jun 23 00:49:48 2003 From: tim.one at comcast.net (Tim Peters) Date: Sun Jun 22 23:51:06 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: <20030620120317.GA29094@cthulhu.gerg.ca> Message-ID: [Greg Ward] > ... > Yup, forgot to post that last night. First here's my config filen -- > or at least the [Tokenizer] section: > > basic_header_tokenize: True That's a dangerous one -- although I think you've already figured out why the hard way. > basic_header_skip: received envelope-to delivered-to delivery-date > x-spam-flag x-spam-status content-type list-* The problem is that any random header line can yield a misleading clue by accident, and there may be no end of adding to this list. > record_header_absence: True > address_headers: from to cc sender reply-to > mine_received_headers: True Those should be fine (for a single-source corpus), although we've had reports of mixed results with mine_received_headers. I'm not sure why. "It stands to reason" that it should help . > 'date:2003': 0.663 > 'date:Jun': 0.681 Any idea where those came from? They have the form of synthesized tokens (keyword colon stuff), but I don't recall anything in the tokenizer that synthesizes tokens with keyword "date". From T.A.Meyer at massey.ac.nz Mon Jun 23 17:10:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon Jun 23 00:10:56 2003 Subject: [spambayes-dev] RE: [Spambayes] imapfilter Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1302206D5D@its-xchg4.massey.ac.nz> [Moved from spambayes@python.org] > I'll give both a spin in the next day or so (checking P...FLAGS and > trying your modification). Thanks. > > If anyone has both time and access to a server that has > > messages with this 'NotJunk' flag, then they could play > > around and figure out what the rules about it are. > I'll poke at this as well. That would be great. I googled about it, and there was one very similar message, but it didn't really hold any useful information. It does suggest that something is special about this flag. > And thanks for correcting my misunderstanding about how flags are > supposed to work. Obviously, I should read the RFC, > especially if I am to contribute a patch that doesn't break things > for other servers. My biggest mistake when originally writing imapfilter was to base it on another imap filter and fix things that didn't work, instead of going from the RFC in the first place. That said, the RFC is a pain to work through, and hopefully can be mostly used as a reference now. If you don't want to read the RFC, you can always run the idea behind the patch past -dev first, where hopefully someone will point out any potential problem :) > Should we move this discussion to -dev? Done :) =Tony Meyer From noreply at sourceforge.net Tue Jun 24 00:12:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 24 02:12:44 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-755095 ] IMAP Filter: move trained messages Message-ID: Feature Requests item #755095, was opened at 2003-06-16 13:56 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755095&group_id=61702 Category: None Group: None >Status: Closed Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Tony Meyer (anadelonbrin) Summary: IMAP Filter: move trained messages Initial Comment: >From private email. > For dialup slowcoaches like me, it would be nice if there > were a trained_folder option in .spambayesrc, so that when I > run imapfilter.py -t, trained messages get moved from my > INBOX.untrained to INBOX.read. I can then set up my mail > reader to move read messages from INBOX to INBOX.untrained. > > If I *don't* move messages out of the training folder (by > hand ATM), it takes fifteen minutes or so to train, just > because of all the headers it reads. This seems a reasonable request. It would be better if it could somehow move the messages back to wherever they came from, but I don't see any way of doing that. ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-24 18:12 Message: Logged In: YES user_id=552329 Done in imapfilter 1.49 There are two new options, one for ham and one for spam. If they are set with the name of a folder then trained messages will be moved into that folder. Note that if you have several folders that include mail to be trained, mail from all the folders will end up in the same folder. This has the potential to confuse people (and I don't see an easy way out of it), so this option isn't exposed via the ui, you'll have to edit the config file manually to use it. Let me know if anything needs changing. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=755095&group_id=61702 From gward at python.net Tue Jun 24 12:52:26 2003 From: gward at python.net (Greg Ward) Date: Tue Jun 24 11:52:31 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: References: <20030620120317.GA29094@cthulhu.gerg.ca> Message-ID: <20030624155226.GA4027@cthulhu.gerg.ca> On 22 June 2003, Tim Peters said: > > basic_header_tokenize: True > > That's a dangerous one -- although I think you've already figured out why > the hard way. > > > basic_header_skip: received envelope-to delivered-to delivery-date > > x-spam-flag x-spam-status content-type list-* > > The problem is that any random header line can yield a misleading clue by > accident, and there may be no end of adding to this list. The thing is, every header on that list is there for a very good reason. But I can see your point: every addition *also* has a very good reason for it. Hmmm. I guess I should try it without basic_header_tokenize at all and see how it does. > > 'date:2003': 0.663 > > 'date:Jun': 0.681 > > Any idea where those came from? They have the form of synthesized tokens > (keyword colon stuff), but I don't recall anything in the tokenizer that > synthesizes tokens with keyword "date". Beats me. In my "default" corpus (right now: 418 ham, 583 spam, roughly half of both from June 2003), these tokens are unsurprisingly quite common: >>> h = hammie.open("db/default.db", usedb=True) >>> h.bayes.db["date:2003"] (283, 192) >>> h.bayes.db['date:Jun'] (317, 193) So *some* bit of code in there is tokenizing the "Date:" header. Seems like a good idea to me, since junk mail often has non-RFC-conformant date headers. Greg -- Greg Ward http://www.gerg.ca/ All of science is either physics or stamp collecting. From noreply at sourceforge.net Tue Jun 24 10:15:21 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 24 12:15:29 2003 Subject: [spambayes-dev] [ spambayes-Bugs-759911 ] Toolbar Icon Missing Message-ID: Bugs item #759911, was opened at 2003-06-24 11:15 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=759911&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Andrew Cira (acira) Assigned to: Mark Hammond (mhammond) Summary: Toolbar Icon Missing Initial Comment: "Recover from Spam Icon" is missing from the "Spam" folder and "Possible Spam" folder. However, "Delete as Spam" toolbar icon is visible in all three locations (Inbox, Spam, Possible Spam). The "Recover from Spam Icon" had been visible up until yesterday afternoon. I verified that the "SpamBayes.OutlookAddin" is in fact checked in the Com Add-Ins. I used the SpamBayes-Outlook-Setup-002.exe, so this is a binary installation. Attached are logfiles. Workstation Info: Dell Dimension 4100 Windows 2000 service pack 3 Microsoft Office 2000 If you need more information, please let me know. Andrew Cira ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=759911&group_id=61702 From noreply at sourceforge.net Tue Jun 24 10:26:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 24 12:26:36 2003 Subject: [spambayes-dev] [ spambayes-Bugs-759917 ] Pickle vs. DB: inconsistent behaviour when database missing Message-ID: Bugs item #759917, was opened at 2003-06-24 12:26 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=759917&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Greg Ward (gward) Assigned to: Nobody/Anonymous (nobody) Summary: Pickle vs. DB: inconsistent behaviour when database missing Initial Comment: hammie.py behaves inconsistently when the database file is missing, depending on whether it was run with -D (pickle) or -d (DBM). In particular, -D scores the message at 0.5, and -d crashes with a traceback. Example (neither spambayes.db or spambayes.pkl exist): $ hammie.py -f -D -p spambayes.pkl < $msg | grep "^X-Spambayes" X-Spambayes-Classification: unsure; 0.50 $ hammie.py -f -d -p spambayes.db < $msg | grep "^X-Spambayes" Traceback (most recent call last): File "/usr/local/bin/hammie.py", line 6, in ? spambayes.hammiebulk.main() File "/home/greg/lib/python/spambayes/hammiebulk.py", line 180, in main h = hammie.open(pck, usedb, mode) File "/home/greg/lib/python/spambayes/hammie.py", line 260, in open b = storage.DBDictClassifier(filename, mode) File "/home/greg/lib/python/spambayes/storage.py", line 147, in __init__ self.load() File "/home/greg/lib/python/spambayes/storage.py", line 155, in load self.dbm = dbmstorage.open(self.db_name, self.mode) File "/home/greg/lib/python/spambayes/dbmstorage.py", line 59, in open return f(*args) File "/home/greg/lib/python/spambayes/dbmstorage.py", line 41, in open_best return f(*args) File "/home/greg/lib/python/spambayes/dbmstorage.py", line 12, in open_db3hash return bsddb3.hashopen(*args) File "/usr/local/lib/python2.2/site-packages/bsddb3/__init__.py", line 160, in hashopen d.open(file, _db.DB_HASH, flags, mode) bsddb3._db.DBNoSuchFileError: (2, 'No such file or directory') Arguably, both behaviours are wrong: hammie.py should terminate with a clear, simple, "No such file or directory" error message. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=759917&group_id=61702 From tim.one at comcast.net Tue Jun 24 15:31:29 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Jun 24 14:34:06 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: <20030624155226.GA4027@cthulhu.gerg.ca> Message-ID: >>> basic_header_tokenize: True >> That's a dangerous one -- although I think you've already figured >> out why the hard way. >>> basic_header_skip: received envelope-to delivered-to delivery-date >>> x-spam-flag x-spam-status content-type list-* >> The problem is that any random header line can yield a misleading >> clue by accident, and there may be no end of adding to this list. [Greg Ward] > The thing is, every header on that list is there for a very good > reason. Sure. I'm not saying basic_header_skip is dangerous, I'm saying basic_header_tokenize is dangerous. That's why it's off by default. basic_header_skip has no effect unless basic_header_skip is forced True. > But I can see your point: every addition *also* has a very > good reason for it. Hmmm. I guess I should try it without > basic_header_tokenize at all and see how it does. In early tests of mine, enabling basic_header_tokenize gave worse results. It's an experiment (added, IIRC, by Jeremy) which didn't get enough testing either way to decide. >>> 'date:2003': 0.663 >>> 'date:Jun': 0.681 >> Any idea where those came from? They have the form of synthesized >> tokens (keyword colon stuff), but I don't recall anything in the >> tokenizer that synthesizes tokens with keyword "date". I understand these now -- they're a side effect of enabling basic_header_tokenize. > Beats me. In my "default" corpus (right now: 418 ham, 583 spam, > roughly half of both from June 2003), these tokens are unsurprisingly > quite common: > >>>> h = hammie.open("db/default.db", usedb=True) >>>> h.bayes.db["date:2003"] > (283, 192) >>>> h.bayes.db['date:Jun'] > (317, 193) > > So *some* bit of code in there is tokenizing the "Date:" header. Yes, basic_header_tokenize tokenizes all header lines, except for those squashed via basic_header_skip. > Seems like a good idea to me, since junk mail often has > non-RFC-conformant date headers. basic_header_tokenize doesn't know anything about RFC compliance, it treats all header lines exactly the same way (as sequences of meaningless characters). If you want to experiment with compliance of Date headers, try Skip's extract_dow option. In tests that did show a weak but highly significant correlation between different Date times and spam-vs-ham, but it was too weak to make any difference to bottom-line results. From gward at python.net Tue Jun 24 15:36:52 2003 From: gward at python.net (Greg Ward) Date: Tue Jun 24 14:37:01 2003 Subject: [spambayes-dev] 1070 spam, 1 false positive In-Reply-To: References: <20030624155226.GA4027@cthulhu.gerg.ca> Message-ID: <20030624183652.GA4566@cthulhu.gerg.ca> On 24 June 2003, Tim Peters said: > > Seems like a good idea to me, since junk mail often has > > non-RFC-conformant date headers. > > basic_header_tokenize doesn't know anything about RFC compliance, it treats > all header lines exactly the same way (as sequences of meaningless > characters). I should have been more explicit. Tokens like "June" or "Canada/Eastern" in the "Date:" header are probably good spam indicators, since they would never appear in a correct "Date:" header -- they should be "Jun" or "-0500". Greg -- Greg Ward http://www.gerg.ca/ Monday is an awful way to spend one seventh of your life. From noreply at sourceforge.net Tue Jun 24 14:48:29 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 24 16:48:32 2003 Subject: [spambayes-dev] [ spambayes-Bugs-760062 ] Traceback untraining a single message Message-ID: Bugs item #760062, was opened at 2003-06-24 16:48 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760062&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Greg Ward (gward) Assigned to: Greg Ward (gward) Summary: Traceback untraining a single message Initial Comment: If I use hammie.py to untrain on a single message, it crashes with NameError: $ hammie.py -d -p 10.db -U -s msg Untraining spam (msg): 1 Traceback (most recent call last): File "/usr/local/bin/hammie.py", line 6, in ? spambayes.hammiebulk.main() File "/home/greg/lib/python/spambayes/hammiebulk.py", line 204, in main h.store() File "/home/greg/lib/python/spambayes/hammie.py", line 246, in store self.bayes.store() File "/home/greg/lib/python/spambayes/storage.py", line 192, in store assert word not in self.wordinfo, \ NameError: global name 'word' is not defined ('msg' is a Maildir containing exactly one message; it's a subset of the spam folder used to create this training database) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760062&group_id=61702 From noreply at sourceforge.net Tue Jun 24 14:52:40 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Tue Jun 24 16:52:52 2003 Subject: [spambayes-dev] [ spambayes-Bugs-760062 ] Traceback untraining a single message Message-ID: Bugs item #760062, was opened at 2003-06-24 16:48 Message generated for change (Comment added) made by gward You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760062&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Greg Ward (gward) >Assigned to: Mark Hammond (mhammond) Summary: Traceback untraining a single message Initial Comment: If I use hammie.py to untrain on a single message, it crashes with NameError: $ hammie.py -d -p 10.db -U -s msg Untraining spam (msg): 1 Traceback (most recent call last): File "/usr/local/bin/hammie.py", line 6, in ? spambayes.hammiebulk.main() File "/home/greg/lib/python/spambayes/hammiebulk.py", line 204, in main h.store() File "/home/greg/lib/python/spambayes/hammie.py", line 246, in store self.bayes.store() File "/home/greg/lib/python/spambayes/storage.py", line 192, in store assert word not in self.wordinfo, \ NameError: global name 'word' is not defined ('msg' is a Maildir containing exactly one message; it's a subset of the spam folder used to create this training database) ---------------------------------------------------------------------- >Comment By: Greg Ward (gward) Date: 2003-06-24 16:52 Message: Logged In: YES user_id=14422 It looks like the assert in store() could never have worked, since 'word' is never defined in that method. Reassigning to mhammond, since he wrote that code (rev 1.7-1.8, late May 2003). ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760062&group_id=61702 From gward at python.net Tue Jun 24 18:49:26 2003 From: gward at python.net (Greg Ward) Date: Tue Jun 24 17:49:30 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies In-Reply-To: <20030613011928.GB6245@cthulhu.gerg.ca> References: <20030613011928.GB6245@cthulhu.gerg.ca> Message-ID: <20030624214925.GA4644@cthulhu.gerg.ca> On 12 June 2003, I said: > I'm getting inconsistent results using the same training corpus when I > store the database to a pickle vs a DB file. Here's how I created the > training databases (once DB, once pickle): I've spent most of this afternoon digging into this, and I am utterly, completely, totally, absolutely STUMPED. I need help. Here's my test setup: * corpus with 10 spam, 10 ham * train twice to "10.db" and "10.pkl" -- the former a Berkeley DB file, the latter a pickle I'm having at least two problems, both related to score/unlearn/score/relearn/score on a single message. Specifically: * the second score (post unlearning) is different for the two storages: the DB file scores my test message 1.000 after removing it from the training database, and the pickle file scores it 0.056. (The message in question is a spam that doesn't look like other spam, so the results with the pickle file make more sense to me.) * at the end of the full cycle, the pickle database is unchanged (verified by diff'ing the output of dbExpImp), but the DB storage is different. Specifically, it appears that the spam count of every token in the unlearned/relearned message is incremented by one. The final clue: this weird behaviour only happens using my 'simplescore' script (which I'll attach) (and yes, it is simple). If I score/unlearn/score/relearn/score with five distinct invocations of hammie.py, things appear to work just fine, and I get identical results with the pickle and DB storages. I've scoured my simplescore script to see if there's anything screwy there, but I sure can't see it. Either it needs a second pair of eyeballs, or there's something wrong with untraining/retraining on a message in a DB storage within the same process. So, if you have a minute, could you look over the attached simplescore script and see if it looks sane to you, ie. does it unlearn/relearn in the correct way? (Note that there are two implementations in there: one that tokenizes the message only once and uses slightly under-the-hood calls, and the other that uses top-level calls and tokenizes the message many times. I get identical results with both versions.) And if you have ten minutes, could you download http://www.gerg.ca/spambayes-test-unlearn.tar.gz and unpack it, and then try cd test python simplescore 10.db spam/cur/19S0vv-0003B0-00:2,S python simplescore 10.pkl spam/cur/19S0vv-0003B0-00:2,S and see if *you* can figure out what the hell is going on. Note that 'save/' contains copies of 10.db and 10.pkl as originally trained. Greg -- Greg Ward http://www.gerg.ca/ No man is an island, but some of us are long peninsulas. -------------- next part -------------- #!/usr/bin/env python2.2 import sys from spambayes import hammie from spambayes import tokenizer def score(hammie, tokens, label): (prob, clues) = hammie.bayes.spamprob(tokens, True) high_clues = ["%s:%.3f" % clue for clue in clues[-5:]] high_clues = ", ".join(high_clues) print "%s: %.3f: %s" % (label, prob, high_clues) def allscores(hammie, message): # Tokenize message once bayes = hammie.bayes tokens = list(tokenizer.tokenize(message)) # Score with that message (presumably) in the database. score(hammie, tokens, "initial") # Untrain (ie. remove this message from the database) and score again # (assume the message is spam). bayes.unlearn(tokens, True) score(hammie, tokens, "unlearn") # Retrain and score one last time. Should give identical results # to the initial scoring... but doesn't! bayes.learn(tokens, True) score(hammie, tokens, "relearn") # def score(hammie, msg, label): # (prob, clues) = hammie.score(msg, True) # high_clues = ["%s:%.3f" % clue for clue in clues[-5:]] # high_clues = ", ".join(high_clues) # print "%s: %.3f: %s" % (label, prob, high_clues) # def allscores(hammie, message): # score(hammie, message, "initial") # hammie.untrain(message, True) # score(hammie, message, "unlearn") # hammie.train(message, True) # score(hammie, message, "relearn") def main(): args = sys.argv[1:] if len(args) != 2: sys.exit("usage: simplescore db_file msg_file") (db_filename, msg_filename) = sys.argv[1:] usedb = db_filename.endswith(".db") # assume pickle otherwise h = hammie.open(db_filename, usedb=usedb, mode="w") message = open(msg_filename).read() allscores(h, message) h.bayes.store() main() From T.A.Meyer at massey.ac.nz Wed Jun 25 13:28:17 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 24 20:29:32 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022072A5@its-xchg4.massey.ac.nz> >From my testing: * Unless dbexpimp.py is broken, the 10.pkl and 10.db you supplied were not identical. There are two tokens that have different counts: 'header:MIME-Version:1' and header:Mime-Version:1 (2,5 vs 4,3 and 4,3 vs 2,3 respectively). I'm not sure what this means! * I couldn't use the same test messages as you because the filenames weren't valid for a win32 system and I couldn't unpack them. I grabbed a random message of my own to use as a test and changed the simplescore script, adding a initial learn (since I can't unlearn one that's not in the db). * Is the message that you give to simplescore one of the ones that was trained? It should be, because you can't untrain a message that hasn't been trained (you might get negative counts). However, even given all of this, I also get the db count 1 higher for each token. This problem goes away if after every learn/unlearn call there is a save call [1]. This would be why the problem doesn't occur running hammie multiple times. I'll keep looking... =Tony Meyer [1] Although this gave me the db error that was recently submitted as a bug, which is also where Greg ran into it, I presume. I think 'word' should be 'key' - it makes sense and seems to work. I think (sorry Mark!) that it's in here that there is a problem. From T.A.Meyer at massey.ac.nz Wed Jun 25 13:34:12 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 24 20:34:50 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022072A9@its-xchg4.massey.ac.nz> > However, even given all of this, I also get the db count 1 > higher for each token. This problem goes away if after every > learn/unlearn call there is a save call [1]. This would be ~~~~ I mean store(), of course. =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Jun 25 13:53:04 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 24 20:54:17 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022072BB@its-xchg4.massey.ac.nz> > This problem goes away if after every > learn/unlearn call there is a [store] call. This would be > why the problem doesn't occur running hammie multiple times. > > I'll keep looking... In case anyone is looking at this at the same time. I have almost narrowed this down. Each word whose count is wrong is a "singleton" (as defined in storage.py), and is not in the changed words list. The comments say that none of the singletons should be in the changed words list, but many are (but they end up fine!). I'll keep going... =Tony Meyer From T.A.Meyer at massey.ac.nz Wed Jun 25 14:06:55 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Tue Jun 24 21:08:22 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022072C7@its-xchg4.massey.ac.nz> Ok, I think I have this figured out now. The DBDictClassifier currently tries to be efficient by not storing "singleton" words (i.e. words that have only appeared once) in the wordinfo cache, but saving them directly to the database. This is all fine, except that they are *not* saved to the database until store() is called. This means that between a call to _wordinfoset() and a call to store() the counts are unreliable. To get around this, we need to either sync the db in the _wordinfoset function (seems to be expensive), or cache the words after all, or something else. Anyway, this is how it seems to me - I could be wrong! If Mark or someone more familiar with this stuff could look at it, that would be great. =Tony Meyer From tim.one at comcast.net Tue Jun 24 23:22:53 2003 From: tim.one at comcast.net (Tim Peters) Date: Tue Jun 24 22:24:09 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13022072C7@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer] > Ok, I think I have this figured out now. > > The DBDictClassifier currently tries to be efficient by not storing > "singleton" words (i.e. words that have only appeared once) in the > wordinfo cache, but saving them directly to the database. This is all > fine, except that they are *not* saved to the database until store() > is called. This means that between a call to _wordinfoset() and a > call to store() the counts are unreliable. You're saying that if d is an open Shelf object, then after d[string] = whatever the value of the access expression d[string] is unreliable unless a d.sync() call intervenes? That's scary, if so -- or a bug. The "whatever" thingies we're storing are not mutable objects (they're immutable tuples), so the caution about *mutating* Shelf values in a Shelf opened with the default writeback=False doesn't apply in spambayes. > To get around this, we need to either sync the db in the _wordinfoset > function (seems to be expensive), or cache the words after all, or > something else. > > Anyway, this is how it seems to me - I could be wrong! If Mark or > someone more familiar with this stuff could look at it, that would be > great. I haven't used this part of the code in real life. Other questions that pop up: + Why does _wordinfoset() start with if record and ... ? For example, how could record==None possibly arise? + If a word is deleted, what's stopping _wordinfoget() from sucking it out of the database anyway? That is, I believe the except clause in _wordinfoget() should start with: if self.changed_words.get(word) is WORD_DELETED: return None From noreply at sourceforge.net Wed Jun 25 00:40:35 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Wed Jun 25 02:40:43 2003 Subject: [spambayes-dev] [ spambayes-Bugs-760286 ] spambayes_admin.dll failed to register Message-ID: Bugs item #760286, was opened at 2003-06-25 16:40 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760286&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: John Mamutil (javinm) Assigned to: Mark Hammond (mhammond) Summary: spambayes_admin.dll failed to register Initial Comment: This bug has already been reported but it did not have a log file entry - so I have provided what appeared in my log file: Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py" , line 388, in RegisterClasses IndexError: list index out of range Hoping to see a fix soon. will installing the Source code version help? Thanks, John ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760286&group_id=61702 From banneker at progress.org Wed Jun 25 11:55:53 2003 From: banneker at progress.org (Hanno Beck) Date: Wed Jun 25 10:56:04 2003 Subject: [spambayes-dev] proposed question for Spambayes FAQ Message-ID: Dear Friends, If other people run into this problem, then maybe it's a good candidate for the FAQ: I have the Outlook plugin of Spambayes and every time I exit from Outlook, it forgets all the training. Whether I reboot the computer or simply close Outlook and reopen it, either way it doesn't retain any training information. What might be the solution to this? (This is on a Windows 98 machine running Outlook 2000.) Thanks in advance, Hanno T. Beck The Benjamin Banneker Center From skip at pobox.com Wed Jun 25 16:37:53 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Jun 25 16:38:02 2003 Subject: [spambayes-dev] Interesting unsure Message-ID: <16122.2081.275212.576909@montanaro.dyndns.org> I got an interesting spam just now. Besides the very short message which was little more than an tag wrapping an tag: doorknob WI-6-5RJ0 molder y-nbs

e--HRdW detect 0O-Q-Xmn marijuana (interesting nonsense words - doorknob? molder?) the subject had umlauts over many of the vowels: ousp W?nt to mak? l?ve lik? a te?n? so of course, I got several tokens which the classifier ignored. The debug and classification headers were X-Spambayes-Debug: '*H*': 0.21; '*S*': 0.66; 'doorknob': 0.09; 'subject:?': 0.23; 'detect': 0.26; 'header:Message-ID:1': 0.37; 'header:Reply-To:1': 0.61; 'url:com': 0.61; 'url:www': 0.67; 'header:Received:2': 0.76; 'subject:\xf6': 0.84; 'content-type:text/html': 0.87; 'url:gif': 0.93 X-Spambayes-Classification: unsure; 0.73 It's not clear much can be done, though it might be interesting to try an option to map Latin-1 accented characters to their unadorned ASCII counterparts, at least in subjects (strip_subject_accents?). For instance, "subject:teen" and "subject:love" are both pretty spammy in my database but "subject:te?n" and "subject:l?ve" don't occur at all. Even "subject:make" is more spammy than hammy. The problem with trying such an experiment isn't that it might not be worthwhile, but that if it's a new spammer technique, there won't be many messages in our existing spam/ham databases which would exercise the technique. Skip From popiel at wolfskeep.com Wed Jun 25 15:07:06 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed Jun 25 17:07:11 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: Message from Skip Montanaro of "Wed, 25 Jun 2003 15:37:53 CDT." <16122.2081.275212.576909@montanaro.dyndns.org> References: <16122.2081.275212.576909@montanaro.dyndns.org> Message-ID: <20030625210706.963BA2DDF0@cashew.wolfskeep.com> In message: <16122.2081.275212.576909@montanaro.dyndns.org> Skip Montanaro writes: >I got an interesting spam just now. [...] >the subject had umlauts over many of the vowels: > > ousp W=E4nt to mak=EB l=F6ve lik=EB a te=EBn? > >so of course, I got several tokens which the classifier ignored. > > X-Spambayes-Debug: '*H*': 0.21; '*S*': 0.66; 'doorknob': 0.09; >=09 'subject:?': 0.23; 'detect': 0.26; 'header:Message-ID:1': 0.37; >=09 'header:Reply-To:1': 0.61; 'url:com': 0.61; 'url:www': 0.67; >=09 'header:Received:2': 0.76; 'subject:\xf6': 0.84; >=09 'content-type:text/html': 0.87; 'url:gif': 0.93 > X-Spambayes-Classification: unsure; 0.73 > >It's not clear much can be done, though it might be interesting to try >an option to map Latin-1 accented characters to their unadorned ASCII >counterparts, at least in subjects (strip_subject_accents?). I suspect that would have serious detrimental effects for foreign language users. >The problem with trying such an experiment isn't that it might not be >worthwhile, but that if it's a new spammer technique, there won't be >many messages in our existing spam/ham databases which would exercise >the technique. I don't see this as any different from any of the other neologisms that spammers come up with; if they persist in using such words (and you're still training), then the odd words with accents will quickly become strong spam indicators. No need for us to do anything... it's already going to be handled properly. - Alex From barry at python.org Wed Jun 25 22:37:25 2003 From: barry at python.org (Barry Warsaw) Date: Wed Jun 25 17:37:26 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <16122.2081.275212.576909@montanaro.dyndns.org> References: <16122.2081.275212.576909@montanaro.dyndns.org> Message-ID: <1056576577.14344.28.camel@geddy> On Wed, 2003-06-25 at 16:37, Skip Montanaro wrote: > the subject had umlauts over many of the vowels: > > ousp W?nt to mak? l?ve lik? a te?n? This is a good argument for tiered spam defenses. I doubt such a message would get through python.org for example. 8-bit characters in Subject headers are not legal according to RFC 2822. -Barry From skip at pobox.com Wed Jun 25 17:40:23 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Jun 25 17:40:35 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <20030625210706.963BA2DDF0@cashew.wolfskeep.com> References: <16122.2081.275212.576909@montanaro.dyndns.org> <20030625210706.963BA2DDF0@cashew.wolfskeep.com> Message-ID: <16122.5831.713436.903045@montanaro.dyndns.org> >> It's not clear much can be done, though it might be interesting to >> try an option to map Latin-1 accented characters to their unadorned >> ASCII counterparts, at least in subjects (strip_subject_accents?). Alex> I suspect that would have serious detrimental effects for foreign Alex> language users. My thought was that whether or not to enable the option would be under user control. If it's a good spaminator for me why should I suffer because the effbot's native language includes accented characters? (No offsense intended toward any Swedes who might be reading this, BTW. ;-) >> The problem with trying such an experiment isn't that it might not be >> worthwhile, but that if it's a new spammer technique, there won't be >> many messages in our existing spam/ham databases which would exercise >> the technique. Alex> I don't see this as any different from any of the other neologisms Alex> that spammers come up with; if they persist in using such words Alex> (and you're still training), then the odd words with accents will Alex> quickly become strong spam indicators. No need for us to do Alex> anything... it's already going to be handled properly. Except note that they weren't accenting every vowel and there were many other accents to choose from. The message I received had "mak?" and "te?n". There are several other accented characters with "a" or "e" as their base character. I would have to receive many messages using this technique to build up enough such odd words to make a difference. I think that's the spammer's basic idea with this - keep it readable but fly below the word count radar. Like I said, "subject:love" is very spammy for me, but I'd never seem "subject:l?ve" before, so it wasn't used to score the message. The fundamental problem when dealing with new spam techniques is (and will always be, I think) when to mount a counterattack. That's certainly the case here. Skip From skip at pobox.com Wed Jun 25 17:45:45 2003 From: skip at pobox.com (Skip Montanaro) Date: Wed Jun 25 17:45:54 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <1056576577.14344.28.camel@geddy> References: <16122.2081.275212.576909@montanaro.dyndns.org> <1056576577.14344.28.camel@geddy> Message-ID: <16122.6153.934275.332711@montanaro.dyndns.org> Barry> On Wed, 2003-06-25 at 16:37, Skip Montanaro wrote: >> the subject had umlauts over many of the vowels: >> >> ousp W?nt to mak? l?ve lik? a te?n? Barry> This is a good argument for tiered spam defenses. I doubt such a Barry> message would get through python.org for example. 8-bit Barry> characters in Subject headers are not legal according to RFC Barry> 2822. Maybe so, but what's to keep the spammers from doing the correct quoted-printable dance with the subject, you know: Subject: =?ISO-8859-1?...= ? Would I see the umlauts in my MUA or not? If so, it doesn't really matter whether they faked the header or not. I'm running Sendmail 8.11.6. It let it through. (The message was sent to webmaster@mojam.com.) Skip From popiel at wolfskeep.com Wed Jun 25 16:37:20 2003 From: popiel at wolfskeep.com (T. Alexander Popiel) Date: Wed Jun 25 18:37:23 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: Message from Skip Montanaro of "Wed, 25 Jun 2003 16:40:23 CDT." <16122.5831.713436.903045@montanaro.dyndns.org> References: <16122.2081.275212.576909@montanaro.dyndns.org> <20030625210706.963BA2DDF0@cashew.wolfskeep.com> <16122.5831.713436.903045@montanaro.dyndns.org> Message-ID: <20030625223720.1CE152DDF0@cashew.wolfskeep.com> In message: <16122.5831.713436.903045@montanaro.dyndns.org> Skip Montanaro writes: > >Except note that they weren't accenting every vowel and there were >many other accents to choose from. The message I received had >"mak=EB" and "te=EBn". There are several other accented characters >with "a" or "e" as their base character. I would have to receive >many messages using this technique to build up enough such odd words >to make a difference. Also note that the message was already on the high end of unsure (0.78), and spammers don't seem to be all that creative in the nonsense words they use ("jvw", "sliceor", "tiper", and "tipor" have all shown up frequently enough for me so that they're strong spam indicators). It wouldn't take much to tip the mail into the spam bucket, and the spammers will probably stick with a couple misspellings more than long enough to be recognizable. >I think that's the spammer's basic idea with this - keep it readable >but fly below the word count radar. Certainly... and it'll work for about a day, each time they come up with new misspellings. It's probably too expensive (or too technical) for them to generate fresh accent permutations for every spam they send. >The fundamental problem when dealing with new spam techniques is (and >will always be, I think) when to mount a counterattack. That's >certainly the case here. Agreed. My gut feeling is not to bother until it's been an actual problem for a week. - Alex From gward at python.net Wed Jun 25 23:14:08 2003 From: gward at python.net (Greg Ward) Date: Wed Jun 25 22:14:16 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <1056576577.14344.28.camel@geddy> References: <16122.2081.275212.576909@montanaro.dyndns.org> <1056576577.14344.28.camel@geddy> Message-ID: <20030626021408.GA7700@cthulhu.gerg.ca> On 25 June 2003, Barry Warsaw said: > On Wed, 2003-06-25 at 16:37, Skip Montanaro wrote: > > > the subject had umlauts over many of the vowels: > > > > ousp W?nt to mak? l?ve lik? a te?n? > > This is a good argument for tiered spam defenses. I doubt such a > message would get through python.org for example. 8-bit characters in > Subject headers are not legal according to RFC 2822. Actually, it *would* get through python.org, because the regex I devised requires 3 8-bit characters in a row -- precisely to allow this sort of thing! (Actually it's to avoid penalizing the occasional European who uses a few unencoded Latin-1 chars, innocently unaware that (s)he's violating an RFC. Of couse this should never happen if people use correct MUAs...) Greg -- Greg Ward http://www.gerg.ca/ When you make your mark in the world, watch out for guys with erasers. From gward at python.net Wed Jun 25 23:19:21 2003 From: gward at python.net (Greg Ward) Date: Wed Jun 25 22:19:24 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13022072A5@its-xchg4.massey.ac.nz> References: <1ED4ECF91CDED24C8D012BCF2B034F13022072A5@its-xchg4.massey.ac.nz> Message-ID: <20030626021921.GB7700@cthulhu.gerg.ca> On 25 June 2003, Meyer, Tony said: > * Unless dbexpimp.py is broken, the 10.pkl and 10.db you supplied were > not identical. There are two tokens that have different counts: > 'header:MIME-Version:1' and header:Mime-Version:1 (2,5 vs 4,3 and 4,3 vs > 2,3 respectively). I'm not sure what this means! Oops -- I must have screwed up. I just retrained, and diff'ing the dbExpImp output shows that they're the same. > * I couldn't use the same test messages as you because the filenames > weren't valid for a win32 system and I couldn't unpack them. I grabbed > a random message of my own to use as a test and changed the simplescore > script, adding a initial learn (since I can't unlearn one that's not in > the db). Arggh, bloody Maildir. Almost but not *quite* the perfect mail folder format. Well, you seem to have figured it out anyways. > * Is the message that you give to simplescore one of the ones that was > trained? It should be, because you can't untrain a message that hasn't > been trained (you might get negative counts). Yes, I made that mistake a couple of times, and now I'm super-careful that the message being unlearned is indeed in the training corpus. Greg -- Greg Ward http://www.gerg.ca/ "Very funny, Scotty. Now beam my *clothes* down." From T.A.Meyer at massey.ac.nz Thu Jun 26 16:04:18 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 25 23:04:57 2003 Subject: [spambayes-dev] Weird cvs commit traceback Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1302207674@its-xchg4.massey.ac.nz> I just committed an update to the website faq and got a strange traceback in my cvs window: Traceback (innermost last): File "/cvsroot/spambayes/CVSROOT/syncmail", line 336, in ? main() File "/cvsroot/spambayes/CVSROOT/syncmail", line 329, in main blast_mail(subject, people, specs[1:], contextlines, fromhost) File "/cvsroot/spambayes/CVSROOT/syncmail", line 253, in blast_mail resp = conn.sendmail(address, people, s.getvalue()) File "/usr/lib/python1.5/smtplib.py", line 452, in sendmail if not (200 <= self.ehlo()[0] <= 299): File "/usr/lib/python1.5/smtplib.py", line 305, in ehlo (code,msg)=self.getreply() File "/usr/lib/python1.5/smtplib.py", line 257, in getreply raise SMTPServerDisconnected("Connection unexpectedly closed") smtplib.SMTPServerDisconnected: Connection unexpectedly closed Traceback (innermost last): File "/cvsroot/spambayes/CVSROOT/syncmail", line 336, in ? main() File "/cvsroot/spambayes/CVSROOT/syncmail", line 329, in main blast_mail(subject, people, specs[1:], contextlines, fromhost) File "/cvsroot/spambayes/CVSROOT/syncmail", line 253, in blast_mail resp = conn.sendmail(address, people, s.getvalue()) File "/usr/lib/python1.5/smtplib.py", line 452, in sendmail if not (200 <= self.ehlo()[0] <= 299): File "/usr/lib/python1.5/smtplib.py", line 305, in ehlo (code,msg)=self.getreply() File "/usr/lib/python1.5/smtplib.py", line 257, in getreply raise SMTPServerDisconnected("Connection unexpectedly closed") smtplib.SMTPServerDisconnected: Connection unexpectedly closed Any idea what this means? (If it's relevant, I'm using TortoiseCVS on WinXP). =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 26 16:05:54 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 25 23:06:38 2003 Subject: [spambayes-dev] proposed question for Spambayes FAQ Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1302207678@its-xchg4.massey.ac.nz> > If other people run into this problem, then maybe it's a good > candidate for the FAQ: > > I have the Outlook plugin of Spambayes and every time I exit > from Outlook, it forgets all the training. Whether I reboot > the computer or simply close Outlook and reopen it, either > way it doesn't retain any training information. This probably is a good FAQ candidate, so I've added it. We can probably remove it once the next release (which fixes this) has been out for a while. It's online now, but here's the answer I wrote: """This is a known problem with the 002 release of the plug-in (or earlier). What is probably happening is that Outlook is not shutting down properly or completely, or is being forced to shut down too quickly. This version of the plug-in only saves training/configuration data on shut-down, so this can result in training/configuration being lost. To get around this problem, exit Outlook (not by restarting/shutting down the machine), and make sure that it is completely shut down (use the Task Manager to check). This should save your training and configuration. The 003 release (and later) saves as required, so avoids this issue.""" =Tony Meyer From tim.one at comcast.net Thu Jun 26 00:24:23 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Jun 25 23:26:31 2003 Subject: [spambayes-dev] Weird cvs commit traceback In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1302207674@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer] > I just committed an update to the website faq and got a strange > traceback in my cvs window: > > Traceback (innermost last): > File "/cvsroot/spambayes/CVSROOT/syncmail", line 336, in ? > main() > File "/cvsroot/spambayes/CVSROOT/syncmail", line 329, in main > blast_mail(subject, people, specs[1:], contextlines, fromhost) > File "/cvsroot/spambayes/CVSROOT/syncmail", line 253, in blast_mail > resp = conn.sendmail(address, people, s.getvalue()) > File "/usr/lib/python1.5/smtplib.py", line 452, in sendmail > if not (200 <= self.ehlo()[0] <= 299): > File "/usr/lib/python1.5/smtplib.py", line 305, in ehlo > (code,msg)=self.getreply() > File "/usr/lib/python1.5/smtplib.py", line 257, in getreply > raise SMTPServerDisconnected("Connection unexpectedly closed") > smtplib.SMTPServerDisconnected: Connection unexpectedly closed > Traceback (innermost last): > File "/cvsroot/spambayes/CVSROOT/syncmail", line 336, in ? > main() > File "/cvsroot/spambayes/CVSROOT/syncmail", line 329, in main > blast_mail(subject, people, specs[1:], contextlines, fromhost) > File "/cvsroot/spambayes/CVSROOT/syncmail", line 253, in blast_mail > resp = conn.sendmail(address, people, s.getvalue()) > File "/usr/lib/python1.5/smtplib.py", line 452, in sendmail > if not (200 <= self.ehlo()[0] <= 299): > File "/usr/lib/python1.5/smtplib.py", line 305, in ehlo > (code,msg)=self.getreply() > File "/usr/lib/python1.5/smtplib.py", line 257, in getreply > raise SMTPServerDisconnected("Connection unexpectedly closed") > smtplib.SMTPServerDisconnected: Connection unexpectedly closed > > Any idea what this means? (If it's relevant, I'm using TortoiseCVS on > WinXP). Not to worry. This isn't your Python, it's an instance of Python running on the SourceForge box. It doesn't matter which client you're using. The CVS server is trying to send out the checkin mail, and had a problem with that on their end. Your change should still be checked in just fine, but the checkins list may not see email about it. From barry at python.org Thu Jun 26 04:27:39 2003 From: barry at python.org (Barry Warsaw) Date: Wed Jun 25 23:27:40 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <20030626021408.GA7700@cthulhu.gerg.ca> References: <16122.2081.275212.576909@montanaro.dyndns.org> <1056576577.14344.28.camel@geddy> <20030626021408.GA7700@cthulhu.gerg.ca> Message-ID: <1056598072.16044.0.camel@geddy> On Wed, 2003-06-25 at 22:14, Greg Ward wrote: > Actually, it *would* get through python.org, because the regex I devised > requires 3 8-bit characters in a row -- precisely to allow this sort of > thing! (Actually it's to avoid penalizing the occasional European who > uses a few unencoded Latin-1 chars, innocently unaware that (s)he's > violating an RFC. Of couse this should never happen if people use > correct MUAs...) Ha! From T.A.Meyer at massey.ac.nz Thu Jun 26 16:39:34 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 25 23:40:09 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022076C6@its-xchg4.massey.ac.nz> > Arggh, bloody Maildir. Almost but not *quite* the perfect > mail folder format. Well, you seem to have figured it out anyways. Well, definitely figured out the bit of the code that the error is in (because I can output a list of tokens that will be the ones that are different), and *probably* figured out the problem, but not a solution. I'm hoping Mark will get to these soon (when he has a chance), since he wrote that bit of the code and probably can see a solution. It is a fairly significant problem, although I'm not sure how many Spambayes apps it will effect (because I think most actually do store() after changing). (As an aside, his (skippinet) address seems to be broken today, or at least my mail to him is bouncing...he may have more urgent things to fix!) =Tony Meyer From T.A.Meyer at massey.ac.nz Thu Jun 26 16:47:23 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Wed Jun 25 23:48:05 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F13022076D3@its-xchg4.massey.ac.nz> > You're saying that if d is an open Shelf object, then after > > d[string] = whatever > > the value of the access expression > > d[string] > > is unreliable unless a > > d.sync() > > call intervenes? That's scary, if so -- or a bug. I believe so. It's reliable as long as 'string' was already a token, but not otherwise. It doesn't give a random value, at least, it just fails to update it. (So d[string] would return whatever value it had before the d[string] = whatever line). > I haven't used this part of the code in real life. You and your pickles ;) > Other questions that pop up: > > + Why does _wordinfoset() start with > > if record and ... > > ? For example, how could record==None possibly arise? I wondered this too. I think it was Mark coding for all possible situations. > + If a word is deleted, what's stopping _wordinfoget() from sucking it > out of the database anyway? That is, I believe the except clause in > _wordinfoget() should start with: > > if self.changed_words.get(word) is WORD_DELETED: > return None I didn't notice this, but your logic makes sense to me. =Tony Meyer From tim.one at comcast.net Thu Jun 26 00:49:31 2003 From: tim.one at comcast.net (Tim Peters) Date: Wed Jun 25 23:50:29 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: <16122.2081.275212.576909@montanaro.dyndns.org> Message-ID: [Skip] > ... > the subject had umlauts over many of the vowels: > > ousp W?nt to mak? l?ve lik? a te?n? > > so of course, I got several tokens which the classifier ignored. The > debug and classification headers were For body (but not header) tokenization, the option replace_nonascii_chars (off by default) is very effective against junk like this, at least for those whose ham is mostly 7-bit ASCII. That option replaces each "funny character" with a question mark. So, e.g., any oddball spelling for "o" in "love" turns the token into "l?ve"; the occasional Euro-name in ham isn't really hurt by this at all. I expect it would also be effective if applied to headers. OTOH, I don't recall getting any Unsures where this would have tipped the score into my Spam range. Indeed, my Unsures this week are utterly dominated by trash bouncing back to various webmaster and admin addresses due to the Sobig worm forging sender addresses, like """ From the U.S. Courts Hostmaster: Content violation found in email message. From: webmaster@python.org To: pacer@psc.uscourts.gov File(s): details.pif Matching filename: *.pif """ It occurs to me that I haven't had "a spam problem" since last year -- now I've got "a virus bounce problem" <0.5 wink>! From tim.one at comcast.net Thu Jun 26 01:04:45 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Jun 26 00:08:02 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F13022076D3@its-xchg4.massey.ac.nz> Message-ID: [Tim] >> You're saying that if d is an open Shelf object, then after >> >> d[string] = whatever >> >> the value of the access expression >> >> d[string] >> >> is unreliable unless a >> >> d.sync() >> >> call intervenes? That's scary, if so -- or a bug. [Tony Meyer] > I believe so. It's reliable as long as 'string' was already a token, > but not otherwise. It doesn't give a random value, at least, it just > fails to update it. (So d[string] would return whatever value it had > before the d[string] = whatever line). I don't follow this: if string wasn't already a token, then how could d[string] have returned *any* value before d[string] was assigned to? Maybe I don't understand what you mean by "was already a token". It would help most if you could post a tiny self-contained executable program illustrating what "not reliable" means. I believe that would be a horrendous bug in the database package (Python's or Berkeley's, not spambayes's -- the spambayes storage.py has at least the deleted-word bug I posted about last time). Here's my failed tiny self-contained attempt at provoking unreliable behavior (current Python 2.3 CVS on Windows): >>> import shelve >>> import bsddb >>> db = bsddb.hashopen('whatever') >>> d = shelve.Shelf(db) >>> d['abc'] = 1, 2 >>> d['abc'] (1, 2) >>> d['abc'] = 3, 4 >>> d['abc'] (3, 4) >>> d['x'] = 12 >>> d['x'] = 42 >>> d['x'] 42 >>> d['y'] Traceback (most recent call last): File "", line 1, in ? File "C:\CODE\PYTHON\lib\shelve.py", line 118, in __getitem__ f = StringIO(self.dict[key]) File "C:\CODE\PYTHON\lib\bsddb\__init__.py", line 86, in __getitem__ return self.db[key] KeyError: 'y' >>> From T.A.Meyer at massey.ac.nz Thu Jun 26 17:48:14 2003 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Thu Jun 26 00:48:56 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies Message-ID: <1ED4ECF91CDED24C8D012BCF2B034F1302207722@its-xchg4.massey.ac.nz> Ah well, I almost had it right before... Ok more investigation prompted by trying to come up with an example for Tim. (Note, I wasn't saying that the database package was broken, just that the _wordinfo* functions in storage.py were). I can now get a list of the incorrect words by putting a print statement in two places - either all those words for which _wordinfodel() is called, or all those words for whom the "del self.changed_words[word]" line does not raise an exception in _wordinfoset(). I guess the problem is not what I guessed before (to my credit, I said that I was unsure, and that I had narrowed it down, which was true ;), but along the lines of the delete issue that Tim pointed out. I was somewhat on the right track... The problem (I am more sure, but still in the unsure range ;) is when tokens are deleted before they are written to the db. (A much nicer and easier to solve problem :) Here's example code: from spambayes.storage import DBDictClassifier from spambayes.classifier import WordInfo d = DBDictClassifier("fail.db") print "Should not be an entry" print d._wordinfoget("tok") w = WordInfo() w.hamcount = 1 d._wordinfoset("tok", w) print "Should have a ham count of 1, spam count of 0" print d._wordinfoget("tok") w.hamcount -=1 # not really necessary d._wordinfodel("tok") #d.store() # uncomment this line and it will work print "Should not be an entry (or have ham and spam of 0)" print d._wordinfoget("tok") =Tony Meyer From skip at pobox.com Thu Jun 26 08:42:55 2003 From: skip at pobox.com (Skip Montanaro) Date: Thu Jun 26 10:00:11 2003 Subject: [spambayes-dev] Interesting unsure In-Reply-To: References: <16122.2081.275212.576909@montanaro.dyndns.org> Message-ID: <16122.59983.374544.678237@montanaro.dyndns.org> Tim> For body (but not header) tokenization, the option Tim> replace_nonascii_chars (off by default) is very effective against Tim> junk like this, at least for those whose ham is mostly 7-bit ASCII. Tim> That option replaces each "funny character" with a question mark. Tim> So, e.g., any oddball spelling for "o" in "love" turns the token Tim> into "l?ve"; I'd like to simply strip the accents. With the current scheme you still wind up with four related tokens, "love", "l?ve", "lov?" and "l?v?", all prefaced by "subject:". Since what the spammer wants you to read in all instances is "love", I think that's the target we should aim at where possible. Tim> Indeed, my Unsures this week are utterly dominated by trash Tim> bouncing back to various webmaster and admin addresses due to the Tim> Sobig worm forging sender addresses, like ... Mine too. Tim> It occurs to me that I haven't had "a spam problem" since last year Tim> -- now I've got "a virus bounce problem" <0.5 wink>! I just classify them as spam. It's actually unclear to me why these "anti-virus" programs feel the need to reply to such messages. Most of the time the sender is forged anyway, so the reply goes to someone who doesn't have the virus. Skip From noreply at sourceforge.net Thu Jun 26 09:40:57 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 11:41:08 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 15:40 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From noreply at sourceforge.net Thu Jun 26 11:19:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 13:19:29 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761316 ] failed binary install Message-ID: Bugs item #761316, was opened at 2003-06-26 13:19 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761316&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Andrew Shepard (sheparda) Assigned to: Mark Hammond (mhammond) Summary: failed binary install Initial Comment: running win98 and Outlook 2000 spambayes1.log file attached-- starts with 2 lines: "SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher" followed by series of "tracebacks" to E: drive ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761316&group_id=61702 From noreply at sourceforge.net Thu Jun 26 11:22:05 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 13:22:12 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761318 ] Moved mail is unfiltered Message-ID: Bugs item #761318, was opened at 2003-06-26 12:22 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761318&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Welsh (markw_x) Assigned to: Mark Hammond (mhammond) Summary: Moved mail is unfiltered Initial Comment: When I use Outlook's rules wizard to move my emails to a different folder, they aren't filtered by the add-in. Both the inbox, where the message arrives for a second before the rule is used, and the destination folder, where the message is moved by the rule, are listed as folders that should be filtered by the add-in. The spam that arrives is given a spam score but not filtered accordingly. This is an excerpt from the log for a piece of spam that was given a rating of 100%: Message 'Get Your NEW Car ASAP' had a Spam classification of 'Yes' Training on message 'Get Your NEW Car ASAP' - trained as good Training on message 'Get Your NEW Car ASAP' - trained as spam It was trained as spam because I clicked the "Delete as Spam" button, not because of the filtering. I have Outlook 2002 SP-2. I'm using the latest binary of the add-in (SpamBayes-Outlook-Setup-002.exe). The spam filtering works fine for all other mail that's left in the inbox and not moved by a rule. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761318&group_id=61702 From noreply at sourceforge.net Thu Jun 26 12:43:06 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 14:43:12 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761359 ] Folder view results in ImportError Message-ID: Bugs item #761359, was opened at 2003-06-26 13:43 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761359&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark Welsh (markw_x) Assigned to: Mark Hammond (mhammond) Summary: Folder view results in ImportError Initial Comment: When I open the anti-spam manager and navigate to any window with a folder tree view, I end up with an ImportError in my log (attached). I haven't noticed any immediate side-effects, but the add-in does crash Outlook periodically, so I thought this might be at least partially to blame. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761359&group_id=61702 From noreply at sourceforge.net Thu Jun 26 13:22:38 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 15:22:43 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-761383 ] Bounce back spam messages Message-ID: Feature Requests item #761383, was opened at 2003-06-26 15:22 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=761383&group_id=61702 Category: Interface Improvements (example) Group: None Status: Open Priority: 5 Submitted By: Dale (daledude) Assigned to: Nobody/Anonymous (nobody) Summary: Bounce back spam messages Initial Comment: I think the ability to configure the option to "bounce" spam back to the spammer would help a bit in eliminating repeat spam from the same spammer. Depending on the spammer of course. Possibly allow configurable text for the bounce so it looks like a true qmail/postfix/sendmail bounce? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=761383&group_id=61702 From noreply at sourceforge.net Thu Jun 26 13:36:52 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 15:37:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 11:40 Message generated for change (Comment added) made by jhihn1 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-26 15:36 Message: Logged In: YES user_id=142836 I have the same problem. here's my log: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From noreply at sourceforge.net Thu Jun 26 13:43:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 15:43:45 2003 Subject: [spambayes-dev] [ spambayes-Bugs-760286 ] spambayes_admin.dll failed to register Message-ID: Bugs item #760286, was opened at 2003-06-24 23:40 Message generated for change (Comment added) made by jimmythegeek You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760286&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: John Mamutil (javinm) Assigned to: Mark Hammond (mhammond) Summary: spambayes_admin.dll failed to register Initial Comment: This bug has already been reported but it did not have a log file entry - so I have provided what appeared in my log file: Traceback (most recent call last): File "", line 15, in DllRegisterServer File "E:\src\pythonex\com\win32com\server\register.py" , line 388, in RegisterClasses IndexError: list index out of range Hoping to see a fix soon. will installing the Source code version help? Thanks, John ---------------------------------------------------------------------- Comment By: Jimmy theGeek (jimmythegeek) Date: 2003-06-26 12:43 Message: Logged In: YES user_id=398582 I ran into the same thing; the docs/troubleshooting.html says "Note that running regsvr32.exe spambayes_addin.dll from the SpamBayes directory may also solve this problem" It did solve the problem. I am training now. Hope this helps. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=760286&group_id=61702 From noreply at sourceforge.net Thu Jun 26 14:03:32 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 16:03:37 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761407 ] Outlook profiles confuse things Message-ID: Bugs item #761407, was opened at 2003-06-26 16:03 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761407&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Mark H Bramhall (beaugeste) Assigned to: Mark Hammond (mhammond) Summary: Outlook profiles confuse things Initial Comment: If one runs Outlook in multiple configurations, by selecting a profile at Outlook startup, the plugin gets quite confused. I don't know enough about Outlook profiles and/or how/where a plugin would store pre-profile information, but something is needed. If I switch to a new profile, which has a different Exchange server, then the plugin looks for info from the old server (I get a popup saying the old server is "slow in responding", which is just a timer-based thing in the general Outlook access routines). After a while things sort of seem to settle, but not quite right. Each profile should have a complete set of its own settings as one could have completely different folders pre-profle. Yeah, I know, gripe, gripe, but it will make or break you for many Outlook users. If someone could point me close to the right things places (in the source/plugin as well as to the right documentation -- I said I didn't know much about Outlook!) then I'd try to look at it... /s/ MarkB ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761407&group_id=61702 From noreply at sourceforge.net Thu Jun 26 14:42:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 16:43:02 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-761383 ] Bounce back spam messages Message-ID: Feature Requests item #761383, was opened at 2003-06-26 14:22 Message generated for change (Comment added) made by montanaro You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=761383&group_id=61702 Category: Interface Improvements (example) Group: None >Status: Closed Priority: 5 Submitted By: Dale (daledude) Assigned to: Nobody/Anonymous (nobody) Summary: Bounce back spam messages Initial Comment: I think the ability to configure the option to "bounce" spam back to the spammer would help a bit in eliminating repeat spam from the same spammer. Depending on the spammer of course. Possibly allow configurable text for the bounce so it looks like a true qmail/postfix/sendmail bounce? ---------------------------------------------------------------------- >Comment By: Skip Montanaro (montanaro) Date: 2003-06-26 15:42 Message: Logged In: YES user_id=44345 Not a good idea in general. Bouncing spam does one of three things: 1. Aggravates the person whose email address was forged. 2. Causes another message to bounce back telling you about the account's over quota or recently closed status. 3. Tells the spammers (if they are actually listening to that email address) that your email is valid and that they should send you more spam. All three cases are bad. I've never encountered a scenario where bounding the mail helps. If you can bounce it at the MTA (during the SMTP session) that can help, but not after the message has been received and delivered. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=761383&group_id=61702 From noreply at sourceforge.net Thu Jun 26 16:17:33 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 18:17:37 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761499 ] pop3proxy_service doesn't stop when shutdown from browser Message-ID: Bugs item #761499, was opened at 2003-06-26 22:17 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761499&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jason Foster (the_onceler) Assigned to: Nobody/Anonymous (nobody) Summary: pop3proxy_service doesn't stop when shutdown from browser Initial Comment: If you click "save and shutdown" in your broswer, pop3proxy shuts down, but in services the pop3proxy_service still lists as started. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761499&group_id=61702 From noreply at sourceforge.net Thu Jun 26 16:26:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Thu Jun 26 18:26:19 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761507 ] Outlook Express reports pop3proxy not responding. Message-ID: Bugs item #761507, was opened at 2003-06-26 22:26 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761507&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jason Foster (the_onceler) Assigned to: Nobody/Anonymous (nobody) Summary: Outlook Express reports pop3proxy not responding. Initial Comment: I am running v 1.0a3 and it, along with v1.0a2 for some reason now likes to break connection with Outlook Express. Outlook Express states the POP3 server is not responding and no matter how many times you tell it to wait, it never finishes getting the messages. If you tell Outlook Express to get messages again, it will start downloading the messages from the beginning again with no problem. pop3proxy is not generating any error messages and it doesn't crash. I thought this could be a problem with a message, but if I delete all the messages out of my mailbox on the pop3 server to the message it stoped downloading on, it has no problem getting the rest of them all. I don't think it's a corrupt database because I have just recreated the problem on a new machine that I started with a fresh install. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761507&group_id=61702 From tim.one at comcast.net Thu Jun 26 21:46:20 2003 From: tim.one at comcast.net (Tim Peters) Date: Thu Jun 26 20:49:26 2003 Subject: [spambayes-dev] Re: Pickle vs DB inconsistencies In-Reply-To: <1ED4ECF91CDED24C8D012BCF2B034F1302207722@its-xchg4.massey.ac.nz> Message-ID: [Tony Meyer] > ... > Here's example code: > > from spambayes.storage import DBDictClassifier > from spambayes.classifier import WordInfo > d = DBDictClassifier("fail.db") > print "Should not be an entry" > print d._wordinfoget("tok") > w = WordInfo() > w.hamcount = 1 > d._wordinfoset("tok", w) > print "Should have a ham count of 1, spam count of 0" > print d._wordinfoget("tok") > w.hamcount -=1 # not really necessary > d._wordinfodel("tok") > #d.store() # uncomment this line and it will work > print "Should not be an entry (or have ham and spam of 0)" > print d._wordinfoget("tok") OK, I checked the change in I mentioned before, and now this program prints """ Should not be an entry None Should have a ham count of 1, spam count of 0 WordInfo(0, 1) Should not be an entry (or have ham and spam of 0) None """ Note that it should not have a spam and ham count of 0 at the end, it should return None (as it does now). As the WordInfo class comment says, # Invariant: For use in a classifier database, at least one of # spamcount and hamcount must be non-zero. I also checked in other, more cosmetic changes. If it breaks something, let me know. From noreply at sourceforge.net Fri Jun 27 01:11:57 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 03:12:07 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 17:40 Message generated for change (Comment added) made by gnistan You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- Comment By: Johan Nilsson (gnistan) Date: 2003-06-27 09:11 Message: Logged In: YES user_id=810664 I ran it on Outlook 2002 Swedish, this result did I get: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' Removed it from COM addins and added it again: Registered: SpamBayes.OutlookAddin SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-26 21:36 Message: Logged In: YES user_id=142836 I have the same problem. here's my log: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From noreply at sourceforge.net Fri Jun 27 02:01:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 04:01:04 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761670 ] Unexpected unicode key in bsd db Message-ID: Bugs item #761670, was opened at 2003-06-27 01:01 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: Unexpected unicode key in bsd db Initial Comment: On Debian (unstable) with python 2.3b1: cst@comuter% pdb mboxtrain.py -n -d ~/.hammiedb -g ~/ Mail.d/Box -g ~/Mail.d/roeinst -g ~/Mail.d/friends -g ~/ Mail.d/freunde -g ~/Mail.d/deadmonkey-list -s ~/Mail.d/ spam Training ham (/home/chs/Mail.d/Box): Reading as Maildir Trained 0 out of 40 messages Training ham (/home/chs/Mail.d/roeinst): Reading as Maildir Trained 0 out of 2902 messages Training ham (/home/chs/Mail.d/friends): Reading as Maildir Trained 2 out of 3690 messages Training ham (/home/chs/Mail.d/freunde): Reading as Maildir Traceback (most recent call last): File "/home/chs/bin/pdb", line 43, in ? execfile(script) File "mboxtrain.py", line 294, in ? main() File "mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "mboxtrain.py", line 110, in maildir_train if not msg_train(h, msg, is_spam, force): File "mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/chs/src/spambayes/spambayes/hammie.py", line 150, in train self.bayes.learn(tokenize(msg), is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 402, in _add_msg record = self._wordinfoget(word) File "/home/chs/src/spambayes/spambayes/storage.py", line 215, in _wordinfoget r = self.db.get(word) File "/usr/lib/python2.3/shelve.py", line 110, in get if self.dict.has_key(key): File "/usr/lib/python2.3/bsddb/__init__.py", line 112, in has_key return self.db.has_key(key) TypeError: String or Integer object expected for key, unicode found > /usr/lib/python2.3/bsddb/__init__.py(112)has_key() -> return self.db.has_key(key) (Pdb) p key u'filename:fname piece:Huettenfete01- Wegbeschreibung_H\xfctte' ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 From noreply at sourceforge.net Fri Jun 27 02:13:59 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 04:14:02 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761674 ] Corrupting X-Spambayes-Trained: headers Message-ID: Bugs item #761674, was opened at 2003-06-27 01:13 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761674&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: Corrupting X-Spambayes-Trained: headers Initial Comment: After playing around with mboxtrain.py (using the -n and -f options in different combinations) I found some of my mails corrupted, eg ...headers Lines: 27 X-Spambayes-Trained: ham Hallo Christian, ...message body OR ...headers Lines: 176 6 X-Spambayes-Trained: ham ...body OR ...headers Lines: 42 X-Spambayes-Trained: ham -Spambayes-Trained: ham Hey Chris: ...body I ran spambayes 1.0a3 with the fix for bug#721664 on Debian unstable. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761674&group_id=61702 From noreply at sourceforge.net Fri Jun 27 02:23:14 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 04:23:17 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761677 ] mboxtrain.py's -n option has no effect Message-ID: Bugs item #761677, was opened at 2003-06-27 01:23 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761677&group_id=61702 Category: hammie Group: None Status: Open Resolution: None Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: mboxtrain.py's -n option has no effect Initial Comment: In mboxtrain.py(train) the logic for trainnew is flawed. The 'new' dir will never be learned. train() should look something like this: def train(h, path, is_spam, force, trainnew): if not os.path.exists(path): raise ValueError("Nonexistent path: %s" % path) elif os.path.isfile(path): mbox_train(h, path, is_spam, force) elif os.path.isdir(os.path.join(path, "cur")): maildir_train(h, os.path.join(path, "cur"), is_spam, force) if trainnew: maildir_train(h, os.path.join(path, "new"), is_spam, force) elif os.path.isdir(path): mhdir_train(h, path, is_spam, force) else: raise ValueError("Unable to determine mailbox type: " + path) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761677&group_id=61702 From noreply at sourceforge.net Fri Jun 27 06:45:16 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 08:45:22 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761797 ] "Failed to call the universal dispatcher" Message-ID: Bugs item #761797, was opened at 2003-06-27 12:45 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Simon Caddy (simoncaddy) Assigned to: Mark Hammond (mhammond) Summary: "Failed to call the universal dispatcher" Initial Comment: My Spambayes log shows this error: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding Any ideas...? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 From noreply at sourceforge.net Fri Jun 27 09:56:03 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 11:56:08 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761674 ] Corrupting X-Spambayes-Trained: headers Message-ID: Bugs item #761674, was opened at 2003-06-27 01:13 Message generated for change (Comment added) made by cst You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761674&group_id=61702 Category: hammie Group: None >Status: Closed >Resolution: Invalid Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: Corrupting X-Spambayes-Trained: headers Initial Comment: After playing around with mboxtrain.py (using the -n and -f options in different combinations) I found some of my mails corrupted, eg ...headers Lines: 27 X-Spambayes-Trained: ham Hallo Christian, ...message body OR ...headers Lines: 176 6 X-Spambayes-Trained: ham ...body OR ...headers Lines: 42 X-Spambayes-Trained: ham -Spambayes-Trained: ham Hey Chris: ...body I ran spambayes 1.0a3 with the fix for bug#721664 on Debian unstable. ---------------------------------------------------------------------- >Comment By: Christian Stork (cst) Date: 2003-06-27 08:56 Message: Logged In: YES user_id=23468 Sorry, this bug was an artifact caused by mutt's way of caching headers in maildir mailboxes. After rebuilding hcache.db everything is fine again. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761674&group_id=61702 From noreply at sourceforge.net Fri Jun 27 11:17:49 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 13:18:04 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 15:40 Message generated for change (Comment added) made by jbehring20 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- Comment By: Jason Behring (jbehring20) Date: 2003-06-27 17:17 Message: Logged In: YES user_id=811023 I am also running into the same problems on Windows 98 with Outllok 2000. removed the .dll and tried to add it again manually and this is the error I got: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0187:1e03ff78. Registers: EAX=505c3a43 CS=0187 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=018f ESP=0056b834 EBP=1e0b8ec0 ECX=005fdc70 DS=018f ESI=005fdc70 FS=4daf EDX=00705130 ES=018f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 This is my first post and I don't see where to upload the log file, so I'll just add it here. SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- Comment By: Johan Nilsson (gnistan) Date: 2003-06-27 07:11 Message: Logged In: YES user_id=810664 I ran it on Outlook 2002 Swedish, this result did I get: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' Removed it from COM addins and added it again: Registered: SpamBayes.OutlookAddin SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-26 19:36 Message: Logged In: YES user_id=142836 I have the same problem. here's my log: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From peter at webreply.com Fri Jun 27 14:59:02 2003 From: peter at webreply.com (Peter McA'Nulty) Date: Fri Jun 27 14:00:18 2003 Subject: [spambayes-dev] Outlook 2003 Beta 2 Technical Refresh Message-ID: <002601c33cd5$cb6cc670$1501a8c0@Peter8100> Has anyone installed the latest Technical Refresh? We're still waiting for ours, but one of our SpamAtBay customer has gotten his and the addin seems to work fine for him -- which I admit I hoped for, but didn't actually _expect_ ;) If anyone out there has the latest refresh installed, can you confirm that Spambayes addin works (or not)? Actually _hoping_, for once, to get a product from Microsoft... Peter McA'Nulty Chief Bottle Washer SpamAtBay -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes-dev/attachments/20030627/661ef93c/attachment.htm From noreply at sourceforge.net Fri Jun 27 17:06:14 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 19:06:24 2003 Subject: [spambayes-dev] [ spambayes-Bugs-758552 ] Spambayes Outlook addin (dll registration failed) Message-ID: Bugs item #758552, was opened at 2003-06-21 17:19 Message generated for change (Comment added) made by hallstevenson You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758552&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jeffrey Williams (wjeff) Assigned to: Mark Hammond (mhammond) Summary: Spambayes Outlook addin (dll registration failed) Initial Comment: Spambayes Outlook Addin version 002 Download and ran installer, with Outlook shutdown, installation failed with the error that the dll failed to register, attempted to run "regsvr32.exe spambayes_addin.dll" from the spambayes directory, which resulted in the same error message. The install log file was created, but empty. Error Message: DllRegisterServer in spambayes_addin.dll failed. Return code was: 0xc0000005 Outlook version info: Outlook 2000 SP3 (9.0.0.6627) Corporate or Workgroup - Security Update Hope this helps Jeff ---------------------------------------------------------------------- Comment By: Hall Stevenson (hallstevenson) Date: 2003-06-27 19:06 Message: Logged In: YES user_id=120676 Same error here and per a followup to a duplicate of this error, running the "regsrvr spambayes...." workaround in the installed dir does *not* work for me. Running WinXP Pro, fully up-to-date, and Office XP SP2, also fully up-to-date. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=758552&group_id=61702 From noreply at sourceforge.net Fri Jun 27 19:03:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Fri Jun 27 21:03:51 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 11:40 Message generated for change (Comment added) made by jhihn1 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-27 21:03 Message: Logged In: YES user_id=142836 PLEASE PEOPLE, BE ADVISED: That it is not actually looking for a E: drive. When the python code was compiled, it stored the path information of the source files. It is telling the developers what function in what file had the problem, along with how it got there. It is not actually looking for a file. ---------------------------------------------------------------------- Comment By: Jason Behring (jbehring20) Date: 2003-06-27 13:17 Message: Logged In: YES user_id=811023 I am also running into the same problems on Windows 98 with Outllok 2000. removed the .dll and tried to add it again manually and this is the error I got: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0187:1e03ff78. Registers: EAX=505c3a43 CS=0187 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=018f ESP=0056b834 EBP=1e0b8ec0 ECX=005fdc70 DS=018f ESI=005fdc70 FS=4daf EDX=00705130 ES=018f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 This is my first post and I don't see where to upload the log file, so I'll just add it here. SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- Comment By: Johan Nilsson (gnistan) Date: 2003-06-27 03:11 Message: Logged In: YES user_id=810664 I ran it on Outlook 2002 Swedish, this result did I get: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' Removed it from COM addins and added it again: Registered: SpamBayes.OutlookAddin SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-26 15:36 Message: Logged In: YES user_id=142836 I have the same problem. here's my log: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From noreply at sourceforge.net Sat Jun 28 22:56:28 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 00:56:31 2003 Subject: [spambayes-dev] [ spambayes-Patches-762577 ] Outlook 2003 Workarounds Message-ID: Patches item #762577, was opened at 2003-06-29 04:56 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=762577&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Adam Walker (xenogeist) Assigned to: Nobody/Anonymous (nobody) Summary: Outlook 2003 Workarounds Initial Comment: This patch makes the toolbar display in OL 2003 and adds a option to run a timer to look for new mail because outlook 2003 doesn't fire the events properly. A diff -u is attached. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498105&aid=762577&group_id=61702 From noreply at sourceforge.net Sat Jun 28 23:30:11 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 01:30:33 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761797 ] "Failed to call the universal dispatcher" Message-ID: Bugs item #761797, was opened at 2003-06-27 08:45 Message generated for change (Comment added) made by chealer You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Simon Caddy (simoncaddy) Assigned to: Mark Hammond (mhammond) Summary: "Failed to call the universal dispatcher" Initial Comment: My Spambayes log shows this error: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding Any ideas...? ---------------------------------------------------------------------- Comment By: Philippe Cloutier (chealer) Date: 2003-06-29 01:30 Message: Logged In: YES user_id=738765 It seems you have the same problem as the one reported in 761316. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 From noreply at sourceforge.net Sat Jun 28 23:37:56 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 01:38:07 2003 Subject: [spambayes-dev] [ spambayes-Bugs-740311 ] 'skip_max_word_size' error on XP Message-ID: Bugs item #740311, was opened at 2003-05-20 04:09 Message generated for change (Comment added) made by chealer You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Rony Steelandt (bucodi) Assigned to: Mark Hammond (mhammond) Summary: 'skip_max_word_size' error on XP Initial Comment: Installed the binary installer 002 on XP Pro No spamb buttons added Here is the logfile SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\rony.BUCODI\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' ---------------------------------------------------------------------- Comment By: Philippe Cloutier (chealer) Date: 2003-06-29 01:37 Message: Logged In: YES user_id=738765 Same installation, same problem here too. Except that I'm on SP2... I installed 5 times, launched Outlook 5 times too, and still got nothing before I read your troobleshooting section. I don't know anything about Python, so I don't know if that's relevant, but it looks like this error message only appears 1 time out of 2 (in my 2nd and 4th log), although the bug I've noticed no button in Outlook everytime. If you want me to test some solution use cheal (removethispart)@hotpop.com ---------------------------------------------------------------------- Comment By: Tony Meyer (anadelonbrin) Date: 2003-06-04 23:26 Message: Logged In: YES user_id=552329 I'm sure this wasn't the bug that I originally posted too, but never mind :) FWIW: I tried to install 002 with Outlook XP SP1 (WinXP SP1) and it failed with the skip_max_word_size thing. I then upgraded to Outlook XP SP2 and tried again. I then got the no disk error (e:, which is my CD drive). I tried putting a disk, in the drive and then got the could not register error. CVS still works fine :) ---------------------------------------------------------------------- Comment By: Sebastian J?rges (sjuerges) Date: 2003-05-27 12:01 Message: Logged In: YES user_id=740980 Same here. WinXP Service Pack 1 / OutlookXP ---------------------------------------------------------------------- Comment By: Harald Meyer (hmeyer666) Date: 2003-05-23 16:36 Message: Logged In: YES user_id=785713 Same here with Win2000 and Outlook XP ---------------------------------------------------------------------- Comment By: Stefan Franke (sfranke) Date: 2003-05-20 18:28 Message: Logged In: YES user_id=431616 I have the same one with Outlook2000 (SR-1) on Win2k. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=740311&group_id=61702 From noreply at sourceforge.net Sun Jun 29 04:36:48 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 06:36:52 2003 Subject: [spambayes-dev] [ spambayes-Bugs-762641 ] Assertation Error on some Email Message-ID: Bugs item #762641, was opened at 2003-06-29 10:36 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762641&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Why do you need this (beyond-thoughts) Assigned to: Nobody/Anonymous (nobody) Summary: Assertation Error on some Email Initial Comment: Hello, I got an assertation error when receiving the message below. It was in classifier.py.probability (line 307). There it says: assert hamcount <= nham I was able to "fix" this by making a comment out of this line. Thanks, Christoph Becker-Freyseng >From - Sun Jun 29 04:54:38 2003 X-UIDL: NLM"!ede"!Og8"!WMU!! X-Mozilla-Status: 0001 X-Mozilla-Status2: 00000000 Date: 29 Jun 2003 02:50:16 +0000 From: Mail System Internal Data Subject: DON'T DELETE THIS MESSAGE -- FOLDER INTERNAL DATA Message-ID: <1056855016@mmc-startup.com> X-IMAP: 1056854405 0000000001 X-UIDL: NLM"!ede"!Og8"!WMU!! Status: RO X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 4.32571756193e-011 X-Spambayes-MailId: 1056855278 This text is part of the internal format of your mail folder, and is not a real message. It is created automatically by the mail system software. If deleted, important folder data will be lost, and it will be re-created with the data reset to initial values. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762641&group_id=61702 From noreply at sourceforge.net Sun Jun 29 04:49:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 06:49:08 2003 Subject: [spambayes-dev] [ spambayes-Bugs-762641 ] Assertation Error on some Email Message-ID: Bugs item #762641, was opened at 2003-06-29 10:36 Message generated for change (Comment added) made by beyond-thoughts You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762641&group_id=61702 Category: pop3proxy Group: None Status: Open Resolution: None Priority: 5 Submitted By: Why do you need this (beyond-thoughts) Assigned to: Nobody/Anonymous (nobody) Summary: Assertation Error on some Email Initial Comment: Hello, I got an assertation error when receiving the message below. It was in classifier.py.probability (line 307). There it says: assert hamcount <= nham I was able to "fix" this by making a comment out of this line. Thanks, Christoph Becker-Freyseng >From - Sun Jun 29 04:54:38 2003 X-UIDL: NLM"!ede"!Og8"!WMU!! X-Mozilla-Status: 0001 X-Mozilla-Status2: 00000000 Date: 29 Jun 2003 02:50:16 +0000 From: Mail System Internal Data Subject: DON'T DELETE THIS MESSAGE -- FOLDER INTERNAL DATA Message-ID: <1056855016@mmc-startup.com> X-IMAP: 1056854405 0000000001 X-UIDL: NLM"!ede"!Og8"!WMU!! Status: RO X-Spambayes-Classification: ham X-Spambayes-Spam-Probability: 4.32571756193e-011 X-Spambayes-MailId: 1056855278 This text is part of the internal format of your mail folder, and is not a real message. It is created automatically by the mail system software. If deleted, important folder data will be lost, and it will be re-created with the data reset to initial values. ---------------------------------------------------------------------- >Comment By: Why do you need this (beyond-thoughts) Date: 2003-06-29 10:49 Message: Logged In: YES user_id=186848 I just sent this bug-report to myself (to store it locally) and guess ... Indeed this email crashed spambayes again. Here's the trace: error: uncaptured python exception, closing channel <__main__.ServerLineReader c onnected at 0xff0540> (exceptions.AssertionError: [C:\PYTHON22\lib\asyncore.py|p oll|100] [C:\PYTHON22\lib\asyncore.py|handle_read_event|397] [C:\PYTHON22\lib\as ynchat.py|handle_read|130] [C:\Programme\spambayes\pop3proxy.py|found_terminator |147] [C:\Programme\spambayes\pop3proxy.py|onServerLine|215] [C:\Programme\spamb ayes\pop3proxy.py|onResponse|276] [C:\Programme\spambayes\pop3proxy.py|onTransac tion|377] [C:\Programme\spambayes\pop3proxy.py|onRetr|428] [C:\Programme\spambay es\spambayes\classifier.py|chi2_spamprob|223] [C:\Programme\spambayes\spambayes\ classifier.py|_getclues|451] [C:\Programme\spambayes\spambayes\classifier.py|pro bability|307]) ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762641&group_id=61702 From noreply at sourceforge.net Sun Jun 29 10:55:01 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 12:55:09 2003 Subject: [spambayes-dev] [ spambayes-Bugs-762769 ] Python error invoking COM method. Message-ID: Bugs item #762769, was opened at 2003-06-29 09:55 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762769&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Constanty DeCinko III (cdecinko) Assigned to: Mark Hammond (mhammond) Summary: Python error invoking COM method. Initial Comment: Delete as SPAM button does nothing. Training database keeps disappearing? Log is filled with: pythoncom error: Python error invoking COM method. Traceback (most recent call last): File "E:\src\pythonex\com\win32com\server\policy.py", line 275, in _Invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 280, in _invoke_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 342, in OnClick File "E:\src\spambayes\Outlook2000\msgstore.py", line 241, in GetFolder File "E:\src\spambayes\Outlook2000\msgstore.py", line 194, in NormalizeID exceptions.AssertionError: What kind of ID is 'None'? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=762769&group_id=61702 From noreply at sourceforge.net Sun Jun 29 11:33:03 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 13:33:06 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-762783 ] TMDA capabilities Message-ID: Feature Requests item #762783, was opened at 2003-06-29 17:33 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Why do you need this (beyond-thoughts) Assigned to: Nobody/Anonymous (nobody) Summary: TMDA capabilities Initial Comment: I just switched from TMDA (http://tmda.net/) to Spambayes. Watching emails that were neither in black- or whitelist was to annoying. Of course those people get a reply (please see at TMDA-Homepage to see how it works) but about 50% don't understand the reply-mail. (I don't know why -- the text is very clear!). Spambayes doesn't queue emails that can't be classified -- you receive them as "unsure". I think combining both techniques would cut down wrong positive and wrong negative and the "unsure" numbers. Additionaly it might be useful for training having a definite black- and whitelist. So what are good points (files, classes, methods) to add such a feature? Is it dependent whether pop3proxy, hammie, ... is used? Thank You, Christoph Becker-Freyseng ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702 From noreply at sourceforge.net Sun Jun 29 16:22:00 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Sun Jun 29 18:22:20 2003 Subject: [spambayes-dev] [ spambayes-Bugs-725449 ] Binary plugin fails on Win9x: encoding exception Message-ID: Bugs item #725449, was opened at 2003-04-22 00:41 Message generated for change (Comment added) made by hschoenman You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: dan maer (dmara) Assigned to: Mark Hammond (mhammond) Summary: Binary plugin fails on Win9x: encoding exception Initial Comment: Running pure Windows (98SE) and Outlook 2000. No Python installed. I can't get the plugin to initialize. I go to Tools/options/other/advanced options/Com Addins and check the SpamBayes plug-in box, but it won't stayed checked, and I've uninstalled/reinstalled Outlook and the plugin but no go. I've got this working great, really great, on Winnt 4.0 and Xp. Logfile being attached for upload... Dan ---------------------------------------------------------------------- Comment By: Harvey Schoenman (hschoenman) Date: 2003-06-29 18:22 Message: Logged In: YES user_id=794308 Since this has been open for a while, I just wondered if you were going to work on it (the installer). If not, would you please publish the procedure for Python install and manual install of Spambayes, such as what user usertgo did? Also, when is the next binary release as you alluded to fixing it for that release? -hschoenman@adelphia.net ---------------------------------------------------------------------- Comment By: Helge Stenstr?m (helge) Date: 2003-05-02 05:27 Message: Logged In: YES user_id=19115 Installation fails on Windows 2000. I have: * http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe (2968028 bytes) * ActivePython-2.2.2-224-win32-ix86.msi The log file talks about files on E:\, which isn't mounted. An earlier log file (which seems to be overwritten, I can't find it) claimed to have written stuff to c:/Documents and Settings/erahege/Application Data/SpamBayes/.pck, but that file is nonexistent. I have NOT installed http://starship.python.net/crew/mhammond/downloads/win32all-152.exe, beacuse the instructions on http://starship.python.net/crew/mhammond/spambayes/ doesn't say so. My log file is very similar to the one already uploaded. I will upload my log file if I find out how to do it. ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-26 01:03 Message: Logged In: YES user_id=763953 ok, since i liked it so much on win2000 i did the python install & manual install of the spambayes outlook addin & its working good now, so i guess it was the installer. thanks ---------------------------------------------------------------------- Comment By: J (usertgo) Date: 2003-04-24 01:10 Message: Logged In: YES user_id=763953 I also get the same errors as in the logfile on a Win98SE installation w/Outlook 2000 SP1, but it works on Win2000 w/same Outlook. If you have any workarounds (w/regedit?) please let us know... ---------------------------------------------------------------------- Comment By: dan maer (dmara) Date: 2003-04-22 23:06 Message: Logged In: YES user_id=759684 Ok Mark... Anyway to bypass the installer issue and get it working by manual means? Dan ---------------------------------------------------------------------- Comment By: Mark Hammond (mhammond) Date: 2003-04-22 19:19 Message: Logged In: YES user_id=14198 This is an issue with the "Installer" tool I use. I will try and sus it out before the next binary release. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=725449&group_id=61702 From noreply at sourceforge.net Mon Jun 30 02:35:30 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 04:35:47 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761797 ] "Failed to call the universal dispatcher" Message-ID: Bugs item #761797, was opened at 2003-06-27 12:45 Message generated for change (Comment added) made by simoncaddy You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Simon Caddy (simoncaddy) Assigned to: Mark Hammond (mhammond) Summary: "Failed to call the universal dispatcher" Initial Comment: My Spambayes log shows this error: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding Any ideas...? ---------------------------------------------------------------------- >Comment By: Simon Caddy (simoncaddy) Date: 2003-06-30 08:35 Message: Logged In: YES user_id=810828 Yeah, thanks... I have now seem at the top of the docs page "current binary installer will not work with Windows 98"... but no solution!? ---------------------------------------------------------------------- Comment By: Philippe Cloutier (chealer) Date: 2003-06-29 05:30 Message: Logged In: YES user_id=738765 It seems you have the same problem as the one reported in 761316. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761797&group_id=61702 From paoloinvernizzi at dmsware.com Mon Jun 30 16:45:51 2003 From: paoloinvernizzi at dmsware.com (Paolo Invernizzi) Date: Mon Jun 30 09:46:26 2003 Subject: [spambayes-dev] Socket problem Message-ID: <3F003F0F.8010601@dmsware.com> Hi all, I've some problems with the current pop3proxy.py script. I'm running spambayes CVS on a FreeBSD 5.1 machine, with python 2.2.3 On my machine I'm lunching the pop3proxy.py with cron at startup, and I've a fetchmail process that check the pop3proxy every 10 minutes for pop3 updates... and all is fine. But... sometimes during startup, or after a one or two days, the process exits with the following error... Running the pop3proxy, lanched by cron at startup, works fine, but SpamBayes POP3 Proxy Beta1, version 0.1 (May 2003), using SpamBayes POP3 Proxy Web Interface Alpha2, version 0.02 and engine SpamBayes Beta1, version 0.1 (May 2003). Loading database... Done. SMTP Listener on port 2525 is proxying 127.0.0.1:25 Listener on port 8110 is proxying kpnqwest.it:110 User interface url is http://localhost:8880/ Traceback (most recent call last): File "/usr/local/bin/pop3proxy.py", line 731, in ? run() File "/usr/local/bin/pop3proxy.py", line 725, in run main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) File "/usr/local/bin/pop3proxy.py", line 664, in main Dibbler.run(launchBrowser=launchUI) File "/usr/local/lib/python2.2/site-packages/spambayes/Dibbler.py", line 528, in run asyncore.loop(map=context._map) File "/usr/local/lib/python2.2/asyncore.py", line 206, in loop poll_fun (timeout, map) File "/usr/local/lib/python2.2/asyncore.py", line 83, in poll r,w,e = select.select (r,w,e, timeout) ValueError: filedescriptor out of range in select() Fredrik Lundh suggested me to use pool, instead of select, so I turned asyncore to: asyncore.loop(map=context._map, use_poll=1) Now the error reported is: SpamBayes POP3 Proxy Beta1, version 0.1 (May 2003), using SpamBayes POP3 Proxy Web Interface Alpha2, version 0.02 and engine SpamBayes Beta1, version 0.1 (May 2003). Loading database... Done. SMTP Listener on port 2525 is proxying 127.0.0.1:25 Listener on port 8110 is proxying kpnqwest.it:110 User interface url is http://localhost:8880/ error: uncaptured python exception, closing channel <__main__.BayesProxyListener listening :8110 at 0x83c9f6c> (socket.error:(24, 'Too many open files') [/usr/local/lib/python2.2/asyncore.py|poll3|184] [/usr/local/lib/python2.2/asyncore.py|handle_read_event|391] [/usr/local/lib/python2.2/site-packages/spambayes/Dibbler.py|handle_accept|280] [/usr/local/bin/pop3proxy.py|__init__|337] [/usr/local/bin/pop3proxy.py|__init__|180] [/usr/local/bin/pop3proxy.py|__init__|134] [/usr/local/lib/python2.2/asyncore.py|create_socket|260]) error: uncaptured python exception, closing channel <__main__.BayesProxy connected 192.168.0.2:63485 at 0x84d8aac> (exceptions.AttributeError:'_socket.socket' object has no attribute 'isClosed' [/usr/local/lib/python2.2/asyncore.py|poll3|184] [/usr/local/lib/python2.2/asyncore.py|handle_read_event|397] [/usr/local/lib/python2.2/asynchat.py|handle_read|82] [/usr/local/bin/pop3proxy.py|recv|360] [/usr/local/lib/python2.2/asyncore.py|recv|353] [/usr/local/lib/python2.2/asynchat.py|handle_close|149] [/usr/local/bin/pop3proxy.py|close|368] [/usr/local/lib/python2.2/asyncore.py|__getattr__|372]) Traceback (most recent call last): File "/usr/local/bin/pop3proxy.py", line 731, in ? run() File "/usr/local/bin/pop3proxy.py", line 725, in run main(state.servers, state.proxyPorts, state.uiPort, state.launchUI) File "/usr/local/bin/pop3proxy.py", line 664, in main Dibbler.run(launchBrowser=launchUI) File "/usr/local/lib/python2.2/site-packages/spambayes/Dibbler.py", line 528, in run asyncore.loop(map=context._map,use_poll=1) File "/usr/local/lib/python2.2/asyncore.py", line 206, in loop poll_fun (timeout, map) File "/usr/local/lib/python2.2/asyncore.py", line 190, in poll3 obj.handle_error() File "/usr/local/lib/python2.2/site-packages/spambayes/Dibbler.py", line 205, in handle_error asynchat.async_chat.handle_error(self) File "/usr/local/lib/python2.2/asyncore.py", line 427, in handle_error self.close() File "/usr/local/bin/pop3proxy.py", line 368, in close if not self.isClosed: File "/usr/local/lib/python2.2/asyncore.py", line 372, in __getattr__ return getattr (self.socket, attr) AttributeError: '_socket.socket' object has no attribute 'isClosed' Any advice?? I've started a thread on this on c.l.python also http://article.gmane.org/gmane.comp.python.general/72706 --- Paolo Invernizzi From noreply at sourceforge.net Mon Jun 30 09:23:54 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 11:24:15 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761250 ] .dll doesn't stay checked in COM Add-Ins window Message-ID: Bugs item #761250, was opened at 2003-06-26 15:40 Message generated for change (Comment added) made by byjiminy You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: James Noser (byjiminy) Assigned to: Mark Hammond (mhammond) Summary: .dll doesn't stay checked in COM Add-Ins window Initial Comment: I downloaded some Windows security patches yesterday and SpamBayes no longer appeared in the toolbar. I uninstalled and reinstalled both SpamBayes and Outlook. I tried to instal Spambayes using the binary and the addin.py source. The COM add-in appeared in the Outlook menu, I checked it, and chose OK. Then when I opened the COM add-in window again, the Spam Bayes add-in wasn't checked. The attached log file documents three tries. I removed the .dll from the COM add-in window, and then added it in again. Sometimes this caused Outlook to Crash. Strangely enough, the log file shows the application trying to access files on the E: drive. Both Outlook and Spambayes are installed on my C: drive. As a note, I've passed this program to several associates who love the software. They worry though, that my problems may come their way. Let me know if I can offer any other information. Thanks PS - Windows 2000 Professional, Office 2000 Proffessional ---------------------------------------------------------------------- >Comment By: James Noser (byjiminy) Date: 2003-06-30 15:23 Message: Logged In: YES user_id=810155 I've added the log file from an installation attempt. I ran addin.py. Here is the log. Thanks again ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-28 01:03 Message: Logged In: YES user_id=142836 PLEASE PEOPLE, BE ADVISED: That it is not actually looking for a E: drive. When the python code was compiled, it stored the path information of the source files. It is telling the developers what function in what file had the problem, along with how it got there. It is not actually looking for a file. ---------------------------------------------------------------------- Comment By: Jason Behring (jbehring20) Date: 2003-06-27 17:17 Message: Logged In: YES user_id=811023 I am also running into the same problems on Windows 98 with Outllok 2000. removed the .dll and tried to add it again manually and this is the error I got: OUTLOOK caused an invalid page fault in module PYTHON23.DLL at 0187:1e03ff78. Registers: EAX=505c3a43 CS=0187 EIP=1e03ff78 EFLGS=00010246 EBX=005a3b70 SS=018f ESP=0056b834 EBP=1e0b8ec0 ECX=005fdc70 DS=018f ESI=005fdc70 FS=4daf EDX=00705130 ES=018f EDI=1e03ff70 GS=0000 Bytes at CS:EIP: 8b 48 54 f6 c5 40 74 1f 8b 80 a4 00 00 00 85 c0 Stack dump: 00000000 1e0352fd 005fdc70 00000000 1e0b8ea8 005a3b60 00000002 005f4da0 00000020 1e03ff5d 005fdc70 1e03ff70 00000000 1e0b8ea8 00000001 1e03fd38 This is my first post and I don't see where to upload the log file, so I'll just add it here. SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- Comment By: Johan Nilsson (gnistan) Date: 2003-06-27 07:11 Message: Logged In: YES user_id=810664 I ran it on Outlook 2002 Swedish, this result did I get: SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff File "E:\src\Installer\iu.py", line 274, in importHook File "E:\src\Installer\iu.py", line 353, in doimport File "E:\src\spambayes\spambayes\tokenizer.py", line 659, in ? exceptions.AttributeError: 'OptionsClass' object has no attribute 'skip_max_word_size' Removed it from COM addins and added it again: Registered: SpamBayes.OutlookAddin SpamAddin - Connecting to Outlook Created new configuration file 'C:\Documents and Settings\bheljoni\Application Data\SpamBayes\default_configuration.pck' pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 156, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 71, in import_core_spambayes_stuff exceptions.ImportError: cannot import name tokenize ---------------------------------------------------------------------- Comment By: Jason Hihn (jhihn1) Date: 2003-06-26 19:36 Message: Logged In: YES user_id=142836 I have the same problem. here's my log: SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding SpamAddin - Connecting to Outlook pythoncom error: Failed to call the universal dispatcher Traceback (most recent call last): File "E:\src\pythonex\com\win32com\universal.py", line 170, in dispatch File "E:\src\pythonex\com\win32com\server\policy.py", line 322, in _InvokeEx_ File "E:\src\pythonex\com\win32com\server\policy.py", line 601, in _invokeex_ File "E:\src\pythonex\com\win32com\server\policy.py", line 541, in _invokeex_ File "E:\src\spambayes\Outlook2000\addin.py", line 655, in OnConnection File "E:\src\spambayes\Outlook2000\manager.py", line 475, in GetManager File "E:\src\spambayes\Outlook2000\manager.py", line 141, in __init__ File "E:\src\spambayes\Outlook2000\manager.py", line 182, in LocateDataDirectory File "E:\src\python-cvs\lib\ntpath.py", line 269, in isdir exceptions.LookupError: no codec search functions registered: can't find encoding ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761250&group_id=61702 From noreply at sourceforge.net Mon Jun 30 14:31:49 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 16:31:55 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761670 ] Unexpected unicode key in bsd db Message-ID: Bugs item #761670, was opened at 2003-06-27 03:01 Message generated for change (Comment added) made by montanaro You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: Unexpected unicode key in bsd db Initial Comment: On Debian (unstable) with python 2.3b1: cst@comuter% pdb mboxtrain.py -n -d ~/.hammiedb -g ~/ Mail.d/Box -g ~/Mail.d/roeinst -g ~/Mail.d/friends -g ~/ Mail.d/freunde -g ~/Mail.d/deadmonkey-list -s ~/Mail.d/ spam Training ham (/home/chs/Mail.d/Box): Reading as Maildir Trained 0 out of 40 messages Training ham (/home/chs/Mail.d/roeinst): Reading as Maildir Trained 0 out of 2902 messages Training ham (/home/chs/Mail.d/friends): Reading as Maildir Trained 2 out of 3690 messages Training ham (/home/chs/Mail.d/freunde): Reading as Maildir Traceback (most recent call last): File "/home/chs/bin/pdb", line 43, in ? execfile(script) File "mboxtrain.py", line 294, in ? main() File "mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "mboxtrain.py", line 110, in maildir_train if not msg_train(h, msg, is_spam, force): File "mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/chs/src/spambayes/spambayes/hammie.py", line 150, in train self.bayes.learn(tokenize(msg), is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 402, in _add_msg record = self._wordinfoget(word) File "/home/chs/src/spambayes/spambayes/storage.py", line 215, in _wordinfoget r = self.db.get(word) File "/usr/lib/python2.3/shelve.py", line 110, in get if self.dict.has_key(key): File "/usr/lib/python2.3/bsddb/__init__.py", line 112, in has_key return self.db.has_key(key) TypeError: String or Integer object expected for key, unicode found > /usr/lib/python2.3/bsddb/__init__.py(112)has_key() -> return self.db.has_key(key) (Pdb) p key u'filename:fname piece:Huettenfete01- Wegbeschreibung_H\xfctte' ---------------------------------------------------------------------- >Comment By: Skip Montanaro (montanaro) Date: 2003-06-30 15:31 Message: Logged In: YES user_id=44345 Can you try out the attached patch and let us know how it works? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 From noreply at sourceforge.net Mon Jun 30 18:15:18 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 20:15:27 2003 Subject: [spambayes-dev] [ spambayes-Bugs-675811 ] Dead buttons left on uninstall Message-ID: Bugs item #675811, was opened at 2003-01-28 12:37 Message generated for change (Comment added) made by anadelonbrin You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=675811&group_id=61702 Category: Outlook Group: None Status: Open Resolution: None Priority: 5 Submitted By: Tony Meyer (anadelonbrin) Assigned to: Mark Hammond (mhammond) Summary: Dead buttons left on uninstall Initial Comment: The toolbar buttons are temporary, which causes problems if they are moved. If they are permanent, then we are left with dead buttons if we uninstall the plugin (why would we do this? ;p ). ---------------------------------------------------------------------- >Comment By: Tony Meyer (anadelonbrin) Date: 2003-07-01 12:15 Message: Logged In: YES user_id=552329 For anyone reading this because they have buttons left over after uninstalling, you can get rid of them by resetting the toolbar. (Right click on the toolbar, click customize, then select the standard toolbar, then click reset). Presumably the next release, when we have our own toolbar, will remove this issue and this can be closed. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=675811&group_id=61702 From noreply at sourceforge.net Mon Jun 30 19:15:04 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 21:15:07 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761670 ] Unexpected unicode key in bsd db Message-ID: Bugs item #761670, was opened at 2003-06-27 01:01 Message generated for change (Comment added) made by cst You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 Category: None Group: None Status: Open Resolution: None Priority: 5 Submitted By: Christian Stork (cst) Assigned to: Nobody/Anonymous (nobody) Summary: Unexpected unicode key in bsd db Initial Comment: On Debian (unstable) with python 2.3b1: cst@comuter% pdb mboxtrain.py -n -d ~/.hammiedb -g ~/ Mail.d/Box -g ~/Mail.d/roeinst -g ~/Mail.d/friends -g ~/ Mail.d/freunde -g ~/Mail.d/deadmonkey-list -s ~/Mail.d/ spam Training ham (/home/chs/Mail.d/Box): Reading as Maildir Trained 0 out of 40 messages Training ham (/home/chs/Mail.d/roeinst): Reading as Maildir Trained 0 out of 2902 messages Training ham (/home/chs/Mail.d/friends): Reading as Maildir Trained 2 out of 3690 messages Training ham (/home/chs/Mail.d/freunde): Reading as Maildir Traceback (most recent call last): File "/home/chs/bin/pdb", line 43, in ? execfile(script) File "mboxtrain.py", line 294, in ? main() File "mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "mboxtrain.py", line 110, in maildir_train if not msg_train(h, msg, is_spam, force): File "mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/chs/src/spambayes/spambayes/hammie.py", line 150, in train self.bayes.learn(tokenize(msg), is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 402, in _add_msg record = self._wordinfoget(word) File "/home/chs/src/spambayes/spambayes/storage.py", line 215, in _wordinfoget r = self.db.get(word) File "/usr/lib/python2.3/shelve.py", line 110, in get if self.dict.has_key(key): File "/usr/lib/python2.3/bsddb/__init__.py", line 112, in has_key return self.db.has_key(key) TypeError: String or Integer object expected for key, unicode found > /usr/lib/python2.3/bsddb/__init__.py(112)has_key() -> return self.db.has_key(key) (Pdb) p key u'filename:fname piece:Huettenfete01- Wegbeschreibung_H\xfctte' ---------------------------------------------------------------------- >Comment By: Christian Stork (cst) Date: 2003-06-30 18:15 Message: Logged In: YES user_id=23468 Works fine with Skip's patch. Thanks, Chris. ---------------------------------------------------------------------- Comment By: Skip Montanaro (montanaro) Date: 2003-06-30 13:31 Message: Logged In: YES user_id=44345 Can you try out the attached patch and let us know how it works? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 From noreply at sourceforge.net Mon Jun 30 19:20:55 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 21:20:58 2003 Subject: [spambayes-dev] [ spambayes-Bugs-761670 ] Unexpected unicode key in bsd db Message-ID: Bugs item #761670, was opened at 2003-06-27 03:01 Message generated for change (Comment added) made by montanaro You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 Category: None Group: None >Status: Closed >Resolution: Fixed Priority: 5 Submitted By: Christian Stork (cst) >Assigned to: Skip Montanaro (montanaro) Summary: Unexpected unicode key in bsd db Initial Comment: On Debian (unstable) with python 2.3b1: cst@comuter% pdb mboxtrain.py -n -d ~/.hammiedb -g ~/ Mail.d/Box -g ~/Mail.d/roeinst -g ~/Mail.d/friends -g ~/ Mail.d/freunde -g ~/Mail.d/deadmonkey-list -s ~/Mail.d/ spam Training ham (/home/chs/Mail.d/Box): Reading as Maildir Trained 0 out of 40 messages Training ham (/home/chs/Mail.d/roeinst): Reading as Maildir Trained 0 out of 2902 messages Training ham (/home/chs/Mail.d/friends): Reading as Maildir Trained 2 out of 3690 messages Training ham (/home/chs/Mail.d/freunde): Reading as Maildir Traceback (most recent call last): File "/home/chs/bin/pdb", line 43, in ? execfile(script) File "mboxtrain.py", line 294, in ? main() File "mboxtrain.py", line 281, in main train(h, g, False, force, trainnew) File "mboxtrain.py", line 216, in train maildir_train(h, os.path.join(path, "cur"), is_spam, force) File "mboxtrain.py", line 110, in maildir_train if not msg_train(h, msg, is_spam, force): File "mboxtrain.py", line 80, in msg_train h.train(msg, is_spam) File "/home/chs/src/spambayes/spambayes/hammie.py", line 150, in train self.bayes.learn(tokenize(msg), is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 276, in learn self._add_msg(wordstream, is_spam) File "/home/chs/src/spambayes/spambayes/classifier.py", line 402, in _add_msg record = self._wordinfoget(word) File "/home/chs/src/spambayes/spambayes/storage.py", line 215, in _wordinfoget r = self.db.get(word) File "/usr/lib/python2.3/shelve.py", line 110, in get if self.dict.has_key(key): File "/usr/lib/python2.3/bsddb/__init__.py", line 112, in has_key return self.db.has_key(key) TypeError: String or Integer object expected for key, unicode found > /usr/lib/python2.3/bsddb/__init__.py(112)has_key() -> return self.db.has_key(key) (Pdb) p key u'filename:fname piece:Huettenfete01- Wegbeschreibung_H\xfctte' ---------------------------------------------------------------------- >Comment By: Skip Montanaro (montanaro) Date: 2003-06-30 20:20 Message: Logged In: YES user_id=44345 fixed in storage.py 1.14 ---------------------------------------------------------------------- Comment By: Christian Stork (cst) Date: 2003-06-30 20:15 Message: Logged In: YES user_id=23468 Works fine with Skip's patch. Thanks, Chris. ---------------------------------------------------------------------- Comment By: Skip Montanaro (montanaro) Date: 2003-06-30 15:31 Message: Logged In: YES user_id=44345 Can you try out the attached patch and let us know how it works? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498103&aid=761670&group_id=61702 From noreply at sourceforge.net Mon Jun 30 19:38:34 2003 From: noreply at sourceforge.net (SourceForge.net) Date: Mon Jun 30 21:38:42 2003 Subject: [spambayes-dev] [ spambayes-Feature Requests-762783 ] TMDA capabilities Message-ID: Feature Requests item #762783, was opened at 2003-06-30 03:33 Message generated for change (Comment added) made by anthonybaxter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702 Category: None Group: None Status: Open Priority: 5 Submitted By: Why do you need this (beyond-thoughts) Assigned to: Nobody/Anonymous (nobody) Summary: TMDA capabilities Initial Comment: I just switched from TMDA (http://tmda.net/) to Spambayes. Watching emails that were neither in black- or whitelist was to annoying. Of course those people get a reply (please see at TMDA-Homepage to see how it works) but about 50% don't understand the reply-mail. (I don't know why -- the text is very clear!). Spambayes doesn't queue emails that can't be classified -- you receive them as "unsure". I think combining both techniques would cut down wrong positive and wrong negative and the "unsure" numbers. Additionaly it might be useful for training having a definite black- and whitelist. So what are good points (files, classes, methods) to add such a feature? Is it dependent whether pop3proxy, hammie, ... is used? Thank You, Christoph Becker-Freyseng ---------------------------------------------------------------------- >Comment By: Anthony Baxter (anthonybaxter) Date: 2003-07-01 11:38 Message: Logged In: YES user_id=29957 I can't imagine that this feature would ever be added to spambayes. It's a completely different approach to spam-filtering, with almost nothing in common with the existing approach. Spambayes will gradually improve as you train it further - the initial flurry of unsures is probably just insufficient training. Note also that you can adjust the cutoffs to end up with more or less unsures. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=498106&aid=762783&group_id=61702