From pankaj.panging at bankofamerica.com Thu Apr 2 00:44:55 2015 From: pankaj.panging at bankofamerica.com (Panging, Pankaj) Date: Wed, 01 Apr 2015 22:44:55 +0000 Subject: [Spambayes] Exception during training Message-ID: <3C1E24A2FFDBE64F89ACF858E2BBFA41183361DC@smtp_mail.bankofamerica.com> Hi We are trying out Spambayes and running into a problem during training. When we click 'Start Training' on the configuration window a message pops up saying : Gocr.exe exception. Can you please direct us for a solution to that issue? Thanks Pankaj Panging VP, Info security Engg GIS Bank of America ---------------------------------------------------------------------- This message, and any attachments, is for the intended recipient(s) only, may contain information that is privileged, confidential and/or proprietary and subject to important terms and conditions available at http://www.bankofamerica.com/emaildisclaimer. If you are not the intended recipient, please delete this message. -------------- next part -------------- An HTML attachment was scrubbed... URL: From bjn5053 at gmail.com Thu Apr 2 07:54:23 2015 From: bjn5053 at gmail.com (bjn5053) Date: Wed, 1 Apr 2015 22:54:23 -0700 Subject: [Spambayes] Exception during training In-Reply-To: <3C1E24A2FFDBE64F89ACF858E2BBFA41183361DC@smtp_mail.bankofamerica.com> References: <3C1E24A2FFDBE64F89ACF858E2BBFA41183361DC@smtp_mail.bankofamerica.com> Message-ID: <032401d06d09$79480280$6bd80780$@gmail.com> Check out this thread - it worked for me. https://mail.python.org/pipermail/spambayes/2012-May/023186.html Good luck! bjn5053 From: SpamBayes [mailto:spambayes-bounces+bjn5053=gmail.com at python.org] On Behalf Of Panging, Pankaj Sent: Wednesday, April 01, 2015 3:45 PM To: spambayes at python.org Subject: [Spambayes] Exception during training Hi We are trying out Spambayes and running into a problem during training. When we click 'Start Training' on the configuration window a message pops up saying : Gocr.exe exception. Can you please direct us for a solution to that issue? Thanks Pankaj Panging VP, Info security Engg GIS Bank of America _____ This message, and any attachments, is for the intended recipient(s) only, may contain information that is privileged, confidential and/or proprietary and subject to important terms and conditions available at http://www.bankofamerica.com/emaildisclaimer. If you are not the intended recipient, please delete this message. -------------- next part -------------- An HTML attachment was scrubbed... URL: From dale at BriannasSaladDressing.com Thu Apr 9 19:37:04 2015 From: dale at BriannasSaladDressing.com (Dale Schroeder) Date: Thu, 09 Apr 2015 12:37:04 -0500 Subject: [Spambayes] X-Spambayes-Exception Message-ID: <5526B8C0.90307@BriannasSaladDressing.com> This week I started getting two different spam mails that are not classified and do not show up in the "Review" training webpage. They cause an "X-Spambayes-Exception" header to be created with the following content: "Traceback (most recent call last): . File "/usr/bin/sb_server.py", line 555, in onRetr . evidence=True) . File "/usr/lib/python2.7/dist-packages/spambayes/classifier.py", line 169, in chi2_spamprob . clues = self._getclues(wordstream) . File "/usr/lib/python2.7/dist-packages/spambayes/classifier.py", line 471, in _getclues . for word in set(wordstream): . File "/usr/lib/python2.7/dist-packages/spambayes/tokenizer.py", line 1260, in tokenize . for tok in self.tokenize_body(msg): . File "/usr/lib/python2.7/dist-packages/spambayes/tokenizer.py", line 1669, in tokenize_body . text, tokens = cracker(text) . File "/usr/lib/python2.7/dist-packages/spambayes/tokenizer.py", line 961, in analyze . tokens.extend(self.tokenize(m)) . File "/usr/lib/python2.7/dist-packages/spambayes/tokenizer.py", line 1071, in tokenize . scheme, netloc, path, params, query, frag = urlparse.urlparse(url) . File "/usr/lib/python2.7/urlparse.py", line 143, in urlparse . tuple = urlsplit(url, scheme, allow_fragments) . File "/usr/lib/python2.7/urlparse.py", line 191, in urlsplit . raise ValueError("Invalid IPv6 URL") .ValueError: Invalid IPv6 URL" I have not seen this problem before, so could someone tell me what's happening here and if it can be fixed. I'm running Spambayes 1.1b1-1 on Debian Jessie with python 2.7.9-1. Thanks, Dale Schroeder From skip.montanaro at gmail.com Sat Apr 11 19:01:20 2015 From: skip.montanaro at gmail.com (Skip Montanaro) Date: Sat, 11 Apr 2015 13:01:20 -0400 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: <5526B8C0.90307@BriannasSaladDressing.com> References: <5526B8C0.90307@BriannasSaladDressing.com> Message-ID: On Thu, Apr 9, 2015 at 1:37 PM, Dale Schroeder wrote: > raise ValueError("Invalid IPv6 URL") .ValueError: Invalid IPv6 URL" Dale, Can you identify the IPv6 URL it seems to be complaining about? Skip From dale at BriannasSaladDressing.com Mon Apr 13 16:08:04 2015 From: dale at BriannasSaladDressing.com (Dale Schroeder) Date: Mon, 13 Apr 2015 09:08:04 -0500 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: References: <5526B8C0.90307@BriannasSaladDressing.com> Message-ID: <552BCDC4.30205@BriannasSaladDressing.com> On 04/11/2015 12:01 PM, Skip Montanaro wrote: > On Thu, Apr 9, 2015 at 1:37 PM, Dale Schroeder > wrote: >> raise ValueError("Invalid IPv6 URL") .ValueError: Invalid IPv6 URL" > Dale, > > Can you identify the IPv6 URL it seems to be complaining about? > > Skip Skip, I have been unsuccessful in getting an ipv6 address using dig, nslookup, or host. All return ipv4 addresses even when specifying ipv6 only. Note that all are of the new tld *.science domain type. Todays list includes: dvasiga.science = 66.248.215.124 bintime.science = 198.52.182.235 buneisme.science = 66.248.215.67 Contact info for all three from a whois lookup is admin at tvspots4less.com. It appears they are hiding any ipv6 information, or it doesn't exist. Dale From skip.montanaro at gmail.com Mon Apr 13 16:19:05 2015 From: skip.montanaro at gmail.com (Skip Montanaro) Date: Mon, 13 Apr 2015 09:19:05 -0500 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: <552BCDC4.30205@BriannasSaladDressing.com> References: <5526B8C0.90307@BriannasSaladDressing.com> <552BCDC4.30205@BriannasSaladDressing.com> Message-ID: Dale, Looking more deeply at the traceback, SpamBayes is finding a URL-ish thing in the body of the message which it tries to split using urlparse.urlparse(). Can you gather all things which look like URLs in the body of the message and send them to me? My guess is that one (or more) of them is ill-formed. SpamBayes should at least not barf when it encounters such a "URL". Skip On Mon, Apr 13, 2015 at 9:08 AM, Dale Schroeder wrote: > On 04/11/2015 12:01 PM, Skip Montanaro wrote: >> >> On Thu, Apr 9, 2015 at 1:37 PM, Dale Schroeder >> wrote: >>> >>> raise ValueError("Invalid IPv6 URL") .ValueError: Invalid IPv6 URL" >> >> Dale, >> >> Can you identify the IPv6 URL it seems to be complaining about? >> >> Skip > > > Skip, > > I have been unsuccessful in getting an ipv6 address using dig, nslookup, or > host. All return ipv4 addresses even when specifying ipv6 only. Note that > all are of the new tld *.science domain type. Todays list includes: > > dvasiga.science = 66.248.215.124 > bintime.science = 198.52.182.235 > buneisme.science = 66.248.215.67 > > Contact info for all three from a whois lookup is admin at tvspots4less.com. > > It appears they are hiding any ipv6 information, or it doesn't exist. > > Dale From dale at BriannasSaladDressing.com Mon Apr 13 17:07:53 2015 From: dale at BriannasSaladDressing.com (Dale Schroeder) Date: Mon, 13 Apr 2015 10:07:53 -0500 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: References: <5526B8C0.90307@BriannasSaladDressing.com> <552BCDC4.30205@BriannasSaladDressing.com> Message-ID: <552BDBC9.6070007@BriannasSaladDressing.com> On 04/13/2015 9:19 AM, Skip Montanaro wrote: > Dale, > > Looking more deeply at the traceback, SpamBayes is finding a URL-ish > thing in the body of the message which it tries to split using > urlparse.urlparse(). Can you gather all things which look like URLs in > the body of the message and send them to me? My guess is that one (or > more) of them is ill-formed. SpamBayes should at least not barf when > it encounters such a "URL". > > Skip Skip, I chose one without images, as it is much smaller in size. Here is what I see: http://www.buneisme.science/l/lt7CD1234N101DW/104L361DO808V34NK1914045N1123604291 http://www.buneisme.science/l/lt7IW1234F101JT/104P361DV808T34IB1914045G1123604291 http://www.buneisme.science/im/TD1234T101AG/104R361A808F34LP1914045V1123604291/img1101104252.gif http://www.buneisme.science/l/lc3QO1234R101YP/104U361ED808J34DJ1914045H1123604291 This looks like the malformed html reference: Update preferences
It appears in line 45 of the code - zipped and attached - as an "Update Preferences" link. All of the others have a similar line for the Update Preferences link, and all have syntax just like the example. Thanks for your help. Dale > > > On Mon, Apr 13, 2015 at 9:08 AM, Dale Schroeder > wrote: >> On 04/11/2015 12:01 PM, Skip Montanaro wrote: >>> On Thu, Apr 9, 2015 at 1:37 PM, Dale Schroeder >>> wrote: >>>> raise ValueError("Invalid IPv6 URL") .ValueError: Invalid IPv6 URL" >>> Dale, >>> >>> Can you identify the IPv6 URL it seems to be complaining about? >>> >>> Skip >> >> Skip, >> >> I have been unsuccessful in getting an ipv6 address using dig, nslookup, or >> host. All return ipv4 addresses even when specifying ipv6 only. Note that >> all are of the new tld *.science domain type. Todays list includes: >> >> dvasiga.science = 66.248.215.124 >> bintime.science = 198.52.182.235 >> buneisme.science = 66.248.215.67 >> >> Contact info for all three from a whois lookup is admin at tvspots4less.com. >> >> It appears they are hiding any ipv6 information, or it doesn't exist. >> >> Dale > . > -------------- next part -------------- A non-text attachment was scrubbed... Name: dot_science_spam.zip Type: application/zip Size: 1404 bytes Desc: not available URL: From skip.montanaro at gmail.com Mon Apr 13 17:19:21 2015 From: skip.montanaro at gmail.com (Skip Montanaro) Date: Mon, 13 Apr 2015 10:19:21 -0500 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: <552BDBC9.6070007@BriannasSaladDressing.com> References: <5526B8C0.90307@BriannasSaladDressing.com> <552BCDC4.30205@BriannasSaladDressing.com> <552BDBC9.6070007@BriannasSaladDressing.com> Message-ID: On Mon, Apr 13, 2015 at 10:07 AM, Dale Schroeder wrote: > This looks like the malformed html reference: > href="http://[http://www.buneisme.science/unsQE1234UU101MJ/104M361HX808HL34B1914045OQ1123604291]">Update > preferences
Yeah, I can see where urlparse.urlparse might misinterpret that as IPv6. I don't have ready access to the source here at work, but will try to check things out from home later. Skip From ijcharingworth at yahoo.co.uk Mon Apr 27 14:25:37 2015 From: ijcharingworth at yahoo.co.uk (Ian James) Date: Mon, 27 Apr 2015 13:25:37 +0100 Subject: [Spambayes] Strange behaviour...... Message-ID: <553E2AC1.502@yahoo.co.uk> After using Spambayes for some years (with Outlook 2007), I have suddenly experienced completely unpredictable behaviour. (1) - it no longer filters automatically - I have to go into Spambayes Manager and select "Filter"; (2) it then filters out all the possible/Spam - but it does not appear in the designated folders; in fact it has been totally and permanently deleted from Outlook and cannot apparently be retrieved. I've now uninstalled it and am very nervous about re-installing it - as I know (from having checked my mail using the webmail interface before I actually downloaded it using Outlook) that Spambayes is actually deleting mail I do not wish to be deleted. is this is known problem - and will reinstalling cure it? Just a thought - one thing that has changed recently is that I have installed Thunderbird and set it as my default - I'm doing some testing on it for a friend - can't see how that could cause this problem, but just wondering..... Thanks for your help. Regards Ian From dale at BriannasSaladDressing.com Mon Apr 27 17:57:16 2015 From: dale at BriannasSaladDressing.com (Dale Schroeder) Date: Mon, 27 Apr 2015 10:57:16 -0500 Subject: [Spambayes] X-Spambayes-Exception In-Reply-To: References: <5526B8C0.90307@BriannasSaladDressing.com> <552BCDC4.30205@BriannasSaladDressing.com> <552BDBC9.6070007@BriannasSaladDressing.com> Message-ID: <553E5C5C.2050109@BriannasSaladDressing.com> On 04/13/2015 10:19 AM, Skip Montanaro wrote: > On Mon, Apr 13, 2015 at 10:07 AM, Dale Schroeder > wrote: >> This looks like the malformed html reference: >> > href="http://[http://www.buneisme.science/unsQE1234UU101MJ/104M361HX808HL34B1914045OQ1123604291]">Update >> preferences
> Yeah, I can see where urlparse.urlparse might misinterpret that as > IPv6. I don't have ready access to the source here at work, but will > try to check things out from home later. > > Skip Skip, Just wondering if you think this can be fixed? Dale