From dave at boost-consulting.com Wed Nov 1 15:44:56 2006 From: dave at boost-consulting.com (David Abrahams) Date: Wed, 01 Nov 2006 09:44:56 -0500 Subject: [Spambayes] BadIMAPResponseError: "Cannot find saved message" References: Message-ID: <87d587b507.fsf@pereiro.luannocracy.com> Matt Stegman writes: > Hello, > > I've just started using Spambayes 1.1a2, and I'm quite happy with it (I > had been using the Outlook plugin until now). However, I'm seeing an > intermittent problem with sb_imapfilter.py. It will routinely crash while > talking to the IMAP server, and always with the same error message: > > Traceback (most recent call last): > File "/usr/bin/sb_imapfilter.py", line 1284, in ? > run() > File "/usr/bin/sb_imapfilter.py", line 1266, in run > imap_filter.Filter() > File "/usr/bin/sb_imapfilter.py", line 1067, in Filter > self.unsure_folder, self.ham_folder) > File "/usr/bin/sb_imapfilter.py", line 970, in Filter > msg.Save() > File "/usr/bin/sb_imapfilter.py", line 723, in Save > raise BadIMAPResponseError("recent", "Cannot find saved message") > __main__.BadIMAPResponseError: The command 'recent' failed to give an OK response. > Cannot find saved message > > Usually it'll run OK for a day or two, but sometimes for just a few hours > before crashing. If I don't notice for a while and messages pile up in > the inbox without being classified, Spambayes will sometimes crash before > classifying a dozen or less messages, when I start it again. I've tried > training several times, and have never seen this error; it's only appeared > when classifying. Also, the server runs Courier IMAP, but I don't know > the version. I'm using IMAP over SSL, on port 993. > > I found that I'm not the first person to see this error message: > > http://mail.python.org/pipermail/spambayes-dev/2004-November/003232.html And not the last either. I can confirm that I've seen the same effect. Unfortunately, IIRC, nobody is actively maintaining sb_imapfilter.py because none of the spambayesians really understands IMAP well. We need someone who understands the protocol to fix these things. -- Dave Abrahams Boost Consulting www.boost-consulting.com From dave at boost-consulting.com Wed Nov 1 15:57:58 2006 From: dave at boost-consulting.com (David Abrahams) Date: Wed, 01 Nov 2006 09:57:58 -0500 Subject: [Spambayes] What do the filter/webservers do? Message-ID: <873b93b4eh.fsf@pereiro.luannocracy.com> I can't find a clear explanation of what is supposed to be happening when I run, e.g. sb_imapfilter.py, with no options. Here are some questions: --- The docs say it is "classifying" my email. Does that mean it is moving supposed spam to the designated spam folder and supposed ham to the designated ham folder? --- The sb_imapfilter.py --help contains the following: We never delete mail, unless you use the -e/purge option, but we do mark a lot as deleted, and your mail client might remove that for you. We try to only mark as deleted once the moved/altered message is correctly saved... I guess that means it's not really removing the messages. Is there a mail client that will show me the messages that are marked deleted but not expunged? --- When I run sb_imapfilter.py -l 1 the program exits instead of continuing to run. However, if I visit the webpage that is supposed to be enabled, it is blank (no 404 error) and I get this message on the console that launched sb_imapfilter.py: channel 3: open failed: administratively prohibited: open failed Does anyone know what that means? --- Thanks, -- Dave Abrahams Boost Consulting www.boost-consulting.com From dave at boost-consulting.com Wed Nov 1 18:38:18 2006 From: dave at boost-consulting.com (David Abrahams) Date: Wed, 01 Nov 2006 12:38:18 -0500 Subject: [Spambayes] BadIMAPResponseError: "Cannot find saved message" References: <87d587b507.fsf@pereiro.luannocracy.com> Message-ID: <87odrr9iet.fsf@pereiro.luannocracy.com> David Abrahams writes: >> I found that I'm not the first person to see this error message: >> >> http://mail.python.org/pipermail/spambayes-dev/2004-November/003232.html > > And not the last either. I can confirm that I've seen the same > effect. > > Unfortunately, IIRC, nobody is actively maintaining sb_imapfilter.py > because none of the spambayesians really understands IMAP well. We > need someone who understands the protocol to fix these things. I believe this may be due to a race condition as described in http://mail.python.org/pipermail/python-list/2004-March/213029.html which might explain why http://mail.python.org/pipermail/spambayes-dev/2004-November/003235.html says that adding debugging output cures the problem. It's probably safe when training, but I just lost a whole lotta messages when classifying because of this bug :( -- Dave Abrahams Boost Consulting www.boost-consulting.com From menhinick at xtra.co.nz Wed Nov 1 20:41:55 2006 From: menhinick at xtra.co.nz (MENHINICK Design Ltd) Date: Thu, 2 Nov 2006 08:41:55 +1300 Subject: [Spambayes] Junk mail marked as read - letter icon still shows in system tray Message-ID: Hi, I seem to have a small glitch? in my Spambayes installtion? The filter seems to be working extremely well but I have ticked options for all 'Spam' and 'Junk Suspects' to be marked as read. The junk mail is 'un-bolded' which appears to be marked as read but the letter icon still appears in the system tray; is this the intended action?? I was hoping that marking spam as read would not still bring 'spam' to my attention. WinXP Pro SP2 Outlook 2000 SP3 Regards Mike ******************** Mike Menhinick MENHINICK Design Ltd PO Box 32164 93 Victoria Rd Devonport Auckland PH/FX +64(9)4450034 menhinick at xtra.co.nz ******************** From jeremy.wallis at virgin.net Thu Nov 2 00:49:23 2006 From: jeremy.wallis at virgin.net (Jeremy Wallis) Date: Wed, 1 Nov 2006 23:49:23 -0000 Subject: [Spambayes] Simple Question... Message-ID: <000001c6fe10$5bfbfa90$0200a8c0@office> Hi, Do I turn off Outlook's built in junk filter or leave it running? I'm inclined to turn it off unless there's a good reason not to. Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061101/ccc451c3/attachment.html From cjw at sympatico.ca Thu Nov 2 01:40:56 2006 From: cjw at sympatico.ca (Colin J. Williams) Date: Wed, 01 Nov 2006 19:40:56 -0500 Subject: [Spambayes] Use of spambyes with any other mail package than outlook express Message-ID: <45493E98.20101@sympatico.ca> This is to enquire whether anyone has successfully used the new version, other than with Outlook Express. I had a stab at it, but failed. If anyone has succeeded, I will try again. Colin W. From skip at pobox.com Thu Nov 2 02:21:23 2006 From: skip at pobox.com (skip at pobox.com) Date: Wed, 1 Nov 2006 19:21:23 -0600 Subject: [Spambayes] Use of spambyes with any other mail package than outlook express In-Reply-To: <45493E98.20101@sympatico.ca> References: <45493E98.20101@sympatico.ca> Message-ID: <17737.18451.855617.113090@montanaro.dyndns.org> Colin> This is to enquire whether anyone has successfully used the new Colin> version, other than with Outlook Express. Colin> I had a stab at it, but failed. If anyone has succeeded, I will Colin> try again. I use SpamBayes just fine on a Mac. Can you describe what your environment is and what doesn't work? Skip From Jant at skynet.be Thu Nov 2 13:45:25 2006 From: Jant at skynet.be (Jan Tuerlinckx) Date: Thu, 2 Nov 2006 13:45:25 +0100 Subject: [Spambayes] ????? Message-ID: <005c01c6fe7c$d6e7e7c0$0301a8c0@DF53XV1J> POP3 proxy running on 110, proxying to Pop.skynet.be:110. Active POP3 conversations: 0. POP3 conversations this session: 12. Emails classified this session: 1 spam, 22 ham, 1 unsure. Total emails trained: Spam: 121 Ham: 258 More statistics... Till now no one SPAM OR unsure are derivated to the folders JUNK MAIL or POSSIBLE JUNK -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061102/38d50758/attachment.html From jsp at PKC.com Thu Nov 2 14:17:43 2006 From: jsp at PKC.com (Jesse Pelton) Date: Thu, 2 Nov 2006 08:17:43 -0500 Subject: [Spambayes] Simple Question... In-Reply-To: <000001c6fe10$5bfbfa90$0200a8c0@office> Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04B52@PKCVT01.pkc.com> I turn it off. Actually, I run with Cached Exchange Mode turned off, and the junk filter only runs when it's on. There's some server-side filtering that I can't control, though, so I have SpamBayes filter my "Junk E-mail" and "Junk E-mail candidates" folders. SpamBayes does a better job of filtering for me than Exchange/Outlook, though this apparently is not true for everyone. ________________________________ From: spambayes-bounces+jsp=pkc.com at python.org [mailto:spambayes-bounces+jsp=pkc.com at python.org] On Behalf Of Jeremy Wallis Sent: Wednesday, November 01, 2006 6:49 PM To: spambayes at python.org Subject: [Spambayes] Simple Question... Hi, Do I turn off Outlook's built in junk filter or leave it running? I'm inclined to turn it off unless there's a good reason not to... Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061102/29723071/attachment.htm From grevsen at gmail.com Thu Nov 2 15:23:00 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Thu, 2 Nov 2006 15:23:00 +0100 Subject: [Spambayes] Unwanted stock solicitations References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> Message-ID: <001301c6fe8a$6a6cb950$0600000a@kasper> Hi friends, as promised I'm continuing my tests on implementing OCR under Windows. FYI I'm running from sources recently downloaded through CVS. > >> ocr = os.popen("ocrad -s %s -c %s -x %s -f %s 2>/dev/null" % > > Vibe> What is the meaning of the last '2' in the os.popen()-call? > > It's a Unix-ism that will probably not work on Windows. It sends error > messages to the bit bucket. Ok, I did a little read-up on this. 2> is supported by WinNT, 2k and XP I just newer saw it used before. 2> is not supported in Win9x and ME. However /dev/null is - of course - not found in Windows. Equivalent is nul (case insensitive). Better use os.path.devnull like shown here. Parenthesis required for string formatting! ocr = os.popen( ( "ocrad -s %s -c %s -x %s < %s 2>" + os.path.devnull ) % (scale, charset, orf, pnmfile)) Now the surprise is that this executes 100% correctly from the interpreter, but it does not when spambayes runs. I still need to check up on exactly what is going on in Spambayes here. Maybe you could hint on other parts of the sources I should check for the next lead? Finally I was surprised to find that ocrad -s4 -x out.txt >ocr.txt logo.pgm did produce an ocr.txt but no out.txt for this image http://www.unlockaarhus.dk/dev/logo.pgm. Maybe it's only a problem with small images? Could you please test if this is the case under Unix as well? Happy coding :) Vibe From pl at symbolic.it Thu Nov 2 15:40:02 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Thu, 02 Nov 2006 15:40:02 +0100 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <001301c6fe8a$6a6cb950$0600000a@kasper> References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> <001301c6fe8a$6a6cb950$0600000a@kasper> Message-ID: <1162478403.27899.22.camel@localhost.localdomain> On Thu, 2006-11-02 at 15:23 +0100, Vibe Grevsen wrote: > Hi friends, > > as promised I'm continuing my tests on implementing OCR under Windows. > FYI I'm running from sources recently downloaded through CVS. > > > > >> ocr = os.popen("ocrad -s %s -c %s -x %s -f %s 2>/dev/null" % > > > > Vibe> What is the meaning of the last '2' in the os.popen()-call? > > > > It's a Unix-ism that will probably not work on Windows. It sends error > > messages to the bit bucket. > > Ok, I did a little read-up on this. > > 2> is supported by WinNT, 2k and XP I just newer saw it used before. > 2> is not supported in Win9x and ME. > > However /dev/null is - of course - not found in Windows. Equivalent is nul (case insensitive). > Better use os.path.devnull like shown here. Parenthesis required for string formatting! > > ocr = os.popen( ( "ocrad -s %s -c %s -x %s < %s 2>" + os.path.devnull ) % > (scale, charset, orf, pnmfile)) > or better use os.popen3 and discard stderr output. On windows you have to put quote around pnmfile to protect against space in path (also un linux you should have them but it's unlikely you get a path with a space). On windows there is also an other caveat. you should put quote also around ocrad path but if you do that you have to quote everything. to explain the command should be: ocr_cmd = r'""ocrad_path" -s %s -c %s "%s""'%(scale, charset, pnmfile) fin, fout, ferrr = os.popen3(ocr_cmd) but that doesn't work on linux. If you quote only ocrad_path or pnmfile you don't need the quote around the command as a whole. you may resolve the thing (as you have done) putting ocrad in the path and non quoting it. it this case you need to quote only pnmfile and it works on both linux and windows. > Now the surprise is that this executes 100% correctly from the interpreter, but it does not when spambayes runs. > I still need to check up on exactly what is going on in Spambayes here. > > Maybe you could hint on other parts of the sources I should check for the next lead? > > > Finally I was surprised to find that > > ocrad -s4 -x out.txt >ocr.txt logo.pgm > > did produce an ocr.txt but no out.txt for this image http://www.unlockaarhus.dk/dev/logo.pgm. > > Maybe it's only a problem with small images? Could you please test if this is the case under Unix as well? using -s (and other flags as well) disable -x. orf file is never used. probably is there from the start before skip introduce the scale parameter > > > Happy coding :) > > Vibe > _______________________________________________ > SpamBayes at python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From cristin at guinnsmith.com Thu Nov 2 16:08:53 2006 From: cristin at guinnsmith.com (cristin at guinnsmith.com) Date: Thu, 2 Nov 2006 09:08:53 -0600 Subject: [Spambayes] (no subject) Message-ID: <20061102151908.677791E4003@bag.python.org> I am using spam bays 1.1 on windows microsoft xp version 2002 and my spam bayes icon buttons are no longer working, delete as spam is no where to be seen and when I click on the spam or spambayes icons nothing happens. My hourglass pops up for a few seconds like it's trying to work but nothing changes. Sincerely, Cristin Wilson Receptionist Guinn, Smith, & Co. Phone 972-255-7120 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061102/54213cec/attachment.htm From skip at pobox.com Thu Nov 2 16:24:46 2006 From: skip at pobox.com (skip at pobox.com) Date: Thu, 2 Nov 2006 09:24:46 -0600 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <1162478403.27899.22.camel@localhost.localdomain> References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> <001301c6fe8a$6a6cb950$0600000a@kasper> <1162478403.27899.22.camel@localhost.localdomain> Message-ID: <17738.3518.257803.256673@montanaro.dyndns.org> Thanks Luigi & Vibe for the feedback. I'll try to make those changes to the code this evening and check into CVS. Skip From amedee at amedee.be Thu Nov 2 16:33:16 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Thu, 2 Nov 2006 16:33:16 +0100 (CET) Subject: [Spambayes] Use of spambyes with any other mail package than outlook express In-Reply-To: <45493E98.20101@sympatico.ca> References: <45493E98.20101@sympatico.ca> Message-ID: <1164.213.118.146.74.1162481596.squirrel@amedee.be> On Thu, November 2, 2006 1:40, Colin J. Williams said: > This is to enquire whether anyone has successfully used the new version, > other than with Outlook Express. > > I had a stab at it, but failed. If anyone has succeeded, I will try > again. Works like a charm on Debian 3.1 as a procmail filter. -- Amedee From pl at symbolic.it Thu Nov 2 17:22:15 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Thu, 02 Nov 2006 17:22:15 +0100 Subject: [Spambayes] image analyze Message-ID: <1162484535.27899.38.camel@localhost.localdomain> Hello, I think there is an errore in the frame selection inside PIL_decode_parts(). for frame in ImageSequence.Iterator(image): # Assume the pixel with the largest value is the # background. bg = max(frame.histogram()) if bg < bgpix: image = frame bgpix = bg outside this loop image (in my opinion) will be always the last frame of original image. A part from this the algorithm could be better. From my test using stddev and sum2 yield better result. >From a (very) raw test you may use the attached code snippet. Of course all of these are very simple test and can be wrong bug you test failed for a pattern common few days ago (see the attached image) -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 -------------- next part -------------- A non-text attachment was scrubbed... Name: test_frame.py Type: text/x-python Size: 4002 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061102/1a07460f/attachment-0001.py -------------- next part -------------- A non-text attachment was scrubbed... Name: feedback.gif.gz Type: application/x-gzip Size: 16684 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061102/1a07460f/attachment-0001.bin From cjw at sympatico.ca Thu Nov 2 18:21:23 2006 From: cjw at sympatico.ca (Colin J. Williams) Date: Thu, 02 Nov 2006 12:21:23 -0500 Subject: [Spambayes] Use of spambyes with any other mail package than outlook express In-Reply-To: <1164.213.118.146.74.1162481596.squirrel@amedee.be> References: <45493E98.20101@sympatico.ca> <1164.213.118.146.74.1162481596.squirrel@amedee.be> Message-ID: <454A2913.3000609@sympatico.ca> Amedee Van Gasse wrote: > On Thu, November 2, 2006 1:40, Colin J. Williams said: > >> This is to enquire whether anyone has successfully used the new version, >> other than with Outlook Express. >> >> I had a stab at it, but failed. If anyone has succeeded, I will try >> again. >> > > > Works like a charm on Debian 3.1 as a procmail filter. > So far, two happy Linux users and one Mac. I use Windows XP with Firefox. I'll try again and try to respond to Skip's question in some days. Meanwhile, I'm happy with the crude Firefox filter which has produced not false positives so far. The filter picks up messages with "image/gif". Colin W From peterb at zeta.org.au Thu Nov 2 23:44:25 2006 From: peterb at zeta.org.au (Peter Barker) Date: Fri, 3 Nov 2006 09:44:25 +1100 Subject: [Spambayes] UnicodeDecodeError: ordinal not in range(128) Message-ID: <200611030944.25574.peterb@zeta.org.au> I am using sb_bnfilter.py version 1.1a3 on linux (actually CVS version from Aug 21), and am getting an increasing number of spam messages with 8-bit characters in messages labelled as 7-bit. These cause sb_bnfilter.py to give an error such as: UnicodeDecodeError: 'ascii' codec can't decode byte 0xb0 in position 1: ordinal not in range(128) and the message is not classified. I set the option replace_nonascii_chars: True in .spambayesrc, and some characters no longer caused problems (as they were replaced with ?). However any 8-bit character in the Subject: header still caused problems, also the character 0xb0 somewhere in messages. I used hexdump on the message to look for the reported 8-bit character. I could find the reported character in the message (or headers), except in the case of 0xb0, which did not appear to be anywhere in the message when dumped with hexdump. I have attached a message which reports the 0xb0 problem. -------------- next part -------------- An embedded message was scrubbed... From: "Israeli Brokerage services Ltd" Subject: part time job Date: Mon, 23 Oct 2006 06:32:22 +1000 (EST) Size: 21533 Url: http://mail.python.org/pipermail/spambayes/attachments/20061103/b6b4d7c5/attachment.mht From grevsen at gmail.com Fri Nov 3 02:10:50 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Fri, 3 Nov 2006 02:10:50 +0100 Subject: [Spambayes] Unwanted stock solicitations References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> Message-ID: <002c01c6fee4$e802e420$0600000a@kasper> Hi again, good news - I fiddled a bit more and got it working under Windows :) :) :) >> ocr = os.popen( ( "ocrad -s %s -c %s -x %s < %s 2>" + os.path.devnull ) % >> (scale, charset, orf, pnmfile)) > or better use os.popen3 and discard stderr output. os.popen3() does not seem to support the read()-method? > On windows you have to put quote around pnmfile to protect against space > in path (also un linux you should have them but it's unlikely you get a > path with a space). Oh, YES, you're absolutely right. Thank you for this suggestion. > On windows there is also an other caveat. > you should put quote also around ocrad path but if you do that you have > to quote everything. > to explain the command should be: > ocr_cmd = r'""ocrad_path" -s %s -c %s "%s""'%(scale, charset, pnmfile) > fin, fout, ferrr = os.popen3(ocr_cmd) I tested your suggestion, but it seemed to resolve wrong in the interpreter. Also popen3() could not be read() so I changed it a bit # u: unicode support, r: raw string ocr_cmd = ur'ocrad -s %s -c %s "%s"' % (scale, charset, pnmfile) ocr = os.popen( ocr_cmd ) I also tested this # u: unicode support, r: raw string ocr_cmd = ( ur'ocrad -s %s -c %s < "%s" 2>' + os.path.devnull ) % (scale, charset, pnmfile) ocr = os.popen( ocr_cmd ) Both working in windows so Skip can pick whichever he likes best ;) >> Maybe you could hint on other parts of the sources I should check for the next lead? With the above change I only had to do one more thing... Comment out the check for ocrad, then OCR is working. (Assuming ocrad 0.16 is in the path.) This means that we should probably work on testing the find_program and is_executable procedures. As soon as they are finished I could probably start on a new exe-installer-version. I think I figured how to include PIL in the exe aswell. >> ocrad -s4 -x out.txt >ocr.txt logo.pgm >> did produce an ocr.txt but no out.txt for this image http://www.unlockaarhus.dk/dev/logo.pgm. > using -s (and other flags as well) disable -x. Hmm, bug, no, better undocumented feature? :) (At least it's not explained in the ocrad readme as far as I can see...) > orf file is never used. probably is there from the start before skip > introduce the scale parameter Actually he tries to count the number of lines in orf I think for line in open(orf): ... But this could of course be done directly on ocr.read(). Happy coding :) Vibe From skip at pobox.com Fri Nov 3 03:53:55 2006 From: skip at pobox.com (skip at pobox.com) Date: Thu, 2 Nov 2006 20:53:55 -0600 Subject: [Spambayes] Use of spambyes with any other mail package than outlook express In-Reply-To: <454A2913.3000609@sympatico.ca> References: <45493E98.20101@sympatico.ca> <1164.213.118.146.74.1162481596.squirrel@amedee.be> <454A2913.3000609@sympatico.ca> Message-ID: <17738.44867.937456.42096@montanaro.dyndns.org> Colin> I'll try again and try to respond to Skip's question in some Colin> days. Meanwhile, I'm happy with the crude Firefox filter which Colin> has produced not false positives so far. The filter picks up Colin> messages with "image/gif". When you have the time to report back, let us know which SpamBayes application you're using (probably sb_server.py or sb_imapfilter.py), what version of SpamBayes you're using (1.0.4, 1.1a2 and 1.1a3 are the most recent ones), what your settings are (default perhaps?) and what the composition of your training database is (number of ham, number of spam, how long have you been gathering mails?). Thx, Skip From pl at symbolic.it Fri Nov 3 09:48:36 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Fri, 03 Nov 2006 09:48:36 +0100 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <002c01c6fee4$e802e420$0600000a@kasper> References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> <002c01c6fee4$e802e420$0600000a@kasper> Message-ID: <1162543717.29026.8.camel@localhost.localdomain> On Fri, 2006-11-03 at 02:10 +0100, Vibe Grevsen wrote: > Hi again, > > good news - I fiddled a bit more and got it working under Windows :) :) :) > Hello, > > >> ocr = os.popen( ( "ocrad -s %s -c %s -x %s < %s 2>" + os.path.devnull ) % > >> (scale, charset, orf, pnmfile)) > > > or better use os.popen3 and discard stderr output. > > os.popen3() does not seem to support the read()-method? Are you sure? I have used It on at least a couple of version of python. If you look into the code snippet I sent yesterday you may see few line commented calling ocrad using popen3 and it works (at least on of a few computers I tested it) > > > > On windows you have to put quote around pnmfile to protect against space > > in path (also un linux you should have them but it's unlikely you get a > > path with a space). > > Oh, YES, you're absolutely right. > Thank you for this suggestion. > > > > On windows there is also an other caveat. > > you should put quote also around ocrad path but if you do that you have > > to quote everything. > > to explain the command should be: > > > ocr_cmd = r'""ocrad_path" -s %s -c %s "%s""'%(scale, charset, pnmfile) > > fin, fout, ferrr = os.popen3(ocr_cmd) > > I tested your suggestion, but it seemed to resolve wrong in the interpreter. > Also popen3() could not be read() so I changed it a bit Strange. It's work for me > > # u: unicode support, r: raw string > ocr_cmd = ur'ocrad -s %s -c %s "%s"' % (scale, charset, pnmfile) > ocr = os.popen( ocr_cmd ) > > I also tested this > > # u: unicode support, r: raw string > ocr_cmd = ( ur'ocrad -s %s -c %s < "%s" 2>' + os.path.devnull ) % (scale, charset, pnmfile) > ocr = os.popen( ocr_cmd ) > > Both working in windows so Skip can pick whichever he likes best ;) Does 2> really work? I think it kind of works because it's ignored by cmd.exe but it's a unix sh construct (I'm not really sure if it even works using csh derived shells) > > > >> Maybe you could hint on other parts of the sources I should check for the next lead? > > With the above change I only had to do one more thing... > Comment out the check for ocrad, then OCR is working. (Assuming ocrad 0.16 is in the path.) > > This means that we should probably work on testing the find_program and is_executable procedures. > As soon as they are finished I could probably start on a new exe-installer-version. > I think I figured how to include PIL in the exe aswell. > > > >> ocrad -s4 -x out.txt >ocr.txt logo.pgm > >> did produce an ocr.txt but no out.txt for this image http://www.unlockaarhus.dk/dev/logo.pgm. > > > using -s (and other flags as well) disable -x. > > Hmm, bug, no, better undocumented feature? :) oh yes, you have to look inside the source code to find it > (At least it's not explained in the ocrad readme as far as I can see...) > > > > orf file is never used. probably is there from the start before skip > > introduce the scale parameter > > Actually he tries to count the number of lines in orf I think no, he looks for line starting with line that probably it's related to the number of line in the output. > > for line in open(orf): > ... > > But this could of course be done directly on ocr.read(). > > > Happy coding :) > > Vibe > _______________________________________________ > SpamBayes at python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From grevsen at gmail.com Fri Nov 3 11:26:31 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Fri, 3 Nov 2006 11:26:31 +0100 Subject: [Spambayes] Unwanted stock solicitations Message-ID: <001501c6ff32$8cd31810$0600000a@kasper> Hi Luigi, > > os.popen3() does not seem to support the read()-method? > Are you sure? ------------ Yep, seems so at least in Python 2.5: Python 2.5 (r25:51908, Sep 19 2006, 09:52:17) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import os >>> ocr_cmd = ur'ocrad -s %s -c %s "%s"' % ("2", "ascii", "c:\logo.pgm") >>> os.popen3( ocr_cmd ).read() __main__:1: RuntimeWarning: tp_compare didn't return -1 or -2 for exception Traceback (most recent call last): File "", line 1, in AttributeError: 'tuple' object has no attribute 'read' >>> os.popen( ocr_cmd ).read() '- Unlo_k Arhus -\n\n' >>> > commented calling ocrad using popen3 and it works (at least on of a few > computers I tested it) Hmm, strange. For the record, which OS are you using it on? > > > On windows you have to put quote around pnmfile to protect against space > > > in path (also un linux you should have them but it's unlikely you get a > > > path with a space). > > > Oh, YES, you're absolutely right. > > Thank you for this suggestion. > > > > > On windows there is also an other caveat. > > > you should put quote also around ocrad path but if you do that you have > > > to quote everything. > > > to explain the command should be: > > > > ocr_cmd = r'""ocrad_path" -s %s -c %s "%s""'%(scale, charset, pnmfile) > > > fin, fout, ferrr = os.popen3(ocr_cmd) > > > I tested your suggestion, but it seemed to resolve wrong in the interpreter. > > Also popen3() could not be read() so I changed it a bit > Strange. It's work for me The tripple-quoting resolved wrong I think. I don't understand it completely. Maybe because it's already a raw string? > > # u: unicode support, r: raw string > > ocr_cmd = ur'ocrad -s %s -c %s "%s"' % (scale, charset, pnmfile) > > ocr = os.popen( ocr_cmd ) > > > > I also tested this > > > > # u: unicode support, r: raw string > > ocr_cmd = ( ur'ocrad -s %s -c %s < "%s" 2>' + os.path.devnull ) % (scale, charset, pnmfile) > > ocr = os.popen( ocr_cmd ) > > > > Both working in windows so Skip can pick whichever he likes best ;) > Does 2> really work? I think it kind of works because it's ignored by > cmd.exe but it's a unix sh construct (I'm not really sure if it even > works using csh derived shells) Yes, it works in NT, XP and 2k: http://shorterlink.com/?P3W1A9 (You may even learn some tricks from that article) > > > using -s (and other flags as well) disable -x. > > > > Hmm, bug, no, better undocumented feature? :) > oh yes, you have to look inside the source code to find it Okay ... > > > orf file is never used. probably is there from the start before skip > > > introduce the scale parameter > > > > Actually he tries to count the number of lines in orf I think > no, he looks for line starting with line that probably it's related to > the number of line in the output. Not sure - I'd never even opened a .py file ten days ago :) - but this ctokens.add("image-text-lines:%d" % int(log2(nlines))) makes me suspicious... Skip? Happy coding :) Vibe From pl at symbolic.it Fri Nov 3 11:35:21 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Fri, 03 Nov 2006 11:35:21 +0100 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <001501c6ff32$8cd31810$0600000a@kasper> References: <001501c6ff32$8cd31810$0600000a@kasper> Message-ID: <1162550123.29026.41.camel@localhost.localdomain> On Fri, 2006-11-03 at 11:26 +0100, Vibe Grevsen wrote: > Hi Luigi, > > > > > os.popen3() does not seem to support the read()-method? > > > Are you sure? > > ------------ > > Yep, seems so at least in Python 2.5: > > Python 2.5 (r25:51908, Sep 19 2006, 09:52:17) [MSC v.1310 32 bit (Intel)] on win32 > Type "help", "copyright", "credits" or "license" for more information. > >>> import os > >>> ocr_cmd = ur'ocrad -s %s -c %s "%s"' % ("2", "ascii", "c:\logo.pgm") > > >>> os.popen3( ocr_cmd ).read() > __main__:1: RuntimeWarning: tp_compare didn't return -1 or -2 for exception > Traceback (most recent call last): > File "", line 1, in > AttributeError: 'tuple' object has no attribute 'read' > > >>> os.popen( ocr_cmd ).read() > '- Unlo_k Arhus -\n\n' ok. this is the problem. popen returns 3 files descriptor you have to use something like: fin, fout, ferr = os.popen3(cmd) fout.read() > >>> > > > > commented calling ocrad using popen3 and it works (at least on of a few > > computers I tested it) > > Hmm, strange. For the record, which OS are you using it on? > > > > > > > On windows you have to put quote around pnmfile to protect against space > > > > in path (also un linux you should have them but it's unlikely you get a > > > > path with a space). > > > > > Oh, YES, you're absolutely right. > > > Thank you for this suggestion. > > > > > > > > On windows there is also an other caveat. > > > > you should put quote also around ocrad path but if you do that you have > > > > to quote everything. > > > > to explain the command should be: > > > > > > ocr_cmd = r'""ocrad_path" -s %s -c %s "%s""'%(scale, charset, pnmfile) > > > > fin, fout, ferrr = os.popen3(ocr_cmd) > > > > > I tested your suggestion, but it seemed to resolve wrong in the interpreter. > > > Also popen3() could not be read() so I changed it a bit > > > Strange. It's work for me > > The tripple-quoting resolved wrong I think. I don't understand it completely. > Maybe because it's already a raw string? > > > > > > # u: unicode support, r: raw string > > > ocr_cmd = ur'ocrad -s %s -c %s "%s"' % (scale, charset, pnmfile) > > > ocr = os.popen( ocr_cmd ) > > > > > > I also tested this > > > > > > # u: unicode support, r: raw string > > > ocr_cmd = ( ur'ocrad -s %s -c %s < "%s" 2>' + os.path.devnull ) % (scale, charset, pnmfile) > > > ocr = os.popen( ocr_cmd ) > > > > > > Both working in windows so Skip can pick whichever he likes best ;) > > > Does 2> really work? I think it kind of works because it's ignored by > > cmd.exe but it's a unix sh construct (I'm not really sure if it even > > works using csh derived shells) > > Yes, it works in NT, XP and 2k: http://shorterlink.com/?P3W1A9 > (You may even learn some tricks from that article) interesting. thank you for the info. > > > > > > > using -s (and other flags as well) disable -x. > > > > > > Hmm, bug, no, better undocumented feature? :) > > > oh yes, you have to look inside the source code to find it > > Okay ... > > > > > > > orf file is never used. probably is there from the start before skip > > > > introduce the scale parameter > > > > > > Actually he tries to count the number of lines in orf I think > > > no, he looks for line starting with line that probably it's related to > > the number of line in the output. > > Not sure - I'd never even opened a .py file ten days ago :) - but this > > ctokens.add("image-text-lines:%d" % > int(log2(nlines))) > > makes me suspicious... Skip? > > > > Happy coding :) > > Vibe > _______________________________________________ > SpamBayes at python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From grevsen at gmail.com Fri Nov 3 11:48:24 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Fri, 3 Nov 2006 11:48:24 +0100 Subject: [Spambayes] image analyze Message-ID: <000501c6ff35$985d2ab0$0600000a@kasper> Hello, ... > A part from this the algorithm could be better. From my test > using stddev and sum2 yield better result. Maybe mean filtering the parts before processing could also be a good idea? Should dim moving artefacts. -- Happy coding :) Vibe -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061103/0238181f/attachment.html From dfjey at comax.jp Fri Nov 3 10:47:57 2006 From: dfjey at comax.jp (Sibyl Morales) Date: Fri, 3 Nov 2006 10:47:57 +0100 Subject: [Spambayes] prevailing Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061103/dbf0e9ff/attachment.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: trestle.gif Type: image/gif Size: 6463 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061103/dbf0e9ff/attachment.gif From pl at symbolic.it Fri Nov 3 16:20:28 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Fri, 03 Nov 2006 16:20:28 +0100 Subject: [Spambayes] Analyzing text in image spam (was: Spam in Images) In-Reply-To: <17637.49519.192540.305301@montanaro.dyndns.org> References: <002001c6b5a9$3e36ff50$6701a8c0@Caladan> <21ae1b060608171957q7f4311baw900c0fa51cc41530@mail.gmail.com> <17637.12107.629913.149910@montanaro.dyndns.org> <17637.49519.192540.305301@montanaro.dyndns.org> Message-ID: <1162567229.29026.64.camel@localhost.localdomain> On Fri, 2006-08-18 at 08:32 -0500, skip at pobox.com wrote: > Once you're ready to go, add the following to your SpamBayes options: > > x-lookup_ip: True > lookup_ip_cache: ~/.dnscache > Is someone using this option? To me seems that this option alone do nothing. You have to enable both x-lookup_ip and x-pick_apart_urls. Is it right or am I missing something? Once both are enabled it seems to work but the mail processing is very very slow. -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From skip at pobox.com Fri Nov 3 16:56:07 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 3 Nov 2006 09:56:07 -0600 Subject: [Spambayes] Analyzing text in image spam (was: Spam in Images) In-Reply-To: <1162567229.29026.64.camel@localhost.localdomain> References: <002001c6b5a9$3e36ff50$6701a8c0@Caladan> <21ae1b060608171957q7f4311baw900c0fa51cc41530@mail.gmail.com> <17637.12107.629913.149910@montanaro.dyndns.org> <17637.49519.192540.305301@montanaro.dyndns.org> <1162567229.29026.64.camel@localhost.localdomain> Message-ID: <17739.26263.153047.731107@montanaro.dyndns.org> >> Once you're ready to go, add the following to your SpamBayes options: >> >> x-lookup_ip: True >> lookup_ip_cache: ~/.dnscache >> Luigi> Is someone using this option? To me seems that this option alone Luigi> do nothing. You have to enable both x-lookup_ip and Luigi> x-pick_apart_urls. Is it right or am I missing something? Perhaps. I can't recall. Do you have PyDNS installed? Luigi> Once both are enabled it seems to work but the mail processing is Luigi> very very slow. First time through, yes. After that, it should (in theory) rely on its cache of IP address information. I may have some pending checkins for that though (*). Note also that a fairly small training database works for me (fewer than 100 hams, 250-300 spams). If you have a massive training database, then, yes, this will slow things down dramatically. The IP lookup and image OCR stuff changes the properties of your database enough that I think it's worth retraining from scratch. Skip (*) Alas, I didn't get around to checking stuff in last night. Maybe over the weekend. S From pl at symbolic.it Fri Nov 3 17:32:13 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Fri, 03 Nov 2006 17:32:13 +0100 Subject: [Spambayes] Analyzing text in image spam (was: Spam in Images) In-Reply-To: <17739.26263.153047.731107@montanaro.dyndns.org> References: <002001c6b5a9$3e36ff50$6701a8c0@Caladan> <21ae1b060608171957q7f4311baw900c0fa51cc41530@mail.gmail.com> <17637.12107.629913.149910@montanaro.dyndns.org> <17637.49519.192540.305301@montanaro.dyndns.org> <1162567229.29026.64.camel@localhost.localdomain> <17739.26263.153047.731107@montanaro.dyndns.org> Message-ID: <1162571534.29026.80.camel@localhost.localdomain> On Fri, 2006-11-03 at 09:56 -0600, skip at pobox.com wrote: > >> Once you're ready to go, add the following to your SpamBayes options: > >> > >> x-lookup_ip: True > >> lookup_ip_cache: ~/.dnscache > >> > > Luigi> Is someone using this option? To me seems that this option alone > Luigi> do nothing. You have to enable both x-lookup_ip and > Luigi> x-pick_apart_urls. Is it right or am I missing something? > > Perhaps. I can't recall. Do you have PyDNS installed? Yes, I have PyDNS installed. I used tcpdump to monitor dns requests and there are no requests if x-pick_apart_urls is disabled. Looking into the code seems that the check for x-lookup_ip is inside a if(pick_url enabled) construct > > Luigi> Once both are enabled it seems to work but the mail processing is > Luigi> very very slow. > > First time through, yes. After that, it should (in theory) rely on its > cache of IP address information. I may have some pending checkins for that > though (*). Note also that a fairly small training database works for me (fewer > than 100 hams, 250-300 spams). If you have a massive training database, > then, yes, this will slow things down dramatically. The IP lookup and image > OCR stuff changes the properties of your database enough that I think it's > worth retraining from scratch. I have tried on a sample of 5000 emails but I stopped it because after more than half an hour it didn't finish. From tcpdump I could see a request every 1,2 seconds (or something like that) now even considering that not every mail contains an url it was very slow. As a note I tried it on windows XP with ocr scanning enabled but ocr alone was much faster. > > Skip > > (*) Alas, I didn't get around to checking stuff in last night. Maybe over > the weekend. > > S -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From grevsen at gmail.com Fri Nov 3 19:53:18 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Fri, 3 Nov 2006 19:53:18 +0100 Subject: [Spambayes] Unwanted stock solicitations Message-ID: <003d01c6ff79$5586a150$0600000a@kasper> Hi Luigi, >> > > os.popen3() does not seem to support the read()-method? ... >> >>> os.popen3( ocr_cmd ).read() >> __main__:1: RuntimeWarning: tp_compare didn't return -1 or -2 for exception >> Traceback (most recent call last): >> File "", line 1, in >> AttributeError: 'tuple' object has no attribute 'read' > ok. this is the problem. popen returns 3 files descriptor > you have to use something like: > fin, fout, ferr = os.popen3(cmd) > fout.read() Oh, this is an array :) I missed that: >>> # os.popen3() returns [stdin, stdout, stderr] >>> os.popen3( ocr_cmd )[1].read() '- Unlo_k Arhus -\n\n' Yes, you're right, that's working and probably under win9x aswell. I'll test and let you know later. Happy coding :) Vibe From grevsen at gmail.com Sat Nov 4 00:17:59 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Sat, 4 Nov 2006 00:17:59 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] Message-ID: <003601c6ff9e$4eb04b90$0600000a@kasper> Hi friends, OCR code's now been tweaked and tested to work in both WinXP and Win9x. This should work in unix as well. Here is a summary: 1. Put ocrad 0.16 in the path 2. Change the following in ImageStripper.py ocr = os.popen("ocrad -s %s -c %s -x %s < %s 2>ocrerr.txt" % (scale, charset, orf, pnmfile)) into this ocr_cmd = ur'ocrad -s %s -c %s "%s"' % (scale, charset, pnmfile) # os.popen3() returns [stdin, stdout, stderr] ocr = os.popen3( ocr_cmd )[1] 3. Change this if os.path.exists(program) and is_executable(program): into this if os.path.exists(program + ".exe") or ( os.path.exists(program) and is_executable(program) ): Because of the way the instruction is interpreted it does not produce fatal errors even if the file is not found. 4. Change this for line in open(orf): if line.startswith("lines"): nlines = int(line.split()[1]) if nlines: ctokens.add("image-text-lines:%d" % int(log2(nlines))) into this nlines = ctext.count('\n') if nlines: ctokens.add("image-text-lines:%d" % nlines ) 5. Finally I sugest you change the default scale from 1 to 2 like in this line scale = options["Tokenizer", "ocrad_scale"] or 2 Compile and enjoy. Happy coding :) Vibe -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061104/8fd0082e/attachment.html From rcharmoz at axxessgroup.com Sat Nov 4 05:21:24 2006 From: rcharmoz at axxessgroup.com (rcharmoz at axxessgroup.com) Date: Fri, 03 Nov 2006 23:21:24 -0500 (EST) Subject: [Spambayes] Problem with POP3 Proxy: can't load Spambayes Message-ID: <0J860089IUROYN91@mta4.srv.hcvlny.cv.net> A non-text attachment was scrubbed... Name: SpamBayesServer1.log Type: application/octet-stream Size: 110 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061103/dafb5326/attachment.obj -------------- next part -------------- I am using SpamBayes POP3 Proxy Version 1.0.4 (March 2005) (binary), with version 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service Pack 2). I have trained 0 ham and 1 spam. The problem I am having is that I cannot make it work with outlook. I go to the configuration section, and load in all the arameters, but it does absolutely nothing From skip at pobox.com Sat Nov 4 05:52:21 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 3 Nov 2006 22:52:21 -0600 Subject: [Spambayes] Problem with POP3 Proxy: can't load Spambayes In-Reply-To: <0J860089IUROYN91@mta4.srv.hcvlny.cv.net> References: <0J860089IUROYN91@mta4.srv.hcvlny.cv.net> Message-ID: <17740.7301.987258.753171@montanaro.dyndns.org> rcharmoz> I am using SpamBayes POP3 Proxy Version 1.0.4 (March 2005) rcharmoz> (binary), with version 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC rcharmoz> v.1200 32 bit (Intel)] of Python; my operating system is rcharmoz> Windows 5.1.2600.2 (Service Pack 2). I have trained 0 ham and rcharmoz> 1 spam. rcharmoz> The problem I am having is that I cannot make it work with rcharmoz> outlook. I go to the configuration section, and load in all rcharmoz> the arameters, but it does absolutely nothing Outlook or Outlook Express? For Outlook there is a plugin. You don't need to use the POP3 proxy. In fact, I've not heard of anyone using the POP3 proxy in preference to the plugin. Skip From pl at symbolic.it Sat Nov 4 09:32:33 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Sat, 04 Nov 2006 09:32:33 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] In-Reply-To: <003601c6ff9e$4eb04b90$0600000a@kasper> References: <003601c6ff9e$4eb04b90$0600000a@kasper> Message-ID: <1162629154.29026.101.camel@localhost.localdomain> On Sat, 2006-11-04 at 00:17 +0100, Vibe Grevsen wrote: > 5. Finally I sugest you change the default scale from 1 to 2 like in > this line > > scale = options["Tokenizer", "ocrad_scale"] or 2 Hello, changing this surely doesn't hurt but ocrad_scale it's already set to 2 in Options.py probably should be removed (or set to 2 as you suggest) > -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From pl at symbolic.it Sat Nov 4 09:52:36 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Sat, 04 Nov 2006 09:52:36 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] In-Reply-To: <003601c6ff9e$4eb04b90$0600000a@kasper> References: <003601c6ff9e$4eb04b90$0600000a@kasper> Message-ID: <1162630356.29026.110.camel@localhost.localdomain> On Sat, 2006-11-04 at 00:17 +0100, Vibe Grevsen wrote: > Hi friends, > > OCR code's now been tweaked and tested to work in both WinXP and > Win9x. > This should work in unix as well. > > Here is a summary: > > 1. Put ocrad 0.16 in the path As a note, for Windows you need a copy of ocrad with skip patch that opens pnm files in binary mode otherwise ocrad will fail on a lot of files. Have you tried other ocr programs? I tried gocr and I think that its result are somewhat better but version 0.41 + pgm patch almost hangs (read it takes a _very_ long time to complete and uses all the available cpu) with some images, version 0.40 crashes on some other different kind of images. For Linux the latter result is somewhat better that the former (you get no tokens from the image but no other harm) but on Windows you get the Dr. Watson report windows that block the process (of course I may disable it but it's a system/user configuration). -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From qycze at h137.p062.iij4u.or.jp Sat Nov 4 13:04:37 2006 From: qycze at h137.p062.iij4u.or.jp (Dobbin Combs) Date: Sat, 4 Nov 2006 13:04:37 +0100 Subject: [Spambayes] knelt cruise ship Message-ID: <03DC37B3503.3B786CD2C@centrumurody.com.pl> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061104/6c2a7790/attachment.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: anytime.gif Type: image/gif Size: 8944 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061104/6c2a7790/attachment.gif From grevsen at gmail.com Sat Nov 4 14:19:56 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Sat, 4 Nov 2006 14:19:56 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted Message-ID: <0f3d01c70013$ed5ad4c0$0600000a@kasper> Hi friends, >> OCR code's now been tweaked and tested to work in both WinXP and >> Win9x. >> This should work in unix as well. >> >> Here is a summary: >> >> 1. Put ocrad 0.16 in the path > > As a note, for Windows you need a copy of ocrad with skip patch that > opens pnm files in binary mode otherwise ocrad will fail on a lot of > files. Actually you're probably refering to my "patch"? (Ocrad/CygWin1.dll) http://mail.python.org/pipermail/spambayes/2006-October/019983.html If you have MinGW experience - which I don't - I think you can compile an exe-only which don't need the dll. But then I don't know if it is actually working because of the POSIX emulation or they did change the source. (I did not...) You're right Skip pointed it out in the ocrad forum, but the developer was reluctant to change this then so I don't actually know why 0.16 is working... Just know it is, which is fine for me. > Have you tried other ocr programs? No, not yet. Tony Meyer suggested Tesseract: http://mail.python.org/pipermail/spambayes-dev/2006-September/003750.html but there seemed to be build issues... I haven't tried.. I mailed with NoSpam Today! Support (spamassasin based) before I chose SB. They were doing research on FuzzyOcr and ImageInfo. Maybe we could ask again about their results. I believe FuzzyOcr is gocr-based? > I tried gocr and I think that its result are somewhat better but version > 0.41 + pgm patch almost hangs Ok, probably needs some tweaking then. Since the ocr is working with ocrad and - as you see below - I get very good results I will be moving on to the next area now. I think it is far more beneficial to do more research into the actual processing as you commented elsewhere than to start the whole testing/tweaking all over again with a new ocr engine. Of course that is just my opinion... >> 5. Finally I sugest you change the default scale from 1 to 2 like in >> this line >> >> scale = options["Tokenizer", "ocrad_scale"] or 2 > > changing this surely doesn't hurt but ocrad_scale it's already set to 2 > in Options.py Ok, I missed that. Don't know which one has prevalence. ImageStripper.py, Options.py or bayescustomize.ini. With 2 you should get this quality image tokens: watch out here comes the big one! srrl about blow your minds add srrl your radar mon nov ob companu name: stellar resource new (otc bb:srrl.ob) sumbol: srrl prlce: tl._ targe_: tio skip:r 10 ueru s_rong buu our last feature, posted cains ouer __o_ the span weekithose- are ridiculous cainsl cet srrl nowl will makinc stunninc skip:a 10 next weekl massiue campaicns are about startl watch srrl trade monday nou obl don't left out! That is about a 90% recognition or so. > probably should be removed (or set to 2 as you suggest) Then I suggest removal as you say. Better avoid redundancy ( clutter :) ) Happy coding :) Vibe From pl at symbolic.it Sat Nov 4 15:11:04 2006 From: pl at symbolic.it (Luigi Pugnetti) Date: Sat, 04 Nov 2006 15:11:04 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted In-Reply-To: <0f3d01c70013$ed5ad4c0$0600000a@kasper> References: <0f3d01c70013$ed5ad4c0$0600000a@kasper> Message-ID: <1162649465.29026.152.camel@localhost.localdomain> On Sat, 2006-11-04 at 14:19 +0100, Vibe Grevsen wrote: > Hi friends, > > >> OCR code's now been tweaked and tested to work in both WinXP and > >> Win9x. > >> This should work in unix as well. > >> > >> Here is a summary: > >> > >> 1. Put ocrad 0.16 in the path > > > > As a note, for Windows you need a copy of ocrad with skip patch that > > opens pnm files in binary mode otherwise ocrad will fail on a lot of > > files. > > Actually you're probably refering to my "patch"? (Ocrad/CygWin1.dll) > http://mail.python.org/pipermail/spambayes/2006-October/019983.html > > If you have MinGW experience - which I don't - I think you can compile > an exe-only which don't need the dll. But then I don't know if it is actually > working because of the POSIX emulation or they did change the source. > (I did not...) > > You're right Skip pointed it out in the ocrad forum, but the developer was > reluctant to change this then so I don't actually know why 0.16 is working... > Just know it is, which is fine for me. I was referring to this mail http://www.nabble.com/Ocrad-opens-files-in-text-mode-t2485744.html probably if you use cygwin emulation layer you have no problem with binary/text file. I have no experience with mingw but I compiled ocrad using it and I'm using the result (without cygwin dll) with no problem, but you have to open the pnm file in binary mode. > > > > > Have you tried other ocr programs? > > No, not yet. > > Tony Meyer suggested Tesseract: > http://mail.python.org/pipermail/spambayes-dev/2006-September/003750.html > but there seemed to be build issues... I haven't tried.. I built tesseract with no problem. I have done a very quick test with it and it's difficult to use (at least I was not able (on windows) to get any results if the image wasn't in the same folder of the executable). I tested few spam images and the results were poor. > > I mailed with NoSpam Today! Support (spamassasin based) before I chose SB. > They were doing research on FuzzyOcr and ImageInfo. Maybe we could ask > again about their results. I believe FuzzyOcr is gocr-based? Yes, they are using gocr. But as I said in my previous mail it has its own problems. > > > > > I tried gocr and I think that its result are somewhat better but version > > 0.41 + pgm patch almost hangs > > Ok, probably needs some tweaking then. > Since the ocr is working with ocrad and - as you see below - I get very > good results I will be moving on to the next area now. You are lucky. My results are so so. Probably I get a reduction of a 60/70% of spam with images (which in itself could be considered not bad) but way too much spam is not stopped. I'm going to recheck my environment to see if something is wrong. > > I think it is far more beneficial to do more research into the actual processing > as you commented elsewhere than to start the whole testing/tweaking all over > again with a new ocr engine. Of course that is just my opinion... Yes and no. We need a decent ocr engine to start with than we may focus on better image manipulation. At the moment spambayes have trouble with image for the following reason: - PIL sometimes fail to handle the image. I'm still investigating the issue but the images seems reasonably correct (IE, Firefox and many viewers, on linux and windows, are able to display them). It's quite rare and not a big issue - ocr results are poor. The worst case are when you get a sequenze of chars (char space char space ...) or a long word. both are ignored by spambayes. There are images which contain more than words and in this case we may get no tokens. In few cases if the colors used inside the image are changed you get a different result. I have no knowledge of image processing but I tried few simple operations (like scaling, sharpening, convert to gray, ...) but I got no results. They were all quick tests and the result are in no way conclusive. > > > >> 5. Finally I sugest you change the default scale from 1 to 2 like in > >> this line > >> > >> scale = options["Tokenizer", "ocrad_scale"] or 2 > > > > changing this surely doesn't hurt but ocrad_scale it's already set to 2 > > in Options.py > > Ok, I missed that. Don't know which one has prevalence. > ImageStripper.py, Options.py or bayescustomize.ini. from my understanding in Options.py you set the default values, bayescustomize.ini contain the values chosen by the user an in Imagestripper.py the programmer may embed it's values ignoring the user choice (joking) > > With 2 you should get this quality image tokens: > > watch > out > here > comes > the > big > one! > srrl > about > blow > your > minds > add > srrl > your > radar > mon > nov > ob > companu > name: > stellar > resource > new > (otc > bb:srrl.ob) > sumbol: > srrl > prlce: > tl._ > targe_: > tio > skip:r 10 > ueru > s_rong > buu > our > last > feature, > posted > cains > ouer > __o_ > the > span > weekithose- > are > ridiculous > cainsl > cet > srrl > nowl > will > makinc > stunninc > skip:a 10 > next > weekl > massiue > campaicns > are > about > startl > watch > srrl > trade > monday > nou > obl > don't > left > out! > > That is about a 90% recognition or so. Yes, sometimes the results are good and sometimes are much worst. In few cases a scaling factor of 3 it's better. Just now I'm doing a retraining with ocrad_scale set to 3. we will see in the next days if the result are better or worst > > > > probably should be removed (or set to 2 as you suggest) > > Then I suggest removal as you say. Better avoid redundancy ( clutter :) ) > > > > Happy coding :) > > Vibe > _______________________________________________ > SpamBayes at python.org > http://mail.python.org/mailman/listinfo/spambayes > Check the FAQ before asking: http://spambayes.sf.net/faq.html -- Luigi Pugnetti Symbolic S.p.A. V.le Mentana, 29 I-43100 Parma Italy Tel: +39 0521 708811 Fax: +39 0521 776190 From grevsen at gmail.com Sat Nov 4 19:27:40 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Sat, 4 Nov 2006 19:27:40 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] Message-ID: <000701c7003e$ea0bc240$0600000a@kasper> Hi friends, >> >> 1. Put ocrad 0.16 in the path > I have no experience with mingw but I compiled ocrad > using it and I'm using the result (without cygwin dll) with no problem, Ok, but note that the sources posted in spambayes-something was 0.15! New version 0.16 can be downloaded here: http://ftp.gnu.org/gnu/ocrad/ocrad-0.16.tar.bz2 According to the changelog the character recognition was improved. If you built a 0.16 exe without cygwin1.dll I would like to test it. Can you post it somewhere together with a short desciption of how it was built? "Pretty please with sugar on top". >> > Have you tried other ocr programs? >> >> No, not yet. >> >> Tony Meyer suggested Tesseract: > I built tesseract with no problem. ... > I tested few spam images and the results were poor. >> I mailed with NoSpam Today! Support (spamassasin based) before I chose SB. >> They were doing research on FuzzyOcr and ImageInfo. Maybe we could ask >> again about their results. I believe FuzzyOcr is gocr-based? > > Yes, they are using gocr. But as I said in my previous mail it has its > own problems. Ok then it has at least been tried ... >> Since the ocr is working with ocrad and - as you see below - I get very >> good results I will be moving on to the next area now. > You are lucky. My results are so so. Probably I get a reduction of a > 60/70% of spam with images (which in itself could be considered not bad) > but way too much spam is not stopped. I expect results to vary and it is too early in my testing to tell, but today SB caught 17 of 18 spams. I changed spam cutoff to 0.7 however that didn't even seem nescessary. My database contains 845 spams and 1411 hams. Zero false positives! >> I think it is far more beneficial to do more research into the actual processing >> as you commented elsewhere than to start the whole testing/tweaking all over >> again with a new ocr engine. Of course that is just my opinion... > > Yes and no. We need a decent ocr engine to start with than we may focus > on better image manipulation. Yes... >> At the moment spambayes have trouble with image for the following >> reason: > > - PIL sometimes fail to handle the image. I'm still investigating the > issue but the images seems reasonably correct (IE, Firefox and many > viewers, on linux and windows, are able to display them). It's quite > rare and not a big issue Not an ocr problem, but a preprocessing problem... It's great that you are looking into this because I for one don't know python well enough to debug such issues. > - ocr results are poor. The worst case are when you get a sequenze of > chars (char space char space ...) or a long word. both are ignored by > spambayes Tokenizer problem, configurable. Not related to the ocr engine. > - There are images which contain more than words and in this case we may > get no tokens. I have seen many animations with moving artefacts. Usually not a problem, but it may be in the future. Again some filtering - which is preprocessing - might be a brilliant idea. > In few cases if the colors used inside the image are changed you get a > different result. We should work on filtering and histogram analysis to determine the correct threshold level for the ocr. If we find a better way than what ocrad already does then we can pass it via the -T parameter. Advanced filtering can even detect repetitive patterns or noise in the background and remove that. Sure a professional ocr engine like OmniPage Pro does huge amounts of preprocessing like i.e. automatic rotation correction etc. already, but that does not yet seem nescessary for our purpose... > I have no knowledge of image processing but I tried few simple > operations (like scaling, sharpening, convert to gray, ...) but I got no > results. They were all quick tests and the result are in no way > conclusive. I did a course in image analysis. I don't know python / PIL, but I could probably do some tests in Matlab when my numbers start to deteriorate. If you have a way to batch-extract images from emails or from a dbx-file or if you send me a zip of 100+ problematic spam images then I would be happy to run some tests i.e. on best scale factor and scaling algorithm. I can batch-convert them so only worry about extraction. > from my understanding in Options.py you set the default values, > bayescustomize.ini contain the values chosen by the user an in > Imagestripper.py the programmer may embed it's values ignoring the user > choice (joking) Something like that I think :) Did you try to change this in ImageStripper.py and did it make any change to the output? >> With 2 you should get this quality image tokens: >> >> watch >> out >> here >> comes >> the >> big >> one! ... >> That is about a 90% recognition or so. > Yes, sometimes the results are good and sometimes are much worst. In few > cases a scaling factor of 3 it's better. Just now I'm doing a retraining > with ocrad_scale set to 3. we will see in the next days if the result > are better or worst Yes, my initial suggestion was scaling by 4, but Skip argued to use 2. He did tests with different scales already. Intuitively a larger scale should be better. I found however that it slowed down the analysis. Now I don't know what ocrad does, but resampling might be better than resizing. Happy coding :) Vibe PS: How come my posts always show up as new threads? Using OE. Don't want to subscribe. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061104/40e30dbd/attachment.htm From Rich at RBarger.com Sat Nov 4 23:50:31 2006 From: Rich at RBarger.com (Richard B Barger ABC APR) Date: Sat, 04 Nov 2006 16:50:31 -0600 Subject: [Spambayes] List confusion Message-ID: <454D1937.6C2D9D25@RBarger.com> I've found several of the recent exchanges interesting, but technically way over my head. Should some of this discussion be moved to the developer's list, rather than being on the general information list, or am I the only one who is mostly befuddled? Rich Barger From amedee at amedee.be Sun Nov 5 00:28:33 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Sun, 5 Nov 2006 00:28:33 +0100 (CET) Subject: [Spambayes] List confusion In-Reply-To: <454D1937.6C2D9D25@RBarger.com> References: <454D1937.6C2D9D25@RBarger.com> Message-ID: <1064.213.118.146.74.1162682913.squirrel@amedee.be> On Sat, November 4, 2006 23:50, Richard B Barger ABC APR said: > > I've found several of the recent exchanges interesting, but technically > way over my head. > > Should some of this discussion be moved to the developer's list, rather > than being on the general information list, or am I the only one who is > mostly befuddled? > If you are talking about the recent ocrad-related mails, I think you might have a valid point :-) However when the developed people come to a conclusion, it would be nice if we, mere mortals, get a brief non-technical summary. -- Amedee Van Gasse amedee at amedee.be From skip at pobox.com Sun Nov 5 19:50:30 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 12:50:30 -0600 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] In-Reply-To: <000701c7003e$ea0bc240$0600000a@kasper> References: <000701c7003e$ea0bc240$0600000a@kasper> Message-ID: <17742.12918.349164.181405@montanaro.dyndns.org> I invite those of you working on the OCR stuff on Windows to subscribe to the spambayes-dev mailing list if you are not already subscribed: http://mail.python.org/mailman/listinfo/spambayes-dev Also, you should read the README-DEVEL.txt file in the top level directory of the CVS repository, especially if you want to test the various settings and have some hope of making apples-to-apples comparisons. Skip From skip at pobox.com Sun Nov 5 20:12:32 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:12:32 -0600 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <1162478403.27899.22.camel@localhost.localdomain> References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> <001301c6fe8a$6a6cb950$0600000a@kasper> <1162478403.27899.22.camel@localhost.localdomain> Message-ID: <17742.14240.9321.436394@montanaro.dyndns.org> Okay, I'm finally actually editing the necessary files. 2> is supported by WinNT, 2k and XP I just newer saw it used before. 2> is not supported in Win9x and ME. I don't think we care about Win9x or WinME (though someone should feel free to demonstrate my ignorance here). >> However /dev/null is - of course - not found in Windows. Equivalent >> is nul (case insensitive). Better use os.path.devnull like shown >> here. Parenthesis required for string formatting! Correct. Will be checked in shortly. Luigi> On windows you have to put quote around pnmfile to protect Luigi> against space in path ... Not a problem here, since the pnmfile is named using the tempfile.mkstemp function. It won't contain any characters which require special treatment. >> Finally I was surprised to find that >> >> ocrad -s4 -x out.txt >ocr.txt logo.pgm >> >> did produce an ocr.txt but no out.txt for this image >> http://www.unlockaarhus.dk/dev/logo.pgm. >> >> Maybe it's only a problem with small images? Could you please test if >> this is the case under Unix as well? Luigi> using -s (and other flags as well) disable -x. Hmmm... That sucks. I see the lines in ocrad's code where that happens. I mailed a note to bug-ocrad asking why this is so. Hopefully it's just a simple bug that can be squashed. Luigi> orf file is never used. probably is there from the start before Luigi> skip introduce the scale parameter Actually, yes, it is used: for line in open(orf): if line.startswith("lines"): nlines = int(line.split()[1]) if nlines: ctokens.add("image-text-lines:%d" % int(log2(nlines))) so no image-text-lines:NN tokens are generated. Skip From skip at pobox.com Sun Nov 5 20:19:35 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:19:35 -0600 Subject: [Spambayes] image analyze In-Reply-To: <1162484535.27899.38.camel@localhost.localdomain> References: <1162484535.27899.38.camel@localhost.localdomain> Message-ID: <17742.14663.428618.258080@montanaro.dyndns.org> Luigi> I think there is an errore in the frame selection inside Luigi> PIL_decode_parts(). Luigi> for frame in ImageSequence.Iterator(image): Luigi> # Assume the pixel with the largest value is the Luigi> # background. Luigi> bg = max(frame.histogram()) Luigi> if bg < bgpix: Luigi> image = frame Luigi> bgpix = bg Luigi, I can't tell what you're doing in this change. Can you describe it? Also, in the future, please submit patches and bug reports to the SourceForge tracker (as context or unified diffs, not as complete replacements). Luigi> A part from this the algorithm could be better. From my test Luigi> using stddev and sum2 yield better result. No doubt my algorithm could be better. I'm afraid I just don't know what yours does and the ImageStat.Stat documentation isn't much help. ;-) Skip From skip at pobox.com Sun Nov 5 20:22:13 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:22:13 -0600 Subject: [Spambayes] Analyzing text in image spam (was: Spam in Images) In-Reply-To: <1162567229.29026.64.camel@localhost.localdomain> References: <002001c6b5a9$3e36ff50$6701a8c0@Caladan> <21ae1b060608171957q7f4311baw900c0fa51cc41530@mail.gmail.com> <17637.12107.629913.149910@montanaro.dyndns.org> <17637.49519.192540.305301@montanaro.dyndns.org> <1162567229.29026.64.camel@localhost.localdomain> Message-ID: <17742.14821.798671.601512@montanaro.dyndns.org> >> Once you're ready to go, add the following to your SpamBayes options: >> >> x-lookup_ip: True >> lookup_ip_cache: ~/.dnscache >> Luigi> Is someone using this option? To me seems that this option alone Luigi> do nothing. You have to enable both x-lookup_ip and Luigi> x-pick_apart_urls. Is it right or am I missing something? Yes, you are right. x-pick_apart_urls needs to be enabled to use x-lookup_ip. My apologies for not catching that before. Skip From skip at pobox.com Sun Nov 5 20:24:05 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:24:05 -0600 Subject: [Spambayes] Analyzing text in image spam (was: Spam in Images) In-Reply-To: <1162571534.29026.80.camel@localhost.localdomain> References: <002001c6b5a9$3e36ff50$6701a8c0@Caladan> <21ae1b060608171957q7f4311baw900c0fa51cc41530@mail.gmail.com> <17637.12107.629913.149910@montanaro.dyndns.org> <17637.49519.192540.305301@montanaro.dyndns.org> <1162567229.29026.64.camel@localhost.localdomain> <17739.26263.153047.731107@montanaro.dyndns.org> <1162571534.29026.80.camel@localhost.localdomain> Message-ID: <17742.14933.845491.787341@montanaro.dyndns.org> Luigi> Once both are enabled it seems to work but the mail processing is Luigi> very very slow. >> First time through, yes. After that, it should (in theory) rely on >> its cache of IP address information. I may have some pending >> checkins for that though (*). Note also that a fairly small training >> database works for me (fewer than 100 hams, 250-300 spams). If you >> have a massive training database, then, yes, this will slow things >> down dramatically. The IP lookup and image OCR stuff changes the >> properties of your database enough that I think it's worth retraining >> from scratch. Luigi> I have tried on a sample of 5000 emails but I stopped it because Luigi> after more than half an hour it didn't finish. From tcpdump I Luigi> could see a request every 1,2 seconds (or something like that) Luigi> now even considering that not every mail contains an url it was Luigi> very slow. As a note I tried it on windows XP with ocr scanning Luigi> enabled but ocr alone was much faster. I can't imagine a scenario where I would need 5000 emails to get decent results with SpamBayes. If that was the common case, everyone would give up on it long before it was of any use. I still suggest you try starting from scratch. Skip From skip at pobox.com Sun Nov 5 20:29:39 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:29:39 -0600 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] In-Reply-To: <003601c6ff9e$4eb04b90$0600000a@kasper> References: <003601c6ff9e$4eb04b90$0600000a@kasper> Message-ID: <17742.15267.613062.117876@montanaro.dyndns.org> Vibe> 4. Change this Vibe> for line in open(orf): Vibe> if line.startswith("lines"): Vibe> nlines = int(line.split()[1]) Vibe> if nlines: Vibe> ctokens.add("image-text-lines:%d" % Vibe> int(log2(nlines))) Vibe> into this Vibe> nlines = ctext.count('\n') Vibe> if nlines: Vibe> ctokens.add("image-text-lines:%d" % Vibe> nlines ) Not the same: % ocrad -x out.txt -o ocr.txt logo.pgm % wc -l ocr.txt 2 ocr.txt % cat out.txt # Ocr Results File. Created by GNU Ocrad version 0.15 source file logo.pgm total text blocks 1 text block 1 0 0 199 50 lines 1 line 1 chars 15 height 11 26 29 5 2; 1, '-'0 31 23 7 12; 1, ' '0 38 22 10 13; 2, 'U'1, 'u'0 51 25 8 10; 1, 'n'0 62 22 2 12; 2, 'l'1, '|'0 67 11 23 24; 0 89 22 7 13; 1, 'h'0 96 23 7 12; 1, ' '0 103 21 11 14; 1, 'A'0 118 25 5 10; 1, 'r'0 125 22 8 13; 1, 'h'0 136 25 8 10; 1, 'u'0 146 25 9 10; 1, '5'0 155 23 7 12; 1, ' '0 162 29 5 2; 1, '-'0 Note that the out.txt file suggests there is only one line in the file while the actual file contains two. It appears that's simply an off-by-one issue (maybe ocrad always adds a blank line to the end of its output text), though I've only looked at the above case and one other. Skip From skip at pobox.com Sun Nov 5 20:39:44 2006 From: skip at pobox.com (skip at pobox.com) Date: Sun, 5 Nov 2006 13:39:44 -0600 Subject: [Spambayes] Unwanted stock solicitations In-Reply-To: <17742.14240.9321.436394@montanaro.dyndns.org> References: <000b01c6f63c$78785320$0600000a@kasper> <17724.54543.797457.531620@montanaro.dyndns.org> <003901c6f6be$3b39a310$0600000a@kasper> <17724.65310.502709.225367@montanaro.dyndns.org> <001301c6fe8a$6a6cb950$0600000a@kasper> <1162478403.27899.22.camel@localhost.localdomain> <17742.14240.9321.436394@montanaro.dyndns.org> Message-ID: <17742.15872.181690.836868@montanaro.dyndns.org> >>> However /dev/null is - of course - not found in Windows. Equivalent >>> is nul (case insensitive). Better use os.path.devnull like shown >>> here. Parenthesis required for string formatting! skip> Correct. Will be checked in shortly. More Windows-friendly executable location and program execution has been checked in for ImageStripper.py. Luigi> orf file is never used. probably is there from the start before Luigi> skip introduce the scale parameter skip> Actually, yes, it is used: skip> for line in open(orf): skip> if line.startswith("lines"): skip> nlines = int(line.split()[1]) skip> if nlines: skip> ctokens.add("image-text-lines:%d" % skip> int(log2(nlines))) skip> so no image-text-lines:NN tokens are generated. But it seemed better to use the count() suggestion, so I did. Now we don't care of the -s/-x thing in ocrad is a bug or not. ;-) Skip From grevsen at gmail.com Sun Nov 5 23:16:08 2006 From: grevsen at gmail.com (Vibe Grevsen) Date: Sun, 5 Nov 2006 23:16:08 +0100 Subject: [Spambayes] Windows compatibility - OCR [was: Unwanted stock solicitations] References: <003601c6ff9e$4eb04b90$0600000a@kasper> <17742.15267.613062.117876@montanaro.dyndns.org> Message-ID: <000801c70127$ffc9b530$0600000a@kasper> Hi there :) > Vibe> 4. Change this > > Vibe> for line in open(orf): > Vibe> if line.startswith("lines"): > Vibe> nlines = int(line.split()[1]) > Vibe> if nlines: > Vibe> ctokens.add("image-text-lines:%d" % > Vibe> int(log2(nlines))) > > > Vibe> into this > > Vibe> nlines = ctext.count('\n') > Vibe> if nlines: > Vibe> ctokens.add("image-text-lines:%d" % > Vibe> nlines ) > Not the same: ... > Note that the out.txt file suggests there is only one line in the file while > the actual file contains two. It appears that's simply an off-by-one issue > (maybe ocrad always adds a blank line to the end of its output text), though > I've only looked at the above case and one other. You're right. Simply off-by-one. Tested on five images. nlines = ctext.count('\n') - 1 I also noted that the line number was often different from the perceived line count. (I.e. if you look at the image and try to estimate the number of lines). If python supports regexp's we could strip empty lines from the output before the count... It may be a good idea, but I suspect it is not significant however. Happy coding :) Vibe From ltusly at sbcglobal.net Mon Nov 6 16:25:19 2006 From: ltusly at sbcglobal.net (L Tusly) Date: Mon, 6 Nov 2006 10:25:19 -0500 Subject: [Spambayes] Problem w/Spam Bayes Message-ID: I have been using Spam Bayes successfully for 2-3 years with no complaint. Starting yesterday, when I go in my Inbox and hit Spam the first time, it works successfully; any times after that, it doesn't. Do I need to reload? Laurie -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061106/c9a6a791/attachment.htm From kathiu at verizon.net Mon Nov 6 18:43:31 2006 From: kathiu at verizon.net (kathi) Date: Mon, 06 Nov 2006 09:43:31 -0800 Subject: [Spambayes] problem Message-ID: <000601c701cb$138aa270$6701a8c0@owner8686f9765> Hi, sorry but I am having a problem and unable to fix it. I accidently lost my junk suspects ( spam) on list and now when I get some emails they are there and then instantly gone....I can't find them anywhere. ( been waiting for a airline confirmation from Delta) 1. Where are those emails? (not in the regular spam or inbox) 2. how do I fix the problem? thanks so much Kathi Urie Shoreline Wa kathiu at verizon.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061106/0aed823f/attachment.htm From jeff at jldsystems.com Mon Nov 6 23:58:19 2006 From: jeff at jldsystems.com (Jeff Dombach) Date: 6 Nov 2006 22:58:19 -0000 Subject: [Spambayes] Trouble with Spambayes Message-ID: <20061106225819.18961.qmail@ns1.rnrtech.com> Hello all, I have been using Spambayes for awhile but it quit working. I have read the troubleshooting guide. I uninstalled and reinstalled the outlook add-in several times. I have no toolbar buttons. It does not appear that any email is being moved to the junk folder. Help, I can't stand filtering 50-60 pieces of spam manually. Enthusiastically, Jeff Dombach, Senior Principal Designer JLD Systems...Your source for embedded solutions! 2818 Marietta Avenue Lancaster PA 17601 Cell 717.314.3792 jeff at jldsystems.com AIM JLDSystems From Felix.Lai at Bain.com Tue Nov 7 02:28:50 2006 From: Felix.Lai at Bain.com (Lai, Felix) Date: Tue, 7 Nov 2006 09:28:50 +0800 Subject: [Spambayes] Spambayes in Outlook 2003 using IMAP Message-ID: Hi there, I finally got Spambayes Outlook plug-in to work with Outlook 2003 using IMAP email mode, however the "Spam" column I've created in Inbox is always empty even though filtering works properly, once the email ends up in Junk or Junk Suspect, the score shows up, but not when it's still in Inbox. How come and is there a cure? :) Felix -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/bac926a2/attachment.htm From bruss at westnet.com.au Tue Nov 7 13:02:51 2006 From: bruss at westnet.com.au (Roland) Date: Tue, 7 Nov 2006 20:02:51 +0800 Subject: [Spambayes] OE problem Message-ID: <009801c70264$a6dee880$0a00a8c0@office> I can't get much satisfaction out of SpamBayes. Maybe I am doing it all wrong. Is SpamBayes only for MS Outlook and not for Outlook Express (OE). Every advice I could find so far was always for Outlook. I find it quite cumbersome to go into the message properties --> details --> message source and then copy and paste the message into "Train on a message, mbox file or dbx file" and but I'm still getting all the usual annoying nonsense I thought I would get rid off after starting to use SpamBayes. Please help if you can! Roland -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/a3f50a27/attachment.html From cjw at sympatico.ca Tue Nov 7 14:01:18 2006 From: cjw at sympatico.ca (Colin J. Williams) Date: Tue, 07 Nov 2006 08:01:18 -0500 Subject: [Spambayes] Use of spambyes with any other mail package than outlook express In-Reply-To: <17737.18451.855617.113090@montanaro.dyndns.org> References: <45493E98.20101@sympatico.ca> <17737.18451.855617.113090@montanaro.dyndns.org> Message-ID: <4550839E.4090305@sympatico.ca> skip at pobox.com wrote: > Colin> This is to enquire whether anyone has successfully used the new > Colin> version, other than with Outlook Express. > > Colin> I had a stab at it, but failed. If anyone has succeeded, I will > Colin> try again. > > I use SpamBayes just fine on a Mac. Can you describe what your environment > is and what doesn't work? > > Skip > > Skip, Some days ago, I responded: I'll try again and try to respond to Skip's question in some days. Meanwhile, I'm happy with the crude Firefox filter which has produced not false positives so far. The filter picks up messages with "image/gif". Colin W Since then, I've found that the Firefox filter is adequate for my needs. I'll wait for a more mature version of SpamBayes. Good luck with your project and thanks for your help. Colin W. From jarribas at bioingenieria.es Tue Nov 7 16:19:27 2006 From: jarribas at bioingenieria.es (=?iso-8859-1?Q?Jes=FAs_Arribas?=) Date: Tue, 7 Nov 2006 16:19:27 +0100 Subject: [Spambayes] Spambayes problem Message-ID: Spambayes doesn't work with Outlook 2003 rules. My rules are "IF the email is for the account A move it to the folder B" because I have to manage several email accounts. If I disable my Outlook 2003 rules the Spambayes works perfectly and all my span is removed and moved to my Junk Email folder. If I enable my Outlook 2003 rules the emails are not moved to my Junk Email folder. If I use the utility Filtering Messages ---> Start Now, all my spam messages not moved are correctly moved to my Junk Email folder. All of this happens regardless I configure the Background filtering, I have tried all the positions of the timers and the problem goes on. So, What can I do? Thank you in advance. Jes?s Arribas Calonge Departamento I+D ---------------------------------------------------------------------------- ---- Pol?gono Industrial Alcalde Caballero C/ Monasterio de las Huelgas 2, nave 6 50.014 ZARAGOZA Tfno: (+34) 976 798 200 BIOINGENIERIA Fax: (+34) 976 798 201 ARAGONESA S.L. www.sabia.es ---------------------------------------------------------------------------- ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/973a9247/attachment.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 861 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061107/973a9247/attachment.gif -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 82 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061107/973a9247/attachment-0001.gif From bishop at aeroprise.com Tue Nov 7 19:59:32 2006 From: bishop at aeroprise.com (Peter Bishop) Date: Tue, 7 Nov 2006 10:59:32 -0800 Subject: [Spambayes] good karma or not? Message-ID: I would like to get some good karma. I am fairly new to SpamBayes, but as CTO of my company, I had designed an approach to spam, and asked my field support manager to start implementing it, when he told me he was already using SpamBayes with great success. Your approach was very similar to the approach I had designed, and so I investigated your software over the course of a couple of days, and I love it! I am now recommending SpamBayes as our corporate answer to spam, and providing some guidance within the company to employees who want to use it. I have also started tentatively answering some of the questions on this list, but I notice that the emails go directly to the person asking the question. Does anyone know how many answers each question gets, and how good they all are? Should I keep doing this, or are others giving much better advice? Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/c119d77d/attachment.html From skip at pobox.com Tue Nov 7 20:35:17 2006 From: skip at pobox.com (skip at pobox.com) Date: Tue, 7 Nov 2006 13:35:17 -0600 Subject: [Spambayes] good karma or not? In-Reply-To: References: Message-ID: <17744.57333.221675.530725@montanaro.dyndns.org> Peter> I have also started tentatively answering some of the questions Peter> on this list, but I notice that the emails go directly to the Peter> person asking the question. Does anyone know how many answers Peter> each question gets, and how good they all are? Should I keep Peter> doing this, or are others giving much better advice? Peter, Generally speaking, a "reply all" is appropriate when answering questions which were posted to the list. The list is set up to not munge the Reply-To header, for several good reasons: http://woozle.org/~neale/papers/reply-to-still-harmful.html So, keep on answering questions. The good karma will continue to flow. I have no idea how good the questions or the answers are. Skip From jsp at PKC.com Tue Nov 7 20:40:20 2006 From: jsp at PKC.com (Jesse Pelton) Date: Tue, 7 Nov 2006 14:40:20 -0500 Subject: [Spambayes] good karma or not? In-Reply-To: Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04B6D@PKCVT01.pkc.com> Thanks for helping out! It frees up the developers to focus on development (or the rest of their lives), lets them know you appreciate their good work, spreads the load, and fills in the gaps when other users on the list (like me) aren't available for some reason. If you use your e-mail client's "Reply to all," everyone on the list should see the response. The mailing list software is smart enough that it only sends the message once to the person you're replying to. For instance, your address appears in my address list along with spambayes at python.org, but you should only receive one copy of this message. -jesse- ________________________________ From: spambayes-bounces+jsp=pkc.com at python.org [mailto:spambayes-bounces+jsp=pkc.com at python.org] On Behalf Of Peter Bishop Sent: Tuesday, November 07, 2006 2:00 PM To: spambayes at python.org Subject: [Spambayes] good karma or not? I would like to get some good karma. I am fairly new to SpamBayes, but as CTO of my company, I had designed an approach to spam, and asked my field support manager to start implementing it, when he told me he was already using SpamBayes with great success. Your approach was very similar to the approach I had designed, and so I investigated your software over the course of a couple of days, and I love it! I am now recommending SpamBayes as our corporate answer to spam, and providing some guidance within the company to employees who want to use it. I have also started tentatively answering some of the questions on this list, but I notice that the emails go directly to the person asking the question. Does anyone know how many answers each question gets, and how good they all are? Should I keep doing this, or are others giving much better advice? Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/a205e5d4/attachment.htm From shawn at 12pointdesign.com Wed Nov 8 02:27:52 2006 From: shawn at 12pointdesign.com (Shawn K. Hall) Date: Tue, 7 Nov 2006 17:27:52 -0800 Subject: [Spambayes] Spambayes problem In-Reply-To: Message-ID: <105501c702d5$1c2ed7b0$1e00a8c0@Yoda> > Spambayes doesn't work with Outlook 2003 rules. Yes it does. You need to have SpamBayes ALSO filter the target folder of your message rule, not just the inbox. * Spambayes > Spambayes Manager > Filtering > "filter folders as messages arrive" > browse. You can select more than one. Regards, Shawn K. Hall http://12PointDesign.com/ From dontreply at abbey.co.uk Tue Nov 7 21:48:35 2006 From: dontreply at abbey.co.uk (dontreply at abbey.co.uk) Date: Tue, 07 Nov 2006 15:48:35 -0500 Subject: [Spambayes] Security Alert Your Account Was Suspend Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061107/7033c971/attachment.htm From Peggy at flscs.com Wed Nov 8 20:41:09 2006 From: Peggy at flscs.com (Peg Wiederholt) Date: Wed, 8 Nov 2006 12:41:09 -0700 Subject: [Spambayes] OOPS! Message-ID: <177EF33D86F3984ABEBF53E94A39428401CDCD@network-servant.flscs.local> I sent something to Spam that I didn't want to. How do I get it back? Thanks, -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061108/8fd00e75/attachment.htm From skip at pobox.com Wed Nov 8 22:00:36 2006 From: skip at pobox.com (skip at pobox.com) Date: Wed, 8 Nov 2006 15:00:36 -0600 Subject: [Spambayes] OOPS! In-Reply-To: <177EF33D86F3984ABEBF53E94A39428401CDCD@network-servant.flscs.local> References: <177EF33D86F3984ABEBF53E94A39428401CDCD@network-servant.flscs.local> Message-ID: <17746.17780.587407.265909@montanaro.dyndns.org> Peg> I sent something to Spam that I didn't want to. How do I get it Peg> back? Peg, I assume you're using the Outlook plugin. I think you can just open your spam mailbox, select the message and click the "not spam" button. Skip From amedee at amedee.be Thu Nov 9 13:00:11 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Thu, 9 Nov 2006 13:00:11 +0100 (CET) Subject: [Spambayes] server side spambayes In-Reply-To: <45531C9E.22FE.00C6.0@cdenv.be> References: <45531C9E.22FE.00C6.0@cdenv.be> Message-ID: <39923.82.146.101.169.1163073611.squirrel@amedee.be> I am using spambayes for almost a year now, and I'm very happy with it. Most spam gets caught (>95%), and no false positives after initial training. Currently I am using spambayes as a procmail filter, in ~/.procmailrc. For training, I have a few imap folders: ham, unsure, spam, confirmed-ham, confirmed spam. On average, 1 or 2 messages per day arrive in one of the first 3 folders. I move them to one of the confirmed folders. (This setup is described somewhere on the wiki) A nightly cron job then trains on the confirmed folders. This works great, but it's client-side filtering so it only works for me. Recently my spouse showed interest in spambayes because the spam filtering that our isp does is - honestly - crap. Too many false positives and too many getting trough. I'm also hosting a few (low traffic) mailing lists on my server, for friends and for a few associations I'm in. Because of this, I am interested in using spambayes at the server level. I have found this page: http://spambayes.sourceforge.net/server_side.html with "postfix notes from Jonathan St-Andre" I still have some questions. If I understand it correctly, the only thing this setup does, is tagging emails with a spam score, just like I am already doing in procmail. Is that a correct interpretation? So it is still up to me to classify the emails in the correct ham/spam folders using procmail? Is it possible to block emails at the server level when they have a spam score above a certain treshold? Like for example, in my current setup (client side), everything with a spam score of .99 gets moved to /dev/null by procmail. On the postfix server level, I would like to fail emails with a spam score of .99 with an SMTP error, like for example error 554: Transaction failed. Is that possible, and if yes, how can I do that? I might have some other questions as I go along, but let's start with these. -- Amedee Van Gasse amedee at amedee.be From skip at pobox.com Thu Nov 9 13:19:04 2006 From: skip at pobox.com (skip at pobox.com) Date: Thu, 9 Nov 2006 06:19:04 -0600 Subject: [Spambayes] server side spambayes In-Reply-To: <39923.82.146.101.169.1163073611.squirrel@amedee.be> References: <45531C9E.22FE.00C6.0@cdenv.be> <39923.82.146.101.169.1163073611.squirrel@amedee.be> Message-ID: <17747.7352.557987.283874@montanaro.dyndns.org> Amedee> If I understand it correctly, the only thing this setup does, is Amedee> tagging emails with a spam score, just like I am already doing Amedee> in procmail. Is that a correct interpretation? So it is still Amedee> up to me to classify the emails in the correct ham/spam folders Amedee> using procmail? Is it possible to block emails at the server Amedee> level when they have a spam score above a certain treshold? In my "server side" setup, all I do is copy by personal training to the server every now and then. I set up the procmailrc on the server to forward unsure and spam to me. That keeps the crap out of most mailing lists and I still have a point at which to review the results. Skip From amedee at amedee.be Thu Nov 9 15:59:49 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Thu, 9 Nov 2006 15:59:49 +0100 (CET) Subject: [Spambayes] server side spambayes In-Reply-To: <17747.7352.557987.283874@montanaro.dyndns.org> References: <45531C9E.22FE.00C6.0@cdenv.be> <39923.82.146.101.169.1163073611.squirrel@amedee.be> <17747.7352.557987.283874@montanaro.dyndns.org> Message-ID: <54323.82.146.101.169.1163084389.squirrel@amedee.be> On Thu, November 9, 2006 13:19, skip at pobox.com said: > > Amedee> If I understand it correctly, the only thing this setup does, > is > Amedee> tagging emails with a spam score, just like I am already doing > Amedee> in procmail. Is that a correct interpretation? So it is still > Amedee> up to me to classify the emails in the correct ham/spam > folders > Amedee> using procmail? Is it possible to block emails at the server > Amedee> level when they have a spam score above a certain treshold? > > In my "server side" setup, all I do is copy by personal training to the > server every now and then. I set up the procmailrc on the server to > forward > unsure and spam to me. That keeps the crap out of most mailing lists and > I > still have a point at which to review the results. > > Skip > Ah yes, interesting. I'll make a mental note of that: find out the difference between ~/.procmailrc and /etc/procmailrc. I know the theory, but haven't played with it yet. But that would be off topic on this list. -- Amedee From skip at pobox.com Thu Nov 9 16:19:48 2006 From: skip at pobox.com (skip at pobox.com) Date: Thu, 9 Nov 2006 09:19:48 -0600 Subject: [Spambayes] server side spambayes In-Reply-To: <54323.82.146.101.169.1163084389.squirrel@amedee.be> References: <45531C9E.22FE.00C6.0@cdenv.be> <39923.82.146.101.169.1163073611.squirrel@amedee.be> <17747.7352.557987.283874@montanaro.dyndns.org> <54323.82.146.101.169.1163084389.squirrel@amedee.be> Message-ID: <17747.18196.565841.314635@montanaro.dyndns.org> Amedee> I'll make a mental note of that: find out the difference between Amedee> ~/.procmailrc and /etc/procmailrc. I know the theory, but Amedee> haven't played with it yet. But that would be off topic on this Amedee> list. Well, actually, I tweak the .procmailrc of the logs that need the spam filtering functionality. I haven't messed with /etc/procmailrc. Sorry that my message was ambiguous on that point. Skip From bishop at aeroprise.com Thu Nov 9 23:26:42 2006 From: bishop at aeroprise.com (Peter Bishop) Date: Thu, 9 Nov 2006 14:26:42 -0800 Subject: [Spambayes] OOPS! In-Reply-To: <17746.17780.587407.265909@montanaro.dyndns.org> Message-ID: My Outlook client, in my Spam Certain folder has a command labelled: Recover from Spam This command puts the mail back in the inbox, and trains that it is not spam (these actions are specified in the SpamBayes->SpamBayes Manager->Training->Incremental Training section of the SpamBayes manager. Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. -----Original Message----- From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of skip at pobox.com Sent: Wednesday, November 08, 2006 1:01 PM To: Peg Wiederholt Cc: spambayes at python.org Subject: Re: [Spambayes] OOPS! Peg> I sent something to Spam that I didn't want to. How do I get it Peg> back? Peg, I assume you're using the Outlook plugin. I think you can just open your spam mailbox, select the message and click the "not spam" button. Skip _______________________________________________ SpamBayes at python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html From steve at holdenweb.com Fri Nov 10 12:11:03 2006 From: steve at holdenweb.com (Steve Holden) Date: Fri, 10 Nov 2006 05:11:03 -0600 Subject: [Spambayes] 1.1a1 inordinately high memory usage Message-ID: <45545E47.9080103@holdenweb.com> Is this just me, or do recent SpamBayes releases use a *lot* of memory? I mean over quarter of a *gigabyte*? I downloaded what was then the current alpha because I started to suffer from the dreaded "DBRunRecoveryError". I downloaded a binary installer for 1.1a3 and that suffered the same problem after a couple of mails got processed (I threw away the previous training database and gritted my teeth for a retrain). So I somehow ended up with a source install that identifies itself as "spambayes-1.1a1". I'm running the local service under Windows XP and as I write the pythonservice.exe associated with Spambayes is idling at around 167 MB (though the Application Data directory occupies less than 7 MB). I just heard my disk thrashing and presumed this was Spambayes processing a regular mail collection. Sure enough I saw the process go up to 319 MB, and it's now hovering at 200 MB again. I am assuming this is something to do with recent additions to OCR the recent flood of stock offer spam. I like the way the spam has disappeared, but I'm not sure I can stand this price. I only" have 1 GB of physical memory. Up until my Spambayes re-install this was enough! You may ask why I'm not running 1.1a3. Good question. When I install that using the binary installer and try to set my options for mail server and port number I see the following: 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 476, in found_terminator File "spambayes\UserInterface.pyc", line 884, in onChangeopts File "spambayes\ProxyUI.pyc", line 782, in reReadOptions File "sb_server.pyc", line 1007, in _recreateState File "sb_server.pyc", line 1022, in prepare File "sb_server.pyc", line 822, in prepare File "sb_server.pyc", line 889, in createWorkers File "spambayes\Stats.pyc", line 60, in __init__ AttributeError: 'NoneType' object has no attribute 'get_statistics_start_date' So that's a bit of a non-starter for me. I'd appreciate Cc'd replies about this if it's not too much trouble. I thought the list would have been full of this but I didn't see anything in the archive. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC/Ltd http://www.holdenweb.com Skype: holdenweb http://holdenweb.blogspot.com Recent Ramblings http://del.icio.us/steve.holden From tameyer at ihug.co.nz Fri Nov 10 23:43:39 2006 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat, 11 Nov 2006 11:43:39 +1300 Subject: [Spambayes] 1.1a1 inordinately high memory usage In-Reply-To: <45545E47.9080103@holdenweb.com> References: <45545E47.9080103@holdenweb.com> Message-ID: <7F077C8B-8155-43D0-ADDB-EFBEC4C5544B@ihug.co.nz> > Is this just me, or do recent SpamBayes releases use a *lot* of > memory? > [...] There was a bug with sb_server in 1.1a1, which means that it shouldn't be used. It was fixed in 1.1a2. > I am assuming this is something to do with recent additions to OCR the > recent flood of stock offer spam. If you haven't enabled the x-crack-images option, you're not doing any OCR. IAC, 1.1a1 doesn't have any of this (it predated the image spam issue). > You may ask why I'm not running 1.1a3. Indeed :) > Good question. When I install > that using the binary installer and try to set my options for mail > server and port number I see the following: > > 500 Server error > > Traceback (most recent call last): > File "spambayes\Dibbler.pyc", line 476, in found_terminator > File "spambayes\UserInterface.pyc", line 884, in onChangeopts > File "spambayes\ProxyUI.pyc", line 782, in reReadOptions > File "sb_server.pyc", line 1007, in _recreateState > File "sb_server.pyc", line 1022, in prepare > File "sb_server.pyc", line 822, in prepare > File "sb_server.pyc", line 889, in createWorkers > File "spambayes\Stats.pyc", line 60, in __init__ > AttributeError: 'NoneType' object has no attribute > 'get_statistics_start_date' > > So that's a bit of a non-starter for me. If you open a bug report and assign it to anadelonbrin, I'll try and find time to fix this. For the meantime, you can open the bayescustomize.ini file in a text editor and change the server & port number there. =Tony.Meyer From kotto at xtra.co.nz Fri Nov 10 22:27:13 2006 From: kotto at xtra.co.nz (Kerry Otto) Date: Sat, 11 Nov 2006 10:27:13 +1300 Subject: [Spambayes] Reeor messageiunder MS outlook Message-ID: <000001c7050e$fcfdf010$0202fea9@KERRY> Your product was recommended on a Talk back program with Newstalk ZB based out of Auckland New Zealand. I was getting a lot of spam (penny stocks, adult products, prescriptions, situation vacant our of Greece etc. I have loaded the product but have a problem when I first turn on the computor and access MS Outlook for emails with the error message as per attached. It seems to get through when started in safe mode which is the option but it takes a lot longer to load. I have not sent Microsoft an error report. Can you help? Kind Regards Kerry Otto kotto at xtra.co.nz Mob: 0275 868 615 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061111/12499a59/attachment-0001.html -------------- next part -------------- A non-text attachment was scrubbed... Name: Error Message1.doc Type: application/msword Size: 31232 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061111/12499a59/attachment-0002.doc -------------- next part -------------- A non-text attachment was scrubbed... Name: Error Message.doc Type: application/msword Size: 31744 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061111/12499a59/attachment-0003.doc From rwoelfel at wi.rr.com Fri Nov 10 22:50:17 2006 From: rwoelfel at wi.rr.com (Ronald Woelfel) Date: Fri, 10 Nov 2006 15:50:17 -0600 Subject: [Spambayes] Info Message-ID: How do I go about finding and seeing what addresses are in my spam list? How do I remove it? Ron From steve at holdenweb.com Sat Nov 11 01:05:18 2006 From: steve at holdenweb.com (Steve Holden) Date: Fri, 10 Nov 2006 18:05:18 -0600 Subject: [Spambayes] 1.1a1 inordinately high memory usage In-Reply-To: <7F077C8B-8155-43D0-ADDB-EFBEC4C5544B@ihug.co.nz> References: <45545E47.9080103@holdenweb.com> <7F077C8B-8155-43D0-ADDB-EFBEC4C5544B@ihug.co.nz> Message-ID: <455513BE.8060607@holdenweb.com> Tony Meyer wrote: >> Is this just me, or do recent SpamBayes releases use a *lot* of memory? >> [...] > > There was a bug with sb_server in 1.1a1, which means that it shouldn't > be used. It was fixed in 1.1a2. > [...] >> File "spambayes\Stats.pyc", line 60, in __init__ >> AttributeError: 'NoneType' object has no attribute >> 'get_statistics_start_date' >> >> So that's a bit of a non-starter for me. > > If you open a bug report and assign it to anadelonbrin, I'll try and > find time to fix this. For the meantime, you can open the > bayescustomize.ini file in a text editor and change the server & port > number there. > Bug report submitted and assigned as requested. Thanks for your helpful response. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC/Ltd http://www.holdenweb.com Skype: holdenweb http://holdenweb.blogspot.com Recent Ramblings http://del.icio.us/steve.holden From dave at boost-consulting.com Sat Nov 11 02:26:08 2006 From: dave at boost-consulting.com (David Abrahams) Date: Fri, 10 Nov 2006 20:26:08 -0500 Subject: [Spambayes] Same message keeps getting through my filter Message-ID: <87r6wahiz3.fsf@pereiro.luannocracy.com> An embedded message was scrubbed... From: "Maureen Gallagher" Subject: hi info Date: Fri, 10 Nov 2006 23:20:18 +0180 Size: 1110 Url: http://mail.python.org/pipermail/spambayes/attachments/20061110/1fa59356/attachment.mht From dave at boost-consulting.com Sat Nov 11 02:27:45 2006 From: dave at boost-consulting.com (David Abrahams) Date: Fri, 10 Nov 2006 20:27:45 -0500 Subject: [Spambayes] server side spambayes References: <45531C9E.22FE.00C6.0@cdenv.be> <39923.82.146.101.169.1163073611.squirrel@amedee.be> Message-ID: <87lkmihiwe.fsf@pereiro.luannocracy.com> "Amedee Van Gasse" writes: > I am using spambayes for almost a year now, and I'm very happy with it. > Most spam gets caught (>95%), and no false positives after initial > training. > > Currently I am using spambayes as a procmail filter, in ~/.procmailrc. > For training, I have a few imap folders: ham, unsure, spam, confirmed-ham, > confirmed spam. On average, 1 or 2 messages per day arrive in one of the > first 3 folders. I move them to one of the confirmed folders. > (This setup is described somewhere on the wiki) > A nightly cron job then trains on the confirmed folders. > > This works great, but it's client-side filtering so it only works for me. > Recently my spouse showed interest in spambayes because the spam filtering > that our isp does is - honestly - crap. Too many false positives and too > many getting trough. > I'm also hosting a few (low traffic) mailing lists on my server, for > friends and for a few associations I'm in. > > Because of this, I am interested in using spambayes at the server level. I'm doing something that sounds almost exactly like what you're doing, except on the server side. I'm using IMAP, which allows me to maintain the folders on the server. -- Dave Abrahams Boost Consulting www.boost-consulting.com From tameyer at ihug.co.nz Sat Nov 11 02:24:10 2006 From: tameyer at ihug.co.nz (Tony Meyer) Date: Sat, 11 Nov 2006 14:24:10 +1300 Subject: [Spambayes] Reeor messageiunder MS outlook In-Reply-To: <000001c7050e$fcfdf010$0202fea9@KERRY> References: <000001c7050e$fcfdf010$0202fea9@KERRY> Message-ID: > I have loaded the product but have a problem when I first turn on > the computor and access MS Outlook for emails with the error > message as per attached. [Generic Windows crash dialog screenshot attached] This is probably a problem with DEP. IIRC, you can disable DEP on a per-program (Outlook, in this case) basis via the System Control Panel. =Tony.Meyer -- Please always include the list (spambayes at python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. http://www.massey.ac.nz/~tameyer/writing/reply_all.html explains this. From skip at pobox.com Sat Nov 11 05:43:52 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 10 Nov 2006 22:43:52 -0600 Subject: [Spambayes] Same message keeps getting through my filter In-Reply-To: <87r6wahiz3.fsf@pereiro.luannocracy.com> References: <87r6wahiz3.fsf@pereiro.luannocracy.com> Message-ID: <17749.21768.580158.589519@montanaro.dyndns.org> David> Today I updated my spambayes from CVS and started training it David> afresh. I've been getting hundreds of copies of the enclosed David> message, and despite having put several in my spam training David> folder, they keep getting classified as ham or unsure. David> Is this a bug, or my misconception, or something I'm doing wrong? David> If a bug, can I do anything to help you fix it? Hard to tell. You didn't post the generated clues/tokens. Is there any chance you mistakenly trained it as a ham once or twice? Skip From amedee at amedee.be Sat Nov 11 11:50:26 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Sat, 11 Nov 2006 11:50:26 +0100 Subject: [Spambayes] Info In-Reply-To: References: Message-ID: <1163242226.5918.12.camel@localhost> Op vrijdag 10-11-2006 om 18:24 uur [tijdzone -0600], schreef Ronald Woelfel: > I am missing a email that I should be getting. I want to check if possible > spambayes list to see it it is blocking the email. >From your mail headers I learned that you are using Outlook as your MUA. Are you using the spambayes outlook plugin? In that case, spambayes does not delete any spam. It only moves spam to a designated spam folder. So check this spam folder. If you are still missing an email, the problem lies somewhere else, not with spambayes. > Are you in Begium? Yes. And you, from Germany? I presume you are _not_ the Ronald Woelfel that has written books about SuSE and is involved against software patents? -- Amedee Van Gasse amedee at amedee.be From amedee at amedee.be Sat Nov 11 11:56:27 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Sat, 11 Nov 2006 11:56:27 +0100 Subject: [Spambayes] Same message keeps getting through my filter In-Reply-To: <87r6wahiz3.fsf@pereiro.luannocracy.com> References: <87r6wahiz3.fsf@pereiro.luannocracy.com> Message-ID: <1163242587.5918.17.camel@localhost> Op vrijdag 10-11-2006 om 20:26 uur [tijdzone -0500], schreef David Abrahams: > Today I updated my spambayes from CVS and started training it afresh. > I've been getting hundreds of copies of the enclosed message, and > despite having put several in my spam training folder, they keep > getting classified as ham or unsure. > > Is this a bug, or my misconception, or something I'm doing wrong? > > If a bug, can I do anything to help you fix it? > > Thanks I have seen exactly the same message too, classified as unsure. And I didn't purge my database, it is trained with a few hundred messages. So if even I get them as unsure, I can imagine that someone with a new database will see even more of them. I think it is a new kind of spam, sent by botnets, and spambayes is still learning about them. I hope... -- Amedee Van Gasse amedee at amedee.be From yiannis at mavrikios.fr Sat Nov 11 19:03:41 2006 From: yiannis at mavrikios.fr (Yiannis MAVRIKIOS) Date: Sat, 11 Nov 2006 19:03:41 +0100 Subject: [Spambayes] outlook add-in stopped functionning. Imossible de re-install it Message-ID: <000001c705bb$b9481280$0b01a8c0@passion2> Hello, I have trouble with spambayes add-in to outlook. Suddenly it stopped working (version 1.03). Clicking on the "recover from spam" delete as spam" etc buttons had no effect any more. The automatic filtering to the JUNK and JUNK SUSPECTED folders stopped working as well. I tried to uninstall (using the windows procedure) and re-install it version 1.04), but with no success. I deleted the spambayes outlook buttons (right click on them and then delete). Then tried to uninstall and re-install again (restarting windows in between each time) with no result. On the windows controm panel spambayes seems normally installed. On the Tools/Options/Others/AdvancedOptions spambayes apears (without being checked) but I cannot either check-it or install it ... In outlook, there are no spambayes menus. Anybody can help or I need to rreinstall outlook all together ...!? Thanks in advance Yiannis -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061111/a4bbe999/attachment.html From ariejan at prettel.nl Sun Nov 12 09:41:17 2006 From: ariejan at prettel.nl (Arie-Jan) Date: Sun, 12 Nov 2006 09:41:17 +0100 Subject: [Spambayes] spambayes Message-ID: <002901c70636$5292c6e0$0301a8c0@workgroup> Hello, I downloaded SpamBayes, but the configuration for me is far to difficult. I am from the Nederlands so i hardly can read the gide. It is to technical and my experians with computer is not so big. Plaes make a translation in dutch. Urlier i downloaded Spamfighter. It was more easy to work with because ist is full automatic. Greatings Arie-Jan -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061112/79a28ced/attachment.html From amedee at amedee.be Sun Nov 12 11:05:11 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Sun, 12 Nov 2006 11:05:11 +0100 Subject: [Spambayes] spambayes In-Reply-To: <002901c70636$5292c6e0$0301a8c0@workgroup> References: <002901c70636$5292c6e0$0301a8c0@workgroup> Message-ID: <1163325911.9967.24.camel@localhost> Op zondag 12-11-2006 om 09:41 uur [tijdzone +0100], schreef Arie-Jan: > Hello, > I downloaded SpamBayes, but the configuration for me is far to > difficult. I am from the Nederlands so i hardly can read the gide. It > is to technical and my experians with computer is not so big. Plaes > make a translation in dutch. > Urlier i downloaded Spamfighter. It was more easy to work with because > ist is full automatic. > > Greatings Arie-Jan ***** For those of you who don't understand Dutch: 1. I asked Arie-Jan what mail program he uses, I guess OE. 2. I explained that afaik none of the devs speak Dutch. 3. I mangled the relevant FAQ entry trough one of the online translators. 4. I asked him to clean up the translation, and send the result back to the devs. I'm sure one of you will find a nice spot for it somewhere on the website. ***** Hallo Arie-Jan, Vreemd, een Nederlander die geen Engels begrijpt. Terwijl de meeste van jouw landgenoten daar net prat op gaan. :) Je hebt nagelaten te vertellen welk mailprogramma je gebruikt, dus nu moet ik gokken. Op basis van je emailheaders vermoed ik dat het Outlook Express is. Maar je zou het mij ook gemakkelijk kunnen maken en het gewoon vertellen. Je vraagt om een Nederlands vertaling, maar voor zover ik weet, spreekt geen enkele van de programmeurs Nederlands. En het is allemaal vrijwilligerswerk. Deze Vlaming is geen programmeur, maar ik ben ook geen professioneel vertaler. Bovendien gebruik ik Outlook Express niet, ik gebruik zelfs geen Windows! Als je een snelle maar niet zo nauwkeurige vertaling wil van een webpagina, dan kan je gebruik maken van de populaire Babelfish: http://babelfish.altavista.com/ Babelfish blijkt offline te zijn, maar als je via Google zoekt naar "online translation", dan is de tweede keuze Worldlingo: http://www.worldlingo.com/en/websites/url_translator.html Wat je nodig hebt, is vraag 4.21 van de FAQ. Machinaal vertaald wordt dit: ----------KNIP---------- 4.21 Hoe zette ik SpamBayes en Outlook Express op? 1. Download de recentste versie van SpamBayes van de downloadpagina. 2. Stel de installateur in werking - u wilt ?sb_server?, niet het elektrische toestel van Vooruitzichten. 3. Open Outlook Express. 4. Van het menu van ?Hulpmiddelen?, kies ?Rekeningen?. 5. Selecteer uw postrekening en klik ?Eigenschappen?. 6. Klik de ?Servers? tabel. 7. Maak een nota van wat u momenteel in ingang hebt de van de ?inkomende postserver? (de naam van uw POP3 server). 8. Verander de ?inkomende postserver? van de naam van uw POP3 server in ?localhost? (geen citaten). 9. Klik ?O.K.?. 10. Klik ?Dicht?. 11. Open de SpamBayes configuratiepagina in http://localhost: 8880/config (u kunt dit via de dienbladtoepassing doen, of enkel type dat in uw browser richt). 12. Ga de naam van uw POP3 server in de hoogste doos (?Verre Servers?) in. Dit is wat u in stap 7 neerschreef. 13. Ga ?110? (geen citaten) in de tweede doos (?havens SpamBayes?) in. 14. Tik ?onzeker? en ?spam? dozen in ?Notate aan? optie, ongeveer halverwege onderaan de pagina. 15. Klik de ?sparen configuratie? knoop aan het eind van de pagina. Alles zou nu opstelling moeten zijn. Probeer doen verzendenontvang - de post zou moeten aankomen normaal, maar om het even welke post dat SpamBayes ongeveer onzeker is zal 'onzeker hebben,' (1.0.x) of 'unsure at spambayes.invalid (1.1.x) in de ontvankelijke lijst, en om het even welke post die SpamBayes denkt is spam zal hebben 'spam,' (1.0.x) of 'spam at spambayes.invalid (1.1.x) in de ontvankelijke lijst. U kunt Tovenaar van de Regels van Vooruitzichten gebruiken de Uitdrukkelijke om regels tot stand te brengen die automatisch deze berichten tot andere omslagen leiden, bijvoorbeeld: 1. Cre??r twee nieuwe omslagen in Outlook Express, ??n genoemd ?Mogelijke Troep? en ??n genoemd de ?Post van de Troep?. 2. Doe Tools->Message Rules->Mail. 3. Klik ?Nieuw? om een nieuwe regel (ik denk dat als er geen bestaande regels zijn het dit voor u zal doen) tot stand te brengen. 4. Klik ?waar aan lijn mensen? in de hoogste doos bevat, en ?verplaats het naar de gespecificeerde omslag? in de tweede doos. 5. Klik het blauw ?bevat mensen? in de derde het ?onzekere? doos en type (geen citaten), dan klik ?Add?, dan klik ?o.k.?. 6. Klik blauw ?gespecificeerde omslag? in de derde doos en vind en selecteer omslag de van de ?Mogelijke Troep? die u in stap 1 maakte, dan klik ?o.k.?. 7. In de vierde doos, noem de regel ?onzekere SpamBayes? 8. Herhaal stappen 3 door 8, maar type ?spam? eerder dan ?onzeker?, kies de omslag van de Troep ?van de Post?, en noem de regel ?SpamBayes spam? 9. Klik o.k. De post zal nu verdeeld worden tussen uw Inbox, de mogelijke omslag van de Troep, en de omslag van de Post van de Troep, afhankelijk van hoe het geclassificeerd was. U doet opleiding door het enveloppictogram tweemaal te klikken en de overzichtspagina in te vullen die opent. Merk op dat er een gebrek in deze methode is: als u post van iemand krijgt wie ?onzeker? of ?spam? in hun e-mailadres heeft, die berichten zullen ook bewogen worden. (Het probleem gebeurt omdat het Outlook Express in zo beperkt is kan het filtreren het doen). Er is een manier aan alternerende actie dit, vraag zo de adressenlijst als het een probleem is. Wanneer definitieve versie 1.1 wordt gemaakt, het zal dit probleem vermijden (in plaats van enkel ?onzeker?, het voegt ?unsure at spambayes toe.ongeldig "). Als u any more vragen hebt, gelieve te kijken door de rest van dit FAQ, en als u niet het antwoord kunt vinden, vraag de adressenlijst. ----------KNIP---------- Dit is een typische machinale vertaling: het klink redelijk belachelijk, maar je hebt al ongeveer een idee waarover het gaat. Als ik je een goede raad mag geven: neem de tijd om deze tekst om te zetten naar mooi Nederlands, en stuur het resultaat naar de ontwikkelaars. Dat allemaal in het kader van "vrijwilligerswerk". Ze zullen je er zeker dankbaar voor zijn. Vriendelijke groeten, -- Amedee Van Gasse amedee at amedee.be From krxj at wilsonadvertising.co.nz Sun Nov 12 13:19:12 2006 From: krxj at wilsonadvertising.co.nz (Leonora Sanders) Date: Sun, 12 Nov 2006 15:19:12 +0300 Subject: [Spambayes] size street Message-ID: An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061112/1af1bd99/attachment.html -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 6223 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061112/1af1bd99/attachment.gif From jeff at ezsolution.com Sun Nov 12 21:34:05 2006 From: jeff at ezsolution.com (Jeff) Date: Sun, 12 Nov 2006 15:34:05 -0500 Subject: [Spambayes] outlook add-in stopped functionning. Imossible de re-install In-Reply-To: Message-ID: I am still having the same problems as Yiannis. Uninstalled Spambayes, Firefox 2.0, Zone Alarm and then reinstalled Spambayes and I cannot get it to work. Jeff From rjs826 at sbcglobal.net Sun Nov 12 23:15:30 2006 From: rjs826 at sbcglobal.net (Ron Shuck) Date: Sun, 12 Nov 2006 14:15:30 -0800 (PST) Subject: [Spambayes] Out of Office AutoReply: [Spam?] Mail Delivery (failure asaracevic@sfchronicle.com) Message-ID: <20061112221530.68789.qmail@web81202.mail.mud.yahoo.com> On articles about various companies it would be helpful if the stock price could be included in quotes or similar bracketing right after their name. Several other reading sources do this. It makes it easier to reference the stock in your mind and to category it for possible investments. Thank you. Ron Shuck Sonoma, CA. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061112/9b1e6394/attachment.htm From peter.dolukhanov at blueyonder.co.uk Mon Nov 13 16:26:27 2006 From: peter.dolukhanov at blueyonder.co.uk (Peter Dolukhanov) Date: Mon, 13 Nov 2006 15:26:27 -0000 Subject: [Spambayes] Outlook Problems Message-ID: <008801c70738$1bc8dca0$4001a8c0@soulfuric> Hi, I am having issues whilst using SpamBayes Add-In with Outlook 2003. SpamBayes stopped responding to Spam emails entirely, and the buttons on the toolbar did nothing. To remedy this I uninstalled SpamBayes and launched Outlook, the Toolbar however remained. So, I deleted that from the Toolbar menu, checked the Add-Ins menu and SpamBayes was removed. I reinstalled the plugin, now the plugin has started working but the Toolbar has not reappeared. This basically means that I cannot "Delete as Spam" and "Recover from Spam". I have tried creating a new Toolbar, but I cannot find a location where the commands can be added from. Is there any way of reinstalling the Toolbar? Thanks, Pete -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061113/a4dbcd79/attachment.html From wewantourfreedomback at knology.net Mon Nov 13 16:57:59 2006 From: wewantourfreedomback at knology.net (WE WANT OUR FREEDOM BACK) Date: Mon, 13 Nov 2006 10:57:59 -0500 Subject: [Spambayes] STOP SCHOOL VIOLENCE, PROTECT OUR KIDS, STOP GAY MARRIAGE, STOPP ILLEGAL IMMIGRATION Message-ID: <20061113162546.AF9471E4018@bag.python.org> NOW THERES SOMETHING YOU CAN DO www.wewantourfreedomback.com From jyppro at free.fr Mon Nov 13 18:13:24 2006 From: jyppro at free.fr (jyppro at free.fr) Date: Mon, 13 Nov 2006 18:13:24 +0100 Subject: [Spambayes] Outlook 2003 Solution Message-ID: <1163438004.4558a7b4c3598@imp1-g19.free.fr> All problems are in outlook : Outlook has disabled Spambayes and keep information in memory : Menu: ?/About Microsoft Office Outlook/disactivated elements/ you must activate spambayes Restart outlook Tools/Options/Others/Advanced Options/COM complements to activate if necessary From kef at wideopenwest.com Mon Nov 13 17:36:57 2006 From: kef at wideopenwest.com (Ken Fulks) Date: Mon, 13 Nov 2006 10:36:57 -0600 Subject: [Spambayes] Outlook Warning Message Message-ID: <004c01c70741$efdfaab0$6501a8c0@asus3200> I have installed Spambayes on several computers in our church office to eliminate a significant amount of spam that they receive. Most users are very happy with the program and its effectiveness, and the fact that they do not need to do anything for it to work. However, on only one computer, an Outlook warning pops up every time they create a new email message. The warning states that "Another program is trying to access the Outlook address book. Do you want to allow this access?" The user can then select whether to allow the access for anywhere from 1-10 minutes. This does not happen on the other 4 computers using Spambayes with Outlook. All of the computers are similar model Dell Dimension desktops with Pentium 4 processors, Windows XP Pro with SP2 and all updates. All use Office 2002 and the Outlook version that comes with that. They are connected in a peer to peer network and email service is provided by Yahoo mail, which is why they get tons of spam. (I am not in charge of which service we use, just a volunteer network support person.) The Windows Knowledge base recommends using the Outlook Add-In Manager to disable the particular add-in. (Tools/Options/Other/General-advanced Options/Add-In Manager) However, then Spambayes does not work. Do you have any suggestions on how to eliminate this warning message whenever a new email is created in Outlook? Thanks, Ken PS. I learned about Spambayes from Maximum PC magazine. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061113/870dfe67/attachment.html From amedee at amedee.be Mon Nov 13 18:32:14 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Mon, 13 Nov 2006 18:32:14 +0100 Subject: [Spambayes] Outlook Warning Message In-Reply-To: <004c01c70741$efdfaab0$6501a8c0@asus3200> References: <004c01c70741$efdfaab0$6501a8c0@asus3200> Message-ID: <1163439134.11582.1.camel@localhost> Op maandag 13-11-2006 om 10:36 uur [tijdzone -0600], schreef Ken Fulks: > I have installed Spambayes on several computers in our church office > to eliminate a significant amount of spam that they receive. Most > users are very happy with the program and its effectiveness, and the > fact that they do not need to do anything for it to work. > > However, on only one computer, an Outlook warning pops up every time > they create a new email message. The warning states that "Another > program is trying to access the Outlook address book. Do you want to > allow this access?" The user can then select whether to allow the > access for anywhere from 1-10 minutes. This does not happen on the > other 4 computers using Spambayes with Outlook. > > All of the computers are similar model Dell Dimension desktops with > Pentium 4 processors, Windows XP Pro with SP2 and all updates. All use > Office 2002 and the Outlook version that comes with that. They are > connected in a peer to peer network and email service is provided by > Yahoo mail, which is why they get tons of spam. (I am not in charge of > which service we use, just a volunteer network support person.) > > The Windows Knowledge base recommends using the Outlook Add-In Manager > to disable the particular add-in. > (Tools/Options/Other/General-advanced Options/Add-In Manager) > However, then Spambayes does not work. > > Do you have any suggestions on how to eliminate this warning message > whenever a new email is created in Outlook? > > Thanks, > > Ken > > PS. I learned about Spambayes from Maximum PC magazine. Hello Ken, I am not one of the Spambayes developers, just a user like you. But I am quite sure Spambayes does not need the Outlook Address Book. I am quite certain there is another application doing this. I suggest scanning for spyware. Greets, Amedee Van Gasse amedee at amedee.be From balint.varga at vargabor.hu Mon Nov 13 17:52:46 2006 From: balint.varga at vargabor.hu (=?iso-8859-2?Q?Varga_B=E1lint?=) Date: Mon, 13 Nov 2006 17:52:46 +0100 Subject: [Spambayes] Translation to Hungarian Message-ID: <000b01c70744$27f61e00$87dafea9@vba> Hello Developers, I've been using SpamBayes for one year, and I'm very happy with it. In my experience this is the strongest spam filter I've ever tried, congratulations. If you need somebody to translate the English language file to Hungarian, I would be happy to help! Best regards, B?lint Varga Varga Winery Ltd. www.vargabor.hu Tel.: +36-70/3828140 Fax.: +36-1/424-0799 E-mail: balint.varga at vargabor.hu Skype: balintvarga -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061113/a558439b/attachment.htm From tadowguy at gmail.com Mon Nov 13 19:47:28 2006 From: tadowguy at gmail.com (Matt Fischer) Date: Mon, 13 Nov 2006 11:47:28 -0700 Subject: [Spambayes] cutoff settings Message-ID: <47e5681d0611131047k34a4e190jcf0fe3802dcb9a55@mail.gmail.com> What are the default ham/spam cutoff settings? (And where are they?) I want to change my cut-offs so that I have less Unsure and more Spam, as I get 10-20 Unsures per day and 99.999999% are Spam. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061113/0cd89d90/attachment.html From jeff at ezsolution.com Mon Nov 13 19:55:35 2006 From: jeff at ezsolution.com (Jeff) Date: Mon, 13 Nov 2006 13:55:35 -0500 Subject: [Spambayes] outlook add-in stopped functionning. Imossible de re-install In-Reply-To: <1163438772.4558aab4813fc@imp1-g19.free.fr> Message-ID: -----Original Message----- Ok, I reactivated the addin, restarted and checked the Advanced Options/COM is active. It now appears that Spambayes is filtering email since it's going into the Junk Suspect folder, but I don't have the move to spam or recover from spam buttons I used to have. Install them? Jeff >From: jyppro at free.fr [mailto:jyppro at free.fr] >Sent: Monday, November 13, 2006 12:26 PM >To: Jeff >Subject: Re: [Spambayes] outlook add-in stopped functionning. Imossible de re-install > > >Selon Jeff : > >> I am still having the same problems as Yiannis. Uninstalled Spambayes, >> Firefox 2.0, Zone Alarm and then reinstalled Spambayes and I cannot >> get it to work. >> >> Jeff >> >> _______________________________________________ >> SpamBayes at python.org http://mail.python.org/mailman/listinfo/spambayes >> Check the FAQ before asking: http://spambayes.sf.net/faq.html >> > > >All problems are in outlook : > >Outlook has disabled Spambayes and keep information in memory : > >Menu: > >?/About Microsoft Office Outlook/disactivated elements/ you must activate spambayes Restart > outlook Tools/Options/Others/Advanced Options/COM complements to activate if necessary From hadfield at tradestar.net Tue Nov 14 14:40:53 2006 From: hadfield at tradestar.net (John Hadfield) Date: Tue, 14 Nov 2006 13:40:53 -0000 Subject: [Spambayes] Spam-bayes not working Message-ID: <008301c707f2$818efbe0$472a8356@default> Hello I installed the Spam-Bayes software on my computer. I can find two files: About Spam-Bayes and Troubleshooting guide. I can't find any other files after completing the installation. There is no toolbar or anything else in Outlook Express. Should I de-install and re-install again to see if I can get it to work? Best regards John Hadfield TRADESTAR UK LTD 6 Scott Close, Bicester, Oxfordshire Tel: +44 01869 240 560 Fax +44 01869 240 916 email: hadfield at tradestar.net www.tradestar.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061114/dcb3d73a/attachment.html From rf at tampabay.rr.com Tue Nov 14 15:10:25 2006 From: rf at tampabay.rr.com (RR - RF) Date: Tue, 14 Nov 2006 09:10:25 -0500 Subject: [Spambayes] Stopped working Message-ID: <000001c707f6$a42cb0d0$c901a8c0@thehood> Hello I have been using Spambayes for some time now with MS Outlook 2003 (11.8010.8107) SP2 without any problems. My OS is Windows XP Pro with SP2. Spambayes version 1.0.4 I recently had some adware slip into the computer which I had cleaned out. When checking my mail I tried to use Spambayes and it did not work. I checked to see if it was still installed as an add-in with Outlook and at least it was still on the toolbar but checking the Tools> options> other> advanced options (under general)> Com add-ins, I noticed that Spambayes was unchecked . I placed the check mark in the box, clicked OK, closed and restarted Outlook and it still did not work. I checked following the above and the box was unchecked again. I tried this a couple of times and still it did not work. I proceeded to reinstall Spambayes and it still did not work. I uninstalled Spambayes and reinstalled it and still the same results. I had the tool bar for Spambayes but it still is not working. The Add-in Manager does not show Spambayes but the Com Add-Ins does show the program there but still unchecked. I repeated the above a few times with no results. Again, the program has been working great for some time now but it has just stopped working. I have had other hits here with adware on the computer before and it did not affect Outlook or Spambayes. There are no traces of any BHOs or adware in the computer at this time. Is it possible that something in the registry is causing Spambayes to not operate? If so, how do I fix this situation? Spambayes seems to be the only program / add-in that works with Outlook and I would like to get it working again before my folders explode with the recent evasion of spam with word lists in them to try to get around the filters. These word lists in the subjects and bodies of the emails seem to be working for the spammers lately, but your program is catching about 80 percent of the junk. Any help would greatly be appreciated. Sincerely, RF -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061114/f65bbf62/attachment.html From jsp at PKC.com Tue Nov 14 15:11:21 2006 From: jsp at PKC.com (Jesse Pelton) Date: Tue, 14 Nov 2006 09:11:21 -0500 Subject: [Spambayes] Spam-bayes not working In-Reply-To: <008301c707f2$818efbe0$472a8356@default> Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04B85@PKCVT01.pkc.com> It sounds like you may not have completed the manual steps required to set up SpamBayes with Outlook Express. See http://spambayes.sourceforge.net/faq.html#does-spambayes-work-with-outlo ok-express for a brief explanation of why this is necessary, and http://spambayes.sourceforge.net/faq.html#how-do-i-set-up-spambayes-and- outlook-express for the steps. ________________________________ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of John Hadfield Sent: Tuesday, November 14, 2006 8:41 AM To: spambayes at python.org Subject: [Spambayes] Spam-bayes not working Hello I installed the Spam-Bayes software on my computer. I can find two files: About Spam-Bayes and Troubleshooting guide. I can't find any other files after completing the installation. There is no toolbar or anything else in Outlook Express. Should I de-install and re-install again to see if I can get it to work? Best regards John Hadfield TRADESTAR UK LTD 6 Scott Close, Bicester, Oxfordshire Tel: +44 01869 240 560 Fax +44 01869 240 916 email: hadfield at tradestar.net www.tradestar.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061114/2980f60e/attachment.htm From bishop at aeroprise.com Tue Nov 14 19:03:11 2006 From: bishop at aeroprise.com (Peter Bishop) Date: Tue, 14 Nov 2006 10:03:11 -0800 Subject: [Spambayes] cutoff settings In-Reply-To: <47e5681d0611131047k34a4e190jcf0fe3802dcb9a55@mail.gmail.com> Message-ID: The whole question of what to set the filtering parameters for Certain Spam and Possible Spam is interesting. I believe that the real trade-off is that SpamBayes needs a certain amount of training. Also, we are living in an environment in which the generators of Spam are trying to get through SpamBayes (among other filters) with some success. The good news is that SpamBayes automatically adapts to new attempts to get through it, as long as you keep training it on the new Spam (and any time it can't tell that a real email is real). The whole point is to make the Certain Spam folder really be CERTAIN. This way you only need to look at it in a cursory manner in order to determine that it really is certain. The Possible Spam folder is really used to identify which emails are sufficiently questionable that SpamBayes needs further training. Even so, having a possible Spam folder that holds 90-99% spam is still a lot more productive that having this number of emails in your regular inbox, because you are in "spam-detection" mode when looking at the Possible Spam folder rather than being in "email-reading" mode as when you look at your inbox. Thus the default parameter for Possible Spam is 15% and the default parameter for Certain Spam is 90%. You can play with these, but you need a significant window between these two in order to get enough emails to allow SpamBayes to adapt to changing spam attacks. I found it was not difficult to get most of my good emails to have very low spam scores, so a very low number on Possible Spam is good. The best way to learn how to set these values is to display the spam scores in Outlook. You can add the spam score column to your outlook display of your inbox and your possible spam folder, and your certain spam folder. This way you can quickly assess how to set these parameters to minimize the possibility of getting a good email into Certain Spam and getting spam in your inbox. Don't try to minimize the number of spams in the Possible Spam folder, just keep the amount of spam here to a reasonably large percentage of total spam so SpamBayes will be trained on new spam attack methods. Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. _____ On Behalf Of Matt Fischer Subject: [Spambayes] cutoff settings What are the default ham/spam cutoff settings? (And where are they?) I want to change my cut-offs so that I have less Unsure and more Spam, as I get 10-20 Unsures per day and 99.999999% are Spam. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061114/ec496b48/attachment.html From bishop at aeroprise.com Tue Nov 14 19:11:48 2006 From: bishop at aeroprise.com (Peter Bishop) Date: Tue, 14 Nov 2006 10:11:48 -0800 Subject: [Spambayes] Outlook Warning Message In-Reply-To: <1163439134.11582.1.camel@localhost> Message-ID: My guess is a firewall or firewall-like software on this one computer. There should be a setting in this code to allow the new email setup with SpamBayes to work without comment. Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. -----Original Message----- Sent: Monday, November 13, 2006 9:32 AM Op maandag 13-11-2006 om 10:36 uur [tijdzone -0600], schreef Ken Fulks: > However, on only one computer, an Outlook warning pops up every time they create a new email message. The warning states that "Another program is trying to access the Outlook address book. Do you want to allow this access?" The user can then select whether to allow the access for anywhere from 1-10 minutes. This does not happen on the other 4 computers using Spambayes with Outlook. From psturley at softbyte2000.co.uk Wed Nov 15 12:03:02 2006 From: psturley at softbyte2000.co.uk (Pete Sturley) Date: Wed, 15 Nov 2006 11:03:02 -0000 Subject: [Spambayes] SpamBayes Possible Bug Message-ID: <002a01c708a5$9e717450$ab7ba8c0@Dell8300> Hi Sorry but I can't make any sense of your bug reporting, having created an account etc. so I'm resorting to this method. Since installing SpamBayes 1.04 on my Outlook 2003 SP2 system, I am now getting errors whenever I open an appointment in the calendar. I'm not sure if it's anything to do with you but it's a bit of a coincidence that it's started happening since installing your add-on. I hope you can help as I really like SpamBayes and it seems to be working very well. Kind regards Pete Sturley This is what I get: Visual Studio Just-In-Time Debugger An unhandled exception ("Object expected") occurred in OUTLOOK.EXE [184] or An unhandled exception ("'h' is not an object") occurred in OUTLOOK.EXE [184] List of debuggers follows. Do you want to debug using the selected debugger? Click No. Repeats numerous times till eventually continues. Here is my SpamBayes log: Loaded bayes database from 'C:\Documents and Settings\Pete Sturley\Application Data\SpamBayes\default_bayes_database.db' Loaded message database from 'C:\Documents and Settings\Pete Sturley\Application Data\SpamBayes\default_message_database.db' Bayes database initialized with 48 spam and 1636 good messages SpamBayes Outlook Addin Binary Version 1.0.4 (March 2005) starting (with engine SpamBayes Engine Version 0.3 (January 2004)) on Windows 5.1.2600 (Service Pack 2) using Python 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] SpamBayes: Watching (for filtering) in 'Personal Folders/Inbox' SpamBayes: Watching (for incremental training) in 'Personal Folders/Junk E-mail' Processing missed spam in folder 'Inbox' by starting a timer FAILED to add the toolbar item 'SpamBayesCommand.Manager' - (-2147352567, 'Exception occurred.', (0, None, None, None, 0, -2147467259), None) The above toolbar message is common - recreating the toolbar... Message 'New George Michael Double CD - down to GBP 6.99 delivered' in 'Personal Folders/Inbox' had a Spam classification of 'Yes' Message '[#XDV-478188]: Web Server Down' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'inalienable VGLV' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'all-important LADF' in 'Personal Folders/Inbox' had a Spam classification of 'No' Moving and spam training message 'all-important LADF' - Training on message 'all-important LADF' in 'Personal Folders/Inbox - trained as spam Moving and spam training message 'inalienable VGLV' - Training on message 'inalienable VGLV' in 'Personal Folders/Inbox - trained as spam Message 'Stop Living Check-to-Check' in 'Personal Folders/Inbox' had a Spam classification of 'Unsure' Message '[#XDV-478188]: Web Server Down' in 'Personal Folders/Inbox' had a Spam classification of 'No' Moving and spam training message 'Stop Living Check-to-Check' - Training on message 'Stop Living Check-to-Check' in 'Personal Folders/Junk Suspects - trained as spam Message 'RE: MBYB Images on Home Page' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message '[#XDV-478188]: Web Server Down' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'RE: MBYB Images on Home Page' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message '[#XDV-478188]: Web Server Down' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'Re: [#XDV-478188]: Web Server Down' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'RE:Application Tracker Request from UKPA website [#3171694]' in 'Personal Folders/Inbox' had a Spam classification of 'No' Message 'RE: Toast' in 'Personal Folders/Inbox' had a Spam classification of 'No' -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061115/5f494135/attachment.html From webmaster at freeantispam.org Thu Nov 16 04:42:24 2006 From: webmaster at freeantispam.org (Martin) Date: Wed, 15 Nov 2006 22:42:24 -0500 Subject: [Spambayes] new button for spambayes Message-ID: <455BDE20.9040608@freeantispam.org> Hi, Hopefully this is the right way to contact members of the Spambayes project. I thought I'd contribute a bit and make a some small button (in the RSS or XML style you see nowadays) for spambayes, since you don't seem to have any of those, with an improvised logo (well, a simplified version considering the space). I tried to use the colors found on the spambayes web site. I hope you'll use it! If you like it, we could consider making a bigger logo and button. Bye! -- Martin Free Anti Spam http://www.freeantispam.org -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes-small.png Type: image/png Size: 275 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061115/d684a171/attachment.png From amedee at amedee.be Thu Nov 16 11:58:05 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Thu, 16 Nov 2006 11:58:05 +0100 (CET) Subject: [Spambayes] new button for spambayes In-Reply-To: <455BDE20.9040608@freeantispam.org> References: <455BDE20.9040608@freeantispam.org> Message-ID: <39359.82.146.101.169.1163674685.squirrel@amedee.be> On Thu, November 16, 2006 4:42, Martin said: > Hi, > > Hopefully this is the right way to contact members of the Spambayes > project. > > I thought I'd contribute a bit and make a some small button (in the RSS > or XML style you see nowadays) for spambayes, since you don't seem to > have any of those, with an improvised logo (well, a simplified version > considering the space). I tried to use the colors found on the spambayes > web site. I hope you'll use it! > > If you like it, we could consider making a bigger logo and button. > > Bye! That is _SO_ cool! Really. -- Amedee Van Gasse From tsiegenthaler at mdm-architects.com Thu Nov 16 19:50:39 2006 From: tsiegenthaler at mdm-architects.com (Tad Siegenthaler) Date: Thu, 16 Nov 2006 10:50:39 -0800 Subject: [Spambayes] SpamBayes error message in Outlook Message-ID: <000f01c709b0$1c589550$6d01a8c0@TAD> Hello, I have SpamBayes installed on my boss's computer, and he gets an error message in Outlook 2003 when he first clicks on the Inbox folder on our IMAP account: Do you know a fix for this? Thank you! P.S. - great product! Tad Siegenthaler Maple Dell + McClelland Architects, LLP 380 Stevens Avenue, Suite 308 Solana Beach, CA 92075 P 858.755.5848 F 858.755.5850 tsiegenthaler at mdm-architects.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061116/b861517f/attachment.html -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 18921 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061116/b861517f/attachment.jpe From ian at ninprodata.com Tue Nov 14 02:11:26 2006 From: ian at ninprodata.com (Ian Colvard) Date: Mon, 13 Nov 2006 17:11:26 -0800 Subject: [Spambayes] Outlook plug-in problem Message-ID: <772A7AA6EEDD48479A8B9B82A43F60C1127503@MAIL01.ninpro.local> Hello I have had Spambayes for a few years and we use it at work. We had the exchange server die and when we gat a new one we had to retrain everyone's filters. All went well tell I got to one computer. The training wasn't working because it said the program wasn't installed. So I uninstalled the program and then downloaded the newest version and installed it, but when I open Outlook 2003 it doesn't show up at al. I don't know how to make it show up inside Outlook. Any input would be appreciated. Ian Colvard -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061113/7e4f0b30/attachment.html From compubds at rogers.com Thu Nov 16 23:23:20 2006 From: compubds at rogers.com (Bruce Sherwin) Date: Thu, 16 Nov 2006 17:23:20 -0500 (EST) Subject: [Spambayes] I don't totally understand how to use your SPAMBAYES Message-ID: <20061116222320.90710.qmail@web88208.mail.re2.yahoo.com> I want to use it with my Outlook Express. I don't have a clue of how to use it effectively. Thank you for your help! Bruce Sherwin & Family compubds at rogers.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061116/5b15fa15/attachment.html From ammodump at theammodump.com Sat Nov 18 02:24:56 2006 From: ammodump at theammodump.com (Mr. A.J. O'Neill) Date: Sat, 18 Nov 2006 14:24:56 +1300 Subject: [Spambayes] Suggestion Message-ID: <005401c70ab0$5be53ba0$3201a8c0@hood> More years ago than I care to remember I did a Masters thesis on incorporating time dependent query terms in search queries used for searching "News" feeds. Part of the thesis involved implementing a test system. One of the many steps involved in the processing was the removal (or ignoring of) punctuation before searching for search tokens. I draw your attention to the following extract from a Spam Clues report 'beneficiary' 0.844828 0 1 'beneficiary.' 0.844828 0 1 I would argue that there is no difference between these two tokens and that the inclusion of the punctuation adds nothing to the process but in this instance is likely to give the token a lower score than may be appropriate. I further draw your attention to the following extracts from the same Spam Clues report: '+31633775038' 0.844828 0 1 '30%' 0.844828 0 1 '65%to' 0.844828 0 1 '7.5.430' 0.867197 4 2 '17/11/2006' 0.909938 1 2 '268.14.7/537' 0.909938 1 2 '5:56' 0.909938 1 2 While strings of numbers such as TCP/IP addresses may be useful in differentiating spam from ham, generally numbers, digits and amounts for currency are not good choices for tokens. In particular the above date '17/11/2006' and time '5:56' tokens can normally be considered to be random and are unlikely to be of any use in classifying spam/ham. I also used a stop list of words which are so common that they are useless to index or use in search engines or other search indexes. Below are a number of instances of words which I believe are not appropriate tokens to use to differentiate between spam and ham emails. 'under' 0.814607 3 1 'its' 0.862812 1 1 'us.' 0.862812 1 1 'our' 0.611666 16 2 'when' 0.637817 7 1 'that' 0.664752 19 3 'all' 0.674394 12 2 'around' 0.739628 4 1 'it,' 0.848794 1 1 'up,' 0.848794 1 1 'p.m.' 0.813589 7 2 'does' 0.814607 3 1 Generally I find the current version of SpamBayes to be a very useful tool but I would like the ability to permanently set the value of a token i.e. I'd like to be able to set the token 'pharmacy' to value 1.0 to ensure that all emails containing it are classified as spam; likewise I'd like to classify certain terms as having value 0.0 so that they are always classified as ham. Keep up the good work and I hope that my suggestions are worthwhile. Regards A.J. O'Neill M. App. Sc. M.B. Computing Grad. Dip. K.B.S. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061118/cdcd82e2/attachment.htm From shawn at 12pointdesign.com Sat Nov 18 14:18:49 2006 From: shawn at 12pointdesign.com (Shawn K. Hall) Date: Sat, 18 Nov 2006 05:18:49 -0800 Subject: [Spambayes] Suggestion In-Reply-To: <005401c70ab0$5be53ba0$3201a8c0@hood> Message-ID: <0cff01c70b14$162bf720$1e00a8c0@Yoda> Hi AJ, > 'beneficiary' 0.844828 0 1 > 'beneficiary.' 0.844828 0 1 > I would argue that there is no difference between these two > tokens and that the inclusion of the punctuation adds nothing > to the process but in this instance is likely to give the > token a lower score than may be appropriate. I disagree. In this particular case the distinction may not be clear, but, in some instances, especially when the spammer is a non-native English speaker, the near-random placement of punctuation in inappropriate locations can correctly identify spam. > While strings of numbers such as TCP/IP addresses may be > useful in differentiating spam from ham, generally numbers, > digits and amounts for currency are not good choices for > tokens. In particular the above date '17/11/2006' and time > '5:56' tokens can normally be considered to be random and are > unlikely to be of any use in classifying spam/ham. Yet again, tokenization for exclusion is not about "normalcy"; it's about distinction. Any token distinctive to a spam or ham message may help in determining a future messages value. For example, most chain letters have similar origins or have crossed certain paths that will leave timestamp tracks within the body of the message and in the header. Some spam munges date values to use tell-tale values (trust me, you'll know it when you see it), and quite often 419 spam will have distinct values within dollar figures, dates and times presented which SpamBayes can use to aid in the identification of unwanted email. Also, while all of these values taken individually may seem silly, they are NOT treated as singular processing values within the SpamBayes filtering system, but are joined with others to build a composite of all tokens - completely whitelisting tokens or token structures that /really do/ appear more frequently in spam would only serve to hinder SpamBayes' effectiveness. > I also used a stop list of words which are so common that > they are useless to index or use in search engines or other > search indexes. Below are a number of instances of words > which I believe are not appropriate tokens to use to > differentiate between spam and ham emails... It appears that your concern is primarily with the shorter strings. There is a method of increasing the minimum token length - IIRC the process was detailed on this list about 3 months ago. You might consider taking those steps if you want to eliminate the shorter tokens altogether. > ...I would like the ability to permanently set the value of a > token i.e. I'd like to be able to set the token 'pharmacy' to > value 1.0 to ensure that all emails containing it are > classified as spam; likewise I'd like to classify certain terms > as having value 0.0 so that they are always classified as ham. I think strict "whitelist" and "blacklist" functionality would be quite useful, too. It might even help reduce my dependency on additional applications for spam processing. Regards, Shawn K. Hall http://12PointDesign.com/ From adrian.robson at nepsweb.co.uk Sun Nov 19 02:45:56 2006 From: adrian.robson at nepsweb.co.uk (Adrian P Robson) Date: Sun, 19 Nov 2006 01:45:56 -0000 Subject: [Spambayes] Spambayes broken Message-ID: <000701c70b7c$7a6729e0$0400a8c0@adrian1> I have been using spambayes with Outlook Express, for a month or so with some success. I then decided to compare it against SpamExplorer. Unimpressed by this product I went back to Spambayes, but I cannot get it to work again. I get the following when I try to "review messages". I have uninstalled spambayes and reinstalled without success. Please help - SpamBayes is good and I want it back. ---------------------------------- 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 470, in found_terminator File "spambayes\ProxyUI.pyc", line 548, in onReview KeyError: '{***HAM***}' --------------------------- From gregandariel at verizon.net Sun Nov 19 13:55:14 2006 From: gregandariel at verizon.net (greg and ariel) Date: Sun, 19 Nov 2006 07:55:14 -0500 Subject: [Spambayes] troubleshoot Message-ID: <000001c70bda$0cfcf930$a0f3fea9@D2X39K71> Sorry to bug you. I've read the troubleshooting file but don't understand the instructions. I am running XP home and outlook 2002. There are two users and only one user seems to have Spambayes. I understand I need to execute the command: "c:\Program Files\SpamBayes\bin\outlook_addin_register.exe" hkey_local_machine I just don't know how to do it. Where and how do I do this? Thanks for your help. I love this program. ariel -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061119/989bf79a/attachment.html From spmbn at hotmail.com Sun Nov 19 15:31:21 2006 From: spmbn at hotmail.com (Mark Peterson) Date: Sun, 19 Nov 2006 14:31:21 -0000 Subject: [Spambayes] Debug mode? Message-ID: Spambayes is sticking on large messages, and I want to submit a bug report. It fails without error, however, and there's no corresponding log file in my temp directory (the only ones there are from much earlier times of the day). Is there a debug mode which will give a full, verbose output for me to post with the bug details? Too late now, as I've managed to get this message by turning Spambayes off, but need to know for next time. Thanks From svnaras at imap.cs.utexas.edu Mon Nov 20 08:05:35 2006 From: svnaras at imap.cs.utexas.edu (svnaras at imap.cs.utexas.edu) Date: Mon, 20 Nov 2006 01:05:35 -0600 Subject: [Spambayes] Request for a trained spam filter Message-ID: <1164006335.456153bfc7fe9@webmailapp6.cc.utexas.edu> Hi, I'm a Masters student at the Dept of CS at UT, Austin. I'm doing a project related to spam generation and I need a well trained spam filter for the same. I downloaded the training set available online and trained the spambayes filter using it. But the accuracy that I got on a set of new spam messages was not too great. Is there some way of improving the accuracy? Since my project involves substantial experimentation I do not want to spend too much time in training the filter. So is it possible to get a trained spam filter from you? I'd really appreciate any help from you in this regard. Thanks, Sudheendra From mpas1342 at yahoo.de Mon Nov 20 15:50:19 2006 From: mpas1342 at yahoo.de (mpas mpas) Date: Mon, 20 Nov 2006 15:50:19 +0100 (CET) Subject: [Spambayes] spambayes pliugin for lotus notes Message-ID: <20061120145019.18247.qmail@web23408.mail.ird.yahoo.com> is there a way to use spambayes in lotus notes email programm or outlook is the only software which could be used with spambayes on windows systems ? thanks --------------------------------- Was ist Gl?ck? Schlafen Fische ?berhaupt? Die Antworten gibt?s auf Yahoo! Clever. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061120/aa705777/attachment.htm From skip at pobox.com Mon Nov 20 16:58:40 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 20 Nov 2006 09:58:40 -0600 Subject: [Spambayes] Request for a trained spam filter In-Reply-To: <1164006335.456153bfc7fe9@webmailapp6.cc.utexas.edu> References: <1164006335.456153bfc7fe9@webmailapp6.cc.utexas.edu> Message-ID: <17761.53424.836600.893686@montanaro.dyndns.org> Sudheendra> I'm a Masters student at the Dept of CS at UT, Austin. I'm Sudheendra> doing a project related to spam generation and I need a well Sudheendra> trained spam filter for the same. I downloaded the training Sudheendra> set available online and trained the spambayes filter using Sudheendra> it. But the accuracy that I got on a set of new spam Sudheendra> messages was not too great. Is there some way of improving Sudheendra> the accuracy? The qualitative nature of spam changes frequently. If you trained SpamBayes on a data set that was generated more than a couple months ago that dataset isn't going to reflect many of the characteristics found in spam today. What ham did you use? Sudheendra> Since my project involves substantial experimentation I do Sudheendra> not want to spend too much time in training the filter. So Sudheendra> is it possible to get a trained spam filter from you? Not really. A trained filter is created using a mix of good and bad email. For obvious privacy reasons very few people are willing to expose information about the email they receive. Start training from scratch and train on the mistakes SpamBayes makes on the mail you receive. After a handful of emails it should do a pretty good job properly filtering your email. Also, use the latest version (best to check out what's in the CVS repository) and use this page on the SpamBayes wiki http://www.entrian.com/sbwiki/TryOutThePreRelease to guide your installation and setup. Skip From jgriffis at thepluscos.com Mon Nov 20 17:54:01 2006 From: jgriffis at thepluscos.com (Julie Griffis) Date: Mon, 20 Nov 2006 11:54:01 -0500 Subject: [Spambayes] deleted junk email file Message-ID: How do I retrieve the junk e-mail folder in my inbox, I accidentally deleted it? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061120/eb764343/attachment.htm From bishop at aeroprise.com Mon Nov 20 21:02:59 2006 From: bishop at aeroprise.com (Peter Bishop) Date: Mon, 20 Nov 2006 12:02:59 -0800 Subject: [Spambayes] troubleshoot In-Reply-To: <000001c70bda$0cfcf930$a0f3fea9@D2X39K71> Message-ID: Ariel, Just bring up your Command Prompt application (Should be somewhere in Accessories) Then just type in the command, and that will execute it. In the bad old days, all programs were invoked in this manner. It is easier for programmers to provide unusual miscellaneous functions packaged this way, so there are a few SpamBayes things (especially in configuration) that need to be dealt with this way. Improving the GUI on these things is just a matter of a little more programmer time. Anyone want to help out? Peter Bishop Aeroprise, Inc. Take advantage of the Aeroprise Enterprise Discovery and Personalization System for both Smart Clients and standard browsers available only with the Aeroprise Mobile Gateway. _____ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of greg and ariel Sent: Sunday, November 19, 2006 4:55 AM To: spambayes at python.org Subject: [Spambayes] troubleshoot Sorry to bug you. I've read the troubleshooting file but don't understand the instructions. I am running XP home and outlook 2002. There are two users and only one user seems to have Spambayes. I understand I need to execute the command: "c:\Program Files\SpamBayes\bin\outlook_addin_register.exe" hkey_local_machine I just don't know how to do it. Where and how do I do this? Thanks for your help. I love this program. ariel -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061120/133304cd/attachment.html From mtavener at iname.com Mon Nov 20 22:39:53 2006 From: mtavener at iname.com (Mike Tavener) Date: Mon, 20 Nov 2006 21:39:53 -0000 Subject: [Spambayes] Lost addresses Message-ID: <01f601c70cec$6bd7d2f0$ef0010ac@carrera> Hello I use Windows XP and Outlook Express. I installed SpamBayes v1 from the cover DVD of PC Pro Iss 143. I then followed steps 1 and 2 under 'Configuration' in the Readme file which opens when installation is complete, but clicking on Save at the bottom of the Configuration page brought up details of a "500 Server error". I reversed the changes I had made, and then found a more comprehensive list of set-up instructions at FAQ 4.21, the first of which was to download the latest version of SpamBayes. I downloaded v1.1a3, but when I came to install ir there was one file of which the installer was unable to replace the existing version. I therefore removed v1 before trying again to install v1.1a3. This time the installation proceeded to its end, but any attempt to send a message in Outlook Express now resulted in another "500 Server error" message (though with fewer lines than the last). I shan't bother you with the details of the two error messages because I have decided to remove SpamBayes from my computer and try a different product. But I would like your advice on this: all the entries in my Outlook Express address book have somehow been deleted and I cannot find any way to get them back. Has anyone else reported this problem? Or can you suggest a solution? Mike Tavener -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061120/fbd4591a/attachment.htm From amedee at amedee.be Tue Nov 21 00:15:43 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Tue, 21 Nov 2006 00:15:43 +0100 Subject: [Spambayes] spambayes pliugin for lotus notes In-Reply-To: <20061120145019.18247.qmail@web23408.mail.ird.yahoo.com> References: <20061120145019.18247.qmail@web23408.mail.ird.yahoo.com> Message-ID: <1164064543.6628.0.camel@localhost> Op maandag 20-11-2006 om 15:50 uur [tijdzone +0100], schreef mpas mpas: > is there a way to use spambayes in lotus notes email programm or > outlook is the only software which could be used with spambayes on > windows systems ? > thanks http://spambayes.sourceforge.net/windows.html Bottom of the page: Non Outlook Solutions -- Amedee Van Gasse amedee at amedee.be From info at shakuhachiflutes.com Tue Nov 21 06:15:33 2006 From: info at shakuhachiflutes.com (Shakuhachi Flutes) Date: Tue, 21 Nov 2006 16:15:33 +1100 Subject: [Spambayes] white list Message-ID: <20061121051535.ZREZ15981.oaamta03sl.mx.bigpond.com@p4> Dear Sirs/ Madams, Thank you for your programme. This may be a dumb question but how do we link your programme (used with Outlook) to the Contact database? Our first "Spam" was from someone in our Contact list and I cannot see a way to form a "Safe Sender" list or "Whitelist" Thanks again. Sincerely, Andrew MacGregor PO Box 681, Lilydale Vic 3140 AUSTRALIA t/f +61-3-9739-7340 info at shakuhachiflutes.com www.shakuhachiflutes.com From amedee at amedee.be Tue Nov 21 08:52:53 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Tue, 21 Nov 2006 08:52:53 +0100 (CET) Subject: [Spambayes] white list In-Reply-To: <20061121051535.ZREZ15981.oaamta03sl.mx.bigpond.com@p4> References: <20061121051535.ZREZ15981.oaamta03sl.mx.bigpond.com@p4> Message-ID: <55056.82.146.101.169.1164095573.squirrel@amedee.be> On Tue, November 21, 2006 6:15, Shakuhachi Flutes said: > Dear Sirs/ Madams, > Thank you for your programme. > This may be a dumb question but how do we link your programme (used with > Outlook) to the Contact database? > Our first "Spam" was from someone in our Contact list and I cannot see a > way > to form a "Safe Sender" list or "Whitelist" First, read this FAQ: http://spambayes.sourceforge.net/faq.html#id72 Second, have you tried training that message as "recover from spam"? That usually works just as good as a "whitelist". Next time you get a message from that person, it won't be spam. Or at least the spam score will decrease. It helps if you show the spam score in Outlook. See http://www.entrian.com/sbwiki/OutlookPlugin -- Amedee Van Gasse From David.Seidman at dc.gov Tue Nov 21 19:19:54 2006 From: David.Seidman at dc.gov (Seidman, David (OCTO)) Date: Tue, 21 Nov 2006 13:19:54 -0500 Subject: [Spambayes] Telling SpamBayes a certain sender is OK Message-ID: <55B3FA88F3E9384CA28AA047179421350CBA31ED@sojsdc001.ojs.dcgov.priv> I get messages from Metro Alert on request, which I often delete after reading because there can be several of them when a subway line is experiencing trouble. I want to see them, but SpamBayes is convinced they are spam, even though I have never moved them into the Spam folder. How do I tell it that any message from Metro Alert should be left in the Inbox? David R. Seidman Program Management Office Office of the Chief Technology Officer District of Columbia Government (202) 727-1596 david.seidman at dc.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061121/84131084/attachment.htm From David.Seidman at dc.gov Tue Nov 21 20:07:14 2006 From: David.Seidman at dc.gov (Seidman, David (OCTO)) Date: Tue, 21 Nov 2006 14:07:14 -0500 Subject: [Spambayes] Telling SpamBayes a certain sender is OK Message-ID: <55B3FA88F3E9384CA28AA047179421350CBA31F0@sojsdc001.ojs.dcgov.priv> I get messages from Metro Alert on request, which I often delete after reading because there can be several of them when a subway line is experiencing trouble. I want to see them, but the SpamBayes Outlook plug-in is convinced they are spam, even though I have never moved them into the Spam folder. How do I tell it that any message from Metro Alert should be left in the Inbox? I am using the current version of SpamBayes, and I have attached the clues file for one of the Metro Alert messages that was rated at 99%. David R. Seidman Program Management Office Office of the Chief Technology Officer District of Columbia Government (202) 727-1596 david.seidman at dc.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061121/d6acf6e5/attachment.html -------------- next part -------------- A non-text attachment was scrubbed... Name: Spam Clues for Metro Alert.doc Type: application/msword Size: 37376 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061121/d6acf6e5/attachment-0001.doc From sethg at goodmanassociates.com Wed Nov 22 00:54:32 2006 From: sethg at goodmanassociates.com (Seth Goodman) Date: Tue, 21 Nov 2006 17:54:32 -0600 Subject: [Spambayes] Telling SpamBayes a certain sender is OK In-Reply-To: <55B3FA88F3E9384CA28AA047179421350CBA31F0@sojsdc001.ojs.dcgov.priv> Message-ID: Seidman, David (OCTO) wrote on Tuesday, November 21, 2006 1:07 PM -0500: > I get messages from Metro Alert on request, which I often delete > after reading because there can be several of them when a subway line > is experiencing trouble. I want to see them, but the SpamBayes > Outlook plug-in is convinced they are spam, even though I have never > moved them into the Spam folder. How do I tell it that any message > from Metro Alert should be left in the Inbox? I am using the current > version of SpamBayes, and I have attached the clues file for one of > the Metro Alert messages that was rated at 99%. The spam clues you sent are very helpful. First of all, it shows that you've trained lots of ham, but very little spam. # ham trained on: 6579 # spam trained on: 188 Though we don't know exactly why, Spambayes seems to have difficulty when the imbalance is this severe. Spambayes has a very good idea what you consider ham, but only a mild notion of what you consider spam. Look at the 'spamprob' values in the list of spam clues to see how the individual words (tokens) score. Spamprob=0.5 for a token means it is equally likely for a message containing that token to be spam or ham. Most of the words in the message were in this middle range, with no strong ham clues and only a few strong spam clues. In other words, this message is not typical for either ham or spam that you've trained. If you have more spam available, you could train Spambayes on that. It's probably easier to retrain from scratch, and if you do it in small batches, you should get better results. While retraining, try to maintain a roughly comparable number of ham and spam in your training set. You can do this by training perhaps a dozen or two messages at a time, half ham and half spam. After training each group of messages, filter all your messages and select the ones that classify most incorrectly to train for the next round. When I do this, I start getting very good results at around 100 each of trained ham and spam. You don't need to train on thousands of messages, and it doesn't necessarily work better that way. -- Seth Goodman From benabusalami at 88.am Mon Nov 13 01:56:09 2006 From: benabusalami at 88.am (MR.BENJAMIN ABUSALAMI) Date: Mon, 13 Nov 2006 01:56:09 +0100 Subject: [Spambayes] FROM THE DESK OF:MR.BENJAMIN ABUSALAMI. Message-ID: FROM THE DESK OF:MR.BENJAMIN ABUSALAMI. DIRECTOR,INTERNATIONAL REMITTANCE FOREIGN OPERATIONS DEPT, UNION BANK OF NIGERIA PLC, LAGOS-NIGERIA. ATTENTION:SIR/MADAM YOUR INHERITANCE FUNDS OF $9.5M THIS IS TO NOTIFY YOU THAT YOUR OVER DUE INHERITANCE FUNDS HAS BEEN GAZZETED TO BE RELEASED, VIA KEY TELEX TRANSFER (KTT )-DIRECT WIRE TRANSFER TO YOU BY THE SENATE COMMITTEE FOR FOREIGN OVER DUE FUND TRANSFER. MEANWHILE,A WOMAN CAME TO MY OFFICE FEW DAYS AGO WITH A LETTER,CLAIMING TO BE YOUR TRUE REPRESENTATIVE. HERE ARE HER INFORMATIONS: NAME JANET WHITE BANK NAME: CITI BANK,YORK. ACCOUNT Number: 6503809428. PLEASE,DO RECONFIRM TO THIS OFFICE ,AS A MATTER OF URGENCY IF THIS WOMAN IS FROM YOU SO THAT THE FEDERAL GOVERNMENT WILL NOT BEHELD RESPONSIBLE FOR PAYING INTO THE WRONG ACCOUNT NAME. THE RESERVE BANK GOVERNOR,EXECUTIVE, BOARD OF DIRECTORS AND THE SENATE COMMITTEE FOR FOREIGN OVER DUE INHERITTANCE FUNDHAVE APPROVED AND ACCREDITED THIS REPUTABLE BANK WITH THE OFFICE OF THE DIRECTOR,INTERNATIONAL REMITTANCE / FOREIGN OPERATIONS,TO HANDLE AND TRANSFER ALL FOREIGN INHERITTANCE FUNDS THIS SECOND QUARTER PAYMENT OF THE YEAR. HOWEVER,WE SHALL PROCEED TO ISSUE ALL PAYMENTS DETAILS TO THE SAID MRS.WHITE,IF WE DO NOT HEAR FROM YOU WITHIN THE NEXT SEVEN WORKING DAYS FROM TODAY. CONGRATULATIONS IN ADVANCE. BEST REGARDS, MR.BENJAMIN ABUSALAMI. DIRECTOR, INTERNATIONAL REMITTANCE FOREIGN OPERATIONS DEPT, UNION BANK OF NIGERIA PLC, LAGOS-NIGERIA. ALL FOREIGN INHERITTANCE FUNDS CENTRAL BANK GOVERNOR BOARD OF DIRECTORS [FPD] ACCOUNTANT GENERAL OF NIGERIA From djvtxv at a1funding.com Wed Nov 22 09:25:38 2006 From: djvtxv at a1funding.com (Toby) Date: Wed, 22 Nov 2006 02:25:38 -0600 Subject: [Spambayes] famed plainclothes Message-ID: <001e01c70e10$d1e96cf0$b03a17d8@awgfd> But in those days we were terrified that Ralph Nader would win enough votes to put Oregon in the column on the right. I have heard that this concept is basically used for web base. As a result, these centers will serve as the foundation of a new, globally integrated model for delivering SOA-based services. Until last night, that is. In fantasy novels, this sort of stuff involves great concentration, or incantations, something. Coffee, that I require concentration, that I require instruction. Deadlocks prevent some transactions from continuing to work. Mullally, who runs the office of the United States Agency for International Development, or A. And at any lunch counter in New York City, or at Golden Olympic in Evanston, a buck twenty-five gets you as many mugs of coffee as you can drink before your bladder and kidneys fail permanently. We are using third party software which requires input as flat files. My family threw a party that night, with pizza and the big-screen TV in the basement set to CNN. It was heartbreaking to see the building returned to its original purpose, its silver walls and shining white roof and brand-new turf all ready for the game. I need to compare B with A. Here, curiously, we have turned the tables: It is Steyn, not Centanni and Wiig, who want us to know that the conversion is false. I have heard that this concept is basically used for web base. During the discussions that we were having with the trainer it was commented by the trainer that the next version coming would be something like . IBM will provide IT expertise to the new company, CMA CGM SYSTeMS, to help accelerate CMA CGM's performance in the global shipping industry. IBM will provide IT expertise to the new company, CMA CGM SYSTeMS, to help accelerate CMA CGM's performance in the global shipping industry. Seeing the German population grow much faster than the French population, this emasculation was confirmed for them by low birth-rate. The team was horrible, after all. But all terrorism is nihilism. My mom and I worked hard to make sure that Al Gore won Oregon that year, made phone calls and knocked on doors and handled mailers, the whole nine yards. But I scarcely know what to do with myself. It was heartbreaking to see the building returned to its original purpose, its silver walls and shining white roof and brand-new turf all ready for the game. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/dd12ec20/attachment.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 20535 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061122/dd12ec20/attachment.gif From rcoe at CambridgeMA.GOV Wed Nov 22 13:30:28 2006 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Wed, 22 Nov 2006 07:30:28 -0500 Subject: [Spambayes] Telling SpamBayes a certain sender is OK Message-ID: <42393C9DA7930245AB540667607F4F5007F66583@SPIKE.city> Why don't you try recreating your database? Maybe the unusually high ham/spam disparity is causing a problem. Bob _____ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of Seidman, David (OCTO) Sent: Tuesday, November 21, 2006 2:07 PM To: 'spambayes at python.org' Subject: [Spambayes] Telling SpamBayes a certain sender is OK I get messages from Metro Alert on request, which I often delete after reading because there can be several of them when a subway line is experiencing trouble. I want to see them, but the SpamBayes Outlook plug-in is convinced they are spam, even though I have never moved them into the Spam folder. How do I tell it that any message from Metro Alert should be left in the Inbox? I am using the current version of SpamBayes, and I have attached the clues file for one of the Metro Alert messages that was rated at 99%. David R. Seidman Program Management Office Office of the Chief Technology Officer District of Columbia Government (202) 727-1596 david.seidman at dc.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/1c95400d/attachment.htm From rcoe at CambridgeMA.GOV Wed Nov 22 13:30:28 2006 From: rcoe at CambridgeMA.GOV (Coe, Bob) Date: Wed, 22 Nov 2006 07:30:28 -0500 Subject: [Spambayes] Lost addresses Message-ID: <42393C9DA7930245AB540667607F4F5007F66584@SPIKE.city> Retrieve the address book contents from your backup DVDs. If OE works at all like standard Outlook, you can open the retrieved .pst file along with the "real" one and simply copy the entries back where they belong. Bob _____ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of Mike Tavener Sent: Monday, November 20, 2006 4:40 PM To: spambayes at python.org Subject: [Spambayes] Lost addresses Hello I use Windows XP and Outlook Express. I installed SpamBayes v1 from the cover DVD of PC Pro Iss 143. I then followed steps 1 and 2 under 'Configuration' in the Readme file which opens when installation is complete, but clicking on Save at the bottom of the Configuration page brought up details of a "500 Server error". I reversed the changes I had made, and then found a more comprehensive list of set-up instructions at FAQ 4.21, the first of which was to download the latest version of SpamBayes. I downloaded v1.1a3, but when I came to install ir there was one file of which the installer was unable to replace the existing version. I therefore removed v1 before trying again to install v1.1a3. This time the installation proceeded to its end, but any attempt to send a message in Outlook Express now resulted in another "500 Server error" message (though with fewer lines than the last). I shan't bother you with the details of the two error messages because I have decided to remove SpamBayes from my computer and try a different product. But I would like your advice on this: all the entries in my Outlook Express address book have somehow been deleted and I cannot find any way to get them back. Has anyone else reported this problem? Or can you suggest a solution? Mike Tavener -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/a1d95dbd/attachment.html From greetings at webmail.2000Greetings.com Wed Nov 22 16:42:36 2006 From: greetings at webmail.2000Greetings.com (2000Greetings.com) Date: Wed, 22 Nov 2006 16:42:36 +0100 (CET) Subject: [Spambayes] you have received a 2000Greetings Card... Message-ID: <20061122154236.7FACC833416@p15139065.pureserver.info> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/28d20f3f/attachment.html From gh at coastalwineco.com Thu Nov 23 00:23:48 2006 From: gh at coastalwineco.com (Gerry Hark) Date: Wed, 22 Nov 2006 15:23:48 -0800 Subject: [Spambayes] icons Message-ID: <000001c70e8d$445ad6d0$6401a8c0@work661563afb0> Greetings, How do I load the icons for suspect and junk? Gerry -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/db0d8a79/attachment.html From hzamora at houston.rr.com Thu Nov 23 05:28:31 2006 From: hzamora at houston.rr.com (hector zamora) Date: Wed, 22 Nov 2006 22:28:31 -0600 Subject: [Spambayes] problem with reinstalling SPAMbayes Message-ID: <000301c70eb7$dab88bc0$0400a8c0@hectorxp> To whom it may concern, I downloaded spambayes in late 2004 and it worked great until I had to uninstall in (spambayes doesn't have the ability to uninstall itself, I'm thinking update) and reinstall the program. At this point spambayes still filtered, but I was unable to recover junk suspects using the tool bar buttons. I went through your troubleshooting step related to tool bar buttons not working and still no go. Please help me with this issue. -Hector -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061122/e924738c/attachment.html From spambayes at spandex.nildram.co.uk Thu Nov 23 11:54:49 2006 From: spambayes at spandex.nildram.co.uk (Spandex) Date: Thu, 23 Nov 2006 10:54:49 +0000 Subject: [Spambayes] Problem with struct.unpack in oe_mailbox.py Message-ID: <1158014585.20061123105449@nildram.co.uk> I'm running spambayes (1.0.4-3) on Debian unstable with Python 2.4.4c0 and a custom compiled 2.6.17 kernel. I'm using an AMD64 chip. sb_server starts up ok and proxies pop3 and smtp connections ok. I can train from the commandline ok. The problem comes when I try to train it from the web interface (using either mbox or dbx format). It bombs with the following error:- ---------------- Traceback (most recent call last): File "/usr/lib/python2.4/site-packages/spambayes/Dibbler.py", line 470, in found_terminator getattr(plugin, name)(**params) File "/usr/lib/python2.4/site-packages/spambayes/UserInterface.py", line 494, in onTrain content = self._convertToMbox(content) File "/usr/lib/python2.4/site-packages/spambayes/UserInterface.py", line 536, in _convertToMbox content = oe_mailbox.convertToMbox(content) File "/usr/lib/python2.4/site-packages/spambayes/oe_mailbox.py", line 444, in convertToMbox if header.isValid() and header.isMessages(): File "/usr/lib/python2.4/site-packages/spambayes/oe_mailbox.py", line 117, in isValid return self.getEntry(0) == dbxFileHeader.MAGIC_NUMBER File "/usr/lib/python2.4/site-packages/spambayes/oe_mailbox.py", line 126, in getEntry self.dbxBuffer[dbxEntry * 4:(dbxEntry * 4) + 4])[0] error: unpack str size does not match format ---------------- I'm wondering whether this is something to do with my machine architecture and the sizes of datatypes? But I'm stabbing in the dark. I can easily disable dbx support by commenting out.. content = oe_mailbox.convertToMbox(content) .. around line 536 of UserInterface.py, and this does enable me to train on mbox format via the web interface, but I'd rather keep dbx support if possible. Any ideas? Thanks, Matt From sethg at goodmanassociates.com Thu Nov 23 21:14:09 2006 From: sethg at goodmanassociates.com (Seth Goodman) Date: Thu, 23 Nov 2006 14:14:09 -0600 Subject: [Spambayes] Suggestion In-Reply-To: <005401c70ab0$5be53ba0$3201a8c0@hood> Message-ID: Mr. A.J. O'Neill wrote on Friday, November 17, 2006 7:25 PM -0500: > One of the many steps involved in the processing was the removal (or > ignoring of) punctuation before searching for search tokens. I draw > your attention to the following extract from a Spam Clues report > > 'beneficiary' 0.844828 0 1 > 'beneficiary.' 0.844828 0 1 > > I would argue that there is no difference between these two tokens > and that the inclusion of the punctuation adds nothing to the process > but in this instance is likely to give the token a lower score than > may be appropriate. This type of specific choice in the tokenizer resulted from testing in a number of people's working environments. It was shown to improve classification empirically. This suggests that the intuition behind your argument, which I originally shared as well, is not correct for the purpose of classifying email as ham/spam at the time this was tested. A lot of the small choices in Spambayes turn out to be the results of empirical testing rather than intuition, and it's surprising (non-intuitive) how often our intuition about our own language is incorrect. If you're looking for a reason to explain the empirical results, one possibility is that it provides differentiation based on grammar, as opposed to just word occurrence. This is something that you normally don't get with a tokenizer that only recognizes words and not sentence structure. > I also used a stop list of words which are so common that they are > useless to index or use in search engines or other search indexes. > Below are a number of instances of words which I believe are not > appropriate tokens to use to differentiate between spam and ham > emails. There is a clash between the philosophy of naive Bayesian classification and rule-based schemes. The idea behind rule-based schemes is that we can tap human beings' pattern recognition ability to create rules that we run in a computer. Since we can recognize spam easily when we see it, we are the best experts to consult when forming a rule set. The problem with this notion is that computers are not currently capable of creating inferences in the same way as people because the system architecture is so different. While people can indeed reliably distinguish spam, often from only a part of the message, they cannot reliably tell you how they made the decision. The aim of naive Bayesian classification is to avoid all the particular problems of trying to construct a useful rule set and instead look at simple statistical properties of language the do not require human-like inference. The underlying model is fundamentally different. A Bayesian classifier is not trying to emulate a speaker of natural language. The approach has strengths as well as weaknesses. One of the strengths is that you don't have to decide what words you think are the best or worst spam indicators. If you tend to favor rule-based approaches, this also looks like a huge weakness. The classifier learns word probabilities by observing your message classifications. To the extent that you are surprised by the spam probabilities of individual words, you would make the classifier worse by manually overriding the training results on a token-by-token basis. This happens far more often than you would think. Words that indicate a spam likeliness equal to a ham likeliness score somewhere near 0.5 and do not contribute to the final score. Another of the strengths is that the word probabilities vary widely among different recipients. It's a strength because there is no such thing as a ham word list that will reliably avoid Bayesian classifiers. That's also a weakness, if you wish to apply Bayesian methods on a server without tracking the word probabilities separately for each mailbox. What this suggests is that it is equally difficult to come up with a list of words that the classifier should ignore that would work for most users. There is a fundamental disagreement in the approaches of Bayesian and rule-based systems. Proponents of rule-based systems believe that people can best identify what clues are most significant, while proponents of Bayesian systems either believe that people cannot reliably identify the most important clues, or even if they can, they don't care to do so. The last condition is important if spam avoidance is simply a utilitarian goal, not a hobby. Personally, I tried rule-based systems first and then experimented with Spambayes. I found that my intuition on word probabilities was indeed wrong a significant proportion of the time and the naive Bayesian approach did about as well as my rule-based system when it was at its peak. The Bayesian approach required much less maintenance and it works well for a wide variety of end-users without requiring insight from them. I still feel there are very useful rules to help detect spam that are complimentary to word frequency. These are things such as whether the message comes from a particular mailing list, whether the sending IP is on a DNS blacklist that I choose or to which one of my mailbox addresses the message is addressed. My own compromise on this is to either put them in the domain MTA, or to write Outlook rules that run before the Bayesian classifier. In terms of overall system architecture, I tend to believe that the rule-based approaches belong in the domain MTA, whenever possible, and should generate rejections during the SMTP session, preferably before DATA. This eliminates most of the spam at the lowest possible system cost and with the largest savings in bandwidth. You can eliminate another significant amount of spam by running rule-based content filters, such as SpamAssassin, in the MTA. This is very expensive, so it is important to run it on as few messages as possible. This generates rejections at the end of DATA, which are still useful for legitimate messages that are improperly classified. For the spam that slips through global rule-base systems, it then makes sense to do computationally intensive and user-specific content filtering like Spambayes in the MUA. The spam load is hopefully reduced enough that the end-user doesn't mind scanning the junk folder for the occasional false positive. -- Seth Goodman From LoneWolf at Montana.com Fri Nov 24 22:34:58 2006 From: LoneWolf at Montana.com (Ron Garland) Date: Fri, 24 Nov 2006 14:34:58 -0700 Subject: [Spambayes] Question Message-ID: <200611242135.kAOLZ6i2032558@jollyroger.montana.com> Hi, I just installed SpamBayes and love it; however, I have my Inbox configured using sub-folders under my Inbox, and rules that send an email into one of five different email identities based on the email address in the To field of the email. As a result, the Junk Suspects folder isn't receiving any emails. As I receive emails into the various folders, if they are spam, I use your Delete As Spam button to remove them. Is SpamBayes "learning" that they are spam even if they don't get routed first through the Junk Suspects folder, or do I have to configure my Outlook with a single Inbox? Thanks, Ron Garland Lone Wolf Marketing 406-556-5085 Office 406-581-2889 Fax -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061124/97172fce/attachment.html From Contacttanya at Raymondltd.com Wed Nov 22 17:34:27 2006 From: Contacttanya at Raymondltd.com (Tanya Miller) Date: Wed, 22 Nov 2006 10:34:27 -0600 Subject: [Spambayes] JOB OFFER Message-ID: Good Day, Are you looking for a lucrative job? The job takes only 3-5 hours a week , And it a chance for you to make over $3,000 extra per month depending on how usefull you are to the company . Also you do not need to resume at any office to get started ,Its a work from home and you do not pay any fee to get started . Try now without risking your current job.Do get back to me ifinterested . Thanks Tanya From neil at techspeak.com.au Mon Nov 27 08:24:49 2006 From: neil at techspeak.com.au (Neil Hymans- Technically Speaking) Date: Mon, 27 Nov 2006 15:24:49 +0800 Subject: [Spambayes] Outlook 2003 loading error Message-ID: <002a01c711f5$1fcae9c0$0301010a@office> Hi guys Love your work! I've just installed Outlook 03 on a new machine with Windows XP SP2. MS Update has done its thing and Outlook & all the Office bits have the most current updates available. When I installed the SpamBayes plugin for Outlook, Outlook crashed repeatedly. I have followed all the advice in the troubleshooting but with no success. (I used SpamBayes on my previous system with the same environment as above and it worked perfectly every time, so I know what I should be seeing.) I thought there might be two possible causes: a conflict with another Add-in (MS PST backup) or perhaps the fact that my PST was set up from my last configuration to display the "spam rating" field in the inbox. So I uninstalled PST backup and SpamBayes, restarted & closed Outlook without errors, then reinstalled SpamBayes - no joy. I then did the same thing after removing the Spam Rating field from my inbox - this didn't make any difference either. I doubt the attached log files are going to tell you much, but I'd be grateful for any suggestions you might have for anything else I should try. Best regards Neil Hymans -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061127/2c3637c8/attachment.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes3.log Type: application/octet-stream Size: 37 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061127/2c3637c8/attachment.obj -------------- next part -------------- A non-text attachment was scrubbed... Name: spambayes2.log Type: application/octet-stream Size: 59 bytes Desc: not available Url : http://mail.python.org/pipermail/spambayes/attachments/20061127/2c3637c8/attachment-0001.obj From shawn at 12pointdesign.com Mon Nov 27 08:49:32 2006 From: shawn at 12pointdesign.com (Shawn K. Hall) Date: Sun, 26 Nov 2006 23:49:32 -0800 Subject: [Spambayes] Outlook 2003 loading error In-Reply-To: <002a01c711f5$1fcae9c0$0301010a@office> Message-ID: <01d101c711f8$93dbddd0$1e00a8c0@Yoda> Hi Neil, > I've just installed Outlook 03 on a new machine with Windows > XP SP2...When I installed the SpamBayes plugin for Outlook, > Outlook crashed repeatedly. Have you tested the ram already? http://memtest86.com/ When nothing else makes sense, I find it's usually the memory. Before reinstalling again, make sure you completely empty all temp files. With SB uninstalled, you might also try changing the macro security settings to low, then re-installing. I'd try emptying the temp files and installing with normal macro security first, though. Regards, Shawn K. Hall http://12PointDesign.com/ From rowan at sylvester-bradley.org Mon Nov 27 11:03:58 2006 From: rowan at sylvester-bradley.org (Rowan) Date: Mon, 27 Nov 2006 02:03:58 -0800 (PST) Subject: [Spambayes] Outlook 2000 Toolbar Buttons Not Working Message-ID: <7557493.post@talk.nabble.com> I'm using SpamBayes with Outlook 2000. The Delete as Spam and Recover from Spam buttons on the toolbar very rarely work (sometimes they work just a couple of times after a restart, but then they stop working). This is making SpamBayes pretty well useless for me, since I can't recover mail from the Junk Suspects folder, therefore SpamBayes never learns not to junk the messages I want. I'm on the point of giving up and looking for another spam filter. This forum seems to have loads of mesages from people suffering with the same problem. Before I finally give up on SpamBayes, has anyone got a solution for this problem? Is anyone working on a solution? Does anyone understand what the problem is? Thanks - Rowan -- View this message in context: http://www.nabble.com/Outlook-2000-Toolbar-Buttons-Not-Working-tf2710826.html#a7557493 Sent from the Python - spambayes mailing list archive at Nabble.com. From dmoya at sailstx.org Mon Nov 27 17:11:14 2006 From: dmoya at sailstx.org (Dan Moya) Date: Mon, 27 Nov 2006 10:11:14 -0600 Subject: [Spambayes] Spam Clues: RE: Friendly Reminder Message-ID: <004401c7123e$aa78bc20$1d00a8c0@SAILS.local> I keep getting this email user mail in my suspected junk e-mail. I am trying to correct this problem and am seeking help. Combined Score: 93% (0.927286) Internal ham score (*H*): 0.0343621 Internal spam score (*S*): 0.888935 # ham trained on: 1 # spam trained on: 2 34 Significant Tokens token spamprob #ham #spam 'from:addr:sailstx.org' 0.155172 1 0 "to:name:'dan moya'" 0.155172 1 0 'url:org' 0.155172 1 0 'url:rec-html40' 0.155172 1 0 'url:tr' 0.155172 1 0 'url:w3' 0.155172 1 0 'x-mailer:microsoft office outlook 11' 0.155172 1 0 'header:Date:1' 0.363946 1 1 'header:From:1' 0.363946 1 1 'header:MIME-Version:1' 0.363946 1 1 'header:Received:2' 0.363946 1 1 'header:Subject:1' 0.363946 1 1 'header:To:1' 0.363946 1 1 'proto:http' 0.363946 1 1 'url:www' 0.363946 1 1 '(210)' 0.844828 0 1 '2006' 0.844828 0 1 'antonio' 0.844828 0 1 'antonio,' 0.844828 0 1 'are' 0.844828 0 1 'december' 0.844828 0 1 'for' 0.844828 0 1 'have' 0.844828 0 1 'our' 0.844828 0 1 'please' 0.844828 0 1 'san' 0.844828 0 1 'services' 0.844828 0 1 'texas' 0.844828 0 1 'that' 0.844828 0 1 'the' 0.844828 0 1 'when' 0.844828 0 1 'you' 0.844828 0 1 'can' 0.908163 0 2 'subject: ' 0.908163 0 2 Message Stream Return-Path: Received: from sheffield.xo.com (sheffield.xo.com [207.155.252.71]) by safeguard.xo.com (ConcentricHost(3.4) MX) with ESMTP id CCFF28E9E3 for ; Mon, 27 Nov 2006 11:08:07 -0500 (EST) Received: from mallika1 (ip67-154-44-19.z44-154-67.customer.algx.net [67.154.44.19]) (as mrangarajan at sailstx.org) by sheffield.xo.com (ConcentricHost(2.54) Relay) with ESMTP id 212943D36 for ; Mon, 27 Nov 2006 10:45:23 -0500 (EST) From: "Mallika Rangarajan" To: "'Dan Moya'" Subject: RE: Friendly Reminder Date: Mon, 27 Nov 2006 09:38:53 -0600 Message-ID: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_00E3_01C71207.DA185FD0" X-Mailer: Microsoft Office Outlook 11 Thread-Index: AccOWBSoU9Kc8MFSSxaXMqRp8gC/bAD3fjjQAAAyexAAANFEkA== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2869 In-Reply-To: <003801c71236$e7d26dd0$1d00a8c0 at SAILS.local> Disposition-Notification-To: "Mallika Rangarajan" X-AllowList: Yes X-MFData: [v2.3 t11,631114] X-UIDL: 708 December 15, 2006 Mallika Rangarajan Executive Administrative Assistant Human Resources Department San Antonio Independent Living Services 1028 S. Alamo San Antonio, Texas 78210 (210) 281-1878 ext. 19 V/TDD (210) 281-1759 FAX _____ From: Dan Moya [mailto:dmoya at sailstx.org] Sent: Monday, November 27, 2006 9:16 AM To: 'Mallika Rangarajan' Subject: RE: Friendly Reminder Question, when is our last day of December? _____ From: Mallika Rangarajan [mailto:mrangarajan at sailstx.org] Sent: Monday, November 27, 2006 9:11 AM To: 'Dan Moya' Subject: FW: Friendly Reminder When are you scheduling a fire drill so I can have the fire drill form ready? Please let me know asap. Thank you. Mallika Mallika Rangarajan Executive Administrative HR Manager Human Resources Department San Antonio Independent Living Services 1028 S. Alamo San Antonio, Texas 78210 (210) 281-1878 ext. 19 V/TDD (210) 281-1759 FAX _____ From: Kitty Brietzke [mailto:kbrietzke at sailstx.org] Sent: Wednesday, November 22, 2006 11:03 AM To: 'Mallika Rangarajan' Cc: 'Dan Moya' Subject: Friendly Reminder Just a friendly reminder that we need a fire drill for December. Kitty

December 15, 2006

 

Mallika Rangarajan
Executive Administrative Assistant
Human Resources Department

San Antonio Independent Living Services
1028 S. Alamo
San Antonio, Texas 78210
(210) 281-1878 ext. 19 V/TDD
(210) 281-1759 FAX


From: Dan Moya [mailto:dmoya at sailstx.org]
Sent: Monday, November 27, 2006 9:16 AM
To: 'Mallika Rangarajan'
Subject: RE: Friendly Reminder

 

Question, when is our last day of December?

 


From: Mallika Rangarajan [mailto:mrangarajan at sailstx.org]
Sent: Monday, November 27, 2006 9:11 AM
To: 'Dan Moya'
Subject: FW: Friendly Reminder

 

When are you scheduling a fire drill so I can have the fire drill form ready?  Please let me know asap.

 

Thank you.

 

Mallika

Malli ka Rangarajan
Executive Administrative HR Manager
Human Resources Department

San Antonio Independent Living Services
1028 S. Alamo
San Antonio, Texas 78210
(210) 281-1878 ext. 19 V/TDD
(210) 281-1759 FAX


From: Kitty Brietzke [mailto:kbrietzke at sailstx.org]
Sent: Wednesday, November 22, 2006 11:03 AM
To: 'Mallika Rangarajan'
Cc: 'Dan Moya'
Subject: Friendly Reminder

 

Just a friendly reminder that we need a fire drill for December.

 

Kitty

All Message Tokens 117 unique tokens "'dan" "'mallika" '(210)' '1028' '11:03' '15,' '2006' '22,' '27,' '281-1759' '281-1878' '78210' '9:11' '9:16' '_____' 'alamo' 'antonio' 'antonio,' 'are' 'asap.' 'assistant' 'brietzke' 'can' 'cc:' 'cc:none' 'content-type:text/plain' 'dan' 'day' 'december' 'december.' 'december?' 'department' 'drill' 'email addr:sailstx.org]' 'email name:[mailto:dmoya' 'email name:[mailto:kbrietzke' 'email name:[mailto:mrangarajan' 'executive' 'ext.' 'fax' 'fire' 'for' 'form' 'friendly' 'from:' 'from:addr:mrangarajan' 'from:addr:sailstx.org' 'from:name:mallika rangarajan' 'fw:' 'have' 'header:Date:1' 'header:From:1' 'header:In-Reply-To:1' 'header:MIME-Version:1' 'header:Message-ID:1' 'header:Received:2' 'header:Return-Path:1' 'header:Subject:1' 'header:To:1' 'human' 'independent' 'just' 'kitty' 'know' 'last' 'let' 'living' 'mallika' 'manager' 'message-id:@sailstx.org' 'monday,' 'moya' "moya'" 'need' 'november' 'our' 'please' 'proto:http' 'question,' 'rangarajan' "rangarajan'" 're:' 'ready?' 'reminder' 'reply-to:none' 'resources' 'san' 'scheduling' 'sender:none' 'sent:' 'services' 'skip:a 10' 'subject:' 'subject: ' 'subject:: ' 'subject:Friendly' 'subject:Reminder' 'texas' 'thank' 'that' 'the' 'to:' 'to:2**0' 'to:addr:dmoya' 'to:addr:sailstx.org' "to:name:'dan moya'" 'url:org' 'url:rec-html40' 'url:tr' 'url:w3' 'url:www' 'v/tdd' 'wednesday,' 'when' 'x-mailer:microsoft office outlook 11' 'you' 'you.' -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061127/a9d23511/attachment-0001.htm -------------- next part -------------- An embedded message was scrubbed... From: "Mallika Rangarajan" Subject: RE: Friendly Reminder Date: Mon, 27 Nov 2006 09:38:53 -0600 Size: 16704 Url: http://mail.python.org/pipermail/spambayes/attachments/20061127/a9d23511/attachment-0001.mht From jsp at PKC.com Mon Nov 27 21:24:33 2006 From: jsp at PKC.com (Jesse Pelton) Date: Mon, 27 Nov 2006 15:24:33 -0500 Subject: [Spambayes] Spam Clues: RE: Friendly Reminder In-Reply-To: <004401c7123e$aa78bc20$1d00a8c0@SAILS.local> Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04BA6@PKCVT01.pkc.com> It looks like you've only trained on three messages, so SpamBayes apparently doesn't have much to go on. You appear to be using Outlook; if so, you can tell it that this message is not spam by using the "Recover from Spam" or "Not Spam" button (depending on your version) or dragging the message from the suspected junk folder to your inbox. Another possibility is that Outlook's spam filter (which as far as I can tell cannot be completely disabled unless you administer the Exchange server) is putting it in your junk suspects folder. If that's the case, you'll need to install 1.1a3 if you don't already have one of the 1.1 alphas. (Though it's alpha software, I've been running 1.1a2 for a year and a half with no problem.) This version allows you to move ham to a different folder. You'll probably want to filter your Junk folders and have SpamBayes move ham to your inbox. I also think it's a good idea to set up folders for SpamBayes to move spam and spam candidates to (separate from the ones Outlook uses). ________________________________ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of Dan Moya Sent: Monday, November 27, 2006 11:11 AM To: spambayes at python.org Subject: [Spambayes] Spam Clues: RE: Friendly Reminder I keep getting this email user mail in my suspected junk e-mail. I am trying to correct this problem and am seeking help. Combined Score: 93% (0.927286) Internal ham score (*H*): 0.0343621 Internal spam score (*S*): 0.888935 # ham trained on: 1 # spam trained on: 2 34 Significant Tokens token spamprob #ham #spam 'from:addr:sailstx.org' 0.155172 1 0 "to:name:'dan moya'" 0.155172 1 0 'url:org' 0.155172 1 0 'url:rec-html40' 0.155172 1 0 'url:tr' 0.155172 1 0 'url:w3' 0.155172 1 0 'x-mailer:microsoft office outlook 11' 0.155172 1 0 'header:Date:1' 0.363946 1 1 'header:From:1' 0.363946 1 1 'header:MIME-Version:1' 0.363946 1 1 'header:Received:2' 0.363946 1 1 'header:Subject:1' 0.363946 1 1 'header:To:1' 0.363946 1 1 'proto:http' 0.363946 1 1 'url:www' 0.363946 1 1 '(210)' 0.844828 0 1 '2006' 0.844828 0 1 'antonio' 0.844828 0 1 'antonio,' 0.844828 0 1 'are' 0.844828 0 1 'december' 0.844828 0 1 'for' 0.844828 0 1 'have' 0.844828 0 1 'our' 0.844828 0 1 'please' 0.844828 0 1 'san' 0.844828 0 1 'services' 0.844828 0 1 'texas' 0.844828 0 1 'that' 0.844828 0 1 'the' 0.844828 0 1 'when' 0.844828 0 1 'you' 0.844828 0 1 'can' 0.908163 0 2 'subject: ' 0.908163 0 2 Message Stream Return-Path: Received: from sheffield.xo.com (sheffield.xo.com [207.155.252.71]) by safeguard.xo.com (ConcentricHost(3.4) MX) with ESMTP id CCFF28E9E3 for ; Mon, 27 Nov 2006 11:08:07 -0500 (EST) Received: from mallika1 (ip67-154-44-19.z44-154-67.customer.algx.net [67.154.44.19]) (as mrangarajan at sailstx.org) by sheffield.xo.com (ConcentricHost(2.54) Relay) with ESMTP id 212943D36 for ; Mon, 27 Nov 2006 10:45:23 -0500 (EST) From: "Mallika Rangarajan" To: "'Dan Moya'" Subject: RE: Friendly Reminder Date: Mon, 27 Nov 2006 09:38:53 -0600 Message-ID: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_00E3_01C71207.DA185FD0" X-Mailer: Microsoft Office Outlook 11 Thread-Index: AccOWBSoU9Kc8MFSSxaXMqRp8gC/bAD3fjjQAAAyexAAANFEkA== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2869 In-Reply-To: <003801c71236$e7d26dd0$1d00a8c0 at SAILS.local> Disposition-Notification-To: "Mallika Rangarajan" X-AllowList: Yes X-MFData: [v2.3 t11,631114] X-UIDL: 708 December 15, 2006 Mallika Rangarajan Executive Administrative Assistant Human Resources Department San Antonio Independent Living Services 1028 S. Alamo San Antonio, Texas 78210 (210) 281-1878 ext. 19 V/TDD (210) 281-1759 FAX _____ From: Dan Moya [mailto:dmoya at sailstx.org] Sent: Monday, November 27, 2006 9:16 AM To: 'Mallika Rangarajan' Subject: RE: Friendly Reminder Question, when is our last day of December? _____ From: Mallika Rangarajan [mailto:mrangarajan at sailstx.org] Sent: Monday, November 27, 2006 9:11 AM To: 'Dan Moya' Subject: FW: Friendly Reminder When are you scheduling a fire drill so I can have the fire drill form ready? Please let me know asap. Thank you. Mallika Mallika Rangarajan Executive Administrative HR Manager Human Resources Department San Antonio Independent Living Services 1028 S. Alamo San Antonio, Texas 78210 (210) 281-1878 ext. 19 V/TDD (210) 281-1759 FAX _____ From: Kitty Brietzke [mailto:kbrietzke at sailstx.org] Sent: Wednesday, November 22, 2006 11:03 AM To: 'Mallika Rangarajan' Cc: 'Dan Moya' Subject: Friendly Reminder Just a friendly reminder that we need a fire drill for December. Kitty

December 15, 2006

 

Mallika Rangarajan
Executive Administrative Assistant
Human Resources Department

San Antonio Independent Living Services
1 028 S. Alamo
San Antonio, Texas 78210
( 210) 281-1878 ext. 19 V/TDD
( 210) 281-1759 FAX


From: Dan Moya [mailto:dmoya at sailstx.org]
Sent: Monday, November 27, 2006 9:16 AM
To: 'Mallika Rangarajan'
Subject: RE: Friendly Reminder

 

Question, when is our last day of December?

 


From: Mallika Rangarajan [mailto:mrangarajan at sailstx.org]
Sent: Monday, November 27, 2006 9:11 AM
To: 'Dan Moya'
Subject: FW: Friendly Reminder

 

When are you scheduling a fire drill so I can have the fire drill form ready?  Please let me know asap.

 

Thank you.

 

Mallika

M allika Rangarajan
Executive Administrative HR Manager
Human Resources Department

S an Antonio Independent Living Services
1028 S. Alamo
San Antonio, Texas 78210
(210) 281-1878 ext. 19 V/TDD
(210) 281-1759 FAX


From: Kitty Brietzke [mailto:kbrietzke at sailstx.org]
Sent: Wednesday, November 22, 2006 11:03 AM
To: 'Mallika Rangarajan'
Cc: 'Dan Moya'
Subject: Friendly Reminder

 

Just a friendly reminder that we need a fire drill for December.

 

Kitty

All Message Tokens 117 unique tokens "'dan" "'mallika" '(210)' '1028' '11:03' '15,' '2006' '22,' '27,' '281-1759' '281-1878' '78210' '9:11' '9:16' '_____' 'alamo' 'antonio' 'antonio,' 'are' 'asap.' 'assistant' 'brietzke' 'can' 'cc:' 'cc:none' 'content-type:text/plain' 'dan' 'day' 'december' 'december.' 'december?' 'department' 'drill' 'email addr:sailstx.org]' 'email name:[mailto:dmoya' 'email name:[mailto:kbrietzke' 'email name:[mailto:mrangarajan' 'executive' 'ext.' 'fax' 'fire' 'for' 'form' 'friendly' 'from:' 'from:addr:mrangarajan' 'from:addr:sailstx.org' 'from:name:mallika rangarajan' 'fw:' 'have' 'header:Date:1' 'header:From:1' 'header:In-Reply-To:1' 'header:MIME-Version:1' 'header:Message-ID:1' 'header:Received:2' 'header:Return-Path:1' 'header:Subject:1' 'header:To:1' 'human' 'independent' 'just' 'kitty' 'know' 'last' 'let' 'living' 'mallika' 'manager' 'message-id:@sailstx.org' 'monday,' 'moya' "moya'" 'need' 'november' 'our' 'please' 'proto:http' 'question,' 'rangarajan' "rangarajan'" 're:' 'ready?' 'reminder' 'reply-to:none' 'resources' 'san' 'scheduling' 'sender:none' 'sent:' 'services' 'skip:a 10' 'subject:' 'subject: ' 'subject:: ' 'subject:Friendly' 'subject:Reminder' 'texas' 'thank' 'that' 'the' 'to:' 'to:2**0' 'to:addr:dmoya' 'to:addr:sailstx.org' "to:name:'dan moya'" 'url:org' 'url:rec-html40' 'url:tr' 'url:w3' 'url:www' 'v/tdd' 'wednesday,' 'when' 'x-mailer:microsoft office outlook 11' 'you' 'you.' -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061127/e01ba5da/attachment.html From rking at preflightventures.com Mon Nov 27 23:13:06 2006 From: rking at preflightventures.com (Rick King) Date: Mon, 27 Nov 2006 17:13:06 -0500 Subject: [Spambayes] Spambayes major problem and help needed Message-ID: We have been long-time users of SpamBayes (Windows XP) and have gotten 10+ people on SpamBayes but please help know what to do about the major problem now having. Using POP3 Proxy Binary ver 1.0.4 March '05 and I am Reviewing and Training messages 2 times a day! 1. A tremendous amount of Spam mail is getting thru ! Literally HUNDREDS of spam messages daily are getting into my Inbox and thousands are getting caught by Spambayes. 2. MANY of the same SPAM sources and addresses and subjects keep getting thru and into inbox over & over & over (for weeks now; esp one from PEQV Parametric ! VERY nasty and repetitive - many dozens a day) 3. Is it because the spammers are using all image messages? (i.e. the text looks all funky and is a picture to throw things off) 4. Is there an update to help? Thank you for any help! - Rick ___________________________________ Rick King PreFlight Ventures (919) 806-1166 www.preflightventures.com Based in Research Triangle Park, NC, PreFlight Ventures helps entrepreneurs and new ventures grow their business through corporate partnering, technology licensing, and acquisitions: http://preflightventures.com/cpc/ In addition, we provide coaching tools for entrepreneurs: self-assessment scorecard, guidebooks, audio PreFlight PowerTips (http://preflightpowertips.com), profiles distributed to corporate partners & investors, and conduct national workshops and talks on Corporate Partnering, "The Art of Telling Your Story", commercialization, and "Doing Smart Deals." <>< From shawn at 12pointdesign.com Tue Nov 28 04:49:45 2006 From: shawn at 12pointdesign.com (Shawn K. Hall) Date: Mon, 27 Nov 2006 19:49:45 -0800 Subject: [Spambayes] Spambayes major problem and help needed In-Reply-To: Message-ID: <05be01c712a0$3eb38c10$1e00a8c0@Yoda> Hi Rick, > 3. Is it because the spammers are using all image messages? > (i.e. the text looks all funky and is a picture to throw > things off) Yes. The developers are working on adding an image-processing capability, which will extract the text portions from images received in these emails. However, there is no expected release date, and little has been discussed about it the last month or so. As it's coming fast on December, there's a good chance that it will not happen until early 2007, as open source developers lead just as busy lives as the rest of you, and December tends to be a very busy month. Regards, Shawn K. Hall http://12PointDesign.com/ From rwakefield at kusala.com Tue Nov 28 10:36:10 2006 From: rwakefield at kusala.com (Richard Wakefield, Kusala) Date: Tue, 28 Nov 2006 09:36:10 -0000 Subject: [Spambayes] SB not filtering Message-ID: <020301c712d0$a3b19410$2406a8c0@thor> Hi I've been running SB for some time now. I'm really pleased with it. I like to leave it running 24/7 to clear out all the overnight Spam. I've just reformatted my hard drive and reinstalled the latest version. Now SB does nothing with incoming mail until I click on "filter messages", it won't work automatically. Any ideas. Kind regards Richard Wakefield Managing Director Kusala Web Developments Ltd Lydgate House, Lydgate Lane, Sheffield, S10 5FH 0114 267 6002 07971 040054 www.kusala.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061128/25b0d924/attachment.htm From kdsabel at dotnet.com Tue Nov 28 13:31:29 2006 From: kdsabel at dotnet.com (Kim and Dennis Sabel) Date: Tue, 28 Nov 2006 06:31:29 -0600 Subject: [Spambayes] Your recent mail to remove@audible.com Message-ID: <000501c712e9$21b3cd20$6401a8c0@COMPRM> I would like to remove myself from this application. I applied for the audible books last week and decided that I don't want to have this subscribtion. Please take my information off the account and unsubscribe myself to this. Thank you, Kim Sabel kdsabel at dotnet.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061128/c7fa366d/attachment.htm From jsp at PKC.com Tue Nov 28 14:04:50 2006 From: jsp at PKC.com (Jesse Pelton) Date: Tue, 28 Nov 2006 08:04:50 -0500 Subject: [Spambayes] Your recent mail to remove@audible.com In-Reply-To: <000501c712e9$21b3cd20$6401a8c0@COMPRM> Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04BAB@PKCVT01.pkc.com> Your message went to the SpamBayes list, but apparently not to the audible books vendor (presumably audible.com). I can't tell you how to remove yourself from their list, but if you're subscribed to the SpamBayes list, you can unsubscribe from it at http://mail.python.org/mailman/listinfo/spambayes. ________________________________ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of Kim and Dennis Sabel Sent: Tuesday, November 28, 2006 7:31 AM To: spambayes at python.org Subject: [Spambayes] Your recent mail to remove at audible.com I would like to remove myself from this application. I applied for the audible books last week and decided that I don't want to have this subscribtion. Please take my information off the account and unsubscribe myself to this. Thank you, Kim Sabel kdsabel at dotnet.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061128/dc91746f/attachment.html From tleish at hotmail.com Tue Nov 28 18:16:55 2006 From: tleish at hotmail.com (Thomas L) Date: Tue, 28 Nov 2006 08:16:55 -0900 Subject: [Spambayes] FW: Spambayes Windows Setup Message-ID: I am having problems getting this to run on Windows. If I download and install the executable, I don't also need to install python do I? That's only if I doing from source, correct? When attempting to use the config web pages, I try to save my configuration and get: http://localhost:8880/changeopts ---------------------------------------------- 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 476, in found_terminator File "spambayes\UserInterface.pyc", line 884, in onChangeopts File "spambayes\ProxyUI.pyc", line 782, in reReadOptions File "sb_server.pyc", line 1007, in _recreateState File "sb_server.pyc", line 1022, in prepare File "sb_server.pyc", line 822, in prepare File "sb_server.pyc", line 889, in createWorkers File "spambayes\Stats.pyc", line 60, in __init__ AttributeError: 'NoneType' object has no attribute 'get_statistics_start_date' ---------------------------------------------- Then after a few tries, I attempt to go back to the home page and get: http://localhost:8880/home ---------------------------------------------- 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 476, in found_terminator File "spambayes\ProxyUI.pyc", line 184, in onHome File "sb_server.pyc", line 835, in buildStatusStrings AttributeError: 'NoneType' object has no attribute 'nspam' ---------------------------------------------- It's not until I stop and restart the program that I can get a valid page again. Even with the above errors, the config changes seem to be sticking. I confirmed that in the config file, however it doesn't seem to work with my Outlook Express. http://spambayes.sourceforge.net/faq.html#how-do-i-set-up-spambayes-and-outlook-express I followed the instructions from the above link step for step several times and have still yet to get it to work. I am a software developer, so I'm not a novice when it comes to this type of stuff. I've searched the list for solutions, and have seen other people ask similar questions to this, but no replies. I'm willing to test out a few things and try to dig down to figure out why it's not working. I've looked for logs to indicate any problems... found one, but the little information didn't seem to raise any red flags. Thank you, -Thomas _________________________________________________________________ Fixing up the home? Live Search can help http://imagine-windowslive.com/search/kits/default.aspx?kit=improve&locale=en-US&source=hmemailtaglinenov06&FORM=WLMTAG From peter at ptoye.com Tue Nov 28 19:44:28 2006 From: peter at ptoye.com (Peter Toye) Date: Tue, 28 Nov 2006 18:44:28 +0000 Subject: [Spambayes] Spambayes has stopped working Message-ID: <1148240990.20061128184428@ptoye.com> I am using SpamBayes POP3 Proxy Version 1.0.4 (March 2005) (binary), with version 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] of Python; my operating system is Windows 5.1.2600.2 (Service Pack 2). I have trained 1610 ham and 2842 spam. My copy of Spambayes, taken from a magazine cover, has suddenly stopped connecting to the POP3 server. There didn't seem to be a time limitation and it's not asked for money or anything. If I put the server address into the client it works fine, so I can't see how it can be at the server end. The error message I get is: !28/11/2006, 18:07:24: FETCH - Server reports error. The response is: -ERR Can't connect to mail.interdns.co.uk:110: 10061 I tried using the Spambayes bug reporter, and this gave an error message: 500 Server error Traceback (most recent call last): File "spambayes\Dibbler.pyc", line 470, in found_terminator File "spambayes\UserInterface.pyc", line 1120, in onSubmitreport File "smtplib.pyc", line 240, in __init__ File "smtplib.pyc", line 302, in connect error: (10061, 'Connection refused') This isn't much help :-( I'll report it to the SF bug page. Regards, Peter Toye mailto:peter at ptoye.com www.ptoye.com From engineer.xyliu at gmail.com Wed Nov 29 03:59:51 2006 From: engineer.xyliu at gmail.com (engineer.xyliu) Date: Wed, 29 Nov 2006 10:59:51 +0800 Subject: [Spambayes] hi,nice to meet you. Message-ID: Hi, I'm doing reserach on AntiSpam personally, I need a lot of spam samples. I have try a lot to incur spam, but to now, I can only get about 300 spam per day. So, could you kind to help me with spam collection? I use engineer.xyliu at gmail.com to collect spam. Please send as many as possible to that mailbox. Or if you have spam archive, please send them to me to the mailbox too. Thanks in Advance! Regards. engineer.xyliu at gmail -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061129/0d5fa77c/attachment.htm From skip at pobox.com Wed Nov 29 06:04:48 2006 From: skip at pobox.com (skip at pobox.com) Date: Tue, 28 Nov 2006 23:04:48 -0600 Subject: [Spambayes] OT: How to send a Unix mbox of mail to an Outlook user? Message-ID: <17773.5360.425512.406039@montanaro.dyndns.org> I have a bunch of email messages in typical Unix mbox format. I wrapped them up in a MIME digest and forwarded them to my correspondent, but cc'd my gmail address as well. Gmail didn't do anything useful with the attachment (like explode the digest into individual emails), so I'm skeptical that my Outlook-using correspondent will get them in a form he can use. I won't know until tomorrow sometime, but I'm skeptical this worked. Any thoughts? Thx, Skip From amedee at amedee.be Wed Nov 29 08:04:03 2006 From: amedee at amedee.be (Amedee Van Gasse) Date: Wed, 29 Nov 2006 08:04:03 +0100 Subject: [Spambayes] OT: How to send a Unix mbox of mail to an Outlook user? In-Reply-To: <17773.5360.425512.406039@montanaro.dyndns.org> References: <17773.5360.425512.406039@montanaro.dyndns.org> Message-ID: <1164783843.31259.20.camel@localhost> Op dinsdag 28-11-2006 om 23:04 uur [tijdzone -0600], schreef skip at pobox.com: > I have a bunch of email messages in typical Unix mbox format. I wrapped > them up in a MIME digest and forwarded them to my correspondent, but cc'd my > gmail address as well. Gmail didn't do anything useful with the attachment > (like explode the digest into individual emails), so I'm skeptical that my > Outlook-using correspondent will get them in a form he can use. I won't > know until tomorrow sometime, but I'm skeptical this worked. Any thoughts? > > Thx, > > Skip Skip, Your correspondent could install a local POP3 or IMAP server implementation, preferably something originally Unix/Linux that has been ported to Windows. Or he could use Cygwin. Once he has that local mailserver, he can use his Outlook client as usual. I know about a tool that directly converts .pst files to mbox format, but I don't know about any tool that does the opposite... (perhaps google knows?) -- Amedee From sethg at goodmanassociates.com Wed Nov 29 09:38:16 2006 From: sethg at goodmanassociates.com (Seth Goodman) Date: Wed, 29 Nov 2006 02:38:16 -0600 Subject: [Spambayes] OT: How to send a Unix mbox of mail to an Outlook user? In-Reply-To: <17773.5360.425512.406039@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote on Tuesday, November 28, 2006 11:05 PM -0500: > I have a bunch of email messages in typical Unix mbox format. I > wrapped them up in a MIME digest and forwarded them to my > correspondent, but cc'd my gmail address as well. Gmail didn't do > anything useful with the attachment (like explode the digest into > individual emails), so I'm skeptical that my Outlook-using > correspondent will get them in a form he can use. I won't know > until tomorrow sometime, but I'm skeptical this worked. Any > thoughts? I ran across http://www.broobles.com/imapsize/mbox2eml.php. This converts the mbox file to eml (individual Outlook Express messages), so there's an extra step to get to Outlook. I tried it on an mbox saved from Evolution on a Debian box and it worked. If you don't have access to a Windows box, I'd be happy to convert some files for you. Another option is to read it into Eudora, which produces Eudora's special indexed mbox files. Outlook can import these directly for version 3 and earlier of Eudora (at least for Outlook2K). Yet another thing you could do is to make the mbox available through an IMAP server and have your correspondent read the messages using Outlook. Finally, since you mentioned gmail, I ran into a gmail uploader written in Python that understands mbox files. http://www.marklyon.org/gmail/default.htm -- Seth Goodman From epittman at martinauto.com Wed Nov 29 10:37:39 2006 From: epittman at martinauto.com (Edward Pittman) Date: Wed, 29 Nov 2006 10:37:39 +0100 Subject: [Spambayes] Can't get Spam Bayes to work Message-ID: <000601c7139a$03899cb0$2501a8c0@martinepittman> I have set up my Outlook Express accounts as described in the FAQ, section 4.21. But when I try to send and receive e-mails, I keep getting a request for the name and password for localhost on each of the accounts I have. Help!! Ed Pittman -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061129/9893c661/attachment.htm From kozel at lineart.cz Wed Nov 29 13:28:18 2006 From: kozel at lineart.cz (Pavel Strnad) Date: Wed, 29 Nov 2006 13:28:18 +0100 Subject: [Spambayes] spambayes installing Message-ID: <009d01c713b1$d98bb7a0$021a14ac@LINEART> Hi dear, I have problem with downloading the secound necessity "Mark Hammond's pywin32 extensions. Choose the version which corresponds to the version of Python you downloaded". Please help me. I have downloaded latest Python Installer (ver 2.5) and SpamBayes 1.1a3 thanks best regards Pavel -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061129/58675f49/attachment.html From jsp at PKC.com Wed Nov 29 14:37:04 2006 From: jsp at PKC.com (Jesse Pelton) Date: Wed, 29 Nov 2006 08:37:04 -0500 Subject: [Spambayes] spambayes installing In-Reply-To: <009d01c713b1$d98bb7a0$021a14ac@LINEART> Message-ID: <16E2027582CDB74180896CDB4B8CC1F901D04BB5@PKCVT01.pkc.com> The linked site that hosts the pywin32 extensions appears to have had problems. They are supposed to be at http://starship.python.net/crew/mhammond/win32/Downloads.html , but if you go there, you get a 404 error. If you go up a couple of levels, http://starship.python.net/crew/mhammond/ gives an explanation: "The starship's had some problems, but we're back online and now we have to wait for the crew to restore their web pages..." However, a quick Google search reveals that pywin32 development is hosted on SourceForge (http://sourceforge.net/projects/pywin32/). You can download the latest version there. But you might want to take an alternative approach. It sounds like you're trying to install from source, though you appear to be running on Windows, for which a stand-alone installer is available. Unless you really need to have the source, maybe you should try the installer (currently spambayes-1.1a3.exe at https://sourceforge.net/project/showfiles.php?group_id=61702&package_id= 58141). ________________________________ From: spambayes-bounces at python.org [mailto:spambayes-bounces at python.org] On Behalf Of Pavel Strnad Sent: Wednesday, November 29, 2006 7:28 AM To: spambayes at python.org Subject: [Spambayes] spambayes installing Hi dear, I have problem with downloading the secound necessity "Mark Hammond's pywin32 extensions . Choose the version which corresponds to the version of Python you downloaded". Please help me. I have downloaded latest Python Installer (ver 2.5) and SpamBayes 1.1a3 thanks best regards Pavel -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061129/7f29eb64/attachment.htm From wsy at merl.com Wed Nov 29 14:36:29 2006 From: wsy at merl.com (Bill Y) Date: Wed, 29 Nov 2006 08:36:29 -0500 Subject: [Spambayes] hi,nice to meet you. In-Reply-To: (engineer.xyliu@gmail.com) References: Message-ID: <200611291336.kATDaTrf020455@spoo.merl.com> Sorry, Xyliu; I actually don't have it split out in a way that would be easy to send you. However, NIST publishes their corpus; the most recent one is 93,000 messages with adjucated statuses. You might want to get that and use that. As another advantage, it's a published corpus so any results you get are directly comparable. - Bill Yerazunis From pailin_pla at yahoo.com Wed Nov 29 21:20:14 2006 From: pailin_pla at yahoo.com (Pailin Lertyingyos) Date: Wed, 29 Nov 2006 12:20:14 -0800 (PST) Subject: [Spambayes] Server-Side SpamBayes Solution Message-ID: <603099.60579.qm@web31914.mail.mud.yahoo.com> Hi there, I'm interest in server-side SpamBayes solutions. Would you please let me know where I could find articles or feedback about them. Thanks in advace for your help. I appreciate it. Pailin --------------------------------- Access over 1 million songs - Yahoo! Music Unlimited. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/spambayes/attachments/20061129/e4d0225e/attachment.htm