[Spambayes] Problems with unheader.py

Anthony Baxter anthony@interlink.com.au
Wed, 25 Sep 2002 11:48:24 +1000


>>> Greg Ward wrote
> I've been playing around with unheader.py -- it looks like it's just the
> ticket for cleaning up some of the spam I've gathered.  Apart from
> adding Maildir support to it, I think I've found some bugs:

Quite probably. It's a quick hack :)


>   2) deSA() removes the first and last line of the body
> 
>   3) deSA() crashes on MIME messages -- assumes the result of
>      get_payload() is a string
> 
> #1 is easy to fix -- just remove the ability to read from stdin.  Anyone
> care?  It'll make my patch to add Maildir simpler.

I'd say kill it.

> #2 is *probably* an easy/silly bug, but I haven't looked into it yet.

Really? Hadn't seen that one.

> #3 makes it look like no one has used this code (the attempt to remove
> SA's "SPAM: " lines in particular) on a real spam corpus.  Really?

but SA turns the message into a text/plain, anyway... well, it does the
way I run it here... 

Anthony

-- 
Anthony Baxter     <anthony@interlink.com.au>   
It's never too late to have a happy childhood.