[Mailman-Developers] Scrubber.py confusion, 2.1b3
Michael Meltzer
mjm@michaelmeltzer.com
Tue, 13 Aug 2002 12:52:06 -0400
I just eyeballed it, looking good, I on the "coding chain gang" for a few
hours, I make my CVS recipe tonight and give it a go. Thank You. I post the
MimeDel.py hack in a bit, right now my python is a graceful as a sludge
hammer :-)
MJM
----- Original Message -----
From: "Barry A. Warsaw" <barry@python.org>
To: "Michael Meltzer" <mjm@michaelmeltzer.com>
Cc: <Mailman-Developers@python.org>
Sent: Tuesday, August 13, 2002 11:38 AM
Subject: Re: [Mailman-Developers] Scrubber.py confusion, 2.1b3
>
> >>>>> "MM" == Michael Meltzer <mjm@michaelmeltzer.com> writes:
>
> MM> Actually I "reusing" the code from Scrubber.py in MimeDel.py
> MM> to turn attachments into links :-) I hardwired it for image
> MM> types but it is generic enough. Some sample output from my
> MM> "staging":
>
> MM> Name: beach.jpg Type: image/jpeg Size: 18853 bytes Desc:
> MM> not_available Url:
> MM>
http://www.michaelmeltzer.com/pipermail/meltzer-list/attachments/200208/12/b
each.jpg-0005.jpe
>
> Cool. I'm using a slightly different naming algorithm for the path.
>
> MM> It turned out to be a 4 line hack to filter_parts, 1 line at
> MM> the top and 10 lines to reformat the payload, the reset came
> MM> from save_attachment, very handle :-)
>
> Can you try to update it to current cvs? If it's really a 4 line
> hack, you've got to post it. :) I tried to write the Scrubber.py
> updates with you in mind, by factoring out some other functionality
> you might need.
>
> MM> I have to admit environment is nice to work in.
>
> :)
>
> MM> I am not sure my code it upto patch quality :-) The next step
> MM> would be a modification to the content filter page for the
> MM> type it should react to.
>
> MM> I would also subject(Scrubber.py needs this too) that the
> MM> filter pages list the extensions that it is allow to write. Or
> MM> the converse the extensions it should not write,
> MM> http://office.microsoft.com/Assistance/2000/Out2ksecFAQ.aspx.
would
> MM> be my start :-), save the masses someday :-)
>
> I've been thinking about this. I vaguely remember that someone did a
> patch to support pass-or-block semantics to the filter, but I can't
> put my finger on it now. I want to link Dan Mick's name to that, but
> does this ring a bell with anyone?
>
> MM> The issue with the directory is the number of files, not a
> MM> name clash
>
> Yep, I know.
>
> MM> , `ls -d archives/private/listname/attachments/* |
> MM> wc -l` > 1000 I think system performance will be
> MM> effected. Above 10,000 I know it would(it would also be a
> MM> problem for the http server on access). I can understand that
> MM> keeping the attachment from each email in it own directory,
> MM> but this way the "files version control" :-) groups them
> MM> together for access(assuming least regency theory) and make
> MM> cleaning out for space/inodes simple. it was just strftime
> MM> wielded on.
>
> I'm not sure I followed all that, but the current Scrubber.py does add
> the date directory to the path, so I think we're good here.
>
> -Barry