From julian at mehnle.net Wed Aug 1 03:34:39 2007 From: julian at mehnle.net (Julian Mehnle) Date: Wed, 1 Aug 2007 01:34:39 +0000 Subject: [Mailman-Developers] add_members: Support for invite & pre-generated passwords [patch] Message-ID: <200708010134.39830.julian@mehnle.net> Hi all, Here is a patch for the add_members command-line utility that adds support for inviting users and specifying pre-generated passwords: http://sourceforge.net/tracker/index.php?func=detail&aid=1765041&group_id=103&atid=300103 http://files.mehnle.net/software/mailman/mailman-add_members:passwords-input;invite-feature.diff If you specify a "--invite-only" option, the users are invited instead of being subscribed right away. You can also specify pre-generated passwords as the last "word" (of non- whitespace characters) in address list files. For example: John Doe PaSsWoRd jane.doe at example.org pAsSwOrD user.with.autogen.password at example.net Consider the patch as being in the public domain. I'd be happy if these features could make it into one of the next Mailman releases. I was missing them so hard, I actually sat down and started learning Python (I mostly just write Perl otherwise). Julian. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available Url : http://mail.python.org/pipermail/mailman-developers/attachments/20070801/4d74e234/attachment.pgp From jeff at jab.org Thu Aug 2 04:17:44 2007 From: jeff at jab.org (Jeff Breidenbach) Date: Wed, 1 Aug 2007 19:17:44 -0700 Subject: [Mailman-Developers] Improving the archives In-Reply-To: <46A84F41.4020003@Newfield.org> References: <8BA8AA8B-2575-4794-AEB5-CF4CFAE99CE6@zone12.com> <874pjumgrg.fsf@uwakimon.sk.tsukuba.ac.jp> <877ioqys3k.fsf@athene.jamux.com> <2009EA3C-9E11-4B2A-BF57-A62C0EB11870@python.org> <46A84F41.4020003@Newfield.org> Message-ID: > What we really want to know is how many (non-empty) Message-ID > collisions are there that *don't* share a Date? This is the number of > messages that only-messageid loses, and that the composite identifier > method would not lose. It took longer than expected, but I now have numbers from looking at 2,151,896 messages spread over a few thousand lists. The appended script was run over a set of MH format raw messages. 704 messages fall into this category. Of these, 596 come from a single (malfunctioning and duplicate spewing) list server. I have not yet examined the remaining 208 messages, but I'll bet anything many also have duplicate message bodies. Or are spam. So for this data set, we have an upper bound of 0.01% messages in this category, possibly significantly less. Jeff #!/bin/bash # # Look for messages that # # Do collide with message-id # Don't collide with message-id + date DIR=/home/archive/Mail C1=0 C2=0 get_ineresting_messages() { cd $DIR/$1 for j in $(ls -U); do MSG_ID=$(cat $j | 822field message-id) MSG_DATE=$(cat $j | 822field date) if [ "$MSG_ID" != "" ]; then echo $MSG_DATE "|" $MSG_ID fi done |\ sort |\ uniq --separator='|' --skip-fields=1 --all-repeated |\ uniq --uniq } for i in $(ls $DIR | grep @); do DUP=$(get_ineresting_messages $i) DUP_CNT=$(echo -n "$DUP" | wc -l) MSG_CNT=$(cd $DIR/$i && ls -U | wc -w) C1=$(( C1 + MSG_CNT )) C2=$(( C2 + DUP_CNT )) if [ $DUP_CNT != 0 ]; then echo echo "=== collisions/messages: $C2/$C1 $i" echo "$DUP" else echo -n . 1>&2 fi done > > -Dale > _______________________________________________ > Mailman-Developers mailing list > Mailman-Developers at python.org > http://mail.python.org/mailman/listinfo/mailman-developers > Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py > Searchable Archives: http://www.mail-archive.com/mailman-developers%40python.org/ > Unsubscribe: http://mail.python.org/mailman/options/mailman-developers/jeff%40jab.org > > Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp > From jeff at jab.org Thu Aug 2 04:20:08 2007 From: jeff at jab.org (Jeff Breidenbach) Date: Wed, 1 Aug 2007 19:20:08 -0700 Subject: [Mailman-Developers] Improving the archives In-Reply-To: References: <8BA8AA8B-2575-4794-AEB5-CF4CFAE99CE6@zone12.com> <874pjumgrg.fsf@uwakimon.sk.tsukuba.ac.jp> <877ioqys3k.fsf@athene.jamux.com> <2009EA3C-9E11-4B2A-BF57-A62C0EB11870@python.org> <46A84F41.4020003@Newfield.org> Message-ID: > 704 messages fall into this category. Of these, 596 come from a > single (malfunctioning and duplicate spewing) list server. I have > not yet examined the remaining 208 messages, but I'll bet anything > many also have duplicate message bodies. Or are spam. So for this > data set, we have an upper bound of 0.01% messages in this > category, possibly significantly less. Correction. ... remaining 108 ... 0.005% ... From lionel at mamane.lu Sat Aug 4 07:30:26 2007 From: lionel at mamane.lu (Lionel Elie Mamane) Date: Sat, 4 Aug 2007 07:30:26 +0200 Subject: [Mailman-Developers] Major updates to Mailman 3.0 branch In-Reply-To: <61705022-9B57-47A5-BC3C-BCE160EBD80A@python.org> References: <61705022-9B57-47A5-BC3C-BCE160EBD80A@python.org> Message-ID: <20070804053026.GA473@capsaicin.mamane.lu> On Sat, Jul 21, 2007 at 04:25:07PM -0400, Barry Warsaw wrote: > I've just merged in my 'setuptools' branch to the official Mailman 3 > branch. > Also gone are the C wrapper programs, so Mailman is now a pure > Python application. I suppose that means that all architecture-dependent code is gone? > All the packages Mailman depends on live in the Cheeseshop (...). > And now that we're a setuptools-based project, when you build > Mailman (see below), these dependent packages will be automatically > downloaded and installed as necessary. With the hat of distribution packager hat on, we will need a way to disable that mechanism. To be sure that our automated builds are self-contained and will _not_ access the network in any way. Is there any way to do that? I didn't find an option for that in "python2.5 setup.py --help". -- Lionel From barry at python.org Sat Aug 4 15:25:46 2007 From: barry at python.org (Barry Warsaw) Date: Sat, 4 Aug 2007 09:25:46 -0400 Subject: [Mailman-Developers] Major updates to Mailman 3.0 branch In-Reply-To: <20070804053026.GA473@capsaicin.mamane.lu> References: <61705022-9B57-47A5-BC3C-BCE160EBD80A@python.org> <20070804053026.GA473@capsaicin.mamane.lu> Message-ID: <222722F4-D52B-4BDB-BBCB-C8556D3186FB@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Aug 4, 2007, at 1:30 AM, Lionel Elie Mamane wrote: > On Sat, Jul 21, 2007 at 04:25:07PM -0400, Barry Warsaw wrote: > >> I've just merged in my 'setuptools' branch to the official Mailman 3 >> branch. > >> Also gone are the C wrapper programs, so Mailman is now a pure >> Python application. > > I suppose that means that all architecture-dependent code is gone? Yes. I'm going to try very hard not to bring them back. I think we can do that by using different mechanisms to integration Mailman with mail and web servers, though I haven't worked my way high enough up the stack yet. >> All the packages Mailman depends on live in the Cheeseshop (...). >> And now that we're a setuptools-based project, when you build >> Mailman (see below), these dependent packages will be automatically >> downloaded and installed as necessary. > > With the hat of distribution packager hat on, we will need a way to > disable that mechanism. To be sure that our automated builds are > self-contained and will _not_ access the network in any way. Is there > any way to do that? I didn't find an option for that in "python2.5 > setup.py --help". Yes, but this is a larger issue than just Mailman because just about any decent sized Python application your going to distribute these is going to be built around this type of architecture. Probably the same goes for Ruby and Perl but I don't know those language's packaging systems very well. I'm pretty certain eggs play well enough with various distribution package management systems. I would start here for more information: http://peak.telecommunity.com/DevCenter/setuptools - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (Darwin) iQCVAwUBRrR+W3EjvBPtnXfVAQJC1wP+MZ4zrVuFv4OrIC+4tb54s6ZFiOdrqAr9 EenGpf9pWG3Fgu4EnU1OXHK36vz4/SBv1kdoiUyxZ1l+6nE+wFzZ+uf4NekG4uV6 MqdHA3o/lJMfQ6ERpNPKEC9M3G2VzPkhgejAcYpU3ilTQhRbAJciADZ5jzruprZQ ZUnBQx+X9BI= =V/HV -----END PGP SIGNATURE----- From jeff at jab.org Wed Aug 8 06:44:19 2007 From: jeff at jab.org (Jeff Breidenbach) Date: Tue, 7 Aug 2007 21:44:19 -0700 Subject: [Mailman-Developers] Improving the archives In-Reply-To: References: <8BA8AA8B-2575-4794-AEB5-CF4CFAE99CE6@zone12.com> <877ioqys3k.fsf@athene.jamux.com> <2009EA3C-9E11-4B2A-BF57-A62C0EB11870@python.org> <46A84F41.4020003@Newfield.org> Message-ID: > What we really want to know is how many (non-empty) Message-ID > collisions are there that *don't* share a Date? This is the number of > messages that only-messageid loses, and that the composite identifier > method would not lose. I took a look at a larger dataset, 5.85 million messages from several thousand lists. Of the messages that share message-id but not date, most come from a small number of based web services. 875 come from forums.slimdevices.com 378 come from lists.openplans.org 265 come from nabble.com 164 come from egroups.com 135 come from yahoo.com 166 come from elsewhere That's 0.03% if you count all the messages. It is 0.008% if you discard the top three offenders, all of which I have contacted. I didn't try contacting Yahoo/eGroups because in my past experience, talking to a brick wall is easier. I have not analyzed how many of these messages are spam or have duplicate bodies, which further discounts the percentages. Hope this data helps. From Dale at Newfield.org Wed Aug 8 07:04:30 2007 From: Dale at Newfield.org (Dale Newfield) Date: Wed, 08 Aug 2007 01:04:30 -0400 Subject: [Mailman-Developers] Improving the archives In-Reply-To: References: <8BA8AA8B-2575-4794-AEB5-CF4CFAE99CE6@zone12.com> <877ioqys3k.fsf@athene.jamux.com> <2009EA3C-9E11-4B2A-BF57-A62C0EB11870@python.org> <46A84F41.4020003@Newfield.org> Message-ID: <46B94EDE.6070901@Newfield.org> Jeff Breidenbach wrote: > 5.85 million messages > That's 0.03% if you count all the messages. It is 0.008% if you > discard the top three offenders, all of which I have contacted. I'd say that's a strong argument for just using the Message-ID and simplifying this tremendously... ...Barry, do you disagree? (It can still be a base32 encoded SHA hash it to make it less user hostile.) http://wiki.list.org/display/DEV/Stable+URLs -Dale From rick at wzoeterwoude.net Fri Aug 10 02:26:45 2007 From: rick at wzoeterwoude.net (Rick van der Zwet) Date: Fri, 10 Aug 2007 02:26:45 +0200 Subject: [Mailman-Developers] [patch] Spam report functionality Message-ID: <46BBB0C5.1050109@wzoeterwoude.net> Hi All, The patch attached (created against 2.1.9), will add functionality where the moderator is able to report a moderated message as spam. This message will than either be stored on the disk or will be send -to a in the config- definable email address. Together will a small shell script it will be pretty easy to load this message into a spam learning program like sa-learn. This will make the moderators more happy, cause they are able to contribute to the improvement of the spam filter. I do believe this will gradually lower the amount of false negatives, cause having people to teach the spam filter is great :-) Best regards, /Rick -- http://rickvanderzwet.nl -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: mailman-spamreport.patch Url: http://mail.python.org/pipermail/mailman-developers/attachments/20070810/b989cce1/attachment.asc From iane at sussex.ac.uk Fri Aug 10 11:31:01 2007 From: iane at sussex.ac.uk (Ian Eiloart) Date: Fri, 10 Aug 2007 10:31:01 +0100 Subject: [Mailman-Developers] [patch] Spam report functionality In-Reply-To: <46BBB0C5.1050109@wzoeterwoude.net> References: <46BBB0C5.1050109@wzoeterwoude.net> Message-ID: <692C4079D0858858763B5917@lewes.staff.uscs.susx.ac.uk> --On 10 August 2007 02:26:45 +0200 Rick van der Zwet wrote: > Hi All, > > The patch attached (created against 2.1.9), will add functionality > where the moderator is able to report a moderated message as spam. This > message will than either be stored on the disk or will be send -to a in > the config- definable email address. > > Together will a small shell script it will be pretty easy to load this > message into a spam learning program like sa-learn. > > This will make the moderators more happy, cause they are able to > contribute to the improvement of the spam filter. I do believe this will > gradually lower the amount of false negatives, cause having people to > teach the spam filter is great :-) That's nice. Don't forget that you need to show good email as well as spam, otherwise SA will believe everything is spam. > Best regards, > /Rick -- Ian Eiloart IT Services, University of Sussex x3148 From rick at wzoeterwoude.net Fri Aug 10 12:02:03 2007 From: rick at wzoeterwoude.net (Rick van der Zwet) Date: Fri, 10 Aug 2007 12:02:03 +0200 Subject: [Mailman-Developers] [patch] Spam report functionality In-Reply-To: <692C4079D0858858763B5917@lewes.staff.uscs.susx.ac.uk> References: <46BBB0C5.1050109@wzoeterwoude.net> <692C4079D0858858763B5917@lewes.staff.uscs.susx.ac.uk> Message-ID: <46BC379B.1000903@wzoeterwoude.net> On 8/10/07 11:31 AM, Ian Eiloart wrote: > --On 10 August 2007 02:26:45 +0200 Rick van der Zwet [snip: spam patch proposal] >> This will make the moderators more happy, cause they are able to >> contribute to the improvement of the spam filter. I do believe this will >> gradually lower the amount of false negatives, cause having people to >> teach the spam filter is great :-) > > That's nice. Don't forget that you need to show good email as well as spam, > otherwise SA will believe everything is spam. Yes true, feeding SA initially with 1000 good (ham) messages and set autolearn to true will do this. But other implementations of spam filtering are also possible, there are plenty systems around. The patch any provide a way to have moderators report messages identified as spam, it is up the the email administrator what to do with this kind of messages. /Rick -- http://rickvanderzwet.nl From ACrosman at afsc.org Fri Aug 10 17:11:20 2007 From: ACrosman at afsc.org (Aaron Crosman) Date: Fri, 10 Aug 2007 11:11:20 -0400 Subject: [Mailman-Developers] Required template files Message-ID: I've been working that last day or two to help update the templates for the interim step of allowing CSS files on the templates. I've noticed several translations are missing some template files. Is this a known condition that behaves just fine, or is this a problem that needs to be addressed? Also, is there a list in the wiki of all the template files? I'm assuming the English translation is complete and current and I have based my work off that template, but other translations include files missing in the en folder, while more than 1/2 of the other languages are missing files present in en. By my count, of the 41 translations, at least 25 are missing files. Aaron From msapiro at value.net Sun Aug 19 08:50:39 2007 From: msapiro at value.net (Mark Sapiro) Date: Sat, 18 Aug 2007 23:50:39 -0700 Subject: [Mailman-Developers] Required template files In-Reply-To: Message-ID: Aaron Crosman wrote: >I've been working that last day or two to help update the templates for >the interim step of allowing CSS files on the templates. I've noticed >several translations are missing some template files. Is this a known >condition that behaves just fine, or is this a problem that needs to be >addressed? This is a problem with the specific translation, but the result is just that the English template is used instead. >Also, is there a list in the wiki of all the template files? No, but the files in templates/en/ constitute the list. >I'm assuming the English translation is complete and current and I have >based my work off that template, but other translations include files >missing in the en folder, Some obsolete templates (e.g. handle_opts.html) have not been removed from some translations, but any templates not in templates/en/ are not currently used. >while more than 1/2 of the other languages are >missing files present in en. By my count, of the 41 translations, at >least 25 are missing files. So the translations are incomplete. -- Mark Sapiro The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan