[Mailman-Users] Harvesting of email addresses for spam fromarchives

Mon Sep 8 21:33:36 CEST 2008

I've just checked myself and the HTML source still seems to allow robots:
<META NAME="robots" CONTENT="index,nofollow"> on each message and <META
NAME="robots" CONTENT="noindex,follow"> on the index page.  I would want
noindex and nofollow on both pages.  

Changing to private archives doesn't seem to make any difference to that,
does it only apply to new archiving?  The help is a bit vague here, does
public mean the data is prepared for public posting  (emails obscufacted)
and private mean they are not, or does private mean they are not put on the
web? i.e. which of private and public is actually the most secure?

Also search engines still seem to be able to see the data e.g. type 
"neoprene site:lists.shire.net/pipermail/dbamain/ " into Google, maybe this
will go in a few days?

David

> -----Original Message-----
> From: mailman-users-bounces+david=johmar.com at python.org 
> [mailto:mailman-users-bounces+david=johmar.com at python.org] On 
> Behalf Of David Beaumont
> Sent: 08 September 2008 20:19
> To: mailman-users at python.org
> Subject: Re: [Mailman-Users] Harvesting of email addresses 
> for spam fromarchives
> 
> Thanks is this still the case at  
> http://lists.shire.net/pipermail/dbamain/
> ?  We have just put a password on so I am hoping that will 
> stop robots too.
> 
> David 
> 
> > -----Original Message-----
> > From: Paul [mailto:opensource at unixoses.com] 
> > Sent: 08 September 2008 20:00
> > To: David Beaumont
> > Cc: mailman-users at python.org
> > Subject: Re: [Mailman-Users] Harvesting of email addresses 
> > for spam from archives
> > 
> > It helps to disallow but the site is allowing.  So possible 
> > some engines
> > will bot the whole site:
> > 
> > http://www.mail-archive.com/robots.txt
> >