[Mailman-Developers] Retrieving individual messages from raw Mailman mboxes via http

Brad Knowles brad.knowles at skynet.be
Thu Oct 9 15:57:50 EDT 2003


At 5:10 PM +0800 2003/10/05, Andy Sy wrote:

>  I. Which mbox index / mail summary file format to use?

	Import into Berkeley DB hash tables.  Fast, easy, well-supported 
by many languages, robust, data can easily be extracted if necessary, 
and they can easily be reconstructed if necessary.

	Failing that, use a mailbox-directory format.

>  II. index / mail summary file performance and maintenance
>
>  Mozilla .msf files can be regenerated on the fly but
>  for a 100MB mailbox (Python-list's is 600MB+!), it already takes
>  fairly long (a few minutes).  Assuming index file corruption is
>  very rare, then this should not be a real problem.

	I would be willing to bet that Berkeley DB files could be 
regenerated even faster -- much faster.

-- 
Brad Knowles, <brad.knowles at skynet.be>

"They that can give up essential liberty to obtain a little temporary
safety deserve neither liberty nor safety."
     -Benjamin Franklin, Historical Review of Pennsylvania.

GCS/IT d+(-) s:+(++)>: a C++(+++)$ UMBSHI++++$ P+>++ L+ !E-(---) W+++(--) N+
!w--- O- M++ V PS++(+++) PE- Y+(++) PGP>+++ t+(+++) 5++(+++) X++(+++) R+(+++)
tv+(+++) b+(++++) DI+(++++) D+(++) G+(++++) e++>++++ h--- r---(+++)* z(+++)



More information about the Mailman-Developers mailing list