Finding messages in huge mboxes

Bastiaan Welmers haasje at welmers.net
Fri Feb 6 06:02:00 EST 2004


Diez B. Roggisch wrote:

>> Anyone who has a better idea?
> 
> AFAIK MUAs usually use a mbox.index-file for faster access. The index is
> computed once, and updated whenever a new message is added. You could
> create this index quite easily yourself by looping over the mbox and
> pickling a list of tell'ed positions. If you also store the creation-date
> of the index and the filesize of the mbox-file, you should be able to
> create a function that will update the index whenever the underlying mbox
> has changed. Another approach would be to perform index-creation on
> regular bases using cron.
Also good idea. It's a mailman archieve so then I have
to hack mailman for creating an index file besides the
mbox file.

/Bastiaan



More information about the Python-list mailing list