[Mailman-Users] importing large (1GB) mbox file, hitting a wall here..

Andrew White, PhD awhite at pdbti.org
Wed Dec 27 23:08:17 EST 2017


   First post- thanks much for your archives, been getting myself up to speed
   today and I hope to be a member of the community who gives back as well.

   I took over a large mailing list with a 12 year archive, which is about a
   1 GB .mbox file (about 35,000 messages). I need to upload it to a new
   Mailman install on a new server. I have a background in visual basic and
   command line SPSS, and have managed Mailman lists before, but I'm a little
   new on this part. Here's where I'm at- let me know where I'm off.

   For reference, the name of the list on the machine is dbt-l_pdbti.org.

   Per the FAQ (
   [1]https://wiki.list.org/DOC/How%20do%20I%20import%20an%20archive%20into%20a%20new%20mailing%20list%3F
   ), I uploaded the old .mbox into the correct folder (in this case
   archives/private/dbt-l_pdbti.org.mbox/).  This is a brand new list
   install, with no posts. I then ran bin/arch --wipe dbt-l_pdbti.org. When I
   checked the archives, only about 11,000 messages were imported. I saw in
   the arch help file there can be memory issues, and so to run things in
   chunks. So, I did this:

   bin/arch ---wipe q -s 0 e 10000 dbt-l_pdbti.org
   bin/arch -q -s 10001 e 20000 dbt-l_pdbti.org
   bin/arch  q -s 20001 e 30000 dbt-l_pdbti.org
   bin/arch  q -s 30001 e 40000 dbt-l_pdbti.org

   So when I do this, each piece works, but each piece overwrites the
   previous- in other words, rather than each chunk adding into the archives,
   only the most recent command seems to affect the archives. At the end of
   these commands, only messages 30,000 to 35,000 are showing up in the
   archives.

   I'm sure there is something I'm doing wrong here, but I'm feeling pretty
   stuck- is there something I'm leaving out?

   Appreciate the help-

   ........................................................................
   Andrew White, PhD
   Associate Director
   DBT-Linehan Board of Certification, Certified DBT Clinician*
   Licensed Clinical Psychologist
   Portland DBT Institute
   (503) 290.3281 (phone)
   (503) 231.8153 (fax)

   Please be aware that e-mail communication can be intercepted in
   transmission or misdirected. This e-mail message and any documents
   attached to it are confidential and may contain information that is
   protected from disclosure by various federal and state laws, including the
   HIPAA privacy rule (45 C.F.R., Part 164). This information is intended to
   be used solely by the entity or individual to whom this message is
   addressed. If you are not the intended recipient, be advised that any use,
   dissemination, forwarding, printing, or copying of this message without
   the sender's written permission is strictly prohibited and may be
   unlawful. Accordingly, if you have received this message in error, please
   notify the sender immediately with a copy to hipaa(at)pdbti.org and
   destroy this message. Please do not include personal identifying
   information such as your birth date, or personal medical information in
   any emails you send to us. No one can diagnose your condition from email
   or other written communications and is not a reliable mechanism for
   emergency communication.

References

   Visible links
   1. https://wiki.list.org/DOC/How%20do%20I%20import%20an%20archive%20into%20a%20new%20mailing%20list?


More information about the Mailman-Users mailing list