[Mailman-Users] (no subject)

Mark Symonds mark at symonds.net
Sat Jun 21 12:29:29 CEST 2003


Hello, 

Had mailman working wonderfully here until three days ago, 
when a larger list (just over 2500 users) ceased functioning 
and I noticed the load average was through the roof (usually 
it's very low): 

[root at tx exim]# w
  1:24pm  up 387 days,  2:26,  1 user,  load average: 4.21, 4.29, 4.24
USER     TTY      FROM              LOGIN@   IDLE   JCPU   PCPU  WHAT
mark     pts/0    ca1.symonds.net   1:19pm  0.00s  0.28s  0.01s  w
[root at tx exim]# ps auxw |grep python
mailman   6751 23.9  2.4 66712 12372 ?       R    Jun19 534:48 /usr/bin/python -S /var/mailman/cron/qrunner
mailman  21895 24.8  2.6 62840 13608 ?       R    Jun19 404:09 /usr/bin/python -S /var/mailman/cron/qrunner
mailman   3114 25.1  9.2 61756 46716 ?       R    Jun19 258:03 /usr/bin/python -S /var/mailman/cron/qrunner
mailman  16755 24.5 11.2 61592 56844 ?       R    06:18 104:49 /usr/bin/python -S /var/mailman/cron/qrunner
root     25531  0.0  0.1  1716  596 pts/0    S    13:24   0:00 grep python
[root at tx exim]#

...read FAQ 4.19, killed the qrunner processes and removed the 
lockfiles.  Still no joy: 

[root at tx Mailman]# w
  4:44am  up 387 days, 17:46,  1 user,  load average: 1.08, 1.07, 1.26
USER     TTY      FROM              LOGIN@   IDLE   JCPU   PCPU  WHAT
mark     pts/1    ca1.symonds.net   4:37am  0.00s  0.24s  0.02s  w
[root at tx Mailman]# ps auxw |grep python
root     11804 96.9 11.3 59256 57492 ?       R    03:59  43:54 python ./qrunner

In the logs:

[root at tx mailman]# tail qrunner
Jun 21 06:08:03 2003 (15197) Could not acquire qrunner lock
Jun 21 06:09:02 2003 (15209) Could not acquire qrunner lock
Jun 21 06:10:01 2003 (15217) Could not acquire qrunner lock
Jun 21 06:11:03 2003 (15240) Could not acquire qrunner lock
Jun 21 06:12:02 2003 (15252) Could not acquire qrunner lock
Jun 21 06:13:02 2003 (15265) Could not acquire qrunner lock
Jun 21 06:14:02 2003 (15274) Could not acquire qrunner lock
Jun 21 06:15:02 2003 (15294) Could not acquire qrunner lock
Jun 21 06:16:02 2003 (15307) Could not acquire qrunner lock
Jun 21 06:17:01 2003 (15319) Could not acquire qrunner lock
[...ad infinitum...]

Also during this time: 

* Trying to subscribe via the mailing list web page, upon 
  clicking the "subscribe" button it hangs at the page 
  until the browser times out. 

* Same thing when trying to auth via the admin web page. 

* This is the only broken list on that machine.  The rest
  are functioning normally in all respects. 

* I thought perhaps someone had changed the domain preferences
  for the list?  Checked it with config_list and it is correct.  
  Also ran check_db and check_perms, both report all's OK. 

Exim 4, RedHat 7.0, Mailman 2.0.13.  

Ideas?

Sincerely, 

-- 
Mark Symonds
mark at symonds.net







More information about the Mailman-Users mailing list