[Mailman-Users] (no subject)
Mark Symonds
mark at symonds.net
Sat Jun 21 12:29:29 CEST 2003
Hello,
Had mailman working wonderfully here until three days ago,
when a larger list (just over 2500 users) ceased functioning
and I noticed the load average was through the roof (usually
it's very low):
[root at tx exim]# w
1:24pm up 387 days, 2:26, 1 user, load average: 4.21, 4.29, 4.24
USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
mark pts/0 ca1.symonds.net 1:19pm 0.00s 0.28s 0.01s w
[root at tx exim]# ps auxw |grep python
mailman 6751 23.9 2.4 66712 12372 ? R Jun19 534:48 /usr/bin/python -S /var/mailman/cron/qrunner
mailman 21895 24.8 2.6 62840 13608 ? R Jun19 404:09 /usr/bin/python -S /var/mailman/cron/qrunner
mailman 3114 25.1 9.2 61756 46716 ? R Jun19 258:03 /usr/bin/python -S /var/mailman/cron/qrunner
mailman 16755 24.5 11.2 61592 56844 ? R 06:18 104:49 /usr/bin/python -S /var/mailman/cron/qrunner
root 25531 0.0 0.1 1716 596 pts/0 S 13:24 0:00 grep python
[root at tx exim]#
...read FAQ 4.19, killed the qrunner processes and removed the
lockfiles. Still no joy:
[root at tx Mailman]# w
4:44am up 387 days, 17:46, 1 user, load average: 1.08, 1.07, 1.26
USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
mark pts/1 ca1.symonds.net 4:37am 0.00s 0.24s 0.02s w
[root at tx Mailman]# ps auxw |grep python
root 11804 96.9 11.3 59256 57492 ? R 03:59 43:54 python ./qrunner
In the logs:
[root at tx mailman]# tail qrunner
Jun 21 06:08:03 2003 (15197) Could not acquire qrunner lock
Jun 21 06:09:02 2003 (15209) Could not acquire qrunner lock
Jun 21 06:10:01 2003 (15217) Could not acquire qrunner lock
Jun 21 06:11:03 2003 (15240) Could not acquire qrunner lock
Jun 21 06:12:02 2003 (15252) Could not acquire qrunner lock
Jun 21 06:13:02 2003 (15265) Could not acquire qrunner lock
Jun 21 06:14:02 2003 (15274) Could not acquire qrunner lock
Jun 21 06:15:02 2003 (15294) Could not acquire qrunner lock
Jun 21 06:16:02 2003 (15307) Could not acquire qrunner lock
Jun 21 06:17:01 2003 (15319) Could not acquire qrunner lock
[...ad infinitum...]
Also during this time:
* Trying to subscribe via the mailing list web page, upon
clicking the "subscribe" button it hangs at the page
until the browser times out.
* Same thing when trying to auth via the admin web page.
* This is the only broken list on that machine. The rest
are functioning normally in all respects.
* I thought perhaps someone had changed the domain preferences
for the list? Checked it with config_list and it is correct.
Also ran check_db and check_perms, both report all's OK.
Exim 4, RedHat 7.0, Mailman 2.0.13.
Ideas?
Sincerely,
--
Mark Symonds
mark at symonds.net
More information about the Mailman-Users
mailing list