[Mailman-Users] Kernel update breaks Mailman!!

Mark Sapiro mark at msapiro.net
Fri Feb 21 14:56:09 CET 2014


On 02/20/2014 02:04 PM, Lindsay Haisley wrote:
> 
> Here's a sampling of the qrunner log from the wee hours, before I
> started poking at the problem to try to fix it:
> 
> Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2445) BounceRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2445) BounceRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2446) CommandRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2446) CommandRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2451) RetryRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2443) Master watcher caught SIGINT.  Restarting.
> Feb 20 03:22:02 2014 (2444) ArchRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2444) ArchRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2448) NewsRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2450) VirginRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2449) OutgoingRunner qrunner caught SIGINT.  Stopping.
> Feb 20 03:22:02 2014 (2451) RetryRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2448) NewsRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2450) VirginRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2445, sig: None, sts: 2, class: BounceRunner, slice: 1/1) [restarting]
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2446, sig: None, sts: 2, class: CommandRunner, slice: 1/1) [restarting]
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2451, sig: None, sts: 2, class: RetryRunner, slice: 1/1) [restarting]
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2448, sig: None, sts: 2, class: NewsRunner, slice: 1/1) [restarting]
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2444, sig: None, sts: 2, class: ArchRunner, slice: 1/1) [restarting]
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2447, sig: None, sts: 2, class: IncomingRunner, slice: 1/1) [restarting]


OK, From what you report here and elsewhere, it appears the issue was
with OutgoingRunner not processing Mailman's 'out' queue. If the above
log excerpt (appears to be from a mailmanctl restart) is complete, you
will note that there are three entries for most runners, e,g.

> Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner caught SIGINT.
Stopping.
> Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner exiting.
> Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit
> (pid: 2447, sig: None, sts: 2, class: IncomingRunner, slice: 1/1)
[restarting]

But there is only one for OutgoingRunner

> Feb 20 03:22:02 2014 (2449) OutgoingRunner qrunner caught SIGINT.
Stopping.

suggesting that it was hung and never terminated.

Had it been me at that point, I would have stopped Mailman and made sure
it was completely stopped per the FAQ at <http://wiki.list.org/x/_4A9>,
and then started it to see if that fixed the problem. If the out queue
were still not being processed, I would try to trace the OutgoingRunner
process to see where it was hung.

-- 
Mark Sapiro <mark at msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan


More information about the Mailman-Users mailing list