[Mailman-Users] is driving me crazy

ArteryPlanet.Net :: Manuel Kissoyan manuel at arteryplanet.net
Mon Jan 23 01:09:12 CET 2006


Thanx mark,

here are some of your answers...i already contacted cpanel they reinstalled 
mailman in the other server before we moved this account and that didnt 
help, also in this server we deleted this list and created it from zero.

I remember i saw qrunners in the server process when the mailing list were 
down but somehow when i restart mailman looks like the whole list were down, 
because it starting send all the lists queued mails.

the qrunner logs at the same hour when the list gone down are:

Jan 18 17:29:19 2006 (23105) VirginRunner qrunner started.
Jan 18 18:15:51 2006 (10568) Master qrunner detected subprocess exit
(pid: 23055, sig: None, sts: 1, class: VirginRunner, slice: 1/1) 
[restarting]
Jan 18 18:15:51 2006 (11632) VirginRunner qrunner started.
Jan 18 18:16:03 2006 (480) Master qrunner detected subprocess exit
(pid: 23105, sig: None, sts: 1, class: VirginRunner, slice: 1/1) 
[restarting]
Jan 18 18:16:03 2006 (480) Qrunner VirginRunner reached maximum restart 
limit of 10, not restarting.


these are the last lines right now in the error log:

Jan 22 23:51:22 2006 qrunner(1449): Traceback (most recent call last):
Jan 22 23:51:22 2006 qrunner(1449):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
Jan 22 23:51:22 2006 qrunner(1449):      main()
Jan 22 23:51:22 2006 qrunner(1449):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
Jan 22 23:51:22 2006 qrunner(1449):      qrunner.run()
Jan 22 23:51:22 2006 qrunner(1449):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 70, in 
run
Jan 22 23:51:22 2006 qrunner(1449):      filecnt = self._oneloop()
Jan 22 23:51:22 2006 qrunner(1449):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 99, in 
_oneloop
Jan 22 23:51:22 2006 qrunner(1449):      msg, msgdata = 
self._switchboard.dequeue(filebase)
Jan 22 23:51:22 2006 qrunner(1449):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", line 144, 
in dequeue
Jan 22 23:51:22 2006 qrunner(1449):      os.unlink(filename)
Jan 22 23:51:22 2006 qrunner(1449): OSError :  [Errno 2] No such file or 
directory: 
'/usr/local/cpanel/3rdparty/mailman/qfiles/in/1137973882.55214+4702c7f0c4fdea7d0473729ec90428cec740947e.pck'
Jan 22 23:51:23 2006 qrunner(20729): Traceback (most recent call last):
Jan 22 23:51:23 2006 qrunner(20729):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
Jan 22 23:51:23 2006 qrunner(20729):      main()
Jan 22 23:51:23 2006 qrunner(20713): Traceback (most recent call last):
Jan 22 23:51:23 2006 qrunner(20729):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
Jan 22 23:51:23 2006 qrunner(20713):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
Jan 22 23:51:23 2006 qrunner(20729):      qrunner.run()
Jan 22 23:51:23 2006 qrunner(20729):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 70, in 
run
Jan 22 23:51:23 2006 qrunner(20713):      main()
Jan 22 23:51:23 2006 qrunner(20729):      filecnt = self._oneloop()
Jan 22 23:51:23 2006 qrunner(20713):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
Jan 22 23:51:23 2006 qrunner(20729):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 99, in 
_oneloop
Jan 22 23:51:23 2006 qrunner(20713):      qrunner.run()
Jan 22 23:51:23 2006 qrunner(20729):      msg, msgdata = 
self._switchboard.dequeue(filebase)
Jan 22 23:51:23 2006 qrunner(20713):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 70, in 
run
Jan 22 23:51:23 2006 qrunner(20729):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", line 144, 
in dequeue
Jan 22 23:51:23 2006 qrunner(20713):      filecnt = self._oneloop()
Jan 22 23:51:23 2006 qrunner(20729):      os.unlink(filename)
Jan 22 23:51:23 2006 qrunner(20713):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 99, in 
_oneloop
Jan 22 23:51:23 2006 qrunner(20713):      msg, msgdata = 
self._switchboard.dequeue(filebase)
Jan 22 23:51:23 2006 qrunner(20729): OSError :  [Errno 2] No such file or 
directory: 
'/usr/local/cpanel/3rdparty/mailman/qfiles/out/1137973882.55214+536fcf4d659766a32e9b94e92bfde66798394acb.pck'
Jan 22 23:51:23 2006 qrunner(20713):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", line 143, 
in dequeue
Jan 22 23:51:23 2006 qrunner(20713):      fp = open(filename)
Jan 22 23:51:23 2006 qrunner(20713): IOError :  [Errno 2] No such file or 
directory: 
'/usr/local/cpanel/3rdparty/mailman/qfiles/archive/1137973882.55214+331cf1e3f1102872474d59b9f53a1fb197f0316f.pck'
Jan 22 23:52:41 2006 qrunner(21835): Traceback (most recent call last):
Jan 22 23:52:41 2006 qrunner(21835):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
Jan 22 23:52:41 2006 qrunner(21835):      main()
Jan 22 23:52:41 2006 qrunner(21835):   File 
"/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
Jan 22 23:52:41 2006 qrunner(21835):      qrunner.run()
Jan 22 23:52:41 2006 qrunner(21835):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 70, in 
run
Jan 22 23:52:41 2006 qrunner(21835):      filecnt = self._oneloop()
Jan 22 23:52:41 2006 qrunner(21835):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 99, in 
_oneloop
Jan 22 23:52:41 2006 qrunner(21835):      msg, msgdata = 
self._switchboard.dequeue(filebase)
Jan 22 23:52:41 2006 qrunner(21835):   File 
"/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", line 143, 
in dequeue
Jan 22 23:52:41 2006 qrunner(21835):      fp = open(filename)
Jan 22 23:52:41 2006 qrunner(21835): IOError :  [Errno 2] No such file or 
directory: 
'/usr/local/cpanel/3rdparty/mailman/qfiles/bounces/1137973961.4684539+a97050f66bcd30b95df8d86fb97378a830687d6f.pck'


about...."Where are the posts going, i.e. which qfiles/* directories have 
entries.", could you please clarify...the following are the directories in 
/qfiles

drwxrwsr-x   11 mailman  mailman      4096 May 28  2004 ./
drwxrwsr-x   22 mailman  mailman      4096 Jul 20  2005 ../
drwxrws---    2 mailman  mailman      4096 Jan 22 23:51 archive/
drwxrws---    2 mailman  mailman      4096 Jan 22 23:52 bounces/
drwxrws---    2 mailman  mailman      4096 Jan 19 01:41 commands/
drwxrws---    2 mailman  mailman      8192 Jan 22 23:51 in/
drwxrws---    2 mailman  mailman      4096 May 28  2004 news/
drwxrws---    2 mailman  mailman     53248 Jan 22 23:51 out/
drwxrws---    2 mailman  mailman      4096 Jan 22 22:37 retry/
drwxrws---    2 mailman  mailman      8192 Dec 17 04:39 shunt/
drwxrws---    2 mailman  mailman     36864 Jan 22 23:44 virgin/


About "Also, what happens if you move the lists/LIST_NAME/digest.mbox file 
aside? Does that help?"

you mean delete that file? remember we already removed this list and 
re-created it so that file was created new before it gone down.

Thank you very much for the help!














----- Original Message ----- 
From: "Mark Sapiro" <msapiro at value.net>
To: "ArteryPlanet.Net :: Manuel Kissoyan" <manuel at arteryplanet.net>; 
"mailman mailing list" <mailman-users at python.org>
Sent: Sunday, January 22, 2006 8:40 PM
Subject: Re: [Mailman-Users] is driving me crazy


> ArteryPlanet.Net :: Manuel Kissoyan wrote:
>
>>We moved this client from one server to other because his mailing list 
>>were going down every week, in fact for some reason is shutid donw the 
>>whole mailman, hope someone could help us with this, we re installed 
>>mailman and also we deleted and created the list again, it ran for a month 
>>now again every week is going down, just a note...before we moved this 
>>list the mailman was working without problem in this server, so is 
>>something specific with this list, is crazy...any help?
>>
>
>
> I don't think these log entries/error reprorts are relevant to the
> issue. See below.
>
> First, see
> <http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq06.011.htp>.
>
> That said, in order to help, we need more specific information about
> the problem. I.e., at this point, is it just the list, or the whole
> Mailman server that's down. If the whole server, which if any queue
> runners are still running. What's in the 'qrunner' log. What current
> entries are in the 'error' log?. Where are the posts going, i.e. which
> qfiles/* directories have entries.
>
> If it's only the one list, presumably the qrunners are OK, but the
> other questions apply. Also, what happens if you move the
> lists/LIST_NAME/digest.mbox file aside? Does that help?
>
> Also see
> <http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.023.htp>.
>
>
>>I did find some logs probably could help.....
>>
>>---------------
>>Jan 18 17:59:46 2006 admin(4202): 
>>@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
>>
>>admin(4202): [----- Mailman Version: 2.1.6 -----]
>>
>>admin(4202): [----- Traceback ------]
>>
>>admin(4202): Traceback (most recent call last):
>>
>>admin(4202):   File "/usr/local/cpanel/3rdparty/mailman/scripts/driver", 
>>line 109, in run_main
>>
>>admin(4202):     sys.stdout.write(tempstdout.getvalue())
>>
>>admin(4202): IOError: [Errno 32] Broken pipe
>>
>>admin(4202): [----- Python Information -----]
>>
>>admin(4202): sys.version     =   2.2.3 (#1, Feb  2 2005, 12:20:51)
>>
>>[GCC 3.2.3 20030502 (Red Hat Linux 3.2.3-49)]
>>
>>admin(4202): sys.executable  =   /usr/bin/python2
>>
>>admin(4202): sys.prefix      =   /usr
>>
>>admin(4202): sys.exec_prefix =   /usr
>>
>>admin(4202): sys.path        =   /usr
>>
>>admin(4202): sys.platform    =   linux2
>>
>>admin(4202): [----- Environment Variables -----]
>>
>>admin(4202):    PATH_INFO: /mercedes_striplin.net
>>
>>admin(4202):    HTTP_COOKIE: 
>>banned_striplin.net+admin=xxxxxxxxxxxxeaa7cdxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx64623739; 
>>mercedes_striplin.net+admin=xxxxxxxxxxxx4b51cexxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx66366534
>>
>>admin(4202):    SERVER_SOFTWARE: Apache
>>
>>admin(4202):    PYTHONPATH: /usr/local/cpanel/3rdparty/mailman
>>
>>admin(4202):    SCRIPT_FILENAME: 
>>/usr/local/cpanel/3rdparty/mailman/cgi-bin/admindb
>>
>>admin(4202):    SERVER_ADMIN: webmaster at striplin.net
>>
>>admin(4202):    SCRIPT_NAME: /mailman/admindb
>>
>>admin(4202):    REQUEST_METHOD: GET
>>
>>admin(4202):    HTTP_HOST: striplin.net
>>
>>admin(4202):    HTTP_KEEP_ALIVE: 300
>>
>>admin(4202):    SERVER_PROTOCOL: HTTP/1.1
>>
>>admin(4202):    QUERY_STRING:
>>
>>admin(4202):    REQUEST_URI: /mailman/admindb/mercedes_striplin.net
>>
>>admin(4202):    HTTP_ACCEPT: 
>>text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5
>>
>>admin(4202):    HTTP_ACCEPT_CHARSET: ISO-8859-1,utf-8;q=0.7,*;q=0.7
>>
>>admin(4202):    HTTP_USER_AGENT: Mozilla/5.0 (Windows; U; Windows NT 5.1; 
>>en-US; rv:1.7.2) Gecko/20040804 Netscape/7.2 (ax)
>>
>>admin(4202):    HTTP_CONNECTION: keep-alive
>>
>>admin(4202):    SERVER_NAME: www.striplin.net
>>
>>admin(4202):    REMOTE_ADDR: 70.184.21.183
>>
>>admin(4202):    REMOTE_PORT: 3450
>>
>>admin(4202):    HTTP_ACCEPT_LANGUAGE: en-us,en;q=0.5
>>
>>admin(4202):    PATH_TRANSLATED: 
>>/home/okiebenz/public_html/mercedes_striplin.net
>>
>>admin(4202):    SERVER_PORT: 80
>>
>>admin(4202):    GATEWAY_INTERFACE: CGI/1.1
>>
>>admin(4202):    HTTP_ACCEPT_ENCODING: gzip,deflate
>>
>>admin(4202):    SERVER_ADDR: 67.18.66.132
>>
>>admin(4202):    DOCUMENT_ROOT: /home/okiebenz/public_html
>>---------------------
>
>
> This indicates a user has log-in cookies as the list-admin for the
> 'banned' and 'mercedes' lists and is going to the admindb page for
> mercedes and has possibly quit or stopped the browser before the
> requested page was returned.
>
> The actual error trace is not relevant (it just indicates the script
> driver is trying to write to Apache which has already closed the
> pipe), but the fact that there may have been a long delay in building
> the page may indicate a problem with the list's request.pck or other
> file.
>
>
>>Also when i loged in shell, in fact right now after i restarted mailman 
>>and was the shell open....it geting the following messages....
>>
>>
>>Traceback (most recent call last):
>>  File "/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
>>    main()
>>  File "/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
>>    qrunner.run()
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 
>> 70, in run
>>    filecnt = self._oneloop()
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 
>> 99, in _oneloop
>>    msg, msgdata = self._switchboard.dequeue(filebase)
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", 
>> line 143, in dequeue
>>    fp = open(filename)
>>IOError: [Errno 2] No such file or directory: 
>>'/usr/local/cpanel/3rdparty/mailman/qfiles/in/1137970125.9761269+9cb39062150386402d39380ce999525996a8a87c.pck'
>>Traceback (most recent call last):
>>  File "/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 270, in ?
>>    main()
>>  File "/usr/local/cpanel/3rdparty/mailman/bin/qrunner", line 230, in main
>>    qrunner.run()
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 
>> 70, in run
>>    filecnt = self._oneloop()
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Runner.py", line 
>> 99, in _oneloop
>>    msg, msgdata = self._switchboard.dequeue(filebase)
>>  File "/usr/local/cpanel/3rdparty/mailman/Mailman/Queue/Switchboard.py", 
>> line 143, in dequeue
>>    fp = open(filename)
>>IOError: [Errno 2] No such file or directory: 
>>'/usr/local/cpanel/3rdparty/mailman/qfiles/bounces/1137970235.609026+0692b87b5f7b8faa6b88f82f4aa6f0394d3a7c35.pck'
>
>
> These imply that there are multiple qrunners processing the same
> queues. Both runners see an entry and begin processing. The first
> unlinks the queue entry successfully, and the second is unable to
> because it is already gone. Thus the above errors (one from the 'in'
> queue and one from the 'bounces' queue.
>
>
> -- 
> Mark Sapiro <msapiro at value.net>       The highway is for gamblers,
> San Francisco Bay Area, California    better use your sense - B. Dylan
>
>
>
> 





More information about the Mailman-Users mailing list