[Mailman-Users] htdig-mailman patch(es) - blank page resultsonlyso far

Mark Sapiro mark at msapiro.net
Tue Dec 8 00:50:02 CET 2009


Capps, John Mwrote:
>
>The url http://mailmansite.com/mailman/mmsearch/listname does in fact produce the desired result (referrer not known).



Just to be clear, it should produce a page like

htdig Archives Access Failure
CGI problem. -5-Field count -4- fields:

If you want to make another attempt to access a list archive then go
via the list users information page.

If this problem persists then please e-mail the following information
to the mailman at example.com:

    Referer not known
    /mailman/mmsearch/listname

It looks like that's what you got. That says at least Mailman's
mmsearch CGI is accessable and probably working.


>If that is all working correctly, the problem is in htdig. mmsearch
>just sets CONFIG_DIR in the environment to your HTDIG_CONF_LINK_DIR
>setting and then opens a pipe to and from the command in your
>HTDIG_HTSEARCH_PATH setting, writes the search parameters to the pipe
>and reads and displays the result.
>...
>What happens if you run '/usr/bin/htsearch' by hand?
><snip>
>
>If I run it from bash, specifying a config file with -c in the config directory, it also returns blank results (specifying text format).


format should be 'short' or 'long'


>Like So: /usr/bin/htsearch -c /var/lib/mailman/archives/htdig/listname.conf
>Enter value for words: recentword 
>Content-type: text/html
>
>Enter value for format: text
>#


What do you get if you enter 'short' instead of 'text'. It probably
doesn't matter. You should get an HTML search results page either way.


>Could it be the config file written?
>
><snip>
>database_dir: /var/lib/mailman/archives/private/listname/htdig
>start_url: http://mailmansite.com/mailman/htdig/listname/
>limit_urls_to: ${start_url}
>local_urls: http://mailmansite.com/mailman/htdig/listname/=/var/lib/mailman/archives/private/listname/
>local_urls_only: true
>url_part_aliases: http://mailmansite.com/mailman/htdig/listname/ *mm-htdig*
>script_name: http://mailmansite.com/mailman/mmsearch/bta_developers


I assume bta-developers is what is listname elsewhere


>noindex_end: <!--/htdig_noindex-->
>noindex_start: <!--htdig_noindex-->
>exclude_urls: /cgi-bin/ .cgi
><snip>
>(the following template content seems appropriate)


As does all the above.


>Well, I might go back to patch fun, but you do seem to have narrowed the problem down to htdig itself... perhaps some way it is creating the dbs?


Possibly. Here is the contents of archives/private/mailman/htdig

-rw-rw-r-- 1 mark mailman  24576 Sep 19 03:30 db.docdb
-rw-rw-r-- 1 mark mailman  24576 Sep 19 03:30 db.docs.index
-rw-rw-r-- 1 mark mailman  49152 Sep 19 03:30 db.excerpts
-rw-rw-r-- 1 mark mailman  49152 Sep 19 03:30 db.metaphone.db
-rw-rw-r-- 1 mark mailman  49152 Sep 19 03:30 db.soundex.db
-rw-rw-r-- 1 mark mailman 126976 Sep 19 03:30 db.words.db
-rw-rw-r-- 1 mark mailman  16384 Sep 19 03:30 db.words.db_weakcmpr
-rw-rw-r-- 1 root mailman   2070 Dec 26  2008 mailman.conf
-rw-rw-r-- 1 mark mailman      0 Sep 19 03:30 rundig_last_run

and here is a larger, more active list

-rw-rw-r-- 1 mark mailman 22503424 Dec  7 03:29 db.docdb
-rw-rw-r-- 1 mark mailman  2678784 Dec  7 03:29 db.docs.index
-rw-rw-r-- 1 mark mailman 57622528 Dec  7 03:29 db.excerpts
-rw-rw-r-- 1 mark mailman  5914624 Dec  7 03:30 db.metaphone.db
-rw-rw-r-- 1 mark mailman  5668864 Dec  7 03:30 db.soundex.db
-rw-rw-r-- 1 mark mailman 74221568 Dec  7 03:29 db.words.db
-rw-rw-r-- 1 mark mailman    16384 Dec  7 03:29 db.words.db_weakcmpr
-rw-rw-r-- 1 root mailman     2056 Dec 26  2008 gpc-talk.conf
-rw-rw-r-- 1 mark mailman        0 Dec  7 03:30 rundig_last_run

How to those compare with yours?


>I notice that the /var/lib/mailman/archives/private/listname/htdig directory does not have permissions appropriate for the Apache host to access....


That's as it should be. The directory should be g+rws and Mailman's
group, because the CGIs that access it are run through SETGID wrappers
that set Mailman's group. This is true of all Mailman's data.

-- 
Mark Sapiro <mark at msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan



More information about the Mailman-Users mailing list