indexing web pages - in python?
Kevin T. Ryan
kevin.t.ryan at gmail.com
Wed Apr 18 22:18:17 EDT 2007
On Apr 18, 8:55 pm, Dan Stromberg <dstromb... at datallegro.com> wrote:
> Are there any open source search engines written in python for indexing a
> given collection of (internal only) html pages? Right now I'm talking
> about dozens, but hopefully it'll be hundreds or thousands at some point.
>
> I'm thinking some sort of CGI script, with perhaps a cron job that updates
> the indexes.
>
> I'm not particularly looking for something that has a full RDBMS behind
> it - just a file that stores indexes. I'll go with an RDBMS-based
> solution if I must, but I don't think that's really needed at this point.
>
> TIA
You could try:
http://gnosis.cx/download/indexer.py
There is an extensive write-up by the author at:
http://gnosis.cx/publish/programming/charming_python_15.txt
Might be something you'd be interested in ...
More information about the Python-list
mailing list