Mass Text Indexing Tools

Thomas Weholt thomas at cintra.no
Tue Oct 17 05:14:56 EDT 2000


I'm working on a cdrom indexing project with similar capabilities. It
uses PostgreSQL and a custom full-text search engine. It's still very
alpha, if not pre-alpha, so there's not user-friendly yet. The final
version should index documents/text too, but that part is still a bit
buggy.

Please post any information on the subject or email me personally.
Stuff like UdmSearch or Harvest ( look at freshmeat.net ) doesn't seem
to have any python connectivity. I email one of the developers on one
of the projects, Udmsearch I believe, and asked for Python support. He
told me they didn't have Python knowledge among their developers, so
if anybody in this group have any spare time you know what to do ....
;->

On Tue, 17 Oct 2000 08:52:38 GMT, "Ender" <kthangavelu at earthlink.net>
wrote:

>Does anyone know of some good mass text indexing/searching tools
>(preferrable open source) that are accessible from python. i've tried
>using popen2 calls to grep but it starts to flag around 50Mbs. text
> material consists of around a hundredb thousand small files (emails).




More information about the Python-list mailing list