[ANNOUNCE] Ransacker extendable search engine
Andreas Jung
ajung at sz-sb.de
Tue May 2 01:17:46 EDT 2000
On Mon, May 01, 2000 at 10:59:13PM -0400, Michal Wallace (sabren) wrote:
>
> -------------------------------------------------------------------------
> Announcing RANSACKER
> -------------------------------------------------------------------------
>
> Ransacker is a scriptable, incrementally-double-indexed search engine
> written in python.
>
> It's *scriptable* in that you can index any text with any key. This
> makes it easy to index content ("pages") stored in databases, file
> systems, the web, etc.
>
> It can index *incrementally*. This means you can add content or update
> the entry for a particular page without touching the rest of the
> index.
>
> It's *double-indexed*, meaning that not only does every word have a
> list of pages, every page has a list of words. This is used for the
> incremental indexer, but also allows you to determine which pages have
> the most in common. This will allow ransacker to produce "what's
> related" pages.
>
> Currently, ransacker ranks pages by number of times keywords
> appear. It does *not yet* support boolean queries, fuzzy matches, or
> other advanced searching features.
Did you make any benchmarks ? Can it handle 1GB data and more ? :-)
Andreas
More information about the Python-list
mailing list