Storing pairs of (int, int) in a database : which db to choose ?

Paul Rubin http
Tue Dec 23 16:04:50 EST 2003


andreif at mail.dntis.ro (Stormbringer) writes:
> The only thing that bothers me a little is the speed for building the
> index, I tried with around 5000 messages and I am not quite thrilled,
> it's not _extremly_ slow but it has to be faster for what I need.
> Perhaps I'll use the C++ version with some Python bindings.

Why not do some profiling first.  Maybe it's limited by i/o traffic
rather than cpu cycles.  I don't know how Lupy works but the one time
I messed with full text indexing, the bottleneck was definitely the
random disk accesses needed for every word of each update.  The
solution is to batch the updates.  Sorting is much less seek intensive
than random updates.




More information about the Python-list mailing list