Best way to handle large lists?

Sybren Stuvel sybrenUSE at YOURthirdtower.com.imagination
Tue Oct 3 09:57:17 EDT 2006


Bill Williams enlightened us with:
> I don't know enough about Python internals, but the suggested
> solutions all seem to involve scanning bigList. Can this presumably
> linear operation be avoided by using dict or similar to find all
> occurrences of smallist items in biglist and then deleting those
> occurrences?

And how would that beat O(n)? Every element of bigList has to be
scanned at one point, either to compare it to every earlier element in
bigList and eliminate it, or to compare it to every element in
smallList.

Run benchmarks on the suggestions, and see which is fastest for
yourself.

Sybren
-- 
Sybren Stüvel
Stüvel IT - http://www.stuvel.eu/



More information about the Python-list mailing list