[issue18771] Reduce the cost of hash collisions for set objects

Raymond Hettinger report at bugs.python.org
Sun Aug 18 00:41:01 CEST 2013


Raymond Hettinger added the comment:

I just made the patch a few minutes ago.  Am just starting to work on benchmarks.  I posted here so that you guys could help find the strengths and weaknesses of the approach.  

The theory is sound, but it is going to take a good deal of effort to isolate the effects (either good or bad) in realistic benchmarks.  The short, tight loops in timeit tend to put everything in cache and hide the effects  of good memory access patterns.

If would love to have you all team up with me on this one.  I could use help on timings, examining disassembly, and runs with cache grind.

This is just in the experimental phase, so it is premature to be throwing up obstacles with "show me your data" or "what does it do in situatation x".  At this point, you all could help out by running some timings designed to isolate the effects on high collision data big enough to not fit in  L2 cache.  I hope you assist in evaluating the idea with an open mind.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue18771>
_______________________________________


More information about the Python-bugs-list mailing list