[Python-Dev] Tuning Python dicts

Antoine Pitrou solipsis at pitrou.net
Sat Apr 10 22:40:26 CEST 2010


Reid Kleckner <rnk <at> mit.edu> writes:
> 
> I think you're right about the number of collisions, though.  CPython
> dicts use a pretty low load factor (2/3) to keep collision counts
> down.  One of the major benefits cited in the paper is the ability to
> maintain performance in the face of higher load factors, so I may be
> able to bump up the load factor to save memory.  This would increase
> collisions, but then that wouldn't matter, because resolving them
> would only require looking within two consecutive cache lines.

Why wouldn't it matter? Hash collisions still involve more CPU work, even though
if you're not access memory a lot.


Antoine.




More information about the Python-Dev mailing list