Python vs. Java gzip performance

Caleb Hattingh caleb.hattingh at gmail.com
Fri Mar 17 15:04:02 EST 2006


I tried this:

from timeit import *

#Try readlines
print Timer('import
gzip;lines=gzip.GzipFile("gztest.txt.gz").readlines();[i+"1" for i in
lines]').timeit(200) # This is one line


# Try file object - uses buffering?
print Timer('import gzip;[i+"1" for i in
gzip.GzipFile("gztest.txt.gz")]').timeit(200) # This is one line

Produces:

3.90938591957
3.98982691765

Doesn't seem much difference, probably because the test file easily
gets into memory, and so disk buffering has no effect.   The file
"gztest.txt.gz" is a gzipped file with 1000 lines, each being "This is
a test file".




More information about the Python-list mailing list