reading files with error
Christian Stapfer
nil at dev.nul
Sun Sep 18 00:51:09 EDT 2005
Maurice Ling <mauriceling at acm.org> wrote in message
news:mailman.547.1127016906.509.python-list at python.org...
> jepler at unpythonic.net wrote:
>
>>I have written a program to do something similar. My strategy is:
>> * use os.read() to read 512 bytes at a time
>> * when os.read fails, seek to the next multiple of 512 bytes
>> and write '\0' * 512 to the output
>>I notice this code doesn't deal properly with short reads, but in my
>>experience
>>they never happen (because the disk error makes an entire block
>>unreadable, and
>>a block is never less than 512 bytes)
>>
>>I use this code on a unix-style system.
>>
>>def dd(src, target, bs=512):
>> src = os.open(src, os.O_RDONLY)
>> if os.path.exists(target):
>> target = os.open(target, os.O_WRONLY | os.O_APPEND, 0600)
>> existing = os.lseek(target, 0, SEEK_END)
>> else:
>> existing = 0
>> target = os.open(target, os.O_WRONLY | os.O_CREAT, 0600)
>>
>> total = os.lseek(src, 0, SEEK_END) / bs
>> os.lseek(src, existing, SEEK_SET)
>> os.lseek(target, existing, SEEK_SET)
>>
>> if existing: print "starting at", existing
>> i = existing / bs
>> f = 0
>> lastrem = -1
>>
>> last = start = time.time()
>> while 1:
>> try:
>> block = os.read(src, bs)
>> except os.error, detail:
>> if detail.errno == errno.EIO:
>> block = "\0" * bs
>> os.lseek(src, (i+1) * bs, SEEK_SET)
>> f = f + 1
>> else:
>> raise
>> if block == "": break
>>
>> i = i + 1
>> os.write(target, block)
>>
>> now = time.time()
>> if i % 1000 or now - last < 1: continue
>> last = now
>>
>> frac = i * 1. / total
>> rem = int((now-start) * (1-frac) / frac)
>> if rem < 60 or abs(rem - lastrem) > .5:
>> rm, rs = divmod(rem, 60)
>> lastrem = rem
>> spd = i * 512. / (now - start) / 1024 / 1024
>> sys.stderr.write("%8d %8d %8d %3.1f%% %6d:%02d %6.1fMB/s\r"
>> % (i, f, i-f, i * 100. / total, rm, rs, spd))
>> sys.stderr.write("\n")
>>
>>
> Sorry but what are SEEK_END and SEEK_SET?
The Python 2.3 documentation seems to specify the *numeric*
values of these constants only. But since Python's file
objects are "implemented using C's stdio package", you
can read
http://www.opengroup.org/onlinepubs/009695399/functions/lseek.html
Regards,
Christian Stapfer
More information about the Python-list
mailing list