signature for a file ?

Shagshag13 shagshag13 at yahoo.fr
Tue Jul 30 05:49:36 EDT 2002


Hello,

i would like to have your opinion on this...

i had at home many hdds, that could contain many time same files, in many places/directories (-> i'm really disorganized).
i would like to do some sort on theses files. to do this i'm planning to write a python script that would compute a kind of CRC32,
MD5 or SHA (i'm really not competent in that - so here i need advices and pointer to some implementations - and to know which is the
best to had a unique unambiguous signature for a file) and then use it to find "doubles" : same size + same signature = probably
same file.

(i'm in no way concerned by the time that will need - less than a week ;o) -, this should always be much less than if i had to it
myself)

(another question is in a system how to know how many disks/mounting points are available )

do you think this could work ? have you any other/best suggestion ?

thanks in advance,

s13.





More information about the Python-list mailing list