signature for a file ?

Shagshag13 shagshag13 at yahoo.fr
Wed Jul 31 05:18:32 EDT 2002


"Huaiyu Zhu" <huaiyu at gauss.almadan.ibm.com> a écrit dans le message de news: slrnake48e.467.huaiyu at gauss.almadan.ibm.com...
> Shagshag13 <shagshag13 at yahoo.fr> wrote:
> >
> >i had at home many hdds, that could contain many time same files, in many
> >places/directories (-> i'm really disorganized).  i would like to do some
> >sort on theses files. to do this i'm planning to write a python script that
> >would compute a kind of CRC32, MD5 or SHA (i'm really not competent in that
> >- so here i need advices and pointer to some implementations - and to know
> >which is the best to had a unique unambiguous signature for a file) and
> >then use it to find "doubles" : same size + same signature = probably same
> >file.
>
> That would be very useful indeed.  (Concurs another disorganized person :-)
>
> Here's a further question.  Once you get to know the identities of files,
> how do you know about the directories?  I have many directories that have
> identical subdirectories.  I'd like to build an inventory of maximal
> identical directories.  A and B are defined as maximal identical if they are
> identical but their parents are not.  The few ideas I have all produces
> combinatorial explosion.

hello,

i don't think about that yet, it was just a quick  idea to check double files.
if a had a cool idea for "maximal identical directories" i'll you know...

s13.





More information about the Python-list mailing list