signature for a file ?

DIG dig.list at telkel.net
Tue Jul 30 07:08:24 EDT 2002


Hi, Shagshag13 !

 On Tue, Jul 30, 2002 at 12:13:31PM +0200, Shagshag13 wrote:

> "Paul Rubin" <phr-n2002b at NOSPAMnightsong.com> a Иcrit dans le message de news: 7xeldlh6o8.fsf at ruckus.brouhaha.com...
> > 
> > Anyway if two files have the
> > same MD5 checksum, it's basically certain that the files are
> > identical.
> 
> thanks for this, (so no need to use SHA or CRC ?)

Practically -- no. 

> > So your basic strategy is:
> >
> >    for each file in your system:
> >       c = md5 checksum of the file
> >       print c and the pathname leading to the file
> >
> > Then take the output of the above and sort it so all the identical md5's
> > end up next to each other in the output.
> >
> > Then find all the sets of files with identical md5's and print out their
> > paths so you can investigate further, get rid of one copy, etc.
> 
> yes, i would like to do something like that !

And if you take into account the size of file, you will not have to run md5sum on EACH file. It will be sufficient to check with md5sum whether the files are identical ONLY IF their sizes are the same.


Regards,

-- 
DIG (Dmitri I GOULIAEV)




More information about the Python-list mailing list