diffing and uniqing directories

Dan Stromberg dstromberglists at gmail.com
Sun May 11 15:44:09 EDT 2008


On Sat, 26 Apr 2008 23:44:17 +0530, Rustom Mody wrote:

> Over years Ive collected tgz's of my directories. I would like to diff
> and uniq them
> 
> Now I guess it would be quite simple to write a script that does a walk
> or find through a pair of directory trees, makes a SHA1 of each file and
> then sorts out the files whose SHA1s are the same/different. What is
> more difficult for me to do is to write a visual/gui tool to help me do
> this.
> 
> I would guess that someone in the python world must have already done it
> [The alternative is to use some of the tools that come with version
> control systems like git. But if I knew more about that option I would
> not be stuck with tgzs in the first place ;-)]
> 
> So if there is such software known please let me know.
> 
> PS Also with the spam flood that has hit the python list I dont know if
> this mail is being read at all or Ive fallen off the list!

It doesn't have a GUI, but here's a python program I wrote for dividing 
large collections of files up into identical groups:

http://stromberg.dnsalias.org/~strombrg/equivalence-classes.html




More information about the Python-list mailing list