This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

find files with identical contents, or print out a list of all files and their md5 sums and sizes

Project Description

Find files with identical contents. Optionally hard-link or delete such files. Works on Windows. Can print out the md5sums and sizes of all your files. Unlike most tools of its ilk, this one will work even if the list of the metadata of all your files is too large to fit into your RAM.

Dupfilefind is reasonably efficient, for what it does. It first compares sizes (so it can tell if files are different) and then inode numbers (so it can tell if different filenames are actually links to the same underlying file contents) before it resorts to computing MD5 sums of file contents. In addition I did some profiling and benchmarking to see how I could make it most efficient, and this is what I came up with.

LICENCE

You may use this package under the GNU General Public License, version 2 or, at your option, any later version. You may use this package under the Transitive Grace Period Public Licence, version 1.0, or at your option, any later version. (You may choose to use this package under the terms of either licence, at your option.) See the file COPYING.GPL for the terms of the GNU General Public License, version 2. See the file COPYING.TGPPL.html for the terms of the Transitive Grace Period Public Licence, version 1.0.

See TGPPL.PDF for why the TGPPL exists, graphically illustrated in four slides.

Release History

Release History

This version
History Node

1.6.9

History Node

1.6.7

History Node

1.6.6

History Node

1.6.5

History Node

1.6.4

History Node

1.6.3-4

History Node

1.6.3-1

History Node

1.6.3

History Node

1.6.2-1

History Node

1.6.2

History Node

1.6.1-4

History Node

1.6.1

History Node

1.6.0

History Node

1.5.0

History Node

1.4.3

History Node

1.4.2

History Node

1.4.1-5

History Node

1.4.1

History Node

1.4.0

History Node

1.3.1

History Node

1.3.0

History Node

1.2.2

History Node

1.2.1

History Node

1.2.0

History Node

1.1.2

History Node

1.1.1

History Node

1.1.0

History Node

1.0.3

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
dupfilefind-1.6.9.tar.gz (21.8 kB) Copy SHA256 Checksum SHA256 Source Apr 22, 2010

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting