Skip to main content

Local file duplicate finder.

Project description

filedups


Given a sequence of paths (full directory paths), finds and groups duplicate files recursively.
Doesn't provide 100% accuracy. Reported files in a group might not be exactly the same.

Example Scan on Windows

How to use filedups on Windows

HOW TO USE

Go to src/filedups in terminal.
Put full paths of directories you want to search in in-dirs.txt file on separate lines.
Options (M, X are the number of bytes) (default value for M is 1024000 (1000 KB), default value for X is None):
--min-file-size M
--max-file-size X


Then run main.py:
For Linux:
python3 main.py in-dirs.txt


1000 KB minimum file size:
python3 main.py in-dirs.txt --min-file-size 1024000


200 KB minimum, 2000 KB maximum file size:
python3 main.py in-dirs.txt --min-file-size 204800 --max-file-size 2048000


For Windows:
py main.py in-dirs.txt


Results will be in a text file of current working directory of command line
, which starts with filedups and contains timestamp of the scan.

Notes

It takes at least 3 minutes to filter 284000 files to 40300 files and then find duplicates.
It takes at least 19 minutes to filter 286000 files to 140000 files and then find duplicates.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

filedups-1.0.0.tar.gz (10.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

filedups-1.0.0-py3-none-any.whl (18.0 kB view details)

Uploaded Python 3

File details

Details for the file filedups-1.0.0.tar.gz.

File metadata

  • Download URL: filedups-1.0.0.tar.gz
  • Upload date:
  • Size: 10.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for filedups-1.0.0.tar.gz
Algorithm Hash digest
SHA256 aa42734ca42f7b244e0f8ad5a40e9e5fc05d9f5dd1a9631f3d8661b5eafdcb4a
MD5 bb6a2aae091340f90c1735fa80a9f3af
BLAKE2b-256 3dfff8e8ed9b050485d34721abb0a42a1d7d60dcfd3cd2e697f48507cf91a918

See more details on using hashes here.

File details

Details for the file filedups-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: filedups-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 18.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for filedups-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 06da6b565284d4340a56ab40909fc3b06faef7a020639858ed8ada8295d52f3e
MD5 5f1f94da9529fcd5c4d0c9e82184dc69
BLAKE2b-256 92eba613d240101679ac312bfd4ea0fe89f9f338f4ae2afe22227eb695e36669

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page