duplicate image finder helps you find duplicate or similar images as well as delete them.
Project description
Duplicate Image Finder
Duplicate image finder uses image hashing to find similar/duplicate images in your local storage. All you gotta do is
- install,
- run (will setup the database with table) if no configuration is provided,
- run specifying which directory to look for images, and finally
- run asking it to show duplicate/similar images.
For example:
# installing
python3.9 -m pip install --user duplicate-image-finder
# show help
duplicate-image-finder --help
# add directory images and calculate hashes using 4 threads
duplicate-image-finder --add <directory> --parallel 4
# show the duplicate/similar images found in your browser
python duplicate-image-finder --show
Requirements
Lots, but all of them can be installed as dependencies as long as you are using python3.9
. Unfortunately, some of its dependencies have not been made available in python3.10
yet, so we are stuck there.
Poetry
Installing dependencies
poetry install
Running
poetry run python duplicate_image_finder/duplicate_finder.py --show
Testing
poetry run pytest
etc.
This duplicate image finder source code is inspired/partially copied from https://github.com/philipbl/duplicate-images.git.
Significant changes from the referred version are:
- moved from
mongodb
tosqlite
- Is probably better in terms of finding similar images (or perhaps I misunderstood the previous code)
Concepts/Technologies I learned/tried to learn while doing this:
poetry
for dependencypytest
for unit testpysqlite3
for databaseconcurrency
for performanceimagehash
for perpetual image hashing for finding similarity- grouping CLI arguments in python (mutually exclusive, etc) using
argparser
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for duplicate_image_finder-0.2.7.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | f06fcdbd70b12475a10e554ef123b0b4dc921fd886bf69e764a1e1d872748389 |
|
MD5 | b2ca40598bd4904012c539731e963e2c |
|
BLAKE2b-256 | 40efb8d547d87ce0711f9923df1248f34e0b2cd318e82078d61b11200bae37f8 |
Hashes for duplicate_image_finder-0.2.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 08505ba7841e7eac906651f18b64d4d8a48f8cf8e7e0444c63b69d0c11b0e5c7 |
|
MD5 | e19efce0fcc12d06a85cfdd51e48ee25 |
|
BLAKE2b-256 | f8267fee10cc41bbb76b96d74caabdda97330c2ffe165dadb39f578ab4ba9799 |