Skip to main content

Very fast two folder image duplicate finder programmed with pickle and cv2

Project description

imgdups

Pylint

Most image duplicate checkers can find duplicates within a single folder. This solution can verify that no duplicates from one path (search) exists in another path (target). It will use opencv to create image descriptors and cache them into a pickle file for faster processing after it was run the first time. With this approach we can not just find exact duplicates but similar images based on a match score.

Requirements

Python 3.6+ was tested

sudo apt install python3 python3-pip

Option 1: Install from Source

git clone https://github.com/ChuckNorrison/imgdups
cd imgdups
pip3 install .

Option 2: Install from PyPi (recommended)

pip3 install imgdups

CLI Usage

imgdups --search "/path/to/reference" --target "/path/to/check"

or if not installed (git clone first)

cd imgdups
python3 imgdups.py --search "/path/to/reference" --target "/path/to/check"

Python example

#!/usr/bin/env python3
import imgdups

SEARCH_PATH = "/path/to/reference"
TARGET_PATH = "/path/to/check"

img_dups = imgdups.ImgDups(TARGET_PATH, SEARCH_PATH)
duplicates = img_dups.find_duplicates()

for duplicate in duplicates:
    print("%s == %s (score: %d)",
            duplicate["target"],
            duplicate["search"],
            duplicate["score"]
    )

print("%d duplicates found", len(duplicates))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imgdups-0.1.2.tar.gz (17.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

imgdups-0.1.2-py3-none-any.whl (18.0 kB view details)

Uploaded Python 3

File details

Details for the file imgdups-0.1.2.tar.gz.

File metadata

  • Download URL: imgdups-0.1.2.tar.gz
  • Upload date:
  • Size: 17.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.2.tar.gz
Algorithm Hash digest
SHA256 f095db94b9d5c3640510f2b45a21bfcda6905350ece983cbdcd59cfedc9bf17e
MD5 a6ab2266803c18dfbe430b7a2180f950
BLAKE2b-256 e12f1cf643187c95394c607c614098ef15b1abf2f522358dc930c8cc47f7c5f6

See more details on using hashes here.

File details

Details for the file imgdups-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: imgdups-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 18.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 acc9abdb37f6bf1e9de1743e8a1fc9bda03f7de653b81fc89a5c46582e04059d
MD5 876547e63e9bd17f895f2aede72ec3fe
BLAKE2b-256 42c78c3d96b605f7b7dd580d7fd753421d539f7c560052bd05a25bcc184d9322

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page