Skip to main content

Very fast two folder image duplicate finder programmed with pickle and cv2

Project description

imgdups

Pylint

Most image duplicate checkers can find duplicates within a single folder. This solution can verify that no duplicates from one path (search) exists in another path (target). It will use opencv to create image descriptors and cache them into a pickle file for faster processing after it was run the first time. With this approach we can not just find exact duplicates but similar images based on a match score.

Requirements

Python 3.6+ was tested

sudo apt install python3 python3-pip

Option 1: Install from Source

git clone https://github.com/ChuckNorrison/imgdups
cd imgdups
pip3 install .

Option 2: Install from PyPi (recommended)

pip3 install imgdups

CLI Usage

imgdups --search "/path/to/reference" --target "/path/to/check"

or if not installed (git clone first)

cd imgdups
python3 imgdups.py --search "/path/to/reference" --target "/path/to/check"

Python example

#!/usr/bin/env python3
import imgdups

SEARCH_PATH = "/path/to/reference"
TARGET_PATH = "/path/to/check"

img_dups = imgdups.ImgDups(TARGET_PATH, SEARCH_PATH)
duplicates = img_dups.find_duplicates()

for duplicate in duplicates:
    print("%s == %s (score: %d)",
            duplicate["target"],
            duplicate["search"],
            duplicate["score"]
    )

print("%d duplicates found", len(duplicates))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imgdups-0.1.6.tar.gz (18.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

imgdups-0.1.6-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file imgdups-0.1.6.tar.gz.

File metadata

  • Download URL: imgdups-0.1.6.tar.gz
  • Upload date:
  • Size: 18.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for imgdups-0.1.6.tar.gz
Algorithm Hash digest
SHA256 9d7dba75c697a11353be5dc77b1365968d0062af78a061dd5efdab6aa6a857b1
MD5 e60150d1e0919ff13cc53318383fb56d
BLAKE2b-256 e314da7f56ad5913e279e11244b2ebb8cf1392adb4c0c3c9972850913bb0a5d2

See more details on using hashes here.

File details

Details for the file imgdups-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: imgdups-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for imgdups-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 881bfabd536e3f55a53a67bd61eb19dd33b7e7f5da3921fc3a4f6c018869624e
MD5 089b7162ff65c0b019b357090e5fec96
BLAKE2b-256 edb54de509d966dbbb545ab1a2e6cc7172a2aca7a9f58f365f7aeb7a09359abf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page