Skip to main content

Very fast two folder image duplicate finder programmed with pickle and cv2

Project description

imgdups

Pylint

Most image duplicate checkers can find duplicates within a single folder. This solution can verify that no duplicates from one path (search) exists in another path (target). It will use opencv to create image descriptors and cache them into a pickle file for faster processing after it was run the first time. With this approach we can not just find exact duplicates but similar images based on a match score.

Requirements

Python 3.6+ was tested

sudo apt install python3 python3-pip

Option 1: Install from Source

git clone https://github.com/ChuckNorrison/imgdups
cd imgdups
pip3 install .

Option 2: Install from PyPi (recommended)

pip3 install imgdups

CLI Usage

imgdups --search "/path/to/reference" --target "/path/to/check"

or if not installed (git clone first)

cd imgdups
python3 imgdups.py --search "/path/to/reference" --target "/path/to/check"

Python example

#!/usr/bin/env python3
import imgdups

SEARCH_PATH = "/path/to/reference"
TARGET_PATH = "/path/to/check"

img_dups = imgdups.ImgDups(TARGET_PATH, SEARCH_PATH)
duplicates = img_dups.find_duplicates()

for duplicate in duplicates:
    print("%s == %s (score: %d)",
            duplicate["target"],
            duplicate["search"],
            duplicate["score"]
    )

print("%d duplicates found", len(duplicates))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imgdups-0.1.5.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

imgdups-0.1.5-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file imgdups-0.1.5.tar.gz.

File metadata

  • Download URL: imgdups-0.1.5.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.5.tar.gz
Algorithm Hash digest
SHA256 95684913b58c5684b9e0c462630c554fd4e5452a36e128c8a4b8579901bab2a9
MD5 8387e9132af8fd370e50d69dc9f968bc
BLAKE2b-256 5193004283ec9b7c63b9cf3cbfe09a7e6939d2ae45980ce75458ba7033abdab7

See more details on using hashes here.

File details

Details for the file imgdups-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: imgdups-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 0b23f273104763213cc18dad19d2339762e2f3015814205fd9ac28af04bedf81
MD5 96cccfd01090e3b34ef8c099dae9a5fa
BLAKE2b-256 129239902c3990c2ba3e0acbacb0ae036500297676410408a7230aee93a36332

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page