Skip to main content

Very fast two folder image duplicate finder programmed with pickle and cv2

Project description

imgdups

Pylint

Most image duplicate checkers can find duplicates within a single folder. This solution can verify that no duplicates from one path (search) exists in another path (target). It will use opencv to create image descriptors and cache them into a pickle file for faster processing after it was run the first time. With this approach we can not just find exact duplicates but similar images based on a match score.

Requirements

Python 3.6+ was tested

sudo apt install python3 python3-pip

Option 1: Install from Source

git clone https://github.com/ChuckNorrison/imgdups
cd imgdups
pip3 install .

Option 2: Install from PyPi (recommended)

pip3 install imgdups

CLI Usage

imgdups --search "/path/to/reference" --target "/path/to/check"

or if not installed (git clone first)

cd imgdups
python3 imgdups.py --search "/path/to/reference" --target "/path/to/check"

Python example

#!/usr/bin/env python3
import imgdups

SEARCH_PATH = "/path/to/reference"
TARGET_PATH = "/path/to/check"

img_dups = imgdups.ImgDups(TARGET_PATH, SEARCH_PATH)
duplicates = img_dups.find_duplicates()

for duplicate in duplicates:
    print("%s == %s (score: %d)",
            duplicate["target"],
            duplicate["search"],
            duplicate["score"]
    )

print("%d duplicates found", len(duplicates))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imgdups-0.1.3.tar.gz (17.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

imgdups-0.1.3-py3-none-any.whl (18.1 kB view details)

Uploaded Python 3

File details

Details for the file imgdups-0.1.3.tar.gz.

File metadata

  • Download URL: imgdups-0.1.3.tar.gz
  • Upload date:
  • Size: 17.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.3.tar.gz
Algorithm Hash digest
SHA256 0e7eff53a7168f21c40639766a160b46f31db9d0b30f4636367768ee5b0cb15c
MD5 cd25e852ec7c818eceb94b7344287e16
BLAKE2b-256 d3ca0583260c672aa95f438767bdfb5c75f3b0b86730c4936760e471d53aa67b

See more details on using hashes here.

File details

Details for the file imgdups-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: imgdups-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 18.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for imgdups-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2c7f16181a4ee7de82b7413c54a2eb4ccbe614920c960fbd65b729453670778d
MD5 8f61aedcbeb50d724eb434c5373f968a
BLAKE2b-256 b56756942ae3b9759db9df9d19b8ea89528dc2ad62eaf58442665f3e3e3982b0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page