Skip to main content

Package for image deduplication

Project description

imagededup is a python package that provides functionality to find duplicates in a collection of images using a variety of algorithms. Additionally, an evaluation and experimentation framework, is also provided. Following details the functionality provided by the package:

  • Finding duplicates in a directory using one of the following algorithms:
    • Convolutional Neural Network
    • Perceptual hashing
    • Difference hashing
    • Wavelet hashing
    • Average hashing
  • Generation of features for images using one of the above stated algorithms.
  • Framework to evaluate effectiveness of deduplication given a ground truth mapping.
  • Plotting duplicates found for a given image file.

Read the documentation at:

imagededup is compatible with Python 3.6+ and runs on Linux, MacOS X and Windows. It is distributed under the Apache 2.0 license.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imagededup-0.2.2.tar.gz (60.6 kB view hashes)

Uploaded source

Built Distributions

imagededup-0.2.2-cp37-cp37m-win_amd64.whl (50.7 kB view hashes)

Uploaded cp37

imagededup-0.2.2-cp36-cp36m-win_amd64.whl (50.6 kB view hashes)

Uploaded cp36

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page