Skip to main content

Library for image hashing and deduplication.

Project description

# SDHash [![Build Status](https://travis-ci.org/horia141/sdhash.svg?branch=master)](https://travis-ci.org/horia141/sdhash)

A Python library for computing hashes of images which ignore perceptual differences.

## Usage

```python
import sdhash
from PIL import Image

i1 = Image.open('test1.png')
i2 = Image.open('test1_noise.png')
i3 = Image.open('test2.png')

sdhash.test_duplicate(i1, i2) # True
sdhash.test_duplicate(i1, i3) # False
sdhash.hash_image(i1) # [ an md5 output ]
```

## Background

Suppose you want to test that two images are identical. The naive approach of simply comparing the byte-array representation of the two is not good.

## Algorithm

## Installation ##

The Python image library and NumPy/SciPy etc.

Installation is simple, via `pip`:

```bash
pip install sdhash
```

## TODO

Resistance to rotation, mirroring etc.
Tunable knobs (for similarity detection etc.)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for sdhash, version 0.0.4
Filename, size File type Python version Upload date Hashes
Filename, size sdhash-0.0.4-py2-none-any.whl (5.5 kB) File type Wheel Python version py2 Upload date Hashes View hashes
Filename, size sdhash-0.0.4.tar.gz (4.0 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page