Skip to main content

Pure-Python library for computing fuzzy hashes (ssdeep)

Project description

ppdeep

This is a pure-Python library for computing context triggered piecewise hashes (CTPH), also called fuzzy hashes, or often ssdeep after the name of a popular tool. At a very high level, fuzzy hashing is a way to determine whether two inputs are similar, rather than identical. Fuzzy hashes are widely adopted in digital forensics and malware detection.

This implementation is based on SpamSum by Dr. Andrew Tridgell.

Usage

To compute a fuzzy hash, simply use hash() function:

>>> import ppdeep
>>> h1 = ppdeep.hash('The equivalence of mass and energy translates into the well-known E = mc²')
>>> h1
'3:RC0qYX4LBFA0dxEq4z2LRK+oCKI9VnXn:RvqpLB60dx8ilK+owX'
>>> h2 = ppdeep.hash('The equivalence of mass and energy translates into the well-known E = MC2')
>>> h2
'3:RC0qYX4LBFA0dxEq4z2LRK+oCKI99:RvqpLB60dx8ilK+oA'

To calculate level of similarity, use compare() function which returns an integer value from 0 to 100 (full match):

>>> ppdeep.compare(h1, h2)
34

Function hash_from_file() accepts a filename as argument and calculates the hash of the contents of the file:

>>> ppdeep.hash_from_file('.bash_history')
'1536:EXM36dG36x3KW732vOAcg3EP1qKlKozcK0z5G+lEPTssl/7eO7HOBF:tKlKozcWT0'

Installation

$ pip install ppdeep

If you want to use the latest version of the code, you can install it from Git:

$ git clone https://github.com/elceef/ppdeep.git
$ cd ppdeep
$ pip install .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ppdeep-20260218.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ppdeep-20260218-py3-none-any.whl (4.8 kB view details)

Uploaded Python 3

File details

Details for the file ppdeep-20260218.tar.gz.

File metadata

  • Download URL: ppdeep-20260218.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for ppdeep-20260218.tar.gz
Algorithm Hash digest
SHA256 269f63c1585f580e8016c31e7f737f143a2ddf8440e5b0caf3ca192ce3532a2b
MD5 224115f7e5f723a466ddbdcf2dc2c9f0
BLAKE2b-256 8ae1c270b564c384f2ab430ee94f951101273f42214e51b0d2ff1797b7e306ae

See more details on using hashes here.

File details

Details for the file ppdeep-20260218-py3-none-any.whl.

File metadata

  • Download URL: ppdeep-20260218-py3-none-any.whl
  • Upload date:
  • Size: 4.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for ppdeep-20260218-py3-none-any.whl
Algorithm Hash digest
SHA256 97b24a17e5c9849b9d9af14918e434abc2043733cf619b3e2603737275b954bf
MD5 4ecfab26ad772db5b3b287b06d0503b1
BLAKE2b-256 40ca71a565a0b59b2b30a2dea1e0c030becd6255bbf1ae2c3247b45f896326c9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page