Skip to main content

Document fingerprint generator

Project description

# Fingerprint – Document Fingerprint Generator

## Fingerprint of a document Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at [Winnowing: Local Algorithms for Document Fingerprinting](http://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf) (specifically Figure 2).

## Super simple to use Fingerprint is very simple to use. `python f = Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000) print f.generate(str="adorunrunrunadorunrun") print f.generate(fpath="/Users/test/docs/CHANGES.txt") ` The default values for the parameters are `python kgram_len = 50 window_len = 100 base = 101 modulo = sys.maxint `

## Install `sh pip install fingerprint `

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for finger-print, version 0.1.2
Filename, size File type Python version Upload date Hashes
Filename, size finger_print-0.1.2-py3-none-any.whl (4.7 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size finger-print-0.1.2.tar.gz (4.1 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page