Skip to main content

Document fingerprint generator

Project description

# Fingerprint – Document Fingerprint Generator

## Fingerprint of a document Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at [Winnowing: Local Algorithms for Document Fingerprinting](http://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf) (specifically Figure 2).

## Super simple to use Fingerprint is very simple to use. `python f = Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000) print f.generate(str="adorunrunrunadorunrun") print f.generate(fpath="/Users/test/docs/CHANGES.txt") ` The default values for the parameters are `python kgram_len = 50 window_len = 100 base = 101 modulo = sys.maxint `

## Install `sh pip install fingerprint `

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
fingerprint-0.1.2.tar.gz (3.7 kB) Copy SHA256 hash SHA256 Source None May 15, 2016

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page