Document fingerprint generator
Project description
# Fingerprint – Document Fingerprint Generator
## Fingerprint of a document Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at [Winnowing: Local Algorithms for Document Fingerprinting](http://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf) (specifically Figure 2).
## Super simple to use Fingerprint is very simple to use. `python f = Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000) print f.generate(str="adorunrunrunadorunrun") print f.generate(fpath="/Users/test/docs/CHANGES.txt") ` The default values for the parameters are `python kgram_len = 50 window_len = 100 base = 101 modulo = sys.maxint `
## Install `sh pip install fingerprint `
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for finger_print-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | aaaac87ebeb6a258d7c0f2cb9ac15c707660dd8795dfe420be19230604ce92a0 |
|
MD5 | 4db5b8d6d1726c603864b161d7ef7948 |
|
BLAKE2b-256 | 750cdfd8cdb4fb01bdafe75006ca63f4eb8126655482cc6332efcfbe3bc68020 |