Document fingerprint generator
Project description
Fingerprint -- Document Fingerprint Generator
Fingerprint of a document
Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at Winnowing: Local Algorithms for Document Fingerprinting (specifically Figure 2).
Super simple to use
Fingerprint is very simple to use.
f = Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000)
print f.generate(str="adorunrunrunadorunrun")
print f.generate(fpath="/Users/test/docs/CHANGES.txt")
The default values for the parameters are
kgram_len = 50
window_len = 100
base = 101
modulo = sys.maxint
Install
pip install fingerprint
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fingerprint-0.1.3.tar.gz
(4.1 kB
view hashes)
Built Distribution
Close
Hashes for fingerprint-0.1.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f6f045ea607ecd8e2979f807e29eb2caa094b51e1ede3a29d63879db55aeef93 |
|
MD5 | db5fb5e0efb69d22b98f6831dcd87fe1 |
|
BLAKE2b-256 | d578b5e4bac2a9aa43cb856598f17d209a812cf0f665d0ec187ed84203838393 |