Document fingerprint generator
Project description
Fingerprint -- Document Fingerprint Generator
Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at Winnowing: Local Algorithms for Document Fingerprinting (specifically Figure 2).
Fingerprint Module Installation
The recommended way to install the fingerprint
module is to simply use pip
:
$ pip install fingerprint
Fingerprint officially supports Python >= 3.0.
How to use fingerprint?
>>> from fingerprint import Fingerprint
>>> fprint = Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000)
>>> fprint.generate(str="adorunrunrunadorunrun")
>>> fprint.generate(fpath="../CHANGES.txt")
The default values for the parameters are
kgram_len = 50
window_len = 100
base = 101
modulo = sys.maxint
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fingerprint-0.1.4.tar.gz
(4.1 kB
view hashes)
Built Distribution
Close
Hashes for fingerprint-0.1.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3c6dff6af64997f2e533596cfdfd69fb4a3ffb0fe1a5eed3d27fd050d60d3aa |
|
MD5 | 6abd9371786088d6dc291c7bce51dd75 |
|
BLAKE2b-256 | 16249da3bdba7d4c9e04612f05f483c21074eda390781cc1de0eef5dbb5ce741 |