Document fingerprint generator
Project description
Fingerprint -- Document Fingerprint Generator
Fingerprint is a signature of the document. In particular, it is a representative subset of hash values from the set of all hash values of a document. For more detail, please consider taking a look at Winnowing: Local Algorithms for Document Fingerprinting (specifically Figure 2).
Fingerprint Module Installation
The recommended way to install the fingerprint module is to simply use pip:
$ pip install fingerprint
Fingerprint officially supports Python >= 3.0.
How to use fingerprint?
>>> import fingerprint
>>> fprint = fingerprint.Fingerprint(kgram_len=4, window_len=5, base=10, modulo=1000)
>>> fprint.generate(str="adorunrunrunadorunrun")
>>> fprint.generate(fpath="../CHANGES.txt")
The default values for the parameters are
kgram_len = 50
window_len = 100
base = 101
modulo = sys.maxint
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fingerprint-0.1.6.tar.gz
(4.1 kB
view details)
File details
Details for the file fingerprint-0.1.6.tar.gz.
File metadata
- Download URL: fingerprint-0.1.6.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
25926c4a4f53289ff0b26a50266749a5526e962cc7fbc807fc48cd5fd3b8f355
|
|
| MD5 |
81a1c15b442962a8c7764c169305f20e
|
|
| BLAKE2b-256 |
e69301993908a62885e32a5e8c0dc4fb603da9878df010c8b9f20b3fcc70fde8
|