The Jaccard index measures exhaustive substring comparison of two strings. This package is a slightly modified Jaccard with pre-calculation accelerate results.
Project description
Jaccard Precalculated String Matcher
The Jaccard index measures exhaustive substring comparison of two strings. This package is a slightly modified Jaccard with pre-calculation accelerate results.
from jaccard_precalc.JaccardPrecalc import JaccardPrecalc
string_list = ['Andrew Matte, 123 Main St, Toronto, Canada']
jac = JaccardPrecalc(string_list)
# jac.search(query_string, number_of_results)
results = jac.search('Andy Matte, Toronto, CA', 1) # returns a list of dicts where each dict is {input: score}, sorted by top score
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jaccard-precalc-0.1.6.tar.gz
(1.8 kB
view hashes)
Built Distribution
Close
Hashes for jaccard_precalc-0.1.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f52ad1a8a6a46dec54eb7f70aedcb8cc7ed397d6d26ef7ee986ce2b6d7e74f07 |
|
MD5 | 1be1dbf386e0879191ff050813ddb70c |
|
BLAKE2b-256 | 0c969edf33e8b5b0227ff8717a35884ae2ea72e70e8c0dcac12047f232d8abbe |