The Jaccard index measures exhaustive substring comparison of two strings. This package is a slightly modified Jaccard with pre-calculation accelerate results.
Project description
Jaccard Precalculated String Matcher
The Jaccard index measures exhaustive substring comparison of two strings. This package is a slightly modified Jaccard with pre-calculation accelerate results.
from jaccard_precalc.JaccardPrecalc import JaccardPrecalc
string_list = ['Andrew Matte, 123 Main St, Toronto, Canada']
jac = JaccardPrecalc(string_list)
# jac.search(query_string, number_of_results)
results = jac.search('Andy Matte, Toronto, CA', 1) # returns a list of dicts where each dict is {input: score}, sorted by top score
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jaccard-precalc-0.1.5.tar.gz
(1.8 kB
view hashes)
Built Distribution
Close
Hashes for jaccard_precalc-0.1.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b2bc23ea04c4f32cf04ffcf850c4d393b05aa7f2b750d04297ff4ce92e658bc3 |
|
MD5 | f85f346c42dc4ff676929e5fa2bf4f8e |
|
BLAKE2b-256 | 206728c939c42f80ea59b69f23c7f5d648c15b9ee73a124958d24de6bd45fb05 |