Original, standard and customisable versions of the Jaro-Winkler functions.
Project description
JaroWinkler
Original, standard and customisable versions of the Jaro-Winkler functions.
>>> import jaro >>> jaro.jaro_winkler_metric(u'SHACKLEFORD', u'SHACKELFORD') 0.9818181 >>> help(jaro) Help on package jaro: NAME jaro - Python translation of the original Jaro-Winkler functions. DESCRIPTION The Jaro-Winkler functions compare two strings and return a score indicating how closely the strings match. The score ranges from 0 (no match) to 1 (perfect match). Two null strings ('') will compare as equal. Strings should be unicode strings, and will be compared as given; the caller is responsible for capitalisations and trimming leading/trailing spaces. You should normally only need to use either the jaro_metric() or jaro_winkler_metric() functions defined here. If you want to implement your own, non-standard metrics, look at the comments and functions in the jaro.py submodule. PACKAGE CONTENTS ... jaro strcmp95 ... FUNCTIONS jaro_metric(string1, string2) The standard, basic Jaro string metric. jaro_winkler_metric(string1, string2) The Jaro metric adjusted with Winkler's modification, which boosts the metric for strings whose prefixes match. original_metric(string1, string2) The same metric that would be returned from the reference Jaro-Winkler C code, taking as it does into account a typo table and adjustments for longer strings. ... custom_metric(string1, string2, typo_table, typo_scale, boost_threshold, pre_len, pre_scale, longer_prob) Calculate the Jaro-Winkler metric with parameters of your own choosing. ...
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file jaro_winkler-2.0.1.linux-x86_64.tar.gz
.
File metadata
- Download URL: jaro_winkler-2.0.1.linux-x86_64.tar.gz
- Upload date:
- Size: 30.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c970834c8138dcb21b73412f442f09b37a2f532c4a46bff17cbf1741fa874243 |
|
MD5 | a56949d0fc914eb877f9b144d8dec279 |
|
BLAKE2b-256 | 334d1df12987bcdba89f49501c471ad6a00d9e99142d6d2470f05fd59e9c4d88 |