Original, standard and customisable versions of the Jaro-Winkler functions.
Project description
JaroWinkler
===========
Original, standard and customisable versions of the Jaro-Winkler functions.
<pre>
>>> import jaro
>>> jaro.jaro_winkler_metric(u'SHACKLEFORD', u'SHACKELFORD')
0.9818181
>>> help(jaro)
Help on package jaro:
<strong>NAME</strong>
jaro - Python translation of the original Jaro-Winkler functions.`
<strong>DESCRIPTION</strong>
The Jaro-Winkler functions compare two strings and return a score indicating
how closely the strings match. The score ranges from 0 (no match) to 1
(perfect match).
Two null strings ('') will compare as equal. Strings should be unicode
strings, and will be compared as given; the caller is responsible for
capitalisations and trimming leading/trailing spaces.
You should normally only need to use either the jaro_metric() or
jaro_winkler_metric() functions defined here. If you want to implement your
own, non-standard metrics, look at the comments and functions in the jaro.py
submodule.
<strong>PACKAGE CONTENTS</strong>
...
jaro
strcmp95
...
<strong>FUNCTIONS</strong>
<strong>jaro_metric</strong>(string1, string2)
The standard, basic Jaro string metric.
<strong>jaro_winkler_metric</strong>(string1, string2)
The Jaro metric adjusted with Winkler's modification, which boosts
the metric for strings whose prefixes match.
<strong>original_metric</strong>(string1, string2)
The same metric that would be returned from the reference Jaro-Winkler
C code, taking as it does into account a typo table and adjustments for
longer strings.
...
<strong>custom_metric</strong>(string1, string2, typo_table, typo_scale,
boost_threshold, pre_len, pre_scale, longer_prob)
Calculate the Jaro-Winkler metric with parameters of your own choosing.
...
</pre>
===========
Original, standard and customisable versions of the Jaro-Winkler functions.
<pre>
>>> import jaro
>>> jaro.jaro_winkler_metric(u'SHACKLEFORD', u'SHACKELFORD')
0.9818181
>>> help(jaro)
Help on package jaro:
<strong>NAME</strong>
jaro - Python translation of the original Jaro-Winkler functions.`
<strong>DESCRIPTION</strong>
The Jaro-Winkler functions compare two strings and return a score indicating
how closely the strings match. The score ranges from 0 (no match) to 1
(perfect match).
Two null strings ('') will compare as equal. Strings should be unicode
strings, and will be compared as given; the caller is responsible for
capitalisations and trimming leading/trailing spaces.
You should normally only need to use either the jaro_metric() or
jaro_winkler_metric() functions defined here. If you want to implement your
own, non-standard metrics, look at the comments and functions in the jaro.py
submodule.
<strong>PACKAGE CONTENTS</strong>
...
jaro
strcmp95
...
<strong>FUNCTIONS</strong>
<strong>jaro_metric</strong>(string1, string2)
The standard, basic Jaro string metric.
<strong>jaro_winkler_metric</strong>(string1, string2)
The Jaro metric adjusted with Winkler's modification, which boosts
the metric for strings whose prefixes match.
<strong>original_metric</strong>(string1, string2)
The same metric that would be returned from the reference Jaro-Winkler
C code, taking as it does into account a typo table and adjustments for
longer strings.
...
<strong>custom_metric</strong>(string1, string2, typo_table, typo_scale,
boost_threshold, pre_len, pre_scale, longer_prob)
Calculate the Jaro-Winkler metric with parameters of your own choosing.
...
</pre>
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jaro_winkler-1.0.2.tar.gz
(28.8 kB
view details)
Built Distribution
jaro_winkler-1.0.2-py2.7.egg
(35.6 kB
view details)
File details
Details for the file jaro_winkler-1.0.2.tar.gz
.
File metadata
- Download URL: jaro_winkler-1.0.2.tar.gz
- Upload date:
- Size: 28.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f418e1f5f7604d6edcce0515c678c9800dee097641c495b4fefb3006f4c5bac9 |
|
MD5 | f6695cd5f2452111378858ead99c5fb4 |
|
BLAKE2b-256 | c08ec52f2fd80c57131431d67c4ef61521119033751dd64f5adc453f4d795385 |
File details
Details for the file jaro_winkler-1.0.2-py2.7.egg
.
File metadata
- Download URL: jaro_winkler-1.0.2-py2.7.egg
- Upload date:
- Size: 35.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1873bbc171be5fdb45e03934f01dd5e305858462733c93cfcd2cd4b79249804e |
|
MD5 | 671062dabc09ef4f7825565ebe18e513 |
|
BLAKE2b-256 | 7b495d24f59e4f6954fcd222ceb81863d6d55bce24188db946ccde0395290363 |