Skip to main content

Original, standard and customisable versions of the Jaro-Winkler functions.

Project description

JaroWinkler
===========

Original, standard and customisable versions of the Jaro-Winkler functions.

<pre>
>>> import jaro
>>> jaro.jaro_winkler_metric(u'SHACKLEFORD', u'SHACKELFORD')
0.9818181
>>> help(jaro)

Help on package jaro:

<strong>NAME</strong>
jaro - Python translation of the original Jaro-Winkler functions.`

<strong>DESCRIPTION</strong>
The Jaro-Winkler functions compare two strings and return a score indicating
how closely the strings match. The score ranges from 0 (no match) to 1
(perfect match).

Two null strings ('') will compare as equal. Strings should be unicode
strings, and will be compared as given; the caller is responsible for
capitalisations and trimming leading/trailing spaces.

You should normally only need to use either the jaro_metric() or
jaro_winkler_metric() functions defined here. If you want to implement your
own, non-standard metrics, look at the comments and functions in the jaro.py
submodule.

<strong>PACKAGE CONTENTS</strong>
...
jaro
strcmp95
...

<strong>FUNCTIONS</strong>
<strong>jaro_metric</strong>(string1, string2)
The standard, basic Jaro string metric.

<strong>jaro_winkler_metric</strong>(string1, string2)
The Jaro metric adjusted with Winkler's modification, which boosts
the metric for strings whose prefixes match.

<strong>original_metric</strong>(string1, string2)
The same metric that would be returned from the reference Jaro-Winkler
C code, taking as it does into account a typo table and adjustments for
longer strings.
...

<strong>custom_metric</strong>(string1, string2, typo_table, typo_scale,
boost_threshold, pre_len, pre_scale, longer_prob)
Calculate the Jaro-Winkler metric with parameters of your own choosing.
...
</pre>

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
jaro_winkler-1.0.2-py2.7.egg (35.6 kB) Copy SHA256 hash SHA256 Egg 2.7
jaro_winkler-1.0.2.tar.gz (28.8 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page