a library for doing approximate and phonetic matching of strings.
Project description
Jellyfish is a python library for doing approximate and phonetic matching of strings.
Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.
See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.
Source is available at http://github.com/jamesturk/jellyfish.
Included Algorithms
String comparison:
Levenshtein Distance
Damerau-Levenshtein Distance
Jaro Distance
Jaro-Winkler Distance
Match Rating Approach Comparison
Hamming Distance
Phonetic encoding:
American Soundex
Metaphone
NYSIIS (New York State Identification and Intelligence System)
Match Rating Codex
Example Usage
>>> import jellyfish >>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish') 2 >>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs') 1
>>> jellyfish.metaphone(u'Jellyfish') 'JLFX' >>> jellyfish.soundex(u'Jellyfish') 'J412' >>> jellyfish.nysiis(u'Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex(u'Jellyfish') 'JLLFSH'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for jellyfish-0.5.6-cp27-none-macosx_10_11_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d0b22034df56d5faa34928457655722acf385d122da59c2cf7f4acd6bd49ea8b |
|
MD5 | d8cad247e707df994afba9f5f1571b3d |
|
BLAKE2b-256 | a4b5b96a9d4377466fedc512a232d4bc8faadc60281ca32a7d7b464aee45d884 |