Skip to main content

Levenshtein edit distance library

Project description

Levenshtein edit distance library for Python, Apache-licensed. Written by Lars Buitinck, Netherlands eScience Center, with contributions from Isaac Sijaranamual, University of Amsterdam.

Performs distance computations on either byte strings or Unicode codepoints.

Installation

Make sure you have Cython and a C++ compiler installed:

pip install cython

Installing a C++ compiler is so platform-dependent that I won’t show instructions. Consult your package manager.

Then:

python setup.py install

To run the tests, but not to actually use leven, you need six and Nose.

Usage

>>> from leven import levenshtein
>>> levenshtein("hello, world!", "goodbye, cruel world!")
13

About the implementation

The core algorithms have been implemented in C++. I used this instead of C to get templates, easier memory management and a better standard library, so the C++ code probably looks C-ish.

Todo

  • Implement Ukkonen’s algorithm for bounded Levenshtein distance

  • Implement Levenshtein automata for fast neighbor search in string spaces

  • Implement weighted Levenshtein distance

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

leven-1.0.4.tar.gz (20.2 kB view details)

Uploaded Source

File details

Details for the file leven-1.0.4.tar.gz.

File metadata

  • Download URL: leven-1.0.4.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for leven-1.0.4.tar.gz
Algorithm Hash digest
SHA256 732474a3b9e94003aab61fd57eb2cbdaf0ed32f9a85b63073bcc4fa1ed5d0aff
MD5 65bcec29cec3ccd8a053c97bd2a711e3
BLAKE2b-256 730237084115516cfd595ee2f9a873fffe8b85c6b1538523ff6a8b8dd7ff7d46

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page