Skip to main content

Python extension computing string distances and similarities.

Reason this release was yanked:

Insecure, upgrade to 0.12.1

Project description

Introduction

The Levenshtein Python C extension module contains functions for fast computation of

  • Levenshtein (edit) distance, and edit operations

  • string similarity

  • approximate median strings, and generally string averaging

  • string sequence and set similarity

It supports both normal and Unicode strings.

Python 2.2 or newer is required.

StringMatcher.py is an example SequenceMatcher-like class built on the top of Levenshtein. It misses some SequenceMatcher’s functionality, and has some extra OTOH.

Levenshtein.c can be used as a pure C library, too. You only have to define NO_PYTHON preprocessor symbol (-DNO_PYTHON) when compiling it. The functionality is similar to that of the Python extension. No separate docs are provided yet, RTFS. But they are not interchangeable:

  • C functions exported when compiling with -DNO_PYTHON (see Levenshtein.h) are not exported when compiling as a Python extension (and vice versa)

  • Unicode character type used with -DNO_PYTHON is wchar_t, Python extension uses Py_UNICODE, they may be the same but don’t count on it

Documentation

gendoc.sh generates HTML API documentation, you probably want a selfcontained instead of includable version, so run in ./gendoc.sh --selfcontained. It needs Levenshtein already installed and genextdoc.py.

License

Levenshtein can be copied and/or modified under the terms of GNU General Public License, see the file COPYING for full license text.

History

This package was long missing from PyPi and available as source checkout only. We needed to restore this package for Go Mobile for Plone and Pywurfl projects which depend on this.

The project is not under active development as far as the maintainer knows.

Source code

Authors

  • Maintainer: Mikko Ohtamaa <mikko at mfabrik.com>

  • David Necas (Yeti) <yeti at physics.muni.cz>

Changelog

10.1 - 10.2

  • Made python-Lehvenstein Git compatible and use setuptools for PyPi upload

  • Created HISTORY.txt and made README reST compatible

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-Levenshtein-0.10.2.tar.gz (45.0 kB view details)

Uploaded Source

File details

Details for the file python-Levenshtein-0.10.2.tar.gz.

File metadata

File hashes

Hashes for python-Levenshtein-0.10.2.tar.gz
Algorithm Hash digest
SHA256 49a3b3c3210157e2070eb46c0713e64f409efc8c9a7520632ddf16f8a9508bed
MD5 c8af7296dc640abdf511614ee677bbb8
BLAKE2b-256 323c46cd4e5b41d46ad309372b9b5de70776aa66d5db02bafb3444782b86a23c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page