Skip to main content

Fast string similarity algorithms in Rust

Project description

marhta

License

Fast string similarity algorithms

This library is still in development and may not be stable.

Installation

pip install marhta

Usage

from marhta import levenshtein_similarity, jaro_winkler_similarity

# Calculate string similarities
print(levenshtein_similarity("hello", "helo"))  # 0.8
print(jaro_winkler_similarity("martha", "marhta"))  # 0.961

# Find best matches
from marhta import levenshtein_match
strings = ["apple", "banana", "orange", "pear"]
matches = levenshtein_match("aple", strings)
print(matches)  # [("apple", 0.8), ("pear", 0.5)]

Features

  • Levenshtein distance and similarity measures
  • Jaro-Winkler distance and similarity measures
  • Fuzzy string matching with customizable thresholds

Performance

Written in Rust for improved performance, while maintaining a Pythonic API.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

marhta-0.1.3-cp312-cp312-manylinux_2_34_x86_64.whl (256.7 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.34+ x86-64

marhta-0.1.3-cp311-cp311-manylinux_2_34_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.34+ x86-64

marhta-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

marhta-0.1.3-cp39-cp39-manylinux_2_34_x86_64.whl (256.5 kB view details)

Uploaded CPython 3.9manylinux: glibc 2.34+ x86-64

marhta-0.1.3-cp38-cp38-manylinux_2_34_x86_64.whl (256.5 kB view details)

Uploaded CPython 3.8manylinux: glibc 2.34+ x86-64

File details

Details for the file marhta-0.1.3-cp312-cp312-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for marhta-0.1.3-cp312-cp312-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 2fca4606ada34846ec6cfe18858678835f00d123099afcc9610c060647f526d2
MD5 609f842d6da95933bc51535e6758ab87
BLAKE2b-256 e14f7ce9f9519f0783c398f56bdc9a454cf34c372932cc813ed48473c520327a

See more details on using hashes here.

File details

Details for the file marhta-0.1.3-cp311-cp311-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for marhta-0.1.3-cp311-cp311-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 3e38663f71a336e9184a2ba28ee515482cfd0f12b78b7d1c1dd02cf0ae187c54
MD5 553c5815c8afe7264c5fb8f0cd2a147f
BLAKE2b-256 03a6bf91bef973e04e68e44d2f55c52b152da5fa14d429d79a756f41d59b3c7b

See more details on using hashes here.

File details

Details for the file marhta-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for marhta-0.1.3-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 e41965ba64169b05d14abc15ac86c2cf2f2f541c3730738b7c15ba6db9c24d85
MD5 d7dac76359370dc66fb011a2ac0b01a6
BLAKE2b-256 d28c97e6098fcc4b2009c9b8dd54f14752fe216709788257685daab02a1d3f39

See more details on using hashes here.

File details

Details for the file marhta-0.1.3-cp39-cp39-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for marhta-0.1.3-cp39-cp39-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 0d8f0afbd06ba830806affa46d507bde6b09db55f8218301a38f5f4ffba1aee8
MD5 5c8cf0f049826ff9943a2bfba24d1be6
BLAKE2b-256 890d850dc2438acc29ca99317078e2f452ac54c3ffb8edde1727aa2d3a9af8f5

See more details on using hashes here.

File details

Details for the file marhta-0.1.3-cp38-cp38-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for marhta-0.1.3-cp38-cp38-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 628469d0c7b73b75b877f7844d33598d32754aa5bacceee1224babd5b430436e
MD5 f2283078809943031d086464605a3f16
BLAKE2b-256 c5c47c21582d84ef297d62b3cb66d86ed15977cc625d23be6c8f38c911ba0d33

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page