Skip to main content

High-performance Amharic phonetic query corrector

Project description

Amharic Corrector

High-performance phonetic query correction library for Amharic.

Changelog

1.0.4

  • Single-word correction now uses phonetic skeleton similarity (consonant LCS) instead of frequency to rank candidates — produces phonetically correct results regardless of corpus frequency bias.
  • Removed Levenshtein from the inner skeleton index loop (was causing ~35% speed regression); kept it only in the transliteration scoring path where it adds precision.
  • Multi-word correction unchanged: co-occurrence / bigram context still drives ranking as intended.

1.0.3

  • Internal release.

1.0.2

  • Added instance-level caching to AmharicQueryCorrector to prevent shared state when multiple instances are used with different corpuses.
  • Minor performance improvements in candidate lookup.

Installation

pip install .

Usage

from amharic_corrector import AmharicQueryCorrector
c = AmharicQueryCorrector("path/to/corpus.json")
print(c.correct_phrase("mekanitu"))

CLI

amharic-correct "Kuratie" --model path/to/corpus.json

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amharic_corrector-1.0.4.tar.gz (12.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amharic_corrector-1.0.4-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file amharic_corrector-1.0.4.tar.gz.

File metadata

  • Download URL: amharic_corrector-1.0.4.tar.gz
  • Upload date:
  • Size: 12.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for amharic_corrector-1.0.4.tar.gz
Algorithm Hash digest
SHA256 6c08f5d97021d168c891e5e01bda6c59f4a699c51cb9a1ac349dc740bc374439
MD5 ba9b6643776ffc3009627b46f8f94295
BLAKE2b-256 f0f7a6fd66d3ab0fabbcb0e04c79ccaa55ed07448c34fea33e0076cab90322d6

See more details on using hashes here.

File details

Details for the file amharic_corrector-1.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for amharic_corrector-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e7174dba6f5e848650d55746f5ea8e85648ce08988baeab9f48c8c497b36764f
MD5 b9b15a588c3a812bf713180b10e92ff2
BLAKE2b-256 37bc125912dab60c7f95bd186a88751f20f425ab775adfd1facf4e0b931ecc03

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page