High-performance Amharic phonetic query corrector
Project description
Amharic Corrector
High-performance phonetic query correction library for Amharic.
Changelog
1.0.4
- Single-word correction now uses phonetic skeleton similarity (consonant LCS) instead of frequency to rank candidates — produces phonetically correct results regardless of corpus frequency bias.
- Removed Levenshtein from the inner skeleton index loop (was causing ~35% speed regression); kept it only in the transliteration scoring path where it adds precision.
- Multi-word correction unchanged: co-occurrence / bigram context still drives ranking as intended.
1.0.3
- Internal release.
1.0.2
- Added instance-level caching to
AmharicQueryCorrectorto prevent shared state when multiple instances are used with different corpuses. - Minor performance improvements in candidate lookup.
Installation
pip install .
Usage
from amharic_corrector import AmharicQueryCorrector
c = AmharicQueryCorrector("path/to/corpus.json")
print(c.correct_phrase("mekanitu"))
CLI
amharic-correct "Kuratie" --model path/to/corpus.json
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
amharic_corrector-1.0.4.tar.gz
(12.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file amharic_corrector-1.0.4.tar.gz.
File metadata
- Download URL: amharic_corrector-1.0.4.tar.gz
- Upload date:
- Size: 12.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6c08f5d97021d168c891e5e01bda6c59f4a699c51cb9a1ac349dc740bc374439
|
|
| MD5 |
ba9b6643776ffc3009627b46f8f94295
|
|
| BLAKE2b-256 |
f0f7a6fd66d3ab0fabbcb0e04c79ccaa55ed07448c34fea33e0076cab90322d6
|
File details
Details for the file amharic_corrector-1.0.4-py3-none-any.whl.
File metadata
- Download URL: amharic_corrector-1.0.4-py3-none-any.whl
- Upload date:
- Size: 12.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e7174dba6f5e848650d55746f5ea8e85648ce08988baeab9f48c8c497b36764f
|
|
| MD5 |
b9b15a588c3a812bf713180b10e92ff2
|
|
| BLAKE2b-256 |
37bc125912dab60c7f95bd186a88751f20f425ab775adfd1facf4e0b931ecc03
|