Arabic Spelling Correction
Project description
Description
Simple library to check the spelling of arabic sentences. This library uses a vocabulary that consists of +500K words, and uses 1-edit_distance and 2-edit_distance to correct the misspelled words. It also uses 1-ngram language models to correct the words depending on the previous context.
Installation
pip install ar-corrector
Usage
Correct word spelling
from ar_corrector.corrector import Corrector
corr = Corrector()
corr.spell_correct('بختب') # return the correction with the frequency
# [('بكتب', 52)]
corr.spell_correct('بختب', 4) # return top 4 correction with frequencies
# [('بكتب', 52), ('بخت', 4), ('بختم', 3), ('بعتب', 2)]
corr.spell_correct('لتمشتلميتلكب', 4) # return the same word
# لتمشتلميتلكب
corr.spell_correct('من') # return true
# True
Correct word spelling using the context
from ar_corrector.corrector import Corrector
corr = Corrector()
sent = 'أكدت قواءص التمذد في تشاد أنها تواضضل طريقها للعاحمة'
print(corr.contextual_correct(sent))
#أكدت قوات التمرد في تشاد أنها تواصل طريقها للعاصمة
sent = 'اتتنتهى حدث آبل المنتظو بالإعلاخ عن مموعة من المنتجات'
print(corr.contextual_correct(sent))
#انتهى حدث آبل المنتظر الإعلان عن مجموعة من المنتجات
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ar_corrector-1.1.2.tar.gz
(51.5 MB
view hashes)
Built Distribution
Close
Hashes for ar_corrector-1.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 143f668f2a6456113d43ab7471567395d6cd2bea0da7ccfce6fe25f2d0fdd7b7 |
|
MD5 | eccda9b13197e65c7040de58cb281382 |
|
BLAKE2b-256 | 94eac362ec9412d6466708065f872e975f3cd29dbac14d9f4a6e32d8a7a3a500 |