Skip to main content

Arabic Spelling Correction

Project description

Description

Simple library to check the spelling of arabic sentences. This library uses a vocabulary that consists of +500K words, and uses 1-edit_distance and 2-edit_distance to correct the misspelled words. It also uses 1-ngram language model to correct the words depending on the previous context.

Installation

pip install ar-corrector

Usage

Correct word spelling

from ar_corrector.corrector import Corrector
corr = Corrector()

corr.spell_correct('بختب') # return 5 corrections with top frequencies
# [('بكتب', 61), ('برتب', 22), ('بختم', 21), ('بختي', 9), ('بخت', 7)]

corr.spell_correct('بختب', 2) # return 2 corrections with top frequencies
# [('بكتب', 61), ('برتب', 22),]

corr.spell_correct('بختب', 1) # return 1 correction with top frequency
# [('بكتب', 61)]

corr.spell_correct('لتمشتلميتلكب', 4) # return the same word
# لتمشتلميتلكب

corr.spell_correct('من') # return true
# True

Correct word spelling using the context

from ar_corrector.corrector import Corrector
corr = Corrector()

sent = 'أكدت قواءص التمذد في تشاد أنها تواضضل طريقها للعاحمة'
print(corr.contextual_correct(sent)) 
#أكدت قوات التمرد في تشاد أنها تواصل طريقها للعاصمة

sent = 'اتتنتهى حدث آبل المنتظو بالإعلاخ عن مموعة من المنتجات'
print(corr.contextual_correct(sent))
#انتهى حدث آبل المنتظر الإعلان عن مجموعة من المنتجات

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ar_corrector-1.1.6.tar.gz (51.5 MB view details)

Uploaded Source

Built Distribution

ar_corrector-1.1.6-py3-none-any.whl (53.4 MB view details)

Uploaded Python 3

File details

Details for the file ar_corrector-1.1.6.tar.gz.

File metadata

  • Download URL: ar_corrector-1.1.6.tar.gz
  • Upload date:
  • Size: 51.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.8.8

File hashes

Hashes for ar_corrector-1.1.6.tar.gz
Algorithm Hash digest
SHA256 2eb287322c1fc10ac8e172374686da42af3137e370a320d81fa08bac7cef5d91
MD5 fc9e7102511c97bc46f3e59cbc7b3f9a
BLAKE2b-256 371ba1fab0a76ae79dc03668f8c82c290c2299b6cc7e487c50b10200d1352931

See more details on using hashes here.

File details

Details for the file ar_corrector-1.1.6-py3-none-any.whl.

File metadata

  • Download URL: ar_corrector-1.1.6-py3-none-any.whl
  • Upload date:
  • Size: 53.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.8.8

File hashes

Hashes for ar_corrector-1.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f3d4a4fe8ff8696c922a5807757506e56c0d720258c7716bfca607b90a8e77f7
MD5 e882c98f6f0242c87fbe9b432e8ebb0f
BLAKE2b-256 2d32ce0b2170db48ab9227d08802f93ce1cb105fdcc97c532333fadc38281bf8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page