hAraCat 🐈 adds diacritics to Arabic text
Project description
hAraCat 🐈
hAraCat is a Python library for adding diacritics automatically to Medieval Arabic text.
Install 😻
pip install haracat
Use 🐱
Diacritics can be added as follows
from haracat import diacritics_sentence
diacritics_sentence("الإجاج، مثلثة الأول: الستر.".split(" "))
>> الْإِجاجُ، مُثَلَّثَةَ الْأَوَّلِ: السِّتْرُ.
First the sentence is tokenized before the diacritics are predicted.
Credits
Khalid Alnajjar, Mika Hämäläinen, Niko Partanen and Jack Rueter
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
haracat-1.0.0.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for haracat-1.0.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8306763f9c3790b5a0328cb581cc7b9b25ba215668de26b2cd3fd0c65e02f865 |
|
MD5 | 0f3476953047f70eb094315f7a2b2014 |
|
BLAKE2b-256 | d50d2ec7a5bae4af9866f0c2445cdc69989198ef7ec68f3c30815a8192b1adc4 |