Skip to main content

Open-source Python toolkit for Arabic Natural Understanding, allowing people to integrate it in their system workflow.

Project description

SinaTools

Open Source Toolkit for Arabic NLP and NLU developed by SinaLab at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.

See the full list of Available Packages, which include: (1) Morphology Tagging, (2) Named Entity Recognition (NER), (3) Word Sense Disambiguation (WSD), (4) Semantic Relatedness, (5) Synonymy Extraction and Evaluation, (6) Relation Extraction, (7) Utilities (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).

See Demo Pages.

See the benchmarking, which shows that SinaTools outperformed all related toolkits.

Installation

To install SinaTools, ensure you are using Python version 3.10.8, then clone the GitHub repository.

Alternatively, you can execute the following command:

pip install sinatools

Installing Models and Data Files

Some modules in SinaTools require some data files and fine-tuned models to be downloaded. To download these models, please consult the DataDownload.

Documentation

For information, please refer to the main page or the online domuementation.

Citation

Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: SinaTools: Open Source Toolkit for Arabic Natural Language Understanding. In Proceedings of the 2024 AI in Computational Linguistics (ACLing 2024), Procedia Computer Science, Dubai. ELSEVIER.

License

SinaTools is available under the MIT License. See the LICENSE file for more information.

Reporting Issues

To report any issues or bugs, please contact us at "sina.institute.bzu@gmail.com" or visit SinaTools Issues.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SinaTools-1.0.4.tar.gz (123.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

SinaTools-1.0.4-py2.py3-none-any.whl (256.9 kB view details)

Uploaded Python 2Python 3

File details

Details for the file SinaTools-1.0.4.tar.gz.

File metadata

  • Download URL: SinaTools-1.0.4.tar.gz
  • Upload date:
  • Size: 123.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.3

File hashes

Hashes for SinaTools-1.0.4.tar.gz
Algorithm Hash digest
SHA256 51f17ad238c4b8e5eb66cc204d45ef48e166a8fb786aae5cb7de6c56d031b676
MD5 03e3ad83028398e4e9b69e8d75987913
BLAKE2b-256 33a17ba779b5d250a2dcefed530edef0cf051642559239f484be2565cd080149

See more details on using hashes here.

File details

Details for the file SinaTools-1.0.4-py2.py3-none-any.whl.

File metadata

  • Download URL: SinaTools-1.0.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 256.9 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.3

File hashes

Hashes for SinaTools-1.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6ec6ceef2c7e79e781957e39b2092e618352a5bc79a31b7073e55a7bc2fd5bf1
MD5 9804b82c5dbeca32233d1231b988fd65
BLAKE2b-256 503c98e7a9e362dbaf4f6c76ac4ec3e1f47cf2e1441a335b19e41579e9b2285b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page