Skip to main content

SIENA tool for efficient entity annotation.

Project description

Efficient entity annotation tool for Sinhala, English, or Sinhala-English code-switched text corpora.

Features

  • Allows annotating both Sinhala and English textual data
  • Fully compatible with Rasa 2.8.x NLU training data files
  • Allows exporting annotated NLU YAML files
  • Able to auto annotate entities efficiently based on novel NLP techniques including reverse-stemming
  • Easy to use SIENA CLI that can spin up a GUI server, locally
  • Read more on docs

Ongoing Research

  • Concurrent entity tagging for multiple users
  • Import/Export support for non-Rasa NLU data files and text corpora

Known Issues

  • Support for Rasa versions other than 2.8.x is under ongoing development
  • Benchmark tests are in progress

📒 Docs: https://siena-nlp.github.io
📦 PyPi: https://pypi.org/project/siena/1.0.2/
🪵 Full Changelog: Refer the relevant GitHub branch (v1.0.0)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

siena-1.0.2.tar.gz (86.8 kB view hashes)

Uploaded Source

Built Distribution

siena-1.0.2-py3-none-any.whl (94.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page