SIENA tool for efficient entity annotation.
Project description
Efficient entity annotation tool for Sinhala, English, or Sinhala-English code-switched text corpora.
Features
- Allows annotating both Sinhala and English textual data
- Fully compatible with Rasa 2.8.x NLU training data files
- Allows exporting annotated NLU YAML files
- Able to auto annotate entities efficiently based on novel NLP techniques including reverse-stemming
- Easy to use SIENA CLI that can spin up a GUI server, locally
- Read more on docs
Ongoing Research
- Concurrent entity tagging for multiple users
- Import/Export support for non-Rasa NLU data files and text corpora
Known Issues
- Support for Rasa versions other than 2.8.x is under ongoing development
- Benchmark tests are in progress
📒 Docs: https://siena-nlp.github.io
📦 PyPi: https://pypi.org/project/siena/1.0.2/
🪵 Full Changelog: Refer the relevant GitHub branch (v1.0.0)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
siena-1.0.2.tar.gz
(86.8 kB
view hashes)
Built Distribution
siena-1.0.2-py3-none-any.whl
(94.1 kB
view hashes)