Skip to main content

Anncorra is a python package for giving meaning to POS (Part of Speech) tags.

Project description

Tags of AnnCorra

GitHub issues GitHub forks GitHub stars GitHub
Indian Language Machine Translation (ILMT) project has taken the task of annotating corpora (AnnCorra) of several Indian languages and came up with tags which have been defined for the tagging schemes for POS (part of speech) tagging.

This repository would explain the POS (Part Of Speech) Tags along with examples.

Requirements

Package requires the following to run:

  • python (preferable version 3+)

Installation

Use the package manager pip to install foobar.

pip install anncorra

or

git clone https://github.com/kuldip-barot/anncorra.git
cd anncorra
python setup.py install

Usage

Import the package after installation.

>>> import anncorra
>>> anncorra.explain('NN')

The output of above command:

POS Tags :  NN
Full form :  Noun
Desription :  The tag NN tag set makes a distinction between noun singular (NN) and noun plural (NNS).
Example :
yaha bAta  galI_NN galI_RDP meM  phEla gayI
 'this' 'talk'  'lane'      'lane'         'in'    'spread' 'went'
 “The word was spread in every lane”.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

References

AnnCorra : Annotating Corpora; Guidelines For POS And Chunk Annotation For Indian Languages

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anncorra-0.0.4.tar.gz (9.6 kB view hashes)

Uploaded Source

Built Distribution

anncorra-0.0.4-py3-none-any.whl (10.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page