Skip to main content

Text tagger, based on the ELMo embeddings and recurrent neural network, with the simple sklearn-like interface

Project description


neuro_tagger
============

Text tagger based on recurrent neural network. It can be used as NER,
dependency parser, morphoanalyzer etc.

The goal of this project is creation of a simple Python package with
the sklearn-like interface for solution of different tasks of text
tagging (named entity recognition, dependency parsing, etc) in case
number of labeled texts is very small (not greater than several
thousands). Special word embeddings named as ELMo (**E**mbeddings from
**L**anguage **Mo**dels) ensure this possibility, because these
embeddings are contextual and they allow to design more simple and
separable feature space for words in texts.

ELMo embeddings are used as features of words in text, and different
variants of neural network architecture (BiLSTM, hybrid BiLSTM-CRF or
pure CRF) can be used as final classifier (tagger). I recommend to
use a special *TensorFlow Hub ELMo* for English NLP tasks and a
*DeepPavlov ELMo* for for same tasks in Russian.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neuro_tagger-0.0.1.4.tar.gz (21.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page