Skip to main content

Text tagger, based on the ELMo embeddings and recurrent neural network, with the simple sklearn-like interface

Project description


NeuroTagger
============

NeuroTagger is a text tagger based on recurrent neural network. It can
be used as NER, dependency parser, morphoanalyzer etc.

The goal of this project is creation of a simple Python package with
the sklearn-like interface for solution of different tasks of text
tagging (named entity recognition, dependency parsing, etc) in case
number of labeled texts is very small (not greater than several
thousands). Special word embeddings named as `ELMo<https://arxiv.org/abs/1802.05365>`_
(Embeddings from Language Models) ensure this possibility, because these
embeddings are contextual and they allow to design more simple and
separable feature space for words in texts.

ELMo embeddings are used as features of words in text, and different
variants of neural network architecture (BiLSTM, hybrid BiLSTM-CRF or
pure CRF) can be used as final classifier (tagger). I recommend to
use a special `TensorFlow Hub ELMo<>https://tfhub.dev/google/elmo/2`_
for English NLP tasks and a `DeepPavlov ELMo
<http://docs.deeppavlov.ai/en/master/apiref/models/embedders.html#deeppavlov.models.embedders.elmo_embedder.ELMoEmbedder>`_
for for same tasks in Russian.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neuro_tagger-0.0.1.5.tar.gz (21.2 kB view details)

Uploaded Source

File details

Details for the file neuro_tagger-0.0.1.5.tar.gz.

File metadata

  • Download URL: neuro_tagger-0.0.1.5.tar.gz
  • Upload date:
  • Size: 21.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.6

File hashes

Hashes for neuro_tagger-0.0.1.5.tar.gz
Algorithm Hash digest
SHA256 c70261fc99e13b622927f3fe14f9e5abda2c1593cb805708428ef1ecf5a04daf
MD5 d1a59151db4e4b642b11319e2db36c0f
BLAKE2b-256 4228d514b763e220cfb6ee37dda101a339978d835a54e46ded4cb8d0ad082f23

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page