Skip to main content

EstNLTK neural -- EstNLTK's linguistic analysis based on neural models

Project description

EstNLTK neural -- EstNLTK's linguistic analysis based on neural models

This package contains EstNLTK's linguistic analysis tools that use neural models:

  • bert embeddings tagger;
  • bert-based named entity recognition;
  • bert-based morphological features tagger and disambiguator;
  • GliLem lemmatizer and morphological disambiguator;
  • stanza syntax tagger and stanza ensemble syntax tagger;
  • pronominal coreference tagger v1 (relies on stanza for input preprocessing);
  • [legacy] tensorflow-based neural morphological features tagger ( disambiguator );

Note: these tools require installation of deep learning frameworks (tensorflow, pytorch), and are demanding for computational resources; they also rely on large models which need to be downloaded separately.

The EstNLTK project is funded by EKT (Eesti Keeletehnoloogia Riiklik Programm).

Installation

EstNLTK-neural is available as a PyPI wheel:

pip install estnltk_neural

And as an Anaconda package:

conda install -c estnltk -c conda-forge estnltk_neural

Supported Python versions: 3.9+

Neural models

Models required by neural tools are large, and therefore cannot be distributed with this package. However, our tagger classes are implemented in a way that once you create an instance of a neural tagger, you'll be asked for a permission to download missing models, and if you give the permission, the model will be downloaded (and installed in a proper location) automatically. If needed, you can also change the default location where downloaded models will be placed, see this tutorial for details.

Documentation

EstNLTK's NLP component tutorials also cover information about neural taggers:

Source

The source of the last release is available at the main branch.

License

EstNLTK-neural is released under dual license - either GNU General Public License v2.0 or Apache 2.0 License.

EstNLTK-neural's GliLem lemmatizer and morphological disambiguator contains code that is licensed under Mozilla Public License 2.0 (MPL 2.0).

(C) University of Tartu (unless specified otherwise in the file headers)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

estnltk_neural-1.7.4.tar.gz (170.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

estnltk_neural-1.7.4-py3-none-any.whl (206.3 kB view details)

Uploaded Python 3

File details

Details for the file estnltk_neural-1.7.4.tar.gz.

File metadata

  • Download URL: estnltk_neural-1.7.4.tar.gz
  • Upload date:
  • Size: 170.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for estnltk_neural-1.7.4.tar.gz
Algorithm Hash digest
SHA256 80934ede0525657e13a5b7c4aa9b153c5f1e131160656f6193d548766650040b
MD5 3744ba2f7db5085359d92f06f2a7551b
BLAKE2b-256 41f77781c4498b30a88c5e93f3f70617fd05b074c9d0d7983b3f332249f03de7

See more details on using hashes here.

File details

Details for the file estnltk_neural-1.7.4-py3-none-any.whl.

File metadata

  • Download URL: estnltk_neural-1.7.4-py3-none-any.whl
  • Upload date:
  • Size: 206.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for estnltk_neural-1.7.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e604c8ee45723e20d0acd056de9639214fbab820a1773e05caa8e441e97ed072
MD5 0f6ad3b6aef499b9308dd69d1ec908d5
BLAKE2b-256 40519dd1684a087506c9942312ec3b8e5e03e37968b2b661d38d1edf7a362516

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page