EstNLTK neural -- EstNLTK's linguistic analysis based on neural models
Project description
EstNLTK neural -- EstNLTK's linguistic analysis based on neural models
This package contains EstNLTK's linguistic analysis tools that use neural models:
- bert embeddings tagger;
- bert-based named entity recognition;
- bert-based morphological features tagger and disambiguator;
- GliLem lemmatizer and morphological disambiguator;
- stanza syntax tagger and stanza ensemble syntax tagger;
- pronominal coreference tagger v1 (relies on stanza for input preprocessing);
- [legacy] tensorflow-based neural morphological features tagger ( disambiguator );
Note: these tools require installation of deep learning frameworks (tensorflow, pytorch), and are demanding for computational resources; they also rely on large models which need to be downloaded separately.
The EstNLTK project is funded by EKT (Eesti Keeletehnoloogia Riiklik Programm).
Installation
EstNLTK-neural is available as a PyPI wheel:
pip install estnltk_neural
And as an Anaconda package:
conda install -c estnltk -c conda-forge estnltk_neural
Supported Python versions: 3.9+
Neural models
Models required by neural tools are large, and therefore cannot be distributed with this package. However, our tagger classes are implemented in a way that once you create an instance of a neural tagger, you'll be asked for a permission to download missing models, and if you give the permission, the model will be downloaded (and installed in a proper location) automatically. If needed, you can also change the default location where downloaded models will be placed, see this tutorial for details.
Documentation
EstNLTK's NLP component tutorials also cover information about neural taggers:
- bert embeddings tagger;
- named entity recognition (incl bert-based approaches);
- bert-based morphological features tagger and disambiguator;
- GliLem lemmatizer and morphological disambiguator;
- stanza-based syntax taggers;
- pronominal coreference tagger v1;
- [legacy] tensorflow-based neural morphological features tagger ( disambiguator )
Source
The source of the last release is available at the main branch.
License
EstNLTK-neural is released under dual license - either GNU General Public License v2.0 or Apache 2.0 License.
EstNLTK-neural's GliLem lemmatizer and morphological disambiguator contains code that is licensed under Mozilla Public License 2.0 (MPL 2.0).
(C) University of Tartu (unless specified otherwise in the file headers)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file estnltk_neural-1.7.4.tar.gz.
File metadata
- Download URL: estnltk_neural-1.7.4.tar.gz
- Upload date:
- Size: 170.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
80934ede0525657e13a5b7c4aa9b153c5f1e131160656f6193d548766650040b
|
|
| MD5 |
3744ba2f7db5085359d92f06f2a7551b
|
|
| BLAKE2b-256 |
41f77781c4498b30a88c5e93f3f70617fd05b074c9d0d7983b3f332249f03de7
|
File details
Details for the file estnltk_neural-1.7.4-py3-none-any.whl.
File metadata
- Download URL: estnltk_neural-1.7.4-py3-none-any.whl
- Upload date:
- Size: 206.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e604c8ee45723e20d0acd056de9639214fbab820a1773e05caa8e441e97ed072
|
|
| MD5 |
0f6ad3b6aef499b9308dd69d1ec908d5
|
|
| BLAKE2b-256 |
40519dd1684a087506c9942312ec3b8e5e03e37968b2b661d38d1edf7a362516
|