Skip to main content

A Python library for morphological tagging, lemmatization and dependency parsing

Project description

dl4dp

dl4dp is a Python NLP library, which provides tools for morphological tagging, lemmatization and dependency parsing. The main motivation for this library is to provide state-of-the-art tools for Slovak language, but the models can be build for any language with training data in Universal Dependencies.

Installation

The library supports Python 3.6 and later.

pip

The dl4dp is available on PyPi and can be installed via pip. To install simply run:

pip install dl4dp

To upgrade the previous installation to the newest release, use:

pip install dl4dp -U

From source

Alternatively, you can also install library from this git repository, which will give you more flexibility and allows you to start contributing to the dl4dp code. For this option, run:

git clone https://github.com/peterbednar/dl4dp.git
cd dl4dp
pip install -e .

Getting started with dl4dp

The library provides a command-line interface which allows you to train own model, create pipeline package for production deployment, or parse data. The following command will download Universal Dependencies archive and train morphological tagger model on English EWT treebank:

python -m dl4dp train tagger -t en_ewt

Similarly, the following command will train a model for dependency parser:

python -m dl4dp train parser -t en_ewt

Subsequently, you can create and install a pipeline package:

python -m dl4dp package install -t en_ewt

After the installation of the pipeline, it can be used for the parsing of input data in CoNLL-U format:

python -m dl4dp parse ~/.dl4dp/treebanks/en_ewt/en_ewt-ud-test.conllu output.conllu -m en_ewt-0.1.0

LICENSE

dl4dp is released under the MIT License. See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dl4dp-0.9.1.tar.gz (14.5 kB view details)

Uploaded Source

Built Distribution

dl4dp-0.9.1-py3-none-any.whl (16.0 kB view details)

Uploaded Python 3

File details

Details for the file dl4dp-0.9.1.tar.gz.

File metadata

  • Download URL: dl4dp-0.9.1.tar.gz
  • Upload date:
  • Size: 14.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for dl4dp-0.9.1.tar.gz
Algorithm Hash digest
SHA256 d6f86c9c0b1ae81ca3b2353f0dd248271038ebe12476625a599b5861ee8acde8
MD5 77de125452592efc1ea8552947a796fa
BLAKE2b-256 d29f028ab0ab1c294057bd72e85136305f5facace1e2800b53e4ff5beaffe0a2

See more details on using hashes here.

File details

Details for the file dl4dp-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: dl4dp-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 16.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for dl4dp-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 79c8a166c5b14ae2b59cb8356fea505da6b1fb96b6308d80cdaa2679a63facef
MD5 b7ec36e2719009fe882e2348f640cb22
BLAKE2b-256 ba8d7e092055787529dc4092173d3778e1bacc616ee6b61accf9deb10a1b25c1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page