Skip to main content

An open source library for building end-to-end dialog systems and training chatbots.

Project description

DeepPavlov 1.0

License Apache 2.0 Python 3.6, 3.7, 3.8, 3.9, 3.10, 3.11 Downloads Static Badge Static Badge

DeepPavlov 1.0 is an open-source NLP framework built on PyTorch and transformers. DeepPavlov 1.0 is created for modular and configuration-driven development of state-of-the-art NLP models and supports a wide range of NLP model applications. DeepPavlov 1.0 is designed for practitioners with limited knowledge of NLP/ML.

Quick Links

name Description
⭐️ Demo Check out our NLP models in the online demo
📚 Documentation How to use DeepPavlov 1.0 and its features
🚀 Model List Find the NLP model you need in the list of available models
🪐 Contribution Guide Please read the contribution guidelines before making a contribution
🎛 Issues If you have an issue with DeepPavlov, please let us know
Forum Please let us know if you have a problem with DeepPavlov
📦 Blogs Read about our current development
🦙 Extended colab tutorials Check out the code tutorials for our models
🌌 Docker Hub Check out the Docker images for rapid deployment
👩‍🏫 Feedback Please leave us your feedback to make DeepPavlov better

Installation

  1. DeepPavlov supports Linux, Windows 10+ (through WSL/WSL2), MacOS (Big Sur+) platforms, Python 3.6, 3.7, 3.8, 3.9 and 3.10. Depending on the model used, you may need from 4 to 16 GB RAM.

  2. Create and activate a virtual environment:

    • Linux
    python -m venv env
    source ./env/bin/activate
    
  3. Install the package inside the environment:

    pip install deeppavlov
    

QuickStart

There is a bunch of great pre-trained NLP models in DeepPavlov. Each model is determined by its config file.

List of models is available on the doc page in the deeppavlov.configs (Python):

from deeppavlov import configs

When you're decided on the model (+ config file), there are two ways to train, evaluate and infer it:

GPU requirements

By default, DeepPavlov installs models requirements from PyPI. PyTorch from PyPI could not support your device CUDA capability. To run supported DeepPavlov models on GPU you should have CUDA compatible with used GPU and PyTorch version required by DeepPavlov models. See docs for details. GPU with Pascal or newer architecture and 4+ GB VRAM is recommended.

Command line interface (CLI)

To get predictions from a model interactively through CLI, run

python -m deeppavlov interact <config_path> [-d] [-i]
  • -d downloads required data - pretrained model files and embeddings (optional).
  • -i installs model requirements (optional).

You can train it in the same simple way:

python -m deeppavlov train <config_path> [-d] [-i]

Dataset will be downloaded regardless of whether there was -d flag or not.

To train on your own data you need to modify dataset reader path in the train config doc. The data format is specified in the corresponding model doc page.

There are even more actions you can perform with configs:

python -m deeppavlov <action> <config_path> [-d] [-i]
  • <action> can be
    • install to install model requirements (same as -i),
    • download to download model's data (same as -d),
    • train to train the model on the data specified in the config file,
    • evaluate to calculate metrics on the same dataset,
    • interact to interact via CLI,
    • riseapi to run a REST API server (see doc),
    • predict to get prediction for samples from stdin or from <file_path> if -f <file_path> is specified.
  • <config_path> specifies path (or name) of model's config file
  • -d downloads required data
  • -i installs model requirements

Python

To get predictions from a model interactively through Python, run

from deeppavlov import build_model

model = build_model(<config_path>, install=True, download=True)

# get predictions for 'input_text1', 'input_text2'
model(['input_text1', 'input_text2'])

where

  • install=True installs model requirements (optional),
  • download=True downloads required data from web - pretrained model files and embeddings (optional),
  • <config_path> is model name (e.g. 'ner_ontonotes_bert_mult'), path to the chosen model's config file (e.g. "deeppavlov/configs/ner/ner_ontonotes_bert_mult.json"), or deeppavlov.configs attribute (e.g. deeppavlov.configs.ner.ner_ontonotes_bert_mult without quotation marks).

You can train it in the same simple way:

from deeppavlov import train_model 

model = train_model(<config_path>, install=True, download=True)

To train on your own data you need to modify dataset reader path in the train config doc. The data format is specified in the corresponding model doc page.

You can also calculate metrics on the dataset specified in your config file:

from deeppavlov import evaluate_model 

model = evaluate_model(<config_path>, install=True, download=True)

DeepPavlov also allows to build a model from components for inference using Python.

License

DeepPavlov is Apache 2.0 - licensed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeppavlov-1.7.0.tar.gz (288.5 kB view details)

Uploaded Source

Built Distribution

deeppavlov-1.7.0-py3-none-any.whl (492.7 kB view details)

Uploaded Python 3

File details

Details for the file deeppavlov-1.7.0.tar.gz.

File metadata

  • Download URL: deeppavlov-1.7.0.tar.gz
  • Upload date:
  • Size: 288.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.10

File hashes

Hashes for deeppavlov-1.7.0.tar.gz
Algorithm Hash digest
SHA256 196c544b17a5ea8a36a2ad39f5e7aac6770d0506d1322be7a8f25c06bfeb5357
MD5 299e979b981549f3ca33f50cb345b27e
BLAKE2b-256 fc97f62e1ca402f5dcd41fe594e72a5e1410d35d2b710bf2e92d6a8980f59a8d

See more details on using hashes here.

File details

Details for the file deeppavlov-1.7.0-py3-none-any.whl.

File metadata

  • Download URL: deeppavlov-1.7.0-py3-none-any.whl
  • Upload date:
  • Size: 492.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.10

File hashes

Hashes for deeppavlov-1.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ebac0cdd6af7509e2102dde223285e0340d4babe9dfab83fac5176013462aee5
MD5 c5ccfbc2c8d5d584bbf633d473481ca0
BLAKE2b-256 aeafda9d957d8556c21d4df1de50ffe0e034f0fc1485b2e208e664951a7b75e7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page