Rasa NLU Components with PaddleNLP
Project description
Rasa NLU Components using PaddleNLP
Features
- Tokenizer and Dense featurizer using pre-trained models supported by PaddleNLP.
Usage
pip install rasa-paddlenlp
In your config.yml, use the following configuration:
language: zh
pipeline:
- name: "rasa_paddlenlp.nlu.paddlenlp_tokenizer.PaddleNLPTokenizer"
model_name: bert
model_weights: bert-wwm-ext-chinese
# Flag to check whether to split intents
intent_tokenization_flag: false
# Symbol on which intent should be split
intent_split_symbol: "_"
- name: "rasa_paddlenlp.nlu.paddlenlp_featurizer.PaddleNLPFeaturizer"
model_name: bert
model_weights: bert-wwm-ext-chinese
# rest of your configurations
Currently there is code to support BERT pre-trained models, we just need to add the model definitions and default weights in order for other PaddleNLP-supported models.
Credits
This package took inspiration from the following projects:
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage
project template.
License
History
0.3.1 (2022-02-10)
- Made PaddleNLPTokenizer match V3 specification.
0.3.0 (2022-02-10)
- From now on the library supports Rasa V3
- Added XLNet and Roberta support (ERNIE will come soon)
0.2.0 (2022-01-13)
- First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rasa_paddlenlp-0.3.1.tar.gz
(17.0 kB
view hashes)
Built Distribution
Close
Hashes for rasa_paddlenlp-0.3.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d7f4ed009f330ad8787580f5858601cebd19efa35c0dcc683cd6a4244ae6d4c |
|
MD5 | f1c01f5c4c1f2c2cb86a428e2fbf0634 |
|
BLAKE2b-256 | 93bd473503231575f8d46c1aec774336f91b6bea3402f52d05789b13641671e4 |