Transformers kit - Multi-task QA/Tagging/Multi-label Multi-Class Classification/Generation with BERT/ALBERT/T5/BERT
Project description
What is it
TFKit is a tool kit mainly for language generation.
It leverages the use of transformers on many tasks with different models in this all-in-one framework.
All you need is a little change of config.
Task Supported
With transformer models - BERT/ALBERT/T5/BART......
Text Generation | :memo: seq2seq language model |
Text Generation | :pen: causal language model |
Text Generation | :printer: once generation model / once generation model with ctc loss |
Text Generation | :pencil: onebyone generation model |
Getting Started
Learn more from the document.
How To Use
Step 0: Install
Simple installation from PyPI
pip install git+https://github.com/voidful/TFkit.git@refactor-dataset
Step 1: Prepare dataset in csv format
input, target
Step 2: Train model
tfkit-train \
--task clas \
--config xlm-roberta-base \
--train training_data.csv \
--test testing_data.csv \
--lr 4e-5 \
--maxlen 384 \
--epoch 10 \
--savedir roberta_sentiment_classificer
Step 3: Evaluate
tfkit-eval \
--task roberta_sentiment_classificer/1.pt \
--metric clas \
--valid testing_data.csv
Advanced features
Multi-task training
tfkit-train \
--task clas clas \
--config xlm-roberta-base \
--train training_data_taskA.csv training_data_taskB.csv \
--test testing_data_taskA.csv testing_data_taskB.csv \
--lr 4e-5 \
--maxlen 384 \
--epoch 10 \
--savedir roberta_sentiment_classificer_multi_task
Not maintained task
Due to time constraints, the following tasks are temporarily not supported
Classification | :label: multi-class and multi-label classification |
Question Answering | :page_with_curl: extractive qa |
Question Answering | :radio_button: multiple-choice qa |
Tagging | :eye_speech_bubble: sequence level tagging / sequence level with crf |
Self-supervise Learning | :diving_mask: mask language model |
Supplement
- transformers models list: you can find any pretrained models here
- nlprep: download and preprocessing data in one line
- nlp2go: create demo api as quickly as possible.
Contributing
Thanks for your interest.There are many ways to contribute to this project. Get started here.
License
Icons reference
Icons modify from Freepik from www.flaticon.com
Icons modify from Nikita Golubev from www.flaticon.com
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tfkit-0.8.3.tar.gz
.
File metadata
- Download URL: tfkit-0.8.3.tar.gz
- Upload date:
- Size: 220.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ea20f989ebb101f49217e43b28a4d4bc4ad9f543421a3002ff46250d3523597 |
|
MD5 | 1ad6801152b1421688479821d56d0ead |
|
BLAKE2b-256 | 212b8592ea66d05de408878d70facc2487b586644fcc198ec105112b0da76bd2 |
File details
Details for the file tfkit-0.8.3-py3-none-any.whl
.
File metadata
- Download URL: tfkit-0.8.3-py3-none-any.whl
- Upload date:
- Size: 54.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 429b50e72108d9c313667939ab4beb60e23bef89fa47a012e74c8fc17c5b5a5f |
|
MD5 | 51673df338ad5137c81469f4871350ac |
|
BLAKE2b-256 | 94abb63939a570f20a2c5d2a60961bdc8e58c67d424829efd5dfb960091fa65a |