Convenient Text-to-Text Training for Transformers

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

t2t-tuner

Convenient Text-to-Text Training for Transformers

pip install t2t-tuner

Requires PyTorch: either follow PyTorch installation instructions or use a PyTorch container.

Features

Easy training for text-to-text generation tasks
Training methods/features:
- Supervised fine-tuning
- Gradient checkpointing
- Model parallelism
- Soft prompt tuning (based on this paper)
- Freeze encoder/decoder/embeddings
- Print model summary
Based on the wonderful HuggingFace Transformers library. Tested on T5-based models. In theory, it should work with other models that support AutoModelForSeq2SeqLM as well

This work is based on HuggingFace's run_translation.py script for text-to-text generation tasks. It provides (what I feel is) a more convenient interface to training and inferencing text-to-text generation models, along with better access to some features and new features that I added in myself.

Examples

Simple snippet:

import t2t

trainer_arguments = t2t.TrainerArguments(model_name_or_path="t5-small",
                                         train_file=YOUR_DATASET)

trainer = t2t.Trainer(arguments=trainer_arguments)

# train without validation
trainer.train(valid=False)

For more concrete examples, check out the notebooks linked below:

Development

Building Package

python3 -m pip install --upgrade build twine
python3 -m build
python3 -m twine upload dist/*

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.4

Dec 10, 2021

0.1.3

Nov 27, 2021

0.1.2

Oct 22, 2021

0.1.1

Oct 22, 2021

0.0.11

Oct 3, 2021

0.0.10

Oct 1, 2021

0.0.9

Sep 26, 2021

0.0.8

Sep 25, 2021

0.0.7

Sep 20, 2021

This version

0.0.6

Sep 20, 2021

0.0.5

Sep 20, 2021

0.0.4

Sep 20, 2021

0.0.2

Sep 20, 2021

0.0.1

Sep 20, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

t2t-tuner-0.0.6.tar.gz (8.3 kB view hashes)

Uploaded Sep 20, 2021 Source

Built Distribution

t2t_tuner-0.0.6-py3-none-any.whl (7.5 kB view hashes)

Uploaded Sep 20, 2021 Python 3

Hashes for t2t-tuner-0.0.6.tar.gz

Hashes for t2t-tuner-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`21390ea36ed7eec80b998c009a59a54ff74468f9f22bb1ee5d5f61fbfc13b6a5`
MD5	`04d034449c59039683b1453bbc0992b5`
BLAKE2b-256	`384d33090ffce1df1566fe2a0d7a08f4815d9e4b553bab84cf4abaf1abc12d57`

Hashes for t2t_tuner-0.0.6-py3-none-any.whl

Hashes for t2t_tuner-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4f0bb880ffbcb3661591bd56cb0820e170e25bc2b2c12e4114690a256c436d00`
MD5	`9e6a8d4a5ea0b17b816334317ddda2ae`
BLAKE2b-256	`63dd609864ac4f047342a9bc41d813079bae9054cb4be88e0c519b26e0322089`