Seamless integration of tasks with huggingface models

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tasknet

tasknet is an interface between Huggingface datasets and Huggingface Trainer.

Task templates

tasknet relies on task templates to avoid boilerplate codes. The task templates are correspond to Transformers AutoClasses:

SequenceClassification
TokenClassification
MultipleChoice

The task templates follow the same interface. They implement preprocess_function and compute_metrics. Look at tasks.py and use existing templates as a starting point to implement a custom task template.

Instanciating a task

Each task template is associated with specific fields. Classification has two text fields s1,s2, and a label y. Pass a dataset to a template, and fill-in the mapping between the dataset fields and the template fields to instanciate a task.

import tasknet as tn
from datasets import load_dataset

rte = tn.Classification(
    dataset=load_dataset("glue", "rte"),
    s1="sentence1", s2="sentence2", y="label"
)

class args:
  model_name='roberta-base'
  learning_rate = 3e-5 # see https://huggingface.co/docs/transformers/v4.24.0/en/main_classes/trainer#transformers.TrainingArguments

 
tasks = [rte]
model = tn.Model(tasks, args)
trainer = tn.Trainer(model, tasks, args)
trainer.train()

As you can see, tasknet is multitask by design. It works with list of tasks and the model creates a task_models_list attribute.

Installation

pip install tasknet

Additional examples:

Colab:

https://colab.research.google.com/drive/15Xf4Bgs3itUmok7XlAK6EEquNbvjD9BD?usp=sharing

tasknet vs jiant

jiant is another library comparable to tasknet. tasknet is a minimal extension of Trainer centered on task templates, while jiant builds a custom analog of Trainer from scratch called runner. tasknet is leaner and easier to extend. jiant is config-based while tasknet is designed for interative use and scripting.

Credit

This code uses some part of the examples of the transformers library and some code from multitask-learning-transformers.

Contact

You can request features on github or reach me at damien.sileo@inria.fr

@misc{sileod21-tasknet,
  author = {Sileo, Damien},
  doi = {10.5281/zenodo.561225781},
  month = {11},
  title = {{tasknet, multitask interface between Trainer and datasets}},
  url = {https://github.com/sileod/tasknet},
  version = {1.5.0},
  year = {2022}}

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.54.0

May 21, 2024

1.53.0

Mar 1, 2024

1.52.0

Nov 2, 2023

1.51.0

Nov 2, 2023

1.50.0

Nov 2, 2023

1.49.0

Sep 19, 2023

1.48.0

Aug 25, 2023

1.47.0

Jun 30, 2023

1.46.0

Jun 30, 2023

1.45.0

Jun 30, 2023

1.44.0

May 4, 2023

1.43.0

Apr 24, 2023

1.42.0

Apr 6, 2023

1.41.0

Mar 27, 2023

1.40.0

Mar 19, 2023

1.39.0

Mar 19, 2023

1.38.0

Mar 14, 2023

1.37.0

Mar 10, 2023

1.36.0

Mar 10, 2023

1.35.0

Feb 27, 2023

1.34.0

Feb 26, 2023

1.33.0

Feb 24, 2023

1.32.0

Feb 23, 2023

1.31.0

Feb 22, 2023

1.30.0

Feb 20, 2023

1.29.0

Feb 20, 2023

1.28.0

Feb 20, 2023

1.27.0

Feb 20, 2023

1.26.0

Feb 17, 2023

1.25.0

Feb 13, 2023

1.24.0

Feb 2, 2023

1.23.0

Feb 2, 2023

1.22.0

Jan 20, 2023

1.21.0

Jan 19, 2023

1.20.0

Jan 13, 2023

1.19.0

Jan 10, 2023

1.18.0

Jan 9, 2023

1.17.0

Dec 24, 2022

1.16.0

Dec 2, 2022

1.15.0

Nov 29, 2022

1.14.0

Nov 23, 2022

1.13.0

Nov 22, 2022

1.12.0

Nov 22, 2022

1.11.0

Nov 16, 2022

1.10.0

Nov 16, 2022

1.9.0

Nov 16, 2022

1.8.0

Nov 16, 2022

This version

1.7.0

Nov 16, 2022

1.6.0

Nov 15, 2022

1.5

Nov 9, 2022

1.4

Nov 9, 2022

1.3

Nov 9, 2022

1.2

Nov 7, 2022

1.1

Nov 7, 2022

1.0.0

Nov 7, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tasknet-1.7.0.tar.gz (17.2 kB view hashes)

Uploaded Nov 16, 2022 Source

Built Distribution

tasknet-1.7.0-py3-none-any.whl (13.0 kB view hashes)

Uploaded Nov 16, 2022 Python 3

Hashes for tasknet-1.7.0.tar.gz

Hashes for tasknet-1.7.0.tar.gz
Algorithm	Hash digest
SHA256	`bb38bf177dc2066f109e1307c7d5e161fd3df5eb12f7838d265a3fd4494428b6`
MD5	`4c1f063e57eee81d147f580f676590ff`
BLAKE2b-256	`f3145e91f98450a630c3287230728abe4b4af39a9ee7e739728b752cd67b1869`

Hashes for tasknet-1.7.0-py3-none-any.whl

Hashes for tasknet-1.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`94316d94b6a2febb53bdde036063d6ffdce2d64e300050f46ae99b046951761e`
MD5	`a988fd3714f400c60190c5fabc017d71`
BLAKE2b-256	`02afa07b707c09f6e00544b67380acde3d59a517981dd4500684317c0bbea254`