State-of-the-art Natural Language Processing toolkit for multi-task and transfer learning built on PyTorch.

These details have not been verified by PyPI

Project links

Homepage

Project description

`jiant` is an NLP toolkit

The multitask and transfer learning toolkit for natural language processing research

Why should I use jiant?

jiant supports multitask learning
jiant supports transfer learning
jiant supports 50+ natural language understanding tasks
jiant supports the following benchmarks:
- GLUE
- SuperGLUE
- XTREME
jiant is a research library and users are encouraged to extend, change, and contribute to match their needs!

A few additional things you might want to know about jiant:

jiant is configuration file driven
jiant is built with PyTorch
jiant integrates with datasets to manage task data
jiant integrates with transformers to manage models and tokenizers.

Getting Started

Get started with some simple Examples
Learn more about jiant by reading our Guides
See our list of supported tasks

Installation

To import jiant from source (recommended for researchers):

git clone https://github.com/nyu-mll/jiant.git
cd jiant
pip install -r requirements.txt

# Add the following to your .bash_rc or .bash_profile 
export PYTHONPATH=/path/to/jiant:$PYTHONPATH

If you plan to contribute to jiant, install additional dependencies with pip install -r requirements-dev.txt.

To install jiant from source (alternative for researchers):

git clone https://github.com/nyu-mll/jiant.git
cd jiant
pip install . -e

To install jiant from pip (recommended if you just want to train/use a model):

pip install jiant

We recommended that you install jiant in a virtual environment or a conda environment.

To check jiant was correctly installed, run a simple example.

Quick Introduction

The following example fine-tunes a RoBERTa model on the MRPC dataset.

Python version:

from jiant.proj.simple import runscript as run
import jiant.scripts.download_data.runscript as downloader

EXP_DIR = "/path/to/exp"

# Download the Data
downloader.download_data(["mrpc"], f"{EXP_DIR}/tasks")

# Set up the arguments for the Simple API
args = run.RunConfiguration(
   run_name="simple",
   exp_dir=EXP_DIR,
   data_dir=f"{EXP_DIR}/tasks",
   hf_pretrained_model_name_or_path="roberta-base",
   tasks="mrpc",
   train_batch_size=16,
   num_train_epochs=3
)

# Run!
run.run_simple(args)

Bash version:

EXP_DIR=/path/to/exp

python jiant/scripts/download_data/runscript.py \
    download \
    --tasks mrpc \
    --output_path ${EXP_DIR}/tasks
python jiant/proj/simple/runscript.py \
    run \
    --run_name simple \
    --exp_dir ${EXP_DIR}/ \
    --data_dir ${EXP_DIR}/tasks \
    --hf_pretrained_model_name_or_path roberta-base \
    --tasks mrpc \
    --train_batch_size 16 \
    --num_train_epochs 3

Examples of more complex training workflows are found here.

Contributing

The jiant project's contributing guidelines can be found here.

Looking for `jiant v1.3.2`?

jiant v1.3.2 has been moved to jiant-v1-legacy to support ongoing research with the library. jiant v2.x.x is more modular and scalable than jiant v1.3.2 and has been designed to reflect the needs of the current NLP research community. We strongly recommended any new projects use jiant v2.x.x.

jiant 1.x has been used in in several papers. For instructions on how to reproduce papers by jiant authors that refer readers to this site for documentation (including Tenney et al., Wang et al., Bowman et al., Kim et al., Warstadt et al.), refer to the jiant-v1-legacy README.

Citation

If you use jiant ≥ v2.0.0 in academic work, please cite it directly:

@misc{phang2020jiant,
    author = {Jason Phang and Phil Yeres and Jesse Swanson and Haokun Liu and Ian F. Tenney and Phu Mon Htut and Clara Vania and Alex Wang and Samuel R. Bowman},
    title = {\texttt{jiant} 2.0: A software toolkit for research on general-purpose text understanding models},
    howpublished = {\url{http://jiant.info/}},
    year = {2020}
}

If you use jiant ≤ v1.3.2 in academic work, please use the citation found here.

Acknowledgments

This work was made possible in part by a donation to NYU from Eric and Wendy Schmidt made by recommendation of the Schmidt Futures program, and by support from Intuit Inc.
We gratefully acknowledge the support of NVIDIA Corporation with the donation of a Titan V GPU used at NYU in this work.
Developer Jesse Swanson is supported by the Moore-Sloan Data Science Environment as part of the NYU Data Science Services initiative.

License

jiant is released under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

2.2.0

May 10, 2021

2.1.2

Dec 3, 2020

2.1.1

Nov 8, 2020

2.1.0

Oct 27, 2020

2.0.1

Oct 9, 2020

2.0.0

Oct 7, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jiant-2.2.0.tar.gz (165.3 kB view details)

Uploaded May 10, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jiant-2.2.0-py3-none-any.whl (253.0 kB view details)

Uploaded May 10, 2021 Python 3

File details

Details for the file jiant-2.2.0.tar.gz.

File metadata

Download URL: jiant-2.2.0.tar.gz
Upload date: May 10, 2021
Size: 165.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for jiant-2.2.0.tar.gz
Algorithm	Hash digest
SHA256	`6355bcb185eab5c84fe9ceb8cf732efdabc121242cc8e5d61651fb290f4517fb`
MD5	`765ee98f15b86e8adba77bb3d2fae486`
BLAKE2b-256	`7c687a7bf075d2d37e16cff893ff4ce8c24a2460212425faa008e0758ab0db31`

See more details on using hashes here.

File details

Details for the file jiant-2.2.0-py3-none-any.whl.

File metadata

Download URL: jiant-2.2.0-py3-none-any.whl
Upload date: May 10, 2021
Size: 253.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for jiant-2.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8ffe25faf90e4e075d3516e2b7f06b86d1a493165141f82110a1f3dbe93c27e7`
MD5	`b477f62669fb949e7e78519a00e65161`
BLAKE2b-256	`2d6ac6d3e80d7ea2b36d282739bcca39bb99bdeb49e423e9f712705fa2d01850`

See more details on using hashes here.

jiant 2.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

`jiant` is an NLP toolkit

Getting Started

Installation

Quick Introduction

Contributing

Looking for `jiant v1.3.2`?

Citation

Acknowledgments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

jiant 2.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

jiant is an NLP toolkit

Getting Started

Installation

Quick Introduction

Contributing

Looking for jiant v1.3.2?

Citation

Acknowledgments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`jiant` is an NLP toolkit

Looking for `jiant v1.3.2`?