Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Optimum Graphcore

🤗 Optimum Graphcore is the interface between the 🤗 Transformers library and Graphcore IPUs. It provides a set of tools enabling model parallelization and loading on IPUs, training and fine-tuning on all the tasks already supported by Transformers while being compatible with the Hugging Face Hub and every model available on it out of the box.

What is an Intelligence Processing Unit (IPU)?

Quote from the Hugging Face blog post:

IPUs are the processors that power Graphcore’s IPU-POD datacenter compute systems. This new type of processor is designed to support the very specific computational requirements of AI and machine learning. Characteristics such as fine-grained parallelism, low precision arithmetic, and the ability to handle sparsity have been built into our silicon.

Instead of adopting a SIMD/SIMT architecture like GPUs, Graphcore’s IPU uses a massively parallel, MIMD architecture, with ultra-high bandwidth memory placed adjacent to the processor cores, right on the silicon die.

This design delivers high performance and new levels of efficiency, whether running today’s most popular models, such as BERT and EfficientNet, or exploring next-generation AI applications.

Poplar SDK setup

A Poplar SDK environment needs to be enabled to use this library. Please refer to Graphcore's Getting Started guide.

Install

To install the latest release of this package:

pip install optimum[graphcore]

Optimum Graphcore is a fast-moving project, and you may want to install from source.

pip install git+https://github.com/huggingface/optimum-graphcore.git

Running the examples

There are a number of examples provided in the examples directory. Each of these contains a README with command lines for running them on IPUs with Optimum Graphcore.

Please install the requirements for every example:

cd <example-folder>
pip install -r requirements.txt

How to use it?

🤗 Optimum Graphcore was designed with one goal in mind: make training and evaluation straightforward for any 🤗 Transformers user while leveraging the complete power of IPUs. There are two main classes one needs to know:

IPUTrainer: the trainer class that takes care of compiling the model to run on IPUs, and of performing training and evaluation.
IPUConfig: the class that specifies attributes and configuration parameters to compile and put the model on the device.

The IPUTrainer is very similar to the 🤗 Transformers Trainer, and adapting a script using the Trainer to make it work with IPUs will mostly consists of simply swapping the Trainer class for the IPUTrainer one. That's how most of the example scripts were adapted from their original counterparts.

Original script:

from transformers import Trainer, TrainingArguments

# A lot of code here

# Initialize our Trainer
trainer = Trainer(
    model=model,
    args=training_args,  # Original training arguments.
    train_dataset=train_dataset if training_args.do_train else None,
    eval_dataset=eval_dataset if training_args.do_eval else None,
    compute_metrics=compute_metrics,
    tokenizer=tokenizer,
    data_collator=data_collator,
)

Transformed version that can run on IPUs:

from optimum.graphcore import IPUConfig, IPUTrainer, IPUTrainingArguments

# A lot of the same code as the original script here

# Loading the IPUConfig needed by the IPUTrainer to compile and train the model on IPUs
ipu_config = IPUConfig.from_pretrained(
    training_args.ipu_config_name if training_args.ipu_config_name else model_args.model_name_or_path,
    cache_dir=model_args.cache_dir,
    revision=model_args.model_revision,
    use_auth_token=True if model_args.use_auth_token else None,
)

# Initialize our Trainer
trainer = IPUTrainer(
    model=model,
    ipu_config=ipu_config,
    # The training arguments differ a bit from the original ones, that is why we use IPUTrainingArguments
    args=training_args,
    train_dataset=train_dataset if training_args.do_train else None,
    eval_dataset=eval_dataset if training_args.do_eval else None,
    compute_metrics=compute_metrics,
    tokenizer=tokenizer,
    data_collator=data_collator,
)

Supported Models

The following model architectures and tasks are currently supported by 🤗 Optimum Graphcore:

	Pre-Training	Masked LM	Causal LM	Seq2Seq LM (Summarization, Translation, etc)	Sequence Classification	Token Classification	Question Answering	Multiple Choice	Image Classification
BART	:heavy_check_mark:		✗	:heavy_check_mark:	:heavy_check_mark:		✗
BERT	:heavy_check_mark:	:heavy_check_mark:	✗		:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:
ConvNeXt	:heavy_check_mark:								:heavy_check_mark:
DeBERTa	✗	✗			:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:
GPT-2	:heavy_check_mark:		:heavy_check_mark:		:heavy_check_mark:	:heavy_check_mark:
HuBERT	✗				:heavy_check_mark:
LXMERT	✗						:heavy_check_mark:
RoBERTa	:heavy_check_mark:	:heavy_check_mark:	✗		:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:
T5	:heavy_check_mark:			:heavy_check_mark:
ViT	✗								:heavy_check_mark:
Wav2Vec2	:heavy_check_mark:

If you find any issue while using those, please open an issue or a pull request.

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.7.1

Jul 31, 2023

0.7.0

Jul 13, 2023

0.6.1

May 25, 2023

0.6.0

Apr 5, 2023

0.5.0

Dec 22, 2022

0.4.3

Dec 7, 2022

0.4.2

Nov 23, 2022

0.4.1

Oct 14, 2022

0.4.0

Oct 10, 2022

0.3.2

Aug 10, 2022

This version

0.3.1

Aug 3, 2022

0.3.0

May 31, 2022

0.2.3

Apr 25, 2022

0.2.2

Mar 7, 2022

0.2.1

Feb 23, 2022

0.2.0

Nov 26, 2021

0.1.0

Nov 5, 2021

0.0.1

Oct 11, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

optimum-graphcore-0.3.1.tar.gz (121.3 kB view hashes)

Uploaded Aug 3, 2022 Source

Built Distribution

optimum_graphcore-0.3.1-py3-none-any.whl (150.8 kB view hashes)

Uploaded Aug 3, 2022 Python 3

Hashes for optimum-graphcore-0.3.1.tar.gz

Hashes for optimum-graphcore-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`becc73e4d46e74580bf5b4f0fb583c1b2e7b03e0d485a1993de857d2dfd32b30`
MD5	`750d92236a8eba467687b2824cc37d14`
BLAKE2b-256	`0236c85a334c82144a8afc2e70789c2c20176c53235b792b9820d3f6d06f8a96`

Hashes for optimum_graphcore-0.3.1-py3-none-any.whl

Hashes for optimum_graphcore-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6ecc2edd2a9e7de48641902129f3f8fd90abee46cf457e1edf648ff7d4452131`
MD5	`00548544c8333f3aea1226c56d80aadf`
BLAKE2b-256	`31fb23b9dc8e3909a6590f5aa748fe4297fc01503dae91f94d1041d37d2ccb2b`