Implementation of Google Research's "RigL" sparse model training method in PyTorch.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

rigl-torch

PyTest

Warning: This repository is still in active development, results are not yet up to the rigl paper spec. Coming soon!

An open source implementation of Google Research's paper (Authored by Utku Evci, an AI Resident @ Google Brain): Rigging the Lottery: Making All Tickets Winners (RigL) in PyTorch as versatile, simple, and fast as possible.

You only need to add 2 lines of code to your PyTorch project to use RigL to train your model with sparsity!

Other Implementations:

View the TensorFlow implementation (also the original) here!
Additionally, it is also implemented in vanilla python and graphcore.

Setup:

Clone this repository: git clone https://github.com/McCrearyD/rigl-torch
Cd into repo: cd rigl-torch
Install dependencies: pip install -r requirements.txt
Install package (-e allows for modifications): pip install -e .

Usage:

Run the tests by doing cd rigl-torch, then pytest.
I have provided some examples of training scripts that were slightly modified to add RigL's functionality. It adds a few parser statements, and only 2 required lines of RigL code usage to work! See them with links to the originals here:
- ImageNet | RigL | Original | RigL + SageMaker
- MNIST | RigL | Original
OR more impressively, you can use the pruning power of RigL by adding 2 lines of code to your already existing training script! Here is how:

from rigl_torch.RigL import RigLScheduler

# first, create your model
model = ... # note: only tested on torch.hub's resnet networks (ie. resnet18 / resnet50)

# create your dataset/dataloader
dataset = ...
dataloader = ...

# define your optimizer (recommended SGD w/ momentum)
optimizer = ...


# RigL runs best when you allow RigL's topology modifications to run for 75% of the total training iterations (batches)
# so, let's calculate T_end according to this
epochs = 100
total_iterations = len(dataloader) * epochs
T_end = int(0.75 * total_iterations)

# ------------------------------------ REQUIRED LINE # 1 ------------------------------------
# now, create the RigLScheduler object
pruner = RigLScheduler(model,                  # model you created
                       optimizer,              # optimizer (recommended = SGD w/ momentum)
                       dense_allocation=0.1,   # a float between 0 and 1 that designates how sparse you want the network to be (0.1 dense_allocation = 90% sparse)
                       T_end=T_end,            # T_end hyperparam within the paper (recommended = 75% * total_iterations)
                       delta=100,              # delta hyperparam within the paper (recommended = 100)
                       alpha=0.3,              # alpha hyperparam within the paper (recommended = 0.3)
                       static_topo=False)      # if True, the topology will be frozen, in other words RigL will not do it's job (for debugging)
# -------------------------------------------------------------------------------------------

... more code ...

for data in dataloader:
    # do forward pass, calculate loss, etc.
    ...

    # instead of calling optimizer.step(), wrap it as such:

# ------------------------------------ REQUIRED LINE # 2 ------------------------------------
    if pruner():
# -------------------------------------------------------------------------------------------
        # this block of code will execute according to the given hyperparameter schedule
        # in other words, optimizer.step() is not called after a RigL step
        optimizer.step()

# at any time you can print the RigLScheduler object and it will show you the sparsity distributions, number of training steps/rigl steps, etc!
print(pruner)

# save model
torch.save(model)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.2

Nov 11, 2020

0.5.1

Nov 4, 2020

0.5

Nov 4, 2020

0.4

Oct 31, 2020

0.3

Oct 27, 2020

This version

0.2

Oct 26, 2020

0.1

Oct 26, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rigl-torch-0.2.tar.gz (7.1 kB view hashes)

Uploaded Oct 26, 2020 Source

Built Distribution

rigl_torch-0.2-py3-none-any.whl (6.8 kB view hashes)

Uploaded Oct 26, 2020 Python 3

Hashes for rigl-torch-0.2.tar.gz

Hashes for rigl-torch-0.2.tar.gz
Algorithm	Hash digest
SHA256	`0e839aea4ea7b42fdecabfdd56c3a95920c891a5d1aa5f7921080d13f985b7b4`
MD5	`ec73ef9b9589152c0dcb51c872feec63`
BLAKE2b-256	`08154318c19ce6e57069cb0540e1d33eb387d8ff98b67b811b85afc43d13869f`

Hashes for rigl_torch-0.2-py3-none-any.whl

Hashes for rigl_torch-0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`54d66fb5792377756af70a9b2c0a5319e2c8538cebd9a3d8a5df4042fb7be711`
MD5	`2c391f626c9ad196da31849de0db1ddd`
BLAKE2b-256	`fcb24127c00387045e77f9314f67efbe4808291108b0f66989601cb88a3f1448`