Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, and RigL).

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Project description

FastSparse

Customizable Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL).

Warning: this repo is undergoing active development

TODOs:

test dynamic training callback
finish documenting this page
PyTorch example
under-the-hood explanation
fully custom example
Drop/Redist/Grow criterion
implement distributed training (?)

Install

pip install fastsparse

How to use

Fastai example

With this package, you can train your model using the latest dynamic sparse training techniques. It only takes 4 additional lines of code!

from fastai.vision.all import *
from fastsparse.core import *                            # <-- import this package

path = untar_data(URLs.MNIST)
dls = ImageDataLoaders.from_folder(path, 'training', 'testing')
learn = cnn_learner(dls, resnet34, metrics=error_rate, pretrained=False)
sparse_hooks = sparsify_model(learn.model, sparsity=0.9) # <-- initial sparsity + enforce masks
cbs = DynamicSparseTrainingCallback(**RigL_kwargs)       # <-- dynamic mask updates

learn.fit_one_cycle(1, cbs=cbs)

for h in sparse_hooks: h.remove()                        # <-- stop enforcing masks

Simply omit the DynamicSparseTrainingCallback to train a fixed-sparsity model as a baseline.

PyTorch example

TODO

Training with Large Batch Sizes

Authors of the Rigged Lottery paper hypothesize that the effectiveness of using the gradient magnitude for determining which connections to grow is partly due to their large batch size (4096 for ImageNet). Those without access to multi-gpu clusters can achieve effective batch sizes of this size by using fastai's GradientAccumulation callback, which has been tested to be compatible with this package's DynamicSparseTrainingCallback.

Under-The-Hood

Here's what's going on.

When you run sparsify_model(learn.model, 0.9), this adds sparse masks and add pre_forward hooks to enforce masks on weights during forward pass.

By default, a uniform sparsity distribution is used. Change the sparsity distribution to Erdos-Renyi with sparsify_model(learn.model, 0.9, sparse_init_f=erdos_renyi), or pass in your custom function (see Customization

To avoid adding pre_forward hooks, use sparsify_model(learn.model, 0.9, enforce_masks=False).

When you add the DynamicSparseTrainingCallback callback, ... TODO complete section

Customization

There are several places to modify the behavior of fastsparse to accomplish custom behaviors. For an example, check out the implementation of RigL.

1. Initial sparsity distribution:

Define your own initial sparsity distribution by setting sparsify_method in sparsify_model to a custom function. For example, this function (included in library) makes the first layer dense, and all following layers to a fixed sparsity.

def first_layer_dense_uniform(params:list, model_sparsity:float):
    sparsities = [1.] + [model_sparsity] * (len(params) - 1)
    return sparsities

2. Drop Criterion

...

3. Redistribute Criterion

...

4. Grow Criterion

...

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

0.0.5

Nov 24, 2020

0.0.4

Nov 23, 2020

This version

0.0.3

Nov 23, 2020

0.0.2

Nov 21, 2020

0.0.1

Nov 20, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastsparse-0.0.3.tar.gz (14.0 kB view hashes)

Uploaded Nov 23, 2020 Source

Built Distribution

fastsparse-0.0.3-py3-none-any.whl (12.2 kB view hashes)

Uploaded Nov 23, 2020 Python 3

Hashes for fastsparse-0.0.3.tar.gz

Hashes for fastsparse-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`89703948160b7778a96b5bf3d79fbaf244a671dbcc38ac8dbcf52b92d75de55d`
MD5	`01348cd593e89f00e5e3e53fb58c3e8b`
BLAKE2b-256	`6fa944ebd5ddf5b8908cc5e5cf5fe5c5dd563c5e62cbf46e9e78b0e03527318d`

Hashes for fastsparse-0.0.3-py3-none-any.whl

Hashes for fastsparse-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6409660d3141597738f05e50029d1dd8229cb4c703526db0be57399dd046b07d`
MD5	`bf876facaeb2fe6d41371b51ca7b8007`
BLAKE2b-256	`53a80792da640cafe99edd5fb693c98b335c036f1a4b91cf5004d012b7787273`