Skip to main content

Condensa Programmable Model Compression Framework

Project description

A Programming System for Model Compression

Condensa is a framework for programmable model compression in Python. It comes with a set of built-in compression operators which may be used to compose complex compression schemes targeting specific combinations of DNN architecture, hardware platform, and optimization objective. To recover any accuracy lost during compression, Condensa uses a constrained optimization formulation of model compression and employs an Augmented Lagrangian-based algorithm as the optimizer.

Status: Condensa is under active development, and bug reports, pull requests, and other feedback are all highly appreciated. See the contributions section below for more details on how to contribute.

Supported Operators and Schemes

Condensa provides the following set of pre-built compression schemes:

The schemes above are built using one or more compression operators, which may be combined in various ways to define your own custom schemes.

Please refer to the documentation for a detailed description of available operators and schemes.

Prerequisites

Condensa requires:

  • A working Linux installation (we use Ubuntu 18.04)
  • NVIDIA drivers and CUDA 10+ for GPU support
  • Python 3.5 or newer
  • PyTorch 1.0 or newer

Installation

The most straightforward way of installing Condensa is via pip:

pip install condensa

Installation from Source

Retrieve the latest source code from the Condensa repository:

git clone https://github.com/NVlabs/condensa.git

Navigate to the source code directory and run the following:

pip install -e .

Test out the Installation

To check the installation, run the unit test suite:

bash run_all_tests.sh -v

Getting Started

The AlexNet Notebook contains a simple step-by-step walkthrough of compressing a pre-trained model using Condensa. Check out the examples folder for additional, more complex examples of using Condensa (note: some examples require the torchvision package to be installed).

Documentation

Documentation is available here. Please also check out the Condensa paper for a detailed description of Condensa's motivation, features, and performance results.

Contributing

We appreciate all contributions, including bug fixes, new features and documentation, and additional tutorials. You can initiate contributions via Github pull requests. When making code contributions, please follow the PEP 8 Python coding standard and provide unit tests for the new features. Finally, make sure to sign off your commits using the -s flag or adding Signed-off-By: Name<Email> in the commit message.

Citing Condensa

If you use Condensa for research, please consider citing the following paper:

@article{condensa2019,
    title = {A Programmable Approach to Model Compression},
    author = {Joseph, Vinu and Muralidharan, Saurav and Garg, Animesh and Garland, Michael and Gopalakrishnan, Ganesh},
    journal = {CoRR},
    volume = {1911.02497},
    year = {2019},
    url = {https://arxiv.org/abs/1911.02497}
}

Disclaimer

Condensa is a research prototype and not an official NVIDIA product. Many features are still experimental and yet to be properly documented.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

condensa-0.5.0b0.tar.gz (21.1 kB view details)

Uploaded Source

Built Distribution

condensa-0.5.0b0-py3-none-any.whl (36.0 kB view details)

Uploaded Python 3

File details

Details for the file condensa-0.5.0b0.tar.gz.

File metadata

  • Download URL: condensa-0.5.0b0.tar.gz
  • Upload date:
  • Size: 21.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for condensa-0.5.0b0.tar.gz
Algorithm Hash digest
SHA256 e9b1b2a18ec9c26890ab2d0438401b3371b90b661b6d01b116ef073b48e92513
MD5 56dfc184ab75f4d82541447d1a0424d2
BLAKE2b-256 0d7be2a6d20b2403d3d07d5eb731fc27c66fe4e54011afa33f1bb5dac30fd83d

See more details on using hashes here.

File details

Details for the file condensa-0.5.0b0-py3-none-any.whl.

File metadata

  • Download URL: condensa-0.5.0b0-py3-none-any.whl
  • Upload date:
  • Size: 36.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for condensa-0.5.0b0-py3-none-any.whl
Algorithm Hash digest
SHA256 0a2894ffd86423d0421f9501fe4fb14391caf95bd6e8fb1939eade7f7c03b6a5
MD5 ee3ea7aca66876c296bde856a9c64e0e
BLAKE2b-256 214eaa3898a90c55013f75af91cdc556266aa7d6b716db8a536c21b03d2592b3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page