Skip to main content

NEural Minimizer for pytOrch

Project description

NEMO (NEural Minimizer for pytOrch)

NEMO (NEural Minimizer for pytOrch) is a small library for minimization of Deep Neural Networks developed in PyTorch, aimed at their deployment on ultra-low power, highly memory constrained platforms, in particular (but not exclusively) PULP-based microcontrollers. NEMO features include:

  • deployment-related transformations such as BatchNorm folding, bias removal, weight equalization
  • collection of statistics on activations and weights
  • post-training quantization
  • quantization-aware fine-tuning, with partially automated precision relaxation
  • mixed-precision quantization
  • bit-accurate deployment model
  • export to ONNX

NEMO operates on three different "levels" of quantization-aware DNN representations, all built upon torch.nn.Module and torch.autograd.Function:

  • fake-quantized FQ: replaces regular activations (e.g., ReLU) with quantization-aware ones (PACT) and dynamically quantized weights (with linear PACT-like quantization), maintaining full trainability (similar to the native PyTorch support, but not based on it).
  • quantized-deployable QD: replaces all function with deployment-equivalent versions, trading off trainability for a more accurate representation of numerical behavior on real hardware.
  • integer-deployable ID: replaces all activation and weight tensors used along the network with integer-based ones. It aims at bit-accurate representation of actual hardware behavior. All the quantized representations support mixed-precision weights (signed and asymmetric) and activations (unsigned). The current version of NEMO targets per-layer quantization; work on per-channel quantization is in progress.

NEMO is organized as a Python library that can be applied with relatively small changes to an existing PyTorch based script or training framework.

Installation and requirements

The NEMO library currently supports PyTorch >= 1.3.1 and runs on Python >= 3.5. To install it from PyPI, just run:

pip install pytorch-nemo

You can also install a development (and editable) version of NEMO by directly downloading this repo:

git clone https://github.com/pulp-platform/nemo
cd nemo
pip install -e .

Then, you can import it in your script using

import nemo

Example

Documentation

Full documentation for NEMO is under development (see doc folder). You can find a technical report covering the deployment-aware quantization methodology here: https://arxiv.org/abs/2004.05930

License

NEMO is released under Apache 2.0, see the LICENSE file in the root of this repository for details.

Acknowledgements

ALOHA Logo

NEMO is an outcome of the European Commission Horizon 2020 ALOHA Project, funded under the EU's Horizon 2020 Research and Innovation Programme, grant agreement no. 780788.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch-nemo-0.0.8.tar.gz (41.2 kB view details)

Uploaded Source

Built Distribution

pytorch_nemo-0.0.8-py3-none-any.whl (58.3 kB view details)

Uploaded Python 3

File details

Details for the file pytorch-nemo-0.0.8.tar.gz.

File metadata

  • Download URL: pytorch-nemo-0.0.8.tar.gz
  • Upload date:
  • Size: 41.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for pytorch-nemo-0.0.8.tar.gz
Algorithm Hash digest
SHA256 72047b309d5d74cffca0e09f8e227ce421ed52523c9e1e16de6af5cc3e715785
MD5 79db42dae697ab44f8854e1c5d7e9ae4
BLAKE2b-256 cfb9a09ff14487ee718b938ca1a2412dd10ac2f44018d4248f4a2076ec09b403

See more details on using hashes here.

File details

Details for the file pytorch_nemo-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: pytorch_nemo-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 58.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for pytorch_nemo-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 bf867e78f6d4b912cb65b11d495eea6d47a7a19239564c9bde1563954369495e
MD5 b823798e2a7cd5ffd60092959a3071b4
BLAKE2b-256 908d6d95e2c9da1459883393818589f24ce07e19f6f9e08a36c9772f6e6217dd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page