Skip to main content

A Model Compression Toolkit for neural networks

Project description

Model Compression Toolkit (MCT)

tests

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware.
This project provides researchers, developers and engineers tools for optimizing and deploying state-of-the-art neural network on efficient hardware.
Specifically, this project applies constrained quantization and pruning schemes on a neural network.

Currently, this project only supports hardware friendly post training quantization (HPTQ) with Tensorflow 2 [1].

The MCT project is developed by researchers and engineers working at Sony Semiconductors Israel.

For more information, please visit our project website.

Table of Contents

Getting Started

This section provides a quick starting guide. We begin with installtion via source code or pip server. Then, we provide a short usage example.

Installation

See the MCT install guide for the pip package, and build from source.

From Source

git clone https://github.com/sony/model_optimization.git
python setup.py install

From PyPi - latest stable release

pip install model-compression-toolkit

A nightly package is also available (unstable):

pip install mct-nightly

To run MCT, one of the supported frameworks, Tenosflow/Pytorch, needs to be installed.

For using with Tensorflow please install the packages: tensorflow, tensorflow-model-optimization

For using with Pytorch (experimental) please install the packages: torch

Usage Example

For an example of how to use the post training quantization, using Keras, please use this link.

For an example using Pytorch (experimental), please use this link.

For more examples please see the tutorials' directory.

Supported Features

Quantization:

* Post Training Quantization for Keras models.
* Post Training Quantization for Pytorch models (experimental).
* Gradient based post training (Experimental, Keras only).
* Mixed-precision post training quantization (Experimental, Keras only).

Tensorboard Visualization (Experimental):

* CS Analyizer: compare a model compressed with the orignal model to analyze large accuracy drops.
* Activation statisicis and errors

MCT is tested with Tensorflow Version 2.7.

Results

Keras

As part of the MCT library, we have a set of example networks on image classification. These networks can be used as examples when using the package.

  • Image Classification Example with MobileNet V1 on ImageNet dataset
Network Name Float Accuracy 8Bit Accuracy Comments
MobileNetV1 [2] 70.558 70.418

For more results please see [1]

Pytorch

We quantized classification networks from the torchvision library. In the following table we present the ImageNet validation results for these models:

Network Name Float Accuracy 8Bit Accuracy
MobileNet V2 [3] 71.886 71.804
ResNet-18 [3] 69.86 69.74
SqueezeNet 1.1 [3] 58.128 57.848

Contributions

MCT aims at keeping a more up-to-date fork and welcomes contributions from anyone.

*You will find more information about contributions in the Contribution guide.

License

Apache License 2.0.

References

[1] Habi, H.V., Peretz, R., Cohen, E., Dikstein, L., Dror, O., Diamant, I., Jennings, R.H. and Netzer, A., 2021. HPTQ: Hardware-Friendly Post Training Quantization. arXiv preprint.

[2] MobilNet from Keras applications.

[3] TORCHVISION.MODELS

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mct-nightly-1.1.1.7022022.post2355.tar.gz (142.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mct_nightly-1.1.1.7022022.post2355-py3-none-any.whl (273.7 kB view details)

Uploaded Python 3

File details

Details for the file mct-nightly-1.1.1.7022022.post2355.tar.gz.

File metadata

  • Download URL: mct-nightly-1.1.1.7022022.post2355.tar.gz
  • Upload date:
  • Size: 142.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.12

File hashes

Hashes for mct-nightly-1.1.1.7022022.post2355.tar.gz
Algorithm Hash digest
SHA256 c4163eded7251874e1b91f6b4806053176d156e756a1ca223e3d42c3fc65478f
MD5 86a1b8bf9db4516bf633c0747c17e15d
BLAKE2b-256 d0ef20495218e89f7a5c3fb95423a9062a50845e05ec77a2691d63f436424d9e

See more details on using hashes here.

File details

Details for the file mct_nightly-1.1.1.7022022.post2355-py3-none-any.whl.

File metadata

  • Download URL: mct_nightly-1.1.1.7022022.post2355-py3-none-any.whl
  • Upload date:
  • Size: 273.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.12

File hashes

Hashes for mct_nightly-1.1.1.7022022.post2355-py3-none-any.whl
Algorithm Hash digest
SHA256 38ae4706616671ebf513b27177114a1f50c79ecfaaf781f69ff0ab58be50cb78
MD5 6ab5254ff96a5797c18fc9bca7008df7
BLAKE2b-256 b063cab42a028edf2e94026642c4eca2fc2b337cfd533af4ca51c9157d29af44

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page