Skip to main content

A Model Compression Toolkit for neural networks

Project description

Model Compression Toolkit (MCT)

tests

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient hardware constrained.
This project enables researchers, developers and engineers an easy way to optimized and deploy state-of-the-art neural network on efficient hardware.
Specifically this project apply constrained quantization and pruning scheme on a neural network.

Currently, this project only support hardware friendly post training quantization (HPTQ) with Tensorflow 2 [1].

MCT project is developed by researchers and engineers working on the Sony Semiconductors Israel.

Table of Contents

Getting Started

This section provides a quick starting guide start with installtion via source code or pip server then a short example usage.

Installation

See the MCT install guide for the pip package, and build from source.

From Source

git clone https://github.com/sony/model_optimization.git
python setup.py install

Example Usage

Here is a snapshot of a code that shown an example of how to use the post training quantization using keras.

import model_compression_toolkit as mct

# Set the batch size of the images at each calibration iteration.
batch_size = 50

# Create a representative data generator, which returns a list of images.
# Load a folder of images. 
folder = '/path/to/images/folder'

# The images can be preprocessed using a list of preprocessing functions.
def normalization(x):
    return (x - 127.5) / 127.5

# Create a FolderImageLoader instance which loads the images, preprocess them and enables you to sample batches of them.
image_data_loader = mct.FolderImageLoader(folder,
                                          preprocessing=[normalization],
                                          batch_size=batch_size)

# Create a Callable representative dataset for calibration purposes.
# The function should be called without any arguments, and should return a list numpy arrays (array for each model's input).
# For example: if the model has two input tensors - one with input shape of 32X32X3 and the second with input 
# shape of 224X224X3, and we calibrate the model using batches of 20 images,
# calling representative_data_gen() should return a list 
# of two numpy.ndarray objects where the arrays' shapes are [(20, 32, 32, 3), (20, 224, 224, 3)].
def representative_data_gen() -> list:
        return [image_data_loader.sample()]


# Create a model and quantize it using the representative_data_gen as the calibration images.
# Set the number of calibration iterations to 10.
quantized_model, quantization_info = mct.keras_post_training_quantization(model,
                                                                          representative_data_gen,
                                                                          n_iter=10)

For more example please see the tutorials' directory.

For more information, please visit out project website.

Supported Features

Quantization:

* Post Training Quantization 
* Gradient base post training using knowledge distillation (Experimental) 

Tensorboard Visualization (Experimental):

* CS Analyizer: compare comprased model with orignal model to analysis large accuracy drop.
* Activation statisicis and errors

Note that currently we only have full support for Keras layers, using the TensorFlow native layer may lead to unexpected behavior. This limitation will be removed in future releases.

MCT is test with Tensorflow Version 2.5.

Tutorials and Results

As part of the MCT library, we have a set of example network on image classification which can be used as an example while using the package.

  • Image Classification Example with MobileNet V1 on ImageNet dataset
Network Name Float Accuracy 8Bit Accuracy Comments
MobileNetV1 [2] 70.558 70.418

For more results please see [1]

Contributions

MCT aims at keeping a more up-to-date fork and welcomes contributions from anyone.

*You will find more information about contributions in the Contribution guide.

License

Apache License 2.0.

Refernce

[1] Habi, H.V., Peretz, R., Cohen, E., Dikstein, L., Dror, O., Diamant, I., Jennings, R.H. and Netzer, A., 2021. HPTQ: Hardware-Friendly Post Training Quantization. arXiv preprint.

[2] MobilNet from Keras applications.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mct-nightly-1.0.0.15112021-003056.tar.gz (91.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mct_nightly-1.0.0.15112021.post3056-py3-none-any.whl (176.3 kB view details)

Uploaded Python 3

File details

Details for the file mct-nightly-1.0.0.15112021-003056.tar.gz.

File metadata

  • Download URL: mct-nightly-1.0.0.15112021-003056.tar.gz
  • Upload date:
  • Size: 91.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12

File hashes

Hashes for mct-nightly-1.0.0.15112021-003056.tar.gz
Algorithm Hash digest
SHA256 51b6e94cf3fd6cd02bee7041e4607152c8dc88df555120047018b3559b084c65
MD5 6dca7756aaef3cb2426d8b94ada113e7
BLAKE2b-256 e8a163a3d6908f40a055b902a9c2c0458926af3214912fb7607dfcf2d220bd6f

See more details on using hashes here.

File details

Details for the file mct_nightly-1.0.0.15112021.post3056-py3-none-any.whl.

File metadata

  • Download URL: mct_nightly-1.0.0.15112021.post3056-py3-none-any.whl
  • Upload date:
  • Size: 176.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12

File hashes

Hashes for mct_nightly-1.0.0.15112021.post3056-py3-none-any.whl
Algorithm Hash digest
SHA256 d5a9c285f997f371d1cc613cc8c9734734da92feed6abf98008ab7dea2d3450f
MD5 6cd01c74f5ee76a79b6d20d8d17a38c0
BLAKE2b-256 e74622300d98b4880f3592ab0fda247813f0a262e304515928dc96b4379f99f8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page