Femtosense Model Optimization Toolkit

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
Operating System
- OS Independent
Programming Language

Project description

Build Status

fmot

The Femtosense Model Optimization Toolkit (fmot) quantizes neural network models for deployment on Femtosense hardware.

Installation

git clone https://github.com/femtosense/fmot.git
cd fmot
pip install -e .

Quantizing Models

You get to define your pytorch models however you want. Once your model has been trained, it can be converted to the fmot.qat format by calling fmot.convert.convert_torch_to_qat. This resulting qat model will initially not be quantized. To quantize it, provide your model, along with an iteratable of sample inputs, to fmot.qat.control.quantize. These test inputs will help the qat model to find an optimal quantization configuration. The resulting quantized model will now simulate the fixed-point integer arithmetic, exactly how it will be performed on Femtosense hardware.

import torch
import fmot

class MyModel(torch.nn.Module):
    def __init__(self, din, dout):
        super().__init__()
        weights = torch.rand(din, dout)
        self.weight = torch.nn.Parameter(weights)
        self.linear = torch.nn.Linear(dout, dout)

    def forward(self, x):
        x = torch.matmul(x, self.weight)
        x = x.relu()
        x = self.linear(x)
        x = torch.sigmoid(x)
        return x

model = MyModel(128, 256)

### TRAINING GOES HERE

# Convert the trained model to qat format
quant_model = fmot.convert.convert_torch_to_qat(model)
# Provide a set of sample inputs to choose an optimal quantization scheme
quant_model = fmot.qat.control.quantize(quant_model, [torch.randn(16, 128) for __ in range(20)])

NOTE: THE ABOVE API NEEDS A TOP-LEVEL SHORTCUT

Fine-Tuning Quantized Models

Setting Custom Bitwidths

Emitting FQIR

Once your model has been quantized,

Building and Viewing Sphinx Documentation

First, let's install sphinx. On macOs:

brew install sphinx-doc

On other platforms.

Now, let's install some dependencies with pip:

cd docs
pip install -r requirements.txt

You can now build the documentation by running

make html

This documentation can be viewed in your browser with Open File (⌘O). Navigate to

{fmot_base}/docs/_build/html/index.html

Running Tests

Pruning Weight Matrices

Sparsifying Activations

Using Custom Layers

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

1.9.4

May 2, 2024

1.9.3

Apr 23, 2024

1.9.2

Apr 16, 2024

1.9.1

Apr 16, 2024

1.9.0

Mar 21, 2024

1.8.1

Dec 22, 2023

1.8.0

Nov 30, 2023

1.7.7

Nov 29, 2023

1.7.6

Sep 7, 2023

1.7.5

Aug 20, 2023

1.7.3

Aug 16, 2023

1.7.2

Aug 16, 2023

This version

1.5.8

May 25, 2023

1.5.6

Apr 22, 2023

1.3.4

Mar 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

fmot-1.5.8-py3-none-any.whl (223.6 kB view hashes)

Uploaded May 25, 2023 Python 3

Hashes for fmot-1.5.8-py3-none-any.whl

Hashes for fmot-1.5.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`462a62b4aca13a0a290096a9cbe506ef306ed69dcefb7c09a4def427890cbe35`
MD5	`27aaa4720610430bf68c383efef8d60d`
BLAKE2b-256	`a3e2c45297fc8e4218db49d45d042cfc56936271736b75c7bb0f318b708d3601`