Skip to main content

High Granularity Quantizarion

Project description

HGQ-logo

High Granularity Quantization

License Apache 2.0 Documentation PyPI version

HGQ is a framework for quantization aware training of neural networks to be deployed on FPGAs, which allows for per-weight and per-activation bitwidth optimization.

Depending on the specific application, HGQ could achieve up to 10x resource reduction compared to the traditional AutoQkeras approach, while maintaining the same accuracy. For some more challenging tasks, where the model is already under-fitted, HGQ could still improve the performance under the same on-board resource consumption. For more details, please refer to our paper (link coming not too soon).

This repository implements HGQ for tensorflow.keras models. It is independent of the QKeras project.

Warning:

This framework requires an unmerged PR of hls4ml. Please install it by running pip install "git+https://github.com/calad0i/hls4ml@HGQ-integration". Or, conversion will fail with unsupported layer error.

This package is still under development. Any API might change without notice at any time!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

HGQ-0.2.0b2.tar.gz (88.3 kB view hashes)

Uploaded Source

Built Distribution

HGQ-0.2.0b2-py3-none-any.whl (42.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page