High Granularity Quantization 2

These details have been verified by PyPI

Project links

repository

GitHub Statistics

Maintainers

Calad0i

These details have not been verified by PyPI

Project description

HGQ2: High Granularity Quantization 2

HGQ2 (High Granularity Quantization 2) is a quantization-aware training framework built on Keras v3, targeting real-time deep learning applications on edge devices like FPGAs. It provides a comprehensive set of tools for creating and training quantized neural networks with minimal effort.

HGQ2 implements an gradient-based automatic bitwidth optimization and quantization-aware training algorithm. By laveraging gradients, it allows for bitwidth optimization at arbitrary granularity, up to per-weight and per-activation level.

High Granularity: HGQ supports per-weight and per-activation bitwidth optimization, or any other lower granularity.
Automatic Quantization: Bit-widths are optimized via gradients, no need to manually tune them in general.
What you see is what you get: One get exactly what you get from Keras models from RTL models.
- still subject to machine float precision limitation.
Accurate Resource Estimation: EBOPs estimated by HGQ gives a good indication of the actual resource usage on FPGA, either upper limit of LUT (da4ml) or LUT + 55 * DSP (hls4ml).

In addition, this framework improves upon the old HGQ implementation in the following aspects:

Scalability: HGQ2 supports TensorFlow, JAX, and PyTorch. As XLA compilation inJAX and TensorFlow can significantly speed up the training process. Training speed on HGQ2 can be 1.2-5 times faster than the previous implementation.
Quantizers:
- Fixed-point: While the last implementation only optimizes the number of floating bits with one way of parameterizing the fixed-point numbers, HGQ2 supports multiple ways of parametrizing them, and allows of optimizing any part of them via gradients.
- Minifloat: Training with minifloat quantization is supported, also with surrogate gradients support (alpha quality).
More Layers: More layers are supported now, including the powerful EinsumDense(BatchNorm) layer and the MultiHeadAttention layer with bit-accurate softmax and scaled dot-product attention.

Installation

pip install HGQ2

If you are using da4ml, please make sure it is at least version 0.6:

pip install da4ml>=0.6

If you are using hls4ml, please make sure it is at least version 1.2:

pip install hls4ml>=1.2.0

Usage

Please refer to the documentation for more details on how to use the library.

A minimal example is shown below:

   import keras
   from hgq.layers import QDense, QConv2D
   from hgq.config import LayerConfigScope, QuantizerConfigScope

   # Setup quantization configuration
   # These values are the defaults, just for demonstration purposes here
   with (
      # Configuration scope for setting the default quantization type and overflow mode
      # The second configuration scope overrides the first one for the 'datalane' place
      QuantizerConfigScope(place='all', default_q_type='kbi', overflow_mode='SAT_SYM'),
      # Configuration scope for enabling EBOPs and setting the beta0 value
      QuantizerConfigScope(place='datalane', default_q_type='kif', overflow_mode='WRAP'),
      LayerConfigScope(enable_ebops=True, beta0=1e-5),
   ):
      model = keras.Sequential([
         QConv2D(32, (3, 3), activation='relu'),
         keras.layers.MaxPooling2D((2, 2)),
         keras.layers.Flatten(),
         QDense(10)
      ])

Citation

If you use HGQ2 in your research, please consider citing the following paper:

@inproceedings{hgq,
author = {Sun, Chang and Que, Zhiqiang and Aarrestad, Thea and Loncar, Vladimir and Ngadiuba, Jennifer and Luk, Wayne and Spiropulu, Maria},
title = {HGQ: High Granularity Quantization for Real-time Neural Networks on FPGAs},
year = {2026},
isbn = {9798400720796},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3748173.3779200},
doi = {10.1145/3748173.3779200},
booktitle = {Proceedings of the 2026 ACM/SIGDA International Symposium on Field Programmable Gate Arrays},
pages = {79–91},
numpages = {13},
keywords = {quantization-aware training, fpga, real-time inference, neural networks, hardware-software codesign, low-latency, quantization},
location = {USA},
series = {FPGA '26}
}

Project details

These details have been verified by PyPI

Project links

repository

GitHub Statistics

Maintainers

Calad0i

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.9

May 27, 2026

0.1.8

Mar 13, 2026

0.1.7

Feb 12, 2026

0.1.6

Jan 12, 2026

0.1.5

Dec 11, 2025

0.1.4

Nov 18, 2025

0.1.3

Nov 9, 2025

0.1.2

Sep 18, 2025

0.1.1

Sep 1, 2025

0.1.0

Jul 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hgq2-0.1.9.tar.gz (201.3 kB view details)

Uploaded May 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hgq2-0.1.9-py3-none-any.whl (111.4 kB view details)

Uploaded May 27, 2026 Python 3

File details

Details for the file hgq2-0.1.9.tar.gz.

File metadata

Download URL: hgq2-0.1.9.tar.gz
Upload date: May 27, 2026
Size: 201.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for hgq2-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`865970ae20f97edd6ea69246a2beb66ad1cfce31a40e9599c92f8ef6e61ad4ad`
MD5	`9f2d4fc68a59e6323c21b50733309599`
BLAKE2b-256	`c6f839b61fc409aaf9918ab0800d8ce489918cf9a93473b69bdcdfc46b91722e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hgq2-0.1.9.tar.gz:

Publisher: python-publish.yml on calad0i/HGQ2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hgq2-0.1.9.tar.gz
- Subject digest: 865970ae20f97edd6ea69246a2beb66ad1cfce31a40e9599c92f8ef6e61ad4ad
- Sigstore transparency entry: 1647749310
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: calad0i/HGQ2@4f0a45d1c0132efd61764fdb97d86153f331bc23
- Branch / Tag: refs/tags/v0.1.9
- Owner: https://github.com/calad0i
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@4f0a45d1c0132efd61764fdb97d86153f331bc23
- Trigger Event: release

File details

Details for the file hgq2-0.1.9-py3-none-any.whl.

File metadata

Download URL: hgq2-0.1.9-py3-none-any.whl
Upload date: May 27, 2026
Size: 111.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for hgq2-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`749754214b97f2531e1f8e609308755fe857064f47d350e9ffb878fb5017c43c`
MD5	`82d84c4dfb628c0ca24b5ddc411c4227`
BLAKE2b-256	`b4c74c7b7dd5707a421a4aef17d0d5ba74d0304f426f5abd78ea8aef541eef4b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hgq2-0.1.9-py3-none-any.whl:

Publisher: python-publish.yml on calad0i/HGQ2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hgq2-0.1.9-py3-none-any.whl
- Subject digest: 749754214b97f2531e1f8e609308755fe857064f47d350e9ffb878fb5017c43c
- Sigstore transparency entry: 1647749423
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: calad0i/HGQ2@4f0a45d1c0132efd61764fdb97d86153f331bc23
- Branch / Tag: refs/tags/v0.1.9
- Owner: https://github.com/calad0i
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@4f0a45d1c0132efd61764fdb97d86153f331bc23
- Trigger Event: release

hgq2 0.1.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

HGQ2: High Granularity Quantization 2

Installation

Usage

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance