Quantization-aware training in PyTorch

Project description

Brevitas

Brevitas is a PyTorch library for neural network quantization, with support for both post-training quantization (PTQ) and quantization-aware training (QAT).

Please note that Brevitas is a research project and not an official Xilinx product.

If you like this project please consider ⭐ this repo, as it is the simplest and best way to support it.

Requirements

Python >= 3.9, <3.13
Pytorch >= 1.12, <= 2.8 (more recent versions would be untested).
Windows, Linux or macOS.
GPU training-time acceleration (Optional but recommended).

Installation

You can install the latest release from PyPI:

pip install brevitas

Getting Started

Brevitas currently offers quantized implementations of the most common PyTorch layers used in DNN under brevitas.nn, such as QuantConv1d, QuantConv2d, QuantConvTranspose1d, QuantConvTranspose2d, QuantMultiheadAttention, QuantRNN, QuantLSTM etc., for adoption within PTQ and/or QAT. For each one of these layers, quantization of different tensors (inputs, weights, bias, outputs, etc) can be individually tuned according to a wide range of quantization settings.

As a reference for PTQ, Brevitas provides an example user flow for ImageNet classification models under brevitas_examples.imagenet_classification.ptq that quantizes an input torchvision model using PTQ under different quantization configurations (e.g. bit-width, granularity of scale, etc).

For more info, checkout our documentation.

Cite as

If you adopt Brevitas in your work, please cite it as:

@software{brevitas,
  author       = {Franco, Giuseppe and Pappalardo, Alessandro and Fraser, Nicholas J},
  title        = {Xilinx/brevitas},
  year         = {2025},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.3333552},
  url          = {https://doi.org/10.5281/zenodo.3333552}
}

History

2025/08/28 - Release version 0.12.1, see the release notes.
2025/05/09 - Release version 0.12.0, see the release notes.
2024/10/10 - Release version 0.11.0, see the release notes.
2024/07/23 - Minor release version 0.10.3, see the release notes.
2024/02/19 - Minor release version 0.10.2, see the release notes.
2024/02/15 - Minor release version 0.10.1, see the release notes.
2023/12/08 - Release version 0.10.0, see the release notes.
2023/04/28 - Minor release version 0.9.1, see the release notes.
2023/04/21 - Release version 0.9.0, see the release notes.
2023/01/10 - Release version 0.8.0, see the release notes.
2021/12/13 - Release version 0.7.1, fix a bunch of issues. Added TVMCon 2021 tutorial notebook.
2021/11/03 - Re-release version 0.7.0 (build 1) on PyPI to fix a packaging issue.
2021/10/29 - Release version 0.7.0, see the release notes.
2021/06/04 - Release version 0.6.0, see the release notes.
2021/05/24 - Release version 0.5.1, fix a bunch of minor issues. See release notes.
2021/05/06 - Release version 0.5.0, see the release notes.
2021/03/15 - Release version 0.4.0, add support for __torch_function__ to QuantTensor.
2021/03/04 - Release version 0.3.1, fix bug w/ act initialization from statistics w/ IGNORE_MISSING_KEYS=1.
2021/03/01 - Release version 0.3.0, implements enum and shape solvers within extended dependency injectors. This allows declarative quantizers to be self-contained.
2021/02/04 - Release version 0.2.1, includes various bugfixes of QuantTensor w/ zero-point.
2021/01/30 - First release version 0.2.0 on PyPI.

Project details

Release history Release notifications | RSS feed

This version

0.12.1

Aug 28, 2025

0.12.0

May 9, 2025

0.11.0

Oct 10, 2024

0.10.3

Jul 23, 2024

0.10.2

Feb 19, 2024

0.10.1

Feb 15, 2024

0.10.0

Dec 8, 2023

0.9.1

Apr 28, 2023

0.9.0

Apr 21, 2023

0.8.0

Jan 10, 2023

0.7.1

Dec 14, 2021

0.7.0

Oct 29, 2021

0.6.0

Jun 4, 2021

0.5.1

May 24, 2021

0.5.0

May 6, 2021

0.4.0

Mar 15, 2021

0.3.1

Mar 4, 2021

0.2.1

Feb 4, 2021

0.2.0

Jan 30, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brevitas-0.12.1.tar.gz (753.9 kB view details)

Uploaded Aug 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

brevitas-0.12.1-py3-none-any.whl (901.4 kB view details)

Uploaded Aug 28, 2025 Python 3

File details

Details for the file brevitas-0.12.1.tar.gz.

File metadata

Download URL: brevitas-0.12.1.tar.gz
Upload date: Aug 28, 2025
Size: 753.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for brevitas-0.12.1.tar.gz
Algorithm	Hash digest
SHA256	`c79fc11b91303448d240430d81c9027bfa7a7fb8a0b89d9f11f0e2c93e31a1f2`
MD5	`ab1b00358651182e694e7e1a844da107`
BLAKE2b-256	`7086027d3815241e8628061c203aad8b42849d28c8e54dfd83cbd597527a8f3e`

See more details on using hashes here.

File details

Details for the file brevitas-0.12.1-py3-none-any.whl.

File metadata

Download URL: brevitas-0.12.1-py3-none-any.whl
Upload date: Aug 28, 2025
Size: 901.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for brevitas-0.12.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1d6f370351a30a1dbdf951c899b8d28cd9f4c1c9d707e2eeac3f306a6dc04885`
MD5	`34e21ce525a96e3b1f0057273d850726`
BLAKE2b-256	`d40f8701e963112f152a45b468e2c6a6c12fbc608bde156c553eec33a1e61846`

See more details on using hashes here.

brevitas 0.12.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Brevitas

Requirements

Installation

Getting Started

Cite as

History

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes