k-bit optimizers and matrix multiplication routines.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

timdettmers Titus-von-Koeller

These details have not been verified by PyPI

Project links

docs

Project description

bitsandbytes

bitsandbytes enables accessible large language models via k-bit quantization for PyTorch. We provide three main features for dramatically reducing memory consumption for inference and training:

8-bit optimizers uses block-wise quantization to maintain 32-bit performance at a small fraction of the memory cost.
LLM.int8() or 8-bit quantization enables large language model inference with only half the required memory and without any performance degradation. This method is based on vector-wise quantization to quantize most features to 8-bits and separately treating outliers with 16-bit matrix multiplication.
QLoRA or 4-bit quantization enables large language model training with several memory-saving techniques that don't compromise performance. This method quantizes a model to 4-bits and inserts a small set of trainable low-rank adaptation (LoRA) weights to allow training.

The library includes quantization primitives for 8-bit & 4-bit operations, through bitsandbytes.nn.Linear8bitLt and bitsandbytes.nn.Linear4bit and 8-bit optimizers through bitsandbytes.optim module.

System Requirements

bitsandbytes has the following minimum requirements for all platforms:

Python 3.9+
PyTorch 2.2+
- Note: While we aim to provide wide backwards compatibility, we recommend using the latest version of PyTorch for the best experience.

Accelerator support:

Platform	Accelerator	Hardware Requirements	Support Status
🐧 Linux, glibc >= 2.24
x86-64	◻️ CPU	AVX2	〰️ Partial Support
	🟩 NVIDIA GPU `cuda`	SM50+ minimum SM75+ recommended	✅ Full Support
	🟥 AMD GPU `cuda`	CDNA: gfx90a, gfx942 RDNA: gfx1100, gfx1200	🚧 In Development
	🟦 Intel GPU `xpu`	Data Center GPU Max Series Arc A-Series (Alchemist) Arc B-Series (Battlemage)	🚧 In Development
	🟪 Intel Gaudi `hpu`	Gaudi1, Gaudi2, Gaudi3	🚧 In Development
aarch64	◻️ CPU		〰️ Partial Support
	🟩 NVIDIA GPU `cuda`	SM75, SM80, SM90, SM100	✅ Full Support
🪟 Windows 11 / Windows Server 2019+
x86-64	◻️ CPU	AVX2	〰️ Partial Support
	🟩 NVIDIA GPU `cuda`	SM50+ minimum SM75+ recommended	✅ Full Support
	🟦 Intel GPU `xpu`	Arc A-Series (Alchemist) Arc B-Series (Battlemage)	🚧 In Development
🍎 macOS 13.1+
arm64	◻️ CPU	Apple M1+	🛣️ Future Roadmap
	⬜ Metal `mps`	Apple M1+	🛣️ Future Roadmap

:book: Documentation

:heart: Sponsors

The continued maintenance and development of bitsandbytes is made possible thanks to the generous support of our sponsors. Their contributions help ensure that we can keep improving the project and delivering valuable updates to the community.

License

bitsandbytes is MIT licensed.

We thank Fabio Cannizzo for his work on FastBinarySearch which we use for CPU quantization.

How to cite us

If you found this library useful, please consider citing our work:

QLoRA

@article{dettmers2023qlora,
  title={Qlora: Efficient finetuning of quantized llms},
  author={Dettmers, Tim and Pagnoni, Artidoro and Holtzman, Ari and Zettlemoyer, Luke},
  journal={arXiv preprint arXiv:2305.14314},
  year={2023}
}

LLM.int8()

@article{dettmers2022llmint8,
  title={LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale},
  author={Dettmers, Tim and Lewis, Mike and Belkada, Younes and Zettlemoyer, Luke},
  journal={arXiv preprint arXiv:2208.07339},
  year={2022}
}

8-bit Optimizers

@article{dettmers2022optimizers,
  title={8-bit Optimizers via Block-wise Quantization},
  author={Dettmers, Tim and Lewis, Mike and Shleifer, Sam and Zettlemoyer, Luke},
  journal={9th International Conference on Learning Representations, ICLR},
  year={2022}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

timdettmers Titus-von-Koeller

These details have not been verified by PyPI

Project links

docs

Release history Release notifications | RSS feed

This version

0.46.1

Jul 2, 2025

0.46.0

May 27, 2025

0.45.5

Apr 7, 2025

0.45.4

Mar 25, 2025

0.45.3

Feb 24, 2025

0.45.2

Feb 6, 2025

0.45.1

Jan 23, 2025

0.45.0

Dec 5, 2024

0.44.1

Sep 30, 2024

0.44.0

Sep 24, 2024

0.44.0rc1 pre-release

Sep 23, 2024

0.43.3

Jul 30, 2024

0.43.2

Jul 23, 2024

0.43.1

Apr 11, 2024

0.43.0

Mar 8, 2024

0.42.0

Jan 8, 2024

0.41.3.post2

Dec 11, 2023

0.41.3.post1

Dec 11, 2023

0.41.3

Dec 6, 2023

0.41.2.post2

Nov 9, 2023

0.41.2.post1

Nov 8, 2023

0.41.2

Nov 8, 2023

0.41.1

Aug 4, 2023

0.41.0

Jul 22, 2023

0.40.2

Jul 17, 2023

0.40.1.post1

Jul 15, 2023

0.40.1

Jul 14, 2023

0.40.0.post4

Jul 12, 2023

0.40.0.post3

Jul 11, 2023

0.40.0.post2

Jul 11, 2023

0.40.0.post1

Jul 10, 2023

0.40.0

Jul 10, 2023

0.39.1

Jun 20, 2023

0.39.0

May 24, 2023

0.38.1

Apr 12, 2023

0.38.0.post2

Apr 12, 2023

0.38.0.post1

Apr 11, 2023

0.38.0

Apr 11, 2023

0.37.2

Mar 21, 2023

0.37.1

Mar 12, 2023

0.37.0

Feb 2, 2023

0.36.0.post2

Jan 4, 2023

0.36.0.post1

Jan 4, 2023

0.36.0

Jan 4, 2023

0.35.4

Nov 1, 2022

0.35.3

Oct 27, 2022

0.35.2

Oct 27, 2022

0.35.1

Oct 25, 2022

0.35.0

Oct 10, 2022

0.34.0

Sep 20, 2022

0.33.1

Sep 15, 2022

0.33.0

Sep 11, 2022

0.32.3

Sep 8, 2022

0.32.2

Aug 23, 2022

0.32.1

Aug 17, 2022

0.32.0

Aug 17, 2022

0.31.8

Aug 9, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

bitsandbytes-0.46.1-py3-none-win_amd64.whl (72.2 MB view details)

Uploaded Jul 2, 2025 Python 3Windows x86-64

bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl (72.9 MB view details)

Uploaded Jul 2, 2025 Python 3manylinux: glibc 2.24+ x86-64

bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl (30.7 MB view details)

Uploaded Jul 2, 2025 Python 3manylinux: glibc 2.24+ ARM64

File details

Details for the file bitsandbytes-0.46.1-py3-none-win_amd64.whl.

File metadata

Download URL: bitsandbytes-0.46.1-py3-none-win_amd64.whl
Upload date: Jul 2, 2025
Size: 72.2 MB
Tags: Python 3, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bitsandbytes-0.46.1-py3-none-win_amd64.whl
Algorithm	Hash digest
SHA256	`9f6f61376bd0e9780c5dc4ddee7d1f52cb10fe8034a1ea588611f4e8b87eb6a7`
MD5	`b52b5e647bb02813acc66e053acada2b`
BLAKE2b-256	`857d06da01fac23a5032632dd7874b31c1d9b7b9af2314b2b07e5f99641950da`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bitsandbytes-0.46.1-py3-none-win_amd64.whl:

Publisher: python-package.yml on bitsandbytes-foundation/bitsandbytes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bitsandbytes-0.46.1-py3-none-win_amd64.whl
- Subject digest: 9f6f61376bd0e9780c5dc4ddee7d1f52cb10fe8034a1ea588611f4e8b87eb6a7
- Sigstore transparency entry: 260281264
- Sigstore integration time: Jul 2, 2025
Source repository:
- Permalink: bitsandbytes-foundation/bitsandbytes@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Branch / Tag: refs/tags/0.46.1
- Owner: https://github.com/bitsandbytes-foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-package.yml@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Trigger Event: push

File details

Details for the file bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl.

File metadata

Download URL: bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl
Upload date: Jul 2, 2025
Size: 72.9 MB
Tags: Python 3, manylinux: glibc 2.24+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl
Algorithm	Hash digest
SHA256	`b0ee4a204fb926d4eae02bc2f5468ae3c11c011cfa849a4c771d4c6b201f57ae`
MD5	`fcb9dedb90c7c1df7b7f12c4b522ad08`
BLAKE2b-256	`6b1ec26dbcb46cebb49fa6b17ff888966e6d8f306078b095a5df801a583549d0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl:

Publisher: python-package.yml on bitsandbytes-foundation/bitsandbytes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bitsandbytes-0.46.1-py3-none-manylinux_2_24_x86_64.whl
- Subject digest: b0ee4a204fb926d4eae02bc2f5468ae3c11c011cfa849a4c771d4c6b201f57ae
- Sigstore transparency entry: 260281280
- Sigstore integration time: Jul 2, 2025
Source repository:
- Permalink: bitsandbytes-foundation/bitsandbytes@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Branch / Tag: refs/tags/0.46.1
- Owner: https://github.com/bitsandbytes-foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-package.yml@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Trigger Event: push

File details

Details for the file bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl.

File metadata

Download URL: bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl
Upload date: Jul 2, 2025
Size: 30.7 MB
Tags: Python 3, manylinux: glibc 2.24+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl
Algorithm	Hash digest
SHA256	`21b349f776d04c6c1380405961081de29c84f49640b79d3d199b6d719818da84`
MD5	`9eff89dbfee8318af2513bb24ad78eab`
BLAKE2b-256	`d2b29dadb4f8dca3948e35c1ebfee75ca82353e41468b41ff785430595f8e6f0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl:

Publisher: python-package.yml on bitsandbytes-foundation/bitsandbytes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bitsandbytes-0.46.1-py3-none-manylinux_2_24_aarch64.whl
- Subject digest: 21b349f776d04c6c1380405961081de29c84f49640b79d3d199b6d719818da84
- Sigstore transparency entry: 260281298
- Sigstore integration time: Jul 2, 2025
Source repository:
- Permalink: bitsandbytes-foundation/bitsandbytes@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Branch / Tag: refs/tags/0.46.1
- Owner: https://github.com/bitsandbytes-foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-package.yml@4bca84499ad194d6c37e77dfcf99201b81dc6981
- Trigger Event: push

bitsandbytes 0.46.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

bitsandbytes

System Requirements

Accelerator support:

:book: Documentation

:heart: Sponsors

License

How to cite us

QLoRA

LLM.int8()

8-bit Optimizers

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance