An production-ready implementation of 1.58 bit quantization-aware training and inference.

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

bitlinear

This project aims to provide a production-ready implementation of 1.58-bit layers for quantization-aware training and time-, memory-, and energy-efficient inference. It builds on the ideas from The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits.

installation

Installation from PyPI:

pip install bitlinear

Installation from source:

git clone https://github.com/schneiderkamplab/bitlinear
cd bitlinear
pip install .

usage

The usage is best explained by a short example:

from bitlinear import replace_modules
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("HuggingFaceM4/tiny-random-LlamaForCausalLM")
replace_modules(model)

More elaborate examples are available under examples/classifier, including training and evaluating a binary classifer:

pip install -r examples/classifier/requirements.txt
python examples/classifier/train.py
python examples/classifier/eval.py

There is also an MNIST classifier:

pip install -r examples/classifier/requirements.txt
python examples/mnist/train.py

comparison to other work

There are other implementations of bit-linear layers, most of which get at least some of the details wrong at the time of this writing (April 2024).

The focus of this implementation is to develop:

a flexible production-ready drop-in replacemenbt for torch.nn.LinearLayer,
efficient fused kernels for training, and
efficient fused kernels for inference with 2-bit weights and 8-bit activations.

Furthermore, this implementation is meant to serve as a testbed for research on low-bit quantization aware training and inference.

future work

further examples (vision, llm)
efficient fused kernels for GPU/AVX/CPU training
efficient fused kernels for GPU/AVX/CPU inferenc

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

2.4.6

Feb 16, 2025

2.4.5

Feb 16, 2025

2.4.4

Feb 16, 2025

2.4.3

Feb 16, 2025

2.4.2

Feb 16, 2025

2.4.1

Feb 16, 2025

This version

2.4.0

Dec 10, 2024

2.3.0

Oct 5, 2024

2.2.1

Sep 29, 2024

2.2.0

Aug 14, 2024

2.1.1

Aug 5, 2024

2.1.0

May 22, 2024

2.0.3

May 22, 2024

2.0.2

May 22, 2024

2.0.1

May 22, 2024

2.0.0

May 22, 2024

1.2.5

May 22, 2024

1.2.4

May 17, 2024

1.2.3

May 17, 2024

1.2.2

May 17, 2024

1.2.1

May 17, 2024

1.2.0

May 17, 2024

1.1.0

Apr 25, 2024

1.0.1

Apr 11, 2024

1.0.0

Apr 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bitlinear-2.4.0.tar.gz (10.7 kB view details)

Uploaded Dec 10, 2024 Source

File details

Details for the file bitlinear-2.4.0.tar.gz.

File metadata

Download URL: bitlinear-2.4.0.tar.gz
Upload date: Dec 10, 2024
Size: 10.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for bitlinear-2.4.0.tar.gz
Algorithm	Hash digest
SHA256	`b56314dba1db7e9ddea2ccbe878673f7876516a723b4896afb57fd07ef42f751`
MD5	`6bf69cf5e18eae79c0dba7fde61b9a75`
BLAKE2b-256	`7fa4af6361b69c03c150b4af7fcd8654759646de4766a200c08305fe40a004fb`

See more details on using hashes here.

bitlinear 2.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

bitlinear

installation

usage

comparison to other work

future work

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes