An efficent implementation for the paper: "The Era of 1-bit LLMs"

These details have not been verified by PyPI

Project links

Homepage

Project description

BitMat: Improving Matrix Multiplication with Triton

Introduction

BitMat is a Python package designed to optimize matrix multiplication operations by utilizing custom kernels written in Triton. Our package leverages the principles outlined in the "1bit-LLM Era" paper, specifically utilizing packed int8 data to enhance computational efficiency and performance in deep learning and numerical computing tasks.

Features

Custom Triton Kernels: Utilize highly optimized kernels for matrix multiplication, tailored for performance and efficiency. Packed int8 Operations: Follows the methodologies from the "1bit-LLM Era" to use packed int8 data, reducing memory usage and increasing throughput. Ease of Integration: BitMat is designed to be easily integrated into existing PyTorch workflows, providing a seamless user experience. Performance Boost: Significant performance improvements in matrix multiplication, especially beneficial for large-scale deep learning models and high-dimensional data.

Installation

pip install bitmat-tl

At the moment we only support Linux platforms. Windows installation is possible but is not tested.

Quick Start

High-level API (tranformers-compatible)

from transformers import AutoModelForCausalLM
from bitmat import convert_hf_model

# Initialize your model
model= AutoModelForCausalLM.from_pretrained("some-repo/some-model")
# Convert the model to use BitLinear layers
model = convert_hf_model(model)

Low-level API

import torch
from bitmat import BitLinear

layer = BitLinear(in_features=1024, out_features=512, bias=True, eps=1e-5)
# You can use the layer as a normal torch.nn.Linear layer

Contributing

We welcome contributions from the community, whether it's adding new features, improving documentation, or reporting bugs. Please refer to our contribution guidelines before making a pull request.

License

BitMat is open-sourced under the Apache-2.0 license.

Citation

If you use BitMat in your research, please cite it using the following Bibtex entry:

@article{bitmat2024,
  title={BitMat: Improving Matrix Multiplication with Custom Triton Kernels},
  author={AstraMind AI},
  journal={https://github.com/astramind-ai/BitMat},
  year={2024}
}

Support

For questions, issues, or support regarding BitMat, please open an issue on our GitHub repository.

Acknowledgments

Special thanks to the Triton community and the authors of the "1bit-LLM Era" paper for their groundbreaking work and inspiration.

Also thanks to the developer od BitDelta and UnSloth since part of the code is based on their work.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.3.7

Apr 21, 2024

0.3.6

Apr 16, 2024

0.3.5

Apr 16, 2024

0.3.4

Apr 15, 2024

0.3.3

Apr 15, 2024

0.3.1

Apr 11, 2024

0.3.0

Apr 11, 2024

0.2.9

Apr 4, 2024

0.2.8

Apr 3, 2024

0.2.7

Apr 3, 2024

0.2.6

Apr 3, 2024

0.2.5

Apr 3, 2024

0.2.4

Apr 3, 2024

0.2.3

Apr 3, 2024

0.2.2

Apr 3, 2024

0.2.0

Apr 2, 2024

0.1.8

Apr 1, 2024

This version

0.1.7

Apr 1, 2024

0.1.6

Apr 1, 2024

0.1.5

Mar 31, 2024

0.1.4

Mar 31, 2024

0.1.3

Mar 31, 2024

0.1.2

Mar 31, 2024

0.1.1

Mar 31, 2024

0.1.0

Mar 31, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bitmat-tl-0.1.7.tar.gz (17.2 kB view hashes)

Uploaded Apr 1, 2024 Source

Built Distribution

bitmat_tl-0.1.7-py3-none-any.whl (18.3 kB view hashes)

Uploaded Apr 1, 2024 Python 3

Hashes for bitmat-tl-0.1.7.tar.gz

Hashes for bitmat-tl-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`c89b9f8d7100acd28862f62a44c58a7f4e55a9b1cd21893007f3cfc06a616ac7`
MD5	`ea1e209929c341e50fe5435b165b375f`
BLAKE2b-256	`f9c8b3ddd67a1d7d6a105ca55c97ae674a2d5fedd3880c69baec7e0551e4f378`

Hashes for bitmat_tl-0.1.7-py3-none-any.whl

Hashes for bitmat_tl-0.1.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b11effc56487e68433aacd9e6582a1a15a76bff0f6b19891ce98984c32b93256`
MD5	`632084fc8d80130c9bd06a5ff22d6d91`
BLAKE2b-256	`5680cb804ca58f5832665d523fff082a265d4ef5f73890e25e12ab932a567698`