GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

🔮 GPTQ - Accurate Post-Training Compression for Generative Pretrained Transformers

This repo is a extended and polished version of the original code for the paper GPTQ: Accurate Post-training Compression for Generative Pretrained Transformers.

🔥 SOTA on LLM PTQ

An efficient implementation of the GPTQ algorithm
2/3/4/8-bit quantized matrix full-precision vector product CUDA kernel
Bug fix for old consumer-grade GPU

📥 Installation

pip install gptq

🛟 Install PyTorch

gptq requires PyTorch and GPU, and installing PyTorch with CUDA is tricky. To install PyTorch correctly, the following steps are recommended:

run nvcc --version to get the version. For example, the following result means we have cuda compiler version 116

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Tue_Mar__8_18:18:20_PST_2022
Cuda compilation tools, release 11.6, V11.6.124
Build cuda_11.6.r11.6/compiler.31057947_0

run pip install light-the-torch to install ltt
run ltt install --pytorch-computation-backend=cu116 torch torchvision torchaudio to install the torch suite. Please replace the 116 according to your environment!

TODO

GPTQ with CNN

Algorithm credits go to IST Austria Distributed Algorithms and Systems Lab

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.3

Mar 23, 2023

0.0.2

Mar 13, 2023

0.0.2.dev5 pre-release

Mar 13, 2023

0.0.2.dev4 pre-release

Mar 13, 2023

0.0.2.dev3 pre-release

Mar 13, 2023

0.0.2.dev2 pre-release

Mar 13, 2023

0.0.2.dev1 pre-release

Mar 13, 2023

0.0.2.dev0 pre-release

Mar 13, 2023

0.0.1

Mar 9, 2023

0.0.1.dev0 pre-release

Mar 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gptq-0.0.3.tar.gz (21.4 kB view hashes)

Uploaded Mar 23, 2023 Source

Hashes for gptq-0.0.3.tar.gz

Hashes for gptq-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`05121652e59fd5cc9c6cf9530bb999bb4d843fdbbe81ee532e06c6f8023b812f`
MD5	`e36064eeaae8f9c0edb7864648f58317`
BLAKE2b-256	`25062e8e087ec1572fca200c591442ed92c4df7ac8a854ecdf32a7b8065ce14d`