Uni-Quant: CUDA-accelerated quantization/dequantization for TensorFlow models
Project description
Uni-Quant
Small library to quantize/dequantize TensorFlow models using PyTorch CUDA kernels.
Notes
- This package compiles CUDA kernels at runtime using
torch.utils.cpp_extension.load_inline. - Installing and using the CUDA compilation requires a compatible CUDA toolkit on the target machine. (Tested with >=13)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
uni_quant_cuda-0.2.2.tar.gz
(6.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file uni_quant_cuda-0.2.2.tar.gz.
File metadata
- Download URL: uni_quant_cuda-0.2.2.tar.gz
- Upload date:
- Size: 6.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d7bc2b5bf3c4855fe66764863c5213a9f45e28ba8196534a63514e62c48da131
|
|
| MD5 |
6b272693a5fe8836a0451e1187c79852
|
|
| BLAKE2b-256 |
c08b676524acf1b9653673a74620ffd977a40306668ec56b809d6d3a3b2fe0c4
|
File details
Details for the file uni_quant_cuda-0.2.2-py3-none-any.whl.
File metadata
- Download URL: uni_quant_cuda-0.2.2-py3-none-any.whl
- Upload date:
- Size: 2.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a33be1b5c0cf78859396665b73c0941c5658d0196726c8b4f00bee8f86177a7
|
|
| MD5 |
bb1d764c000601ef20a6392d2015e515
|
|
| BLAKE2b-256 |
e2a062e0f7cb8114d01a59966078010b0d588291c3efc86d680d895d4974071b
|