Uni-Quant: CUDA-accelerated quantization/dequantization for Keras and XGBoost models
Project description
Uni-Quant
Small library to quantize/dequantize Keras and XGBoost models using PyTorch CUDA kernels.
Notes
- This package compiles CUDA kernels at runtime using
torch.utils.cpp_extension.load_inline. - Installing and using the CUDA compilation requires a compatible CUDA toolkit on the target machine. (Tested with 13.1)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
uni_quant_cuda-0.2.1.tar.gz
(6.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file uni_quant_cuda-0.2.1.tar.gz.
File metadata
- Download URL: uni_quant_cuda-0.2.1.tar.gz
- Upload date:
- Size: 6.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e28597fea172b8738af488b36c9927978464e8b82ece896eb4707ec8b0cbcf0f
|
|
| MD5 |
b308232a46329ca502f12c3898dfefe8
|
|
| BLAKE2b-256 |
bdcfeffef7a1b4780b9db6775e7fb3e85326da6a472e29672bf66bfd0f8a86db
|
File details
Details for the file uni_quant_cuda-0.2.1-py3-none-any.whl.
File metadata
- Download URL: uni_quant_cuda-0.2.1-py3-none-any.whl
- Upload date:
- Size: 2.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
902742b9917a7f6b637407b830b8e21b4098d3f2217c0470eba6021eb32d7cfa
|
|
| MD5 |
8ab55edb7c5ac01d25d638dd230b56f0
|
|
| BLAKE2b-256 |
4f5387cdf665f5f59713d9c91dbeb7de264975de51cf5491a69b560704ea3b9f
|