Skip to main content

Uni-Quant: CUDA-accelerated quantization/dequantization for Keras and XGBoost models

Project description

Uni-Quant

Small library to quantize/dequantize Keras and XGBoost models using PyTorch CUDA kernels.

Notes

  • This package compiles CUDA kernels at runtime using torch.utils.cpp_extension.load_inline.
  • Installing and using the CUDA compilation requires a compatible PyTorch build and CUDA toolkit on the target machine.

Dependencies are listed in requirements.txt and synchronized with pyproject.toml.

Quick publish test

Build a source/wheel and check locally:

python -m pip install --upgrade build twine
python -m build
python -m twine check dist/*

Upload (example):

python -m twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uni_quant_cuda-0.2.0.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

uni_quant_cuda-0.2.0-py3-none-any.whl (2.4 kB view details)

Uploaded Python 3

File details

Details for the file uni_quant_cuda-0.2.0.tar.gz.

File metadata

  • Download URL: uni_quant_cuda-0.2.0.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for uni_quant_cuda-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b1bdc48be18cafa94fc112ad04aba790658f455f9a658ee9253d0856b77600c1
MD5 51b9da8bd9ab8b97e6256e5a930b922f
BLAKE2b-256 132061a2a121a17ba362eb487d8988332df1218c3bcaa804e26658f04bfc026e

See more details on using hashes here.

File details

Details for the file uni_quant_cuda-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: uni_quant_cuda-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 2.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for uni_quant_cuda-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a47c322a79324e8b402c1525e5a734bc80be46bc9d4bed494bfa23666e4c5499
MD5 a58b7679f7c96556e8f3d269ec0aba17
BLAKE2b-256 52857024e63ab6dc4884f111b904296790d943992263ec76c3b563572aeac436

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page