Skip to main content

A tool for measurements and quantization of models in pytorch.

Project description

The Quantization Toolkit

The Quantization Toolkit (HQT) provides model measurement and quantization capabilities in PyTorch. For full details see: https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_FP8.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

File details

Details for the file habana_quantization_toolkit-1.15.3.5-py3-none-any.whl.

File metadata

File hashes

Hashes for habana_quantization_toolkit-1.15.3.5-py3-none-any.whl
Algorithm Hash digest
SHA256 4bc5153a1c3935201654070750b14f992b59f6364d222878f1d7b5ffaeed94c7
MD5 f0e7407ef81799006b7034bf13522158
BLAKE2b-256 4680af1acfc4fe298a39a364c5ab5eb3ab0daa7147497aa6abaa793216d2f5a6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page