Skip to main content

The Qualcomm AI Runtime Development (QAIRT-DEV) package provides a simple python interface for executing ML models on QAIRT runtimes

Project description

Qualcomm AI Runtime Development (QAIRT-DEV) package

[Docs] [Discord Forums]

The Qualcomm AI Runtime (QAIRT) Development Python API provides a simple interface for executing ML models on QAIRT runtimes

It mirrors select capabilities and extends the features of existing QAIRT command line tools, while also providing an intuitive Pythonic API for easy integration into ML workflows

Features

  • Framework Model Conversion

    • Convert ONNX, Pytorch (1.x), TFLite framework models into DLC
    • Includes support for quantization and application of quantization encodings generated from AIMET
  • Compilation

    • Perform AOT compilation on QAIRT backends to generate optimized binaries.
    • Perform compiler optimization using tuning API on HTP
    • Supports compilation on HTP, HTP MCP and AIC backends.
  • Model Execution

    • Execute models on python native targets via Pybind wrappers on QAIRT APIs
    • Execute models on other targets (e.g android) via helper APIs that abstract platform specific details
  • Model Analysis

    • Generate profiling reports on all supported backends
    • Generate Op Trace and Qualcomm Hexagon Analysis Summary (QHAS) reports on HTP
  • Gen AI Model Building and Execution

    • Convert, optimize, and compile Gen AI models for on-device inference using a builder object with a single API call.
    • Perform text generation and obtain metrics via Generative AI Inference Engine
    • Construct Gen AI applications natively in python using simplified python bindings on Genie APIs.

Install

QAIRT Dev is available via pip:

pip install qairt-dev

Getting Started

QAIRT Dev documentation can be found here

Need help?

LICENSE

Qualcomm AI Runtime Development (QAIRT-DEV) package is Proprietary licensed. See LICENSE.pdf for further details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

qairt_dev-0.2.0-py3-none-win_amd64.whl (168.3 kB view details)

Uploaded Python 3Windows x86-64

qairt_dev-0.2.0-py3-none-manylinux2014_x86_64.whl (168.3 kB view details)

Uploaded Python 3

File details

Details for the file qairt_dev-0.2.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: qairt_dev-0.2.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 168.3 kB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.12

File hashes

Hashes for qairt_dev-0.2.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 e11bc7a2d7a321b51fa0ca6bc9e18e7f594b8f5b6907b47b251c949b476cf90b
MD5 20b6990b4b5162c4f06022d5be8deefa
BLAKE2b-256 4f1f5efc4e7785532c2f8a5b220db7c53d7cf9e21368c33071016d1ebbc85b11

See more details on using hashes here.

File details

Details for the file qairt_dev-0.2.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for qairt_dev-0.2.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 bd81f43d33cf1d99b33cddf6db596c356d84947ab8949404330756b6bc4fcc35
MD5 312ead564d9b15278cac1009f79e25a5
BLAKE2b-256 0dad5033328d9173809f50d98dbcfa308e5282c6d3b4d0c74eafbe0d01a332f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page