Skip to main content

ONNX Runtime generate() API

Project description

ONNX Runtime generate() API

Run SLMs/LLMs and multi modal models on-device and in the cloud with ONNX Runtime.

Model architectures supported so far (and more coming soon): Gemma, Llama, Mistral, Phi (language and vision).

For more details, see: docs https://onnxruntime.ai/docs/genai and repo: https://github.com/microsoft/onnxruntime-genai

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

onnxruntime_genai_cuda-0.5.0-cp312-cp312-win_amd64.whl (14.3 MB view details)

Uploaded CPython 3.12 Windows x86-64

onnxruntime_genai_cuda-0.5.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.0 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

onnxruntime_genai_cuda-0.5.0-cp311-cp311-win_amd64.whl (14.3 MB view details)

Uploaded CPython 3.11 Windows x86-64

onnxruntime_genai_cuda-0.5.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.0 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

onnxruntime_genai_cuda-0.5.0-cp310-cp310-win_amd64.whl (14.3 MB view details)

Uploaded CPython 3.10 Windows x86-64

onnxruntime_genai_cuda-0.5.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.0 MB view details)

Uploaded CPython 3.10 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 519fb645a1d41eac0f441062f083caa4d56c148316b7a118521dbbc52d01cb36
MD5 b52ce6f5e763498deb1351883fa2576c
BLAKE2b-256 b805a5ba70f69c003be3bc55fa1b25743dad4cebe0284ad9f8e7984eefd50468

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5d8ad1ae3554a146832bf5068d6b2e5f24d7db42612a748efb3c4642fbe131f9
MD5 b1e9e67f5e5a19424b3c743932ba24fa
BLAKE2b-256 ff0f87e55a1c753e90a8d8122d046327367526a6a785330a01092b29211f3ab7

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 e2a02bdeb545c0ce7fa6472ed891da3c894fcccf3deada30399c26632e2df939
MD5 522cf061e85a4c19d497202c2f48bc23
BLAKE2b-256 bb7b8d698f4ac90af5e32996267ed4da31001c0348922e2da6b740a7ac70517b

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c17a71e32847027343fd3007d0ca6eadf9926a3733085834464f7f5b791e3615
MD5 9a15c522d75d6aaf43650b1ffcac07ec
BLAKE2b-256 15b2e6cc74939a27738255c61fa95615d2736bdeda12ac95dd594153685d19ee

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 6c14db1d0ef8cab8c32d7509c3b61f0bff06d8faab03b3ef2dfbe7766a5cc18c
MD5 9b8912851e9a0f6a80b9a68638d5d9ad
BLAKE2b-256 ff28f84f876248103c7fed9a5f514ad1a3adb91793de4ec3214898d246b936fc

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1f85cb7bf7afc3ab7902f06dadf7dc215198212fadd67b28165a0358235f5e9d
MD5 c7175e59ab3dd80f3faeb452ba75f4a5
BLAKE2b-256 3ff7378a15a1e9d97a82e18d1cef2774ea01b40c6cc476d2ebad9abcc2e0aaee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page