Skip to main content

ONNX Runtime generate() API

Project description

ONNX Runtime generate() API

Run SLMs/LLMs and multi modal models on-device and in the cloud with ONNX Runtime.

Model architectures supported so far (and more coming soon): Gemma, Llama, Mistral, Phi (language and vision).

For more details, see: docs https://onnxruntime.ai/docs/genai and repo: https://github.com/microsoft/onnxruntime-genai

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

onnxruntime_genai_cuda-0.5.1-cp312-cp312-win_amd64.whl (14.4 MB view details)

Uploaded CPython 3.12 Windows x86-64

onnxruntime_genai_cuda-0.5.1-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.1 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

onnxruntime_genai_cuda-0.5.1-cp311-cp311-win_amd64.whl (14.4 MB view details)

Uploaded CPython 3.11 Windows x86-64

onnxruntime_genai_cuda-0.5.1-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.1 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

onnxruntime_genai_cuda-0.5.1-cp310-cp310-win_amd64.whl (14.4 MB view details)

Uploaded CPython 3.10 Windows x86-64

onnxruntime_genai_cuda-0.5.1-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (15.1 MB view details)

Uploaded CPython 3.10 manylinux: glibc 2.27+ x86-64 manylinux: glibc 2.28+ x86-64

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 b42d15c9301f7ab9f8775f57c03eebaa243c6c8c866fdc127d2e320f79d35265
MD5 738f4f434039b53a341f07029df90465
BLAKE2b-256 eeda30cb1276d9613c80ec1890b3c93e8ca01dce21a3319edd72dc9f9b83700c

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 eb521213e12515172a0b742b1089facd03efb55d31b4eba8af2b55ce223d54dc
MD5 089f4c654e9182516948c881fb8daae9
BLAKE2b-256 81a38f22b451bd8ce913bb93c1a100402d6dd93dc6f623b16f544904cc95eb31

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 bb7bfb88e02ca4aa2099ae50852c3de19a120e21fe7ec6a44fb49b41b1dde6f7
MD5 52beb375bfceacb78a5dabefdb24c453
BLAKE2b-256 fdc642a19bc84117984a1a4c8bee0fc61e75ff972de1606a9d70d55041fbbf82

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 03e108bc03e57e033523e4bc0ddaf6df340084a708bf490f25748c1fc4888c2f
MD5 3588d139d8b8fb3bfca9db37d6c9b4d7
BLAKE2b-256 830382d59cdea80847ddc93bd4c5c13a0280cc9334d0f9b860f7a9806af2bb2f

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 e1b7a5c8fc7fbd22fb9812731d2ba8f8855482d3e9b0ea21cfc2d099526ddc75
MD5 0a97de66fbae89ec11c9c7809ac2a5d2
BLAKE2b-256 d5795035356ed5dc677438842ff47097ae170997c5e2677926d1bc40e56598bc

See more details on using hashes here.

File details

Details for the file onnxruntime_genai_cuda-0.5.1-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_genai_cuda-0.5.1-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ae9974ed09a93c051ff87811b39998eb473ed21b9020d740dc4fd5f3ed2625f9
MD5 a987595e2bb50ce7cc94258aad3a91e9
BLAKE2b-256 7a44c3ef5ec3085184f538c68c63a0ac84bd3f2ce5f097d45588d91e16c2fe89

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page