Skip to main content

ONNX Runtime plugin package for the NVIDIA TensorRT RTX execution provider (EP ABI)

Project description

onnxruntime-ep-nv-tensorrt-rtx

NVIDIA TensorRT RTX Execution Provider plugin for ONNX Runtime.

Enables hardware-accelerated inference on NVIDIA RTX GPUs (Ampere / RTX 30xx and later) via the ORT Plugin EP ABI.


About NVIDIA TensorRT for RTX

NVIDIA® TensorRT™ for RTX (TensorRT-RTX) is an inference optimization library dedicated for deploying AI inference on NVIDIA GeForce RTX GPUs. It is a great choice for developers building applications that must run on Windows or Linux PCs, laptops, or workstations.

This package bundles the TensorRT-RTX runtime libraries alongside the ONNX Runtime EP plugin so that no separate TensorRT-RTX installation is required.

For more information about TensorRT-RTX, visit https://developer.nvidia.com/tensorrt-rtx.
Online documentation: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/index.html
License agreement: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/reference/sla.html


References


Requirements

  • NVIDIA RTX GPU (Ampere or later)
  • NVIDIA GPU driver with CUDA 13 support
  • pip install onnxruntime>=1.24

Installation

pip install onnxruntime>=1.24
pip install onnxruntime-ep-nv-tensorrt-rtx

Usage

import onnxruntime as ort
import onnxruntime_ep_nv_tensorrt_rtx as trt_ep

# Register the EP plugin
ort.register_execution_provider_library(trt_ep.get_ep_name(), trt_ep.get_library_path())

# List available devices
devices = [d for d in ort.get_ep_devices() if d.ep_name == trt_ep.get_ep_name()]
print(f"TensorRT RTX devices: {len(devices)}")

# Create session with EP
so = ort.SessionOptions()
so.add_provider_for_devices(devices, {})
sess = ort.InferenceSession("model.onnx", sess_options=so)

License

Apache 2.0. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0.tar.gz (1.8 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-win_amd64.whl (97.5 MB view details)

Uploaded Python 3Windows x86-64

onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-manylinux_2_34_x86_64.whl (102.7 MB view details)

Uploaded Python 3manylinux: glibc 2.34+ x86-64

File details

Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0.tar.gz.

File metadata

File hashes

Hashes for onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0.tar.gz
Algorithm Hash digest
SHA256 95401832334f794ac22f8c89c120ac803326cf0e5633e07d8c52fdca109ba44a
MD5 ca5c450b1a8dcab685905d503dc6d650
BLAKE2b-256 34a903ca034adc0bab6205881bbb38fd140e06ffd63c80675284b72f1f45bbd4

See more details on using hashes here.

File details

Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 a989e04fd7f007f102201595ad08f3d8f3fdb24017720413bfee50383e260fe7
MD5 c6e4c4d3649f567f06d7710df82c7f33
BLAKE2b-256 e91128ebaf6c8543008fa8ca4b46727b46e463c4c625ebbd764b90f6002ef737

See more details on using hashes here.

File details

Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for onnxruntime_ep_nv_tensorrt_rtx_cu13-0.3.0-py3-none-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 bb7a583f5f6d3c1d8019abe170bd1bfb9729d0836ef9eda85365f9c4c92c3d97
MD5 33be5dce9813001b82699c799e55365f
BLAKE2b-256 f0adadee9ec160610b228fb3b5b257937d178978bb54a75cb6f9f391fc24cc4f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page