ONNX Runtime plugin package for the NVIDIA TensorRT RTX execution provider (EP ABI)
Project description
onnxruntime-ep-nv-tensorrt-rtx
NVIDIA TensorRT RTX Execution Provider plugin for ONNX Runtime.
Enables hardware-accelerated inference on NVIDIA RTX GPUs (Ampere / RTX 30xx and later) via the ORT Plugin EP ABI.
About NVIDIA TensorRT for RTX
NVIDIA® TensorRT™ for RTX (TensorRT-RTX) is an inference optimization library dedicated for deploying AI inference on NVIDIA GeForce RTX GPUs. It is a great choice for developers building applications that must run on Windows or Linux PCs, laptops, or workstations.
This package bundles the TensorRT-RTX runtime libraries alongside the ONNX Runtime EP plugin so that no separate TensorRT-RTX installation is required.
For more information about TensorRT-RTX, visit https://developer.nvidia.com/tensorrt-rtx.
Online documentation: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/index.html
License agreement: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/reference/sla.html
References
- Release Notes: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/getting-started/release-notes.html
- Support Matrix: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/getting-started/support-matrix.html
- Installation Guide: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/installing-tensorrt-rtx/installation-overview.html
- C++ API: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/_static/cpp-api/index.html
- Python API: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/_static/python-api/index.html
Requirements
- NVIDIA RTX GPU (Ampere or later)
- NVIDIA GPU driver with CUDA 12 support
pip install onnxruntime>=1.24
Installation
pip install onnxruntime>=1.24
pip install onnxruntime-ep-nv-tensorrt-rtx
Usage
import onnxruntime as ort
import onnxruntime_ep_nv_tensorrt_rtx as trt_ep
# Register the EP plugin
ort.register_execution_provider_library(trt_ep.get_ep_name(), trt_ep.get_library_path())
# List available devices
devices = [d for d in ort.get_ep_devices() if d.ep_name == trt_ep.get_ep_name()]
print(f"TensorRT RTX devices: {len(devices)}")
# Create session with EP
so = ort.SessionOptions()
so.add_provider_for_devices(devices, {})
sess = ort.InferenceSession("model.onnx", sess_options=so)
License
Apache 2.0. See LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0.tar.gz.
File metadata
- Download URL: onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0.tar.gz
- Upload date:
- Size: 1.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8fbdfc91438bd0629e1544a406b41e6f1466fb4239d47e538e514ef711293409
|
|
| MD5 |
995811e544c5a586af385a1b3fdbe552
|
|
| BLAKE2b-256 |
66034a4c05e16b23fc37bc683d1836c8422968bfdee5cc05ec8dfe92268d53ed
|
File details
Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0-py3-none-win_amd64.whl.
File metadata
- Download URL: onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0-py3-none-win_amd64.whl
- Upload date:
- Size: 103.3 MB
- Tags: Python 3, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0b22c2bde776664c1796e0e29d0608b8acc30593544bbbe01892f4a02edc2166
|
|
| MD5 |
31153173fd694969a125496e4124af4d
|
|
| BLAKE2b-256 |
efb6edf3ba1551046194f04e65a1ebe7a7d068e26768c2e180d3f7c46972eed6
|