Skip to main content

ONNX Runtime plugin package for the NVIDIA TensorRT RTX execution provider (EP ABI)

Project description

onnxruntime-ep-nv-tensorrt-rtx

NVIDIA TensorRT RTX Execution Provider plugin for ONNX Runtime.

Enables hardware-accelerated inference on NVIDIA RTX GPUs (Ampere / RTX 30xx and later) via the ORT Plugin EP ABI.


About NVIDIA TensorRT for RTX

NVIDIA® TensorRT™ for RTX (TensorRT-RTX) is an inference optimization library dedicated for deploying AI inference on NVIDIA GeForce RTX GPUs. It is a great choice for developers building applications that must run on Windows or Linux PCs, laptops, or workstations.

This package bundles the TensorRT-RTX runtime libraries alongside the ONNX Runtime EP plugin so that no separate TensorRT-RTX installation is required.

For more information about TensorRT-RTX, visit https://developer.nvidia.com/tensorrt-rtx.
Online documentation: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/index.html
License agreement: https://docs.nvidia.com/deeplearning/tensorrt-rtx/latest/reference/sla.html


References


Requirements

  • NVIDIA RTX GPU (Ampere or later)
  • NVIDIA GPU driver with CUDA 12 support
  • pip install onnxruntime>=1.24

Installation

pip install onnxruntime>=1.24
pip install onnxruntime-ep-nv-tensorrt-rtx

Usage

import onnxruntime as ort
import onnxruntime_ep_nv_tensorrt_rtx as trt_ep

# Register the EP plugin
ort.register_execution_provider_library(trt_ep.get_ep_name(), trt_ep.get_library_path())

# List available devices
devices = [d for d in ort.get_ep_devices() if d.ep_name == trt_ep.get_ep_name()]
print(f"TensorRT RTX devices: {len(devices)}")

# Create session with EP
so = ort.SessionOptions()
so.add_provider_for_devices(devices, {})
sess = ort.InferenceSession("model.onnx", sess_options=so)

License

Apache 2.0. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0.tar.gz (1.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0-py3-none-win_amd64.whl (103.3 MB view details)

Uploaded Python 3Windows x86-64

File details

Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0.tar.gz.

File metadata

File hashes

Hashes for onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0.tar.gz
Algorithm Hash digest
SHA256 8fbdfc91438bd0629e1544a406b41e6f1466fb4239d47e538e514ef711293409
MD5 995811e544c5a586af385a1b3fdbe552
BLAKE2b-256 66034a4c05e16b23fc37bc683d1836c8422968bfdee5cc05ec8dfe92268d53ed

See more details on using hashes here.

File details

Details for the file onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for onnxruntime_ep_nv_tensorrt_rtx_cu12-0.3.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 0b22c2bde776664c1796e0e29d0608b8acc30593544bbbe01892f4a02edc2166
MD5 31153173fd694969a125496e4124af4d
BLAKE2b-256 efb6edf3ba1551046194f04e65a1ebe7a7d068e26768c2e180d3f7c46972eed6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page