ONNX Runtime GPU (CUDA) - pre-built wheels for platforms upstream onnxruntime-gpu doesn't ship (e.g. NVIDIA Jetson / aarch64 + CUDA)
Project description
onnxruntime-gpu-extended
Pre-built ONNX Runtime CUDA wheels for platforms that upstream
onnxruntime-gpu does not ship — primarily NVIDIA Jetson (aarch64 +
CUDA / L4T) — published under a single package name so installation is the same
everywhere.
There is no source code in this repo. It downloads official CUDA wheels from
PyPI (onnxruntime-gpu) and builds the missing aarch64 Jetson wheels with Nix
(see flake.nix + TARGETS.md). All wheels are renamed to
onnxruntime-gpu-extended and uploaded to PyPI with a unified version, keeping
the internal onnxruntime package intact so import onnxruntime works
unchanged.
Install
pip install onnxruntime-gpu-extended
Works as a drop-in replacement — just import onnxruntime as usual.
Status
Placeholder / name reserved (0.0.1). The Jetson wheels build (see CI), but publishing the real wheels is blocked on:
- PyPI 100 MB per-file limit. Upstream
onnxruntime-gpuwheels are ~200 MB and the JetPack 6 (CUDA 12.6) wheels are ~150 MB — both over the default limit. A per-project file-size increase request to PyPI is required. - Version alignment. nixpkgs'
onnxruntime(the base for the Jetson build) is currently 1.22.x while upstreamonnxruntime-gpuis 1.27.x. The republished set should agree on a version. - Jetson wheel portability. Nix-built wheels carry
/nix/storerpaths and are not yet stock-Jetson portable (repath/bundle against JetPack-provided CUDA is a separate step), plus no runtime GPU smoke-test yet.
Platform Coverage (target)
| Linux x86_64 (CUDA) | Windows (CUDA) | Linux aarch64 / Jetson (CUDA) | |
|---|---|---|---|
| Python 3.10 | upstream | upstream | nix (JetPack 5/6) |
| Python 3.11 | upstream | upstream | nix (JetPack 5/6) |
| Python 3.12 | upstream | upstream | nix (JetPack 5/6) |
| Python 3.13 | upstream | upstream | nix (best-effort) |
| Python 3.14 | upstream | upstream | nix (best-effort) |
Building & Publishing
nix build .#jp6-cp312(on a native aarch64 runner) — build a Jetson wheel. SeeTARGETS.mdfor the full matrix and.github/workflows/build-jetson-nix.yml.run/publish— download upstream wheels, collect local Nix builds, rename all toonnxruntime-gpu-extended, and upload to PyPI.PUBLISH_SKIP_UPLOAD=1does a dry run (build + rename only).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file onnxruntime_gpu_extended-0.0.1.tar.gz.
File metadata
- Download URL: onnxruntime_gpu_extended-0.0.1.tar.gz
- Upload date:
- Size: 3.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4d30fc14b0c19153cfe8448d3e3023d08d058336f581fa2b8c2e6b5817bfbd6b
|
|
| MD5 |
9a3a65fb38338571f6851ed701161112
|
|
| BLAKE2b-256 |
c3271eddf0958839beab509559ff76e893d69d41556149d8cc73070cc5ade30e
|