A high performance deep learning inference library
Project description
NVIDIA TensorRT is an SDK that facilitates high-performance machine learning inference. It is designed to work in a complementary fashion with training frameworks such as TensorFlow, PyTorch, and MXNet. It focuses specifically on running an already-trained network quickly and efficiently on NVIDIA hardware.
IMPORTANT: This is a special release of TensorRT designed to work only with TensorRT-LLM. Please refrain from upgrading to this version if you are not using TensorRT-LLM.
To install, please execute the following:
pip install tensorrt --extra-index-url https://pypi.nvidia.com
Or add the index URL to the (space-separated) PIP_EXTRA_INDEX_URL environment variable:
export PIP_EXTRA_INDEX_URL='https://pypi.nvidia.com'
pip install tensorrt
When the extra index url does not contain https://pypi.nvidia.com
, a nested pip install
will run with the proper extra index url hard-coded.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file tensorrt_lean-cu12-10.4.0.tar.gz
.
File metadata
- Download URL: tensorrt_lean-cu12-10.4.0.tar.gz
- Upload date:
- Size: 18.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1b5d298293bb1b9cedee44f0cafe35cc2871a85de43b6882e8bb5640f49fc8df |
|
MD5 | 7db740ad848f60c7bafead20e2d7e836 |
|
BLAKE2b-256 | 0d55d125c19ad274876fa15254f748a101b15b89dc1788dff9bd6c8c629bee42 |