CUDNN FrontEnd python library
Project description
cuDNN FrontEnd(FE) API
Introduction
The cuDNN FrontEnd(FE) API is a C++ header-only library that wraps the cuDNN C backend API. Both the FE and backend APIs are entry points to the same set of functionality that is commonly referred to as the "graph API".
While there are two entry points to the graph API (i.e. backend and frontend), it is expected that most users will use the FE API. Reasons being:
- FE API is less verbose without loss of control. All functionality accessible through the backend API is also accessible through the FE API.
- FE API adds functionality on top of the backend API, like errata filters and autotuning.
Also, for those using backend API, FE API source and samples can serve as reference implementation.
In FE v1.0 API, users can describe multiple operations that form subgraph through a persistent cudnn_frontend::graph::Graph
object. Unlike the FE v0.x API, users dont need to worry about specifying shapes and sizes of the intermediate virtual tensors. FE v1.0 API extends the groundwork of earlier versions and introduces a new set of APIs to further simplify the workflow. For detailed information of FE v1.0 API, see README.FE.v1.0.md.
Additionally, FE v1.0 API provides python bindings to all API through pybind11. It is recommended that new users of cuDNN start with the frontend v1.0 API. See samples/cpp
and samples/python
for more details on its usage.
Usage
In order to include the entire library, include the cudnn_frontend header file include/cudnn_frontend.h
into your compilation unit.
Build:
Dependencies
With the release of v1.0, we are bumping up the minimum supported cudnn version to 8.5.0
cuda can be downloaded from the nvidia dev-zone
cudnn can be installed from - [nvidia dev-zone] (https://developer.nvidia.com/cudnn) - [pypi wheels] (https://pypi.org/project/nvidia-cudnn-cu12/)
Minimum python version needed 3.6
The python binding compilation requires development package which can be installed by running apt-get install python-dev
.
To run the python samples, additionally, you will need the following python packages:
- pytest
- torch
- jupyter
Python API
Source installation:
Install FE python API by running:
pip install git+https://github.com/NVIDIA/cudnn-frontend.git
Above command picks cuda and cudnn from default system paths.
To provide a custom CUDA installation path, use environment variable: CUDAToolkit_ROOT
.
To provide a custom CUDNN installation path, use environment variable: CUDNN_PATH
.
pip wheel installation
Download the pip wheel corresponding to your python installation.
pip install nvidia_cudnn_frontend-1.2.0-*.whl
Checking the installation
To test whether installation is successful, run:
pytest tests/python_fe
NOTE: Only v1.0 API is exposed via python bindings.
C++ API
C++ API is header only library.
The root CMakeLists.txt can be used as reference to include the cudnn_frontend in your project's build system.
Building samples
The following compilation steps are only required for building the samples and/or python bindings.
Provide CUDA installation path according to: https://cmake.org/cmake/help/latest/module/FindCUDAToolkit.html
Provide CUDNN installation path using CUDNN_PATH env variable or cmake parameter.
CUDNN_PATH has the cudnn installation:
- Headers are in CUDNN_PATH/include.
- Libraries are in CUDNN_PATH/lib or CUDNN_PATH/lib64 or CUDNN_PATH/lib/x64.
For a in-source build,
mkdir build
cd build
cmake -DCUDNN_PATH=/path/to/cudnn -DCUDAToolkit_ROOT=/path/to/cuda ../
cmake --build . -j16
bin/samples
To skip building samples, use -DCUDNN_FRONTEND_BUILD_SAMPLES=OFF
.
To skip building python bindings, use -DCUDNN_FRONTEND_BUILD_PYTHON_BINDINGS=OFF
.
In case, you have a stale cmake cache and want to update the cudnn/cuda paths, please delete the cmake cache (or build directory and redo the above steps).
Debugging
For initial debugging, we recommend turning on the cudnn FE logging and checking for warnings and errors. cuDNN Frontend API logging records execution flow through cuDNN frontend API. This functionality is disabled by default, and can be enabled through methods described in this section.
Method 1: Using Environment Variables:
Environment variables | CUDNN_FRONTEND_LOG_INFO=0 | CUDNN_FRONTEND_LOG_INFO=1 |
---|---|---|
CUDNN_FRONTEND_LOG_FILE not set | No Logging | No Logging |
CUDNN_FRONTEND_LOG_FILE set to stdout or stderr | No Logging | Logging to cout or cerr |
CUDNN_FRONTEND_LOG_FILE set to filename.txt | No Logging | Logging to the filename |
Method 2: Using API calls:
Calling cudnn_frontend::isLoggingEnabled() = true|false
has same effect of setting the environment variable.
Calling cudnn_frontend::getStream() = stream_name
can be used to assign the output stream directly.
For further debugging, please turn on the cudnn backend logs described here https://docs.nvidia.com/deeplearning/cudnn/developer-guide/index.html#api-logging
Documentation
- See README.FE.1.0.md for v1.0 API documentation.
- See README.FE.0.x.md for v0.x API documentation.
Contributing:
Please refer to our contribution guide
Feedback
Support, resources, and information about cuDNN can be found online at https://developer.nvidia.com/cudnn.
Also, bugs and rfes can be reported in the issues section.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file nvidia_cudnn_frontend-1.3.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
.
File metadata
- Download URL: nvidia_cudnn_frontend-1.3.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.12, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d69e3defef47797ed810fbc89efd80d2de6d050f57262853655c96a38b38e20d |
|
MD5 | 6a17aa89e0aed92cd94b43305db58478 |
|
BLAKE2b-256 | 915faced52696d0334536fccb4173ccc4a3590d59e45e78476ff2606434b940b |
File details
Details for the file nvidia_cudnn_frontend-1.3.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
.
File metadata
- Download URL: nvidia_cudnn_frontend-1.3.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.11, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 06a14d36e1358e4a189a400f3f46bb0e8863601900242c8f85c206810fb9b101 |
|
MD5 | 3dc7f4c7636a36ecb2968c06f22da073 |
|
BLAKE2b-256 | 800f4675336e8f84c614768501f102346d48e47bbfc97e8a3fe0d178b6c3b953 |
File details
Details for the file nvidia_cudnn_frontend-1.3.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
.
File metadata
- Download URL: nvidia_cudnn_frontend-1.3.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.10, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eb59c1d6d6d211581f4c9bd8a8d6f02fb8f62c855fffb88ab363f47cb97fe72a |
|
MD5 | d3a56dac7a7de123c6ab4197645273b7 |
|
BLAKE2b-256 | 68992f6c9bd1f35469ccb0f89721259822bee8a23ddb80ec35a9d2546921d6cc |
File details
Details for the file nvidia_cudnn_frontend-1.3.0-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
.
File metadata
- Download URL: nvidia_cudnn_frontend-1.3.0-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.9, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f4d9c347cd997daa426e4c908aea454cea81b6f5c9edcfad618f9fff632758bc |
|
MD5 | 2116939a37ae1a88f9905fa40d05eb96 |
|
BLAKE2b-256 | ec52abb7fdbfb59394d89bc487ebc49c057096d6162dde8ba47c080fc6895b97 |
File details
Details for the file nvidia_cudnn_frontend-1.3.0-cp38-cp38-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
.
File metadata
- Download URL: nvidia_cudnn_frontend-1.3.0-cp38-cp38-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.8, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19726c157395c7426065339e6400f86393a9af382876a39a7ee94f616c71297c |
|
MD5 | d8caa2d04952f4d9bf67ec785bee1b57 |
|
BLAKE2b-256 | f23960727e0717dc70cb4b4555e7a61ea1456f7ce1ff02f98848fd746c703ed2 |