Skip to main content

CUDA Kernel Benchmarking Package

Project description

CUDA Kernel Benchmarking Package

This package provides Python API to CUDA Kernel Benchmarking Library NVBench.

Building

Ensure recent version of CMake

Since nvbench requires a rather new version of CMake (>=3.30.4), either build CMake from sources, or create a conda environment with a recent version of CMake, using

conda create -n build_env --yes  cmake ninja
conda activate build_env

Ensure CUDA compiler

Since building NVBench library requires CUDA compiler, ensure that appropriate environment variables are set. For example, assuming CUDA toolkit is installed system-wide, and assuming Ampere GPU architecture:

export CUDACXX=/usr/local/cuda/bin/nvcc
export CUDAARCHS=86

Build Python project

Now switch to python folder, configure and install NVBench library, and install the package in editable mode:

cd nvbench/python
pip install -e .

Verify that package works

python test/run_1.py

Run examples

# Example benchmarking numba.cuda kernel
python examples/throughput.py
# Example benchmarking kernels authored using cuda.core
python examples/axes.py
# Example benchmarking algorithms from cuda.cccl.parallel
python examples/cccl_parallel_segmented_reduce.py
# Example benchmarking CuPy function
python examples/cupy_extract.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cuda_bench-0.1.0-cp313-cp313-manylinux_2_27_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.27+ x86-64

cuda_bench-0.1.0-cp312-cp312-manylinux_2_27_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.27+ x86-64

cuda_bench-0.1.0-cp311-cp311-manylinux_2_27_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.27+ x86-64

cuda_bench-0.1.0-cp310-cp310-manylinux_2_27_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.27+ x86-64

File details

Details for the file cuda_bench-0.1.0-cp313-cp313-manylinux_2_27_x86_64.whl.

File metadata

File hashes

Hashes for cuda_bench-0.1.0-cp313-cp313-manylinux_2_27_x86_64.whl
Algorithm Hash digest
SHA256 e08853c5f70da2de10501f48acf72c91b3ed1e9522325bd0a04f959da24e3627
MD5 13fbc147115be72a60d7c77b63ab7d84
BLAKE2b-256 3bdab6fa917f3c7ad996f51e95c6a33eed939b22a2ffc775b4ad39aeb7001c79

See more details on using hashes here.

Provenance

The following attestation bundles were made for cuda_bench-0.1.0-cp313-cp313-manylinux_2_27_x86_64.whl:

Publisher: publish-wheels.yml on NVIDIA/nvbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cuda_bench-0.1.0-cp312-cp312-manylinux_2_27_x86_64.whl.

File metadata

File hashes

Hashes for cuda_bench-0.1.0-cp312-cp312-manylinux_2_27_x86_64.whl
Algorithm Hash digest
SHA256 bfc088b642605cd40fa50e7133f5383c7ca2f382df32b3c264a012a70bde2bc6
MD5 48e7c22c4a819b1ccf5fcc947053259b
BLAKE2b-256 6d9fc80bddfb0a14e5bba36d9763b6e76069bf3ce1bc153ff6e971d9f8887346

See more details on using hashes here.

Provenance

The following attestation bundles were made for cuda_bench-0.1.0-cp312-cp312-manylinux_2_27_x86_64.whl:

Publisher: publish-wheels.yml on NVIDIA/nvbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cuda_bench-0.1.0-cp311-cp311-manylinux_2_27_x86_64.whl.

File metadata

File hashes

Hashes for cuda_bench-0.1.0-cp311-cp311-manylinux_2_27_x86_64.whl
Algorithm Hash digest
SHA256 655a214b063454ca5321201d9015b3279e871295e09d89051d37fdf733b960cd
MD5 bb08a69c4da3e15f95a7ce5510b4f428
BLAKE2b-256 c840641135bab990237aee0d5175bbb737f82679afc7eb2ca0eee3652e2f5832

See more details on using hashes here.

Provenance

The following attestation bundles were made for cuda_bench-0.1.0-cp311-cp311-manylinux_2_27_x86_64.whl:

Publisher: publish-wheels.yml on NVIDIA/nvbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cuda_bench-0.1.0-cp310-cp310-manylinux_2_27_x86_64.whl.

File metadata

File hashes

Hashes for cuda_bench-0.1.0-cp310-cp310-manylinux_2_27_x86_64.whl
Algorithm Hash digest
SHA256 fa01f59bd83d6286788affe47e1d3021f9e6d5387229f75f183ab35b4fd8df41
MD5 2d43b426c15052bc35c6e47a265846e9
BLAKE2b-256 c8f1e5fd2610247668c79081231397838fc219177da67d8f9e5c2af7010cf800

See more details on using hashes here.

Provenance

The following attestation bundles were made for cuda_bench-0.1.0-cp310-cp310-manylinux_2_27_x86_64.whl:

Publisher: publish-wheels.yml on NVIDIA/nvbench

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page