kernel-tuner

An easy to use CUDA/OpenCL kernel tuner in Python

These details have not been verified by PyPI

Project description

Create optimized GPU applications in any mainstream GPU programming language (CUDA, HIP, OpenCL, OpenACC, OpenMP).

What Kernel Tuner does:

Works as an external tool to benchmark and optimize GPU kernels in isolation
Can be used directly on existing kernel code without extensive changes
Can be used with applications in any host programming language
Blazing fast search space construction
More than 20 optimization algorithms to speedup tuning
Energy measurements and optimizations (power capping, clock frequency tuning)
... and much more! For example, caching, output verification, tuning host and device code, user defined metrics, see the full documentation.

Installation

First, make sure you have your CUDA, OpenCL, or HIP compiler installed
Then type: pip install kernel_tuner[cuda], pip install kernel_tuner[opencl], or pip install kernel_tuner[hip]
or why not all of them: pip install kernel_tuner[cuda,opencl,hip]

More information on installation, also for other languages, in the installation guide.

Example

import numpy as np
from kernel_tuner import tune_kernel

kernel_string = """
__global__ void vector_add(float *c, float *a, float *b, int n) {
    int i = blockIdx.x * block_size_x + threadIdx.x;
    if (i<n) {
        c[i] = a[i] + b[i];
    }
}
"""

n = np.int32(10000000)

a = np.random.randn(n).astype(np.float32)
b = np.random.randn(n).astype(np.float32)
c = np.zeros_like(a)

args = [c, a, b, n]

tune_params = {"block_size_x": [32, 64, 128, 256, 512]}

tune_kernel("vector_add", kernel_string, n, args, tune_params)

More examples here.

Resources

Full documentation
Guides:
Features & Use cases:
Kernel Tuner Tutorial slides [PDF], hands-on:
- Vector add example [.ipynb]
- Tuning thread block dimensions [.ipynb]
- Search space restrictions & output verification [.ipynb]
- Visualization & search space optimization [.ipynb]
Energy Efficient GPU Computing tutorial slides [PDF], hands-on:
- Kernel Tuner for GPU energy measurements [.ipynb]
- Code optimizations for energy [.ipynb]
- Mixed precision and accuracy tuning [.ipynb]
- Optimzing for time vs for energy [.ipynb]

Kernel Tuner ecosystem

C++ magic to integrate auto-tuned kernels into C++ applications

C++ data types for mixed-precision CUDA kernel programming

Monitor, analyze, and visualize auto-tuning runs

Communication & Contribution

GitHub Issues: Bug reports, install issues, feature requests, work in progress
GitHub Discussion group: General questions, Q&A, thoughts

Contributions are welcome! For feature requests, bug reports, or usage problems, please feel free to create an issue. For more extensive contributions, check the contribution guide.

Citation

If you use Kernel Tuner in research or research software, please cite the most relevant among the publications on Kernel Tuner. To refer to the project as a whole, please cite:

@article{kerneltuner,
  author  = {Ben van Werkhoven},
  title   = {Kernel Tuner: A search-optimizing GPU code auto-tuner},
  journal = {Future Generation Computer Systems},
  year = {2019},
  volume  = {90},
  pages = {347-358},
  url = {https://www.sciencedirect.com/science/article/pii/S0167739X18313359},
  doi = {https://doi.org/10.1016/j.future.2018.08.004}
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.3.1

Jan 21, 2026

1.3.0

Sep 3, 2025

1.2.0

Jul 17, 2025

1.1.3

May 21, 2025

1.1.2

Apr 8, 2025

1.1.0

Apr 4, 2025

1.0

Apr 4, 2024

1.0.0b6 pre-release

Nov 8, 2023

1.0.0b5 pre-release

Nov 1, 2023

1.0.0b4 pre-release

Oct 22, 2023

1.0.0b3 pre-release

Oct 12, 2023

1.0.0b2 pre-release

Oct 11, 2023

1.0.0b1 pre-release

Oct 11, 2023

0.4.5

Jun 1, 2023

0.4.4

Mar 9, 2023

0.4.3

Oct 19, 2022

0.4.2

May 23, 2022

0.4.1

Sep 10, 2021

0.4.0

Apr 9, 2021

0.3.2

Nov 4, 2020

0.3.1

Jun 18, 2020

0.3.0

Feb 14, 2020

0.2.0

Nov 16, 2018

0.1.9

Apr 18, 2018

0.1.8

Nov 23, 2017

0.1.7

Nov 10, 2017

0.1.6

Aug 24, 2017

0.1.5

Jul 21, 2017

0.1.4

Jun 14, 2017

0.1.3

Apr 6, 2017

0.1.2

Mar 29, 2017

0.1.1

Feb 10, 2017

0.1.0

Nov 2, 2016

0.1.0rc0 pre-release

Nov 8, 2016

0.1.0b0 pre-release

Nov 2, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kernel_tuner-1.3.1.tar.gz (191.0 kB view details)

Uploaded Jan 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kernel_tuner-1.3.1-py3-none-any.whl (180.4 kB view details)

Uploaded Jan 21, 2026 Python 3

File details

Details for the file kernel_tuner-1.3.1.tar.gz.

File metadata

Download URL: kernel_tuner-1.3.1.tar.gz
Upload date: Jan 21, 2026
Size: 191.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kernel_tuner-1.3.1.tar.gz
Algorithm	Hash digest
SHA256	`b53c9dd1d89d75d0ee1a1e85692bb4e8aa8b80cc52aba20c8800f85f0dfe58a9`
MD5	`eb9c9f4800b6cd505cb69671b46d4d7d`
BLAKE2b-256	`3ceefb93d9d5f777dbdb024e605a47d8759fe636448add9e928cba6564bce1ef`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kernel_tuner-1.3.1.tar.gz:

Publisher: publish-python-package.yml on KernelTuner/kernel_tuner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kernel_tuner-1.3.1.tar.gz
- Subject digest: b53c9dd1d89d75d0ee1a1e85692bb4e8aa8b80cc52aba20c8800f85f0dfe58a9
- Sigstore transparency entry: 843306144
- Sigstore integration time: Jan 21, 2026
Source repository:
- Permalink: KernelTuner/kernel_tuner@6f9f54f55731f7d73d47075ae04ea4178fc435fe
- Branch / Tag: refs/tags/1.3.1
- Owner: https://github.com/KernelTuner
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python-package.yml@6f9f54f55731f7d73d47075ae04ea4178fc435fe
- Trigger Event: release

File details

Details for the file kernel_tuner-1.3.1-py3-none-any.whl.

File metadata

Download URL: kernel_tuner-1.3.1-py3-none-any.whl
Upload date: Jan 21, 2026
Size: 180.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kernel_tuner-1.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7db130fca35f89c3a33abdf5b2cd8f955e99bb5ec2d80a06c0faaeaa0cc5eb72`
MD5	`cbe03656bfb03105642384d4e280f7bf`
BLAKE2b-256	`7fad8b06dd8f59ffd55c3c2e48036ac2622df04d5c48525699bc06f545d3b3e5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kernel_tuner-1.3.1-py3-none-any.whl:

Publisher: publish-python-package.yml on KernelTuner/kernel_tuner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kernel_tuner-1.3.1-py3-none-any.whl
- Subject digest: 7db130fca35f89c3a33abdf5b2cd8f955e99bb5ec2d80a06c0faaeaa0cc5eb72
- Sigstore transparency entry: 843306184
- Sigstore integration time: Jan 21, 2026
Source repository:
- Permalink: KernelTuner/kernel_tuner@6f9f54f55731f7d73d47075ae04ea4178fc435fe
- Branch / Tag: refs/tags/1.3.1
- Owner: https://github.com/KernelTuner
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python-package.yml@6f9f54f55731f7d73d47075ae04ea4178fc435fe
- Trigger Event: release

kernel-tuner 1.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Installation

Example

Resources

Kernel Tuner ecosystem

Communication & Contribution

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance