Skip to main content

Generate PyTorch Custom Operators from Numba-CUDA kernels

Project description

Pytorch-Numba Extension JIT

Documentation | PyPi

Writing custom CUDA operators in C and CPP can make certain operations significantly more efficient, but requires setting up a full C++ project and involves a great deal of boilerplate. Writing CUDA kernels using numba-cuda is significantly easier, but incurs overhead on every call, and still requires some boilerplate to integrate with the tracing systems that underlie torch.compile.

However, many of the CUDA kernels that would be used for deep learning are relatively similar (read from a set of input arrays, write to output arrays). As such, most of the boilerplate and binding code for C++ extensions could be generated automatically.

This project aims to do exactly that: pnex.jit takes a Python function in the form of a Numba CUDA kernel, along with some type annotations, and compiles a user-friendly and highly-performant PyTorch C++ extension.

Additionally, if a convenient wrapper for PyTorch Custom Operators is all that is desired, this library also allows skipping the C++ compilation phase and only generating the boilerplate for a Custom Operator definition.

For an example usage of this package, see my other package pytorch-nd-semiconv

This package is listed on PyPi; it can be installed with

pip install pytorch-numba-extension-jit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch_numba_extension_jit-0.1.4.tar.gz (56.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytorch_numba_extension_jit-0.1.4-py3-none-any.whl (18.0 kB view details)

Uploaded Python 3

File details

Details for the file pytorch_numba_extension_jit-0.1.4.tar.gz.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.4.tar.gz
Algorithm Hash digest
SHA256 bd6bc6a14dba093075abdf84e51bc436c4d7a327b5a1a38afb3cf437ebac1844
MD5 71af1e750f8e96a5e48f343587124b33
BLAKE2b-256 424764e5d5b74c2c3330343cd94c6f854fde90780a234848d13be84ffa8f48af

See more details on using hashes here.

File details

Details for the file pytorch_numba_extension_jit-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 77ade53392f477cff05e7e224a6a7ab46ba1a338efa31eb5194c9943c9e0562c
MD5 b04f9c84619fff3cb0bc355d5a1febff
BLAKE2b-256 546d4ebfce01b7c4cfbab14205f801d99cae9641b3d93de973725ddb29720ddb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page