Skip to main content

Generate PyTorch Custom Operators from Numba-CUDA kernels

Project description

Pytorch-Numba Extension JIT

Documentation | PyPi

Writing custom CUDA operators in C and CPP can make certain operations significantly more efficient, but requires setting up a full C++ project and involves a great deal of boilerplate. Writing CUDA kernels using numba-cuda is significantly easier, but incurs overhead on every call, and still requires some boilerplate to integrate with the tracing systems that underlie torch.compile.

However, many of the CUDA kernels that would be used for deep learning are relatively similar (read from a set of input arrays, write to output arrays). As such, most of the boilerplate and binding code for C++ extensions could be generated automatically.

This project aims to do exactly that: pnex.jit takes a Python function in the form of a Numba CUDA kernel, along with some type annotations, and compiles a user-friendly and highly-performant PyTorch C++ extension.

Additionally, if a convenient wrapper for PyTorch Custom Operators is all that is desired, this library also allows skipping the C++ compilation phase and only generating the boilerplate for a Custom Operator definition.

For an example usage of this package, see my other package pytorch-nd-semiconv

This package is listed on PyPi; it can be installed with

pip install pytorch-numba-extension-jit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch_numba_extension_jit-0.1.1.tar.gz (55.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytorch_numba_extension_jit-0.1.1-py3-none-any.whl (17.9 kB view details)

Uploaded Python 3

File details

Details for the file pytorch_numba_extension_jit-0.1.1.tar.gz.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.1.tar.gz
Algorithm Hash digest
SHA256 47abfe3db926a7982eb17956e85891e393f230bb290a13cbff7294010b58e426
MD5 fefa96ad88c35cb486db584b8336753a
BLAKE2b-256 fed2b05d508384e294bd91cf1c0ef76aa1b7fd6fa4ad1d2478b1ae55ad12933f

See more details on using hashes here.

File details

Details for the file pytorch_numba_extension_jit-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cf1e1977f111dd085a1d7d0de499f7622b29b24e21d0b9a139cd0317552bb07c
MD5 c49559f8e1d67c4e5e943c4fcf023ac9
BLAKE2b-256 a1e65f4091f953c06567c7a233803e45b936dca64e47645cd536b3b2b738d0e7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page