Skip to main content

Generate PyTorch Custom Operators from Numba-CUDA kernels

Project description

Pytorch-Numba Extension JIT

Documentation | PyPi

Writing custom CUDA operators in C and CPP can make certain operations significantly more efficient, but requires setting up a full C++ project and involves a great deal of boilerplate. Writing CUDA kernels using numba-cuda is significantly easier, but incurs overhead on every call, and still requires some boilerplate to integrate with the tracing systems that underlie torch.compile.

However, many of the CUDA kernels that would be used for deep learning are relatively similar (read from a set of input arrays, write to output arrays). As such, most of the boilerplate and binding code for C++ extensions could be generated automatically.

This project aims to do exactly that: pnex.jit takes a Python function in the form of a Numba CUDA kernel, along with some type annotations, and compiles a user-friendly and highly-performant PyTorch C++ extension.

Additionally, if a convenient wrapper for PyTorch Custom Operators is all that is desired, this library also allows skipping the C++ compilation phase and only generating the boilerplate for a Custom Operator definition.

For an example usage of this package, see my other package pytorch-nd-semiconv

This package is listed on PyPi; it can be installed with

pip install pytorch-numba-extension-jit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch_numba_extension_jit-0.1.6.tar.gz (57.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytorch_numba_extension_jit-0.1.6-py3-none-any.whl (18.4 kB view details)

Uploaded Python 3

File details

Details for the file pytorch_numba_extension_jit-0.1.6.tar.gz.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.6.tar.gz
Algorithm Hash digest
SHA256 8b598b44254ae4cc4ca648b0bd54cf399dc4dabd41db3b76fcca1634cfd61c4b
MD5 b8d3179ad3c064ef7f37a5b6f66d65b0
BLAKE2b-256 f8262613ed769bd6a7208d609c2b8e5af3b82c6e818914ee4e9f5d69d0991584

See more details on using hashes here.

File details

Details for the file pytorch_numba_extension_jit-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for pytorch_numba_extension_jit-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 03381722c8b5a99248ca59c9632d498d73e080dbd36210504915b3a4860a8ebc
MD5 d765fc846bedb7722d5b93c874cca40e
BLAKE2b-256 881353857f5fe208497b1c5b37f9ba030724919cdd2a38faeed4384e8907901b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page