Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive meta-programming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by summer 2026.

For more details please visit CUTLASS Documentation or CUTLASS GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1d255f4a308eb0d228d2466a415a8489b8337db1d322f5d8428e60139b41a317
MD5 f85b124e12311d7c3ff60e7abdbd74b7
BLAKE2b-256 36808ced4c7e1ead8d1e3ac6c823db9e387dbcfd41232e11655d5bc94e950c75

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c4d3ea9080c5a92f8f4a69451ef7036f43bfc3d7f8a426dd70258f0e237c05fb
MD5 c3099f7ce3ea622ed0f1b1836c6701e2
BLAKE2b-256 897c2bf50f2649f06a97a935919f71d2d0e40d7648364319b834548ed664d6d3

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 888edad4fe1e9b683fddcbc6969437527ccd0eb8740e60dce8f29f6a3a22c825
MD5 7d5a9ba8a9aff65191dc43a303c12f01
BLAKE2b-256 aa83d335575e1d37f6c436b1e3203ded6f352678937b9f30b900b643f9df0f9d

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 aabd41c980083db94950a4010c2c1ca156d4ab56701605739a3fba388ac9736b
MD5 a6822c3154c3dfac4c139b08a1723dc2
BLAKE2b-256 d165a8e16a9647acef4f43ea2e046cd7eeb3e5779e89089c3939a5d25fe47f57

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c7a5ce1c01616fc4c3ac492e011c543a79c3dde86aaf20a8af55e9d40ef2b2e6
MD5 51bfe22f922953c47e905f9c4e95f2e6
BLAKE2b-256 5c9fb7928ff505e577c1021c07b206ce32d285aae793763d524023c1800b6dc9

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 64e994554af4da59f75754b9df1a2b1bdfdb96b58c2457802da13d586fb58cde
MD5 8cdbffc9022ff8d1f20bdb9c0abb47ca
BLAKE2b-256 9857bc7248c02c3e4ee2ed03e194ceda9861a46fa23f0da5140bd8060a086b1e

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 80f0cd402e0f1d1571e5aed33bfa17dbc9cb90cc5b1352f0f806b4788558e80e
MD5 0b299b4a471298e8461e7178355dbcb3
BLAKE2b-256 0360443e559139da15ab544761ac14f4206dffb981af48cc9856cd5b5b7cf0e7

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 3032405dff28892340f96b467e744a822079cae454dce534fc17b77e85190e42
MD5 ab9604ffcca8e0553f82c306cd25b931
BLAKE2b-256 21e5aeb570713a7bd6c2cb08102c2ebe6de234ef1bbc276d1af4643266cd71a8

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 df61430d6110eea872acb39257042814bf02dcbb1f8d55ea0c5681bb7ce5836a
MD5 44e8b8eb8694e6785812860f6d76ff9f
BLAKE2b-256 3c3d2153608b1f8f594ccfc67daa45a1d0ff600b9e552b1e5662644e6e3ebec3

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f4a7b72147c2efdc7963c64475eac4ed67eb1dd5fdf5b0300daf79319fe9a38a
MD5 d78f0fa5881ecb9516d43b66167efdb6
BLAKE2b-256 34244ad875105f8b834ff0a6dce484c8ac124c292368338b087b993b70288385

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 587494d0ab615b805fac86b43a3c1b855182f455681c9cc4ddb1b8973f44a7cc
MD5 134ba57b4de9afa4a30d6e1a002cf248
BLAKE2b-256 3f1e12d1773571cd5f3cb2ff2a7570badfe9ccc1361e9f6684b17f7ff092c188

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 696c65ca03995713b6719bc59b7df06f8ec1d263d7eb6ac77aa011201e142bd5
MD5 0cf69235c9283fa87b63d9df8de0d49f
BLAKE2b-256 12961519dc5fb936b2e8d519710a1134ecfd162dfbdb014f15ea4534f52ca221

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page