Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive meta-programming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by summer 2026.

For more details please visit CUTLASS Documentation or CUTLASS GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 3d460d03d6ea0a463e262ee07964ccafffde8eeefe058c9615d77da1ddd6d003
MD5 08845599b8fcf4558ada658698f587e0
BLAKE2b-256 d06e6371065485ec91a75176b6b850a7bee31fd57bcb9045a3e07d7a8fd06c85

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 652e34d8a78accceab321da0232157e493b113e417306269d700a297a7d62447
MD5 9343be2e064a2314060c4631af1428f5
BLAKE2b-256 4b68f303a5732763b72cee6f485c0fa1d4972b159770eb869f789a873f98f330

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 554b775069d093f308949a65880bf9c9bfd48b1f4b2fd0e0d97aa2f608e6eea1
MD5 dec42ef36f48d709d397daa9117b034c
BLAKE2b-256 084813386e28bf2b724268d7ac95c41d0c718e91118bf89218b6b0471e5fa595

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f2b821671add2e69a1377e7fd87e6261995db281b2f8e516ddae2fd6b7a6c1d0
MD5 3087daf1210d15f5289ffd6147e08a7f
BLAKE2b-256 e241f8817a6ef5c93ae1009f7e6abe4e205f0014ae51faf8d0aa5817d7bf64f5

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 fc0b5a81ff591db72489134ca206ae886f0cce43f20863010a7f30fcfe484a7b
MD5 4d1b55e316dc904bc8ec28b399ea707c
BLAKE2b-256 cdf22c976759dff8836e41a8ff3716db4fc72b01969ea6ee062ae22877008030

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a799eb9d65ba03095444907b8bf617b5a8bd7d03d1b95cec9637c558af6d0b60
MD5 2baf41ef2a6cfc55bf70d446ce88fd06
BLAKE2b-256 a1bd36767220a6ecd4284496708976017ef85053563971a13b95aa8b84d316d7

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 35726dddb6f6e63c767caf7533d583b4462e2c6b90df7fa67738d58b2eede742
MD5 4813e235ffc8694bfb4531a7aeb9a853
BLAKE2b-256 5c0f5ad5f2012fd00d683b2131d270300f45e386e55064a69047ee2989f6d480

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 dadddab5bfe11973f8a0dfa66645a42ccb587287abb0766992e0e69da621dc82
MD5 4b44264b3b336c89f712e2f2cdf1f821
BLAKE2b-256 a9e0d184ab4b764cdab3d2c09aa7726178c6cc294dc908bda80f0bec13c9bebc

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 145cf5563b2d0d347c722d84a3b74541da5a1ec5b46821b54acc202693ce56ae
MD5 09c3176ed4c54a9574cf2716b34254a1
BLAKE2b-256 046cc7c32165d00dfa01d81a49626552192c3f3540d6cd5b8faf6507e161544a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b95cd78ca5f3440cd488ea03c30d394ca07e491e5bd6bb274f8f26d4f4713439
MD5 862859f6e9d9ccb7ab0616bb7c8a0521
BLAKE2b-256 50326566f883444d4342f9b196ab4243c0ffa985d62bfdbc08f2d6f54b193a42

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page