Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive meta-programming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by summer 2026.

For more details please visit CUTLASS Documentation or CUTLASS GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl (78.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl (79.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 83c10331be8d9e697a45bd8faf83fe1dc5a2fafe5a2450402404e4a0787c83bd
MD5 71442b3ebf35378916bd429ad8cb68a7
BLAKE2b-256 17e9b38d196c1e4503ea6dbd883b4f8778f0a6ca29b233807d14514049cc6066

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 9aa7c8a1f42677b40d9a18f3edc32c7546afd0188cbdab07df74cbb20a22c5e7
MD5 c65fba065f85fa400d6dfd5f14d65d26
BLAKE2b-256 d3e3cb1f11a87333d716bf5ce4921cc3b4bee8f235255ef70970620661742783

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6e996f117bb3ca448827b2deb7ea0d8126e860593b1b14e47db575b05798670f
MD5 ad42a1c90c4dd10606110b65ccd71f99
BLAKE2b-256 af5c4646abdac23e6a0057ebea2ef1e442ec7f354b886ef28d8432cba88fc4db

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 478137711acb33f646cffa1ab04372b1911568aac10b01914cb64daafbd3cb48
MD5 f99427dbfe781291d4ab63e0c5f64f4d
BLAKE2b-256 350673482edfdf4697d0408d467e3eaa4489c92b07dd7bd97571c9d8df564428

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f3299506a36be561d77e23fe15dcf9cdb9535933e41e29fdba29238deb3cab4f
MD5 50c1bd3cf604cdcf28282ca5949afb63
BLAKE2b-256 e0649d177e852192797bab0ecc7e9455a9c512c2c2b4b472da112cfeece00500

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 30f210872ff34a3438f02bafc73e49d76f26dd370599cc36cc6cab44176a249e
MD5 6dbfc03e4eee1c767556aa071ba3af74
BLAKE2b-256 424c45deded46e9a173e3137d0ab072a25bc9e906fa8c562d1c346891937c839

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 494acb5a906c062c7f53a5996309fd038de2841e4d3caffc8d2b8c74b1f07789
MD5 5110b7d36b23c714233b842c2b796a66
BLAKE2b-256 f65c970d5a0706253efa45c50da88b067d6b2a6d9519e5ba5c296a22bb0b11a4

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 882eb55b5e86e17df68d413e17b031894cae8d8136a5734f3dd854420d3c66c3
MD5 8f55d474ffbccdc83e022f738098799c
BLAKE2b-256 3b3c2e0c67e606a73a2e29785a6b02a607a5e8e4de72a60a0f07f3d8ce08eeb8

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 22f1e808a742b4d6f0a2937c6ff8c0a1de9acb96bbf91e64a2b93d27ee4bb4e5
MD5 bec205d349e7a364425a94fc516f0809
BLAKE2b-256 85244f6b69cff4645b9d9b947bf926a3848ad2dd23589fb4bd4bc78eaed1929f

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 9fb821348a38356f0eec7c654a3f7b1db9bb022bdf14faf799b8ffd7f046407b
MD5 38d9dbee850de2bfea92e72c75efa350
BLAKE2b-256 7ceaff1aafabaf39512548a8eee0d5675586f83672ed746b28a67fda2d4d467d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page