Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive meta-programming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by summer 2026.

For more details please visit CUTLASS Documentation or CUTLASS GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_aarch64.whl (89.1 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl (89.0 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl (89.0 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl (89.0 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl (89.0 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl (90.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl (89.0 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9226a7b80f2355fce8cc47a7b87c6f7be738b158dddf0a552143bd4d34315ffe
MD5 ef7e34605b964fcb98f9ae5936f737a2
BLAKE2b-256 556a00c5a2a8cfa564b3fd0251ba56c5c0a1ae7b7f08c038e237b2e2c4ceb0bf

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 549e25e49ef4e30a2afd66b1b2543b0fc79321419f00c82c458b14850bde578d
MD5 4ec43a67fc90cbfa627043ac83ae0e14
BLAKE2b-256 4b11c8148885a6f7a99c1f8dbf51084475802dbf5bf6f2a9572d18f063de523a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5e3dec27807b5d243479e597f2c846b560b89b80d77f252689639e6737313d10
MD5 76c9c362ef816bee917c6b1582662c45
BLAKE2b-256 507d39bc404d85a02e885dace705174ff7739fa2679fd2e64d15b588afe45bc9

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 33fdaf691653af45206c7e4114ad53ebff260b1a837401e549775cd9d0cc268f
MD5 21d776be2d497a0a1442aa0e69a39c31
BLAKE2b-256 004dc1561d0e81600486cce44f88014cfaf9d641240dff739bdd535568a603db

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 19de039bb3496baa806cf22c8a68b989dd998a129847d3d8bb830d34ba777f08
MD5 7a9278cbdec4c05fb1acb5ef8e88b7bc
BLAKE2b-256 47f5be8d6d0bf4a5ba7e6a1825cf72150c27a81e041463b907b393f68693b9a9

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 fe292f2352e78a5b93f7501811a035f7d8945fc1986426b0be8cee7a8cbb2e1f
MD5 397d427542311c1fee63379584c6c778
BLAKE2b-256 49b0cbb6474af79fcf94b9c6ad84f6656d6f9603a3ab399781ae5c83a8d8bf86

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a33d83905cfca7a2db2034a2f658941024755d75c1b500c93aacbe6164ec0300
MD5 d4602175c56066b9c44c0a43ee286013
BLAKE2b-256 d1f2f23740acf232540576439e2fbb5a231f74739047dbd1db5896cbd2eb6498

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 97683ee6231ec57c4f7bfd2ce53171d7fa3805cf8f307ddd238ed75707446cdb
MD5 8afd990b8b2c1c7b8fed7ca80b047c35
BLAKE2b-256 59075747add860c9747b56c2d11eb2831d6f6d831b27f715bdf33573258b0a6c

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 218b0cfeaf8c3184933635032f589c6c190513f3efe0b232b05b69d0b5be5c4b
MD5 936ca96589770cb6efd79e0f0816f72e
BLAKE2b-256 0a0b1b2e7b5a1c1f59fed0302fc90f03e3f4b61f81b75607453c6726fe642a36

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 d08feb9429bd872022d01fd790a6a9cded57a14f4ea2b2525ed62ee4a32f5f7f
MD5 7bc58f9861b23450304e25424cfadfeb
BLAKE2b-256 9b705d5e8dbb4911e890a98848da23cf624a022e320f93d79b637e50e5ddd0f0

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 29c01b026e41afe89db970466c1b6a16e60166e47a76e4c6f88148133c485ab0
MD5 ecb02711e60346ded704fb7d371e69ec
BLAKE2b-256 b70c45b8e0b3c8386ae88b82cd65ca249271f9a6978d60f208665228a358830b

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.6.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 516042afd42419aa085120950415ac2a49333a7f821756647ebc2e142049720b
MD5 24bc7476542ab8a7e18e5902cb349f70
BLAKE2b-256 fa4d22e1cd5be088668029e8c53f7b6a92d16e919525fa8535536f53956748ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page