Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 abab8a0d2f3f5661533c366df78f973052b86a3b52b868d997a95dce5aa8f17b
MD5 15010a7ddf05b26b018404ed46f840f4
BLAKE2b-256 158408d695d2e0fa95891a2e5abd978f359d50125e4d1f056e54697d465fccc3

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 5aca392063ffbc7da30442a267928b22d4a2d37f9ea1db32e4487aa31b0fcc33
MD5 7a570286ea1774d35f4defcc1b9f4036
BLAKE2b-256 617c76a9d1ce5ade3f43ab6f10e361a9c1962d02177deeaf46f2c3684a7ae959

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 12c29f7c1f1f82851092ba3869264dafafb035228c0d9827a8db08b884fb80ca
MD5 a1f0b4c8db12eaa398107d09f6651791
BLAKE2b-256 3ebc5f9dd8c05c3e2f435228224f0b0e76e324c1bf0a6dcd3cfb917b5e94bad7

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 216eee6aa8107d35569f9451b66b03a3c53167841d1af9b630b966ef8d966e19
MD5 6931fdf7bd6c97a7c09375cfe6fb9bca
BLAKE2b-256 2eb27a5de500bb74915ab8b3875f4952ae07d562f33d06eef9b2569adf4c09ab

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 e59da7d89e5e4f8514c6530843f910f9d8734d8042dcaa079c9d9c5063eb3514
MD5 25b647a805e08471775bbf0d2e7386f8
BLAKE2b-256 0a6ebfe256ac08e5a6dfb11444809e54c76c3a2f05fff38dd173e2e71b95e4d2

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 395bd77cf642aeef311313453e6582f11c9357a4b81fe620ea3daccd1fccab9b
MD5 27e117fde745a9853161b71be6a0d8c3
BLAKE2b-256 b0f8b192015e273ff023a35741d6d5e4a93e4819160dee3955fc5d3d53534450

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 15ef6a59193667e663934ef4873f8ccad37455e9b7c3c419c3072113b8aedf61
MD5 fd169f6c9f09e3050861eb4e9083369b
BLAKE2b-256 9768c1247ab848f26c4ab56e562eea0e3f31fc14c9aaf0d883afaa92d8f05592

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 d2a3c412287e356fbe48fe9f845d6d33cd35dea5e20d7e4f628c20957967cacd
MD5 ced1a60f3768482f65ca50a528d5d0ae
BLAKE2b-256 b1efe827e3c67d72adbf4e8f680bdf03b1b67723d9e1ae7c3d0a1751f39f69ce

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9117900cba53d3c21a8dacba6bbf3d6e5f269e427a526c320fb44707a0d57363
MD5 e8ad83ca2377cf0241bbc6c52c1645a1
BLAKE2b-256 0ce078eded54b4478ec01a91c75f1b9bc6dc73a2ec205c4fa2fdc25a456f4089

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 cbb555a95c7011e4b3ca328be407299c77d289660adbea22ed515d4406e6949c
MD5 f02215ebe67255f67dfda75de02378dd
BLAKE2b-256 c0a80cca1d11787128c66c0774374d1bb09313352eee11560dd00f36d6d62f36

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 386e832427e3670479049a1560e4d8d2e565d8c0f37a6852c6d7043d046548f1
MD5 b559205086187abcb87dead3a8f597c4
BLAKE2b-256 8e2b4de80442d33791322aa496e2a7f47ed08a42578bd1c7031ef0602009f8ad

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.2-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b62807bc5ea13bbdef648212893fac407ed943f940cece56b880d44af243e075
MD5 bae4d994887730723a889dd89f1e81ab
BLAKE2b-256 fd3e2cca8745885aaba0d835a8be29e516e56930791c01f0806da95d3017a495

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page