Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c7614afdbc03841db51abd84a542b79837d12a10b6f735bc58892fd2a8e5298b
MD5 d9a2758f79aad2a48ac0528e50effe59
BLAKE2b-256 dde5e3951e454c1e3cc39403cd99afd3284eed5cff3b811a7a908ce223bab152

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 9f41ea01b18e4eb16ff45c595480efe6323450b77a8abbcccb208ac7431c3817
MD5 df01c3123a0faf9196f5981ffc585202
BLAKE2b-256 4ce0c60f349ad0a72a591ce98957f3789d49dfac634075823d96c0b04545de13

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1e82c30583b816a587becd61a9e26827fcdad7767d859352ffdb4206155ac879
MD5 a2450407c77c2d0ec553d2851ba710d5
BLAKE2b-256 2bcef29fcd03f57331dd49859c8777b37720af083ee9d487a62da08cf2b7c115

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 2fb575c73233a99d3e12d27f7411c9161e8335200c652500e5f79be2bff7dca8
MD5 33efef41ed8b9629137f0d43d1ddd28d
BLAKE2b-256 42a989a2b2421b150beafa41e790c6f1ca777b2fc1ebd291fdfb855aedd809c7

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b67535ce55160207330a877ccb90837088dc79b730e9e2e66450fcc955365529
MD5 c53dc50833afdc6e3b79a88d21810dee
BLAKE2b-256 29b8c70356a84a7bea64a66490097920539f76843a2bc869640ac30efededc8d

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 df78bc75efb54ed4e021c41d7b2d5a1fd97723bcec74108392013a2446d12b1a
MD5 e9950a89d635d217a94b20969b72777f
BLAKE2b-256 7b6c3a6c14317e1f56196554c223efdd8f1156a2760bf38e66e0bf69f5c7e4f2

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b3bd6ea3ecc65cf1200a1efd429dd854fcfc27a7e1bc3986f409d5b2f085ff88
MD5 dcd073aff08b2350a8826670441bf798
BLAKE2b-256 6afc2a69feb1bff6aa2f8bff23eaf63e04c37a3535e04119e45191ee66c3f6f5

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 1ac533b6131bbf46fac78790c29d68ac40831b1cfe0a63be96ebe1bf9748035e
MD5 2411c19003a31ebab2adb21faccb7c7b
BLAKE2b-256 df871983d9a5141e97ca7c88d2e14397c8bf045633a0a87499a736e3534bd3f1

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b066b9b9ac9cb27050c0a024092fd116cc7c68fcd5b4bb936489e35d2c6e294f
MD5 d2319408d0a784a2de1fcd1d0b2fb77a
BLAKE2b-256 6ce842aaf05cd67b78367433a447fbe9f5d4829af7596211567471eccebf16e3

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.4.2-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f2a7b742600ddd738b2b692f5a2c83abf7d1f863b70d5c382445d1a2cd74e8a7
MD5 a0e9e4ac69f185eacfde399c04de8166
BLAKE2b-256 8964d70aab6b723b1077d7ac7638caba8eda6c17ddf86620eb3181455961d1bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page