Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl (78.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl (78.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 89bff91aeaeb1109c13b3729d1b020d0130e7a5d3758fc1f64ac11392642d01b
MD5 014ac6b8df23ae5c8c816c7e071bf70b
BLAKE2b-256 559c11bc59ec158add0baacd572e67a383fb375fc17de4940bbf755108af24f6

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f4455f3471cc39dfff5b72dcc3ae9c21d147b32aa5d03004119b80a2b85b1de5
MD5 09b89a1a4b4f5e816561cef1c230432f
BLAKE2b-256 a2d37ec31543351c9e76194753c55d5299ce3367ec26777d0086d838128e104b

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4bd6fc7d3e1f792a3761fdf60f627a9901c2e59633997c458d4525ef5d2a9d6e
MD5 f0c35384ac814daea945f8849e5316bb
BLAKE2b-256 9b40d141503c7f7ac6eefc904b1cef1e8f1a3068a2ed29fe0f3488bb5723fe9b

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 7da50c639081e115674f2ad7bd4089e3a7227591769d40468c844ca47cd489aa
MD5 04dee56bc83221e5a09bdcb01f508691
BLAKE2b-256 11a169c6f44d8a0a49c4bd404218a350fa49df96551a81bfdcafb3c43bc8532a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a63d95006f1dcddc9fc58117fbb2769dfd5a22bba44a27e7f7d06c28f5fd9878
MD5 12f5fc0dc409e7376ab814509facf619
BLAKE2b-256 d9ebdb64d6b0307a7ff3a93a3cb0a177dc8977b1c6787293ff064dd4cab3172a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 55f1675091d5158073de7babfe6c2d218282e21013ba848d70cda682fda78868
MD5 9a73f7a9c34c3f149aed0261ff9a5841
BLAKE2b-256 f60d7838d6240112460e520eaf0440a9d30ab4cb5207e5d962b12433688bf638

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f5f06e26a0d8c0518d339ae2b5cd7e8f0b4733e3bd788ec73a812df5a5c5842c
MD5 716b9b1d225624adea6f5f38d86d591c
BLAKE2b-256 479e72a454f71419a650ebb31a0751f1543c8b489b00f5e779ac0109e66411e3

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c6975c2d7ea944abbcb290fa433ec57195d17ffad3d2292ca4fb66960d083740
MD5 1d05e1aff10a2f6bc7d415248de31ea7
BLAKE2b-256 318279ec6fa69348d1e027fa3089d7c804a1fb68b4eef58306427f6c62826ceb

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0c2868ccda1d86341d2a134111509c6156387f0f8abb551fe9589a00e86e938e
MD5 d65cec8060493b6e258b4726dea4369f
BLAKE2b-256 a89759d550af459db5ecc7d8f9302b37d74d5835f7d61390f66c1630dbcc4a41

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu13-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 ed5abbcf11db3f89cef3e5d279e79d976dfd18ef6babd2d5d6c69f4e044738b7
MD5 46001084171a62763b3812aec4cb4aeb
BLAKE2b-256 2a86a4fcef7115a725a491b21b63e356ae8b38883eeff383667cb30b35b0f294

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page