Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 277c034802e110a0c3840053f23d772d0366e1af4b86f0a19d81632d120634c6
MD5 a579ab81c5cec59579d1a6d21529ce72
BLAKE2b-256 f047aab054ab9f2400f803233d1506019f4e5d93ef9c30e8000fc62902a516cb

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b5570b6e657363812916922cfcade91f84eb4f0db66ffbfe4ba4a2b7af246231
MD5 2a6bf95dbe68c5095d69dfca31c5f79f
BLAKE2b-256 12dde39cc5b914dbb3673b0b7ee4f477c598e1dbaa023a5504834d5023e7b27d

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4b14ce2921ee9e62426dac6e1efa500f492bbfa05adc7f2f8190ccc87ec2436c
MD5 f33a15badf0715597cd0362621f1f589
BLAKE2b-256 2abfb33e5cecde6ea3cecbb604b8dee8d8b0c07a42c886f2929a709fdcd86f8b

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 4d6fec19fb0aca24c95731da66c5166f79eb24fe8845f2a76cfb5ae41b5f9fa4
MD5 ca5eec232a3f09675e1056e7a51e9503
BLAKE2b-256 daba49e298aea2882740fa85ab38dea4a826326e69d038889db9353426230508

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 527cb8b4025b34b39c11ec6d7bfe065c82f4051aafec9d038d43dcaaeb853c22
MD5 4739c751e7cc242da436a735b0462449
BLAKE2b-256 d6baf8c3f7db3de25be75e13527cd88a3019f0036370c9769ceefd1cf8165c76

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 e9825fc281b72b2b10e5a397c211ce0f16e6df55178230da4b3f141aeffbe6d6
MD5 270cbe6b6738628799cf0b047be8dd0c
BLAKE2b-256 172216bfee73a53195ca06a738d92aefd740f5af5eef32be8aec4f5c18a8e1d2

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 7581e03f1537ce707732ad561a0555ce6048d2b27380403dfe1b295314fbe7b8
MD5 cee44d74434246a6772fb28878b16366
BLAKE2b-256 1d615f4282b2e0eeb9256acd8cbb2a088c5e75c90e2e180c442c5a7e63c0ac84

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 e460a8ecccd71b99204e87fc68509beba3d9f3b7a21cff29341cece3c4a05abf
MD5 ea209de893d0df6f43cde6ff6a1bccc0
BLAKE2b-256 cfeda6480ed25d1f0a29f95f9b90c6eb0b3a0500e28d19f4f87bcde342d9f560

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 37c1197ddbffa12926ce968fc854ca12edec2d466f8076031222959ac8893e4d
MD5 7dd8f947b4424386a7eae86c58d6da45
BLAKE2b-256 bed59fb56397b16c94ce4a5cc0d632606bd52ca9f68c6beb46990b3e7ba96e57

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.1-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 7833ba3524b1e972e8502499a1b2445ee1971b1fd6ed5a165ef6ce1c7c0e810b
MD5 e67238b1d100364b94fe37cbde49fee8
BLAKE2b-256 8a4a46d0698f30b27350ae96872bb5727b94c6379ac47cd142f1e2b9129ffeb9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page