Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive meta-programming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by summer 2026.

For more details please visit CUTLASS Documentation or CUTLASS GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_x86_64.whl (88.5 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_x86_64.whl (88.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_x86_64.whl (88.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_x86_64.whl (88.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_x86_64.whl (88.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_x86_64.whl (88.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_aarch64.whl (87.0 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 e8dabef7651ce49aa9659f4500a8b43c73325ac08ca1d88d75f383ae8711664d
MD5 a5ccf8737ec8e58691fa9e7d321b72ea
BLAKE2b-256 17c3425b2d64c1da0a1a6017e98d3d2aa69145bf77cf3a095aaa45e065abb4ba

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314t-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f1dfb31449fe0a24131c53b9d4f957ff6bf3e52ea8709473b0972bf030929e6b
MD5 420b27348aa34386141790928cdabfed
BLAKE2b-256 68919dc39b2f65715ca47d8c084364650be8fbeeac63affba01d5e1d16ff5a77

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 18f2aa81d5eeeadcde520b07ce369dc947abf3e29d6655ace9bdedcafb57ac8b
MD5 bd3e2cf6740e5036de50ced5ffc967e5
BLAKE2b-256 68e1aef0960124c15d7b615e6b0fbd382a6b9650cd4bfb0a1a9eaf2c112fb511

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 af1428709b6cf37b8aed62f065a708acbd51eb2e68b5828bb8b61cd674ce57ed
MD5 45c70b7702615c609c6df3db4cdfb1c2
BLAKE2b-256 113581556560c4e01c0dbb0746b63529d4513db2e9b0e5f74f641580e3674fda

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 22028842dd9c6064a3de7756b301650be77ddebf6f2acd5336bfbcd05aaf4c02
MD5 2daa5cb8307cc8b274fa0b0bcf8db31d
BLAKE2b-256 07c12521ce3d3f46731d0563bf7e5e0fa6b6ea42c31bcb6763cf4bde16d7b3ce

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 45b72e41d343f6b0c1a98669719e03dca4769d7285ae85b7d2a5292168fc73ec
MD5 5214d1b0c41d7af71b9862d683d88e45
BLAKE2b-256 7c92773b79f50ca59ca878a5e6be53d7f407deeef56f0b8000bb8cddc2b66d9e

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 abc341ff0fce40ed0bdadf160f6afac07fb9d01768d4daebd1628c330b3e4210
MD5 e5e386d8669c482f95dfedde559b564e
BLAKE2b-256 bf64f3f8962a9b91dd9368b90e23b2ac81614d6e9df72b55365ec0c216c3f8f9

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 87f132ccc30946949868989f3b1b1adaa714ccdf5c636e5379b54909cc29576c
MD5 a03453d4e35f664b7874a7671ca01547
BLAKE2b-256 113862def848b65bf067f434df7680c7e8c48519b25bbd3f03f9cdff3606353b

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0479904db0736ab912b2f2db7c6a76cf3c9f6e953f94dc5d667f73c27f18d772
MD5 8bae480b201e72240deaca4c6303bd59
BLAKE2b-256 1e984501c0b4053cacbb4e555d306d891f2426ce7edbb148f6e78376418e0356

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 783d5da4aa19a4d11429435419b64264c96d429a1794cbc9df2aa905c1342eee
MD5 7f34c21cfc18a10d0c4dbe585883ec7a
BLAKE2b-256 d1c4ea041a7857b3c7e10e939d787feee59c6c8d2d51a9ae1518684d8894daa1

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a1f5b27adab0caa8574e9663c4d2db6063b6d2c4a894489f0e1326c8395f7f7e
MD5 5ea243afdc5b77dc0a6970f7fb846700
BLAKE2b-256 d5c882428db415b151c21d4ec3f38f75cfbb88f14056dd0fc684bbfefc94d309

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_cu12-4.6.0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 27685c36145b23f2c988babe75b4f29b25cd327c6a2189c5ed6edf08f3828508
MD5 eb18650837561e43179f43f226293180
BLAKE2b-256 ec110bcc6e0b58958f13f5e0f0df8c1f1078ffbacee49b8b3a09f660489aa926

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page