Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl (74.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl (75.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl (74.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl (75.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl (74.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl (75.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl (74.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl (75.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl (74.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl (75.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 30683667646327c69e819adc42b178fc7b4443c5281aa4330e32b9b1b4d33f5f
MD5 ef3ad72a7249dc2f7776a1372f7b41b5
BLAKE2b-256 e7212bcd52763ded21e0f53ee493a13350fdd9000ed60468189bfc93d223cca2

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 07dd9bf32eb2f1f49ccb61020c865e2bebb8255ff1c79c67393bd305437a59f8
MD5 04b88618d160b1bd03c089d9ee9c44cc
BLAKE2b-256 1dd6d243c1de628a25f7fe2971d65f59e9b8d5c5963fb6a14c33790f68caf73a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 91a253dd8258875240713fdf96214777a04e95ec93f24962316c2c59365f952a
MD5 81ea8a9b634c3532e248c7652f78ff57
BLAKE2b-256 4f3b16a553c0e922ddcea3c1b944eb1babc2c8a24fdc720412d44f918b5a1fac

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 bf76915e52b8869961824abeb5d61ea573502b8c273e2423b7096f84ab2bb964
MD5 122832ca3d1cc01def0a637084ad3d44
BLAKE2b-256 b24fbb737dbfb200b415744c5cab1fe67a884f0ef41e66c5edff7946fa55b277

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 38136edf1a1cd0c49fed73dcce966c8ac8b4396bdbd996deada6f73c81f5896e
MD5 f3fec979cf1a1bbc4c9ab354a514a837
BLAKE2b-256 1fcd29d3122f72faab60ecf4dcbc4e0c457f24dff5ef23ccb8ed60a9912f05be

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b1e710835056384e5735c3c975ac31f8c7c40b993cf71edfcee9715a37e76eb2
MD5 15c17eae2e598a0ad212d0fccc6f3c85
BLAKE2b-256 5251b3448c42e47e3735b1b7d67ecee473290c981acb40c2605a0bd603235601

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 814abf81679a7f00108753d1a514c0cff843e4c18ec7e0975b21cfd22434e9b4
MD5 4b19a3520ed2e77842a5cc1a98d4c621
BLAKE2b-256 f2eadc817711341f5f9bb5822c4600746f77bc9a9f00c5fcc57f6ac72f600d7a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 add124fbb85c7d48e5c6e0d4c7afb560baab4ba80e37e0be8e8e97cd8f710796
MD5 55b8c3353694257b0f10a1e8a4867dbe
BLAKE2b-256 7d60179f6ea2314c18e77b3312881021bd9de3dc3560c74e732da78535b97a79

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1dcd6b5fd63257f44b5eecec3e5774fb37b1f8e2539ea27a8c54991bd3b8b97d
MD5 a056a4522f2dc56aa45483ecaf9d1a6e
BLAKE2b-256 43db7c0ec3c12ad85797941d19a217346a6576d73006ac8deccc0eb26ff5e5b6

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0.dev0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 e83fd779a603ced680f70d26c6bf2eaff43403d6837055d9a33135cf0378e2bc
MD5 bf5dbdc50225baf6ec62573c2a447c5b
BLAKE2b-256 179491b9354df3eab3412b12203c57a9e4adc7b69fdc12f930602bdd53d19730

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page