Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu13-2.6.0-py3-none-win_amd64.whl (187.8 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu13-2.6.0-py3-none-manylinux2014_x86_64.whl (205.2 MB view details)

Uploaded Python 3

cutensor_cu13-2.6.0-py3-none-manylinux2014_aarch64.whl (204.7 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu13-2.6.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.6.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 2bb9d35480c205d6df80dd6813a4a13988905021b49d82ec08050bd5541ced05
MD5 8bf6cbc234fe6f14bb54716a81f80aea
BLAKE2b-256 c6b0e29b8d87c1eed35e10b97f3d9da92e0f6689468b93cd84aca0fbc990209d

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.6.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.6.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2c0c47bf0fa31652dc8e1dd8989b6273e34fb12d68430a50e1cf1dc368a67a24
MD5 3bd3dd4edce5b2f1d8ab7ea1b5a45730
BLAKE2b-256 97cc514b83bad4e5e18ff8c7ff17e5f05f01bb44908e85643d5cce3d3a5efbaf

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.6.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.6.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 ab3ae0e4b0373d39f3dfef26f968dfe353d3b7ca813d4abc2c98a426bc7ae8f3
MD5 c46678c33a12adb0f8d26b9310eea5db
BLAKE2b-256 f7d3f94addb70feacd027dab0b2a2a116b3385083cebd63c4165ec0eb045ec41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page