Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu13-2.7.0-py3-none-win_amd64.whl (224.8 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu13-2.7.0-py3-none-manylinux2014_x86_64.whl (242.6 MB view details)

Uploaded Python 3

cutensor_cu13-2.7.0-py3-none-manylinux2014_aarch64.whl (242.3 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu13-2.7.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.7.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 7834c69b40cce5f6c086624ccb88a70a05461e263e52e368a9eaebca937c1c0f
MD5 0750fed9931c5240df169366ba236de0
BLAKE2b-256 a8c2b76e58bb8beb0ac95cbbfd79659d38b69f38b383f3e179671bcfa9caae6e

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.7.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.7.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9a4187a153ddd431dca4313e2efc6f6f8bd4fda4dd0e41d863c9e039705b6f8f
MD5 4fe2f61da51c951641c98446a75571ed
BLAKE2b-256 19da53b58aad8261451ca4e9384791ac8af1fb8b4ca976e8feba56a50d5e53ba

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.7.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.7.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 293730541bdcc260c2f735ba4aa123cd907ca143a964252d1912a58c1e37c18a
MD5 fce908833d0d77bec7817b6376addc05
BLAKE2b-256 b16573c65022634bac7816bdeaf453a7c37820c4d6fcd79832108e2662d10cd6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page