Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu13-2.5.0-py3-none-win_amd64.whl (191.2 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu13-2.5.0-py3-none-manylinux2014_x86_64.whl (208.5 MB view details)

Uploaded Python 3

cutensor_cu13-2.5.0-py3-none-manylinux2014_aarch64.whl (208.0 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu13-2.5.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: cutensor_cu13-2.5.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 191.2 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.19

File hashes

Hashes for cutensor_cu13-2.5.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 30f45d34da48dc6286260518497eecd4714c25e543e89c262d6403b76c24469d
MD5 ae94cd3f5d578efff444904b42523304
BLAKE2b-256 748b287d66a271658b4ad16510a4c5925dc0bcc5921e8e600dcaaee9a3398827

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.5.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.5.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f75fd87ddd89c778a7c83dfa4956523e8af00475fc6edd1df4bda4e4a26bfad9
MD5 50c14bc7d726a1d993c12216f176ec91
BLAKE2b-256 efbb1d284473350c49c86a9999eb178b09d448179bcc398f7010e53f51c26ab7

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.5.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.5.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 519a065abf0f03b6e0ed5fcbf5a37f2aaeba1990c7cd2a188d261e5af05ddc57
MD5 b3b11374eee8538924059db22dccc059
BLAKE2b-256 8f8505f33f7ba28a180a03498483c52a1b94c078c758e9a2fde79b627cc32800

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page