Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu12-2.6.0-py3-none-win_amd64.whl (260.7 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu12-2.6.0-py3-none-manylinux2014_x86_64.whl (277.1 MB view details)

Uploaded Python 3

cutensor_cu12-2.6.0-py3-none-manylinux2014_aarch64.whl (276.9 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu12-2.6.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.6.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 a3a0930362cb4b3ff784163e956b1a557535a2cd6d3f1a1612ad427a988f46e3
MD5 7aafc751853bdea184ec401730315833
BLAKE2b-256 68684f03d2a23eba24f3e7f9614251d752591a97c5aff43b83f14b9a4fddd238

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.6.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.6.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c0baefa562c60e39989c5aed3e3f2d2e655d396b2e4788f6fe1b5410e20fef70
MD5 c2c9ab7c44b223285ab767c68abdf092
BLAKE2b-256 6b6ea0b687567c46234354a9978dd6671b1ac362bddc0b0e52e6d0f2df7047f0

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.6.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.6.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 99c3f29b4061d097371786cb00d4ca127a731b7568a962d1e069ce11baa8d344
MD5 3fa86dd2b8ebff54c8836da83222549f
BLAKE2b-256 43bb935e85c62ff0f9001ff3d5165daa07258c35848a423c43ae3c0e7d7acf24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page