Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu12-2.5.0-py3-none-win_amd64.whl (264.4 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu12-2.5.0-py3-none-manylinux2014_x86_64.whl (280.8 MB view details)

Uploaded Python 3

cutensor_cu12-2.5.0-py3-none-manylinux2014_aarch64.whl (280.6 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu12-2.5.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: cutensor_cu12-2.5.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 264.4 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.19

File hashes

Hashes for cutensor_cu12-2.5.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 094627ed8d6d5dd4d234da10d4cdf85aff38cc2dafbfcbb385b3045736a7cf2f
MD5 af33fb602afc54a0d86fc5bdcebaab88
BLAKE2b-256 c2a4e876d3577f34a710be593bb1d9cd19e8078765f315ff2830ec47ba5a61d1

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.5.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.5.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6151e26d797a30bff2d283dfe1dd05881aa3f44da53f82caea1409baf0c50223
MD5 b34a01afbed667279c76082581c8312e
BLAKE2b-256 a3fbae730aba0b6da934482dd52bdeaceb32629db30d9a0a29b58a6abfcf68a5

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.5.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.5.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 3cf388c38ac0bd1370482d583b162a4b2db54935a4a6a7b5ae43112c4d57673b
MD5 588d345f4af7b25f9ef5e58f71d56361
BLAKE2b-256 2fae956a5a9dd0e0f1b261946fcdd705dfee674b029d3a12d5d80f40370940cd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page