Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu12-2.7.0-py3-none-win_amd64.whl (316.3 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu12-2.7.0-py3-none-manylinux2014_x86_64.whl (333.0 MB view details)

Uploaded Python 3

cutensor_cu12-2.7.0-py3-none-manylinux2014_aarch64.whl (333.0 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu12-2.7.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.7.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 6b5df750a43218a395b0ddc2607623c07f610a10ff2e5a071273d93e568010f0
MD5 ba1c470a5f3bafe8477f7207a859064e
BLAKE2b-256 30bc18bd0789acfb9d746b4542c9a794efe5debb21bf7114fb39f16662805c46

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.7.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.7.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 ddedf0d94555c457c6c282dd7369c10fd10db22029f3731f1ed9420600188e6d
MD5 37535fa5c4e18ee3e0ef14e6fc34014f
BLAKE2b-256 300bc9a0d71cc847c7b90d21ae1062737dbc16104fc2c2219b51ec64752017c4

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.7.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.7.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 4518081e13a22c6fe1d7941ee7b0358f4ad71a1484e2a92563f0d1336476f939
MD5 2f20f95c6e4c30a2b0067a861dca80b6
BLAKE2b-256 874f7744b62f5d0c3ec0e4f1442ecd119343462d6f3a0d025508ace4f319d32a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page