Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 11 & 12 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

cutensor_cu11-2.2.0-py3-none-win_amd64.whl (123.5 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu11-2.2.0-py3-none-manylinux2014_x86_64.whl (139.1 MB view details)

Uploaded Python 3

cutensor_cu11-2.2.0-py3-none-manylinux2014_aarch64.whl (138.4 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu11-2.2.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: cutensor_cu11-2.2.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 123.5 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for cutensor_cu11-2.2.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 a4e583b69a64aeb7b528005ed47178855aabf55a0b7ed287671cbf70c21613fe
MD5 a9eff8f06e3b05f772c534de6ea6e7b1
BLAKE2b-256 6bb335dd5b41d25f205aee003a31a06d84ec729a8fc616413d04f5d316cdb617

See more details on using hashes here.

File details

Details for the file cutensor_cu11-2.2.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu11-2.2.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d9112abf69902ff97869c328cd28f171485d26154f7e07e9539279d1c05e130b
MD5 114307f48944f7518dadcb1d07125051
BLAKE2b-256 515afad76419b7c4fab6b9dd92b6ed3925ff779f59e265e2841175b71fb6f4ff

See more details on using hashes here.

File details

Details for the file cutensor_cu11-2.2.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu11-2.2.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 b2623e556bdb228a7cb3031d1391f17a7ab965430687e9491b4e15dd47f7da8a
MD5 6f48d2b6515d470e200cbda1cac512d2
BLAKE2b-256 41bb1ad37378ae7754fef40af03e714b007aaf1df7a800ecb0a6013c3adcdbf3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page