Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 11 & 12 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

cutensor_cu12-2.2.0-py3-none-win_amd64.whl (182.8 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu12-2.2.0-py3-none-manylinux2014_x86_64.whl (202.3 MB view details)

Uploaded Python 3

cutensor_cu12-2.2.0-py3-none-manylinux2014_aarch64.whl (202.2 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu12-2.2.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: cutensor_cu12-2.2.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 182.8 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for cutensor_cu12-2.2.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 445464d39e25d1e67761694a676c21694c37b668a3e7e23df760150957318f91
MD5 c58bec92bbc458bde27fae3c34a33bab
BLAKE2b-256 8d933c6fce7353f20bdf8f760b16d3f180a160ccd2025242d78918d9d704d5fa

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.2.0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.2.0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 ecec61eac87a903e74b069dc86e53a273c60f496fd6b7c2203fbc110c8b4819d
MD5 64156d6ff5bc77c8d1ea209e5aa06db7
BLAKE2b-256 ae8816b2528417dc017f3e950f02e4c6f4123cdc1940cae4f2edfd1bfd828e35

See more details on using hashes here.

File details

Details for the file cutensor_cu12-2.2.0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu12-2.2.0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 3b4faf4a036b8b178cb15305e37f9e3263339fc3272c117c568805ce04d39f3e
MD5 5533509360875527d9a383944365a436
BLAKE2b-256 8fc1861f5084c545bf7ad04c7509f8f6a30530ff6d262ceffb3ba248ba88c4d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page