Skip to main content

NVIDIA cuTENSOR

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

      • Support just-in-time compilation of dedicated kernels.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Support for padding of the output tensor

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 12 & 13 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cutensor_cu13-2.4.1-py3-none-win_amd64.whl (187.0 MB view details)

Uploaded Python 3Windows x86-64

cutensor_cu13-2.4.1-py3-none-manylinux2014_x86_64.whl (204.1 MB view details)

Uploaded Python 3

cutensor_cu13-2.4.1-py3-none-manylinux2014_aarch64.whl (203.7 MB view details)

Uploaded Python 3

File details

Details for the file cutensor_cu13-2.4.1-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.4.1-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 222d55324ae8fa0c841ba9cf97f384082d5fe9d2a996540cfb58396f2f7a0e39
MD5 c2fc8c2dee1a8659d60490486657e4a9
BLAKE2b-256 5ead4c2569ff00098990ad82c73ae6fd4d7ea3c397cf59d9432c1eb1bd557367

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.4.1-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.4.1-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 1f3d684e04953ae44923b673a2d4156dd99a821400c42fe872aa85f8f166b728
MD5 32903533de081b121bb72c6efecbc36f
BLAKE2b-256 41aef31dde1ef35ac9892c154c8cdf43a80d8dd726d5a38cc22648beaab5ee15

See more details on using hashes here.

File details

Details for the file cutensor_cu13-2.4.1-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for cutensor_cu13-2.4.1-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 0ab20da80b82e663cfafdc2e2f4948143afcb85d840883c4c18c70e453d1cf69
MD5 3ca4c5ee12f3a15a94a28f02ee7dee19
BLAKE2b-256 b7915815375ca78b13cd56d1d034b0fab815056dea90279c65bdd373e46341b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page