Skip to main content

NVIDIA cuTENSOR

Reason this release was yanked:

placeholder

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

  • Extensive mixed-precision support:

    • FP64 inputs with FP32 compute.

    • FP32 inputs with FP16, BF16, or TF32 compute.

    • Complex-times-real operations.

    • Conjugate (without transpose) support.

  • Support for up to 64-dimensional tensors.

  • Arbitrary data layouts.

  • Trivially serializable data structures.

  • Main computational routines:

    • Direct (i.e., transpose-free) tensor contractions.

    • Tensor reductions (including partial reductions).

    • Element-wise tensor operations:

      • Support for various activation functions.

      • Arbitrary tensor permutations.

      • Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

File details

Details for the file cutensor_cu11-0.0.1.dev0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cutensor_cu11-0.0.1.dev0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c3c024ae2f95149ebe004dd407459a6ef165e52812ee806a07393fb699ab6dab
MD5 7903591bb5bbff83c4261d64590c4ae2
BLAKE2b-256 d501a2d14baba77750f173baa9f0a5bee2b5423f27524b3183e68025120ec302

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page