Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) 5.2.0 is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. ZenDNN includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.1 and 3.2, Microsoft Phi, and more as well as support for INT4 quantized datatype. This includes the advanced Activation-Aware Weight Quantization (AWQ) algorithm for LLMs and quantized support for the DLRM-v2 model with int8 weights.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch 5.2.0 plugs seamlessly with PyTorch versions including 2.10.0 and 2.9.1, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch Github page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-5.2.0-cp313-cp313-manylinux_2_28_x86_64.whl (25.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-5.2.0-cp312-cp312-manylinux_2_28_x86_64.whl (25.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-5.2.0-cp311-cp311-manylinux_2_28_x86_64.whl (25.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-5.2.0-cp310-cp310-manylinux_2_28_x86_64.whl (25.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-5.2.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0d647fb6d3ab08d912628a2b848bdf6596e5519fa591fc1a939a2e4431b891e3
MD5 aea604d248ee9bd10614322940a2dcd4
BLAKE2b-256 0eab726eadc22a521e82c8b69e8f6ca79a40299b23243df2e74000e2ac5e7c17

See more details on using hashes here.

File details

Details for the file zentorch-5.2.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 3cd714cc8589bc642165e7fc34c5917c0c1d52f713c7633508d51d3db001e5c8
MD5 ebfff58cd56e59572fde4365b8e42f9e
BLAKE2b-256 b2ff8036af9b59e4e62aa29ea78a470558fa4ed73c79442c5cebee32be929d94

See more details on using hashes here.

File details

Details for the file zentorch-5.2.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 339c5f13c5660deb2eee3a1aa68cf677af0c3de890a0f3969350fb2094c60b99
MD5 1ebb8c6cc68d5ee66971c2fd8e6cbfd1
BLAKE2b-256 8bc358a71a7b0ba89edb94cdcde3bbc92282d0a8057856af9ea9737d5f1918e2

See more details on using hashes here.

File details

Details for the file zentorch-5.2.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 90cb918bfceb32ef9ddb4dc3288f97c0bb0a353eec109060824d44fa048aae98
MD5 282be64148ac469411227fb8fd25ab3f
BLAKE2b-256 b0bbdff6c472e05a2d4c5839e121c4e1fc6ad83c947e7d45a99d2a11694849e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page