Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. It includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.2 and 3.3, Microsoft Phi, and more as well as support for a wide-variety of quantization configurations. The quantization support included 4-bit weight-only quantization, along with support for INT8 dynamic activation and INT8 weight quantization, and quantized support for the DLRM-v2 model with a mix of 8-bit and 4-bit quantization. This also includes support for running generative models with vLLM.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch plugin seamlessly works with PyTorch 2.12.0 and 2.11.0, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

In addition to stable (GA) releases, the zentorch plugin provides weekly releases via the zentorch-weekly package on PyPI.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch Github page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Build Information

Field Value
Source Tag v2.12.0.0
Tag Commit f2d1555966a492e34562606149f16e4e73555bae
Build Commit f2d1555966a492e34562606149f16e4e73555bae
PyTorch Version 2.11.0+cpu
Release Type ga

Built from v2.12.0.0

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-2.11.0.0-cp313-cp313-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.0-cp312-cp312-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.0-cp311-cp311-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.0-cp310-cp310-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-2.11.0.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9a6d802e2206e2b9cb258c6d4b6f6631a6ce6bdd60d0ba385341253828c6d643
MD5 3a5c303f6e96a6a134cc886c39ee98c6
BLAKE2b-256 ab7954d3bf1c61b130c5d9c045f5fa2516a3fffaaf63808abafa442ce57980d9

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 807b836730ff0e694b14ffb1257c45571e6ee6183c72e74884b4a8388a7349f5
MD5 e1e7a108885087ace9958f21eb34e7bf
BLAKE2b-256 57f1fae93ef06079a3c64696867aa1f672938201f38c1e437944ad1dd05033d7

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 80ba7d06051bc811c4c3b26cb9d4551d265eeb4da83c3016b48e8419cbc9da8c
MD5 95be24e961b476c53f868ffcfaca623a
BLAKE2b-256 e7a47ecb39928d6db2bee5e23117f0fcda2a7fa1881d61d6d3487b908f59b116

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 065ac51ce58d4ba2b6892360b826bf5761b433eebca7d9f1a01e621958491417
MD5 0ab185503b38ba4f3b5b045d1cf16ceb
BLAKE2b-256 9d5a7523f2915261e7614f51e22421d1bfd7fcc7040287e9f24e50e9972d0889

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page