Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. It includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.2 and 3.3, Microsoft Phi, and more as well as support for a wide-variety of quantization configurations. The quantization support included 4-bit weight-only quantization, along with support for INT8 dynamic activation and INT8 weight quantization, and quantized support for the DLRM-v2 model with a mix of 8-bit and 4-bit quantization. This also includes support for running generative models with vLLM.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch plugin seamlessly works with PyTorch 2.12.0+cpu, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

In addition to stable (GA) releases, the zentorch plugin provides weekly releases via the zentorch-weekly package on PyPI.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch GitHub page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Build Information

Field Value
Source Tag v2.12.0.1_rc2
Tag Commit ece56c8fd6723fd23d1bc4ae186ff61d5aa958c8
Build Commit ece56c8fd6723fd23d1bc4ae186ff61d5aa958c8
PyTorch Version 2.12.0+cpu
Release Type ga

Built from v2.12.0.1_rc2

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-2.12.0.1-cp313-cp313-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.1-cp312-cp312-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.1-cp311-cp311-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.1-cp310-cp310-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-2.12.0.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 22e2bf255956a12f4cbee857f066ab7019ea764eb46a455e6f81c9f2e91b9c8a
MD5 0adfadc35dfa157bbf82bbced7ed26a0
BLAKE2b-256 f984f9549f08997aefd16d6a85ebbe5417f5856e67b1ef90f4f3e5f0fc4a80a1

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c65c0c901717eb9d7872c94e994004c1ddf85b15dd3e97d3d5dc3aee0597e6ae
MD5 894db6d0063cd4151738c4f0e3652cd5
BLAKE2b-256 8e87f77e96dba20f4d8cdb54e714b8d4974217b86a1523f376ebc54e6f67a0c8

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b4cf4d4ca296b9729b33cfc2dcc91e989ec8bb98bb142dd3038dfa7e56a0d005
MD5 6b5d5db34054b96ef3b4645ec7c41117
BLAKE2b-256 813a2ef966bfdf31b76409d9a070661586ac2e800c2d49770498a61aa6e031e2

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 fda6c6fd8cda386aed9275516dfc8277d2db9b2a0fc2db06abd5b22ee178f22a
MD5 d8df50187c4e69e6063f6f1a3243605d
BLAKE2b-256 675045a03ce75ebad119d3345aada33e8ebd265a99fd8568cce16bbc0e5af088

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page