Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) 5.2.1 is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. It includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.2 and 3.3, Microsoft Phi, and more as well as support for a wide-variety of quantization configurations. The quantization support included 4-bit weight-only quantization, along with support for INT8 dynamic activation and INT8 weight quantization, and quantized support for the DLRM-v2 model with a mix of 8-bit and 4-bit quantization. This also includes support for running generative models with vLLM.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch 5.2.1 plugin seamlessly works with PyTorch versions including 2.11.0 and 2.10.0, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

In addition to stable releases, the zentorch plugin provides weekly minor releases that extend support to newer PyTorch versions. The latest minor release plugin supports PyTorch 2.11.0 and 2.10.0. These weekly releases are only available as source builds.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch Github page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-5.2.1-cp313-cp313-manylinux_2_28_x86_64.whl (26.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-5.2.1-cp312-cp312-manylinux_2_28_x86_64.whl (26.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-5.2.1-cp311-cp311-manylinux_2_28_x86_64.whl (26.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-5.2.1-cp310-cp310-manylinux_2_28_x86_64.whl (26.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-5.2.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0841789b28e8d71b2e60e19c07e65a98cea38bcd3665af8fc8f209ee2d6ec0e5
MD5 8cd0cc4b7f00d9af80aca5ceafc00446
BLAKE2b-256 2deff0b78b2ce1f8a579445fe00eb39544653dc50738fa6333a83f54634ae83d

See more details on using hashes here.

File details

Details for the file zentorch-5.2.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 46f7d175b88c75b6d08b6bc84b9737aecf3d17ba4e64a9dbfa2f77692834f192
MD5 d0743d4d2abe12efe234035918d2212f
BLAKE2b-256 87e376d8be9406bff9365feb418903fe3d8db0f0708c2053e23439c2bc9a3145

See more details on using hashes here.

File details

Details for the file zentorch-5.2.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f7bcdbf5ea7920aaf7541627758ea910d0ddd57fe6f8860e1ebb992713189aba
MD5 f9c0c0adfbed9ba2ce2121712db10674
BLAKE2b-256 804b8be3e7cd1c3a47f8766b2419a163ee71a8705e1bd811eb183d14a363839b

See more details on using hashes here.

File details

Details for the file zentorch-5.2.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-5.2.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ea8c7ea055b25165924e32f6441ede087f3bebb9e9da6b369bb0dc0e2279b759
MD5 ab9bd3929223a5f52cd96915f6b02dfb
BLAKE2b-256 af2addd305b0c3d36badec083de28a5c1d8c098103c31011251e7694d9adac54

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page