Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. It includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.2 and 3.3, Microsoft Phi, and more as well as support for a wide-variety of quantization configurations. The quantization support included 4-bit weight-only quantization, along with support for INT8 dynamic activation and INT8 weight quantization, and quantized support for the DLRM-v2 model with a mix of 8-bit and 4-bit quantization. This also includes support for running generative models with vLLM.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch plugin seamlessly works with PyTorch 2.12.0 and 2.11.0, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

In addition to stable (GA) releases, the zentorch plugin provides weekly releases via the zentorch-weekly package on PyPI.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch Github page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Build Information

Field Value
Source Tag v2.12.0.0
Tag Commit f2d1555966a492e34562606149f16e4e73555bae
Build Commit f2d1555966a492e34562606149f16e4e73555bae
PyTorch Version 2.12.0+cpu
Release Type ga

Built from v2.12.0.0

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-2.12.0.0-cp313-cp313-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.0-cp312-cp312-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.0-cp311-cp311-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-2.12.0.0-cp310-cp310-manylinux_2_28_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-2.12.0.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c012e175c05978a26c7960bf74a8d6ba1b82db40f46ac40648b1df287935f447
MD5 a144fc3396a6b02e62337644fdaaab89
BLAKE2b-256 81cc4ebc13f61d31b52b426a12771408bd9f7924abf794a4de1c4b7e3da3e5c0

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d95c695fadb9c0e08704d8b0a115b93d2a4885f3a6c96591a854c656722a43bc
MD5 e9cd50b397e06640f1860b31dfeb9428
BLAKE2b-256 a2ac8b5cf1a79c689932ff8ca529183e64ed76443db4feb95f665ecd057ac0f9

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1ed5b5e85083a9fb8dd6f73cea8cc9e616654a42cbe1248eb7f58d426ad202bc
MD5 ed1d79b68a16e480c1374117b2a916b0
BLAKE2b-256 a40d3eb82b01821d4bc993cd3c3df6096cef3e82e11be7cdfad210dfa5dc02fa

See more details on using hashes here.

File details

Details for the file zentorch-2.12.0.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.12.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 55cfbbc8c515f71acd6d85259d30d66d00d7abbc793daf8536f6d5b077ad15c6
MD5 c4ceae81052f5aa76b5701dc816baf02
BLAKE2b-256 13804157b60c9d0d5f3c02c61609ed4468ac245f66d16c5aace2652ca1967e76

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page