Skip to main content

zentorch : A PyTorch* extension for AMD EPYC CPUs.

Project description

The latest ZenDNN Plugin for PyTorch* (zentorch) is here!

The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.

This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. It includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.2 and 3.3, Microsoft Phi, and more as well as support for a wide-variety of quantization configurations. The quantization support included 4-bit weight-only quantization, along with support for INT8 dynamic activation and INT8 weight quantization, and quantized support for the DLRM-v2 model with a mix of 8-bit and 4-bit quantization. This also includes support for running generative models with vLLM.

Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.

The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.

The zentorch plugin seamlessly works with PyTorch 2.11.0+cpu, offering a high-performance experience for deep learning on AMD EPYC™ platforms.

In addition to stable (GA) releases, the zentorch plugin provides weekly releases via the zentorch-weekly package on PyPI.

Support

We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch GitHub page here

License

AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.

Build Information

Field Value
Source Tag v2.12.0.1_rc2
Tag Commit ece56c8fd6723fd23d1bc4ae186ff61d5aa958c8
Build Commit ece56c8fd6723fd23d1bc4ae186ff61d5aa958c8
PyTorch Version 2.11.0+cpu
Release Type ga

Built from v2.12.0.1_rc2

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zentorch-2.11.0.1-cp313-cp313-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.1-cp312-cp312-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.1-cp311-cp311-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

zentorch-2.11.0.1-cp310-cp310-manylinux_2_28_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file zentorch-2.11.0.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 296e01ddb2a75a668e7ed71dc6efa8ff1d7ea2367cb196ceeca901796b2d2cf5
MD5 0bb3f87e456b627d249aa45d94fd3c44
BLAKE2b-256 60aac955a0087bd18aa17742e363fdac92ce4ae8390235422a2a63fb9abf5207

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 47e4cbaf0f91cad1a1b6dfc6c5def8f32165b2d9b6d075b3081e9b8bfede3a7f
MD5 1a8dd1a778bfcb592e19dc19561e1376
BLAKE2b-256 56b3b9dbda9e07025f0cbe4c26155cf8b4a8f5676d476ef8b8ba9877540efe42

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a4a3294fc19121a2985561308d40ec79a402e63ac3298ad0d9227a17426ce678
MD5 d1dcfb7786173d4a8ec3fcc0a633379a
BLAKE2b-256 eff99b584a10b34cb45f672d2370845ec63c8f79fe2004d147557618a95c4e3d

See more details on using hashes here.

File details

Details for the file zentorch-2.11.0.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for zentorch-2.11.0.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 e256a2d5b98bb276e250502884a37bcfbb1272e0bc8361d43522c7bef4c7454e
MD5 13ee46a097f9fa79cf437261d25fedeb
BLAKE2b-256 5298add6b14b362575efe4aedabb716225b099fcd27017fb93038e6a9c75d4e1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page