zentorch : A PyTorch* extension for AMD EPYC CPUs.
Project description
The latest ZenDNN Plugin for PyTorch* (zentorch) 5.2.0 is here!
The ZenDNN PyTorch plugin is called zentorch. Combined with PyTorch's torch.compile, zentorch transforms deep learning pipelines into finely-tuned, AMD-specific engines, delivering unparalleled efficiency and speed for large-scale inference workloads.
This upgrade continues the focus on optimizing inference with Recommender Systems and Large Language Models on AMD EPYC™ CPUs. ZenDNN includes AMD EPYC™ enhancements for bfloat16 performance, expanded support for cutting-edge models like Llama 3.1 and 3.2, Microsoft Phi, and more as well as support for INT4 quantized datatype. This includes the advanced Activation-Aware Weight Quantization (AWQ) algorithm for LLMs and quantized support for the DLRM-v2 model with int8 weights.
Under the hood, ZenDNN’s enhanced AMD-specific optimizations operate at every level. In addition to highly optimized operator microkernels, these include comprehensive graph optimizations including pattern identification, graph reordering, and fusions. They also incorporate optimized embedding bag kernels and enhanced zenMatMul matrix splitting strategies which leverage the AMD EPYC™ microarchitecture to deliver enhanced throughput and latency.
The vLLM-ZenTorch plugin extends these benefits to the vLLM inference engine, enabling plug-and-play acceleration of large language model inference on AMD EPYC™ CPUs. By integrating ZenTorch with vLLM, users can achieve significant throughput improvements for LLM workloads with zero code changes.
The zentorch 5.2.0 plugs seamlessly with PyTorch versions including 2.10.0 and 2.9.1, offering a high-performance experience for deep learning on AMD EPYC™ platforms.
Support
We welcome feedback, suggestions, and bug reports. Should you have any of the these, please kindly file an issue on the ZenDNN Plugin for PyTorch Github page here
License
AMD copyrighted code in ZenDNN is subject to the Apache-2.0, MIT, or BSD-3-Clause licenses; consult the source code file headers for the applicable license. Third party copyrighted code in ZenDNN is subject to the licenses set forth in the source code file headers of such code.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file zentorch-5.2.0-cp313-cp313-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: zentorch-5.2.0-cp313-cp313-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 25.5 MB
- Tags: CPython 3.13, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0d647fb6d3ab08d912628a2b848bdf6596e5519fa591fc1a939a2e4431b891e3
|
|
| MD5 |
aea604d248ee9bd10614322940a2dcd4
|
|
| BLAKE2b-256 |
0eab726eadc22a521e82c8b69e8f6ca79a40299b23243df2e74000e2ac5e7c17
|
File details
Details for the file zentorch-5.2.0-cp312-cp312-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: zentorch-5.2.0-cp312-cp312-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 25.5 MB
- Tags: CPython 3.12, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cd714cc8589bc642165e7fc34c5917c0c1d52f713c7633508d51d3db001e5c8
|
|
| MD5 |
ebfff58cd56e59572fde4365b8e42f9e
|
|
| BLAKE2b-256 |
b2ff8036af9b59e4e62aa29ea78a470558fa4ed73c79442c5cebee32be929d94
|
File details
Details for the file zentorch-5.2.0-cp311-cp311-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: zentorch-5.2.0-cp311-cp311-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 25.5 MB
- Tags: CPython 3.11, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
339c5f13c5660deb2eee3a1aa68cf677af0c3de890a0f3969350fb2094c60b99
|
|
| MD5 |
1ebb8c6cc68d5ee66971c2fd8e6cbfd1
|
|
| BLAKE2b-256 |
8bc358a71a7b0ba89edb94cdcde3bbc92282d0a8057856af9ea9737d5f1918e2
|
File details
Details for the file zentorch-5.2.0-cp310-cp310-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: zentorch-5.2.0-cp310-cp310-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 25.5 MB
- Tags: CPython 3.10, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
90cb918bfceb32ef9ddb4dc3288f97c0bb0a353eec109060824d44fa048aae98
|
|
| MD5 |
282be64148ac469411227fb8fd25ab3f
|
|
| BLAKE2b-256 |
b0bbdff6c472e05a2d4c5839e121c4e1fc6ad83c947e7d45a99d2a11694849e2
|