vLLM plugin for Spyre hardware support
Project description
SenDNN Inference
| Documentation | Users Forum | #sig-spyre |
IBM Spyre is the first production-grade Artificial Intelligence Unit (AIU) accelerator born out of the IBM Research AIU family, and is part of a long-term strategy of developing novel architectures and full-stack technology solutions for the emerging space of generative AI. Spyre builds on the foundation of IBM's internal AIU research and delivers a scalable, efficient architecture for accelerating AI in enterprise environments.
SenDNN Inference (sendnn-inference) is a vLLM plugin that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture described in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.
For more information, check out the following:
- 📚 Meet the IBM Artificial Intelligence Unit
- 📽️ AI Accelerators: Transforming Scalability & Model Efficiency
- 🚀 Spyre Accelerator for IBM Z
- 🚀 Spyre Accelerator for IBM POWER
Getting Started
Visit our documentation:
Contributing
We welcome and value any contributions and collaborations. Please check out Contributing to SenDNN Inference for how to get involved.
Contact
You can reach out for discussion or support in the #sig-spyre channel in the vLLM Slack workspace or by opening an issue.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sendnn_inference-2.0.0rc12.tar.gz.
File metadata
- Download URL: sendnn_inference-2.0.0rc12.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0fc05ea30d9d7b0b7e0bf76c5169100049f2cdef1e0673edc5981d2e46b73030
|
|
| MD5 |
0929485a3441eb95f77ad2be6a149544
|
|
| BLAKE2b-256 |
3462a40848bc1432dd297834313ab453338f3539146f607a5173dedbaa37cea5
|
Provenance
The following attestation bundles were made for sendnn_inference-2.0.0rc12.tar.gz:
Publisher:
build_and_publish.yaml on torch-spyre/sendnn-inference
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sendnn_inference-2.0.0rc12.tar.gz -
Subject digest:
0fc05ea30d9d7b0b7e0bf76c5169100049f2cdef1e0673edc5981d2e46b73030 - Sigstore transparency entry: 1375413782
- Sigstore integration time:
-
Permalink:
torch-spyre/sendnn-inference@5ede22854839df4c47ef61df8a092feb55a78010 -
Branch / Tag:
refs/tags/v2.0.0-rc.12 - Owner: https://github.com/torch-spyre
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build_and_publish.yaml@5ede22854839df4c47ef61df8a092feb55a78010 -
Trigger Event:
release
-
Statement type:
File details
Details for the file sendnn_inference-2.0.0rc12-py3-none-any.whl.
File metadata
- Download URL: sendnn_inference-2.0.0rc12-py3-none-any.whl
- Upload date:
- Size: 109.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5fb79257120c1b793aa011f82136bc041f8da1964ed4e23d962b368d80ef1cbe
|
|
| MD5 |
e31e8e9851b0cf96ced8d5366922eb28
|
|
| BLAKE2b-256 |
1d18e8fd1105aae508c349ea2dbd144a35956cb3e8f45d267eb2f2774be17a10
|
Provenance
The following attestation bundles were made for sendnn_inference-2.0.0rc12-py3-none-any.whl:
Publisher:
build_and_publish.yaml on torch-spyre/sendnn-inference
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sendnn_inference-2.0.0rc12-py3-none-any.whl -
Subject digest:
5fb79257120c1b793aa011f82136bc041f8da1964ed4e23d962b368d80ef1cbe - Sigstore transparency entry: 1375413887
- Sigstore integration time:
-
Permalink:
torch-spyre/sendnn-inference@5ede22854839df4c47ef61df8a092feb55a78010 -
Branch / Tag:
refs/tags/v2.0.0-rc.12 - Owner: https://github.com/torch-spyre
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build_and_publish.yaml@5ede22854839df4c47ef61df8a092feb55a78010 -
Trigger Event:
release
-
Statement type: