vLLM plugin for Spyre hardware support

Project description

SenDNN Inference

| Documentation | Users Forum | #sig-spyre |

IBM Spyre is the first production-grade Artificial Intelligence Unit (AIU) accelerator born out of the IBM Research AIU family, and is part of a long-term strategy of developing novel architectures and full-stack technology solutions for the emerging space of generative AI. Spyre builds on the foundation of IBM's internal AIU research and delivers a scalable, efficient architecture for accelerating AI in enterprise environments.

SenDNN Inference (sendnn-inference) is a vLLM plugin that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture described in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.

For more information, check out the following:

📚 Meet the IBM Artificial Intelligence Unit
📽️ AI Accelerators: Transforming Scalability & Model Efficiency
🚀 Spyre Accelerator for IBM Z
🚀 Spyre Accelerator for IBM POWER

Getting Started

Visit our documentation:

Contributing

We welcome and value any contributions and collaborations. Please check out Contributing to SenDNN Inference for how to get involved.

Contact

You can reach out for discussion or support in the #sig-spyre channel in the vLLM Slack workspace or by opening an issue.

Project details

Release history Release notifications | RSS feed

2.0.0

Apr 29, 2026

This version

2.0.0rc12 pre-release

Apr 24, 2026

2.0.0rc11 pre-release

Apr 23, 2026

0.0.0

Apr 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sendnn_inference-2.0.0rc12.tar.gz (1.1 MB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sendnn_inference-2.0.0rc12-py3-none-any.whl (109.5 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file sendnn_inference-2.0.0rc12.tar.gz.

File metadata

Download URL: sendnn_inference-2.0.0rc12.tar.gz
Upload date: Apr 24, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sendnn_inference-2.0.0rc12.tar.gz
Algorithm	Hash digest
SHA256	`0fc05ea30d9d7b0b7e0bf76c5169100049f2cdef1e0673edc5981d2e46b73030`
MD5	`0929485a3441eb95f77ad2be6a149544`
BLAKE2b-256	`3462a40848bc1432dd297834313ab453338f3539146f607a5173dedbaa37cea5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sendnn_inference-2.0.0rc12.tar.gz:

Publisher: build_and_publish.yaml on torch-spyre/sendnn-inference

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sendnn_inference-2.0.0rc12.tar.gz
- Subject digest: 0fc05ea30d9d7b0b7e0bf76c5169100049f2cdef1e0673edc5981d2e46b73030
- Sigstore transparency entry: 1375413782
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: torch-spyre/sendnn-inference@5ede22854839df4c47ef61df8a092feb55a78010
- Branch / Tag: refs/tags/v2.0.0-rc.12
- Owner: https://github.com/torch-spyre
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build_and_publish.yaml@5ede22854839df4c47ef61df8a092feb55a78010
- Trigger Event: release

File details

Details for the file sendnn_inference-2.0.0rc12-py3-none-any.whl.

File metadata

Download URL: sendnn_inference-2.0.0rc12-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 109.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sendnn_inference-2.0.0rc12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5fb79257120c1b793aa011f82136bc041f8da1964ed4e23d962b368d80ef1cbe`
MD5	`e31e8e9851b0cf96ced8d5366922eb28`
BLAKE2b-256	`1d18e8fd1105aae508c349ea2dbd144a35956cb3e8f45d267eb2f2774be17a10`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sendnn_inference-2.0.0rc12-py3-none-any.whl:

Publisher: build_and_publish.yaml on torch-spyre/sendnn-inference

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sendnn_inference-2.0.0rc12-py3-none-any.whl
- Subject digest: 5fb79257120c1b793aa011f82136bc041f8da1964ed4e23d962b368d80ef1cbe
- Sigstore transparency entry: 1375413887
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: torch-spyre/sendnn-inference@5ede22854839df4c47ef61df8a092feb55a78010
- Branch / Tag: refs/tags/v2.0.0-rc.12
- Owner: https://github.com/torch-spyre
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build_and_publish.yaml@5ede22854839df4c47ef61df8a092feb55a78010
- Trigger Event: release

sendnn-inference 2.0.0rc12

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

SenDNN Inference

Getting Started

Contributing

Contact

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance