Skip to main content

vLLM plugin for Spyre hardware support

Project description

Spyre Plugin for vLLM

| Documentation | Users Forum | #sig-spyre |


IBM Spyre is the first production-grade Artificial Intelligence Unit (AIU) accelerator born out of the IBM Research AIU family, and is part of a long-term strategy of developing novel architectures and full-stack technology solutions for the emerging space of generative AI. Spyre builds on the foundation of IBM’s internal AIU research and delivers a scalable, efficient architecture for accelerating AI in enterprise environments.

The vLLM Spyre plugin (vllm-spyre) is a dedicated backend extension that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture described in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.

For more information, check out the following:

Getting Started

Visit our documentation:

Contributing

We welcome and value any contributions and collaborations. Please check out Contributing to vLLM Spyre for how to get involved.

Contact

You can reach out for discussion or support in the #sig-spyre channel in the vLLM Slack workspace or by opening an issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_spyre-0.8.1.tar.gz (781.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_spyre-0.8.1-py3-none-any.whl (59.7 kB view details)

Uploaded Python 3

File details

Details for the file vllm_spyre-0.8.1.tar.gz.

File metadata

  • Download URL: vllm_spyre-0.8.1.tar.gz
  • Upload date:
  • Size: 781.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vllm_spyre-0.8.1.tar.gz
Algorithm Hash digest
SHA256 484f0d06a07af09cce08f5a6d5d5fad70ad8effc431b0db4713c8a8d9563f51f
MD5 44fcf2d0bf64ea7419f29cfbe84edbca
BLAKE2b-256 9268e194cc674eb07cc98a78266bd12c63206eddbd0dde7d9aa7a6b8ae4b8233

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_spyre-0.8.1.tar.gz:

Publisher: build_and_publish.yaml on vllm-project/vllm-spyre

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vllm_spyre-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: vllm_spyre-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 59.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vllm_spyre-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9bd581192317855d68d125e2f2c1b9dd58d57f1f4b690a3fdb54f35e4188e178
MD5 3e4bf47893ef62b4f43a153e9307b024
BLAKE2b-256 59d4d1ed2783d8b7ebf22fcb83bb4120e0a21dab164b7121f93888575c3bcb60

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_spyre-0.8.1-py3-none-any.whl:

Publisher: build_and_publish.yaml on vllm-project/vllm-spyre

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page