vLLM plugin for RBLN NPU
Project description
vLLM RBLN Plugin
This repository provides the hardware plugin that enables vLLM on RBLN NPUs, including ATOM and REBEL.
Built on top of vLLM’s Plugin System, it allows seamless integration with the vLLM ecosystem and provides high-throughput, low-latency LLM serving on RBLN hardware. Our plugin supports a wide range of popular LLMs and continues to expand to support all features enabled in vLLM, including advanced attention mechanisms.
🚀 Getting Started
📋 Prerequisites
rebel-compileroptimum-rbln
⚙️ Installation
You can install this project using pip or from source.
Install via PyPI
pip install vllm-rbln --extra-index-url https://wheels.vllm.ai/0.13.0/cpu --extra-index-url https://download.pytorch.org/whl/cpu
Or from source
Using uv
git clone https://github.com/rbln-sw/vllm-rbln.git
cd vllm-rbln
uv pip install -e .
Using pip
git clone https://github.com/rbln-sw/vllm-rbln.git
cd vllm-rbln
pip install -e . --extra-index-url https://wheels.vllm.ai/0.13.0/cpu --extra-index-url https://download.pytorch.org/whl/cpu
📚 Documentation
🤝 Contributing
We welcome all contributions! Whether it's reporting issues, proposing enhancements, or improving docs—your input helps make the project better.
See our CONTRIBUTING.md for more information.
📄 License
This project is licensed under the Apache License 2.0.
See the LICENSE file for more information.
📧 Contact
- Join discussions and get answers in our Developer Community
- Contact maintainers at support@rebellions.ai
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vllm_rbln-0.10.2a1-py3-none-any.whl.
File metadata
- Download URL: vllm_rbln-0.10.2a1-py3-none-any.whl
- Upload date:
- Size: 270.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5441b56eb0720131b7e7bbcc1d587ceee01192cad871baf759d36af106242206
|
|
| MD5 |
7994e2e3860252422a61bdaff8bc0ab0
|
|
| BLAKE2b-256 |
22ed17d352b3bbe80c97909f031e3af9c8f0ce1c2de92a444f884cd76c1eea0d
|