Optimum RBLN is the interface between the HuggingFace Transformers and Diffusers libraries and RBLN accelerators. It provides a set of tools enabling easy model loading and inference on single and multiple rbln device settings for different downstream tasks.
Project description
Optimum RBLN
🤗 Optimum RBLN provides an interface between HuggingFace libraries (Transformers, Diffusers) and RBLN NPUs, including ATOM and REBEL.
This library enables seamless integration between the HuggingFace ecosystem and RBLN NPUs through a comprehensive toolkit for model loading and inference across single and multi-NPU environments. While we maintain a list of officially validated models and tasks, users can easily adapt other models and tasks with minimal modifications.
Key Features
🚀 High Performance Inference
- Optimized model execution on RBLN NPUs through RBLN SDK compilation
- Support for both single and multi-NPU inference
- Integrated with RBLN Runtime for optimal performance
🔧 Easy Integration
- Seamless compatibility with HuggingFace Model Hub
- Drop-in replacement for existing HuggingFace pipelines
- Minimal code changes required for NPU acceleration
Seamless Replacement for Existing HuggingFace Code
- from diffusers import StableDiffusionXLPipeline
+ from optimum.rbln import RBLNStableDiffusionXLPipeline
# Load model
model_id = "stabilityai/stable-diffusion-xl-base-1.0"
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
- pipe = StableDiffusionXLPipeline.from_pretrained(model_id)
+ pipe = RBLNStableDiffusionXLPipeline.from_pretrained(model_id, export=True)
# Generate image
image = pipe(prompt).images[0]
# Save image result
image.save("image.png")
+ # (Optional) Save compiled artifacts to skip the compilation step in future runs
+ pipe.save_pretrained("compiled_sdxl")
Documentation
Check out the documentation of Optimum RBLN for more advanced usage.
Getting Started
Note: The
rebel-compilerlibrary, which is required for runningoptimum-rbln, is only available for approved users. Please refer to the installation guide for instructions on accessing and installingrebel-compiler.
Install from PyPI
To install the latest release of this package:
pip install optimum-rbln
# CPU-only installation (recommended if you don't plan to use CUDA-enabled PyTorch)
pip install optimum-rbln --extra-index-url https://download.pytorch.org/whl/cpu
Install from source
Prerequisites
The below command installs optimum-rbln along with its dependencies.
git clone https://github.com/rebellions-sw/optimum-rbln.git
cd optimum-rbln
./scripts/uv-sync.sh
Need Help?
- Join discussions and get answers in our Developer Community
- Contact maintainers at support@rebellions.ai
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file optimum_rbln-0.7.5rc0.tar.gz.
File metadata
- Download URL: optimum_rbln-0.7.5rc0.tar.gz
- Upload date:
- Size: 321.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d034134ee03e59bbd2e861f068d1fef125432663328487f49c4c4b2ab8e68721
|
|
| MD5 |
cff6119c239adad66ffb86715adc9a14
|
|
| BLAKE2b-256 |
47c9666f272d008a5fddf99498c1909a5dddae11b5c583c883e5fc76a1efbc31
|
File details
Details for the file optimum_rbln-0.7.5rc0-py3-none-any.whl.
File metadata
- Download URL: optimum_rbln-0.7.5rc0-py3-none-any.whl
- Upload date:
- Size: 324.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d576dbb64a21bcf639b405ec5dd52a84c7cdb406ff6f015d6d8c2cf213497221
|
|
| MD5 |
f899a0424059351863040e4201f3e205
|
|
| BLAKE2b-256 |
39afc9d2bbe39ec74707d9755c5920406479ffaa61116a6b0ca9fb9097045790
|