AIBrix, the foundational building blocks for constructing your own GenAI inference infrastructure.
Project description
AI Runtime
A versatile sidecar enabling metric standardization, model downloading, and management.
Quick Start
Installation
AI Runtime can be installed by pip
.
pip install aibrix
Model download
The AI Runtime supports model downloading from the following storage backends:
- HuggingFace
- S3
- TOS
For more details on model downloading, please refer to our Runtime docs.
Integrate with inference engines
The AI Runtime hides various implementation details on the inference engine side, providing a universal method to guide model management, as well as expose inference monitoring metrics.
At present, vLLM
engine is supported, and in the future, SGLang
and other inference engines will be supported.
For more details on integrate with vLLM
, please refer to our Runtime docs.
Contributing
We welcome contributions from the community! Check out our contributing guidelines to see how you can make a difference.
Build from source
# This may take several minutes
pip install -e .
Lint, Format and Type Check
Before contribute your code, please run the following commands to ensure that your code passes the tests and linting checks.
# install dependencies
poetry install --no-root --with dev
# linting, formatting and type checking
bash ./scripts/format.sh
License
AI Runtime is licensed under the APACHE License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file aibrix-0.1.0rc5.tar.gz
.
File metadata
- Download URL: aibrix-0.1.0rc5.tar.gz
- Upload date:
- Size: 24.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.11.10 Linux/6.5.0-1025-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8bfaabb7272c274538b2e03afcba0a9d68b05328801091936baafd0a15b80a84 |
|
MD5 | 61e4ce43187f62f4e35168d8f6da0695 |
|
BLAKE2b-256 | 264b0340c45ddf9dbb050442f927aa18955240edbc2eeb2d4cb2bab727045151 |
File details
Details for the file aibrix-0.1.0rc5-py3-none-any.whl
.
File metadata
- Download URL: aibrix-0.1.0rc5-py3-none-any.whl
- Upload date:
- Size: 45.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.11.10 Linux/6.5.0-1025-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6bf53b996e7427bc43fe4f644d8457707601b9f4aca6cbd40a37a1311a9fd1b6 |
|
MD5 | b53a33360cc31db2c522d246dc17fdb5 |
|
BLAKE2b-256 | 8645f9d2324e7eec1c34a36e506c72a66efdb5d271f3d18db9a06ee4612cb84d |