Deploy, manage, and monitor vLLM instances across a GPU cluster from a single web dashboard.
Project description
Aquila
GPU inference cluster manager. Deploy, manage, and monitor vLLM instances across a GPU cluster from a single web dashboard.
Quick Start
pip install aquila
On each GPU node (satellite):
aquila client install
aquila client start
On the management server (host):
aquila host install
aquila host start
Then open the dashboard at http://<host-ip>:5173 to deploy and monitor models across your cluster.
Features
- Web dashboard for deploying and managing vLLM models across multiple GPU nodes
- Runs each model in the official
vllm/vllm-openaiDocker container — no per-node CUDA/PyTorch setup; the version maps to an image tag - Live GPU utilization metrics and deployment status monitoring
- Consul-based automatic node discovery
- Support for custom pip packages (via cached derived images) and vLLM plugins
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aquila-0.3.1.tar.gz.
File metadata
- Download URL: aquila-0.3.1.tar.gz
- Upload date:
- Size: 1.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a8b10b3c574476e61c390b8402f22e8d5f4f67957bf722fa6ec81c31c6ece97
|
|
| MD5 |
eb32f7b0c8dd5845eb3f8540c54bae38
|
|
| BLAKE2b-256 |
4d54615a4cc8cccbeec68ec2f7c6b7ba7e1c9e78b9cb318fab26f1e6bdc98b8a
|
Provenance
The following attestation bundles were made for aquila-0.3.1.tar.gz:
Publisher:
publish.yml on sisl/aquila
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
aquila-0.3.1.tar.gz -
Subject digest:
1a8b10b3c574476e61c390b8402f22e8d5f4f67957bf722fa6ec81c31c6ece97 - Sigstore transparency entry: 1931851938
- Sigstore integration time:
-
Permalink:
sisl/aquila@bd43688901147f1e07659d04df3b3e8f548fe6cc -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/sisl
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@bd43688901147f1e07659d04df3b3e8f548fe6cc -
Trigger Event:
push
-
Statement type:
File details
Details for the file aquila-0.3.1-py3-none-any.whl.
File metadata
- Download URL: aquila-0.3.1-py3-none-any.whl
- Upload date:
- Size: 259.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b4db140606283416d6f38274aa8c6465797fb071638e94ccf8b2f94d3c9a35f7
|
|
| MD5 |
bde130cb2e615fb4ab661ffc9d7d36f8
|
|
| BLAKE2b-256 |
4db75706f2a7708ef736692d5f6678ce381aae382657bd3750232b9916d0cd44
|
Provenance
The following attestation bundles were made for aquila-0.3.1-py3-none-any.whl:
Publisher:
publish.yml on sisl/aquila
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
aquila-0.3.1-py3-none-any.whl -
Subject digest:
b4db140606283416d6f38274aa8c6465797fb071638e94ccf8b2f94d3c9a35f7 - Sigstore transparency entry: 1931851976
- Sigstore integration time:
-
Permalink:
sisl/aquila@bd43688901147f1e07659d04df3b3e8f548fe6cc -
Branch / Tag:
refs/tags/v0.3.1 - Owner: https://github.com/sisl
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@bd43688901147f1e07659d04df3b3e8f548fe6cc -
Trigger Event:
push
-
Statement type: