Skip to main content

Deploy, manage, and monitor vLLM instances across a GPU cluster from a single web dashboard.

Project description

Aquila

GPU inference cluster manager. Deploy, manage, and monitor vLLM instances across a GPU cluster from a single web dashboard.

Quick Start

pip install aquila

On each GPU node (satellite):

aquila client install
aquila client start

On the management server (host):

aquila host install
aquila host start

Then open the dashboard at http://<host-ip>:5173 to deploy and monitor models across your cluster.

Features

  • Web dashboard for deploying and managing vLLM models across multiple GPU nodes
  • Runs each model in the official vllm/vllm-openai Docker container — no per-node CUDA/PyTorch setup; the version maps to an image tag
  • Live GPU utilization metrics and deployment status monitoring
  • Consul-based automatic node discovery
  • Support for custom pip packages (via cached derived images) and vLLM plugins

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aquila-0.3.0.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aquila-0.3.0-py3-none-any.whl (257.8 kB view details)

Uploaded Python 3

File details

Details for the file aquila-0.3.0.tar.gz.

File metadata

  • Download URL: aquila-0.3.0.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aquila-0.3.0.tar.gz
Algorithm Hash digest
SHA256 a9dfdded7da4fa37dc120942cf1d46d4590192b0b8176b18565f0e581cef3da5
MD5 f759c715b6f8b0e009189b422bc119a3
BLAKE2b-256 36a870545beae9d7bc6ea2fd7201696eef8aaeef9f291a29b769a2af2c6952a2

See more details on using hashes here.

Provenance

The following attestation bundles were made for aquila-0.3.0.tar.gz:

Publisher: publish.yml on sisl/aquila

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aquila-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: aquila-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 257.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aquila-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d95335b68eef601e2f7508733cd702120cf65a7a24bf31a49d7ac9b6dc496c62
MD5 20bab99661e79745617e2f9b25b9c71a
BLAKE2b-256 baa1d0b8750f43ec339acf781d9727ddd24c83f62784b530c8e6b1dc2af0bdc0

See more details on using hashes here.

Provenance

The following attestation bundles were made for aquila-0.3.0-py3-none-any.whl:

Publisher: publish.yml on sisl/aquila

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page