Skip to main content

vLLM Semantic Router fleet simulator for capacity planning, SLO validation, and what-if analysis

Project description

vllm-sr-sim

vllm-sr-sim is the maintained fleet simulator for this repository. It sizes heterogeneous GPU fleets, evaluates routing strategies, and exposes a service mode that the dashboard can call across containers.

Repository-maintained docs now live in the website:

Install

cd src/fleet-sim
pip install -e .

Install the service extras when you want to run the simulator API:

pip install -e .[api]

For local development and tests:

pip install -e .[dev]

CLI

vllm-sr-sim --version

vllm-sr-sim optimize \
  --cdf data/azure_cdf.json \
  --lam 200 --slo 500 --b-short 6144 \
  --verify-top 3 --n-sim-req 30000

vllm-sr-sim whatif \
  --cdf data/azure_cdf.json \
  --lam-range 50 100 200 500 1000 \
  --slo 500 --b-short 6144

vllm-sr-sim serve --host 0.0.0.0 --port 8000

vllm-sr serve also starts vllm-sr-sim by default as a sibling container on the shared runtime network so the dashboard can proxy it without rebuilding the router image.

Layout

  • fleet_sim/: simulation engine, optimizers, routing, hardware, workload, and service package
  • run_sim.py: unified CLI entrypoint used by vllm-sr-sim
  • tests/: simulator and service test coverage
  • data/: reference workload traces used by the examples and dashboard integration
  • examples/: sample scripts and multi-pool input files

Docs

Long-form simulator docs are maintained in the repository website. Keep the package README focused on installation, CLI usage, and source layout.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_sr_sim-0.1.0.dev20260407022258.tar.gz (115.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_sr_sim-0.1.0.dev20260407022258-py3-none-any.whl (115.0 kB view details)

Uploaded Python 3

File details

Details for the file vllm_sr_sim-0.1.0.dev20260407022258.tar.gz.

File metadata

File hashes

Hashes for vllm_sr_sim-0.1.0.dev20260407022258.tar.gz
Algorithm Hash digest
SHA256 6b731ca59df9aeaa0e1611f1b0b642711254e0cd158f989219793170790b235e
MD5 bb00266ba26fe6b477b1abca558be061
BLAKE2b-256 535beaa224bcbf56f242672ce2591b1a4e98ec1c61e0d6a9fbec705087b96b59

See more details on using hashes here.

File details

Details for the file vllm_sr_sim-0.1.0.dev20260407022258-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_sr_sim-0.1.0.dev20260407022258-py3-none-any.whl
Algorithm Hash digest
SHA256 46020a65936a6edee684b5e2a08fe7a65177638da0b7773cea45c3cef3dc2a55
MD5 bea9f08eaea46793e0dc015ad3e2c3d9
BLAKE2b-256 2c2a0af81c9913154f5703b342d1a850786c260d7da60fd888c349fe8abfb014

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page