Skip to main content

vLLM Semantic Router fleet simulator for capacity planning, SLO validation, and what-if analysis

Project description

vllm-sr-sim

vllm-sr-sim is the maintained fleet simulator for this repository. It sizes heterogeneous GPU fleets, evaluates routing strategies, and exposes a service mode that the dashboard can call across containers.

Repository-maintained docs now live in the website:

Install

cd src/fleet-sim
pip install -e .

Install the service extras when you want to run the simulator API:

pip install -e .[api]

For local development and tests:

pip install -e .[dev]

CLI

vllm-sr-sim --version

vllm-sr-sim optimize \
  --cdf data/azure_cdf.json \
  --lam 200 --slo 500 --b-short 6144 \
  --verify-top 3 --n-sim-req 30000

vllm-sr-sim whatif \
  --cdf data/azure_cdf.json \
  --lam-range 50 100 200 500 1000 \
  --slo 500 --b-short 6144

vllm-sr-sim serve --host 0.0.0.0 --port 8000

vllm-sr serve also starts vllm-sr-sim by default as a sibling container on the shared runtime network so the dashboard can proxy it without rebuilding the router image.

Layout

  • fleet_sim/: simulation engine, optimizers, routing, hardware, workload, and service package
  • run_sim.py: unified CLI entrypoint used by vllm-sr-sim
  • tests/: simulator and service test coverage
  • data/: reference workload traces used by the examples and dashboard integration
  • examples/: sample scripts and multi-pool input files

Docs

Long-form simulator docs are maintained in the repository website. Keep the package README focused on installation, CLI usage, and source layout.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_sr_sim-0.1.0.dev20260318114607.tar.gz (115.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_sr_sim-0.1.0.dev20260318114607-py3-none-any.whl (112.5 kB view details)

Uploaded Python 3

File details

Details for the file vllm_sr_sim-0.1.0.dev20260318114607.tar.gz.

File metadata

File hashes

Hashes for vllm_sr_sim-0.1.0.dev20260318114607.tar.gz
Algorithm Hash digest
SHA256 7e6a7568ba4128c58a09f6f9f31e371ee963c08175bcf97ce19f06029e95adfc
MD5 88ed785e0f86b214605e57a226194ea6
BLAKE2b-256 8c79222e7c9c540471ed6310246c03520b485ca616bafa2d6acefd7bab8bba4c

See more details on using hashes here.

File details

Details for the file vllm_sr_sim-0.1.0.dev20260318114607-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_sr_sim-0.1.0.dev20260318114607-py3-none-any.whl
Algorithm Hash digest
SHA256 9c88bd22f199258ea0dc4e5c5588b3f0f27d08e7104d9b66f717218f872cf6f8
MD5 4845909798d9d13d9c067cf6b6776190
BLAKE2b-256 9983b2752099d356f18ab069072eb53eab837c9025171dde2f6d0b7650731db4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page