vLLM Semantic Router fleet simulator for capacity planning, SLO validation, and what-if analysis
Project description
vllm-sr-sim
vllm-sr-sim is the maintained fleet simulator for this repository. It sizes heterogeneous GPU fleets, evaluates routing strategies, and exposes a service mode that the dashboard can call across containers.
Repository-maintained docs now live in the website:
- https://vllm-semantic-router.com/docs/fleet-sim/overview
- https://vllm-semantic-router.com/docs/fleet-sim/getting-started
- https://vllm-semantic-router.com/docs/fleet-sim/use-cases
Install
cd src/fleet-sim
pip install -e .
Install the service extras when you want to run the simulator API:
pip install -e .[api]
For local development and tests:
pip install -e .[dev]
CLI
vllm-sr-sim --version
vllm-sr-sim optimize \
--cdf data/azure_cdf.json \
--lam 200 --slo 500 --b-short 6144 \
--verify-top 3 --n-sim-req 30000
vllm-sr-sim whatif \
--cdf data/azure_cdf.json \
--lam-range 50 100 200 500 1000 \
--slo 500 --b-short 6144
vllm-sr-sim serve --host 0.0.0.0 --port 8000
vllm-sr serve also starts vllm-sr-sim by default as a sibling container on the shared runtime network so the dashboard can proxy it without rebuilding the router image.
Layout
fleet_sim/: simulation engine, optimizers, routing, hardware, workload, and service packagerun_sim.py: unified CLI entrypoint used byvllm-sr-simtests/: simulator and service test coveragedata/: reference workload traces used by the examples and dashboard integrationexamples/: sample scripts and multi-pool input files
Docs
Long-form simulator docs are maintained in the repository website. Keep the package README focused on installation, CLI usage, and source layout.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vllm_sr_sim-0.1.0.dev20260318114607.tar.gz.
File metadata
- Download URL: vllm_sr_sim-0.1.0.dev20260318114607.tar.gz
- Upload date:
- Size: 115.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e6a7568ba4128c58a09f6f9f31e371ee963c08175bcf97ce19f06029e95adfc
|
|
| MD5 |
88ed785e0f86b214605e57a226194ea6
|
|
| BLAKE2b-256 |
8c79222e7c9c540471ed6310246c03520b485ca616bafa2d6acefd7bab8bba4c
|
File details
Details for the file vllm_sr_sim-0.1.0.dev20260318114607-py3-none-any.whl.
File metadata
- Download URL: vllm_sr_sim-0.1.0.dev20260318114607-py3-none-any.whl
- Upload date:
- Size: 112.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9c88bd22f199258ea0dc4e5c5588b3f0f27d08e7104d9b66f717218f872cf6f8
|
|
| MD5 |
4845909798d9d13d9c067cf6b6776190
|
|
| BLAKE2b-256 |
9983b2752099d356f18ab069072eb53eab837c9025171dde2f6d0b7650731db4
|