Skip to main content

OpenAI-compatible LLM inference simulator for xPyD

Project description

xPyD-sim

OpenAI-compatible LLM inference simulator for testing and benchmarking.

xPyD-sim simulates prefill and decode nodes with realistic latency behavior, enabling testing of xPyD-proxy and xPyD-bench without real GPU hardware.

Key Features

  • Prefill/Decode simulation — separate modes with configurable latency
  • Full OpenAI API — /v1/completions, /v1/chat/completions, /v1/embeddings, /v1/models
  • vLLM compatible — accepts all vLLM-specific parameters
  • Scheduling simulation — batch formation, decode iteration, queue depth
  • Calibration tool — fit latency curves from real hardware measurements
  • Prometheus metrics — /metrics endpoint for monitoring

Install

pip install xpyd-sim

Or as part of the full xPyD toolkit:

pip install xpyd

Quick Start

# Start dual mode (prefill + decode)
xpyd-sim --mode dual --port 8000

# Start PD disaggregated
xpyd-sim --mode prefill --port 8001
xpyd-sim --mode decode --port 8002

Part of xPyD

Component Description
xpyd-proxy PD-disaggregated proxy
xpyd-sim OpenAI-compatible inference simulator
xpyd-bench Benchmarking & planning tool

📖 Full Guide → | 💡 Examples → | 🏗️ Contributing →

License

Apache 2.0 — see LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xpyd_sim-0.5.0.tar.gz (62.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xpyd_sim-0.5.0-py3-none-any.whl (32.6 kB view details)

Uploaded Python 3

File details

Details for the file xpyd_sim-0.5.0.tar.gz.

File metadata

  • Download URL: xpyd_sim-0.5.0.tar.gz
  • Upload date:
  • Size: 62.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for xpyd_sim-0.5.0.tar.gz
Algorithm Hash digest
SHA256 64f4f9731cc18f7d7caa96763f82b96c82f261bff02b6891100a8d3279b2ada2
MD5 3afd8b5418087a8dce24db346716935c
BLAKE2b-256 d7f62ba40fbc4da224e82474ab806544f8e21502baa3de7c075a8ccbde34f67e

See more details on using hashes here.

File details

Details for the file xpyd_sim-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: xpyd_sim-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 32.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for xpyd_sim-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 22789ad2110f59d9656f25709e044f9be80b5d3555faab28f3d80cdba1930853
MD5 89a0fe5c03e30f97ad4cb077d6138739
BLAKE2b-256 47cfdb84557c4bba23242f21777035772ea09fa6c54dae3b0dc163678fbe4aa7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page