Skip to main content

Lightweight Prefill-Decode proxy for disaggregated LLM serving

Project description

xPyD-proxy

Lightweight Prefill-Decode disaggregated proxy for LLM serving.

xPyD-proxy routes inference requests between prefill and decode nodes, enabling PD-disaggregated LLM serving with load balancing, health monitoring, and fault tolerance.

Key Features

  • PD disaggregation — separate prefill and decode nodes for optimal resource utilization
  • Multiple scheduling policies — round-robin, consistent hash, cache-aware, power-of-two
  • Resilience — circuit breaker, health monitoring, automatic failover
  • Multi-model routing — serve multiple models through a single proxy
  • OpenAI-compatible API — drop-in replacement for vLLM/OpenAI endpoints
  • YAML configuration — declarative topology and settings

Install

pip install xpyd-proxy

Or as part of the full xPyD toolkit:

pip install xpyd

Quick Start

# Start with YAML config
xpyd proxy --config proxy.yaml

# Or with CLI args
xpyd proxy --model my-model \
  --prefill 127.0.0.1:8001 \
  --decode 127.0.0.1:8002

Part of xPyD

Component Description
xpyd-proxy PD-disaggregated proxy
xpyd-sim OpenAI-compatible inference simulator
xpyd-bench Benchmarking & planning tool

📖 Full Guide → | 💡 Examples → | 🏗️ Contributing →

License

Apache 2.0 — see LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xpyd_proxy-1.3.0.tar.gz (51.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xpyd_proxy-1.3.0-py3-none-any.whl (62.2 kB view details)

Uploaded Python 3

File details

Details for the file xpyd_proxy-1.3.0.tar.gz.

File metadata

  • Download URL: xpyd_proxy-1.3.0.tar.gz
  • Upload date:
  • Size: 51.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for xpyd_proxy-1.3.0.tar.gz
Algorithm Hash digest
SHA256 c4de39babe18268ee1476ae3af1a571f2170236d5f01a9082e4a84098b0da5d7
MD5 2974cc50a693a1b0b0cd5a14c838b0b6
BLAKE2b-256 a5604a766f8c21ea59b6d24551bae1fcb48a319224591da93c858ac12f71116f

See more details on using hashes here.

Provenance

The following attestation bundles were made for xpyd_proxy-1.3.0.tar.gz:

Publisher: release.yml on xPyD-hub/xPyD-proxy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file xpyd_proxy-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: xpyd_proxy-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 62.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for xpyd_proxy-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c7d62200fabc8e9d549c6bcd58948e4be86086cd8bdc9a4f43b55bb2eb4cce07
MD5 c890a7bd0e9fee9747d8a3d0b50397a3
BLAKE2b-256 446600094f8f0c81fa1e4a560e3b5bbcf802372f21c581f056efed9e5792e874

See more details on using hashes here.

Provenance

The following attestation bundles were made for xpyd_proxy-1.3.0-py3-none-any.whl:

Publisher: release.yml on xPyD-hub/xPyD-proxy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page