Lightweight Prefill-Decode proxy for disaggregated LLM serving
Project description
xPyD-proxy
Lightweight Prefill-Decode disaggregated proxy for LLM serving.
xPyD-proxy routes inference requests between prefill and decode nodes, enabling PD-disaggregated LLM serving with load balancing, health monitoring, and fault tolerance.
Key Features
- PD disaggregation — separate prefill and decode nodes for optimal resource utilization
- Multiple scheduling policies — round-robin, consistent hash, cache-aware, power-of-two
- Resilience — circuit breaker, health monitoring, automatic failover
- Multi-model routing — serve multiple models through a single proxy
- OpenAI-compatible API — drop-in replacement for vLLM/OpenAI endpoints
- YAML configuration — declarative topology and settings
Install
pip install xpyd-proxy
Or as part of the full xPyD toolkit:
pip install xpyd
Quick Start
# Start with YAML config
xpyd proxy --config proxy.yaml
# Or with CLI args
xpyd proxy --model my-model \
--prefill 127.0.0.1:8001 \
--decode 127.0.0.1:8002
Part of xPyD
| Component | Description |
|---|---|
| xpyd-proxy | PD-disaggregated proxy |
| xpyd-sim | OpenAI-compatible inference simulator |
| xpyd-bench | Benchmarking & planning tool |
📖 Full Guide → | 💡 Examples → | 🏗️ Contributing →
License
Apache 2.0 — see LICENSE
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file xpyd_proxy-1.3.0.tar.gz.
File metadata
- Download URL: xpyd_proxy-1.3.0.tar.gz
- Upload date:
- Size: 51.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4de39babe18268ee1476ae3af1a571f2170236d5f01a9082e4a84098b0da5d7
|
|
| MD5 |
2974cc50a693a1b0b0cd5a14c838b0b6
|
|
| BLAKE2b-256 |
a5604a766f8c21ea59b6d24551bae1fcb48a319224591da93c858ac12f71116f
|
Provenance
The following attestation bundles were made for xpyd_proxy-1.3.0.tar.gz:
Publisher:
release.yml on xPyD-hub/xPyD-proxy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
xpyd_proxy-1.3.0.tar.gz -
Subject digest:
c4de39babe18268ee1476ae3af1a571f2170236d5f01a9082e4a84098b0da5d7 - Sigstore transparency entry: 1243646625
- Sigstore integration time:
-
Permalink:
xPyD-hub/xPyD-proxy@213f2a472debe60120acd4cfa0fbf7956ed36e86 -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/xPyD-hub
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@213f2a472debe60120acd4cfa0fbf7956ed36e86 -
Trigger Event:
push
-
Statement type:
File details
Details for the file xpyd_proxy-1.3.0-py3-none-any.whl.
File metadata
- Download URL: xpyd_proxy-1.3.0-py3-none-any.whl
- Upload date:
- Size: 62.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c7d62200fabc8e9d549c6bcd58948e4be86086cd8bdc9a4f43b55bb2eb4cce07
|
|
| MD5 |
c890a7bd0e9fee9747d8a3d0b50397a3
|
|
| BLAKE2b-256 |
446600094f8f0c81fa1e4a560e3b5bbcf802372f21c581f056efed9e5792e874
|
Provenance
The following attestation bundles were made for xpyd_proxy-1.3.0-py3-none-any.whl:
Publisher:
release.yml on xPyD-hub/xPyD-proxy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
xpyd_proxy-1.3.0-py3-none-any.whl -
Subject digest:
c7d62200fabc8e9d549c6bcd58948e4be86086cd8bdc9a4f43b55bb2eb4cce07 - Sigstore transparency entry: 1243646630
- Sigstore integration time:
-
Permalink:
xPyD-hub/xPyD-proxy@213f2a472debe60120acd4cfa0fbf7956ed36e86 -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/xPyD-hub
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@213f2a472debe60120acd4cfa0fbf7956ed36e86 -
Trigger Event:
push
-
Statement type: