Benchmarking & PD ratio planning tool for xPyD proxy
Project description
xPyD-bench
Benchmarking & PD ratio planning tool for xPyD-proxy.
Features
xpyd-bench— Benchmark xPyD proxy with configurable concurrency, request patterns, and both/v1/completionsand/v1/chat/completionsendpoints
For PD ratio planning, see xPyD-plan.
Install
pip install xpyd-bench
Quick Start
Benchmark
# Run benchmark against a running xPyD proxy
xpyd-bench --target http://localhost:8080 \
--endpoint chat \
--concurrency 16 \
--num-requests 200 \
--output results.json
# Use completion endpoint
xpyd-bench --target http://localhost:8080 \
--endpoint completion \
--concurrency 8 \
--num-requests 100
Configuration
See examples/ for sample configs and scenarios.
Output Metrics
- TTFT — Time to first token
- TPS — Tokens per second (per request & aggregate)
- Latency — P50 / P90 / P99 end-to-end latency
- Throughput — Total requests/sec and tokens/sec
- Error rate — Failed requests count and percentage
License
TBD
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xpyd_bench-0.2.0.tar.gz
(331.1 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
xpyd_bench-0.2.0-py3-none-any.whl
(425.9 kB
view details)
File details
Details for the file xpyd_bench-0.2.0.tar.gz.
File metadata
- Download URL: xpyd_bench-0.2.0.tar.gz
- Upload date:
- Size: 331.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
50327387812ec963ab4e77259471b8655bf6b81ae776bf8869cb332e8b80394e
|
|
| MD5 |
8312d8379757bb460cca80608d4f6724
|
|
| BLAKE2b-256 |
83570dad52aa0e9d58704cc7314ee0efc9379a0eb09e01dd819d48578f3c1ec0
|
File details
Details for the file xpyd_bench-0.2.0-py3-none-any.whl.
File metadata
- Download URL: xpyd_bench-0.2.0-py3-none-any.whl
- Upload date:
- Size: 425.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8dc72973cf0b92d1894d42fab553c5abb5e97caa06711e9d16dfb4c293a11c59
|
|
| MD5 |
4bf337d8606d7f7d7ec080e8b9eac37f
|
|
| BLAKE2b-256 |
df011f5f0ec533af893181f079d2eef23979195f1808144668855039a528d40f
|