A powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
Project description
Introduction
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
It provides detailed insights into model serving performance, offering both a user-friendly CLI and a live UI for real-time progress monitoring.
Features
- 🛠️ CLI Tool: Validates user inputs and initiates benchmarks seamlessly.
- 📊 Live UI Dashboard: Displays current progress, logs, and real-time metrics.
- 📝 Rich Logs: Automatically flushed to both terminal and file upon experiment completion.
- 📈 Experiment Analyzer: Generates comprehensive Excel reports with pricing and raw metrics data, plus flexible plot configurations (default 2x4 grid) that visualize key performance metrics including throughput, latency (TTFT, E2E, TPOT), error rates, and RPS across different traffic scenarios and concurrency levels. Supports custom plot layouts and multi-line comparisons.
Installation
Quick Start: Install with pip install genai-bench.
Alternatively, check Installation Guide for other options.
How to use
Quick Start
-
Run a benchmark against your model:
genai-bench benchmark --api-backend "your-backend" \ --api-base "http://localhost:8080" \ --api-key "your-api-key" \ --api-model-name "your-model" \ --task text-to-text \ --max-time-per-run 5 \ --max-requests-per-run 100
-
Generate Excel reports from your results:
genai-bench excel --experiment-folder ./experiments/your_experiment \ --excel-name results --metric-percentile mean
-
Create visualizations:
genai-bench plot --experiments-folder ./experiments \ --group-key traffic_scenario --preset 2x4_default
Next Steps
If you're new to GenAI Bench, check out the Getting Started page.
For detailed instructions, advanced configuration options, and comprehensive examples, check out the User Guide.
Development
If you are interested in contributing to GenAI-Bench, you can use the Development Guide.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file genai_bench-0.0.3.tar.gz.
File metadata
- Download URL: genai_bench-0.0.3.tar.gz
- Upload date:
- Size: 112.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ea8857f8aa044241693c9c0b4d5a89093ad94c523294ef431b7397fab7e8bcf5
|
|
| MD5 |
a0886abb40df20983b7b3dfd22a2aeaf
|
|
| BLAKE2b-256 |
ffa32600a5710049c45d91336b33a20a6ecd86ed49194040fc6bb98392d4efa8
|
File details
Details for the file genai_bench-0.0.3-py3-none-any.whl.
File metadata
- Download URL: genai_bench-0.0.3-py3-none-any.whl
- Upload date:
- Size: 165.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cd5c231284f4027e87aa7d0dc68b257956c0a655349c4480ff69f44ee90a5391
|
|
| MD5 |
1699c8b212bf29b4b029e9ff382ad51b
|
|
| BLAKE2b-256 |
df32af9794c6a2a2003debd11c86b929088d1928d60dbc5ffbdbe1e2981e85c9
|