No project description provided
Project description
Baseten benchmarks
How to install
pip install baseten_benchmarks
How to run
To hit a local OpenAI server running on post 10001
baseten_benchmark --backend generic \
--api_url http://localhost:10001/v1/chat/completions \
--api_key this_does_not_matter \
--model deepseek \
--num_prompts 1 2 4 8 16 \
--concurrency 1 2 4 8 16 \
--random_input 1024 \
--output_len 1024 \
--input_type custom \
--stream \
--tokenizer deepseek-ai/DeepSeek-R1 \
--output_file latency.csv \
--warmup_requests 2 \
--prompt_style messages
For now input_type custom uses a fixed text file.
How to publish
poetry config pypi-token.pypi [your pypi token here]
poetry publish --build
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
baseten_benchmarks-0.3.0.tar.gz
(178.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file baseten_benchmarks-0.3.0.tar.gz.
File metadata
- Download URL: baseten_benchmarks-0.3.0.tar.gz
- Upload date:
- Size: 178.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9be55debe8c5e090b211d63b68279f834dde218a0900c6194575178f4c656b3f
|
|
| MD5 |
15abe6e6339380b7c71a16e9df993e9e
|
|
| BLAKE2b-256 |
6a408c1b8843783aff216ad7b554e4def745252d229d1c91d4111f39b67682ef
|
File details
Details for the file baseten_benchmarks-0.3.0-py3-none-any.whl.
File metadata
- Download URL: baseten_benchmarks-0.3.0-py3-none-any.whl
- Upload date:
- Size: 182.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
76ea76ac16ded46c289fa684a1795672d8be5da8cfe12ae203f80d79bc935e83
|
|
| MD5 |
cf7a4a777c306f7d25912327d50fbdd5
|
|
| BLAKE2b-256 |
e16f8b39043436a4fad601c7f5b27ec9ee13c0316d38074a2c5084b16a7570d2
|