No project description provided
Project description
Baseten benchmarks
How to install
pip install baseten_benchmarks
How to run
To hit a local OpenAI server running on post 10001
baseten_benchmark --backend generic \
--api_url http://localhost:10001/v1/chat/completions \
--api_key this_does_not_matter \
--model deepseek \
--num_prompts 1 2 4 8 16 \
--concurrency 1 2 4 8 16 \
--random_input 1024 \
--output_len 1024 \
--input_type custom \
--stream \
--tokenizer deepseek-ai/DeepSeek-R1 \
--output_file latency.csv \
--warmup_requests 2 \
--prompt_style messages
For now input_type custom uses a fixed text file.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
baseten_benchmarks-0.1.0.tar.gz
(177.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file baseten_benchmarks-0.1.0.tar.gz.
File metadata
- Download URL: baseten_benchmarks-0.1.0.tar.gz
- Upload date:
- Size: 177.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
611842a33c28b7f6e87336c49f9e1414567e30f0884bfb0d968c85136f9dbf2a
|
|
| MD5 |
35e1c7b47925c351d13711a7e3d45c18
|
|
| BLAKE2b-256 |
2da36038b42185e8ac61bef511743f8501b62b471cdeeedde1afaab71a7df27c
|
File details
Details for the file baseten_benchmarks-0.1.0-py3-none-any.whl.
File metadata
- Download URL: baseten_benchmarks-0.1.0-py3-none-any.whl
- Upload date:
- Size: 178.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e3065b3a5d4bff92ec4dfd0f32e2a82d7964294fe58f5f43d549c9c47d0821b
|
|
| MD5 |
4ad372005ca7d04cd8e8d8be4785058e
|
|
| BLAKE2b-256 |
f28c26f8fd130ef648a62aaa6cb77197aa959d8033efdbccd08ced004f621000
|