Skip to main content

No project description provided

Project description

Baseten benchmarks

How to install

pip install baseten_benchmarks

How to run

To hit a local OpenAI server running on post 10001

baseten_benchmark --backend generic \
  --api_url http://localhost:10001/v1/chat/completions \
  --api_key this_does_not_matter \
  --model deepseek \
  --num_prompts 1 2 4 8 16 \
  --concurrency 1 2 4 8 16 \
  --random_input 1024 \
  --output_len 1024 \
  --input_type custom \
  --stream \
  --tokenizer deepseek-ai/DeepSeek-R1 \
  --output_file latency.csv \
  --warmup_requests 2 \
  --prompt_style messages

For now input_type custom uses a fixed text file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

baseten_benchmarks-0.1.0.tar.gz (177.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

baseten_benchmarks-0.1.0-py3-none-any.whl (178.8 kB view details)

Uploaded Python 3

File details

Details for the file baseten_benchmarks-0.1.0.tar.gz.

File metadata

  • Download URL: baseten_benchmarks-0.1.0.tar.gz
  • Upload date:
  • Size: 177.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic

File hashes

Hashes for baseten_benchmarks-0.1.0.tar.gz
Algorithm Hash digest
SHA256 611842a33c28b7f6e87336c49f9e1414567e30f0884bfb0d968c85136f9dbf2a
MD5 35e1c7b47925c351d13711a7e3d45c18
BLAKE2b-256 2da36038b42185e8ac61bef511743f8501b62b471cdeeedde1afaab71a7df27c

See more details on using hashes here.

File details

Details for the file baseten_benchmarks-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: baseten_benchmarks-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 178.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.10.12 Linux/6.5.0-45-generic

File hashes

Hashes for baseten_benchmarks-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7e3065b3a5d4bff92ec4dfd0f32e2a82d7964294fe58f5f43d549c9c47d0821b
MD5 4ad372005ca7d04cd8e8d8be4785058e
BLAKE2b-256 f28c26f8fd130ef648a62aaa6cb77197aa959d8033efdbccd08ced004f621000

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page