LLM benchmarking tools for the LLM CLI

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

LLM Benchmarking Plugin

This is a plugin for the llm tool that adds a benchmark command to compare the performance of different language models.

The commands runs a prompt with optional system prompt for several models and compares the performance between models.

Installation

You can install the plugin using pip:

pip install llm-profile

or using llm

llm install llm-profile

Metrics

Total time - The time taken from the request to the end of the final chunk
Time to First Chunk - The time taken from the request to the first chunk of the response
Length of Response - The length of the response text
Number of Chunks - The number of chunks in the response
Chunks per Second - The number of chunks divided by the total time taken

Benchmark Usage

To run a benchmark, provide the prompt along with any number of models using the llm alias (from llm models):

$ llm benchmark -m azure/ant-grok-3-mini -m azure/ants-gpt-4.1-mini -s "Respond in emoji" "Give me a friendly hello message" --markdown

For a single pass (no repeats) you will get a summary table:

Benchmark	Total Time	Time to First Chunk	Length of Response	Number of Chunks	Chunks per Second
azure/ant-grok-3-mini	7.79	7.76	112	30	3.85
azure/ants-gpt-4.1-mini	2.99	2.80	78	19	6.36

To repeat each benchmark and get an average of times, use the --repeat argument:

Benchmark	Total Time	Time to First Chunk	Length of Response	Number of Chunks	Chunks per Second
azure/ant-grok-3-mini	2.59 <-> 8.39 (x̄=5.49)	2.57 <-> 8.36 (x̄=5.47)	65 <-> 109 (x̄=87.00)	18 <-> 30 (x̄=24.00)	2.15 <-> 11.58 (x̄=6.86)
azure/ants-gpt-4.1-mini	0.54 <-> 2.88 (x̄=1.71)	0.26 <-> 2.69 (x̄=1.47)	76 <-> 78 (x̄=77.00)	19 <-> 19 (x̄=19.00)	6.60 <-> 35.17 (x̄=20.89)

The printout is a range (min <-> max (x̄=mean))

Providing options

You can provide key/value options for all models using the --option flag. This can be useful for setting parameters like temperature, max tokens, etc.

Example:

$ llm benchmark -m gpt-4.1-mini -m gpt-4.1-nano --option temperature 0.7 --option max_tokens 100 "Give me a friendly hello message"

This feature is also helpful for setting the seed option for reproducibility and isolating variances in time to first chunk and time to completion with the same prompt and result.

Markdown formatted results

By default, tables are printed with color showing the fastest and slowest metric in a benchmark:

benchmark screenshot

If you want to customize the output, you can use the --markdown flag to get the results in a Markdown-friendly format.

Non-Streaming models

If you want to benchmark models that do not support streaming, you can use the --no-stream flag. This will disable streaming and provide a single response time.

Graphs

The benchmark tool can produce a PNG graph like this:

benchmark graph

To get a graph, add the --graph file.png with the path to the results graph file. You will need to install matplotlib to generate the graph.

$ pip install matplotlib

matplotlib isn't installed by default to keep the dependencies for this plugin smaller.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

anthonypjshaw

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.0

Aug 27, 2025

0.5.0

Aug 23, 2025

This version

0.4.0

Aug 21, 2025

0.3.0

Aug 20, 2025

0.2.0

Aug 20, 2025

0.1.1

Aug 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_profile-0.4.0.tar.gz (7.3 kB view details)

Uploaded Aug 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_profile-0.4.0-py3-none-any.whl (7.6 kB view details)

Uploaded Aug 21, 2025 Python 3

File details

Details for the file llm_profile-0.4.0.tar.gz.

File metadata

Download URL: llm_profile-0.4.0.tar.gz
Upload date: Aug 21, 2025
Size: 7.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for llm_profile-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`2883ed66caca0e07ad33c800b1f9013482c1f9abc63dc857c480ca6fd0fed691`
MD5	`ab335db1b6728b6498205fc33e01fc33`
BLAKE2b-256	`1a1d655c653637421332b17860ad4d0690a868521277331681acea1d1063ce05`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_profile-0.4.0.tar.gz:

Publisher: python-publish.yml on tonybaloney/llm-profile

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_profile-0.4.0.tar.gz
- Subject digest: 2883ed66caca0e07ad33c800b1f9013482c1f9abc63dc857c480ca6fd0fed691
- Sigstore transparency entry: 415845071
- Sigstore integration time: Aug 21, 2025
Source repository:
- Permalink: tonybaloney/llm-profile@7f9bb7a023e315569fa5d2e686af15e83d5ab793
- Branch / Tag: refs/tags/0.4.0
- Owner: https://github.com/tonybaloney
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@7f9bb7a023e315569fa5d2e686af15e83d5ab793
- Trigger Event: release

File details

Details for the file llm_profile-0.4.0-py3-none-any.whl.

File metadata

Download URL: llm_profile-0.4.0-py3-none-any.whl
Upload date: Aug 21, 2025
Size: 7.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for llm_profile-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8822bdf73582f8b6c37dd2ed58f5ecf40bdddb547456b7258a195ceee7f9393f`
MD5	`0c4bbe3d4e90469db4a517edc1281c19`
BLAKE2b-256	`605212630ce44be096c4b73dfcbac45d29ba2fe2ddab0644163c637836fa5c3b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_profile-0.4.0-py3-none-any.whl:

Publisher: python-publish.yml on tonybaloney/llm-profile

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_profile-0.4.0-py3-none-any.whl
- Subject digest: 8822bdf73582f8b6c37dd2ed58f5ecf40bdddb547456b7258a195ceee7f9393f
- Sigstore transparency entry: 415845107
- Sigstore integration time: Aug 21, 2025
Source repository:
- Permalink: tonybaloney/llm-profile@7f9bb7a023e315569fa5d2e686af15e83d5ab793
- Branch / Tag: refs/tags/0.4.0
- Owner: https://github.com/tonybaloney
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@7f9bb7a023e315569fa5d2e686af15e83d5ab793
- Trigger Event: release

llm-profile 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

LLM Benchmarking Plugin

Installation

Metrics

Benchmark Usage

Providing options

Markdown formatted results

Non-Streaming models

Graphs

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance