LLM Benchmarking tool for OLLAMA

These details have not been verified by PyPI

Project links

Homepage

Project description

llm-benchmark (ollama-benchmark)

LLM Benchmark for Throughput via Ollama (Local LLMs)

Installation Steps

pip install llm-bench

Usage for general users directly

llm_bench run

ollama installation with the following models installed

7B model can be run on machines with 8GB of RAM

13B model can be run on machines with 16GB of RAM

Usage explaination

On Windows, Linux, and macOS, it will detect memory RAM size to first download required LLM models.

When memory RAM size is greater than or equal to 4GB, but less than 7GB, it will check if gemma:2b exist. The program implicitly pull the model.

ollama pull gemma:2b

When memory RAM size is greater than 7GB, but less than 15GB, it will check if these models exist. The program implicitly pull these models

ollama pull gemma:2b
ollama pull gemma:7b
ollama pull mistral:7b
ollama pull llama2:7b
ollama pull llava:7b

When memory RAM siz is greater than 15GB, it will check if these models exist. The program implicitly pull these models

ollama pull gemma:2b
ollama pull gemma:7b
ollama pull mistral:7b
ollama pull llama2:7b
ollama pull llama2:13b
ollama pull llava:7b
ollama pull llava:13b

Python Poetry manually(advanced) installation

https://python-poetry.org/docs/#installing-manually

For developers to develop new features on Windows Powershell or on Ubuntu Linux or macOS

python3 -m venv .venv
. ./.venv/bin/activate
pip install -U pip setuptools
pip install poetry

Usage in Python virtual environment

poetry shell
poetry install
llm_benchmark hello jason

Example #1 send systeminfo and benchmark results to a remote server

llm_bench run

Example #2 Do not send systeminfo and benchmark results to a remote server

llm_bench run --no-sendinfo

Example #3 Benchmark run on explicitly given the path to the ollama executable (When you built your own developer version of ollama)

llm_bench run --ollamabin=~/code/ollama/ollama

Reference

Ollama

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.4.32

Jul 23, 2024

0.4.31

Jul 23, 2024

0.4.30

Jul 12, 2024

0.4.29

Jul 9, 2024

This version

0.4.28

Jul 7, 2024

0.4.27

Jul 1, 2024

0.4.26

Jun 28, 2024

0.4.25

Jun 18, 2024

0.4.24

Jun 18, 2024

0.4.23

May 28, 2024

0.4.22

May 28, 2024

0.4.21

May 21, 2024

0.4.20

May 21, 2024

0.4.19

May 15, 2024

0.4.18

May 15, 2024

0.4.17

May 15, 2024

0.4.16

May 14, 2024

0.4.15

May 12, 2024

0.4.14

May 12, 2024

0.4.13

May 12, 2024

0.4.12

May 12, 2024

0.4.11

May 12, 2024

0.4.10

May 12, 2024

0.4.9

May 12, 2024

0.4.8

May 12, 2024

0.4.7

May 12, 2024

0.4.6

May 12, 2024

0.4.5

May 12, 2024

0.4.4

May 12, 2024

0.4.3

May 11, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_bench-0.4.28.tar.gz (12.9 kB view details)

Uploaded Jul 7, 2024 Source

Built Distribution

llm_bench-0.4.28-py3-none-any.whl (19.0 kB view details)

Uploaded Jul 7, 2024 Python 3

File details

Details for the file llm_bench-0.4.28.tar.gz.

File metadata

Download URL: llm_bench-0.4.28.tar.gz
Upload date: Jul 7, 2024
Size: 12.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.8

File hashes

Hashes for llm_bench-0.4.28.tar.gz
Algorithm	Hash digest
SHA256	`d7a89038362e78d2535422e99299c9e4b03b17226ba97c1451063b399ea43cd7`
MD5	`c007cd00128e8e6e862c7fb3619c8f71`
BLAKE2b-256	`c3dccfcea1ac8d788214257cce4f826b68e12fb0fac9d31c1b1b26e66399862a`

See more details on using hashes here.

File details

Details for the file llm_bench-0.4.28-py3-none-any.whl.

File metadata

Download URL: llm_bench-0.4.28-py3-none-any.whl
Upload date: Jul 7, 2024
Size: 19.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.8

File hashes

Hashes for llm_bench-0.4.28-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2c1bc67786554fda13e619b7649478cdd3e1b45d6a5db060e204209549e52f5a`
MD5	`f630d41e0d306bc36f262b018484a14f`
BLAKE2b-256	`f93a2a92af953b1e66920fdfafda4009648d0e81bbef8c841ea477d845efc03f`