A benchmarking tool for Local LLMs.

These details have not been verified by PyPI

Project description

localbench

localbench Banner

🚧 EARLY DEVELOPMENT WARNING 🚧

This tool is currently about as stable as a house of cards in a wind tunnel.
Very early alpha. Bugs aren't just expected - they've signed a lease.

Status: Proceed with optimism ☕

A benchmarking tool for Local LLMs. Currently keeping an eye on Cortex.cpp but with plans to judge other frameworks equally in the future.

What is this?

localbench measures performance metrics, resource utilization, and stability characteristics of your LLM deployments. Rather comprehensive, really.

Features

Model initialization metrics
Runtime performance
Resource utilization
Advanced processing scenarios
Workload-specific benchmarks
System integration metrics
Stability analysis

Installation

Using uvx:

uvx install localbench

Using pip:

pip install localbench

Usage

Basic Benchmarking

# Standard benchmark
localbench "llama3.2:3b-gguf-q2-k"

# With detailed metrics
localbench "llama3.2:3b-gguf-q2-k" --verbose

Specific Benchmarks

# Initialization only
localbench "llama3.2:3b-gguf-q2-k" --type init

# Runtime metrics
localbench "llama3.2:3b-gguf-q2-k" --type runtime

# Long-running stability test
localbench "llama3.2:3b-gguf-q2-k" --type stability --stability-duration 24

Advanced Usage

# Custom benchmark prompts
localbench "llama3.2:3b-gguf-q2-k" --type workload --prompts my_prompts.json

# Multi-model benchmarking
localbench "llama3.2:3b-gguf-q2-k" --type advanced \
    --secondary-models "tinyllama:1b-gguf-q4" "phi2:3b-gguf-q4"

# Export results
localbench "llama3.2:3b-gguf-q2-k" --json results.json

Status

Under active development. Support for additional frameworks is planned.

Roadmap

Framework-agnostic benchmarking
Additional performance metrics
Enhanced visualizations
Extended stability testing
local server

Development

Setup

Clone the repository:

git clone https://github.com/username/localbench.git
cd localbench

Create and activate a virtual environment:

# Using uv (recommended)
uv venv .venv --python 3.12
source .venv/bin/activate

Install development dependencies:

# Install project in editable mode with test dependencies
uv pip install -e ".[test]"

# Install development tools
uv add --dev ruff pytest pytest-cov pytest-asyncio hypothesis

Code Quality

Linting and Formatting

Run Ruff linter:

# Check code
ruff check .

# Auto-fix issues
ruff check --fix .

# Format code
ruff format .

# Check formatting without changes
ruff format --check .

Testing

Run tests:

# All tests
pytest

# With coverage
pytest --cov=localbench --cov-report=html

# Specific test file
pytest src/tests/test_utils.py

# With hypothesis verbose output
pytest -v src/tests/test_utils.py

Pre-commit Checks

Before submitting a PR:

# Format code
ruff format .

# Run linter
ruff check .

# Run tests with coverage
pytest --cov=localbench --cov-report=term-missing

# Show coverage report in browser (optional)
python -m http.server -d htmlcov

Code Style

The project uses:

Type hints
Some docstrings for public functions and classes

Project Structure

src/
├── localbench/
│   ├── core/
│   │   ├── initialization.py   # Model initialization metrics
│   │   ├── runtime.py         # Runtime performance metrics
│   │   ├── resources.py       # Resource utilization metrics
│   │   ├── integration.py     # System integration metrics
│   │   ├── workloads.py      # Workload-specific metrics
│   │   ├── stability.py       # Stability metrics
│   │   └── utils.py          # Shared utilities
│   ├── cli.py                # Command-line interface
│   └── __init__.py
└── tests/
    ├── conftest.py           # Shared test fixtures
    ├── test_initialization.py
    ├── test_runtime.py
    ├── test_resources.py
    ├── test_integration.py
    └── test_utils.py

Pre-commit Checks

Before submitting a PR:

Run all tests
Check test coverage
Verify type hints with mypy (coming soon)
Ensure docstrings are up to date

Contributing

Issues and pull requests welcome. Do have a look at the existing ones first, though.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.2

Feb 9, 2025

0.0.1

Feb 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

localbench-0.0.2.tar.gz (223.6 kB view details)

Uploaded Feb 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

localbench-0.0.2-py3-none-any.whl (30.6 kB view details)

Uploaded Feb 9, 2025 Python 3

File details

Details for the file localbench-0.0.2.tar.gz.

File metadata

Download URL: localbench-0.0.2.tar.gz
Upload date: Feb 9, 2025
Size: 223.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.4.28

File hashes

Hashes for localbench-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`bd8f7f6228d63cf11761520adbb6fd2ec08ef98264f5a643e42726dd22f30a3d`
MD5	`4c3bce955e30798af8b09cb4443f2683`
BLAKE2b-256	`f5f3af83d7e07a60f96c283f70810ad33a9ad226e7ec08dd1e3e36d1c73845cc`

See more details on using hashes here.

File details

Details for the file localbench-0.0.2-py3-none-any.whl.

File metadata

Download URL: localbench-0.0.2-py3-none-any.whl
Upload date: Feb 9, 2025
Size: 30.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.4.28

File hashes

Hashes for localbench-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`14894344be7f9d5e8744a71cdbe0d90a10513b5381172d54c5e218dc45342c61`
MD5	`21e5d6ba2667aa183260eb103a933106`
BLAKE2b-256	`0a01d536e0bced1bb965337f580f875d180275a58dfbfc0c14e2a64106b20bc9`

See more details on using hashes here.

localbench 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

localbench

What is this?

Features

Installation

Usage

Basic Benchmarking

Specific Benchmarks

Advanced Usage

Status

Roadmap

Development

Setup

Code Quality

Linting and Formatting

Testing

Pre-commit Checks

Code Style

Project Structure

Pre-commit Checks

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes