Skip to main content

Benchmark your local LLMs.

Project description

Benchllama

An open-source tool to benchmark you local LLMs.

License: MIT PyPI Supported Versions GitHub: Downloads Discord

🚀 Installation

$ pip install benchllama

⚙️ Usage

Usage:

$ benchllama [OPTIONS] COMMAND [ARGS]...

Options:

  • --install-completion: Install completion for the current shell.
  • --show-completion: Show completion for the current shell, to copy it or customize the installation.
  • --help: Show this message and exit.

Commands:

  • clean
  • evaluate

benchllama clean

Usage:

$ benchllama clean [OPTIONS]

Options:

  • --run-id TEXT: Run id
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

benchllama evaluate

Usage:

$ benchllama evaluate [OPTIONS]

Options:

  • --models TEXT: Names of models that need to be evaluated. [required]
  • --provider-url TEXT: The endpoint of the model provider. [default: http://localhost:11434]
  • --dataset FILE: By default, bigcode/humanevalpack from Hugging Face will be used. If you want to use your own dataset, specify the path here.
  • --languages [python|js|java|go|cpp]: List of languages to evaluate from bigcode/humanevalpack. Ignore this if you are brining your own data [default: Language.python]
  • --num-completions INTEGER: Number of completions to be generated for each task. [default: 3]
  • --k INTEGER: The k for calculating pass@k. The values shouldn't exceed num_completions [default: 1, 2]
  • --samples INTEGER: Number of dataset samples to evaluate. By default, all the samples get processed. [default: -1]
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

Related Projects

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benchllama-0.2.1.tar.gz (9.5 kB view details)

Uploaded Source

Built Distribution

benchllama-0.2.1-py3-none-any.whl (15.8 kB view details)

Uploaded Python 3

File details

Details for the file benchllama-0.2.1.tar.gz.

File metadata

  • Download URL: benchllama-0.2.1.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.12.2 Linux/6.2.0-1019-azure

File hashes

Hashes for benchllama-0.2.1.tar.gz
Algorithm Hash digest
SHA256 45f989f2b6b7aa84140c2a722a231a33160a1dcab70952387b789608d3d48b7f
MD5 dca0d3a5b0426b401b3ad1b468271554
BLAKE2b-256 12793b81028d46a05c70bf1730f035234ae358d0286e6210bc0edc43302acc06

See more details on using hashes here.

File details

Details for the file benchllama-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: benchllama-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 15.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.12.2 Linux/6.2.0-1019-azure

File hashes

Hashes for benchllama-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 551726c70232299e980f39f7be9797a712c9b13e8d984e925b0f0f7cc58eaf1a
MD5 0f41b5395f901067b9b13476fbb8604b
BLAKE2b-256 21f9257a807ca2681247d5b78070dc8b62e4e7327b1aa4fce1c8fc07b2fd7172

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page