Skip to main content

Benchmark your local LLMs.

Project description

🧮 Benchllama

An open-source tool to benchmark you local LLMs.

License: MIT PyPI Supported Versions GitHub: Downloads Discord

🚀 Installation

$ pip install benchllama

⚙️ Usage

Usage:

$ benchllama [OPTIONS] COMMAND [ARGS]...

Options:

  • --install-completion: Install completion for the current shell.
  • --show-completion: Show completion for the current shell, to copy it or customize the installation.
  • --help: Show this message and exit.

Commands:

  • clean
  • evaluate

benchllama clean

Usage:

$ benchllama clean [OPTIONS]

Options:

  • --run-id TEXT: Run id
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

benchllama evaluate

Usage:

$ benchllama evaluate [OPTIONS]

Options:

  • --models TEXT: Names of models that need to be evaluated. [required]
  • --provider-url TEXT: The endpoint of the model provider. [default: http://localhost:11434]
  • --dataset FILE: By default, bigcode/humanevalpack from Hugging Face will be used. If you want to use your own dataset, specify the path here.
  • --languages [python|js|java|go|cpp]: List of languages to evaluate from bigcode/humanevalpack. Ignore this if you are brining your own data [default: Language.python]
  • --num-completions INTEGER: Number of completions to be generated for each task. [default: 3]
  • --k INTEGER: The k for calculating pass@k. The values shouldn't exceed num_completions [default: 1, 2]
  • --samples INTEGER: Number of dataset samples to evaluate. By default, all the samples get processed. [default: -1]
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benchllama-0.2.2.tar.gz (10.9 kB view details)

Uploaded Source

Built Distribution

benchllama-0.2.2-py3-none-any.whl (16.2 kB view details)

Uploaded Python 3

File details

Details for the file benchllama-0.2.2.tar.gz.

File metadata

  • Download URL: benchllama-0.2.2.tar.gz
  • Upload date:
  • Size: 10.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.11 Darwin/23.1.0

File hashes

Hashes for benchllama-0.2.2.tar.gz
Algorithm Hash digest
SHA256 9ea9db283bab4be5fda0dcb2811db425ac8f081647d2a96404db23e4af8c837f
MD5 c84dfe925dcdfed6b51f13556c539a0c
BLAKE2b-256 ad68aeeacd52d2ca57b71dddff3012b632a14440e95643d247456d3ba4c3c501

See more details on using hashes here.

File details

Details for the file benchllama-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: benchllama-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 16.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.11 Darwin/23.1.0

File hashes

Hashes for benchllama-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 26718f5c01d182640e40133cdfcd1892d9172acff673aaac4cf16a61fba20fa3
MD5 47d1a14a29a09d8769505d09bbe2891e
BLAKE2b-256 a292696886bf1fb13c2aee27645d4daf31d49d283498caac06074108e1e83eab

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page