Skip to main content

No project description provided

Project description

👀 See it in action

benchllama

Usage:

$ benchllama [OPTIONS] COMMAND [ARGS]...

Options:

  • --install-completion: Install completion for the current shell.
  • --show-completion: Show completion for the current shell, to copy it or customize the installation.
  • --help: Show this message and exit.

Commands:

  • clean
  • evaluate

benchllama clean

Usage:

$ benchllama clean [OPTIONS]

Options:

  • --run-id TEXT: Run id
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

benchllama evaluate

Usage:

$ benchllama evaluate [OPTIONS]

Options:

  • --models TEXT: Names of models that need to be evaluated. [required]
  • --provider-url TEXT: The endpoint of the model provider. [default: http://localhost:11434]
  • --dataset FILE: By default, bigcode/humanevalpack from Hugging Face will be used. If you want to use your own dataset, specify the path here.
  • --languages [python|js|java|go|cpp]: List of languages to evaluate from bigcode/humanevalpack. Ignore this if you are brining your own data [default: Language.python]
  • --num-completions INTEGER: Number of completions to be generated for each task. [default: 3]
  • --k INTEGER: The k for calculating pass@k. The values shouldn't exceed num_completions [default: 1, 2]
  • --samples INTEGER: Number of dataset samples to evaluate. By default, all the samples get processed. [default: -1]
  • --output PATH: Output directory [default: /tmp]
  • --help: Show this message and exit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benchllama-0.2.0.tar.gz (9.6 kB view details)

Uploaded Source

Built Distribution

benchllama-0.2.0-py3-none-any.whl (15.3 kB view details)

Uploaded Python 3

File details

Details for the file benchllama-0.2.0.tar.gz.

File metadata

  • Download URL: benchllama-0.2.0.tar.gz
  • Upload date:
  • Size: 9.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.11 Darwin/23.1.0

File hashes

Hashes for benchllama-0.2.0.tar.gz
Algorithm Hash digest
SHA256 eef78f74c4c353ca96f36822e46a97adc8c4725053ebbd7c63cd16cf03936a4b
MD5 36e58039a7505b2d6be4afd2873438ea
BLAKE2b-256 8bb85340f047986a579a47c4b99ac71b309916dedd57ca392e435a1dafc248f6

See more details on using hashes here.

File details

Details for the file benchllama-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: benchllama-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 15.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.11 Darwin/23.1.0

File hashes

Hashes for benchllama-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e675e01d16dd47fe124cc4bcf8ff607d9d1989c2e0d643419c78092f29744fe5
MD5 1dcac8271c6d1feaebef405e3e039350
BLAKE2b-256 fb71bc7ed67272be3cb4294ba057dac60d9cb2a4f09264bbe5c09659bfd30892

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page