Skip to main content

No project description provided

Project description

CommonEval

Code for staging LLM evaluation benchmarks in a variety of standard formats for common evaluation.

The focus of this library is reading and writing benchmark data, but it includes one example benchmark dataset in data/eng for illustration purposes. Please do not use these files for fine-tuning, since that compromises their ability to measure LLM performance fairly.

CommonEval © 2025 by Biblica, Inc is licensed under CC BY 4.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

commoneval-0.2.0.tar.gz (10.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

commoneval-0.2.0-py3-none-any.whl (12.3 kB view details)

Uploaded Python 3

File details

Details for the file commoneval-0.2.0.tar.gz.

File metadata

  • Download URL: commoneval-0.2.0.tar.gz
  • Upload date:
  • Size: 10.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.7 Darwin/25.0.0

File hashes

Hashes for commoneval-0.2.0.tar.gz
Algorithm Hash digest
SHA256 38f775245eeb14deb5cd3504d71316f13486bebfd57ddc20ba095901e0668c33
MD5 39308e7d100f6613dfc513f53b17c40d
BLAKE2b-256 74fc1ce8d3e2c2121f9ccb95266efb0b5aa1ef59b86eb7f27f025fe2c0121071

See more details on using hashes here.

File details

Details for the file commoneval-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: commoneval-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 12.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.7 Darwin/25.0.0

File hashes

Hashes for commoneval-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 768b99bb0d26d28de30f5e791cacf9eebae80a50f422e3aaea6725b9382390a1
MD5 a0255d9109c4a10d74b10ca381a6dab9
BLAKE2b-256 e98e9db69f5fac66265440be59545854b7ecd2417b06c2dc69e17ab0cb00e7bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page