Skip to main content

No project description provided

Project description

CommonEval

Code for staging LLM evaluation benchmarks in a variety of standard formats for common evaluation.

The focus of this library is reading and writing benchmark data, but it includes one example benchmark dataset in data/eng for illustration purposes. Please do not use these files for fine-tuning, since that compromises their ability to measure LLM performance fairly.

CommonEval © 2025 by Biblica, Inc is licensed under CC BY 4.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

commoneval-0.2.1.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

commoneval-0.2.1-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file commoneval-0.2.1.tar.gz.

File metadata

  • Download URL: commoneval-0.2.1.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.7 Darwin/25.0.0

File hashes

Hashes for commoneval-0.2.1.tar.gz
Algorithm Hash digest
SHA256 1bc3f0a417366276a632b8695f4b0dab0c65bb41406a2db1791f0efae6ef1c02
MD5 2dae4204d095b56ccedef00e3c3be421
BLAKE2b-256 6d1d7180869bfed43871b542ade74d96c9687bf3bdb8ab3d032c986a6e1ee949

See more details on using hashes here.

File details

Details for the file commoneval-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: commoneval-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 12.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.7 Darwin/25.0.0

File hashes

Hashes for commoneval-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ea012dcaff815d2a46af01421abd9df00244c7cc9a26d9b4064a8275872c4a3e
MD5 7dc525a417613ee90baa64bbcf352cc0
BLAKE2b-256 6d224f6b5a6bdd576a930dd9537749a07435fdf9351ca555f83640e2a2cb33b3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page