Skip to main content

Tools and Techniques for Consistency Benchmarking

Project description

ConsistencyBench

Setup

First install the package from PyPI.

pip install consistencybench

Additionally, if you'd like to use the NER metric (consistencybench.metrics.AgreementNER), run the following first.

python -m spacy download en_core_web_sm

To start generation and scoring

Set the arguments from run_eval.py

python run_eval.py --openai_api_key <OPENAI_API_KEY>

Example Output file: consistencybench/result_gpt-3.5-turbo_paraphrasing.csv

Example Jupyter Notebook: example.ipynb

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

consistencybench-0.1.2.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

consistencybench-0.1.2-py3-none-any.whl (18.8 kB view details)

Uploaded Python 3

File details

Details for the file consistencybench-0.1.2.tar.gz.

File metadata

  • Download URL: consistencybench-0.1.2.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.11.5 Linux/6.6.10-76060610-generic

File hashes

Hashes for consistencybench-0.1.2.tar.gz
Algorithm Hash digest
SHA256 5c5aeeccd4f52750911049e7d7792552d154c550bdae8cf3587866d68f87ca3c
MD5 8442d824259880883c5c858a77f657b9
BLAKE2b-256 763c3a62f12fcd5fe9eb582a68f198f68a89a9c690fb1a1840d1045d99ee8c2c

See more details on using hashes here.

File details

Details for the file consistencybench-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: consistencybench-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 18.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.11.5 Linux/6.6.10-76060610-generic

File hashes

Hashes for consistencybench-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7e4987ffce46621527d9b50e810791356ce438d0d997b4d7a5db9a0c857620b6
MD5 a786776d777367c7d3fbbd8b4a68f711
BLAKE2b-256 5b34e5169140931fff66309d76b2b9c71f592b02d78cf26a086b691e20342edb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page