Skip to main content

Tools and Techniques for Consistency Benchmarking

Project description

ConsistencyBench

Setup

First install the package from PyPI.

pip install consistencybench

Additionally, if you'd like to use the NER metric (consistencybench.metrics.AgreementNER), run the following first.

python -m spacy download en_core_web_sm

To start generation and scoring

Set the arguments from run_eval.py

python run_eval.py --openai_api_key <OPENAI_API_KEY>

Example Output file: consistencybench/result_gpt-3.5-turbo_paraphrasing.csv

Example Jupyter Notebook: example.ipynb

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

consistencybench-0.1.1.tar.gz (14.9 kB view details)

Uploaded Source

Built Distribution

consistencybench-0.1.1-py3-none-any.whl (18.7 kB view details)

Uploaded Python 3

File details

Details for the file consistencybench-0.1.1.tar.gz.

File metadata

  • Download URL: consistencybench-0.1.1.tar.gz
  • Upload date:
  • Size: 14.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.5 Linux/6.6.10-76060610-generic

File hashes

Hashes for consistencybench-0.1.1.tar.gz
Algorithm Hash digest
SHA256 41d5ebad8b5ba1a37dbc42cc0ddfe85a47c24b66512c88bd1666a26a17b79bc7
MD5 3c0357409b6d396144b7b09901755221
BLAKE2b-256 5d19298e4cc5f1ccbc39a8f6d9e913a9addf440bfb6d86f7cf1f0e92aa6e7fa6

See more details on using hashes here.

File details

Details for the file consistencybench-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: consistencybench-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 18.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.5 Linux/6.6.10-76060610-generic

File hashes

Hashes for consistencybench-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 40197312e22d132948591d81d9f4e9326067f6666d5ae9ee18518922e7fe75b9
MD5 7881798aff8134de988824f0b72447b5
BLAKE2b-256 df831f3bf399f4c2d334c672c4049ec9977edaeca3fe2054471ae174d653a2d7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page