Skip to main content

Local CLI and desktop app for turning AI failure notes into reusable eval cases

Project description

Terminal report preview

Eval Case Curator

Eval Case Curator turns messy AI failure notes, support escalations, prompt review comments, and QA findings into reusable eval cases.

It runs locally, redacts common contact/credential patterns, and exports Markdown, JSON, JSONL, or HTML reports that can be dropped into prompt regression suites or review notes.

Install

pip install eval-case-curator

CLI

eval-case-curator examples/failure-notes.txt
eval-case-curator examples/failure-notes.txt --format jsonl --output evals.jsonl
eval-case-curator examples/failure-notes.txt --fail-under 3

Desktop App

eval-case-curator-gui

Paste notes, curate cases, copy JSONL, or save a Markdown report.

What It Extracts

  • Candidate eval input from Input:, Prompt:, User:, or fallback context.
  • Expected behavior from Expected:, Should:, Desired:, or failure wording.
  • Priority from safety, privacy, financial, legal, hallucination, format, and missing-answer clues.
  • Tags such as privacy, format, retrieval, instruction-following, tool-use, and tone.
  • Source line references and risk notes for review traceability.

Example

eval-case-curator examples/failure-notes.txt --format jsonl
{"id":"eval-...","input":"Summarize the renewal risk...","expected":"Mention renewal risk without exposing direct contact details.","metadata":{"priority":"high","tags":["privacy"]}}

Development

PYTHONPATH=src python3 -m unittest discover -s tests
python3 -m build --no-isolation
python3 -m twine check dist/*

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eval_case_curator-0.1.0.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

eval_case_curator-0.1.0-py3-none-any.whl (10.4 kB view details)

Uploaded Python 3

File details

Details for the file eval_case_curator-0.1.0.tar.gz.

File metadata

  • Download URL: eval_case_curator-0.1.0.tar.gz
  • Upload date:
  • Size: 11.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for eval_case_curator-0.1.0.tar.gz
Algorithm Hash digest
SHA256 1884e013965c39ee0ac3cd9773b174e6933f4dc3c8eb9d79b106d5c9ef2e18bb
MD5 03f5c981e9d44327635ef2e1c90aae7a
BLAKE2b-256 72135edc23410ceb43d34bea0b613f58ae29fee9b48a590465c6d850a9cb4340

See more details on using hashes here.

File details

Details for the file eval_case_curator-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for eval_case_curator-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c998ca7e73a7f594d8837426c5cb43e2602404fd7bb7478c5c7b2ad19e24a6fe
MD5 7aa9b3c4a2ab26eeadfbf739e7638aec
BLAKE2b-256 8eded6b2e7778d9059fe1af3063fbf7758af675fd73a4d6002d554ab7b42f87e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page