Local CLI and desktop app for turning AI failure notes into reusable eval cases
Project description
Eval Case Curator
Eval Case Curator turns messy AI failure notes, support escalations, prompt review comments, and QA findings into reusable eval cases.
It runs locally, redacts common contact/credential patterns, and exports Markdown, JSON, JSONL, or HTML reports that can be dropped into prompt regression suites or review notes.
Install
pip install eval-case-curator
CLI
eval-case-curator examples/failure-notes.txt
eval-case-curator examples/failure-notes.txt --format jsonl --output evals.jsonl
eval-case-curator examples/failure-notes.txt --fail-under 3
Desktop App
eval-case-curator-gui
Paste notes, curate cases, copy JSONL, or save a Markdown report.
What It Extracts
- Candidate eval input from
Input:,Prompt:,User:, or fallback context. - Expected behavior from
Expected:,Should:,Desired:, or failure wording. - Priority from safety, privacy, financial, legal, hallucination, format, and missing-answer clues.
- Tags such as
privacy,format,retrieval,instruction-following,tool-use, andtone. - Source line references and risk notes for review traceability.
Example
eval-case-curator examples/failure-notes.txt --format jsonl
{"id":"eval-...","input":"Summarize the renewal risk...","expected":"Mention renewal risk without exposing direct contact details.","metadata":{"priority":"high","tags":["privacy"]}}
Development
PYTHONPATH=src python3 -m unittest discover -s tests
python3 -m build --no-isolation
python3 -m twine check dist/*
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file eval_case_curator-0.1.0.tar.gz.
File metadata
- Download URL: eval_case_curator-0.1.0.tar.gz
- Upload date:
- Size: 11.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1884e013965c39ee0ac3cd9773b174e6933f4dc3c8eb9d79b106d5c9ef2e18bb
|
|
| MD5 |
03f5c981e9d44327635ef2e1c90aae7a
|
|
| BLAKE2b-256 |
72135edc23410ceb43d34bea0b613f58ae29fee9b48a590465c6d850a9cb4340
|
File details
Details for the file eval_case_curator-0.1.0-py3-none-any.whl.
File metadata
- Download URL: eval_case_curator-0.1.0-py3-none-any.whl
- Upload date:
- Size: 10.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c998ca7e73a7f594d8837426c5cb43e2602404fd7bb7478c5c7b2ad19e24a6fe
|
|
| MD5 |
7aa9b3c4a2ab26eeadfbf739e7638aec
|
|
| BLAKE2b-256 |
8eded6b2e7778d9059fe1af3063fbf7758af675fd73a4d6002d554ab7b42f87e
|