Local-first, event-driven evaluation dashboard for DeepEval
Project description
Runner View
Local-first, event-driven evaluation dashboard for DeepEval. Inspect trace runs and click to any node's detail in under 100ms — no account, no upload, nothing leaves your machine.
Launch (one command)
uvx runner-view
Opens the most recent stored run in the dashboard. With no stored runs yet, it prints an actionable message and exits cleanly.
Open a specific portable trace file:
uvx runner-view path/to/trace.rvtrace
Installed-tool form is equivalent:
uv tool install runner-view
runner-view [target] # no arg = latest stored run; or a .rvtrace path
Runs are stored append-only at ~/.runner-view/runs.jsonl (one JSON line per run).
Develop
PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 PYTHONPATH=. pytest -q
PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 keeps the suite isolated from the project's own pytest11 entry point and sidesteps the pytest-rerunfailures socket-bind issue in sandboxed shells.
Status
Epic 2 (one-command offline launch), Epic 3 (pytest debug launcher — zero-click failure triage), and Epic 7 (Analyst Curation Layer — regression diff views and flag-to-dataset functionality) shipped.
Analyst Curation Layer Features:
- Regression Diff View: Automatically shows when 2+ runs share the same test_id
- Flag-to-Dataset Mapping: Flag spans and save them to a local dataset file
- Visual Highlighting: 🟢 Improved, 🔴 Regressed, ⚪ Unchanged metrics
- Per-Metric Comparison: Detailed comparison of scores across runs
- Group by Test ID: Runs with the same test_id are automatically grouped
Decisions and context live in _notes/.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file runner_view-0.1.0.tar.gz.
File metadata
- Download URL: runner_view-0.1.0.tar.gz
- Upload date:
- Size: 2.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9fc5d75e2da76c3716621bf922d99508b2831dd2692d85e39c515eae0b40eb6f
|
|
| MD5 |
88824202973ddd83216b12bcb819dc2b
|
|
| BLAKE2b-256 |
b01fedb21c5ba8a83d5a6c7b427762004a1772206a714fb304fa3a4c0772522a
|
File details
Details for the file runner_view-0.1.0-py3-none-any.whl.
File metadata
- Download URL: runner_view-0.1.0-py3-none-any.whl
- Upload date:
- Size: 16.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e70d4c7b6da5532c9894bb83e877872a4c0446cca9f780c9479b3f062b2adfad
|
|
| MD5 |
1d774d6570fc6f0cec2c7190abd4739a
|
|
| BLAKE2b-256 |
679e3243b97a2abc2cb12d25524e524cc46324f2a8cb72c546f542b94bc69787
|