Skip to main content

Local-first, event-driven evaluation dashboard for DeepEval

Project description

Runner View

Local-first, event-driven evaluation dashboard for DeepEval. Inspect trace runs and click to any node's detail in under 100ms — no account, no upload, nothing leaves your machine.

Launch (one command)

uvx runner-view

Opens the most recent stored run in the dashboard. With no stored runs yet, it prints an actionable message and exits cleanly.

Open a specific portable trace file:

uvx runner-view path/to/trace.rvtrace

Installed-tool form is equivalent:

uv tool install runner-view
runner-view [target]        # no arg = latest stored run; or a .rvtrace path

Runs are stored append-only at ~/.runner-view/runs.jsonl (one JSON line per run).

Develop

PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 PYTHONPATH=. pytest -q

PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 keeps the suite isolated from the project's own pytest11 entry point and sidesteps the pytest-rerunfailures socket-bind issue in sandboxed shells.

Status

Epic 2 (one-command offline launch), Epic 3 (pytest debug launcher — zero-click failure triage), and Epic 7 (Analyst Curation Layer — regression diff views and flag-to-dataset functionality) shipped.

Analyst Curation Layer Features:

  • Regression Diff View: Automatically shows when 2+ runs share the same test_id
  • Flag-to-Dataset Mapping: Flag spans and save them to a local dataset file
  • Visual Highlighting: 🟢 Improved, 🔴 Regressed, ⚪ Unchanged metrics
  • Per-Metric Comparison: Detailed comparison of scores across runs
  • Group by Test ID: Runs with the same test_id are automatically grouped

Decisions and context live in _notes/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

runner_view-0.1.0.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

runner_view-0.1.0-py3-none-any.whl (16.4 kB view details)

Uploaded Python 3

File details

Details for the file runner_view-0.1.0.tar.gz.

File metadata

  • Download URL: runner_view-0.1.0.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for runner_view-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9fc5d75e2da76c3716621bf922d99508b2831dd2692d85e39c515eae0b40eb6f
MD5 88824202973ddd83216b12bcb819dc2b
BLAKE2b-256 b01fedb21c5ba8a83d5a6c7b427762004a1772206a714fb304fa3a4c0772522a

See more details on using hashes here.

File details

Details for the file runner_view-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: runner_view-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for runner_view-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e70d4c7b6da5532c9894bb83e877872a4c0446cca9f780c9479b3f062b2adfad
MD5 1d774d6570fc6f0cec2c7190abd4739a
BLAKE2b-256 679e3243b97a2abc2cb12d25524e524cc46324f2a8cb72c546f542b94bc69787

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page