Skip to main content

Evaluation-Driven Development toolkit for OpenClaw agents

Project description

openclaw-edd logo

openclaw-edd

CI PyPI version npm version License: MIT

Evaluation-Driven Development for OpenClaw agents — save golden cases from real interactions, catch regressions before they reach users.

中文文档

Quick Start

Install the OpenClaw plugin:

openclaw plugins install openclaw-edd

After a good agent interaction, save it as a golden case:

/edd save

After modifying a skill, run all saved cases to check for regressions:

/edd

That's it. Cases are stored as human-readable YAML at <workspace>/skills/<skill>/edd.yaml.

Test Case Format

Cases are saved automatically by /edd save and editable by hand:

cases:
  - id: mysql_slow_query
    message: "Any slow queries in MySQL recently"
    expect_tools:
      - exec
    expect_commands:
      - "check_health"
    forbidden_commands:
      - "rm -rf"
    expect_output_contains:
      - "slow query"
    timeout_s: 30
    tags: [mysql, sre]

For the full field reference (pass_at_k, expect_tool_args, eval_type, expect_plan_contains, etc.), see the User Guide.

CI / CLI Integration

For CI pipelines and local observability, install the Python CLI:

pip install openclaw-edd

# Run cases in CI
openclaw-edd run --cases edd.yaml --output-json report.json

# Watch live tool events
openclaw-edd watch

# Mine golden cases from session history
openclaw-edd edd mine --output mined.yaml

The plugin and CLI share the same edd.yaml format. See the User Guide for the full EDD loop (suggestapplydiffminejudgeexport).

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openclaw_edd-0.5.0.tar.gz (51.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

openclaw_edd-0.5.0-py3-none-any.whl (49.2 kB view details)

Uploaded Python 3

File details

Details for the file openclaw_edd-0.5.0.tar.gz.

File metadata

  • Download URL: openclaw_edd-0.5.0.tar.gz
  • Upload date:
  • Size: 51.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openclaw_edd-0.5.0.tar.gz
Algorithm Hash digest
SHA256 24cf4e341a2cdee165c8b494ffe29bb86879f343cb148443b124f50951134181
MD5 c4374f7c2293e09890a9fe1564ceea95
BLAKE2b-256 c297a0c2b20bf6b858f61c6e73b81eeaa387631dd74309d354f84671b0a6903e

See more details on using hashes here.

Provenance

The following attestation bundles were made for openclaw_edd-0.5.0.tar.gz:

Publisher: release.yml on Belyenochi/openclaw-edd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file openclaw_edd-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: openclaw_edd-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 49.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openclaw_edd-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ed2ce2749b439b4973856e24d4f4505f77f77a09efcbbe480be446722f44e927
MD5 f3431804a4661e4b2ace68383fe167bc
BLAKE2b-256 861f30a5ba5020b619e0427bd96342bb9adc82375ac076b5d17a86c935ddc481

See more details on using hashes here.

Provenance

The following attestation bundles were made for openclaw_edd-0.5.0-py3-none-any.whl:

Publisher: release.yml on Belyenochi/openclaw-edd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page