Evidence-backed capability packaging layer for cross-runtime agents.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Baekpica

These details have not been verified by PyPI

Project links

Homepage

Project description

oh-my-field logo

oh-my-field

Website

Field-fit agents to real work. oh-my-field turns one-off agent sessions into portable, evidence-backed capability packages.

OMF can be driven by hand from the CLI, but the intended loop is agent-assisted: an agent records its own work as an OMF session, materializes that work into immutable evidence, promotes the repeatable parts into a reviewable capability package, and carries that capability across runtimes, models, and projects.

The OMF CLI is Apache-2.0 licensed. Capability artifacts generated from your work remain owned by you or the project that generated them unless you choose to publish them under separate terms.

oh-my-field is currently published as an alpha CLI. The core package and capability contracts are usable, but release consumers should expect the public surface to keep tightening while feedback lands.

What OMF Is

A capability packaging and verification layer around external agents.
A way to keep good agent work as reviewable, repeatable, portable packages.

What OMF Is Not

Not an agent runtime — Codex, Claude Code, Hermes, Pi, Odysseus, or another agent does the work.
Not a prompt vault — a capability is instructions plus context policy, harness, evidence, eval cases, and integrity metadata.
Not an autonomous shell runner — risky commands are recorded as intent and require explicit approval before they execute.

Why It Exists

Agent work disappears into chat history instead of compounding.
A good run is hard to reproduce without its evidence, context, and checks.
Local/domain tacit knowledge (constraints, preferences, failure history) is rarely captured.
Migrating runtime or model silently loses behavior.
Teams need evidence, harnesses, and reviewable packages — not another hand-written prompt.

The product goal: turn "the agent did this once" into "this team can reuse and verify this capability again."

How OMF Fits Into An Agent Workflow

The agent still does the work. OMF records what happened, preserves evidence, promotes repeatable work into a capability package, and tracks whether that capability actually works on another runtime or project.

external agent runtime
  Codex / Claude Code / Hermes / Pi / Odysseus / local agent
        │
        ▼
OMF session  ──or──  imported run
        │
        ▼
evidence record
        │
        ▼
capability package
        │
        ├── health / verify / eval / review
        ├── learn / reflect / harden
        ▼
canonical .omfcap.tar.gz archive
        │
        ├── optional runtime projection
        ▼
OMF target project import
        │
        ▼
target validation

See docs/concepts.md for the Field, Evidence, Capability, Harness, and Portability definitions, and docs/agent-ux.md for the activation model.

Install

pipx install oh-my-field      # persistent CLI install
omf --help

uvx oh-my-field --help        # try without installing

Development install from source:

git clone https://github.com/Baekpica/oh-my-field.git
cd oh-my-field
uv sync --all-extras --dev
uv run omf --help

Local checks mirror CI — see CONTRIBUTING.md and docs/development.md. The full install guide, including how to verify the install, is in docs/install.md.

Agent Activation

OMF can be used manually, but the intended loop is agent-assisted. Install an OMF meta-skill for the target agent runtime:

omf install skill --runtime codex
omf install skill --runtime claude_code
omf install skill --runtime hermes
omf install skill --runtime pi
omf install skill --runtime odysseus --project /path/to/odysseus
omf install skill --runtime generic --scope export

By default Codex, Claude Code, Hermes, and Pi install into their user-level skill discovery paths (~/.agents/skills, ~/.claude/skills, ~/.hermes/skills, ~/.pi/agent/skills). Odysseus installs into the target checkout's data/skills tree. generic keeps producing reviewable export assets under .omf/agent/omf-skill.

For MCP-capable clients, patch the matching client config and run the stdio server:

omf install mcp --client codex
omf install mcp --client claude_code
omf install mcp --client hermes
omf install mcp --client pi
omf install mcp --client odysseus --project /path/to/odysseus
omf install mcp --client generic --scope export --out .omf/mcp.json
omf mcp serve

omf runtime install <runtime> combines both steps (controller skill + MCP config), and omf runtime conformance <runtime> verifies the adoption surface afterwards: controller skill installed, MCP config present, omf on PATH, capability skills installed as launchers, and imported targets validated. Exported per-capability skills are launcher-style by default -- they direct the agent into the OMF lifecycle, starting with omf capability import for the canonical .omfcap.tar.gz package, instead of restating the task (see docs/portability.md).

Once activated, a human can say /omf or "track this task with OMF" and the agent records its work as an OMF session, materializes that session into immutable evidence, and proposes a reusable capability package. The MCP surface mirrors the same loop (omf_start_session, omf_record_input, omf_record_artifact, omf_record_validation, omf_record_decision, omf_materialize_session, omf_promote_capability, …). See docs/mcp.md and docs/agent-ux.md.

Quickstart A: Track An Agent Session

Use this when an agent can call OMF during the work.

omf init

omf session start \
  --runtime codex \
  --model gpt-5.5 \
  --goal "triage repository issue" \
  --activation-source skill

Create a tiny input, output artifact, and validation result for the example:

mkdir -p output
printf "issue: repository bug report\n" > issue.md
printf '{"status":"triaged"}\n' > output/report.json
printf "pytest passed\n" > output/pytest.txt

Copy the session_id from the JSON output, then record meaningful events. The context, artifact, and validation paths make the materialized evidence strict-ready for promotion:

omf session event <session_id> \
  --type context \
  --summary "Captured issue report" \
  --path issue.md

omf session event <session_id> \
  --type command \
  --summary "Ran the test suite" \
  --command "uv run pytest" \
  --exit-code 0

omf session event <session_id> \
  --type artifact \
  --summary "Produced triage report" \
  --path output/report.json

omf session event <session_id> \
  --type test_result \
  --summary "pytest passed" \
  --path output/pytest.txt \
  --command "uv run pytest" \
  --exit-code 0

omf session finish <session_id> --outcome success
omf session materialize <session_id>
omf session suggest-capability <session_id>

Promote the resulting evidence into a capability and check its health. promote is strict by default; use --no-strict only when intentionally promoting legacy or incomplete evidence:

omf promote <evidence_id> \
  --name repo_issue_triage \
  --description "Repository issue triage capability"

omf health repo_issue_triage

Quickstart B: Import An Existing Run

Use this when the agent already produced logs, diffs, test outputs, or artifacts.

mkdir -p /tmp/omf-smoke
printf "agent run log\n" > /tmp/omf-smoke/codex.log
printf "pytest passed\n" > /tmp/omf-smoke/pytest.txt

omf import-run codex \
  --log /tmp/omf-smoke/codex.log \
  --goal "triage repo issue" \
  --test-result /tmp/omf-smoke/pytest.txt \
  --evidence-dir /tmp/omf-smoke/evidence \
  --outcome success

omf promote <evidence_id> \
  --name repo_issue_triage \
  --description "Repository issue triage capability" \
  --evidence-dir /tmp/omf-smoke/evidence \
  --capabilities-dir /tmp/omf-smoke/capabilities

omf health repo_issue_triage \
  --capabilities-dir /tmp/omf-smoke/capabilities

From a source checkout, prefix each command with uv run. The full walkthrough, including the export/import/validate path, lives in docs/quickstart.md.

What You Get

promote creates a runtime-neutral package under capabilities/<name>/ — the source of truth:

capabilities/<name>/
  capability.yaml     # canonical metadata and provenance
  instructions.md     # runtime-neutral agent instructions
  harness.yaml        # verification and approval boundaries
  README.md           # human-readable capability card
  contracts/
    task_contract.yaml
    artifacts.yaml
    validation.md
    replay_plan.yaml
  validators/
    validate_contract.py

The contract files are generated from hardened evidence and are copied into runtime projections so target agents can see the task, artifact, validation, and replay contract without reconstructing it from prose.

That per-capability README.md is the capability card — purpose, source evidence, harness summary, portability and review status — and is distinct from this repository's README.

omf init sets up the repo-local field — .omf/config.yaml, .omfignore, and the artifact directories. It also creates the top-level capabilities/ directory for reviewable packages:

capabilities/
  <name>/

.omf/
  evidence/      sessions/     exports/      imports/
  evals/         replays/      context/      learning/
  datasets/      reflections/  workflows/    runs/

Runtime-specific files (Codex instructions, Claude Code memory, Hermes/Pi/Odysseus skill projections, and generic runbook projections) are projections of the package, not the source of truth.

.omf/config.yaml records local field defaults and .omf/registry.yaml is local registry metadata. The package files under capabilities/<name>/ remain the authoritative capability source.

Learning And Datasets

Accumulated evidence becomes reviewable learning material, not silent training data. learn and reflect turn evidence and eval results into learning exports and reflection reports; learn-patch records accept/reject decisions on proposed prompt patches.

dataset-export then emits JSONL from those downstream artifacts — learning exports (fine-tuning), patch decisions (preference), and eval results (eval) — not from raw evidence. Review and harness status sit upstream, so unreviewed or failing runs do not silently become a dataset.

omf dataset-export <capability_name> --dataset-type all

Command Map

Area	Commands / options	Purpose
Setup	`init`, `doctor`, `--version`	Create and inspect a repo-local OMF field
Agent activation	`install skill`, `install mcp`, `mcp serve`	Give agents a lower-friction OMF surface
Session tracking	`session start`, `session event`, `session finish`, `session materialize`, `session suggest-capability`	Record active agent work as structured evidence
Evidence ingestion	`import-run`, `capture`	Import existing logs, artifacts, diffs, and test results
Capability build	`promote`, `health`, `harden`, `card`, `registry`	Create and inspect reusable capability packages
Verification	`replay`, `eval`, `verify`, `verify package`, `regression-case`	Check whether a capability still satisfies its harness or package manifest
Review and learning	`review`, `approve`, `reject`, `revise`, `learn`, `learn-patch`, `reflect`	Turn feedback and failures into accepted patches
Pipeline	`run`, `resume`, `rollback`, `status`	Drive and resume the full checkpointed pipeline
Portability	`capability export`, `capability unpack`, `capability import`, `capability validate`, `capability remap`, `capability adapt`, `export`	Move capabilities across runtimes and projects
Operations	`dashboard`, `inspect`, `diff`, `explain` (`why`), `context`, `dataset-export`	Inspect, explain, compare, and export accumulated evidence

Portability Lifecycle

A capability moves across runtimes through four distinct states — keep them separate, because "can be exported" is not "works on the target":

Exported: packaged as a canonical .omfcap.tar.gz archive with optional target runtime projections (omf capability export).
Imported: the archive or directory package has been materialized into a target project's OMF registry (omf capability import).
Validated: an actual target run passed under the recorded target runtime/model/project and no hard blockers remain (omf capability validate --run-command ...). Static import --validate alone leaves the import at needs_validation.
Portable: the capability has at least one validated target import.

omf health reports export_status, import_status, and validation_status separately, and lists each imported target with its own status.

omf capability export repo_issue_triage --target hermes \
  --out .omf/exports/repo_issue_triage-hermes
omf verify package .omf/exports/repo_issue_triage-hermes.omfcap.tar.gz
omf capability import .omf/exports/repo_issue_triage-hermes.omfcap.tar.gz \
  --runtime hermes --validate
omf capability validate repo_issue_triage --target hermes \
  --run-command "hermes-code --profile target --skill repo_issue_triage" \
  --approve-command-risk

Target validation reports separate hard blockers from advisory portability risk. Missing required tools, unresolved context remaps, failed target runs, missing expected artifacts, and opt-in contract validator failures require adaptation. Runtime/model/project differences lower the advisory risk score but do not block validated after the target run and required artifacts pass.

Copying a generated Codex, Claude Code, Hermes, Pi, Odysseus, or generic projection into a runtime only makes that runtime discover a launcher. It is not an import; every target run should enter through omf capability import so OMF can create registry state, import reports, and follow-up commands.

The full cross-runtime walkthrough (Codex/gpt-5.5 → Hermes/qwen3.6-27B, redacted evidence transfer, overlays) is in docs/portability.md and the runtime adapter docs.

Safety Model

OMF is not an arbitrary shell runner. It records command intent and risk. Commands classified as write, destructive, external, credential, production, or paid risk are recorded but not executed unless they receive explicit approval (--approve-command-risk). Capability exports are gated by --approve-export.

Prefer the shell-free --run-argv form (one token per flag) over the legacy shell strings --command / --harness-command / --run-command; --require-cwd-inside-project blocks commands that escape the project root.
Commands run with a minimal environment (PATH, HOME, TMPDIR); secret-bearing vars (OPENAI_API_KEY, ANTHROPIC_API_KEY, AWS_*, GITHUB_TOKEN, …) are stripped and recorded. Opt one back in with --allow-env NAME.
import-run --artifact-root skips .git/, .venv/, node_modules/, .env*, private-key patterns, and symlinks; honors .omfignore/--exclude; and caps traversal via --max-artifact-count/--max-total-artifact-bytes.

Full details: docs/security.md.

Architecture At A Glance

OMF is organized into layers (flat root modules such as oh_my_field.models and oh_my_field.storage remain as compatibility shims while call sites migrate):

Layer	Path	Responsibility
CLI	`src/oh_my_field/cli/`	Typer command surface
Application	`src/oh_my_field/application/`	use-case workflows
Domain	`src/oh_my_field/domain/`	models, rules, lifecycle
Infrastructure	`src/oh_my_field/infrastructure/`	storage, hashing, execution
Adapters	`src/oh_my_field/adapters/`	runtime-specific behavior
Schemas	`schemas/`	artifact JSON Schema contracts

See docs/architecture/overview.md for the dependency direction and per-concept layout.

Learn More

Full product and feature reference: oh-my-field.md
Install guide: docs/install.md
Quickstart (session / import / portability paths): docs/quickstart.md
Agent UX and activation: docs/agent-ux.md
MCP surface: docs/mcp.md
Concepts: docs/concepts.md
Portability: docs/portability.md
Security model: docs/security.md
Architecture overview: docs/architecture/overview.md
Development guide: docs/development.md
Runtime adapters: Codex, Claude Code, Hermes, Generic

Practical Notes

The agent-assisted session loop is the primary path; manual import-run is the fallback when you only have logs after the fact.
Keep generated examples in /private/tmp/... while trying the CLI.
Record failed runs too; they are the raw material for stronger capabilities.
Treat human review as part of the system, not as a failure mode.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Baekpica

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.3.2

Jun 16, 2026

0.3.1

Jun 16, 2026

This version

0.3.0

Jun 15, 2026

0.2.5

Jun 15, 2026

0.2.4

Jun 11, 2026

0.2.3

Jun 11, 2026

0.2.2

Jun 10, 2026

0.2.1

Jun 10, 2026

0.2.0

Jun 10, 2026

0.1.0

Jun 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oh_my_field-0.3.0.tar.gz (735.5 kB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oh_my_field-0.3.0-py3-none-any.whl (248.4 kB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file oh_my_field-0.3.0.tar.gz.

File metadata

Download URL: oh_my_field-0.3.0.tar.gz
Upload date: Jun 15, 2026
Size: 735.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oh_my_field-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`90a658cbd35e9bb10d57c5d917764ec715081ec33b1c59a9cf9a6d1b467639b2`
MD5	`8cd2d0a3e774413770cf1d1f8b017730`
BLAKE2b-256	`b537c2f8d04a1272cc6bac38fa955392d26cd8e30c630dc32178d4c082a1b2ef`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oh_my_field-0.3.0.tar.gz:

Publisher: release.yml on Baekpica/oh-my-field

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oh_my_field-0.3.0.tar.gz
- Subject digest: 90a658cbd35e9bb10d57c5d917764ec715081ec33b1c59a9cf9a6d1b467639b2
- Sigstore transparency entry: 1826824999
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: Baekpica/oh-my-field@35e28985e4b3a05e219089a6a549f6241007f2a2
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Baekpica
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@35e28985e4b3a05e219089a6a549f6241007f2a2
- Trigger Event: push

File details

Details for the file oh_my_field-0.3.0-py3-none-any.whl.

File metadata

Download URL: oh_my_field-0.3.0-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 248.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oh_my_field-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2c46e198354779c41bf6813d33672fcd247a53a7094cbd6c88ef3f3eae484ecb`
MD5	`7da81892ab609012c599792bf2714432`
BLAKE2b-256	`b88278db36985427fc711b1fa71dec92c3ef4d0af906006779041da286075c22`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oh_my_field-0.3.0-py3-none-any.whl:

Publisher: release.yml on Baekpica/oh-my-field

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oh_my_field-0.3.0-py3-none-any.whl
- Subject digest: 2c46e198354779c41bf6813d33672fcd247a53a7094cbd6c88ef3f3eae484ecb
- Sigstore transparency entry: 1826825172
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: Baekpica/oh-my-field@35e28985e4b3a05e219089a6a549f6241007f2a2
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Baekpica
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@35e28985e4b3a05e219089a6a549f6241007f2a2
- Trigger Event: push

oh-my-field 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

oh-my-field

What OMF Is

What OMF Is Not

Why It Exists

How OMF Fits Into An Agent Workflow

Install

Agent Activation

Quickstart A: Track An Agent Session

Quickstart B: Import An Existing Run

What You Get

Learning And Datasets

Command Map

Portability Lifecycle

Safety Model

Architecture At A Glance

Learn More

Practical Notes

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance