Observable trajectory infrastructure — trading domain reference implementation

These details have not been verified by PyPI

Project links

Project description

ewm-core — Executable World Models

Part of the Crucible project

This repository accompanies the Executable World Models essay series published on Substack. It is the companion to the Beyond Tokens series, which established why world models matter. This series builds them.

Executable World Models — release roadmap

Install

pip install ewm-core
pip install ewm-core[llm]   # includes Anthropic SDK for LLM agent

Agent modes

Mode	Requires	Model
Rule-based agent	Nothing — default	—
LLM agent	`ANTHROPIC_API_KEY` env var	`claude-haiku-4-5-20251001`

The LLM agent receives a market observation (price, SMA5, SMA10, volume, position) and returns a buy / sell / hold decision with a reasoning field.

from ewm_core.agents import LLMAgent

agent = LLMAgent()  # reads ANTHROPIC_API_KEY from environment
decision = agent.decide({
    "symbol": "AAPL", "price": 182.5,
    "sma5": 183.1, "sma10": 180.2,
    "volume": 52_000_000, "position": "flat",
})
# {"type": "buy", "qty": 1, "source": "llm", "reasoning": "..."}

Live demo

Dashboard — synthetic market replay, trajectory viewer, artifact inspector

Quick start

from ewm_core.environment import MarketPathEnvironment
from ewm_core.eval import evaluate_artifact
from ewm_core.learning import build_evidence_policy

What this repository builds

Most agent frameworks focus on what agents can do — tools, skills, orchestration. This project focuses on something different: what it takes for agent behavior to be observable, reproducible, and improvable over time.

The architecture is built in layers, each release adding one:

Execution — a deterministic agent runtime deployed on AWS, executing decisions under explicit budget constraints and emitting structured artifacts for every run
Evaluation — a structural validity layer that verifies artifact integrity before runs are treated as evidence
Environments — a stateful world that evolves step by step, so agents interact with something that changes rather than just querying APIs
Learning — an evidence policy that converts validated experiment results into decision guidance for future runs, without retraining the model

The result is a system that produces trajectories, not just outputs. Trajectories can be replayed, validated, compared across experiments, and used as the dataset from which intelligent systems can begin to improve.

Releases

Each release adds one layer to the architecture. The system compounds — it does not pivot.

Release	Layer added	Essay
`v0.8.1-Agent-Runtime`	Execution — agent runtime deployed on AWS	From Theory to Runtime
`v0.8.3`	Evaluation + environments — structural validity and deterministic world	Evaluation Is a Primitive, Not a Report · Tools Return Results. Environments Change the World.
`v0.8.5`	Evidence policy — learning without retraining	Learning Without Retraining
`v0.8.5.1`	Policy-guided agent — evidence feeding future decisions	Learning Without Retraining

Essays

This series follows directly from Beyond Tokens. That series made the case for why world models matter. This series runs that argument as infrastructure.

Essay	What it covers
What the dashboard reveals	Phase 1 bridge — making agent behaviour visible
From Theory to Runtime	The Agent Runtime goes live on AWS — execution, artifacts, persistence, telemetry
Evaluation Is a Primitive, Not a Report	Structural validation that turns runs into trusted evidence
Tools Return Results. Environments Change the World.	Why environments are the missing layer in most agent architectures
Learning Without Retraining	How agent systems improve decisions without changing the model
The Architecture of Intelligent Systems	What both series, taken together, mean for how intelligent systems are built

Recommended entry point: start with From Theory to Runtime, which introduces the runtime and links directly to the code. Read the earlier Beyond Tokens series for the architectural argument that precedes it.

Architecture

The system is a layered experimental stack. Each layer makes agent behavior more observable and improvable.

agents
↓
constraints        — budget: steps · tool calls · model calls · memory
↓
artifacts          — decision.json · trajectory.json · deltas.json
↓
evaluation         — structural validity · integrity checks
↓
experiments        — aggregate across runs · integrity rate · success rate
↓
environments       — stateful · deterministic · step-by-step
↓
evidence policy    — patterns → decision guidance · no retraining required

The upper layers generate behavior. The lower layers make that behavior observable, trustworthy, and learnable from.

The learning loop

Version v0.8.5 completes the learning loop. The architecture now runs end to end:

environment → trajectories → artifacts → evaluation
    → experiments → evidence dataset → evidence policy → future decisions

This is not reinforcement learning. No model weights are updated. No gradient descent occurs. What changes is the decision architecture — past experiment evidence informs future choices, without touching the model.

Setup

make setup
make lint
pytest

Local demo

python3 scripts/demo_learning_loop.py

What you should see

Agent runs interact with the MarketPathEnvironment step by step
Structured artifacts are written for each run — decision.json, trajectory.json, deltas.json
Evaluation verifies structural integrity — valid runs proceed, invalid runs are excluded
Experiments aggregate results across runs
A learning dataset is exported from validated trajectories

Evidence policy demo (v0.8.5)

# Export learning dataset from validated experiments
python3 scripts/export_learning_dataset.py

# Run the learner stub to produce a learning report
python3 scripts/run_learning_stub.py

# Build the evidence policy from the learning report
python3 scripts/build_evidence_policy.py \
  --learning-report outputs/learning/demo_learning_report.json \
  --output outputs/learning/evidence_policy.json

# Run the policy feedback loop demo
python3 scripts/demo_policy_feedback_loop.py

Policy-guided agent demo (v0.8.5.1)

# Run the policy-guided agent
python3 scripts/demo_policy_guided_trading_agent.py

# Run the full end-to-end learning loop
python3 scripts/demo_end_to_end_learning_loop.py

What you should see

Agent loads an evidence policy and consults it for each decision
Symbol-level preferences take priority, then step-level preferences, then the default action
Each decision includes an explanation of which policy source was used
The complete loop from experiments to decisions runs end to end

AWS deployment

make deploy-agentcore-loop

Verify health:

curl https://<your-api-gateway-url>/health

Run integration tests:

pytest tests/integration

Repository structure

ewm_core/environment/        world environments — MarketPathEnvironment
ewm_core/eval/               structural evaluation layer
ewm_core/learning/           evidence policy and learning scaffold
services/cli/                operational CLI

scripts/                     demos, export tools, and policy builders
tests/                       unit and integration tests
infra/cdk/                   AWS infrastructure — API Gateway, Lambda, DynamoDB, S3
docs/                        architecture diagrams
outputs/learning/            experiment datasets and policy outputs

How to evaluate this repository in 10 minutes

Run python3 scripts/demo_learning_loop.py and observe that every run produces structured artifacts
Check outputs/learning/ — trajectories are exported as a clean dataset
Run python3 scripts/demo_policy_guided_trading_agent.py and observe decisions being guided by prior evidence
Open any decision.json artifact — the decision, trajectory, and state deltas are all explicit and inspectable
The model does not change. The system improves through architecture.

Crucible project

ewm-core is the foundation layer of the Crucible research project — three repos, three essays, two domains.

crucible-ewm — this repo
crucible-player-coach — adversarial quality loop (Phase 3)
crucible-autoresearcher — meta-loop improvement (Phase 4)

Project status

Current milestone: v0.8.5.1 — Policy-Guided Agent

The learning loop is complete. Experiments produce evidence. Evidence becomes policy. Policy informs future decisions.

The next step is replacing the deterministic MarketPathEnvironment with a learned world model — at which point the environment itself becomes a predictive system rather than a replay. The experimental architecture remains unchanged.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Apr 20, 2026

0.1.0

Apr 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ewm_core-0.2.0.tar.gz (59.4 kB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ewm_core-0.2.0-py3-none-any.whl (77.1 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file ewm_core-0.2.0.tar.gz.

File metadata

Download URL: ewm_core-0.2.0.tar.gz
Upload date: Apr 20, 2026
Size: 59.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for ewm_core-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`f2935c9462372c9f50026290e89f80e4fe61b56ea55dc34016e2f5ccc23d51ef`
MD5	`b2dfbf650d8bf632884933cea6941ae7`
BLAKE2b-256	`fbaa4db779e0bcad559f7dff4ae8d2b70d0de09d57ca6a145e3ec374db4a4acd`

See more details on using hashes here.

File details

Details for the file ewm_core-0.2.0-py3-none-any.whl.

File metadata

Download URL: ewm_core-0.2.0-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 77.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for ewm_core-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`16dc7e2c051363dfc68ba705d69d753cf4e6cb423f5d9df8cdd1081a068f06a0`
MD5	`c3b2bce1cbc99767954eaeab59451fa4`
BLAKE2b-256	`0630c022c5ac67809703d70f22072b538d0b0bc9031fe365e12f1c68c1218074`

See more details on using hashes here.

ewm-core 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ewm-core — Executable World Models

Part of the Crucible project

Install

Agent modes

Live demo

Quick start

What this repository builds

Releases

Essays

Architecture

The learning loop

Setup

Local demo

Evidence policy demo (v0.8.5)

Policy-guided agent demo (v0.8.5.1)

AWS deployment

Repository structure

How to evaluate this repository in 10 minutes

Crucible project

Project status

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes