Decision-contracts library: deterministic verdict/gate/failure/promotion logic for autoresearch loops.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

autoresearch-core

A tiny, pure-Python decision-contracts library for autoresearch / agentic loops: a deterministic verdict (metric / comparator / target), failure classification, gates, and promotion record shapes — the disciplined decision core, with zero runtime dependencies and no I/O.

You bring the loop, the retrieval, the runner, and the storage; you bind them to the library's Protocols and call measure / decide / should_promote_dead_end at your decision points. The verdict logic is parity-tested against the GRD autoresearch loop.

Why

Agentic research loops fail in a predictable way: the model grades its own homework. An LLM proposes a hypothesis, runs an experiment, then judges whether the result supports the hypothesis — and judgment drifts. autoresearch-core removes the judge from the control path:

Every hypothesis must carry a machine-readable contract (MetricSpec: which metric, which comparator, which target).
Experiments report results through a machine-readable line (__RESULT__ {"accuracy": 0.93} on stdout).
The verdict is computed, not judged: metric vs target → supported / refuted / inconclusive.
Only a deterministic refutation may auto-promote a dead-end. Anything judged by an LLM or inferred from an exit code is advisory.

Install

pip install autoresearch-core

Requires Python 3.11+. No runtime dependencies. Fully typed (py.typed).

Quickstart

from autoresearch_core import (
    MetricSpec, ExperimentResult, measure, parse_metrics_line, should_promote_dead_end,
)

spec = MetricSpec(metric_key="recall_at_10", comparator=">=", target=0.8)

# An experiment prints `__RESULT__ {"recall_at_10": 0.83}` on stdout:
metrics = parse_metrics_line(stdout)        # -> {"recall_at_10": 0.83}
verdict = measure(spec, ExperimentResult(metrics=metrics, exit_code=0))

verdict.verdict          # "supported" | "refuted" | "inconclusive"  (deterministic)
verdict.evidence_level   # "deterministic"
should_promote_dead_end(verdict)            # True only for a deterministic refutation

Documentation

QUICKSTART — zero to a working deterministic verdict in five minutes, with a complete runnable script.
TUTORIAL — build a full hypothesis → experiment → measure → learn loop on top of the library: contracts, failure classes, gates, dead-end promotion, infrastructure ports, and custom verdict strategies.
CHANGELOG

What it owns (and what it doesn't)

Owns — the decision discipline:

Module	Public surface	Job
`types`	`MetricSpec`, `ExperimentResult`, `VerdictRecord`, `Hypothesis`, `Takeaway`, `GateState`, `GateCheck`	Frozen dataclasses; pure data, no logic
`contract`	`parse_metrics_line`, `validate_metric_spec`	The `__RESULT__ {json}` experiment-result contract
`verdict`	`compare`, `DeterministicVerdict`, `VerdictStrategy`	Metric vs target → supported / refuted / inconclusive
`failures`	`classify_run_failure`	stderr → `H2` (missing dep) / `H3` (missing file / permission) / `H4` (timeout / runtime) / `none`
`gates`	`resolve_gates`, `check_gate`	Approval gates (`execute`, `kg_write`) resolved from config
`policy`	`measure`, `decide`, `decide_branch`, `should_terminate`, `detect_plateau`, `should_promote_dead_end`	The loop's branch / terminate / promote decisions
`promote`	`DeadEndRecord`, `KnowhowRecord`, `approach_hash`, `build_dead_end_record`, `should_skip`	Promotion record shapes + approach dedupe

Doesn't own — bind these via ports.py Protocols to your own infra: Spawn (LLM call), Retriever, KnowledgeGraph, ExperimentRunner, Store. No implementations ship in this package; the tutorial shows minimal bindings.

Verdict authority

DeterministicVerdict is the default and the reason this package exists. Other strategies (an LLM judge, an exit-code check) can be plugged in via the VerdictStrategy protocol, but only a deterministic refutation auto-promotes a dead-end — non-deterministic verdicts are advisory. Every verdict records its strategy and evidence_level, so the decision trail stays auditable.

Development

pip install -e ".[dev]"
pytest -q --cov=autoresearch_core

The test suite includes a parity suite (tests/test_parity.py) that pins behaviour to the GRD TypeScript implementation.

License

MIT © Cameleon X — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jokerized

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.4

Jun 8, 2026

0.4.3

Jun 6, 2026

This version

0.1.2

Jun 4, 2026

0.1.1

Jun 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoresearch_core-0.1.2.tar.gz (20.2 kB view details)

Uploaded Jun 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autoresearch_core-0.1.2-py3-none-any.whl (11.9 kB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file autoresearch_core-0.1.2.tar.gz.

File metadata

Download URL: autoresearch_core-0.1.2.tar.gz
Upload date: Jun 4, 2026
Size: 20.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autoresearch_core-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`352df1f3c13f9ad9ac94476825b537c674ab6d3e62219f3c8df6709b6f7fc444`
MD5	`47d93a16f904b7c030810adfbbf90b02`
BLAKE2b-256	`9d503552db3d907c957768007653d5b68202ea0850b658a5655c2a5177803fe6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autoresearch_core-0.1.2.tar.gz:

Publisher: publish.yml on ca1773130n/autoresearch-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autoresearch_core-0.1.2.tar.gz
- Subject digest: 352df1f3c13f9ad9ac94476825b537c674ab6d3e62219f3c8df6709b6f7fc444
- Sigstore transparency entry: 1713349401
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: ca1773130n/autoresearch-core@5cc1924c8b62dcc1e3cf307a1d9d176633eadaa6
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/ca1773130n
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5cc1924c8b62dcc1e3cf307a1d9d176633eadaa6
- Trigger Event: push

File details

Details for the file autoresearch_core-0.1.2-py3-none-any.whl.

File metadata

Download URL: autoresearch_core-0.1.2-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 11.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autoresearch_core-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9c99a3551538e6d789ba25858b5435156af0aa61505dbb5f36ab3a3313207c3`
MD5	`e5f31dda9ad43469f0df826c630d0f13`
BLAKE2b-256	`058288630a3e13b42466cfa18b68bf23f7579e00ae07d969ec1886287a0689c4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autoresearch_core-0.1.2-py3-none-any.whl:

Publisher: publish.yml on ca1773130n/autoresearch-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autoresearch_core-0.1.2-py3-none-any.whl
- Subject digest: c9c99a3551538e6d789ba25858b5435156af0aa61505dbb5f36ab3a3313207c3
- Sigstore transparency entry: 1713349539
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: ca1773130n/autoresearch-core@5cc1924c8b62dcc1e3cf307a1d9d176633eadaa6
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/ca1773130n
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5cc1924c8b62dcc1e3cf307a1d9d176633eadaa6
- Trigger Event: push

autoresearch-core 0.1.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

autoresearch-core

Why

Install

Quickstart

Documentation

What it owns (and what it doesn't)

Verdict authority

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance