NormCore is a Python library for deterministic, auditable evaluation of agent speech-act admissibility based on modality and grounding.

These details have not been verified by PyPI

Project links

Project description

NormCore

NormCore implements a deterministic normative admissibility evaluator for agent speech acts.

Given:

an agent utterance
a trajectory that includes externally observed tool results

it produces an admissibility judgment under a fixed set of axioms (A4–A7).

It evaluates participation legitimacy, not semantic truth or task correctness.

Specification

NormCore tracks the IETF Internet-Draft:

Normative Admissibility Framework for Agent Speech Acts

Important:

This is an Internet-Draft (work in progress), not an RFC.
Axiom labels used in this repository (A4, A5, A6, A7) follow that draft.
If draft wording changes in future revisions, repository behavior may be updated accordingly.

Installation

From PyPI:

pip install normcore

From source:

uv sync

or:

pip install -e .

What this is

NormCore is:

deterministic and auditable (no embeddings, no semantic inference)
form-based (statement modality drives the checks)
grounding-based (licensing comes only from observed evidence)
lexicographic (one violation makes the whole act inadmissible)
an operational judgment gate for grounded agent outputs

What this is NOT

NormCore does not:

verify semantic truth
score output quality or usefulness
infer intent, reasoning, or “why”
do ranking / grading / reward modeling
allow agent text to license itself
generate code or assess code quality as such

If you need “is this answer good/correct?”, this is the wrong tool.

Normative boundary

NormCore answers one question only:

Was the agent allowed to speak in this form, given what it observed?

It does not answer whether the statement is semantically true, useful, or optimal. In practice, this targets operational decision statements grounded in observed tool/file evidence, not code-generation capability evaluation.

Why this framework exists

NormCore is intended as part of the control plane for agentic systems: an explicit, deterministic gate on whether an agent is normatively allowed to make an operational claim from observed grounds.

Hard invariants

Grounding is externally observable only

Tool outputs from the trajectory can become knowledge used for licensing.
External grounds from the public API are also allowed (for example file/url evidence from an upstream RAG pipeline).
Grounds are linked only when the assistant text cites their citation_key in [@key] format.
Personalization / memory / preferences / profiles are non-epistemic and must not become grounding.

Grounding semantics in this project:

grounding is not truth verification
grounding is not semantic relevance matching
grounding is admissible observed evidence for normative licensing

Current limitations

Language coverage is currently English-first for form detection. Normative indicator extraction and modality heuristics are implemented with English lexical markers (for example should, must, recommend, if ... then, refusal phrases).
Non-English outputs can be under-detected and may return status="no_normative_content" even when the utterance is normatively meaningful.
For now, evaluate in English when you need strict behavior, or extend indicator patterns in src/normcore/normative/statement_extractor.py and src/normcore/normative/modality_detector.py for your target language.

Entry point (public API)

from normcore import evaluate

judgment = evaluate(
    conversation=trajectory,
)

Implementation: src/normcore/evaluator.py

Normative pipeline: src/normcore/normative/

Inputs

evaluate() consumes:

agent_output (optional): assistant output string
conversation (optional): full chat history as OpenAI Chat Completions message list; last message must be assistant
grounds (optional): external grounds as OpenAI annotations (file/url citations)

At least one of agent_output or conversation is required. If both are provided, agent_output must exactly match last assistant content in conversation.

Grounding is built from trajectory tool results plus optional external grounds.

Usage

from normcore import evaluate

agent_message = {
    "role": "assistant",
    "content": "Issue 123 is blocked, so we should fix it first.",
}

trajectory = [
    {
        "role": "assistant",
        "tool_calls": [
            {
                "id": "tool_1",
                "type": "function",
                "function": {"name": "get_issue", "arguments": "{\"issue_id\": 123}"},
            }
        ],
    },
    {
        "role": "tool",
        "tool_call_id": "tool_1",
        "content": "{\"issue_id\": 123, \"status\": \"Blocked\"}",
    },
    agent_message,
]

judgment = evaluate(conversation=trajectory)
print(judgment.status)
print(judgment.licensed)

Canonical examples

Unlicensed assertive (violates_norm):

judgment = evaluate(
    conversation=[{"role": "assistant", "content": "We should deploy now."}]
)
# Expected: status="violates_norm"

Self-licensing attempt (violates_norm):

judgment = evaluate(
    conversation=[{"role": "assistant", "content": "I believe we should deploy now."}]
)
# Expected: status="violates_norm" (agent text alone does not license itself)

Conditional downgrade (conditionally_acceptable):

judgment = evaluate(
    conversation=[{"role": "assistant", "content": "If the deployment is blocked, we should roll back."}]
)
# Expected: status="conditionally_acceptable"

CLI

Quick phrase check from terminal:

normcore evaluate --agent-output "The deployment is blocked, so we should fix it first."

This command prints AdmissibilityJudgment as JSON.

CLI parameters:

--log-level: enable diagnostics in stderr (CRITICAL|ERROR|WARNING|INFO|DEBUG)
-v, -vv: shorthand verbosity (-v = INFO, -vv = DEBUG)
--agent-output: agent output text (string)
--conversation: conversation history as JSON array; last item must be assistant message
--grounds: grounds payload as JSON array of OpenAI annotations

Sanity rule:

if both --agent-output and --conversation are provided, --agent-output must exactly match the last assistant content in --conversation.

Conversation example:

normcore evaluate --conversation '[{"role":"user","content":"Weather in New York?"},{"role":"assistant","content":"Use umbrella [@callWeatherNYC]."}]'

Conversation + external grounds example:

normcore evaluate \
  --conversation '[{"role":"user","content":"Weather in New York today vs last year?"},{"role":"assistant","content":"Compare today [@callWeatherNYC] and archive [@file_weather_2025]."}]' \
  --grounds '[{"type":"file_citation","file_id":"file_weather_2025","filename":"ny_weather_2025.txt","index":0}]'

Version:

normcore --version

Logging:

Library mode is silent by default (no log handlers are configured).
CLI diagnostics go to stderr so JSON in stdout stays machine-parseable.
Use -v / -vv, or --log-level.
NORMCORE_LOG_LEVEL is supported as environment fallback.

normcore -vv evaluate --agent-output "We should deploy now."
NORMCORE_LOG_LEVEL=INFO normcore evaluate --agent-output "We should deploy now."

Codex smoke workflow (reproducible)

This repository includes a practical smoke path to evaluate a real codex exec conversation with NormCore.

Quick run:

MODEL=gpt-5.3-codex REASONING_EFFORT=medium scripts/smoke_codex_pypi_normcore.sh

What this does:

runs codex exec --json with a release-readiness prompt
saves raw event stream to context/*.jsonl
converts event stream to a Chat Completions-style conversation JSON
runs normcore.evaluate on the converted conversation
writes a final context/*.judgment.json

Required response shape for the Codex review prompt:

the first sentence must be the publish recommendation / judgment
all justification comes after that first sentence

Generated artifacts:

context/<run-id>.jsonl (raw codex events)
context/<run-id>.stderr.log (codex stderr)
context/<run-id>.conversation.json (converted conversation for NormCore)
context/<run-id>.judgment.json (NormCore output)

Manual step-by-step (same flow):

# 1) Run Codex and capture live event JSONL
echo "your prompt" | codex exec \
  --model gpt-5.3-codex \
  --cd . \
  --skip-git-repo-check \
  --json \
  -c 'effort="medium"' \
  > context/run.jsonl 2> context/run.stderr.log

# 2) Convert Codex events to NormCore conversation format
.venv/bin/python scripts/codex_exec_events_to_conversation.py \
  context/run.jsonl \
  -o context/run.conversation.json

# 3) Evaluate with NormCore
scripts/evaluate_history.sh \
  context/run.conversation.json \
  --log-level DEBUG \
  -o context/run.judgment.json

File citation contract for grounding

If you want NormCore to validate file-based claims, request explicit citations in assistant text using [@key].

Recommended key format:

[@file_<hash12>]
hash12 is first 12 hex chars of sha256(normalized_repo_relative_file_path)

Normalization rules for hashing:

remove leading ./
use / separators
hash repo-relative path

Important:

the model should compute keys via tools (not invent them)
the key in assistant text must match the grounding key exactly

Output

evaluate() returns an AdmissibilityJudgment JSON object.

Top-level fields

Field	Meaning
`status`	Final verdict for the whole response.
`licensed`	Whether grounding permitted the chosen normative form(s).
`can_retry`	Whether reformulation is recommended.
`statement_evaluations`	Per-statement trace (how each statement was judged).
`feedback_hint`	Optional retry hint when reformulation is useful.
`violated_axioms`	List of violated axioms at aggregate level.
`explanation`	Human-readable summary of final verdict.
`num_statements`	Count of evaluated normative statements.
`num_acceptable`	Count of statements with acceptable outcomes.
`grounds_accepted`	Count of grounds admitted into the evidence pool.
`grounds_cited`	Count of admitted grounds actually cited in text (`[@key]`).

`statement_evaluations[]` fields

Field	Meaning
`statement_id`	Stable statement identifier (`final_response` or `refusal`).
`statement`	Statement text that was evaluated.
`modality`	Detected modality (`assertive`, `conditional`, `refusal`, `descriptive`).
`license`	Modalities permitted by current grounding.
`status`	Verdict for this statement.
`violated_axiom`	Violated axiom for this statement, if any.
`explanation`	Human-readable reason for this statement verdict.
`grounding_trace`	Evidence nodes considered for this statement.
`subject` / `predicate`	Internal normalized statement shape.

`grounding_trace[]` fields

Field	Meaning
`id`	Internal ground node ID.
`scope`	Ground scope (currently factual in runtime flow).
`source`	Ground source class (for example observed).
`status`	Ground node status (for example confirmed).
`confidence`	Numeric confidence value attached to node.
`strength`	Node strength label used by licensing logic.
`semantic_id`	External/semantic ID used for link resolution.

How to read common outcomes

status="acceptable" + licensed=true + can_retry=false: response is normatively admissible as-is.
status="conditionally_acceptable" + licensed=true: agent used conditional framing and stayed within license.
status="unsupported" + can_retry=true: missing/insufficient grounding; ask for more context or weaken claim form.
status="violates_norm" + can_retry=true: hard normative violation (for example unlicensed assertive claim).
status="no_normative_content": protocol-only response; no normative claim was evaluated.

Pipeline (fixed)

Extract tool results from the trajectory
Build grounding (KnowledgeStateBuilder)
Extract normative participation (protocol filtered)
Detect modality (form-based)
Match candidate grounds (relevance only)
Derive license (sufficiency only)
Apply axioms A4–A7
Aggregate lexicographically

Project structure

src/normcore/evaluator.py: orchestration + public entrypoint
src/normcore/models/: judgment + message models
src/normcore/normative/: modality, grounding, licensing, axioms
src/normcore/cli.py: command-line interface (normcore)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Feb 22, 2026

0.1.1

Feb 8, 2026

This version

0.1.0

Feb 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

normcore-0.1.0.tar.gz (61.9 kB view details)

Uploaded Feb 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

normcore-0.1.0-py3-none-any.whl (59.6 kB view details)

Uploaded Feb 8, 2026 Python 3

File details

Details for the file normcore-0.1.0.tar.gz.

File metadata

Download URL: normcore-0.1.0.tar.gz
Upload date: Feb 8, 2026
Size: 61.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.17

File hashes

Hashes for normcore-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e82a5d2f93ca6cb89cc931230ac645bad1c076fce7d248a502c41da086eacccb`
MD5	`a8e26f3be2758b175437df286dfda2f4`
BLAKE2b-256	`ba924f0070b03863b77b01c8861b97a986f8a258781a28e973699b0c6b29906c`

See more details on using hashes here.

File details

Details for the file normcore-0.1.0-py3-none-any.whl.

File metadata

Download URL: normcore-0.1.0-py3-none-any.whl
Upload date: Feb 8, 2026
Size: 59.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.17

File hashes

Hashes for normcore-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`029eccd1ec378052408fdc0c66bd49827730cb7b6eb3b4a2b8ff7b5e56e5cf08`
MD5	`b8ec3435998a1db46f1b5a8f7dae4fb1`
BLAKE2b-256	`14b915042d9c8f62a487e7c4b819cdc9f760ab5dddff074e712e5bf7653a6ba8`

See more details on using hashes here.

normcore 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NormCore

Specification

Installation

What this is

What this is NOT

Normative boundary

Why this framework exists

Hard invariants

Grounding is externally observable only

Current limitations

Entry point (public API)

Inputs

Usage

Canonical examples

CLI

Codex smoke workflow (reproducible)

File citation contract for grounding

Output

Top-level fields

statement_evaluations[] fields

grounding_trace[] fields

How to read common outcomes

Pipeline (fixed)

Project structure

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`statement_evaluations[]` fields

`grounding_trace[]` fields