Governance patterns for autonomous AI agents in health-insurance / payer operations (aligned to the NAIC Model Bulletin framework)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

linus10xp

These details have not been verified by PyPI

Project description

payer-agent-audit

The audit record for a UM, prior-auth, or claims/appeals decision when an AI agent touches it — not the medical-necessity call.

coverage 100% tests 156 License: MIT OR Apache-2.0 Python 3.12+

What this is — the audit record for a UM / prior-auth / claims decision: it records whether a human clinician was present and attested, whether the decision was timely under the rule that governs this plan, and whether appeal rights were afforded. It writes every check to a hash-chain ledger that detects tampering within its trust boundary.

What this is not — it makes NO medical-necessity or clinical determination. Not the coverage decision, not a medical device, not FDA-cleared software, not legal advice, not a deployed control. Detection, not prevention. Recordkeeping, not medical necessity. Reference IP to adapt, not a product to install.

Who this is for — a health plan's compliance, model-risk, or engineering lead putting autonomy into UM/PA/claims workflows, and the diligence teams who have to assess one.

30-second tour

Most governance tooling ships a dashboard and a compliance checkbox. This ships a hash-chain evidence ledger, an adversarial probe per primitive, and a written list of the things it deliberately does not do. Five domain-agnostic governance primitives (level-gate · sovereign veto · hash-chain ledger · DEFCON · effective-challenge harness) carry three health-payer controls (UM timeliness · clinician-of-record · appeal/IRO) on top. Funding-type-aware: the same denial routes to CMS-0057-F, ERISA, or a state DOI clock depending on who funds the plan.

156 tests · 100% coverage · 14/14 mutation kill · 5 AL-PROBES · golden corpus of real public matters (Lokken v. UnitedHealth, Kisting-Leung v. Cigna) · mypy --strict · py.typed · zero runtime deps · 4 SHA-pinned security workflows.

Read me first

A UM-timeliness test, the breach not the happy path — an autonomous decision that blows the deadline, and the ledger that records it:

from payer_agent_audit.governance import AuditChain
from payer_agent_audit.payer import UMTimelinessControl, FundingType, RequestCategory
from datetime import datetime, timedelta, UTC

chain = AuditChain(deployer_id="acme-health-prod")          # hardened genesis
received = datetime(2026, 6, 1, 8, 0, tzinfo=UTC)
result = UMTimelinessControl(chain).check(
    funding_type=FundingType.MEDICARE_ADVANTAGE,
    category=RequestCategory.EXPEDITED_URGENT,              # CMS-0057-F 72h
    request_received_at=received,
    decision_made_at=received + timedelta(hours=80),        # 80h > 72h
    case_ref="PA-12345",
)
assert result.met is False                                  # breach, recorded to the chain
assert chain.verify()

WORKED_EXAMPLE.md — the full path end to end: a decision class, an agent acting, the envelope catching the out-of-envelope case, the audit entry, and the demotion. Runnable: python3 examples/worked_example.py.
autonomy-ladder.io — the framework, the whitepaper, and the six-vertical family this library belongs to. Primitive-to-rung mapping: AUTONOMY_LADDER.md.

Install

pip install payer-agent-audit          # zero runtime dependencies
payer-audit info
payer-audit obligations --funding self_funded_erisa --category standard_preservice

Runnable end-to-end: examples/quickstart_um_timeliness.py and examples/worked_example.py.

Why this exists for frontier autonomy stacks

The controls in this library are domain-agnostic. The DEFCON state machine, the non-overridable sovereign veto (a separate-process control the agent cannot switch off), the hash-chain audit ledger (it detects tampering within its trust boundary), the hard envelopes with mechanical escalation, the sampled-review tripwires, and monitor-led promotion were forged in real multi-agent production systems under consequence — and they apply directly to any high-stakes coordinated autonomy (vehicles, robots, agent swarms) where invisible promotion or cascade failure is unacceptable. The decision class is a parameter: this repo encodes it for health-insurance payer — utilization management, prior auth, appeals, but the same A0→A4 deployment-authority structure lifts into any decision class without inheriting financial-services assumptions.

Framework + whitepaper: autonomy-ladder.io
Non-financial demo (under 60s): finserv-agent-audit/examples/agent_coordination — the same veto / envelope / audit-chain / demotion primitives on a generic agent swarm.

For reviewers & safety teams: every control here is falsifiable — the test suite (156 tests · 100% coverage · 14/14 mutation kill) turns each rule into a runnable check, and the veto and ledger are infrastructure with operational properties (separate process boundary, distinct credentials, a gate the agent cannot reach; write-once retention). These are reference implementations for adoption, not deployed production controls.

Part of the Autonomy Ladder™ family

Six co-equal regulated-vertical reference libraries implementing the Autonomy Ladder — a governance framework for autonomous AI in regulated operations (A0→A4, every rung demotable). Framework + whitepaper: autonomy-ladder.io.

Vertical	Library
Cross-vertical financial services	`finserv-agent-audit`
Banking (model risk · ECOA/Reg B · BSA/AML/OFAC)	`banking-agent-audit`
Payments (OFAC · Reg E · rail finality)	`payments-agent-audit`
Health-insurance payer (UM · prior auth · appeals)	`payer-agent-audit`
SEC-registered investment advisers (Advisers Act §206)	`private-capital-agent-audit`
Commercial real estate	`cre-agent-audit`

Why this exists
The five primitives
Health-payer controls
Funding-type obligation routing
Trust boundary — what's in, what's yours
Threat model at a glance
Regulatory mapping
What this is and is not
Testing
Who this is for
Architecture
Author & disclosures
License · Citation

Why this exists

Payers are putting autonomous and AI-assisted systems into utilization management, prior authorization, and claims adjudication support. The algorithmic-UM disputes now in litigation and on regulators' desks turn on the same question a regulator and a plaintiff both ask: can you show, on the record, that a denial which turned on medical judgment had a licensed clinician of record, that the decision was timely under the rule that governs this plan, and that appeal rights were afforded?

Where governance tooling typically ships a dashboard and a compliance checkbox, this ships a hash-chained evidence ledger, an adversarial probe per primitive, and a written list of the things it deliberately does not do.

This framework is the recordkeeping and process-gating answer to that question. It does not decide medical necessity. It refuses to let an autonomous agent issue a medical-judgment denial without an attested clinician of record, checks decision timeliness against the rule that the plan's funding type actually imposes, and writes every check to a hash-chained ledger. These are tested reference patterns — not academic proposals, and not a turnkey product. (Install and a first runnable breach are in Read me first, above.)

The five primitives

Built fresh to a corrected specification — each primitive ships with an adversarial probe (tests/adversarial/) that re-authors the exact failure mode an earlier-generation library admitted, and asserts this one refuses it. The defects are not described; they are tested against.

Primitive	Module	What it does
Autonomy-ladder level-gate	`governance/autonomy_ladder.py`	Refuses A2→A3 promotion when lower-level controls are unmet; requires independent attestation of each input (rejects self-attestation, stale, or evidence-less claims). Labeled advisory.
Sovereign veto	`governance/sovereign_veto.py`	Human kill switch; an agent cannot clear its own veto; a wired `Authorizer` is mandatory in production mode; `operator_id` is bound to an authenticated principal; durable state store documented.
Hash-chain ledger	`governance/audit_chain.py`	Tamper-detecting (within-trust-boundary) append-only chain; the verifier branches the genesis seed so both a deployer-keyed hardened chain and a legacy chain verify; production mode requires an external witness anchor.
DEFCON state machine	`governance/defcon.py`	Graduated autonomy throttle; immediate escalation, hardened de-escalation — a transition-direction guard forbids a one-call `HALT`/`SHUTDOWN → NORMAL`.
Effective-challenge harness	`governance/effective_challenge_harness.py`	Independent model challenge; enforces challenger ≠ primary (a model cannot self-challenge to a clean accept); records an operator independence attestation to the chain.

Health-payer controls

This is v1 — the health-insurance payer vertical. The three controls below sit on top of the five primitives and encode UM / prior-auth / appeals obligations. (P&C and Life & Annuity are a separate vertical on the roadmap — out of scope here, and the library says so rather than implying coverage it does not have.)

Control	Module	Governs
UM timeliness	`payer/um_timeliness.py`	Was the decision made within the deadline the plan's funding type imposes (CMS-0057-F / ERISA / state DOI)?
Clinician-of-record-on-denial	`payer/clinician_of_record.py`	A medical-judgment denial requires an attested, licensed clinician who actually reviewed the case — refused otherwise, and the refusal is itself recorded.
Appeal / IRO pathway	`payer/appeal_iro.py`	Internal-appeal + IRO external-review rights afforded; ERISA full-and-fair-review independence (the appeal reviewer is not the original decision-maker).

P&C / Life & Annuity coverage is on the roadmap and not yet shipped — see LIMITATIONS.md. This README does not claim it.

Funding-type obligation routing

The same denial carries different obligations depending on who funds the plan. The obligation map routes each decision to the correct regime:

Funding type	Primary regulator	UM-decision-timeliness anchor
Medicare Advantage	CMS	CMS-0057-F (72h expedited / 7-day standard, effective 2026-01-01)
Medicaid · CHIP managed care	CMS + State Medicaid	42 CFR 438.210(d) (72h expedited / 7-day standard on/after 2026-01-01; 14d before)
Self-funded (ERISA)	DOL (EBSA)	29 CFR 2560.503-1 (72h / 15d / 30d)
QHP on the FFE	HHS / CMS + state	No CMS-0057-F decision clock (QHP-FFE excluded); 45 CFR 147.136 governs appeals
Fully insured	State Department of Insurance	State UR statute (NAIC Model #073 framework) — deployer-supplied

A deployer may tighten a verified deadline (a stricter internal SLA), never loosen one past the regulatory floor.

Trust boundary — what's in, what's yours

The honesty thesis of this library is the boundary. Shipped-and-tested controls draw it; everything across it is handed back to you, explicitly, by design.

Component	In boundary (shipped & tested)	Across boundary (deployer-owned)
Five governance primitives (P1–P5)	✅
Three health-payer controls (UM / clinician / appeal)	✅
Hash-chain ledger (within-boundary tamper detection)	✅
External witness anchor (regenerated-chain detection)		⬜ deployer
Authorizer / IdP / KMS (the principal behind `operator_id`)		⬜ deployer
Durable veto state store (default is in-memory)		⬜ deployer
Second-line review process (challenger independence)		⬜ deployer

Three responsibilities the deployer must wire — production mode fails closed without them:

Authorizer / IdP / KMS. operator_id is only as strong as the authenticated-principal check behind it.
Durable state store. The default veto store is in-memory and lost on restart; wire your own.
External witness. The chain is internally consistent, but an attacker with write access can regenerate it end-to-end — only an out-of-band witness (OpenTimestamps / Rekor / a regulator log) makes that detectable.

Full diagram and the four-row deployer-responsibility table: ARCHITECTURE.md.

Threat model at a glance

Threat	In-boundary?	What stops it
Ledger edited after write	✅ shipped control	Recompute-on-load + `verify_strict`
Agent clears its own veto (incl. case / Unicode-confusable disguise)	✅ shipped control	Unconditional, normalized self-clear guard
One-call `HALT`/`SHUTDOWN → NORMAL`	✅ shipped control	Transition-direction guard (stepwise + re-authorized)
Whole chain regenerated by an attacker with write access	❌ across boundary	Only an external witness anchor — your wiring
Operator falsely attests an independent challenger	❌ across boundary	Second-line model-risk review — your process

Full 8-row matrix with regulatory mappings: FAILURE-MODES.md.

Regulatory mapping

Reference mappings to help a deployer point qualified counsel at relevant clauses; applicability is a deployer-and-counsel determination. Primary sources, with a verified flag in code:

NAIC Model Bulletin: Use of AI Systems by Insurers (adopted 2023-12-04; at least 24 states adopted, per the reg source verified 2026-06-03)
CMS-0057-F Interoperability and Prior Authorization Final Rule (89 FR 8758, 2024-02-08; RIN 0938-AU87; decision timeframes effective 2026-01-01)
Medicaid / CHIP managed care 42 CFR 438.210(d) (service-authorization timeframes; standard tightened to 7 days on/after 2026-01-01 by CMS-0057-F)
ERISA claims-procedure 29 CFR 2560.503-1 (DOL EBSA)
ACA internal/external review 45 CFR 147.136 (IRO pathway)
State utilization review — NAIC Utilization Review and Benefit Determination Model Act (#073)

What this is and is not

It makes no medical-necessity or clinical determination. The clinician's judgment belongs to the clinician; this framework governs whether that judgment is present, attested, timely, and appealable. A payer coverage decision is a benefit adjudication under insurance law, distinct from FDA medical-device regulation.
It is reference IP for adoption: documented, tested governance patterns with zero runtime dependencies.
It is not a deployed control, a medical device, FDA-cleared software, legal advice, or a guarantee of any regulatory outcome.

The framework is the governed pattern; the production deployment is the deployer's substrate underneath it. This repo gives you the first and refuses to pretend it gives you the second. See LIMITATIONS.md.

Testing

pip install -e ".[dev,test-property]"
pytest --cov=src/payer_agent_audit --cov-fail-under=90      # unit + property + golden + boundary
python3 scripts/mutation_check.py                            # mutation pass (kill score)

The suite includes unit + contract tests, property-based tests (thousands of generated cases per primitive), a golden corpus of public matters of record (each with a primary-source URL — including Estate of Gene B. Lokken v. UnitedHealth Group and Kisting-Leung v. Cigna), the five AL-PROBES under tests/adversarial/, and a payer-not-FDA-SaMD boundary scan. The gate is ≥90%; the suite currently runs at 156 tests, 100% line coverage, and a 14/14 (100%) mutation kill — coverage is a floor, not a finish line (see docs/ASSURANCE-CATALOG.md). The same checks run in CI on every push (badge above is live, not self-asserted).

Who this is for

Health-plan model-risk, compliance, and engineering teams putting autonomy into UM/PA/claims workflows who need an auditable governance substrate a regulator can read and they can adapt — and the diligence teams who have to assess one.

Not for you if you want a turnkey UM engine, a medical-necessity classifier, or a control you can deploy without wiring your own identity provider, durable store, and external witness. This is a substrate to adapt, not a product to install.

Architecture

See ARCHITECTURE.md for the trust-boundary diagram — the five primitives and three controls inside the boundary, and the deployer responsibilities (Authorizer/IdP, durable store, external witness, second-line process) explicitly outside it.

Author & disclosures

Authored by Kunjar Bhaduri through North Texas Capital Investments, an independent research effort. This is independent research; it is not produced on behalf of, and does not represent the views of, any employer or client, and contains no employer- or client-confidential material. The regulatory content is reference mapping, not legal advice — see DISCLAIMER.md.

License · Citation

Dual-licensed MIT OR Apache-2.0. If you use this framework in research or production, please cite it — see CITATION.cff. Trademark posture: docs/TRADEMARK.md.

Patterns are software, not legal advice.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

linus10xp

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.4

Jun 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

payer_agent_audit-0.1.4.tar.gz (88.9 kB view details)

Uploaded Jun 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

payer_agent_audit-0.1.4-py3-none-any.whl (59.2 kB view details)

Uploaded Jun 27, 2026 Python 3

File details

Details for the file payer_agent_audit-0.1.4.tar.gz.

File metadata

Download URL: payer_agent_audit-0.1.4.tar.gz
Upload date: Jun 27, 2026
Size: 88.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for payer_agent_audit-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`c07b8bc10fd3c91e76c2145ca840b77e2105719dc3dc34d232050add76338e00`
MD5	`1e377f484bb7b6d19b758715cda5c7f5`
BLAKE2b-256	`da74b9d026059188b073a17ad41c9424be956f5db8b6d057dac3dda28d792fae`

See more details on using hashes here.

Provenance

The following attestation bundles were made for payer_agent_audit-0.1.4.tar.gz:

Publisher: publish.yml on linus10x/payer-agent-audit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: payer_agent_audit-0.1.4.tar.gz
- Subject digest: c07b8bc10fd3c91e76c2145ca840b77e2105719dc3dc34d232050add76338e00
- Sigstore transparency entry: 1986916983
- Sigstore integration time: Jun 27, 2026
Source repository:
- Permalink: linus10x/payer-agent-audit@01d917221864b25d6e2bc2a669386752ab6dd3fb
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/linus10x
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@01d917221864b25d6e2bc2a669386752ab6dd3fb
- Trigger Event: push

File details

Details for the file payer_agent_audit-0.1.4-py3-none-any.whl.

File metadata

Download URL: payer_agent_audit-0.1.4-py3-none-any.whl
Upload date: Jun 27, 2026
Size: 59.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for payer_agent_audit-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`77474cc0ad40da91af3d344a1be6bf05a8e7a398181e7dfa89c53150122d8602`
MD5	`03fe6ce7d8e9055ec2430a1736ce5ea4`
BLAKE2b-256	`7f366fd2b28148c31e67f23387c778f71ad3c7b6b75c7c15dbe5cd57776655ba`

See more details on using hashes here.

Provenance

The following attestation bundles were made for payer_agent_audit-0.1.4-py3-none-any.whl:

Publisher: publish.yml on linus10x/payer-agent-audit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: payer_agent_audit-0.1.4-py3-none-any.whl
- Subject digest: 77474cc0ad40da91af3d344a1be6bf05a8e7a398181e7dfa89c53150122d8602
- Sigstore transparency entry: 1986917057
- Sigstore integration time: Jun 27, 2026
Source repository:
- Permalink: linus10x/payer-agent-audit@01d917221864b25d6e2bc2a669386752ab6dd3fb
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/linus10x
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@01d917221864b25d6e2bc2a669386752ab6dd3fb
- Trigger Event: push

payer-agent-audit 0.1.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

payer-agent-audit

30-second tour

Read me first

Install

Why this exists for frontier autonomy stacks

Part of the Autonomy Ladder™ family

Table of Contents

Why this exists

The five primitives

Health-payer controls

Funding-type obligation routing

Trust boundary — what's in, what's yours

Threat model at a glance

Regulatory mapping

What this is and is not

Testing

Who this is for

Architecture

Author & disclosures

License · Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance