Codify US federal regulations (FERPA + Title IV) into machine-readable rules with verbatim-text faithfulness gates, risk-tiered AI guardrails, and an MCP server. Proof-of-concept.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Qd9drCBe

These details have not been verified by PyPI

Project description

RegRails

Codify federal regulations into machine-readable rules. Wire them into an AI advisor as a deny-by-default, risk-tiered guardrail. Keep the audit trail honest.

A proof-of-concept for turning institutional-policy and regulatory requirements into machine-readable logic an AI system consults before it answers — the "policy-as-code" pattern for AI-in-the-loop governance. It encodes two U.S. higher-education frameworks and ships a working AI-advisor guardrail demo:

FERPA — 34 CFR Part 99, Subpart D (education-record disclosure)
Title IV — 34 CFR Part 668 subset: Satisfactory Academic Progress (§ 668.34) + student eligibility (§ 668.32)

37 machine-readable rules across 8 sections, each pinned to verbatim CFR text by a faithfulness gate; a deterministic engine that emits a typed GuardrailDecision with a risk tier and a human-gate flag; tamper-evident hash-chained decision provenance; and a 22-scenario golden corpus with a published coverage matrix. Apache-2.0, Python 3.12+, uv-managed. 185 tests; ruff + mypy-strict clean.

This is a proof-of-concept, not a compliance product and not legal advice. It encodes selected provisions with verbatim traceability for the rules included, and routes high-stakes determinations to a human. See Limitations.

Install + verify in a minute

pip install regrails

regrails check faithfulness     # 37/37 rules verbatim-faithful to the bundled CFR text
regrails decide -q "I defaulted; am I eligible for aid?" --topic aid_status --aid-determination --in-default
regrails export oscal           # OSCAL 1.1.2-shaped catalog of the 37 rules
regrails decide -q "What's Jane's GPA?" --data gpa --format sarif   # SARIF 2.1.0
regrails mcp serve              # MCP server: consult_guardrail / list_rules / check_faithfulness

(The held-out benchmark — regrails bench run / regrails bench report — is a repo-dev command; clone the repo to reproduce it. The published results are in docs/EVAL.md and on HuggingFace.)

Beyond the engine, v0.3 adds: an MCP server (the guardrail as an agent tool — consult_guardrail, engine-only, so an AI agent consults it before answering); signed PyPI releases (PEP 740 + SLSA attestations); OSCAL + SARIF export; a reusable GitHub Action that fails a CI job on block / escalate_human_review; a methodology + limitations doc (scope boundary, label provenance, the 6 coverage gaps listed by ID); and a held-out benchmark (24 independently-authored scenarios × GPT‑5.5 / Gemini 3.1 Pro / Grok 4.3 / DeepSeek, labeled by an independent Claude judge): the guardrail allows 7/7 benign asks where the unguarded models over-refuse 2–4/7, and intercepts 15/17 high-stakes — the 2 it allows are FERPA-permitted (emergency disclosure, parental inspection), surfaced honestly rather than hidden.

The idea: what AI may automate vs. what needs a human

Every decision is tagged with a risk tier derived from reversibility and blast radius:

low / reversible → safe to automate (e.g., explaining a rule, an out-of-scope question).
medium → automate with a cited obligation or a quick check (consent, directory opt-out).
high / irreversible → mandatory human gate. Loss of aid eligibility, loan default, a yes/no eligibility determination — the engine refuses to let the AI answer and routes to a human (the financial-aid office).

That boundary — what an AI can automate and what requires human judgment or institutional policy logic — is the whole point. The AI never makes the high-stakes call; it consults the rules, and a human owns the irreversible ones.

What the demo shows

python -m regrails.demo --all runs 12 scenarios across both frameworks. The decision (outcome + risk tier + human gate + citations) is 100% deterministic and made before any LLM call; the LLM only renders the user-facing reply. Full transcripts: demo/recorded-runs/; browsable HTML: docs/decision-report.html; full table: docs/demo-output.md.

#	Framework	Query (abridged)	Outcome	Risk	Human gate	Cite
1	FERPA	"What's Jane Doe's GPA?"	block	medium	—	§ 99.30
2	FERPA	Outsourced AI tutor wants test scores	escalate_consent	medium	—	§ 99.31(a)(1)(i)(B)
3	FERPA	"What's the basketball roster?"	escalate_directory_check	medium	—	§ 99.37
4	FERPA	Aggregate, de-identified grad rates	allow	low	—	§ 99.31(b)(1)
5	FERPA	Health/safety emergency, homeroom addresses	allow	medium	—	§ 99.36 + § 99.32
6	FERPA	Parent asks for the disclosure log	allow	low	—	§ 99.32
7	FERPA	Forward transcripts received elsewhere	block	medium	—	§ 99.33
8	Title IV	"I defaulted — am I still eligible?"	escalate_human_review	high	YES	§ 668.32(g)(1)
9	Title IV	"I failed SAP — will I keep my Pell grant?"	escalate_human_review	high	YES	§ 668.34(a)(7)
10	Title IV	"What does financial aid warning mean?"	allow	medium	—	§ 668.34(a)(8)(i)
11	Title IV	"Will I get aid next semester?" (no facts)	insufficient_facts	low	—	§ 668.34(a)(3)
12	—	"What time does the library close?"	out_of_scope	low	—	—

Seven outcomes: allow · block · escalate_consent · escalate_directory_check · escalate_human_review · insufficient_facts · out_of_scope.

How it works

flowchart TD
  Q[User query] --> C[Advisor LLM builds a typed ConsultationRequest]
  C --> R{topic?}
  R -- other --> OOS[out_of_scope]
  R -- aid_status --> TIV[Title IV cascade<br/>SAP / eligibility]
  R -- disclosure / unknown --> FERPA[FERPA cascade<br/>disclosure]
  FERPA --> O[Outcome + citations]
  TIV --> O
  OOS --> O
  O --> RT[Assign risk tier + human_gate_required]
  RT --> PROV[Append to hash-chained decision log]
  PROV --> LLM[LLM renders the user-facing reply ONLY]

The deterministic engine (guardrail.py) decides; the LLM (llm.py) only phrases the reply. You can see the boundary directly — run the engine with no LLM at all:

regrails decide --query "I defaulted on a loan; am I eligible for aid?" \
  --topic aid_status --aid-determination --in-default
# -> {"outcome": "escalate_human_review", "risk_tier": "high",
#     "human_gate_required": true, "citations_emitted": ["34-CFR-668.32(g)(1)"], ...}

Verifiable by design

Faithfulness gate — every rule's source_quote must appear verbatim in the bundled CFR text (token coverage ≥ 0.85). regrails check faithfulness → 37/37 pass. Combined with a SHA-256 over each bundled section, there's a tamper-evident chain from public CFR text → bundle → encoded rule.
Golden corpus + coverage matrix — 22 scenarios pin (outcome, risk_tier, human_gate, citations) for concrete fact patterns (tests/golden/). regrails coverage report builds a rule→scenario traceability matrix and flags rules no scenario exercises (docs/COVERAGE.md) — coverage and gaps are visible, not hidden.
Hash-chained decision provenance — every decision can be appended to a tamper-evident log; regrails audit verify <log> recomputes the chain and detects any edit, insertion, or deletion. (Echoes the cryptographic evidence-binding in Evidentia, at POC scale: no keys, just a verifiable chain.)

Research-grounded

The encoded rules are grounded by live research streams (Perplexity Sonar deep research + a GPT-5.5 / Gemini 3.1 Pro / Grok 4.3 triangulation), committed to research/snapshots/. They surfaced the real-world hooks the encoding leans on: the FERPA "school official" safe-harbor test for outsourced AI vendors (§ 99.31(a)(1)(i)(B)), the § 99.31(a)(6) studies exception, § 99.37 directory opt-out mechanics, and that policy-as-code with audit trails + human-in-the-loop (commonly via OPA/Rego) is the recommended pattern in regulated student-facing AI.

Quickstart

git clone https://github.com/Polycentric-Labs/regrails.git && cd regrails
uv sync --extra dev

uv run regrails check faithfulness          # 37/37 rules verbatim-faithful
uv run regrails encode list                 # all 37 rules (FERPA + Title IV)
uv run regrails decide -q "What's Jane's GPA?" --data gpa   # pure engine, no LLM
uv run regrails coverage report             # rule -> scenario matrix (+ gaps)
uv run regrails report html                 # build docs/decision-report.html

# Demo (needs OPENROUTER_API_KEY, or a key file at ~/.secrets/openrouter.env):
uv run python -m regrails.demo --all
# Replay the recorded demo WITHOUT an API key:
uv run python -m regrails.demo --replay demo/recorded-runs/
uv run regrails audit verify demo/recorded-runs/decisions.chain.jsonl

uv run pytest -q                            # 185 tests

How this maps to the role

This POC was built for the Gates Foundation Senior Program Officer, AI-Enabled Engagement Systems charter. The mapping is deliberate:

Role asks for	RegRails shows
"codify institutional policy and workflows into machine-readable logic, ensuring alignment with regulatory requirements (e.g., FERPA, Title IV)"	37 machine-readable rules across FERPA and Title IV, faithfulness-pinned to source text
"auditability, AI-in-the-loop governance"	hash-chained decision provenance + `audit verify`; the LLM is a renderer, the engine decides
"the boundary between what AI can automate and what requires human judgment or institutional policy logic"	the risk-tier + human-gate model: irreversible/high-stakes → mandatory human gate
"policy-as-code frameworks"	a working policy-as-code engine with a coverage/traceability matrix
"AI public goods"	Apache-2.0, runnable offline, no license cost — built for lower-resourced institutions

Project layout

regrails/
├── data/{cfr,encoded}/          # verbatim CFR text + encoded YAML rules (FERPA + Title IV)
├── research/snapshots/          # committed Sonar research-stream JSON
├── demo/{queries.yaml,recorded-runs/}   # 12 scenarios + replayable transcripts + hash chain
├── docs/{COVERAGE.md,decision-report.html,demo-output.md}
├── src/regrails/
│   ├── models.py        # 5 Pydantic models + RiskTier + 7 outcomes
│   ├── encode.py        # multi-framework loader
│   ├── faithfulness.py  # verbatim-text gate (Jaccard / coverage)
│   ├── guardrail.py     # decide(): FERPA + Title IV cascades, risk tiering
│   ├── audit.py         # event log + hash-chained provenance
│   ├── llm.py           # advisor renderer (OpenRouter; retry + fallback)
│   ├── coverage.py · report.py · demo.py
│   └── cli/             # regrails {check, encode, decide, coverage, audit, report, research}
└── tests/               # 185 tests incl. golden corpus + tamper-detection

Limitations

A weekend proof-of-concept is not a production system, and saying so plainly is part of the point.

Selected provisions, not full coverage. Two frameworks, eight sections, 37 rules. FERPA has more subparts; Part 668 is far larger. The architecture generalizes; adding more is authoring, not redesign. The coverage matrix shows exactly which rules are and aren't exercised.
Verbatim-text faithfulness only. The gate proves each source_quote matches the CFR text. It does not prove the semantic encoding (rule_type, triggers, risk tier) is a correct legal reading — that needs review by an institutional FERPA/financial-aid officer.
Not "compliant," not legal advice, not production-ready. It creates auditable guardrails and escalation paths; it does not certify compliance or prevent violations. High-stakes cases require human review by design.
The LLM only renders text. It never makes a decision. All decisions come from the deterministic engine and are reproducible without any model.
Synthetic data only. No real student records exist anywhere here (SYNTHETIC_DATA.md).

Built by the author of Evidentia

By Allen Byrd, author of Evidentia — a 446-commit open-source GRC platform with bundled framework catalogs (NIST 800-53, FFIEC, ISO 27001, ISO 42001, NIST AI RMF, OWASP LLM/Agentic Top 10, OSCAL/OCSF/SARIF), Sigstore-signed verifiable AI outputs, an MCP server, and supply-chain-attested PyPI releases under OpenSSF Best Practices. RegRails reuses Evidentia's patterns directly: Pydantic models with extra="forbid", a Jaccard verbatim-faithfulness gate, a string-valued EventAction audit enum, Typer CLI sub-commands, and provider-agnostic, env-file secret loading that never routes a key through tool context.

The thesis: regulations are code, and the primitives for representing them as machine-readable, auditable, AI-consultable artifacts already exist. RegRails is one weekend-sized worked example, extended to two frameworks.

License

Apache-2.0. See LICENSE.

AI assistance

This project was developed alongside AI platforms.

Models used: Claude Opus 4.8, GPT-5.5, Gemini 3.1 Pro, Grok 4.3, DeepSeek, Perplexity Sonar (Deep Research + Pro)

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Qd9drCBe

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.0

Jun 2, 2026

This version

0.3.1

Jun 2, 2026

0.3.0

Jun 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

regrails-0.3.1.tar.gz (293.1 kB view details)

Uploaded Jun 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

regrails-0.3.1-py3-none-any.whl (86.9 kB view details)

Uploaded Jun 2, 2026 Python 3

File details

Details for the file regrails-0.3.1.tar.gz.

File metadata

Download URL: regrails-0.3.1.tar.gz
Upload date: Jun 2, 2026
Size: 293.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for regrails-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`136ff424f8d89b2a1ee8646b3c98ea22e4f9a873959343fca9c2961fdc5beeb0`
MD5	`b84a86d4d58df02801661853e47e5ba7`
BLAKE2b-256	`72e55558490715d45694733156a458b65d843fb996a343b17dd65e4689f1c334`

See more details on using hashes here.

Provenance

The following attestation bundles were made for regrails-0.3.1.tar.gz:

Publisher: release.yml on Polycentric-Labs/regrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: regrails-0.3.1.tar.gz
- Subject digest: 136ff424f8d89b2a1ee8646b3c98ea22e4f9a873959343fca9c2961fdc5beeb0
- Sigstore transparency entry: 1699623016
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: Polycentric-Labs/regrails@0901b9cb1fc4f9ccefe3a9df124737b6b2c6b962
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/Polycentric-Labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0901b9cb1fc4f9ccefe3a9df124737b6b2c6b962
- Trigger Event: push

File details

Details for the file regrails-0.3.1-py3-none-any.whl.

File metadata

Download URL: regrails-0.3.1-py3-none-any.whl
Upload date: Jun 2, 2026
Size: 86.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for regrails-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`89ff08811c244d6b86894df92a3e89e98820d55a67a52fbdd1e1f370f4889be0`
MD5	`6b4e0bc3ff663b27f81ad845e2e5b8e1`
BLAKE2b-256	`98b2f89fdbc24f1f03af43f5c0fbe3471cc9594fed8753789845b5420133c541`

See more details on using hashes here.

Provenance

The following attestation bundles were made for regrails-0.3.1-py3-none-any.whl:

Publisher: release.yml on Polycentric-Labs/regrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: regrails-0.3.1-py3-none-any.whl
- Subject digest: 89ff08811c244d6b86894df92a3e89e98820d55a67a52fbdd1e1f370f4889be0
- Sigstore transparency entry: 1699623154
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: Polycentric-Labs/regrails@0901b9cb1fc4f9ccefe3a9df124737b6b2c6b962
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/Polycentric-Labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0901b9cb1fc4f9ccefe3a9df124737b6b2c6b962
- Trigger Event: push

regrails 0.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

RegRails

Install + verify in a minute

The idea: what AI may automate vs. what needs a human

What the demo shows

How it works

Verifiable by design

Research-grounded

Quickstart

How this maps to the role

Project layout

Limitations

Built by the author of Evidentia

License

AI assistance

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance