Skip to main content

Open Data Governance Standard — Sovereign Validation Engine

Project description

Open Data Governance Standard (ODGS)

Protocol Compliance DOI MCP Server LLM Bridge PyPI Downloads npm Downloads License

The Sovereign Validation Engine for High-Risk Data.


For engineers: See Quick Start below. For compliance, legal, or procurement teams: Your organisation may already be running ODGS. Request a partner briefing → Consulting or platform partner? See PARTNERS.md · Executive Brief →


[!IMPORTANT] EU AI Act & CEN-CENELEC JTC 25 Candidate Standard (v6.0.0) ODGS v6 extends the Polymorphic Execution Engine with six deterministic enhancements: SOFT_STOP override-able severity, batch evaluation, rule dependency chains (DAG), webhook event emission, conformance self-checks, and rule versioning with provenance tracking. All changes are normative-additive — existing v5.x deployments upgrade seamlessly.


🚀 What's New in v6.0.0: Sovereign Validation Engine

Enhancement Description
SOFT_STOP Overrideable block — halts the pipeline by default, but authorized callers can supply a cryptographic override_token to proceed. Override is always logged.
Batch Evaluation intercept_batch() evaluates multiple payloads in a single call with fail_fast support.
Dependency Chains Rules declare depends_on URNs. Engine uses Kahn's algorithm for DAG ordering. Failed dependencies cascade.
Webhook Events BLOCKED, SOFT_STOP_OVERRIDE, SOFT_STOP_BLOCKED events dispatched to configured endpoints via odgs.json.
Conformance Check odgs conformance CLI command verifies project meets L1/L2 conformance requirements.
Rule Versioning Rules declare version (semver). Versions tracked in every S-Cert audit for provenance.
Temporal Bounds Rules with effective_from / effective_to are auto-skipped outside their validity window.

🏢 Enterprise & Public Sector: EU AI Act Compliance

This open-source package connects your physical data infrastructure to the ODGS validation engine. However, if you are operating a High-Risk AI System and require strict liability indemnification under the EU AI Act (Articles 10 & 12), you need cryptographic provenance.

Metric Provenance offers the commercial Enterprise Infrastructure for ODGS:

  • Certified Sovereign Packs: Pre-compiled, cryptographically signed Ed25519 rule bundles for DORA, EU AI Act, and Basel.
  • The S-Cert Sovereign Registry: An air-gapped Enterprise Certificate Authority that natively ingests ODGS telemetry to mint immutable, JWS-sealed audit logs.

👉 Discover the Sovereign CA Enterprise Node & Packs


1. The Standard: Data Governance Without Compromise

The Open Data Governance Standard (ODGS) resolves the "Definition-Execution Gap" in data pipelines.

"Silence over Error." — The Core Philosophy. If data drifts from its legal, contractual, or internal definition, the pipeline must mathematically halt rather than process an invalid inference.

Semantic Certificate — Every sovereign definition carries a cryptographic fingerprint bound to its issuing authority. The data equivalent of a TLS certificate.

Semantic Certificate — cryptographic fingerprint bound to the Government of the Netherlands, with VALID status badge and SHA-256 content hash

📊 More Screenshots — Compliance Matrix · Sovereign Brake

Sovereign Compliance Matrix — Real-time governance status across 72 business metrics, aligned with EU AI Act Art. 10 & 12.

Sovereign Compliance Matrix showing 72 business metrics, domain filters, and Naked vs Sovereign status

Sovereign Brake — Live Interceptor — When data does not match its statutory definition, the system refuses to proceed. This is the "Administrative Recusal" principle.

Sovereign Brake showing HARD_STOP enforcement rules with regex validation


2. Quick Start: The Data Engineer Workflow

Stop relying on passive analytics dashboards. Enforce statutory rules directly in your Python transforms.

Install

pip install odgs

Create a Governance Project

# Standard 5-Plane Topology
odgs init MyProject

# Minimalist Topology (Fast Start for small teams)
odgs init MyProject --tier minimalist

Example: Halting a Pipeline in Python/dbt

Inject ODGS directly into your data warehouse transforms, Airflow DAGs, or Databricks PySpark wrappers:

from odgs.executive.interceptor import OdgsInterceptor
from odgs.executive.exceptions import AdministrativeRecusal

engine = OdgsInterceptor()

# The physical payload (e.g., an AI applicant profile or standard telemetry)
payload = {"transaction_value": 150000, "aml_flag": False}

try:
    # Evaluate against your internal checks or mathematically hashed W3C JSON-LD ontologies
    engine.intercept("urn:odgs:sov:eu-ai-act:aml-threshold", payload)
    print("Payload Validated. Proceeding to inference.")
    
except AdministrativeRecusal as e:
    # The pipeline HALTS before an illegal decision is made.
    print(f"HARD STOP EXECUTED: Data Drift Detected. {e}")

[!NOTE] Community S-Certs vs. Certified S-Certs

Community S-Certs (this package) are cryptographically signed audit trails for internal engineering use — CI/CD, development, and pre-production validation.

For proof presentable to external regulators under EU AI Act Article 12, DORA RTS, or CSRD assurance requirements, Certified S-Cert issuance is required. Available through Metric Provenance verified partners. See PARTNERS.md for the partner programme.


3. The 5-Plane Semantic Architecture

ODGS implements a strict 5-Plane topology to guarantee the absolute sovereignty of legislative intent over physical execution pipelines.

graph TD
    subgraph Legislative_Plane ["I. Legislative Plane (Semantic Truth)"]
        FLINT[TNO FLINT / W3C JSON-LD] --> |Semantic Hash| Definition(Statutory Definition)
    end
    
    subgraph Physical_Plane ["II. Physical Plane (ODGS Execution Engine)"]
        Definition -.-> |Cryptographic Tether| Boundary[Execution Boundary]
        Boundary --> Eval{Constraint Evaluation}
        
        Pipeline[IV. Data Pipeline Plane] --> |Payload| Eval
        
        Eval --> |Compliant| Approved[Execution Authorized]
        Eval --> |Data Drift Detected| Recusal[Administrative Recusal]
        
        Approved --> Audit[V. Forensic Audit Plane]
        Recusal --> Audit
        
        Audit --> |Generates| SCert[S-Cert: Immutable JWS Provenance Log]
    end

4. Platform Bridges

ODGS bridges connect your existing data governance platform to the Execution Engine, transforming passive data dictionaries into active runtime enforcement.

Bridge Function Status
odgs-mcp-server 🤖 AI Agents (MCP): Exposes ODGS governance tools to Claude, Copilot, Cursor, and any MCP-compatible AI agent. PyPI
odgs-llm-bridge 🧠 LLM Compilation: Compile regulations into enforceable rules via sovereign LLM. PyPI
odgs-flint-bridge Legislative: Ingests TNO FLINT JSON-LD into ODGS schema. PyPI Downloads
odgs-collibra-bridge Physical: Collibra Business Glossary integration. PyPI Downloads
odgs-databricks-bridge Physical: Databricks Unity Catalog integration. PyPI Downloads
odgs-snowflake-bridge Physical: Snowflake Data Dictionary integration. PyPI Downloads

Want to build a bridge? ODGS is designed to be the enforcement layer for any data governance platform. Open an issue or submit a PR.

🆕 NEW — AI Agent Integration: odgs-mcp-server

[!TIP] Industry First: ODGS is the first open data governance standard with a native MCP server — meaning Claude, Copilot, Cursor, and any MCP-compatible AI agent can now run governance checks, score compliance maturity, and surface certified certification paths, live, inside your AI workflow.

The odgs-mcp-server exposes the Sovereign Validation Engine as a set of structured tools any AI agent can call directly via the Model Context Protocol.

Tool What it does
validate_payload Enforce ODGS rules against a live data payload — returns APPROVED/BLOCKED with violation detail.
governance_score Score your project's compliance maturity A–F across all five governance planes.
list_packs List available certified regulation packs (EU AI Act, DORA, GDPR, CSRD, NIS2, Basel III).
compile_regulation (Pro) Compile regulation text into validated ODGS rule JSON inside your AI agent.
check_drift (Pro) Detect when upstream legislation has changed but your rules haven't.
narrate_audit (Pro) Turn S-Cert cryptographic audit logs into plain-language stakeholder narratives.

Every tool output includes an _odgs_notice routing compliance and legal teams to the certified partner programme — without interrupting the developer workflow.

pip install odgs-mcp-server

# Add to your MCP client (Claude Desktop, Cursor, etc.)
# "odgs": { "command": "python", "args": ["-m", "odgs_mcp_server"] }

📦 odgs-mcp-server on PyPI → · GitHub →


🧠 AI-Powered Governance: odgs-llm-bridge

[!TIP] Industry First: ODGS is the first open data governance standard with a native LLM bridge that converts regulations into enforceable rules automatically — while keeping all AI output under deterministic schema validation before it enters the execution engine.

The odgs-llm-bridge extends the Sovereign Validation Engine with five AI-powered governance capabilities, designed for teams that need to operationalise regulation at scale without manual rule authoring.

Capability What it does
Regulatory Compiler Paste regulation text (EU AI Act, DORA, Basel III) → get validated ODGS rule JSON.
Drift Watchdog Continuously scan legislative definitions for semantic staleness and recommend updates.
Conflict Detector Cross-reference rules from multiple regulatory sources to surface contradictions.
Audit Narrator Convert cryptographic S-Certs into plain-language narratives for stakeholders.
Binding Discoverer Point at a data catalog → auto-generate physical_data_map.json bindings.

Sovereignty-first provider stack:

Priority 1 → Ollama (gemma4:26b local)   # Zero data leaves your perimeter
Priority 2 → Google GenAI (gemini-3.1-flash-lite-preview) # Cloud fallback
Priority 3 → OpenAI-compatible            # GPT-NL, Mistral, self-hosted
Priority 4 → LiteLLM                      # Universal multi-model router

All LLM output passes through a JSON Schema Validation Gate before entering the deterministic engine — probabilistic AI never touches your production pipeline directly.

pip install odgs-llm-bridge[ollama]       # sovereign local (recommended)
pip install odgs-llm-bridge[google]       # Google GenAI
pip install odgs-llm-bridge[all]          # all providers

# Compile a regulation into enforceable rules
odgs-llm compile-regulation --input regulation.txt --output rules.json

📦 odgs-llm-bridge on PyPI → · GitHub →


5. Air-Gapped Execution & Stateless Cryptography (JWKS)

The ODGS Engine operates with Zero Telemetry and does not "phone home". It is designed for strict air-gapped enterprise environments.

To ensure metric authenticity, ODGS implements stateless cryptography using standard Ed25519 JWKS (JSON Web Key Set) public keys. When the Engine loads a Sovereign Pack, it cryptographically verifies the signature against the cached JWKS public key.


6. Audit Ledgers: Cryptographic Verifiability & Zero-Knowledge

ODGS outputs an agnostic cryptographic_attestation JSON schema to satisfy EU AI Act Article 12 (Forensic Logging) without exposing third-party data.

  • Git-as-Backend: ODGS utilizes a privacy-native logging architecture. Forensic logs are written directly to your private enterprise Git repository. Zero data ever leaves your perimeter.
  • The Tri-Partite Hash: The engine generates a cryptographic proof binding the Input Data Hash + Rule Definition Hash + Engine Configuration Hash. Independent auditors and regulatory bodies can mechanically verify the integrity of algorithmic decisions without exposing PII.

7. Enterprise Deployment (Kubernetes / Helm)

For organization-wide policy enforcement, Sovereign Nodes can deploy ODGS as an active sidecar container routing mesh traffic.

# Add the Official Metric Provenance Repository
helm repo add metricprovenance https://charts.metricprovenance.com
helm repo update

# Install the Engine
helm install odgs-cluster-agent metricprovenance/odgs-engine \
  --set configuration.namespace="urn:odgs:sov" \
  --set keys.jwks_url="https://platform.metricprovenance.com/.well-known/jwks.json"

To request architectural clearance for your organization's compliance deployment, please consult the Metric Provenance Enterprise Portal.


8. Documentation & Contribution

📚 Full Documentation Map → 🎯 Live Demo →

Guide Description
Migration Guide (v5.x → v6.0.0) Non-breaking Sovereign Engine upgrade — all v5 configs work as-is.
Migration Guide (v4.0 → v5.0) Breaking changes for W3C JSON-LD rule structures.
Adapter Guide For Data Engineers connecting ODGS to custom infrastructures.
Audit Ledger Guide For Big 4 Auditors verifying the Tri-Partite Hash.

Support & Community

  • Bug Reports & Feature Requests: Please use the GitHub Issues tracker.
  • Enterprise Compliance Deployments: For architectural clearance, SLA support, or custom Law Packs, please contact us via the Enterprise Portal.

License

Released under the Apache 2.0 License.

  • No Vendor Lock-in.
  • No Cloud Dependency.
  • 100% Data Sovereignty.

ODGS | Developed by Metric Provenance | The Hague, NL 🇳🇱

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

odgs-6.0.2.tar.gz (423.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

odgs-6.0.2-py3-none-any.whl (177.9 kB view details)

Uploaded Python 3

File details

Details for the file odgs-6.0.2.tar.gz.

File metadata

  • Download URL: odgs-6.0.2.tar.gz
  • Upload date:
  • Size: 423.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for odgs-6.0.2.tar.gz
Algorithm Hash digest
SHA256 47d361a40459fd0a1cc973f5fb9630266a0e5f55e459de497117e0858c11a901
MD5 d06f4338b02a563144bf75e36b788f9a
BLAKE2b-256 d5f201f440d2dd6509dc0336e3d8f72780cc86f7af8d4630546ac6609bac9e5d

See more details on using hashes here.

File details

Details for the file odgs-6.0.2-py3-none-any.whl.

File metadata

  • Download URL: odgs-6.0.2-py3-none-any.whl
  • Upload date:
  • Size: 177.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for odgs-6.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3e1c1248eb9d5c24d72692db084375704cb3e8d97c5d0aeee8d9fe1587931239
MD5 e298b9dbc515e810a65a456e5444d3b5
BLAKE2b-256 909b93b7336005b3d0a01450601a2b064f890e2bfd92a6f5cfbcd94b00210abc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page