Skip to main content

Open Data Governance Standard — Sovereign Validation Engine

Project description

Open Data Governance Standard (ODGS)

Protocol Compliance DOI PyPI Downloads npm Downloads License

The Sovereign Validation Engine for High-Risk Data.


For engineers: See Quick Start below. For compliance and procurement teams: Certified Regulatory Law Packs and Sovereign S-Cert Registries are managed by Metric Provenance partners. See PARTNERS.md for certification details.


[!IMPORTANT] EU AI Act & CEN-CENELEC JTC 25 Candidate Standard (v6.0.3) ODGS v6 extends the Polymorphic Execution Engine with six deterministic enhancements: SOFT_STOP override-able severity, batch evaluation, rule dependency chains (DAG), webhook event emission, conformance self-checks, and rule versioning with provenance tracking. All changes are normative-additive — existing v5.x deployments upgrade deterministically.


🚀 What's New in v6.0.3: Maturity Diagnostic + Sovereign Engine

Enhancement Description
📊 odgs-maturity NEW — DAMA DMBOK-aligned maturity scoring across 8 governance pillars with gap analysis and automated charter generation. pip install odgs-maturity
SOFT_STOP Overrideable block — halts the pipeline by default, but authorized callers can supply a cryptographic override_token to proceed. Override is always logged.
Batch Evaluation intercept_batch() evaluates multiple payloads in a single call with fail_fast support.
Dependency Chains Rules declare depends_on URNs. Engine uses Kahn's algorithm for DAG ordering. Failed dependencies cascade.
Webhook Events BLOCKED, SOFT_STOP_OVERRIDE, SOFT_STOP_BLOCKED events dispatched to configured endpoints via odgs.json.
Conformance Check odgs conformance CLI command verifies project meets L1/L2 conformance requirements.
Rule Versioning Rules declare version (semver). Versions tracked in every S-Cert audit for provenance.
Temporal Bounds Rules with effective_from / effective_to are auto-skipped outside their validity window.
Framework Tags Extensible framework_tags dictionary added to all normative schemas — bind ODGS artefacts to your enterprise governance framework (APQC, DAMA DMBOK, BIAN, CDMC).

💡 Maturity Baseline: Most teams score below 60% on first run. Certified sovereign packs map directly to missing conformance requirements to establish deterministic compliance.


🏢 Enterprise & Public Sector: EU AI Act Compliance

This open-source package connects your physical data infrastructure to the ODGS validation engine. However, if you are operating a High-Risk AI System and require strict liability indemnification under the EU AI Act (Articles 10 & 12), you need cryptographic provenance.

Metric Provenance offers the commercial Enterprise Infrastructure for ODGS:

  • Certified Sovereign Packs: Pre-compiled, cryptographically signed Ed25519 rule bundles for DORA, EU AI Act, and Basel.
  • The S-Cert Sovereign Registry: An air-gapped Enterprise Certificate Authority that natively ingests ODGS telemetry to mint immutable, JWS-sealed audit logs.

📊 Industry Benchmark: The European Data Governance Maturity Benchmark 2026 found an average governance maturity of 37.6% across 99 enterprises — a 62.4% enforcement gap against regulatory expectation.

Registry Access: The Sovereign CA Enterprise Node and Packs are available exclusively through the Metric Provenance certification registry.


1. The Standard: Data Governance Without Compromise

The Open Data Governance Standard (ODGS) resolves the "Definition-Execution Gap" in data pipelines.

"Silence over Error." — The Core Philosophy. If data drifts from its legal, contractual, or internal definition, the pipeline must mathematically halt rather than process an invalid inference.

Semantic Certificate — Every sovereign definition carries a cryptographic fingerprint bound to its issuing authority. The data equivalent of a TLS certificate.

Semantic Certificate — cryptographic fingerprint bound to the Government of the Netherlands, with VALID status badge and SHA-256 content hash

📊 More Screenshots — Compliance Matrix · Sovereign Brake

Sovereign Compliance Matrix — Real-time governance status across 72 business metrics, aligned with EU AI Act Art. 10 & 12.

Sovereign Compliance Matrix showing 72 business metrics, domain filters, and Naked vs Sovereign status

Sovereign Brake — Live Interceptor — When data does not match its statutory definition, the system refuses to proceed. This is the "Administrative Recusal" principle.

Sovereign Brake showing HARD_STOP enforcement rules with regex validation


2. Quick Start: The Data Engineer Workflow

Stop relying on passive analytics dashboards. Enforce statutory rules directly in your Python transforms.

Install

pip install odgs

Create a Governance Project

# Standard 5-Plane Topology
odgs init MyProject

# Minimalist Topology (Fast Start for small teams)
odgs init MyProject --tier minimalist

Example: Halting a Pipeline in Python/dbt

Inject ODGS directly into your data warehouse transforms, Airflow DAGs, or Databricks PySpark wrappers:

from odgs.executive.interceptor import OdgsInterceptor
from odgs.executive.exceptions import AdministrativeRecusal

engine = OdgsInterceptor()

# The physical payload (e.g., an AI applicant profile or standard telemetry)
payload = {"transaction_value": 150000, "aml_flag": False}

try:
    # Evaluate against your internal checks or mathematically hashed W3C JSON-LD ontologies
    engine.intercept("urn:odgs:sov:eu-ai-act:aml-threshold", payload)
    print("Payload Validated. Proceeding to inference.")
    
except AdministrativeRecusal as e:
    # The pipeline HALTS before an illegal decision is made.
    print(f"HARD STOP EXECUTED: Data Drift Detected. {e}")

[!NOTE] Community S-Certs vs. Certified S-Certs

Community S-Certs (this package) are cryptographically signed audit trails for internal engineering use — CI/CD, development, and pre-production validation.

For proof presentable to external regulators under EU AI Act Article 12, DORA RTS, or CSRD assurance requirements, Certified S-Cert issuance is required. This is maintained within the Metric Provenance Sovereign S-Cert Registry. See PARTNERS.md for certification details.


3. The 5-Plane Semantic Architecture

ODGS implements a strict 5-Plane topology to guarantee the absolute sovereignty of legislative intent over physical execution pipelines.

graph TD
    subgraph Legislative_Plane ["I. Legislative Plane (Semantic Truth)"]
        FLINT[TNO FLINT / W3C JSON-LD] --> |Semantic Hash| Definition(Statutory Definition)
    end
    
    subgraph Physical_Plane ["II. Physical Plane (ODGS Execution Engine)"]
        Definition -.-> |Cryptographic Tether| Boundary[Execution Boundary]
        Boundary --> Eval{Constraint Evaluation}
        
        Pipeline[IV. Data Pipeline Plane] --> |Payload| Eval
        
        Eval --> |Compliant| Approved[Execution Authorized]
        Eval --> |Data Drift Detected| Recusal[Administrative Recusal]
        
        Approved --> Audit[V. Forensic Audit Plane]
        Recusal --> Audit
        
        Audit --> |Generates| SCert[S-Cert: Immutable JWS Provenance Log]
    end

4. Platform Bridges

ODGS bridges connect your existing data governance platform to the Execution Engine, transforming passive data dictionaries into active runtime enforcement.

Bridge Function Status
odgs-mcp-server 🤖 AI Agents (MCP): Exposes ODGS governance tools to Claude, Copilot, Cursor, and any MCP-compatible AI agent. PyPI
odgs-maturity 📊 Maturity Diagnostic: DAMA DMBOK-aligned maturity scoring with gap analysis and governance charter generation. PyPI
odgs-llm-bridge 🧠 LLM Compilation: Compile regulations into enforceable rules via sovereign LLM. PyPI
odgs-flint-bridge Legislative: Ingests TNO FLINT JSON-LD into ODGS schema. PyPI Downloads
odgs-collibra-bridge Physical: Collibra Business Glossary integration. PyPI Downloads
odgs-databricks-bridge Physical: Databricks Unity Catalog integration. PyPI Downloads
odgs-snowflake-bridge Physical: Snowflake Data Dictionary integration. PyPI Downloads

Want to build a bridge? ODGS is designed to be the enforcement layer for any data governance platform. Open an issue or submit a PR.

🆕 NEW — AI Agent Integration: odgs-mcp-server

[!TIP] Industry First: ODGS is the first open data governance standard with a native MCP server — meaning Claude, Copilot, Cursor, and any MCP-compatible AI agent can now run governance checks, score compliance maturity, and surface certified certification paths, live, inside your AI workflow.

The odgs-mcp-server exposes the Sovereign Validation Engine as a set of structured tools any AI agent can call directly via the Model Context Protocol.

Tool What it does
validate_payload Enforce ODGS rules against a live data payload — returns APPROVED/BLOCKED with violation detail.
governance_score Score your project's compliance maturity A–F across all five governance planes.
list_packs List available certified regulation packs (EU AI Act, DORA, GDPR, CSRD, NIS2, Basel III).
compile_regulation (Pro) Compile regulation text into validated ODGS rule JSON inside your AI agent.
check_drift (Pro) Detect when upstream legislation has changed but your rules haven't.
narrate_audit (Pro) Turn S-Cert cryptographic audit logs into plain-language stakeholder narratives.

Every tool output includes an _odgs_notice routing compliance and legal teams to the certified registry — without interrupting the deterministic execution flow.

pip install odgs-mcp-server

# Add to your MCP client (Claude Desktop, Cursor, etc.)
# "odgs": { "command": "python", "args": ["-m", "odgs_mcp_server"] }

📦 odgs-mcp-server on PyPI → · GitHub →


🧠 AI-Powered Governance: odgs-llm-bridge

[!TIP] Industry First: ODGS is the first open data governance standard with a native LLM bridge that converts regulations into enforceable rules automatically — while keeping all AI output under deterministic schema validation before it enters the execution engine.

The odgs-llm-bridge extends the Sovereign Validation Engine with five AI-powered governance capabilities, designed for teams that need to operationalise regulation at scale without manual rule authoring.

Capability What it does
Regulatory Compiler Paste regulation text (EU AI Act, DORA, Basel III) → get validated ODGS rule JSON.
Drift Watchdog Continuously scan legislative definitions for semantic staleness and recommend updates.
Conflict Detector Cross-reference rules from multiple regulatory sources to surface contradictions.
Audit Narrator Convert cryptographic S-Certs into plain-language narratives for stakeholders.
Binding Discoverer Point at a data catalog → auto-generate physical_data_map.json bindings.

Sovereignty-first provider stack:

Priority 1 → Ollama (gemma4:26b local)   # Zero data leaves your perimeter
Priority 2 → Google GenAI (gemini-3.1-flash-lite-preview) # Cloud fallback
Priority 3 → OpenAI-compatible            # GPT-NL, Mistral, self-hosted
Priority 4 → LiteLLM                      # Universal multi-model router

All LLM output passes through a JSON Schema Validation Gate before entering the deterministic engine — probabilistic AI never touches your production pipeline directly.

pip install odgs-llm-bridge[ollama]       # sovereign local (recommended)
pip install odgs-llm-bridge[google]       # Google GenAI
pip install odgs-llm-bridge[all]          # all providers

# Compile a regulation into enforceable rules
odgs-llm compile-regulation --input regulation.txt --output rules.json

📦 odgs-llm-bridge on PyPI → · GitHub →


5. Air-Gapped Execution & Stateless Cryptography (JWKS)

The ODGS Engine operates with Zero Telemetry and does not "phone home". It is designed for strict air-gapped enterprise environments.

To ensure metric authenticity, ODGS implements stateless cryptography using standard Ed25519 JWKS (JSON Web Key Set) public keys. When the Engine loads a Sovereign Pack, it cryptographically verifies the signature against the cached JWKS public key.


6. Audit Ledgers: Cryptographic Verifiability & Zero-Knowledge

ODGS outputs an agnostic cryptographic_attestation JSON schema to satisfy EU AI Act Article 12 (Forensic Logging) without exposing third-party data.

  • Git-as-Backend: ODGS utilizes a privacy-native logging architecture. Forensic logs are written directly to your private enterprise Git repository. Zero data ever leaves your perimeter.
  • The Tri-Partite Hash: The engine generates a cryptographic proof binding the Input Data Hash + Rule Definition Hash + Engine Configuration Hash. Independent auditors and regulatory bodies can mechanically verify the integrity of algorithmic decisions without exposing PII.

7. Enterprise Deployment (Kubernetes / Helm)

For organization-wide policy enforcement, Sovereign Nodes can deploy ODGS as an active sidecar container routing mesh traffic.

# Add the Official Metric Provenance Repository
helm repo add metricprovenance https://charts.metricprovenance.com
helm repo update

# Install the Engine
helm install odgs-cluster-agent metricprovenance/odgs-engine \
  --set configuration.namespace="urn:odgs:sov" \
  --set keys.jwks_url="https://certificate.metricprovenance.com/.well-known/jwks.json"

Architectural clearance and registry access for compliance deployments are managed via the Metric Provenance Sovereign Registry.


8. Documentation & Contribution

📚 Full Documentation Map →

Guide Description
Migration Guide (v5.x → v6.0.0) Non-breaking Sovereign Engine upgrade — all v5 configs work as-is.
Migration Guide (v4.0 → v5.0) Breaking changes for W3C JSON-LD rule structures.
Adapter Guide For Data Engineers connecting ODGS to custom infrastructures.
Audit Ledger Guide For Big 4 Auditors verifying the Tri-Partite Hash.

Support & Community


License

Released under the Apache 2.0 License.

  • No Vendor Lock-in.
  • No Cloud Dependency.
  • 100% Data Sovereignty.

ODGS | Developed by Metric Provenance | The Hague, NL 🇳🇱

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

odgs-6.0.3.tar.gz (413.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

odgs-6.0.3-py3-none-any.whl (178.4 kB view details)

Uploaded Python 3

File details

Details for the file odgs-6.0.3.tar.gz.

File metadata

  • Download URL: odgs-6.0.3.tar.gz
  • Upload date:
  • Size: 413.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for odgs-6.0.3.tar.gz
Algorithm Hash digest
SHA256 86f059e634936a73d275c821d5f96e320d3fc00aa890757e6b1bad06fd3d1ff7
MD5 6bc47073ee373bd6657efa176ce1adcb
BLAKE2b-256 eee21e114d9f4aa7a39f9a6e4b1c02ee4080775c88b1b65051d71f9b1163e122

See more details on using hashes here.

File details

Details for the file odgs-6.0.3-py3-none-any.whl.

File metadata

  • Download URL: odgs-6.0.3-py3-none-any.whl
  • Upload date:
  • Size: 178.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for odgs-6.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 254c3a50f73e54f5894378f5667046b193f905d22d7d4bfb980fe0d6fe65539e
MD5 f6a8e8bcb7d30e2d14afac3c7c92d9f3
BLAKE2b-256 2eec4755e0d562e3193f7a897fc4fabae179024628f9875fe50bebc601c88415

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page