Skip to main content

The Open Data Governance Schema (ODGS) - A vendor-neutral standard for business definitions.

Project description

🏛️ The Open Data Governance Schema (ODGS)

What is it?

🧩 Why now, why this?

Infographic

The Open Standard for Headless Data Governance
Decouples Business Logic from BI Tools

Build Status Standard Maintained by

License JSON

📉 The Problem: Definition Drift

In the modern data stack, business logic is fragmented. The definition of Gross Margin in dbt often conflicts with the DAX formula in Power BI, which differs from the calculation in Tableau.

Result: Executives don't trust the dashboard, and Data Engineers spend 40% of their time debugging "why the numbers don't match."

🚀 The Solution: Write Once, Sync Everywhere

Headless Data Governance

graph TD
    subgraph PROBLEM ["❌ The Problem: Definition Drift"]
        A[CFO: 'Gross Margin' in Excel] -->|Disconnect| B[dbt: SQL Logic]
        A -->|Disconnect| C[Power BI: DAX Logic]
        B -.-|Mismatch| C
    end

    subgraph SOLUTION ["✅ The Solution: Open Governance Schema"]
        D[("JSON Schema (OGS)
        Single Source of Truth")] 
        
        D -->|Auto-Sync| E[dbt / Snowflake]
        D -->|Auto-Sync| F[Power BI / Tableau]
        D -->|Auto-Sync| G[Data Catalog / Collibra]
    end

    style D fill:#f9f,stroke:#333,stroke-width:4px,color:black
    style PROBLEM fill:#ffcccc,stroke:#333,stroke-width:1px
    style SOLUTION fill:#ccffcc,stroke:#333,stroke-width:1px

The Open Data Governance Schema (ODGS) is a vendor-neutral JSON protocol that acts as the "API" for your business definitions. By decoupling the Definition (The "What") from the Tool (The "How"), you achieve Headless Governance.

How it works

// example: standard_metrics.json
{
  "metric_id": "KPI_102",
  "name": "Gross_Margin",
  "domain": "Finance",
  "calculation_logic": {
    "abstract": "Revenue - COGS",
    "sql_standard": "SUM(gross_sales) - SUM(cost_of_goods)",
    "dax_pattern": "[Total Sales] - [Total Cost]"
  },
  "owner": "CFO_Office",
  "quality_threshold": "99.5%"
}

📂 The Protocol Structure

This repository contains the core schemas that define the "Alphabet" of Data Governance:

File Purpose
standard_metrics.json The "Golden Record" for KPIs. Define logic, ownership, and sensitivity here.
standard_dq_dimensions.json The 60 industry-standard dimensions of data quality (Accuracy, Timeliness, Completeness, etc.).
standard_data_rules.json Technical validation rules (Regex patterns, null checks, referential integrity).
root_cause_factors.json A standardized taxonomy for why data breaks (e.g., Process_Gap vs Integration_Failure).
business_process_maps.json Maps how data entities flow through the business lifecycle.

✅ Validation & CI/CD Integration

The repository includes a validator script that enforces the governance schema:

python3 validate_schema.py

Output:

🔍 Running Open Governance Schema Validator...
✅ Loaded 72 metrics.
✅ Loaded 50 data rules.
🎉 All Governance Checks Passed!

CI/CD Integration

Add this to your GitHub Actions workflow to enforce governance standards:

- name: Validate Governance Schema
  run: python3 validate_schema.py

This ensures that all metrics and rules have:

  • Unique IDs
  • Assigned owners
  • Defined domains
  • Clear calculation logic

📦 Installation

NPM (Node.js)

npm install odgs

Usage:

import { standardMetrics } from 'odgs';
// or
const { standardMetrics } = require('odgs');

PyPI (Python)

pip install odgs

Usage:

from odgs import standard_metrics

🛠 Usage & Implementation

Option A: Build your own Sync Engine

Fork this repository. Use these JSON files as the configuration layer in your CI/CD pipeline. Write Python/Node parsers to inject these definitions into your tools (dbt schema.yml, Power BI XMLA, etc.).

Option B: The Reference Implementation

If you prefer a managed "Headless Governance" layer that natively supports ODGS and handles the sync to Power BI/dbt automatically, feel free to see the art of possible with these end-state applications built with the same data:

These examples demonstrate the end outcome for the business user of what the Open Data Governance Schema can do.

Commercial Managed Service Partners

For commercial managed services, please visit:


📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Copyright © 2025 Authentic Intelligence Labs


Contributing: We welcome Pull Requests to expand the dq_dimensions or refine the root_cause_factors taxonomy.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

odgs-1.0.0.tar.gz (85.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

odgs-1.0.0-py3-none-any.whl (85.6 kB view details)

Uploaded Python 3

File details

Details for the file odgs-1.0.0.tar.gz.

File metadata

  • Download URL: odgs-1.0.0.tar.gz
  • Upload date:
  • Size: 85.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for odgs-1.0.0.tar.gz
Algorithm Hash digest
SHA256 1fde83645942bb61b45dcd241093769903b40b90ec9621e6c05e9922d9c7c444
MD5 bf94a132acd88289557911c427b6ac14
BLAKE2b-256 961f2bbd37966cd7efb01b157ee73ed250eab7050e114257b282a12bd1b0af4f

See more details on using hashes here.

File details

Details for the file odgs-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: odgs-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 85.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for odgs-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 19c1fc1f747897d7ccc493bca8cd2b1bece9f95cfabd4f023f066eb3d675fdbf
MD5 4745cb64e390f7d3899982743e405915
BLAKE2b-256 edf8422b765195e5335f651320531d7620c0c4ed37dc4bb74d24bb92f2c3dddc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page