Synchronization tool for dbt models to Cube.js schemas and BI tools

These details have not been verified by PyPI

Project description

dbt-cube-sync

A powerful synchronization tool that creates a seamless pipeline from dbt models to Cube.js schemas and BI tools (Superset, Tableau, PowerBI).

Features

🔄 dbt → Cube.js: Auto-generate Cube.js schemas from dbt models with metrics
🗃️ Flexible Data Type Source: Get column types from catalog OR directly from database via SQLAlchemy
🎯 Model Filtering: Process specific models instead of all models
📊 Cube.js → BI Tools: Sync schemas to multiple BI platforms
🏗️ Extensible Architecture: Plugin-based connector system for easy BI tool integration
🐳 Docker Support: Containerized execution with orchestration support
🎯 CLI Interface: Simple command-line tools for automation

Supported BI Tools

✅ Apache Superset - Full implementation
🚧 Tableau - Placeholder (coming soon)
🚧 PowerBI - Placeholder (coming soon)

Installation

Using Poetry (Development)

cd dbt-cube-sync
poetry install
poetry run dbt-cube-sync --help

Database Drivers (for SQLAlchemy URI feature)

If you want to use the --sqlalchemy-uri option to fetch column types directly from your database, you'll need to install the appropriate database driver:

# PostgreSQL
poetry add psycopg2-binary

# MySQL
poetry add pymysql

# Snowflake
poetry add snowflake-sqlalchemy

# BigQuery
poetry add sqlalchemy-bigquery

# Redshift
poetry add sqlalchemy-redshift

Using Docker

docker build -t dbt-cube-sync .
docker run --rm dbt-cube-sync --help

Quick Start

1. Generate Cube.js Schemas from dbt

Option A: Using catalog file (traditional method)

dbt-cube-sync dbt-to-cube \
  --manifest ./target/manifest.json \
  --catalog ./target/catalog.json \
  --output ./cube_output

Option B: Using database connection (no catalog needed)

dbt-cube-sync dbt-to-cube \
  --manifest ./target/manifest.json \
  --sqlalchemy-uri postgresql://user:password@localhost:5432/mydb \
  --output ./cube_output

Option C: Filter specific models

dbt-cube-sync dbt-to-cube \
  --manifest ./target/manifest.json \
  --sqlalchemy-uri postgresql://user:password@localhost:5432/mydb \
  --models orders,customers,products \
  --output ./cube_output

2. Sync to BI Tool (Optional)

# Sync to Superset
dbt-cube-sync cube-to-bi superset \
  --cube-files ./cube_output \
  --url http://localhost:8088 \
  --username admin \
  --password admin \
  --cube-connection-name Cube

Configuration

Sample Configuration (`sync-config.yaml`)

connectors:
  superset:
    type: superset
    url: http://localhost:8088
    username: admin
    password: admin
    database_name: Cube
    
  tableau:
    type: tableau
    url: https://your-tableau-server.com
    username: your-username
    password: your-password
    
  powerbi:
    type: powerbi
    # PowerBI specific configuration

CLI Commands

Quick Reference

Command	Description
`sync-all`	Ultimate command - Incremental sync: dbt → Cube.js → Superset → RAG
`dbt-to-cube`	Generate Cube.js schemas from dbt models (with incremental support)
`cube-to-bi`	Sync Cube.js schemas to BI tools (Superset, Tableau, PowerBI)

`sync-all` (Recommended)

Ultimate incremental sync command - handles the complete pipeline with state tracking.

# Basic incremental sync (Cube.js only)
dbt-cube-sync sync-all -m manifest.json -c catalog.json -o ./cube_output

# Full pipeline: dbt → Cube.js → Superset
dbt-cube-sync sync-all -m manifest.json -c catalog.json -o ./cube_output \
  --superset-url http://localhost:8088 \
  --superset-username admin \
  --superset-password admin

# Full pipeline: dbt → Cube.js → Superset → RAG embeddings
dbt-cube-sync sync-all -m manifest.json -c catalog.json -o ./cube_output \
  --superset-url http://localhost:8088 \
  --superset-username admin \
  --superset-password admin \
  --rag-api-url http://localhost:8000

# Force full rebuild (ignore state)
dbt-cube-sync sync-all -m manifest.json -c catalog.json -o ./cube_output --force-full-sync

Options:

Option	Required	Description
`--manifest, -m`	Yes	Path to dbt manifest.json
`--catalog, -c`	No*	Path to dbt catalog.json
`--sqlalchemy-uri, -s`	No*	Database URI for column types
`--output, -o`	Yes	Output directory for Cube.js files
`--state-path`	No	State file path (default: `.dbt-cube-sync-state.json`)
`--force-full-sync`	No	Force full rebuild, ignore state
`--superset-url`	No	Superset URL
`--superset-username`	No	Superset username
`--superset-password`	No	Superset password
`--cube-connection-name`	No	Cube database name in Superset (default: `Cube`)
`--rag-api-url`	No	RAG API URL for embedding updates

*Either --catalog or --sqlalchemy-uri is required.

How Incremental Sync Works:

Reads state file (.dbt-cube-sync-state.json) with model checksums
Compares against current manifest to detect changes
Only processes added or modified models
Deletes Cube.js files for removed models
Updates state file with new checksums

`dbt-to-cube`

Generate Cube.js schema files from dbt models with incremental support.

Options:

--manifest / -m: Path to dbt manifest.json file (required)
--catalog / -c: Path to dbt catalog.json file
--sqlalchemy-uri / -s: SQLAlchemy database URI for fetching column types
--models: Comma-separated list of model names to process
--output / -o: Output directory for Cube.js files (required)
--template-dir / -t: Directory containing Cube.js templates (default: ./cube/templates)
--state-path: State file for incremental sync (default: .dbt-cube-sync-state.json)
--force-full-sync: Force full regeneration, ignore cached state
--no-state: Disable state tracking (legacy behavior)

Examples:

# Incremental sync (default)
dbt-cube-sync dbt-to-cube -m manifest.json -c catalog.json -o output/

# Force full rebuild
dbt-cube-sync dbt-to-cube -m manifest.json -c catalog.json -o output/ --force-full-sync

# Using database connection (no catalog needed)
dbt-cube-sync dbt-to-cube -m manifest.json -s postgresql://user:pass@localhost/db -o output/

# Filter specific models
dbt-cube-sync dbt-to-cube -m manifest.json -c catalog.json -o output/ --models users,orders

`cube-to-bi`

Sync Cube.js schemas to BI tool datasets.

Arguments:

bi_tool: BI tool type (superset, tableau, powerbi)

Options:

--cube-files / -c: Directory containing Cube.js files (required)
--url / -u: BI tool URL (required)
--username / -n: BI tool username (required)
--password / -p: BI tool password (required)
--cube-connection-name / -d: Name of Cube database connection in BI tool (default: Cube)

Example:

dbt-cube-sync cube-to-bi superset -c cube_output/ -u http://localhost:8088 -n admin -p admin -d Cube

State File

The state file (.dbt-cube-sync-state.json) tracks:

{
  "version": "1.0",
  "last_sync_timestamp": "2024-01-15T10:30:00Z",
  "manifest_path": "/path/to/manifest.json",
  "models": {
    "model.project.users": {
      "checksum": "abc123...",
      "has_metrics": true,
      "last_generated": "2024-01-15T10:30:00Z",
      "output_file": "./cube_output/Users.js"
    }
  }
}

Delete this file to force a full rebuild, or use --force-full-sync.

Architecture

dbt models (with metrics) 
    ↓
dbt-cube-sync generate-cubes
    ↓
Cube.js schemas
    ↓
dbt-cube-sync sync-bi [connector]
    ↓
BI Tool Datasets (Superset/Tableau/PowerBI)

Project Structure

dbt-cube-sync/
├── dbt_cube_sync/
│   ├── cli.py                 # CLI interface
│   ├── config.py             # Configuration management
│   ├── core/
│   │   ├── dbt_parser.py     # dbt manifest parser
│   │   ├── db_inspector.py   # Database column type inspector (SQLAlchemy)
│   │   ├── cube_generator.py # Cube.js generator
│   │   └── models.py         # Pydantic data models
│   └── connectors/
│       ├── base.py           # Abstract base connector
│       ├── superset.py       # Superset implementation
│       ├── tableau.py        # Tableau placeholder
│       └── powerbi.py        # PowerBI placeholder
├── Dockerfile                # Container definition
├── pyproject.toml            # Poetry configuration
└── README.md

Adding New BI Connectors

Create a new connector class inheriting from BaseConnector
Implement the required abstract methods
Register the connector using ConnectorRegistry.register()

Example:

from .base import BaseConnector, ConnectorRegistry

class MyBIConnector(BaseConnector):
    def _validate_config(self):
        # Validation logic
        pass
    
    def connect(self):
        # Connection logic
        pass
    
    def sync_cube_schemas(self, cube_dir):
        # Sync implementation
        pass

# Register the connector
ConnectorRegistry.register('mybi', MyBIConnector)

Docker Integration

The tool is designed to work in containerized environments with proper dependency orchestration:

dbt docs: Runs dbt build then serves documentation
dbt-cube-sync: Runs sync pipeline after dbt and Cube.js are ready
BI Tools: Receive synced datasets after sync completes

Contributing

Fork the repository
Create a feature branch
Implement your changes
Add tests if applicable
Submit a pull request

License

MIT License - see LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.0a28 pre-release

Apr 14, 2026

0.1.0a27 pre-release

Mar 29, 2026

0.1.0a26 pre-release

Mar 29, 2026

This version

0.1.0a25 pre-release

Feb 15, 2026

0.1.0a24 pre-release

Feb 15, 2026

0.1.0a23 pre-release

Feb 15, 2026

0.1.0a22 pre-release

Feb 15, 2026

0.1.0a21 pre-release

Feb 8, 2026

0.1.0a20 pre-release

Feb 3, 2026

0.1.0a19 pre-release

Feb 3, 2026

0.1.0a18 pre-release

Feb 3, 2026

0.1.0a17 pre-release

Feb 3, 2026

0.1.0a16 pre-release

Feb 3, 2026

0.1.0a15 pre-release

Feb 2, 2026

0.1.0a14 pre-release

Feb 2, 2026

0.1.0a13 pre-release

Feb 2, 2026

0.1.0a12 pre-release

Jan 30, 2026

0.1.0a11 pre-release

Jan 30, 2026

0.1.0a10 pre-release

Jan 30, 2026

0.1.0a9 pre-release

Jan 29, 2026

0.1.0a8 pre-release

Jan 28, 2026

0.1.0a7 pre-release

Jan 7, 2026

0.1.0a6 pre-release

Jan 7, 2026

0.1.0a5 pre-release

Jan 5, 2026

0.1.0a4 pre-release

Nov 16, 2025

0.1.0a3 pre-release

Nov 16, 2025

0.1.0a2 pre-release

Nov 16, 2025

0.1.0a1 pre-release

Nov 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbt_cube_sync-0.1.0a25.tar.gz (29.6 kB view details)

Uploaded Feb 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dbt_cube_sync-0.1.0a25-py3-none-any.whl (33.8 kB view details)

Uploaded Feb 15, 2026 Python 3

File details

Details for the file dbt_cube_sync-0.1.0a25.tar.gz.

File metadata

Download URL: dbt_cube_sync-0.1.0a25.tar.gz
Upload date: Feb 15, 2026
Size: 29.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.19 Linux/6.14.0-1017-azure

File hashes

Hashes for dbt_cube_sync-0.1.0a25.tar.gz
Algorithm	Hash digest
SHA256	`9673cf203187643a902ac86a426311863a5f1aba474a4c26aaf448a28aa2c440`
MD5	`7c456118295eab711df29efb9ea2ee20`
BLAKE2b-256	`65ffd2991891b4ee2fd124e4fec7d250bc9ae249f28b94dd3925295deca6b6b8`

See more details on using hashes here.

File details

Details for the file dbt_cube_sync-0.1.0a25-py3-none-any.whl.

File metadata

Download URL: dbt_cube_sync-0.1.0a25-py3-none-any.whl
Upload date: Feb 15, 2026
Size: 33.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.19 Linux/6.14.0-1017-azure

File hashes

Hashes for dbt_cube_sync-0.1.0a25-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ff77a1b3057b8e82d3fadb2d10924607ce3932af636e18e3ddfd64db6ba2736c`
MD5	`320b0bf1508c5ac3574e89d2930be23f`
BLAKE2b-256	`977a3cb8291f597ef447a99d4f7040be2b49af6d78f04297de20512cec887283`

See more details on using hashes here.

dbt-cube-sync 0.1.0a25

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

dbt-cube-sync

Features

Supported BI Tools

Installation

Using Poetry (Development)

Database Drivers (for SQLAlchemy URI feature)

Using Docker

Quick Start

1. Generate Cube.js Schemas from dbt

2. Sync to BI Tool (Optional)

Configuration

Sample Configuration (sync-config.yaml)

CLI Commands

Quick Reference

sync-all (Recommended)

dbt-to-cube

cube-to-bi

State File

Architecture

Project Structure

Adding New BI Connectors

Docker Integration

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Sample Configuration (`sync-config.yaml`)

`sync-all` (Recommended)

`dbt-to-cube`

`cube-to-bi`