The GizmoSQL adapter plugin for dbt

These details have been verified by PyPI

Project links

Homepage

Owner

GizmoData

GitHub Statistics

These details have not been verified by PyPI

Project description

dbt-gizmosql

A dbt adapter for GizmoSQL

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

dbt is the T in ELT. Organize, cleanse, denormalize, filter, rename, and pre-aggregate the raw data in your warehouse so that it's ready for analysis.

GizmoSQL

GizmoSQL is an Apache Arrow Flight-based SQL engine for data warehouses. It is designed to be fast, scalable, and easy to use.

It has DuckDB and SQLite back-ends. You can see more information about GizmoSQL here.

Features

This adapter provides feature parity with dbt-duckdb for all features applicable to a remote Flight SQL connection:

Materializations

Table and View (SQL and Python)
Incremental with four strategies:
- append -- simple record additions
- delete+insert -- key-based upserts with DuckDB's DELETE...USING syntax
- merge -- uses DuckDB's MERGE with UPDATE BY NAME / INSERT BY NAME
- microbatch -- time-based batch processing via event_time windows
Snapshot (check and timestamp modes) using UPDATE+INSERT pattern
Schema change handling: ignore, append_new_columns, sync_all_columns, fail

Python Models

Python models execute client-side using a local DuckDB instance for full API compatibility, then ship results to GizmoSQL via ADBC bulk ingest:

def model(dbt, session):
    dbt.config(materialized="table")
    df = dbt.ref("upstream_model")
    df = df.filter(df.amount > 100)
    return df

Supports DuckDB relations, pandas DataFrames, and PyArrow Tables as return types
dbt.ref() and dbt.source() fetch data from GizmoSQL as Arrow and expose it as DuckDB relations
Incremental Python models supported (with proper dbt.is_incremental handling)

Server-side pushdown with `session.remote_sql()`

Because Python models run client-side, dbt.ref('big_table').filter(...) pulls the entire upstream table over the network before filtering locally. When you only need a small slice of a large server-side table, use session.remote_sql(query) to push the query down to the GizmoSQL server — the filter/aggregation runs server-side and only the result crosses the wire:

def model(dbt, session):
    dbt.config(materialized="table")
    schema = dbt.this.schema
    # Runs on the GizmoSQL server; only matching rows come back.
    return session.remote_sql(
        f"select * from {schema}.big_table where name = 'Joe'"
    )

remote_sql() returns a chainable local DuckDB relation, so you can combine it with the usual .filter(), .project(), .df(), pandas, etc. The rest of the session object behaves exactly like a local DuckDB connection (session.sql(...), session.register(...), …) — remote_sql is an additive escape hatch, not a replacement.

Seed Loading

Seeds are loaded using DuckDB's CSV reader on the client side with ADBC bulk ingest to the server:

Correct null handling (empty CSV fields become SQL NULL, not the string 'null')
Proper type inference (dates detected as DATE, integers as BIGINT, etc.)
Supports column_types overrides and custom delimiters
Significantly faster than dbt's default batch INSERT path

Constraints

All constraint types are enforced: CHECK, NOT NULL, UNIQUE, PRIMARY KEY, FOREIGN KEY.

Documentation

persist_docs support (COMMENT ON for relations and columns)
Full catalog generation with dbt docs generate

Utility Macros

DuckDB-compatible overrides for: dateadd, last_day, listagg, split_part.

Writing to External Files (server-side)

Because GizmoSQL is essentially a remote, server-side DuckDB, dbt-gizmosql supports the external materialization — models that are backed by Parquet, CSV, or JSON files rather than database tables — with the same config surface as dbt-duckdb.

The important distinction: the COPY runs on the GizmoSQL server, not on your dbt client. That's usually what you want. A production GizmoSQL deployment typically sits on a powerful cloud VM (lots of CPU, lots of RAM, fast local disks, a fat NIC, IAM role for blob storage). Pushing the write to that box is far faster than streaming a multi-GB result set back to the client just to write it out again — and the server's credentials/network topology are exactly what's needed to reach the destination.

{{ config(materialized='external', location='/data/warehouse/fact_orders.parquet') }}
select m.*, s.id is not null as has_source_id
from {{ ref('upstream_model') }} m
left join {{ source('upstream', 'source') }} s using (id)

Configuration

Option	Default	Description
`location`	`external_location` macro	Server-side path (or S3/GCS/Azure URI) to write to. See below.
`format`	`parquet`	One of `parquet`, `csv`, `json`. Inferred from the `location` extension when omitted.
`delimiter`	`,`	For CSV, the field delimiter.
`options`	`{}`	Any other options for DuckDB's `COPY` statement — e.g. `compression`, `partition_by`, `codec`, `per_thread_output`.
`parquet_read_options`	`{'union_by_name': False}`	Options passed to `read_parquet()` when building the read-side view.
`csv_read_options`	`{'auto_detect': True}`	Options passed to `read_csv()`.
`json_read_options`	`{'auto_detect': True}`	Options passed to `read_json()`.

If location is omitted, the file is written to {external_root}/{model_name}.{format}. Set external_root in your profile to control the default write location (local path or cloud URI):

my-gizmosql-db:
  target: dev
  outputs:
    dev:
      type: gizmosql
      host: gizmosql.prod.example.com
      port: 31337
      auth_type: external
      use_encryption: True
      external_root: "s3://my-warehouse/dbt-output"

The external_root is resolved on the GizmoSQL server, so any path/URI the server's DuckDB backend can reach works — local filesystem paths, s3://..., gs://..., azure://..., etc. Any credentials needed to write there live on the server, not on your dbt client.

After the write, dbt-gizmosql creates a view over the file via read_parquet / read_csv / read_json, so downstream models can ref() the external model like any other relation.

Partitioning example

{{ config(
    materialized='external',
    format='parquet',
    options={'partition_by': 'year, month', 'compression': 'zstd'}
) }}
select * from {{ ref('fact_events') }}

Notes and limitations

The directory referenced by external_root (or the parent directory of an explicit location) must already exist on the server — DuckDB's COPY ... TO '<file>' does not create parent directories for single-file writes. Cloud URIs like s3://bucket/prefix don't have this constraint.
Incremental strategies are not supported on external models — each run fully replaces the file(s).
dbt-duckdb's plugin / glue_register options are not supported: those are a client-side feature of dbt-duckdb with no analogue on the server. Setting either will produce a clear compile-time error.

Installation

Option 1 - from PyPi

# Create the virtual environment
python3 -m venv .venv

# Activate the virtual environment
. .venv/bin/activate

pip install --upgrade pip

python -m pip install dbt-core dbt-gizmosql

Option 2 - from source (for development)

git clone https://github.com/gizmodata/dbt-gizmosql

cd dbt-gizmosql

# Create the virtual environment
python3 -m venv .venv

# Activate the virtual environment
. .venv/bin/activate

# Upgrade pip, setuptools, and wheel
pip install --upgrade pip setuptools wheel

# Install the dbt GizmoSQL adapter - in editable mode with dev dependencies
pip install --editable .[dev]

Configuration

Profile setup

Add the following to your ~/.dbt/profiles.yml (change values to match your environment):

my-gizmosql-db:
  target: dev
  outputs:
    dev:
      type: gizmosql
      host: localhost
      port: 31337
      database: dbt
      user: [username]
      password: [password]
      use_encryption: True
      tls_skip_verify: True
      threads: 2

OAuth/SSO Authentication

For browser-based OAuth/SSO, use auth_type: external -- no username or password needed:

my-gizmosql-db:
  target: dev
  outputs:
    dev:
      type: gizmosql
      host: gizmosql.example.com
      port: 31337
      auth_type: external
      use_encryption: True
      threads: 2

Architecture

This adapter connects to GizmoSQL via Apache Arrow Flight SQL using the ADBC driver (adbc-driver-gizmosql). Key architectural decisions:

Autocommit mode: Each statement auto-commits immediately. Flight SQL's PREPARE phase validates against committed catalog state, so explicit transactions would cause DDL from earlier statements to be invisible to later ones.
Client-side DuckDB: Seeds and Python models use a local DuckDB instance for processing, with results shipped to the server via ADBC bulk ingest (Arrow columnar format over gRPC).
MERGE BY NAME: Incremental merges use DuckDB's UPDATE BY NAME / INSERT BY NAME syntax, which is resilient to column ordering differences.

Versioning

This adapter follows semantic versioning. The major.minor version tracks dbt-core (e.g., dbt-core 1.11.x -> dbt-gizmosql 1.11.x).

Reporting bugs and contributing code

Want to report a bug or request a feature? Open an issue
Want to contribute? Pull requests are welcome

Code of Conduct

Everyone interacting in the dbt project's codebases, issue trackers, chat rooms, and mailing lists is expected to follow the dbt Code of Conduct.

Project details

These details have been verified by PyPI

Project links

Homepage

Owner

GizmoData

GitHub Statistics

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.11.16

May 10, 2026

1.11.15

Apr 11, 2026

1.11.14

Apr 11, 2026

1.11.13

Apr 11, 2026

1.11.12

Apr 1, 2026

1.11.11

Mar 31, 2026

1.11.10

Mar 31, 2026

1.11.9

Mar 30, 2026

1.11.8

Mar 2, 2026

1.11.7

Mar 2, 2026

1.11.6

Mar 2, 2026

1.11.2

Jan 9, 2026

1.10.3

Sep 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbt_gizmosql-1.11.16.tar.gz (30.5 kB view details)

Uploaded May 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dbt_gizmosql-1.11.16-py3-none-any.whl (30.3 kB view details)

Uploaded May 10, 2026 Python 3

File details

Details for the file dbt_gizmosql-1.11.16.tar.gz.

File metadata

Download URL: dbt_gizmosql-1.11.16.tar.gz
Upload date: May 10, 2026
Size: 30.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dbt_gizmosql-1.11.16.tar.gz
Algorithm	Hash digest
SHA256	`009a9e7ea541d9757e055051532adba4380cb448fa8d2ff98f108c0b434e2105`
MD5	`5a3669bed516ec9fb0480788e5c447d1`
BLAKE2b-256	`ba14ca199e8ea8edff674307ca4302eba5748f1c7ba4a15c3cd7160958be117a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dbt_gizmosql-1.11.16.tar.gz:

Publisher: ci.yml on gizmodata/dbt-gizmosql

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dbt_gizmosql-1.11.16.tar.gz
- Subject digest: 009a9e7ea541d9757e055051532adba4380cb448fa8d2ff98f108c0b434e2105
- Sigstore transparency entry: 1494771356
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: gizmodata/dbt-gizmosql@415297ef8d8e098fad8e1f3b110b310dc1dc536e
- Branch / Tag: refs/tags/v1.11.16
- Owner: https://github.com/gizmodata
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@415297ef8d8e098fad8e1f3b110b310dc1dc536e
- Trigger Event: push

File details

Details for the file dbt_gizmosql-1.11.16-py3-none-any.whl.

File metadata

Download URL: dbt_gizmosql-1.11.16-py3-none-any.whl
Upload date: May 10, 2026
Size: 30.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dbt_gizmosql-1.11.16-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c63bf681d969f09a01820e0af2f5267811280ceec75eb0c6a7ba92ed676c4377`
MD5	`5ba73456bb30b28cbf436db1d008c6a7`
BLAKE2b-256	`2154d860a1f72f9d12ce91bcc5d3db916e73312bf27e9bdf6f68c6395a06b31b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dbt_gizmosql-1.11.16-py3-none-any.whl:

Publisher: ci.yml on gizmodata/dbt-gizmosql

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dbt_gizmosql-1.11.16-py3-none-any.whl
- Subject digest: c63bf681d969f09a01820e0af2f5267811280ceec75eb0c6a7ba92ed676c4377
- Sigstore transparency entry: 1494771498
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: gizmodata/dbt-gizmosql@415297ef8d8e098fad8e1f3b110b310dc1dc536e
- Branch / Tag: refs/tags/v1.11.16
- Owner: https://github.com/gizmodata
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@415297ef8d8e098fad8e1f3b110b310dc1dc536e
- Trigger Event: push

dbt-gizmosql 1.11.16

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Project description

dbt-gizmosql

GizmoSQL

Features

Materializations

Python Models

Server-side pushdown with session.remote_sql()

Seed Loading

Constraints

Documentation

Utility Macros

Writing to External Files (server-side)

Configuration

Partitioning example

Notes and limitations

Installation

Option 1 - from PyPi

Option 2 - from source (for development)

Configuration

Profile setup

OAuth/SSO Authentication

Architecture

Versioning

Reporting bugs and contributing code

Code of Conduct

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Server-side pushdown with `session.remote_sql()`