duckrun

A dbt adapter that runs SQL in DuckDB and materializes to Delta Lake (delta_rs).

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Project description

duckrun is a dbt adapter that runs your model SQL in DuckDB and writes the results to Delta Lake using delta_rs (the deltalake Python package).

It is a thin wrapper around dbt-duckdb. You keep everything dbt-duckdb gives you — views, seeds, sources, tests, snapshots, the full plugin ecosystem — and gain one extra thing: a Delta-backed table / incremental materialization that writes real Delta tables

Why a separate adapter instead of a PR to dbt-duckdb?

Writing Delta with delta_rs needs the deltalake package. dbt-duckdb deliberately keeps a minimal dependency footprint and avoids external dependencies like this — for very good reasons — so this doesn't belong upstream. duckrun keeps it isolated here instead.

It's also meant to be a temporary workaround: DuckDB is gaining native Delta write support, and once that matures the delta_rs hop should no longer be needed. Until then, this adapter fills the gap.

0.3.0 is a breaking change

Versions ≤ 0.2.x of duckrun were a Microsoft Fabric / OneLake helper library. From 0.3.0 onward duckrun is a dbt adapter. Need the old library? Pin pip install "duckrun<0.3", or use the legacy branch.

How it fits together

DuckDB is a great query engine, Delta Lake is a great open table format, and dbt is the right tool to orchestrate the DAG. duckrun wires the three together:

DuckDB executes · delta_rs materializes · dbt orchestrates.

Install

pip install duckrun

That single install pulls in dbt-duckdb (and therefore duckdb) plus deltalake.

Configure your profile

# ~/.dbt/profiles.yml
my_project:
  target: dev
  outputs:
    dev:
      type: duckrun
      # No `threads:` needed — duckrun always runs single-threaded (see Limitations).
      # DuckDB runs in-memory by default — the Delta tables are the only state.
      # Default Delta location for models that don't set config(location=...).
      root_path: './warehouse'   # local path, or abfss://.../Tables, s3://..., gs://...
      # storage_options: {}      # passed through to deltalake for remote stores

Persisted models are written to <root_path>/<schema>/<model> (e.g. ./warehouse/dbo/orders), or to an explicit config(location=...).

Remote stores (Fabric OneLake / ADLS / S3 / GCS)

Point root_path at the warehouse location and pass credentials through storage_options — these flow straight to deltalake for writes and merges.

If storage_options carries a bearer_token (or token / access_token), the adapter also auto-creates a matching DuckDB Azure secret, so delta_scan() reads work with no extra config. In a notebook where the storage secret is already provided to DuckDB, you can leave storage_options empty.

    onelake:
      type: duckrun
      schema: dbo
      root_path: "abfss://<workspace>@onelake.dfs.fabric.microsoft.com/<lakehouse>.Lakehouse/Tables"
      storage_options:
        # az account get-access-token --resource https://storage.azure.com
        bearer_token: "{{ env_var('ONELAKE_TOKEN') }}"

Verified end-to-end against real Fabric OneLake: table overwrite, incremental merge, and delta_scan reads / tests.

Materializations

materialized	backed by	notes
`table`	Delta (overwrite)	DuckDB runs the SQL; delta_rs writes the table fresh each run.
`incremental`	Delta (merge / append)	First run overwrites; later runs apply `incremental_strategy`.
`view`	in-memory DuckDB	Ephemeral staging within a run (inherited from dbt-duckdb).
`seed`	in-memory DuckDB	CSV fixtures (inherited from dbt-duckdb).
`delta`	Delta	Alias for `table`; honors `incremental=true`. Kept for convenience.

The persisted materializations (table, incremental, delta) register a delta_scan view over the new Delta table, so downstream ref() works.

`table`

-- models/orders.sql
{{ config(materialized='table') }}

select status, count(*) as n, sum(amount) as total
from {{ ref('stg_orders') }}
group by status

`incremental`

{{ config(materialized='incremental', unique_key='order_id', incremental_strategy='merge') }}

select * from {{ ref('stg_orders') }}
{% if is_incremental() %}
  where updated_at > (select max(updated_at) from {{ this }})
{% endif %}

The first run (or --full-refresh, or a missing table) overwrites. Later runs apply the incremental_strategy:

`incremental_strategy`	behavior	requires
`merge` (default with `unique_key`)	upsert — update matched, insert new	`unique_key`
`insert`	insert only new keys (idempotent append)	`unique_key`
`append` (default without `unique_key`)	blind append	—

Config options (`table` / `incremental` / `delta`)

option	description
`location`	Delta path. Defaults to `<root_path>/<schema>/<id>`.
`incremental_strategy`	`merge` \| `insert` \| `append` (incremental only).
`unique_key`	column(s) to merge on.
`merge_update_columns`	merge: update only these columns on match (others untouched).
`merge_exclude_columns`	merge: update all columns except these on match.
`incremental_predicates`	merge: extra predicates AND-ed into the merge condition (use `target.`/`source.`, or dbt's `DBT_INTERNAL_DEST`/`DBT_INTERNAL_SOURCE`).
`on_schema_change`	`ignore` (default) \| `append_new_columns` \| `fail`. (`sync_all_columns` only adds — delta_rs can't drop columns.)
`partition_by`	Delta partition column(s).
`merge_schema`	allow schema evolution on write.
`storage_options`	per-model override forwarded to deltalake.

Reading existing Delta tables as sources

sources:
  - name: lake
    tables:
      - name: customers
        meta:
          plugin: duckrun
          delta_table_path: 's3://bucket/lake/customers'

How it works

dbt compiles your model SQL.
The materialization stages it as a DuckDB view.
A dbt-duckdb plugin (a store() hook) hands that relation to deltalake over the Arrow C-stream interface (__arrow_c_stream__) — no pyarrow required — which write_deltalake / DeltaTable.merge consume natively.
The model relation becomes a delta_scan view over the new Delta table.

The adapter is a thin subclass of dbt-duckdb declaring dependencies=['duckdb'], so view, seed, tests, and the rest are inherited directly; only table and incremental are overridden to write Delta.

Limitations

Single-threaded (enforced). duckrun's delta_rs write path isn't thread-safe — parallel models would collide on the shared DuckDB connection — so the adapter pins the run to one thread, overriding any threads: you set in the profile. There's nothing to configure; it's fine for duckrun's intended use (incremental Delta builds on DuckDB) and isn't aimed at large-scale concurrent workloads (that's Spark's job, not this).

Development

The integration_tests/ directory is a small dbt project exercised by CI (.github/workflows/integration.yml): dbt build runs twice against a local Delta ./warehouse — a seed, a view, a table, and an incremental model — where the second build exercises the incremental merge. Verified to run with pyarrow not installed, on the minimum supported duckdb and deltalake.

jaffle_shop/ is a self-contained build of the canonical dbt-labs jaffle shop project on duckrun, run by .github/workflows/jaffle.yml as a gating end-to-end test over a local Delta warehouse. It seeds the classic data, builds staging views → a dim_customers Delta table → an incremental fct_orders, then drives a two-pass merge: pass 1 lands the 99 base orders, pass 2 applies a late-arriving batch (a restated order plus two new ones) and singular tests assert the Delta merge upserted correctly (right row count, the existing order UPDATEd, the new orders INSERTed). It's industry-standard and recognisable, and — unlike the conformance report — fails the build on a merge regression. It shares no files with integration_tests/.

tests/conformance/ runs the official dbt adapter test suite (dbt-tests-adapter) against duckrun (.github/workflows/conformance.yml, results card in the job summary). It runs single-threaded (threads: 1) — see Limitations — as is normal for adapter conformance suites (e.g. dbt-iceberg does the same).

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mimoune.djouallah

Release history Release notifications | RSS feed

0.3.24

Jun 23, 2026

0.3.23

Jun 23, 2026

0.3.22

Jun 23, 2026

0.3.21

Jun 22, 2026

0.3.20

Jun 22, 2026

0.3.19

Jun 21, 2026

0.3.18

Jun 21, 2026

0.3.17

Jun 21, 2026

0.3.17.dev7 pre-release

Jun 18, 2026

0.3.17.dev6 pre-release

Jun 17, 2026

0.3.17.dev5 pre-release

Jun 15, 2026

0.3.17.dev4 pre-release

Jun 15, 2026

0.3.17.dev3 pre-release

Jun 14, 2026

0.3.17.dev2 pre-release

Jun 14, 2026

0.3.17.dev1 pre-release

Jun 14, 2026

0.3.16

Jun 12, 2026

0.3.15

Jun 11, 2026

0.3.14

Jun 10, 2026

0.3.13

Jun 9, 2026

0.3.12

Jun 8, 2026

0.3.11

Jun 8, 2026

0.3.9

Jun 8, 2026

0.3.8

Jun 8, 2026

0.3.7

Jun 7, 2026

0.3.6

Jun 7, 2026

0.3.5

Jun 7, 2026

0.3.4

Jun 7, 2026

0.3.3

Jun 6, 2026

0.3.2

Jun 6, 2026

This version

0.3.1

Jun 6, 2026

0.3.0

Jun 5, 2026

0.2.26

Jan 12, 2026

0.2.25

Jan 12, 2026

0.2.24

Jan 12, 2026

0.2.23

Jan 12, 2026

0.2.22.dev2 pre-release

Nov 24, 2025

0.2.22.dev0 pre-release

Nov 22, 2025

0.2.21

Nov 21, 2025

0.2.21.dev2 pre-release

Nov 21, 2025

0.2.21.dev1 pre-release

Nov 21, 2025

0.2.20

Nov 17, 2025

0.2.20.dev5 pre-release

Nov 16, 2025

0.2.20.dev3 pre-release

Nov 16, 2025

0.2.20.dev2 pre-release

Nov 15, 2025

0.2.20.dev1 pre-release

Nov 15, 2025

0.2.20.dev0 pre-release

Nov 15, 2025

0.2.19

Nov 12, 2025

0.2.19.dev8 pre-release

Nov 11, 2025

0.2.19.dev7 pre-release

Nov 10, 2025

0.2.19.dev6 pre-release

Nov 10, 2025

0.2.19.dev5 pre-release

Nov 10, 2025

0.2.19.dev4 pre-release

Nov 10, 2025

0.2.19.dev3 pre-release

Nov 9, 2025

0.2.19.dev2 pre-release

Nov 9, 2025

0.2.19.dev1 pre-release

Nov 5, 2025

0.2.19.dev0 pre-release

Nov 5, 2025

0.2.18

Nov 4, 2025

0.2.18.dev5 pre-release

Nov 3, 2025

0.2.18.dev4 pre-release

Nov 3, 2025

0.2.18.dev3 pre-release

Nov 2, 2025

0.2.18.dev2 pre-release

Nov 2, 2025

0.2.18.dev1 pre-release

Nov 2, 2025

0.2.17

Nov 1, 2025

0.2.16.dev2 pre-release

Nov 1, 2025

0.2.16.dev1 pre-release

Nov 1, 2025

0.2.16.dev0 pre-release

Nov 1, 2025

0.2.15

Oct 31, 2025

0.2.14.dev40 pre-release

Oct 31, 2025

0.2.14.dev3 pre-release

Oct 30, 2025

0.2.14.dev2 pre-release

Oct 30, 2025

0.2.14.dev1 pre-release

Oct 30, 2025

0.2.14.dev0 pre-release

Oct 30, 2025

0.2.13

Oct 22, 2025

0.2.13.dev0 pre-release

Oct 22, 2025

0.2.12

Oct 21, 2025

0.2.11

Oct 21, 2025

0.2.11.dev0 pre-release

Oct 21, 2025

0.2.10

Oct 16, 2025

0.2.9

Oct 15, 2025

0.2.7

Oct 14, 2025

0.2.6

Oct 14, 2025

0.2.4

Oct 9, 2025

0.2.3

Oct 9, 2025

0.2.2

Oct 8, 2025

0.2.1

Oct 7, 2025

0.2.0

Oct 7, 2025

0.1.9

Oct 6, 2025

0.1.8

Oct 6, 2025

0.1.7

Oct 5, 2025

0.1.6.3

Oct 5, 2025

0.1.6.2

Oct 5, 2025

0.1.6.1

Oct 5, 2025

0.1.6

Oct 5, 2025

0.1.5.6

Oct 5, 2025

0.1.5.5

Oct 5, 2025

0.1.5.4

Oct 5, 2025

0.1.5.3

Oct 5, 2025

0.1.5.2

Oct 4, 2025

0.1.5.1

Oct 4, 2025

0.1.5

Oct 4, 2025

0.1.4

Oct 4, 2025

0.1.3

Oct 4, 2025

0.1.2

Oct 4, 2025

0.1.1

Oct 3, 2025

0.0.0

Oct 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

duckrun-0.3.1.tar.gz (25.5 kB view details)

Uploaded Jun 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

duckrun-0.3.1-py3-none-any.whl (23.3 kB view details)

Uploaded Jun 6, 2026 Python 3

File details

Details for the file duckrun-0.3.1.tar.gz.

File metadata

Download URL: duckrun-0.3.1.tar.gz
Upload date: Jun 6, 2026
Size: 25.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for duckrun-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`6a355debc2ca236c0c6f74461c0b04337b613d55424f56ff4d270c7cd93da68f`
MD5	`2adafd4879b279caa6e47a5f053cab7c`
BLAKE2b-256	`174ad039ba9e8fda91d62fae2f936bb13bb60e6a78ee4393784136c9de755de8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for duckrun-0.3.1.tar.gz:

Publisher: publish.yml on djouallah/duckrun

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: duckrun-0.3.1.tar.gz
- Subject digest: 6a355debc2ca236c0c6f74461c0b04337b613d55424f56ff4d270c7cd93da68f
- Sigstore transparency entry: 1738841483
- Sigstore integration time: Jun 6, 2026
Source repository:
- Permalink: djouallah/duckrun@b2a1b5e65dac58fa432a69f301faa7d66e41c667
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/djouallah
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b2a1b5e65dac58fa432a69f301faa7d66e41c667
- Trigger Event: push

File details

Details for the file duckrun-0.3.1-py3-none-any.whl.

File metadata

Download URL: duckrun-0.3.1-py3-none-any.whl
Upload date: Jun 6, 2026
Size: 23.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for duckrun-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d5f4bee0a3a88111d5f112489b6637db923b8ef8a51358c53283913ab9a1d26`
MD5	`cee0618bee1eec8107651b0a46080baf`
BLAKE2b-256	`494d6f5f0574edcd278775fada4caa4fd4e9a7bf28e50b2bd436e2b3b3d4541d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for duckrun-0.3.1-py3-none-any.whl:

Publisher: publish.yml on djouallah/duckrun

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: duckrun-0.3.1-py3-none-any.whl
- Subject digest: 2d5f4bee0a3a88111d5f112489b6637db923b8ef8a51358c53283913ab9a1d26
- Sigstore transparency entry: 1738841511
- Sigstore integration time: Jun 6, 2026
Source repository:
- Permalink: djouallah/duckrun@b2a1b5e65dac58fa432a69f301faa7d66e41c667
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/djouallah
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b2a1b5e65dac58fa432a69f301faa7d66e41c667
- Trigger Event: push

duckrun 0.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Why a separate adapter instead of a PR to dbt-duckdb?

0.3.0 is a breaking change

How it fits together

Install

Configure your profile

Remote stores (Fabric OneLake / ADLS / S3 / GCS)

Materializations

table

incremental

Config options (table / incremental / delta)

Reading existing Delta tables as sources

How it works

Limitations

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`table`

`incremental`

Config options (`table` / `incremental` / `delta`)