Skip to main content

A utility to export DuckLake database metadata to Delta Lake transaction logs.

Project description

🦆 DuckLake Delta Exporter

A Python utility to bridge the gap between DuckLake and Delta Lake by generating Delta-compatible transaction logs directly from DuckLake metadata.

This isn’t your typical general-purpose library. It’s mostly battle-tested with OneLake mounted storage, and while it should work with local filesystems, there’s no support for S3, GCS, or ABFSS .

It doesn’t use the deltalake Python package either. The metadata is handcrafted from scratch — because why not reinvent the wheel for fun and learning?

Goal?
Mostly to annoy DuckDB developers into finally shipping a proper Delta Lake metadata exporter 😎

🔗 Source code on GitHub


✨ Features

  • DuckLake → Delta Sync
    Generates Delta Lake _delta_log/*.json transaction files and Parquet checkpoints from the latest DuckLake state.

  • Schema Mapping
    Converts DuckDB types to their Spark SQL equivalents so Delta can understand them without throwing a tantrum.

  • Change Detection
    Detects file-level additions/removals since the last export — keeps things incremental and tidy.

  • Checkpointing
    Automatically writes Delta checkpoints every N versions (configurable), so readers don’t have to replay the entire log from scratch.


⚙️ Installation & Usage

Install via pip:

pip install ducklake-delta-exporter
from ducklake_delta_exporter import generate_latest_delta_log

generate_latest_delta_log('/lakehouse/default/Files/meta.db')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ducklake_delta_exporter-0.1.1.tar.gz (8.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ducklake_delta_exporter-0.1.1-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file ducklake_delta_exporter-0.1.1.tar.gz.

File metadata

  • Download URL: ducklake_delta_exporter-0.1.1.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ducklake_delta_exporter-0.1.1.tar.gz
Algorithm Hash digest
SHA256 24f929455c1492a300a2eacda1f0010a7c440b7326ca7b33f4f3e70b0072ceda
MD5 e62b8cf89a12e46a981b0949011bb7e1
BLAKE2b-256 b9188545d8e026ec08ef865e5cbbcd015931ece2f78e168925598aa3eb623578

See more details on using hashes here.

File details

Details for the file ducklake_delta_exporter-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for ducklake_delta_exporter-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 28956670d5c15371927f6a75a59ed7d807e03b81e2d90f0c582d988d89319320
MD5 6cf44854d06e71fc007adcbbf76ccf84
BLAKE2b-256 d8b7b0e1f8eda8992806fd264bc56147b115b611a5424f158c630decedf206e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page