Skip to main content

Lakehouse management, orchestration and data processing tool.

Project description

dadude

Lakehouse management and orchestration tool.

Status

  • The ultimate target lakehouse is the unity catalog, which is to provide benefits from both deltalake and iceberg.
  • To support writing data to the lakehouse, we currently use the deltalake-rs python bindings.
  • For testing purposes, we use the minio instance at http://192.168.18.206:9000.

Installation

This package is published to the PyPI repository, so you can install it using pip anywhere:

pip install dadude

Usage

First export the following environment variables:

export STORAGE_ACCESS_KEY_ID=xxx STORAGE_SECRET_ACCESS_KEY=xxx STORAGE_ENDPOINT_URL=http://192.168.18.206:9000

Then you can use the dadude module CLI to interact with the lakehouse:

python -m dadude.cli.read_table staging <table_name> --save_dir <save_dir>
python -m dadude.cli.write_table write_json_table --local_json_file_path data/silver/xxx.json

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dadude-0.3.4-py3-none-any.whl (21.8 kB view details)

Uploaded Python 3

File details

Details for the file dadude-0.3.4-py3-none-any.whl.

File metadata

  • Download URL: dadude-0.3.4-py3-none-any.whl
  • Upload date:
  • Size: 21.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.1

File hashes

Hashes for dadude-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4b9d8983695079e6f44cd487ad6b16b02051f3c8f6d5605a4c1c481ac662b9c7
MD5 d716b8a7748303b4828dbe04df9b56bb
BLAKE2b-256 bec527ed80555d3e4ef59d6e16643cb7fa67de95aa694316caf045f09139f634

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page