Skip to main content

Advanced Delta-Lake related tooling based on Apache Spark

Reason this release was yanked:

sorry folks, this is the only time this will happen, promise.

Project description

hydro 💧

main

hydro is a collection of Python-based Apache Spark and Delta Lake tooling.

See Key Functionality for concrete use cases.

Installation

pip install delta-hydro

Docs 📖

https://christophergrant.github.io/delta-hydro

Key Functionality 🔑

Contributions ✨

Contributions are welcome.

However, please create an issue before starting work on a feature to make sure that it aligns with the future of the project.

Naming 🤓

Originally this project was going to be hydrologist but that's way too long and pretentious, so we shortened to hydro.

A hydrologist is a person who studies water and its movement. Delta Lake, Data Lake, Lakehouse => water.

ChatGPT and LLMs 🤖

Some of this project's code was generated by a Large Language Model(LLM), namely ChatGPT.

We are proud prompt engineers, so we display the prompt that gave us the code in hydro's source (example).

Our take is that the model is very impressive, but not sophisticated enough to be able to write this whole program (yet). A lot of this stuff is very context-dependent and would be difficult to explain to an AI. Plus, ChatGPT isn't aware of newer APIs as it was trained on an older corpus.

We are excited for the future of humanity given recent advancements in artificial intelligence and hope that the technology is used to liberate, rather than accelerate.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

delta_hydro-0.2.1.tar.gz (9.4 kB view hashes)

Uploaded Source

Built Distribution

delta_hydro-0.2.1-py2.py3-none-any.whl (9.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page