6 projects
data-time-machine
A git-like state management system for data pipelines
dag-cost-tracker
A DAG cost tracking plugin for Apache Airflow
audio-transcript-cli
A simple CLI tool to transcribe large audio files using OpenAI Whisper with chunking support.
csv-json-schema-sync
A tool for data engineers to manage continuously evolving CSV/JSON schemas.
pyviz-tutor
A CLI tool to visualize Python code execution locally.
copydata
CLI to compare two tabular datasets and produce a concise markdown report