3 projects
pyspark-data-toolkit
Modular toolkit for Data Engineering with PySpark and Delta Lake — schema management, auditing, profiling, normalization, JSON handling, window functions, and Delta Lake operations.
logging-metrics
Advanced logging utilities for robust, standardized logs in Python projects, APIs, data engineering, and more.
file-toolkit
file-toolkit is a complete suite of utilities for manipulating, organizing, and monitoring files and directories in Python. It offers functions for copying, moving, synchronizing, compressing, hashing, advanced searching, temporary file management, and much more, with a focus on productivity, security, and auditing.