3 projects
fastrecon
High-performance reconciliation engine for SQL tables, queries, CSV, Parquet, JSON, Excel, and fixed-width files using DuckDB, Polars, and Arrow.
informatica-python
Convert Informatica PowerCenter workflow XML to Python/PySpark code
informatica-sparker
Framework to convert Informatica PowerCenter XML exports to PySpark code for Databricks. Auto-detects sources (SQL, CSV, Parquet, XML, JSON, text, DAT, files without extensions) and generates complete deployment packages.