A csv reconciliation tool and report generator
Project description
Reconcilation CLI tools 
1. Introduction
This document describes how to use the reconcile command-line tool to compare two CSV files and generate a report highlighting missing records and discrepancies between them. The tool can be helpful for data validation, identifying discrepancies in data pipelines, and ensuring data consistency across multiple sources.
2. Prerequisites
- Python 3.6+
clickpackage installed (pip install click)- Pandas library installed (pip install pandas)
- Levenshtein library installed (pip install Levenshtein)
2.1 Installing
pip3 install reconcii
3. Running the Tool
3.1 Required arguments:
-sor--source: Path to the source CSV file.-tor--target: Path to the target CSV file.-oor--output: Path to the output CSV file containing the reconciliation report (default: "reconciliation_report.csv").
3.2 Optional arguments:
-cor--comparison-columns: List of additional columns to compare for discrepancies (default: all columns except "ID").
3.2 Example usage:
reconcile -s data/source.csv -t data/target.csv -o report.csv -c column1,column2
This command will compare the source file data/source.csv with the target file data/target.csv, generate a reconciliation report named report.csv, and compare discrepancies in columns column1 and column2 in addition to the default "ID" column.
4. Output Report
The generated report will be a CSV file with the following columns:
Type: Indicates the type of discrepancy (missing in source, missing in target, field discrepancy).Record Identifier: ID of the record where the discrepancy was found.Field: (optional) Specific field where the discrepancy was found (applicable for field discrepancies).Source Value: Value of the field in the source file.Target Value: Value of the field in the target file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file reconciii-0.1.0.tar.gz.
File metadata
- Download URL: reconciii-0.1.0.tar.gz
- Upload date:
- Size: 2.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
72ac9d17fb94d577f999ae6ba1afaaf2c17a553a85354dac01771acd1643de1a
|
|
| MD5 |
e17fd5b5785ba3a6f776f3c6c0d4b447
|
|
| BLAKE2b-256 |
a4bfbe0c8d006ea3cd06a9cb9a073bb548bb09af098c3fffcb397dd7cc15440e
|
File details
Details for the file reconciii-0.1.0-py3-none-any.whl.
File metadata
- Download URL: reconciii-0.1.0-py3-none-any.whl
- Upload date:
- Size: 2.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
33ea780b6fa01cbb0d36cbeedbffb14fab7fb3da30cc872ce35db8a181bba70a
|
|
| MD5 |
8d0145ff83fa05a4d18e40724e7df5f1
|
|
| BLAKE2b-256 |
7008b62400cba90a24dd79bb0a289dc0f4f3f362695eff998b565abb9496e7b1
|