Skip to main content

A csv reconciliation tool and report generator

Project description

Reconcilation CLI tools Upload Python Package

1. Introduction

This document describes how to use the reconcile command-line tool to compare two CSV files and generate a report highlighting missing records and discrepancies between them. The tool can be helpful for data validation, identifying discrepancies in data pipelines, and ensuring data consistency across multiple sources.

2. Prerequisites

  • Python 3.6+
  • click package installed (pip install click)
  • Pandas library installed (pip install pandas)
  • Levenshtein library installed (pip install Levenshtein)

2.1 Installing

    pip3 install reconcii

3. Running the Tool

3.1 Required arguments:

  • -s or --source: Path to the source CSV file.
  • -t or --target: Path to the target CSV file.
  • -o or --output: Path to the output CSV file containing the reconciliation report (default: "reconciliation_report.csv").

3.2 Optional arguments:

  • -c or --comparison-columns: List of additional columns to compare for discrepancies (default: all columns except "ID").

3.2 Example usage:

 reconcile -s data/source.csv -t data/target.csv -o report.csv -c column1,column2

This command will compare the source file data/source.csv with the target file data/target.csv, generate a reconciliation report named report.csv, and compare discrepancies in columns column1 and column2 in addition to the default "ID" column.

4. Output Report

The generated report will be a CSV file with the following columns:

  • Type: Indicates the type of discrepancy (missing in source, missing in target, field discrepancy).
  • Record Identifier: ID of the record where the discrepancy was found.
  • Field: (optional) Specific field where the discrepancy was found (applicable for field discrepancies).
  • Source Value: Value of the field in the source file.
  • Target Value: Value of the field in the target file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reconciii-0.1.0.tar.gz (2.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

reconciii-0.1.0-py3-none-any.whl (2.8 kB view details)

Uploaded Python 3

File details

Details for the file reconciii-0.1.0.tar.gz.

File metadata

  • Download URL: reconciii-0.1.0.tar.gz
  • Upload date:
  • Size: 2.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for reconciii-0.1.0.tar.gz
Algorithm Hash digest
SHA256 72ac9d17fb94d577f999ae6ba1afaaf2c17a553a85354dac01771acd1643de1a
MD5 e17fd5b5785ba3a6f776f3c6c0d4b447
BLAKE2b-256 a4bfbe0c8d006ea3cd06a9cb9a073bb548bb09af098c3fffcb397dd7cc15440e

See more details on using hashes here.

File details

Details for the file reconciii-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: reconciii-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 2.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for reconciii-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 33ea780b6fa01cbb0d36cbeedbffb14fab7fb3da30cc872ce35db8a181bba70a
MD5 8d0145ff83fa05a4d18e40724e7df5f1
BLAKE2b-256 7008b62400cba90a24dd79bb0a289dc0f4f3f362695eff998b565abb9496e7b1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page