Skip to main content

Find difference between two large csv files

Project description

![PyPI](https://img.shields.io/pypi/v/diffcsv.svg) [![Build Status](https://travis-ci.org/ugursogukpinar/diff-csv.svg?branch=master)](https://travis-ci.org/ugursogukpinar/diff-csv)

# diffcsv

It finds differences between two version of a csv file which are built with same structure.

### Requirements
---

- Python 3.x

### Setup
---

```
$ pip install diffcsv
```


### Usage
---

```
diffcsv [-h] [--primary-key PRIMARY_KEY]
[--based-on BASED_ON [BASED_ON ...]] [--delimiter DELIMITER]
old_csv new_csv

positional arguments:
old_csv Path of old csv file
new_csv Path of new csv file

optional arguments:
-h, --help show this help message and exit
--primary-key PRIMARY_KEY
Common key of two csv files
--based-on BASED_ON [BASED_ON ...]
--delimiter DELIMITER
Delimiter of csv files
```

Example:

- version-1.csv
```
id,key,value,created_at
1,key-1,value-1,2018-01-01
2,key-2,value-2,2018-02-01
```

- version-2.csv
```
id,key,value,created_at
1,key-1,value-1-altered,2018-01-01
3,key-3,value-3,2018-02-02
```


Run:

```bash
$ diffcsv /path/of/version-1.csv /path/of/version-2.csv --primary-key id --based-on key value
```

Output:
```
"id","key","value","created_at","DIFF_STATUS"
"2","key-2","value-2","2018-02-01","DELETED"
"3","key-3","value-3","2018-02-02","INSERTED"
"1","key-1","value-1-altered","2018-01-01","UPDATED"
```


### Contributers
[**Kaan ant**](https://github.com/kaanant)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffcsv-1.6.tar.gz (16.6 kB view details)

Uploaded Source

File details

Details for the file diffcsv-1.6.tar.gz.

File metadata

  • Download URL: diffcsv-1.6.tar.gz
  • Upload date:
  • Size: 16.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for diffcsv-1.6.tar.gz
Algorithm Hash digest
SHA256 9749da2d55fe48eba7a300bbbf0389c20ed0cd6b7bdf1c07869116dcf14b141b
MD5 d3fb5906e1594c4afe2cb74a3d93c765
BLAKE2b-256 97679cd22e97c50afccbfd90695fb6c7c8756b1f3723159f39f1a8fd697fc9d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page