Dataframe comparisons in Python
Project description
DataComPy
DataComPy is a package to compare two DataFrames (or tables) such as Pandas, Spark, Polars, and
even Snowflake. Originally it was created to be something of a replacement
for SAS's PROC COMPARE for Pandas DataFrames with some more functionality than
just Pandas.DataFrame.equals(Pandas.DataFrame) (in that it prints out some stats,
and lets you tweak how accurate matches have to be). Supported types include:
- Pandas
- Polars
- Spark
- Snowflake
[!IMPORTANT] datacompy is progressing towards a
v1release. During this transition, asupport/0.19.xbranch will be maintained solely forv0.19.xusers. This branch will only receive dependency updates and critical bug fixes; no new features will be added. All new feature development should target thev1branches (developand eventuallymain).
Quick Installation
pip install datacompy
or
conda install datacompy
Installing extras
If you would like to use Spark or any other backends please make sure you install via extras:
pip install datacompy[spark]
pip install datacompy[snowflake]
Supported backends
- Pandas: (See documentation)
- Spark: (See documentation)
- Polars: (See documentation)
- Snowflake/Snowpark: (See documentation)
Contributors
We welcome and appreciate your contributions! Before we can accept any contributions, we ask that you please be sure to sign the Contributor License Agreement (CLA).
This project adheres to the Open Source Code of Conduct. By participating, you are expected to honor this code.
Roadmap
Roadmap details can be found here
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datacompy-1.0.0b3.tar.gz.
File metadata
- Download URL: datacompy-1.0.0b3.tar.gz
- Upload date:
- Size: 114.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
329b3d8b338cf35889b81cc54c142161a8b6f8f82f0ac83f53822a7b1b5dec9c
|
|
| MD5 |
f8125ab998d5b858109148ca4ce92fea
|
|
| BLAKE2b-256 |
81e59ba3c968fad807a21ea050ac7f7b8e80a38c5ef75352bacd94371a88df97
|
Provenance
The following attestation bundles were made for datacompy-1.0.0b3.tar.gz:
Publisher:
publish-package.yml on capitalone/datacompy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
datacompy-1.0.0b3.tar.gz -
Subject digest:
329b3d8b338cf35889b81cc54c142161a8b6f8f82f0ac83f53822a7b1b5dec9c - Sigstore transparency entry: 1510587631
- Sigstore integration time:
-
Permalink:
capitalone/datacompy@6217ef19dba67a72004394fbf65e91e469f1ba10 -
Branch / Tag:
refs/heads/develop - Owner: https://github.com/capitalone
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-package.yml@6217ef19dba67a72004394fbf65e91e469f1ba10 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file datacompy-1.0.0b3-py3-none-any.whl.
File metadata
- Download URL: datacompy-1.0.0b3-py3-none-any.whl
- Upload date:
- Size: 70.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
475e18cdab1a3dcc8ee7a1f2c7c15c55635d135f7f0f55baeddcc771fd3c4247
|
|
| MD5 |
b09d6583bd35e5e9459287a9509c6a86
|
|
| BLAKE2b-256 |
f23253691e76289b6e50ea7afc7b61910cc149b1e3aa18fd0ef041b7ef3b9ff6
|
Provenance
The following attestation bundles were made for datacompy-1.0.0b3-py3-none-any.whl:
Publisher:
publish-package.yml on capitalone/datacompy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
datacompy-1.0.0b3-py3-none-any.whl -
Subject digest:
475e18cdab1a3dcc8ee7a1f2c7c15c55635d135f7f0f55baeddcc771fd3c4247 - Sigstore transparency entry: 1510587714
- Sigstore integration time:
-
Permalink:
capitalone/datacompy@6217ef19dba67a72004394fbf65e91e469f1ba10 -
Branch / Tag:
refs/heads/develop - Owner: https://github.com/capitalone
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-package.yml@6217ef19dba67a72004394fbf65e91e469f1ba10 -
Trigger Event:
workflow_dispatch
-
Statement type: