Skip to main content

Dataframe comparisons in Python

Project description

DataComPy

PyPI - Python Version Ruff PyPI version Anaconda-Server Badge PyPI - Downloads

DataComPy is a package to compare two DataFrames (or tables) such as Pandas, Spark, Polars, and even Snowflake. Originally it was created to be something of a replacement for SAS's PROC COMPARE for Pandas DataFrames with some more functionality than just Pandas.DataFrame.equals(Pandas.DataFrame) (in that it prints out some stats, and lets you tweak how accurate matches have to be). Supported types include:

  • Pandas
  • Polars
  • Spark
  • Snowflake

[!IMPORTANT] datacompy is progressing towards a v1 release. During this transition, a support/0.19.x branch will be maintained solely for v0.19.x users. This branch will only receive dependency updates and critical bug fixes; no new features will be added. All new feature development should target the v1 branches (develop and eventually main).

Quick Installation

pip install datacompy

or

conda install datacompy

Installing extras

If you would like to use Spark or any other backends please make sure you install via extras:

pip install datacompy[spark]
pip install datacompy[snowflake]

Supported backends

Contributors

We welcome and appreciate your contributions! Before we can accept any contributions, we ask that you please be sure to sign the Contributor License Agreement (CLA).

This project adheres to the Open Source Code of Conduct. By participating, you are expected to honor this code.

Roadmap

Roadmap details can be found here

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datacompy-1.0.0b3.tar.gz (114.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datacompy-1.0.0b3-py3-none-any.whl (70.9 kB view details)

Uploaded Python 3

File details

Details for the file datacompy-1.0.0b3.tar.gz.

File metadata

  • Download URL: datacompy-1.0.0b3.tar.gz
  • Upload date:
  • Size: 114.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datacompy-1.0.0b3.tar.gz
Algorithm Hash digest
SHA256 329b3d8b338cf35889b81cc54c142161a8b6f8f82f0ac83f53822a7b1b5dec9c
MD5 f8125ab998d5b858109148ca4ce92fea
BLAKE2b-256 81e59ba3c968fad807a21ea050ac7f7b8e80a38c5ef75352bacd94371a88df97

See more details on using hashes here.

Provenance

The following attestation bundles were made for datacompy-1.0.0b3.tar.gz:

Publisher: publish-package.yml on capitalone/datacompy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file datacompy-1.0.0b3-py3-none-any.whl.

File metadata

  • Download URL: datacompy-1.0.0b3-py3-none-any.whl
  • Upload date:
  • Size: 70.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datacompy-1.0.0b3-py3-none-any.whl
Algorithm Hash digest
SHA256 475e18cdab1a3dcc8ee7a1f2c7c15c55635d135f7f0f55baeddcc771fd3c4247
MD5 b09d6583bd35e5e9459287a9509c6a86
BLAKE2b-256 f23253691e76289b6e50ea7afc7b61910cc149b1e3aa18fd0ef041b7ef3b9ff6

See more details on using hashes here.

Provenance

The following attestation bundles were made for datacompy-1.0.0b3-py3-none-any.whl:

Publisher: publish-package.yml on capitalone/datacompy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page