Skip to main content

Utility package for comparing polars dataframes.

Project description


diffly โ€” A utility package for comparing ๐Ÿปโ€โ„๏ธ DataFrames

CI conda-forge pypi-version python-version codecov

๐Ÿ—‚ Table of Contents

๐Ÿ“– Introduction

Diffly is a Python package for comparing Polars DataFrames with detailed analysis capabilities. It identifies differences between datasets including schema differences, row-level mismatches, missing rows, and column value changes.

๐Ÿ’ฟ Installation

You can install diffly using your favorite package manager:

pixi add diffly
conda install diffly
uv add diffly
pip install diffly

๐ŸŽฏ Usage

import polars as pl
from diffly import compare_frames

left = pl.DataFrame({
    "id": ["a", "b", "c"],
    "value": [1.0, 2.0, 3.0],
})

right = pl.DataFrame({
    "id": ["a", "b", "d"],
    "value": [1.0, 2.5, 4.0],
})

comparison = compare_frames(left, right, primary_key="id")

if not comparison.equal():
    summary = comparison.summary(
        top_k_column_changes=1,
        show_sample_primary_key_per_change=True
    )
    print(summary)
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”“
โ”ƒ                                     Diffly Summary                                     โ”ƒ
โ”—โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”›
   Primary key: id

 Schemas
 โ–”โ–”โ–”โ–”โ–”โ–”โ–”
   Schemas match exactly (column count: 2).

 Rows
 โ–”โ–”โ–”โ–”
   Left count             Right count
       3      (no change)      3

   โ”โ”โ”ฏโ”โ”ฏโ”โ”ฏโ”โ”ฏโ”โ”“
   โ”ƒ-โ”‚-โ”‚-โ”‚-โ”‚-โ”ƒ                1  left only   (33.33%)
   โ” โ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”จโ•Œโ•Œโ•Œโ”โ”โ”ฏโ”โ”ฏโ”โ”ฏโ”โ”ฏโ”โ”“โ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•ฎ
   โ”ƒ โ”‚ โ”‚ โ”‚ โ”‚ โ”ƒ = โ”ƒ โ”‚ โ”‚ โ”‚ โ”‚ โ”ƒ  1  equal       (50.00%)  โ”‚
   โ” โ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”จโ•Œโ•Œโ•Œโ” โ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”จโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ”œโ•ด  2  joined
   โ”ƒ โ”‚ โ”‚ โ”‚ โ”‚ โ”ƒ โ‰  โ”ƒ โ”‚ โ”‚ โ”‚ โ”‚ โ”ƒ  1  unequal     (50.00%)  โ”‚
   โ”—โ”โ”ทโ”โ”ทโ”โ”ทโ”โ”ทโ”โ”›โ•Œโ•Œโ•Œโ” โ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”ผโ”€โ”จโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•Œโ•ฏ
                 โ”ƒ+โ”‚+โ”‚+โ”‚+โ”‚+โ”ƒ  1  right only  (33.33%)
                 โ”—โ”โ”ทโ”โ”ทโ”โ”ทโ”โ”ทโ”โ”›

 Columns
 โ–”โ–”โ–”โ–”โ–”โ–”โ–”
   โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
   โ”‚ value โ”‚ 50.00% โ”‚ 2.0 -> 2.5 (1x, e.g. "b") โ”‚
   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

See more examples in the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffly-1.0.2.tar.gz (225.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffly-1.0.2-py3-none-any.whl (26.1 kB view details)

Uploaded Python 3

File details

Details for the file diffly-1.0.2.tar.gz.

File metadata

  • Download URL: diffly-1.0.2.tar.gz
  • Upload date:
  • Size: 225.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for diffly-1.0.2.tar.gz
Algorithm Hash digest
SHA256 d50817c974bbe03734a6a35a040a9406f81d35ac7c57f9cb7a57f254243fb6a8
MD5 5981dfb979f98252d28af197ddca90aa
BLAKE2b-256 0f584447dbb272868ca7d45609b16f55e675d43977659689cc08afcca232c7fa

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffly-1.0.2.tar.gz:

Publisher: build.yml on Quantco/diffly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file diffly-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: diffly-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 26.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for diffly-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9baea7a128b7d4b0669f8b25fb258a0ef21364352896b17685760f6371b9c0e6
MD5 d1b67b75d8c32daa4572b20e72e2c864
BLAKE2b-256 1215241e383ff4298e81ec92488163f796a34aeaad71d84581b980b510a3a917

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffly-1.0.2-py3-none-any.whl:

Publisher: build.yml on Quantco/diffly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page