Skip to main content

A tool to find the differences between two tables.

Project description

polars_compare

A tool to compare and find the differences between two Polars DataFrames.

To DO:

  • Linting (Ruff)
  • [] strict MyPy type checking
  • Make into python package
  • Add makefile for easy linting and tests
  • Statistics should indicate which statistics are referencing columns
  • Add all statistics frame to tests
  • Add schema differences to schema summary
  • Make row examples alternate between base only and compare only so that it is more readable.
  • Add limit value to the examples.
  • Updated value differences summary so that Statistic is something that makes sense.
  • Raise error and print examples if duplicates are present.
  • Add difference criterion.
  • [] Add a count of the number of rows that have any differences to the value differences summary.
  • [] Add total number of value differences to the value differences summary.
  • [] Add parameter to hide column differences with 0 differences.
  • [] Update report so that non differences are not displayed.
  • [] Add table name labels that can replace 'base' and 'compare'.
  • [] Change id_columns to be named 'join_on' and add a test that checks that abritrary join conditions work.
  • [] Update code to use a config dataclass that can be passed between the class and functions.
  • Add license
  • Test for large amounts of data
  • Benchmark for different sizes of data.
  • Write up docstrings
  • Write up readme (with code examples)
  • Publish package to pypi
  • Github actions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pl_compare-0.1.0.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

pl_compare-0.1.0-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file pl_compare-0.1.0.tar.gz.

File metadata

  • Download URL: pl_compare-0.1.0.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.12 Linux/6.1.47

File hashes

Hashes for pl_compare-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3c8381e3c3a1879ae24cf7152af05de6bc00c8fe4997debb9cb54e5c05158cd8
MD5 579cd5199bcc0efc2e5eabbd5b7576dd
BLAKE2b-256 70b9e3ca103f2cfa8d963a2e10ba61ba2850d08d91de6d5ac37befbe311d1a51

See more details on using hashes here.

File details

Details for the file pl_compare-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: pl_compare-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.12 Linux/6.1.47

File hashes

Hashes for pl_compare-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85ed683bd813ae073ec52ea02c26f4aeb863e04357ad7a273562a1ea01d14e27
MD5 191514b0ccf835be1f3452340f1ae7c4
BLAKE2b-256 6e41242433dff788479d3ddbac6d2a937e7b8edb154ce29918da70fb0732baee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page