A tool to find the differences between two tables.
Project description
polars_compare
A tool to compare and find the differences between two Polars DataFrames.
To DO:
- Linting (Ruff)
- [] strict MyPy type checking
- Make into python package
- Add makefile for easy linting and tests
- Statistics should indicate which statistics are referencing columns
- Add all statistics frame to tests
- Add schema differences to schema summary
- Make row examples alternate between base only and compare only so that it is more readable.
- Add limit value to the examples.
- Updated value differences summary so that Statistic is something that makes sense.
- Raise error and print examples if duplicates are present.
- Add difference criterion.
- [] Add a count of the number of rows that have any differences to the value differences summary.
- [] Add total number of value differences to the value differences summary.
- [] Add parameter to hide column differences with 0 differences.
- [] Update report so that non differences are not displayed.
- [] Add table name labels that can replace 'base' and 'compare'.
- [] Change id_columns to be named 'join_on' and add a test that checks that abritrary join conditions work.
- [] Update code to use a config dataclass that can be passed between the class and functions.
- Add license
- Test for large amounts of data
- Benchmark for different sizes of data.
- Write up docstrings
- Write up readme (with code examples)
- Publish package to pypi
- Github actions
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pl_compare-0.1.0.tar.gz
(7.4 kB
view details)
Built Distribution
File details
Details for the file pl_compare-0.1.0.tar.gz
.
File metadata
- Download URL: pl_compare-0.1.0.tar.gz
- Upload date:
- Size: 7.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.12 Linux/6.1.47
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3c8381e3c3a1879ae24cf7152af05de6bc00c8fe4997debb9cb54e5c05158cd8 |
|
MD5 | 579cd5199bcc0efc2e5eabbd5b7576dd |
|
BLAKE2b-256 | 70b9e3ca103f2cfa8d963a2e10ba61ba2850d08d91de6d5ac37befbe311d1a51 |
File details
Details for the file pl_compare-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: pl_compare-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.12 Linux/6.1.47
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 85ed683bd813ae073ec52ea02c26f4aeb863e04357ad7a273562a1ea01d14e27 |
|
MD5 | 191514b0ccf835be1f3452340f1ae7c4 |
|
BLAKE2b-256 | 6e41242433dff788479d3ddbac6d2a937e7b8edb154ce29918da70fb0732baee |