A tool to find the differences between two tables.
Project description
polars_compare
A tool to compare and find the differences between two Polars DataFrames.
To DO:
- Linting (Ruff)
- Make into python package
- Add makefile for easy linting and tests
- Statistics should indicate which statistics are referencing columns
- Add all statistics frame to tests
- Add schema differences to schema summary
- Make row examples alternate between base only and compare only so that it is more readable.
- Add limit value to the examples.
- Updated value differences summary so that Statistic is something that makes sense.
- Publish package to pypi
- Add difference criterion.
- Add license
- [] Make package easy to use (i.e. so you only have to import pl_compare and then you can us pl_compare)
- [] Raise error and print examples if duplicates are present.
- [] Add a count of the number of rows that have any differences to the value differences summary.
- [] Add total number of value differences to the value differences summary.
- [] Add parameter to hide column differences with 0 differences.
- [] Update report so that non differences are not displayed.
- [] Add table name labels that can replace 'base' and 'compare'.
- [] Change id_columns to be named 'join_on' and add a test that checks that abritrary join conditions work.
- [] Update code to use a config dataclass that can be passed between the class and functions.
- [] Test for large amounts of data
- [] Benchmark for different sizes of data.
- [] Write up docstrings
- [] Write up readme (with code examples)
- [] strict MyPy type checking
- [] Github actions for testing
- [] Github actions for linting
- [] Github actions for publishing
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pl_compare-0.1.5.tar.gz
(7.5 kB
view hashes)
Built Distribution
Close
Hashes for pl_compare-0.1.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a94084d491f418637705d6d4a8b23e9504377b6a84c7bafc41e5f9a2672d1ae4 |
|
MD5 | c94fff96cd382489f23cba6c4249bbe9 |
|
BLAKE2b-256 | 9f7553cb1e2858bac491a1ff5e3d6f04b578d599ce7e7cbe84e951ddf022d518 |