The library that compares two dataframes
Project description
Compare Dataframes
Description
This module provides functionality to compare two dataframes. It uses various distance functions and provides a tabulated result for easy interpretation.
Example Usage
import polars as pl
df = pl.DataFrame(
{
"a": ['21-03-2022', 'soccer', 'cricket'],
"b": ["21-03-2022", 'soccer', "cricket"],
"c": [1, 2, 3],
}
)
df1 = pl.DataFrame(
{
"a": ['21-03-2022', 'soccer', 'cricket', 'baseball'],
"b": ["21-03-2022", 'sucker', "cricket", 'man'],
"c": [4, 2, 3, 4],
}
)
from comparedf import comparedf
compared = comparedf.Compare(df, df1)
print(compared) # prints the tabulated result
compared.save_report("<PATH_TO_SAVE_REPORT>")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
compare_datasets-0.0.0.tar.gz
(10.8 kB
view hashes)
Built Distribution
Close
Hashes for compare_datasets-0.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 28b72233bce3a30f0dbd28215a6fb05647efe886c655b3ddd428d23a5e612d26 |
|
MD5 | 04ab7ecf8501d29de39a9c1e7f5b5e28 |
|
BLAKE2b-256 | c5ffb59f41ab92c1979ffd32a018f57f8e3b2485378501ed242f846664aee4cc |