A simple package to get statistical summary of a pandas dataframe
Project description
Get Started
Source code: GitHub Repository
Installation
Use pip directly
pip install cleansummary
Or install from source
git clone https://github.com/fonyango/cleansummary.git
cd cleansummary
pip install .
Usage
Import the library
from cleansummary import CleanSummary
Instantiate the library using a dataframe
cs = CleanSummary(df)
Get proportion of missing data
cs.percentage_missing()
Get the plot and skewness coefficient of a variable
cs.check_skewness('variable_name')
Get statistical summary
cs.get_statistical_summary(variableType=None)
cs.get_statistical_summary(variableType='categorical')
cs.get_statistical_summary(variableType='numerical')
Change Log
0.0.4 (15/09/2023)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cleansummary-0.0.5.tar.gz
(5.3 kB
view hashes)
Built Distribution
Close
Hashes for cleansummary-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 525de4985bef147d18af50ecf994369d5b13bbc37c1b6ea4ab1bb8ac8e9e40e3 |
|
MD5 | 462c823979af2d6e2c6c02a1880bf538 |
|
BLAKE2b-256 | 5101d8a58afe20c9c7d2d22dbcaa3704a708a2496379648b6299f1cad26eb6e5 |