A simple package to get statistical summary of a pandas dataframe
Project description
Get Started
Source code: GitHub Repository
Installation
Use pip directly
pip install cleansummary
Or install from source
git clone https://github.com/fonyango/cleansummary.git
cd cleansummary
pip install .
Usage
Import the library
from cleansummary import CleanSummary
Instantiate the library using a dataframe
cs = CleanSummary(df)
Get proportion of missing data
cs.percentage_missing()
Get the plot and skewness coefficient of a variable
cs.check_skewness('variable_name')
Get statistical summary
cs.get_statistical_summary(variableType=None)
cs.get_statistical_summary(variableType='categorical')
cs.get_statistical_summary(variableType='numerical')
Change Log
0.0.4 (15/09/2023)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cleansummary-0.0.4.tar.gz
(4.8 kB
view hashes)
Built Distribution
Close
Hashes for cleansummary-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9a0b1197dea155b26057ae78ef8d8081a4ceddb1fee76d1f5ec9664e78f938a |
|
MD5 | ba11faf1d10bbb82a669c382f4b2918f |
|
BLAKE2b-256 | 611d796162327253b89c89886057c3748ba4ae4c2cb7b80729909376e28253c2 |