Skip to main content

A simple package to get statistical summary of a pandas dataframe

Project description

Get Started

Source code: GitHub Repository

Installation

Use pip directly

pip install cleansummary

Or install from source

git clone https://github.com/fonyango/cleansummary.git
cd cleansummary
pip install .

Usage

Import the library

from cleansummary import CleanSummary

Instantiate the library using a dataframe

cs = CleanSummary(df)

Get proportion of missing data

cs.percentage_missing()

Get the plot and skewness coefficient of a variable

cs.check_skewness('variable_name')

Get statistical summary

cs.get_statistical_summary(variableType=None)

cs.get_statistical_summary(variableType='categorical')

cs.get_statistical_summary(variableType='numerical')

Change Log

0.0.4 (15/09/2023)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleansummary-0.0.5.tar.gz (5.3 kB view hashes)

Uploaded Source

Built Distribution

cleansummary-0.0.5-py3-none-any.whl (3.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page