Generate a quick EDA report of a pandas DataFrame
Project description
Easy Report
Generates EDA reports from a pandas DataFrame.
Features of the package
The pandas df.describe() function is great but a little basic, for serious exploratory data analysis for any classification or regression problem EasyReport helps the user for quick data analysis.
For each column the following statistics (if relevant for the column type) are presented:
- Type inference: detect the types of columns in a dataframe
- Essentials: type, unique values, missing values, outliers
- Quantile statistics: minimum value, Q1, median, Q3, maximum, interquartile range, upper, lower
- Descriptive statistics: mean, variance, standard deviation, coefficient of variation, kurtosis, skewness
- Graphs: Histogram, boxplot, countplot, pie, scatterplot
- Correlations: correlation matrix, heatmap
- Duplicates: total number of duplicate row and number of times they are repated
- Missing value: number of missing values, dendrogram of missing values
Installation
You can install using the pip package manager by running
pip install EasyReport
Examples
For Classification Problem
import pandas as pd
from EasyReport.easy_report import EdaReport
#Read the dataset
df = pd.read_csv("Dataset")
report = EdaReport(data = df,target_column = 'target_column_name')
report.summary()
For Regression Problem
import pandas as pd
from EasyReport.easy_report import EdaReport
#Read the dataset
df = pd.read_csv("Dataset")
report = EdaReport(data = df,target_column = 'target_column_name',regression = True)
report.summary()
Author
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
EasyReport-0.0.7.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for EasyReport-0.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5272e30269abd8905e0e870971b6aca3cde5414826618223e9d80cb40a9af468 |
|
MD5 | 226b2b0118cdea208413c3dd464d48a7 |
|
BLAKE2b-256 | 59a6bb25f6c4a8986819b1d269d3e42f36caca426a3f1af4c387869241c3bb69 |