Skip to main content

Automate Exploratory Data Analysis

Project description

Plotly Dash EDA

Automate Exploratory Data Analysis
Exploratory Data Analysis (EDA) is used to explore different aspects of the data we are working on.
EDA should be performed in order to find the patterns, visual insights, etc. that the data set is having, before creating a model or 
predicting something through the dataset.
EDA is a general approach of identifying characteristics of the data we are working on by visualizing the dataset.
Analyzing a dataset is a hectic task and takes a lot of time,
according to a study EDA takes around 30% effort of the project but it cannot be eliminated.
in thist project cteate  certain open-source modules that can automate the whole process of EDA and save a lot of time.

Loading the data into the pandas data frame is certainly one of the most important steps in EDA. Note:-Although categorical data is qualitative, it may sometimes take numerical values. in that case graphical presentation (eg.-bar chart, scatter chart) variable not clearly identify. In ordinal data order of data variable clearly name with python object dtype. In nominal data level of data variable clearly name with python object dtype.

User Installation :

If you already have a working installation of numpy and pandas, plolty the easiest way to install PDEDA is using pip

pip install PDEDA

This Package Depend On Other Packages:

#Importing the required libraries for EDA:
pandas
scipy
plotly
numpy
sklearn
jupyter_dash
dash
dash_table
statsmodels
dash_core_components
dash_html_components
dash_bootstrap_components 
plotly
base64
io

Usage

plotly dash EDA

from PDEDA import PD_EDA 
import pandas as pd


df = pd.read_csv('data.csv')


app=PD_EDA(data=df)
app.plotly_dash_eda()

# By default, Dash app run on jupyter
from PDEDA import PD_EDA 
import pandas as pd


df = pd.read_csv('data.csv')

app=PD_EDA(data=df,display='localhost')
app.plotly_dash_eda()

# click on below link  Dash app run on localhost
# restart notebook for reuse application.

#App Structure


Github file source second

Change Log

0.0.1 (19/05/2021)

  • First Release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PDEDA-0.0.3.tar.gz (12.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page