A package that performs Exploratory Data Analysis
Project description
Python Package that performs Exploratory Data Analysis
exploredata is built on Pandas and Numpy. It gives an overview of a csv data file.
- It imports csv file and returns the summary of the data in the file using the Pandas Dataframe.info() function.
- It contains method (null_cat and null_num) that returns the number of missing categorical and numerical features respectively
- It also has method that returns the number of numerical and categorical features as well the number of features containing outliers
- It also contains method (num_viz and cat_viz) that plot histogram and boxplot for each numerical variable as well as the count plot for each categorical variables.
Installing
pip install exploredata
Usage
-
import exploredata as ep #imports the exploredata package
-
new_variable = ep.EDA(file path) #calls the EDA module in the package. The module import your csv file and assign it as a pandas dataframe to the variable 'train'. It also returns the summarry of the data
-
new_variable.df #prints the pandas dataframe
-
new_variable.num_var() #returns a tuple containing the number and the list of numerical variables in the dataframe
-
train.null_var() #returns a tuple containing the number and the list of numerical variables containing null values
-
help(ep.EDA) #provide information on all methods defined in the EDA module
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for exploredata-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d87230413ef096641dcd2ba7cc07dafd622b15c4f9786968563e586780b4d2b6 |
|
MD5 | 43164b3d4a3006e2e0ba68b3d2cf735b |
|
BLAKE2b-256 | 0c21f843a4c766b1a3395a8f2a52b687dc6e8c4423a1426fa78cf5976c000a37 |