Skip to main content

Exploratory Data Analysis

Project description

exploratory

Exploratory Data Analysis

Description

This project explortory was created to perform Exploratory Data Analysis on any structured dataset. Dataset can have categorical or numerical data types. This project takes pandas dataframe and gives summary statistics and individual plots having categorical count for catagorical variables and PDF's, CDF's with mean, median and mode for numerical variables. The both the results are stored in PDF and CSV file in your current directory/path.

Installation:

Use the package manager pip to install exploratory

pip install exploratory

Usage:

from exploratory import EDA
EDA(df)
# df --> pandas dataframe
#Please input the DPI value, as DPI value increases runtime would increase. Defualt DPI value: 150

Expected Outputs:

  • CSV File, DataFrame Containing
Column Description
Variable Variable Name in the dataset provided
Cardinality Number of levels/classes in each variable
total_count Count of total records (non null)
unique_rate Cardinality / total_count, Unique Rate of 1 indicates a ID variable
percent_missing Percentage of missing values across each column
mean Average of column (Ignores Object/String variables)
std Standard deviation of column (Ignores Object/String variables)
min Minimum of column (Ignores Object/String variables)
25% 25th percentile value of column (Ignores Object/String variables)
median 50th percentile value of column (Ignores Object/String variables)
75% 75th percentile value of column (Ignores Object/String variables)
max Maximum of column (Ignores Object/String variables)
data_types Data type of column (Int / Float / Object etc)
range Max Value - Min Value (Ignores Object/String variables)
  • PDF with Statistical Summary and variable distribution graphs (categorical & continous)

Contributing

Pull requests are welcome. Please use this 'https://github.com/Abhilash-MS/exploratory' Please feel free to contact authors for any suggestions or issues, Ram kakarlaramcharan@gmail.com, Abhilash abhilashmaspalli1996@gmail.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

exploratory-3.4.10.tar.gz (6.9 kB view details)

Uploaded Source

File details

Details for the file exploratory-3.4.10.tar.gz.

File metadata

  • Download URL: exploratory-3.4.10.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.12

File hashes

Hashes for exploratory-3.4.10.tar.gz
Algorithm Hash digest
SHA256 2cae4e54cf4f0a160cff12f676abdc0db6d7d7a6c8f00c0ed42e265ce3e33d33
MD5 59c6799e925be62272e3e9d54c848a68
BLAKE2b-256 e5d50ba9066a0660a6948cecd397ad987b8d3d960180b6a32a310e20b276f71e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page