Skip to main content

This module brings different functions to make EDA, data cleaning easier.

Project description

Data Inspector

Author MIT Contributions welcome Stars

Data Inspector is an open-source python library that brings 15 types of different functions to make EDA, data cleaning easier.

Author: Kazi Amit Hasan

Project Description:

Data Inspector brings a total of 15 essential exploratory data analysis, data cleaning automations to make a dataset understandable. This is a perfect tool to get started with you data.

Downloads

Installation:

pip install data-inspector

Package available at https://pypi.org/project/data-inspector/

Available automation:

  1. Line plot : line_plot(data, x_data, y_data, x_label="", y_label="", title="")
  2. Skew feature: plot_skewed_feature(data, column)
  3. Showing data distribution: show_distribution(data, column)
  4. Scatter plot: plot_scatter(data,x_data, y_data)
  5. Correlation plot: plot_correlation(data)
  6. Create histogram: histogram(data,column, x_label, y_label, title)
  7. Create bar plot: plot_bar(data, column, xlabel, ylabel, title)
  8. Create boxplots of all features: box_plot(data)
  9. Checking dataset's shape: datasetShape(data)
  10. Get dataset's diagnostic plots: diagnostic_plots(data, variable)
  11. Divide numerical and categorical features: divideFeatures(data)
  12. Fill NaN values: fillNan(data, column, value)
  13. Get pearson's correlation between two variables: get_correlation(column_1, column_2, data)
  14. Plotting kde plots: plot_cont_kde(data, var)
  15. Automatic calculating the missing values and their percentage along with visualization : calculating_missing_values(data)

Tutorial:

Link: https://github.com/AmitHasanShuvo/data-inspector/blob/main/notebook/example%20notebook.ipynb
Colab link: https://colab.research.google.com/drive/1mj9gz2XyQprSYdKMUKlKkJ9Qi8XmleHW?usp=sharing

Some visualizations:



How to cite:

@online{data-inspector,
title={data-inspector},
url={https://pypi.org/project/data-inspector/},
urldate = {2021-08-21}, 
publisher={Kazi Amit Hasan}
}

Future Works:

  1. Add some automations for time series data.

How to contribute:

Any contribution would be highly appreciated. Kindly go through the guidelines for contributing in github.

Change Log

0.0.1 (20/08/2021)

  • First Release

0.0.2 (20/08/2021)

  • Minor updates

0.0.3 (20/08/2021)

  • Minor updates

0.0.4 (20/08/2021)

  • Minor updates

0.0.5 (20/08/2021)

  • Minor updates

0.0.6 (20/08/2021)

  • Minor updates

0.0.8 (21/08/2021)

  • Minor updates

1.1 (21/08/2021)

  • Minor updates

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_inspector-1.5.2.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_inspector-1.5.2-py3-none-any.whl (6.5 kB view details)

Uploaded Python 3

File details

Details for the file data_inspector-1.5.2.tar.gz.

File metadata

  • Download URL: data_inspector-1.5.2.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.10

File hashes

Hashes for data_inspector-1.5.2.tar.gz
Algorithm Hash digest
SHA256 5cb367166e383d62383905f13d6025feaf04a892b5c4a447a0700c77904ed166
MD5 cec5f259f572bc0d9ca92242ef60eb2b
BLAKE2b-256 145fe38b823a5c8e29c53a570847563fa49e51cf24627c76a53078a26dbff6e4

See more details on using hashes here.

File details

Details for the file data_inspector-1.5.2-py3-none-any.whl.

File metadata

  • Download URL: data_inspector-1.5.2-py3-none-any.whl
  • Upload date:
  • Size: 6.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.10

File hashes

Hashes for data_inspector-1.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b8950eda645a333599fc5217a5df0fbdf6598128ec6a4dd0711975961c23c5ee
MD5 216309281e182c97d748b43aa23f63db
BLAKE2b-256 39801524b53ec1bbdc3317825417cb684332f2ef66df584146565d01680ccda9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page