A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning and Automated Data Preprocessing For Machine Learning and Natural Language Processing in Python.
Project description
Data-Purifier
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning and Automated Data Preprocessing For Machine Learning and Natural Language Processing in Python.
Features
-
It gives shape, number of categorical and numerical features, description of the dataset, and also the information about the number of null values and their respective percentage.
-
For understanding the distribution of datasets and getting useful insights, there are many interactive plots generated where the user can select his desired column and the system will automatically plot it. Plot includes
- Count plot
- Correlation plot
- Joint plot
- Pair plot
- Pie plot
Get Started
Install the package
pip install data-purifier
Load the module
from datapurifier import Mleda
Load the dataset and let the magic of automated EDA begin
df = pd.read_csv("./datasets/iris.csv")
ae = Mleda(df)
ae
Python Package: https://pypi.org/project/data-purifier/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for data_purifier-0.0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 39b8c4954d23ff7a72b05f5f36d0ca6616c5eeb520e6d06fb59409944029da4d |
|
MD5 | 554a005a287efa94fae9d4cc4c3ba400 |
|
BLAKE2b-256 | 9455b4b83751e66701305a13999cf319c98993c64ed3107b1ff49e3dfab23ad6 |