An easy to use pre-processing utility for machine learning.
Project description
Data PreProcessing
EasyPreProcessing is a Python module that comprises of data pre-processing helper functions mainly for the purpose of data science and machine learning.
Many of the common machine learning activities that are performed during the Feature Engineering can be performed in a single line of code using this library.
What functionalities are currently available?
- Handling missing values
- Encoding categorical variables
- Handling DateTime features
- Handling empty/blank columns
- Display correlation metrics
- Standardize dataset
- Over sampling
- Clustering (KMeans)
Installing
Just a simple
pip install easypreprocessing
For details regarding all the functionality available:
from easypreprocessing import EasyPreProcessing
prep = EasyPreProcessing('filename.csv')
prep.info()
Sample Templete
Below you can see a sample code of preprocessing using this library.
from easypreprocessing import EasyPreProcessing
prep = EasyPreProcessing('filename_here.csv')
prep.output = 'output_variable_here'
prep.remove_blank() #Remove blank or empty columns
prep.missing_values #Display missing values
prep.categorical.impute() #Fill missing values for categorical variables
prep.numerical.impute() #Fill missing values for numerical variables
prep.categorical.encode() #Convert categorical features to numerical
prep.standardize() #Standardize dataset
X_train, X_test, y_train, y_test = prep.split()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file easypreprocessing-1.0.7.tar.gz
.
File metadata
- Download URL: easypreprocessing-1.0.7.tar.gz
- Upload date:
- Size: 5.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.6.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c692d586d4c7dbb341f0e5d4976eae0074e7287b3193b6247c7e736780553407 |
|
MD5 | ecbe6100af239e76c79a2883f1ffe4b6 |
|
BLAKE2b-256 | 152602fc47ec0b7d0be03bea570773bf2680acd89032fc6511766f8ee00ae21b |
File details
Details for the file easypreprocessing-1.0.7-py3-none-any.whl
.
File metadata
- Download URL: easypreprocessing-1.0.7-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.6.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe81857dda4206e5071ccf74aea94ecd313d9babadb333e26cb0fc9df0af7972 |
|
MD5 | 2e75c8bfdaac34147f2c4e4f81ef6eb4 |
|
BLAKE2b-256 | 1040ee7d75a2f8e243a2f1ccc7c743e26d02d45d9646e4645f61e66306c4acd4 |