Automatic Data Preprocessing Library
Project description
AutoPreprocess
Automatic Data Preprocessing Library for Machine Learning
✨ Features
- Automatic column type detection (numeric, categorical, datetime, useless)
- Smart missing value handling (based on missing percentage)
- Outlier detection & capping (IQR method)
- Intelligent encoding (One-hot, Frequency, Target encoding)
- Feature scaling (Standard, MinMax, Robust)
- Feature selection (Variance, Correlation, Importance, Mutual Info)
- Train/Test split (Random, Stratified, Time series)
- Save & load pipeline for production deployment
- Zero data leakage (fit only on training data)
🚀 Quick Start
from autopreprocess import AutoClean
# One line to preprocess everything
pipeline = AutoClean('data.csv', target='price')
X_train, X_test, y_train, y_test = pipeline.preprocess()
# For new predictions
X_new_clean = pipeline.predict_ready_data(X_new)
# Save for later
pipeline.save('my_pipeline.pkl')
# Load and use
loaded = AutoClean.load('my_pipeline.pkl')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
autopreprocess_lite-0.1.1.tar.gz
(31.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autopreprocess_lite-0.1.1.tar.gz.
File metadata
- Download URL: autopreprocess_lite-0.1.1.tar.gz
- Upload date:
- Size: 31.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
99d0978102409e4a2577712ace0c7e32249c6872d7811680697b36b9f334b719
|
|
| MD5 |
e0e0490b943efd4081cd5f20a78ff353
|
|
| BLAKE2b-256 |
6aa08b6025fc678d758bd189bfb8950580b4110c95f66d2573fde429965e2c26
|
File details
Details for the file autopreprocess_lite-0.1.1-py3-none-any.whl.
File metadata
- Download URL: autopreprocess_lite-0.1.1-py3-none-any.whl
- Upload date:
- Size: 30.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
23e851937a2135664b369cc67f40f63b92c67db0ae397431f8a579798d6c6451
|
|
| MD5 |
1958e566982124e07e87482668a034dd
|
|
| BLAKE2b-256 |
743b72c076442213567c51da31da5bcceb4ad7bdfefc270317f564d9485b957d
|