Skip to main content

Automatic Data Preprocessing Library

Project description

AutoPreprocess

Automatic Data Preprocessing Library for Machine Learning

Python Version License

✨ Features

  • Automatic column type detection (numeric, categorical, datetime, useless)
  • Smart missing value handling (based on missing percentage)
  • Outlier detection & capping (IQR method)
  • Intelligent encoding (One-hot, Frequency, Target encoding)
  • Feature scaling (Standard, MinMax, Robust)
  • Feature selection (Variance, Correlation, Importance, Mutual Info)
  • Train/Test split (Random, Stratified, Time series)
  • Save & load pipeline for production deployment
  • Zero data leakage (fit only on training data)

🚀 Quick Start

from autopreprocess import AutoClean

# One line to preprocess everything
pipeline = AutoClean('data.csv', target='price')
X_train, X_test, y_train, y_test = pipeline.preprocess()

# For new predictions
X_new_clean = pipeline.predict_ready_data(X_new)

# Save for later
pipeline.save('my_pipeline.pkl')

# Load and use
loaded = AutoClean.load('my_pipeline.pkl')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autopreprocess_lite-0.1.1.tar.gz (31.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autopreprocess_lite-0.1.1-py3-none-any.whl (30.9 kB view details)

Uploaded Python 3

File details

Details for the file autopreprocess_lite-0.1.1.tar.gz.

File metadata

  • Download URL: autopreprocess_lite-0.1.1.tar.gz
  • Upload date:
  • Size: 31.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for autopreprocess_lite-0.1.1.tar.gz
Algorithm Hash digest
SHA256 99d0978102409e4a2577712ace0c7e32249c6872d7811680697b36b9f334b719
MD5 e0e0490b943efd4081cd5f20a78ff353
BLAKE2b-256 6aa08b6025fc678d758bd189bfb8950580b4110c95f66d2573fde429965e2c26

See more details on using hashes here.

File details

Details for the file autopreprocess_lite-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for autopreprocess_lite-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 23e851937a2135664b369cc67f40f63b92c67db0ae397431f8a579798d6c6451
MD5 1958e566982124e07e87482668a034dd
BLAKE2b-256 743b72c076442213567c51da31da5bcceb4ad7bdfefc270317f564d9485b957d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page