Skip to main content

The complete ML toolkit โ€” EDA, cleaning, training, explainability, deployment

Project description

mlpilot ๐Ÿš€

PyPI version Python 3.9+ License: MIT Tests Downloads

mlpilot is the complete Python ML toolkit โ€” what currently takes 30โ€“40 hours of repetitive boilerplate takes 5โ€“10 minutes. One import. Every tool you need. Full explainability.

import mlpilot as ml

eda    = ml.analyze(df, target='churn')          # 12-section EDA report
clean  = ml.clean(df, target='churn')            # auto null/outlier/dtype fixing
feats  = ml.features(clean.df, target='churn')  # leakage-safe feature pipeline
board  = ml.baseline(X_train, y_train)           # 15+ model leaderboard in 2 min
tuned  = ml.tune('lgbm', X_train, y_train)       # Bayesian hyperparameter search
exp    = ml.explain(tuned.best_model, X_train)   # SHAP global + local explanations
api    = ml.deploy(tuned.best_model)             # FastAPI + Docker in 5 minutes
api.serve(port=8000)                              # โ†’ localhost:8000/predict

Why mlpilot?

Feature mlpilot ydata-profiling sweetviz PyCaret SHAP
Smart EDA report โœ… โœ… โœ… โŒ โŒ
Auto data cleaning โœ… โŒ โŒ Partial โŒ
Multi-model baseline โœ… โŒ โŒ โœ… โŒ
Hyperparameter tuning โœ… โŒ โŒ โœ… โŒ
Model explainability โœ… โŒ โŒ โŒ โœ…
Time series โœ… โŒ โŒ โœ… โŒ
NLP pipeline โœ… โŒ โŒ โœ… โŒ
API deployment โœ… โŒ โŒ โŒ โŒ
AI data analyst โœ… โŒ โŒ โŒ โŒ
Undo / diff reports โœ… โŒ โŒ โŒ โŒ

Installation

# Core (EDA, cleaning, validation, features, training)
pip install mlplt

# With specific extras
pip install mlplt[xgb,lgbm,shap,optuna]

# Everything
pip install mlplt[full]

Available extras: xgb, lgbm, shap, optuna, prophet, nlp, imb, deploy, ai, full

Modules

Module Function Description
SmartEDA ml.analyze(df) 12-section EDA report with plots
AutoCleaner ml.clean(df) Auto null/outlier/dtype fixing with undo
DataValidator ml.validate(df) Schema, leakage, drift detection
FeatureForge ml.features(df) Leakage-safe encoding + scaling pipeline
BaselineBlitz ml.baseline(X, y) 15+ model comparison leaderboard
EvalSuite ml.evaluate(model, X, y) All metrics + diagnostic plots
HyperX ml.tune(model, X, y) Bayesian hyperparameter optimization
Explainer ml.explain(model, X) SHAP global + local + what-if
BalanceKit ml.balance(X, y) Auto SMOTE/ADASYN/class_weight
TimeSense ml.forecast(df) Multi-model time series forecasting
TextML ml.text_classify(df) NLP classification + embeddings
LaunchPad ml.deploy(model) FastAPI + Docker generation
AIAnalyst ml.analyst(df) Ask questions in plain English

Quick Start โ€” Churn Prediction

import mlpilot as ml
import pandas as pd

df = pd.read_csv('churn.csv')

# 1. Understand your data
eda = ml.analyze(df, target='Churn', report_format='html')

# 2. Clean it
df_clean = ml.clean(df, target='Churn').df

# 3. Engineer features (leakage-safe)
feats = ml.features(df_clean, target='Churn')
X_train, X_test, y_train, y_test = ml.split(feats, test_size=0.2, stratify=True)

# 4. Handle imbalance
bal = ml.balance(X_train, y_train)

# 5. Find the best model
board = ml.baseline(bal.X_resampled, bal.y_resampled, X_test=X_test, y_test=y_test)
board.leaderboard.print()

# 6. Tune + evaluate
tuned = ml.tune('lgbm', bal.X_resampled, bal.y_resampled, time_budget=300)
eval_r = ml.evaluate(tuned.best_model, X_test, y_test, optimize_threshold=True)

# 7. Explain
exp = ml.explain(tuned.best_model, X_train, X_test)
exp.feature_importance()

# 8. Deploy
ml.deploy(tuned.best_model, X_sample=X_test.iloc[:10]).serve(port=8000)

Documentation

Full API reference: mlpilot.readthedocs.io

Contributing

  1. Fork the repo
  2. pip install -e ".[dev]"
  3. pre-commit install
  4. Make your changes + add tests
  5. pytest tests/ --cov=mlpilot
  6. Open a pull request

License

MIT โ€” see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlplt-0.1.3.tar.gz (88.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlplt-0.1.3-py3-none-any.whl (95.4 kB view details)

Uploaded Python 3

File details

Details for the file mlplt-0.1.3.tar.gz.

File metadata

  • Download URL: mlplt-0.1.3.tar.gz
  • Upload date:
  • Size: 88.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mlplt-0.1.3.tar.gz
Algorithm Hash digest
SHA256 c836ea86d3aa774857a518d72abb23b4b6cd3d805ce60ee386679e4564024c4b
MD5 04c2a7f9e8c4da8a5226f4b933cac331
BLAKE2b-256 669055769f231ef3a52d87f656326e8a1e0d1e3999adc78b647e2071ed8b1ea0

See more details on using hashes here.

File details

Details for the file mlplt-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: mlplt-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 95.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mlplt-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 a10ffb8093c89f6537123beebf2eab61635d98bd3901ddab14789e2fe7b52fd3
MD5 caa173417b15b94db63c0d06bafa06cf
BLAKE2b-256 23a788f55e5a4aa721fdf724e063596af57a5d657f7fcba8527d29416960fe2e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page