Skip to main content

The complete ML toolkit โ€” EDA, cleaning, training, explainability, deployment

Project description

mlpilot ๐Ÿš€

PyPI version Python 3.9+ License: MIT Tests Downloads

mlpilot is the complete Python ML toolkit โ€” what currently takes 30โ€“40 hours of repetitive boilerplate takes 5โ€“10 minutes. One import. Every tool you need. Full explainability.

import mlpilot as ml

eda    = ml.analyze(df, target='churn')          # 12-section EDA report
clean  = ml.clean(df, target='churn')            # auto null/outlier/dtype fixing
feats  = ml.features(clean.df, target='churn')  # leakage-safe feature pipeline
board  = ml.baseline(X_train, y_train)           # 15+ model leaderboard in 2 min
tuned  = ml.tune('lgbm', X_train, y_train)       # Bayesian hyperparameter search
exp    = ml.explain(tuned.best_model, X_train)   # SHAP global + local explanations
api    = ml.deploy(tuned.best_model)             # FastAPI + Docker in 5 minutes
api.serve(port=8000)                              # โ†’ localhost:8000/predict

Why mlpilot?

Feature mlpilot ydata-profiling sweetviz PyCaret SHAP
Smart EDA report โœ… โœ… โœ… โŒ โŒ
Auto data cleaning โœ… โŒ โŒ Partial โŒ
Multi-model baseline โœ… โŒ โŒ โœ… โŒ
Hyperparameter tuning โœ… โŒ โŒ โœ… โŒ
Model explainability โœ… โŒ โŒ โŒ โœ…
Time series โœ… โŒ โŒ โœ… โŒ
NLP pipeline โœ… โŒ โŒ โœ… โŒ
API deployment โœ… โŒ โŒ โŒ โŒ
AI data analyst โœ… โŒ โŒ โŒ โŒ
Undo / diff reports โœ… โŒ โŒ โŒ โŒ

Installation

# Core (EDA, cleaning, validation, features, training)
pip install mlplt

# With specific extras
pip install mlplt[xgb,lgbm,shap,optuna]

# Everything
pip install mlplt[full]

Available extras: xgb, lgbm, shap, optuna, prophet, nlp, imb, deploy, ai, full

Modules

Module Function Description
SmartEDA ml.analyze(df) 12-section EDA report with plots
AutoCleaner ml.clean(df) Auto null/outlier/dtype fixing with undo
DataValidator ml.validate(df) Schema, leakage, drift detection
FeatureForge ml.features(df) Leakage-safe encoding + scaling pipeline
BaselineBlitz ml.baseline(X, y) 15+ model comparison leaderboard
EvalSuite ml.evaluate(model, X, y) All metrics + diagnostic plots
HyperX ml.tune(model, X, y) Bayesian hyperparameter optimization
Explainer ml.explain(model, X) SHAP global + local + what-if
BalanceKit ml.balance(X, y) Auto SMOTE/ADASYN/class_weight
TimeSense ml.forecast(df) Multi-model time series forecasting
TextML ml.text_classify(df) NLP classification + embeddings
LaunchPad ml.deploy(model) FastAPI + Docker generation
AIAnalyst ml.analyst(df) Ask questions in plain English

Quick Start โ€” Churn Prediction

import mlpilot as ml
import pandas as pd

df = pd.read_csv('churn.csv')

# 1. Understand your data
eda = ml.analyze(df, target='Churn', report_format='html')

# 2. Clean it
df_clean = ml.clean(df, target='Churn').df

# 3. Engineer features (leakage-safe)
feats = ml.features(df_clean, target='Churn')
X_train, X_test, y_train, y_test = ml.split(feats, test_size=0.2, stratify=True)

# 4. Handle imbalance
bal = ml.balance(X_train, y_train)

# 5. Find the best model
board = ml.baseline(bal.X_resampled, bal.y_resampled, X_test=X_test, y_test=y_test)
board.leaderboard.print()

# 6. Tune + evaluate
tuned = ml.tune('lgbm', bal.X_resampled, bal.y_resampled, time_budget=300)
eval_r = ml.evaluate(tuned.best_model, X_test, y_test, optimize_threshold=True)

# 7. Explain
exp = ml.explain(tuned.best_model, X_train, X_test)
exp.feature_importance()

# 8. Deploy
ml.deploy(tuned.best_model, X_sample=X_test.iloc[:10]).serve(port=8000)

Documentation

Full API reference: mlpilot.readthedocs.io

Contributing

  1. Fork the repo
  2. pip install -e ".[dev]"
  3. pre-commit install
  4. Make your changes + add tests
  5. pytest tests/ --cov=mlpilot
  6. Open a pull request

License

MIT โ€” see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlplt-0.2.1.tar.gz (88.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlplt-0.2.1-py3-none-any.whl (95.6 kB view details)

Uploaded Python 3

File details

Details for the file mlplt-0.2.1.tar.gz.

File metadata

  • Download URL: mlplt-0.2.1.tar.gz
  • Upload date:
  • Size: 88.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mlplt-0.2.1.tar.gz
Algorithm Hash digest
SHA256 732b0a6113ad990bab13a6c7fddf4be886cd0f5166a2cb61167957054ddbcb3f
MD5 c812005024acfb854b92620afdec5b34
BLAKE2b-256 62dea464899826b5d4b3ff394a572115e94fd5409b1f73edf147843ccd0cd389

See more details on using hashes here.

File details

Details for the file mlplt-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: mlplt-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 95.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mlplt-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a2188cd0f8867e363e090f7f6ad8c13da16d83a65bb3c8d7ffbc42fe4a28e1be
MD5 ecd1597b3e1ca6040b6f55ff56919a5e
BLAKE2b-256 fb925a6912678c12d277593943b3660c8e3d84e15cf841120fa77b4dc3d99214

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page