The complete ML toolkit โ EDA, cleaning, training, explainability, deployment
Project description
mlpilot ๐
The Complete Python ML Toolkit โ Killing the Boilerplate.
mlpilot is the definitive "all-in-one" library for machine learning practitioners. What currently takes 40 hours of repetitive coding, mlpilot delivers in 5 minutes. No dependency hell. No boilerplate. Just results.
๐ข A Note from the Author
Hi! I'm Anannya Vyas, a student developer. ๐
I didn't build mlpilot just to add another library to the pile. I built it because I was tired of writing the same 200 lines of null-handling, scaling, and correlation plotting for every single project. I wanted a library that "thinks" like a data scientistโmaking smart defaults while giving you full override control.
This project is my graduation milestone. Itโs a journey from learning how to build a package to implementing local-first AI intelligence. Whether you are a student like me or a professional looking to move faster, I hope mlpilot saves you from the "boilerplate tax."
Special thanks to my teacher Lovnish Verma for the inspiration and guidance.
๐ The mlpilot Advantage
mlpilot isn't just a library; it's a replacement for half your requirements.txt.
| Feature | mlpilot | Profiling | PyCaret | SHAP | Fairlearn |
|---|---|---|---|---|---|
| Smart EDA Report | โ | โ | โ | โ | โ |
| Auto-Cleaning | โ | โ | Partial | โ | โ |
| Model Leaderboard | โ | โ | โ | โ | โ |
| Bias & Fairness Audit | โ | โ | โ | โ | โ |
| AI Data Analyst | โ | โ | โ | โ | โ |
| API Deployment | โ | โ | โ | โ | โ |
๐ ๏ธ Prerequisites & Installation
AI Features (Recommended)
mlpilot uses local-first AI to keep your data private.
- Install Ollama.
- Run
ollama pull llama3.2(orllama3). - Keep the Ollama app running to use
ml.analyst()andml.explain_data().
Installation
# Core installation
pip install mlplt
# Full suite (includes AI, NLP, and Deployment tools)
pip install mlplt[full]
โก Quick Start: The "Instant Analyst" Workflow
Solve a churn prediction problem from scratch in under 60 seconds of coding.
import mlpilot as ml
import pandas as pd
# 1. Load your data
df = pd.read_csv("customer_data.csv")
# 2. Let the AI explain it in 3 sentences
ml.explain_data(df)
# 3. Auto-generate a premium dashboard
viz = ml.visualize(df, target="churn")
viz.to_html("insights.html")
# 4. Clean and Engineer (leakage-safe)
clean = ml.clean(df, target="churn")
feats = ml.features(clean.df, target="churn")
# 5. Training & Auditing
X_train, X_test, y_train, y_test = ml.split(feats, test_size=0.2)
base = ml.baseline(X_train, y_train)
# 6. Audit for Bias & Stability
audit = ml.audit(base.best_model, X_test, y_test)
audit.print_summary()
# 7. Generate your final Project Story
ml.story([df, base, audit])
๐ง The AI Brain: Intelligent ML
๐ AI Analyst (ml.analyst)
Ask questions about your data in plain English. No more searching for pandas syntax.
"Which city has the highest average churn, and is it correlated with revenue?"
mlpilotgenerates the code, shows it to you for review, and runs it instantly.
โ๏ธ MLAudit (ml.audit)
Go beyond accuracy. Audit your models for:
- Technical Stability: How does the model react to noise?
- Social Bias: Check for Demographic Parity and Equalized Odds to ensure your model is fair to all groups.
๐ DataStory (ml.story)
Automatically synthesizes your EDA, Cleaning, Training, and Auditing results into a narrative report. Perfect for presentations!
๐ Complete API Reference
| Module | Function | Description |
|---|---|---|
| EDA | ml.analyze(df) |
12-section comprehensive report. |
| Visualizer | ml.visualize(df) |
Intelligent auto-plotting dashboard. |
| Cleaner | ml.clean(df) |
Auto-fix nulls, outliers, and dtypes. |
| Validator | ml.validate(df) |
Check for leakage, drift, and schema errors. |
| FeatureForge | ml.features(df) |
One-line encoding and scaling pipeline. |
| Baseline | ml.baseline(X,y) |
15+ model comparison leaderboard. |
| Explainer | ml.explain(model,X) |
Global and local SHAP explanations. |
| LaunchPad | ml.deploy(model) |
FastAPI + Docker in 5 minutes. |
๐ Project Structure
mlpilot/
โโโ ai/ # Analyst, Audit, Story (The Brain)
โโโ clean/ # Cleaner, Strategies, Diff
โโโ eda/ # Analyzer, Visualizer, Plots
โโโ train/ # Baseline, HyperX, Evaluate
โโโ validate/ # Schema, Drift, Checks
โโโ deploy/ # LaunchPad, FastAPI
๐ค Contributing
Contributions are welcome! This is a student project, and I'd love to see how you improve it.
- Fork the repo.
- Install dev dependencies:
pip install -e ".[dev]" - Run tests:
pytest tests/
๐ License
Licensed under the MIT License. Use it, fork it, make it yours.
โญ Show Your Support
If mlpilot helped you crush your project boilerplate:
- โญ Star the repo on GitHub.
- ๐ข Share it with your classmates.
- ๐ Report bugs to help me learn and improve.
Built with โค๏ธ for the ML community.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mlplt-0.1.0.tar.gz.
File metadata
- Download URL: mlplt-0.1.0.tar.gz
- Upload date:
- Size: 88.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8cab12ce5bf54d6a5cdf2d074f8ffc259699e1b4c793bf4b8aa8dba2314e02d6
|
|
| MD5 |
0fa18ab47d69ecbbaa1c08b7592cb574
|
|
| BLAKE2b-256 |
fda15fa147395908690e5191251c4fa83aea521927812181d3cdc5e596ad25ea
|
File details
Details for the file mlplt-0.1.0-py3-none-any.whl.
File metadata
- Download URL: mlplt-0.1.0-py3-none-any.whl
- Upload date:
- Size: 94.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d1c2bcc5965cdbd8eef09bf862c139c1ee582f04cb3a184aa05b7ed55840b29a
|
|
| MD5 |
dd07e8b36cc83fcef1fa53f68196efd7
|
|
| BLAKE2b-256 |
8b138a66b8bc5adb32b378913d24682938ca8b01f98f877fbfd4dfb86aab84f2
|