Universal AutoML + Feature Engineering + Explainability Library
Project description
๐ง featuremind v3.1.1
One-line AutoML with Built-in Reliability, Leakage Detection & Explainability
โญ If this project helps you, give it a star โ it really helps!
๐ What is featuremind?
featuremind is a one-line AutoML library that handles the complete machine learning pipeline โ from raw CSV to production-ready model โ with built-in reliability checking, leakage detection, and feature engineering.
import featuremind as fm
fm.analyze("data.csv")
That's it. One line. Full analysis, model selection, feature suggestions, SHAP importance, leakage check, and HTML report โ all automated.
๐งช Tested Datasets
featuremind v3.1 has been verified on:
| Dataset | Type | Score | Notes |
|---|---|---|---|
| Telecom Churn (7,043 rows) | Classification | 85.7% F1 | โ Stable, well-balanced |
| Credit Card Fraud (284,807 rows) | Classification | ~99% F1 | โ ๏ธ High score due to PCA-transformed separable data |
| Heart Failure Medical | Classification | ~80% Accuracy | โ Works |
| House Prices | Regression | Rยฒ reported | โ Works |
| Generic CSVs | Auto-detected | Auto-detected | โ Works |
๐ Key Features
๐ค Auto ML Pipeline
- Loads and cleans any CSV automatically
- Detects target column, task type (classification / regression), and data issues
- Trains 6 models: LogisticRegression, RandomForest, GradientBoosting, XGBoost, LightGBM, CatBoost
- Picks best model using cross-validation
- Auto hyperparameter tuning (RandomizedSearchCV)
๐ก๏ธ Leakage Guard (Core Feature)
- Detects if any feature formula references the target column
- Flags columns with suspiciously high correlation with target (>0.95)
- Smart ID detection (non-generalizable columns)
- Warns user before model training (no silent failures)
๐ Reliability Engine
-
Detects unrealistic scores (>0.98)
-
Adjusts confidence level automatically:
-
0.99 โ Low confidence โ
-
0.98 โ Medium โ ๏ธ
-
-
Highlights possible issues:
- Data leakage
- Overfitting
- Sampling bias
โ๏ธ Class Imbalance Handling
- Detects imbalance automatically
- Applies SMOTE (if available)
- Falls back to class weights
- Switches evaluation metric to F1 when needed
๐ SHAP Explainability
- Computes SHAP values for model explainability
- Displays top features influencing predictions
- Helps identify real business drivers
๐ฌ Feature Engineering (Multi-layer)
- Domain-aware features: Telecom ยท Medical ยท Real Estate ยท Finance ยท HR
- Interactions, ratios, log transforms, polynomial features
- Only surfaces features that improve performance
๐๏ธ Production Pipeline
- Save trained model + preprocessing pipeline
- Load and predict on new/unseen data
- Handles missing columns and unseen categories
๐ Experiment Tracking
- Logs every run automatically
- Leaderboard of models and scores
- Export results to CSV
๐ REST API (Optional)
- FastAPI-based prediction server
- Ready-to-use endpoints for deployment
๐ Why featuremind?
| Capability | featuremind | Typical AutoML Tools |
|---|---|---|
| One-line usage | โ | โ |
| Leakage detection | โ | โ |
| Reliability scoring | โ | โ |
| SHAP explainability | โ | โ ๏ธ |
| Production pipeline | โ | โ |
๐ฆ Installation
pip install featuremind
# (Recommended) Install advanced ML libraries
pip install xgboost lightgbm catboost shap imbalanced-learn
# Optional API support
pip install fastapi uvicorn python-multipart
๐ Quick Start
import featuremind as fm
fm.analyze("data.csv")
fm.check_leakage("data.csv", target="Churn")
pipeline = fm.train("data.csv", target="Churn")
pipeline.save("churn_pipeline")
pipeline = fm.load_pipeline("churn_pipeline")
results = pipeline.predict_df(new_data)
fm.get_tracker().leaderboard()
fm.serve("churn_pipeline/", port=8000)
๐ฌ Example Output
๐ง featuremind v3.1.1 โ Starting Analysis
๐ฏ Best Model : LightGBM
๐ Score : 0.8569 (F1-weighted)
๐ Confidence : High โ
๐ก๏ธ Leakage : None detected
๐ Project Structure
featuremind_project/
โ
โโโ featuremind/
โ โโโ analyzer.py
โ โโโ feature_engineer.py
โ โโโ evaluator.py
โ โโโ leakage_guard.py
โ โโโ importance.py
โ โโโ reporter.py
โ โโโ html_reporter.py
โ โโโ insights.py
โ โโโ pipeline.py
โ โโโ tracker.py
โ โโโ api.py
โ
โโโ setup.py
โโโ requirements.txt
โโโ test.py
โโโ README.md
โ ๏ธ Notes
-
High accuracy (>0.98) may indicate:
- Data leakage
- Highly separable datasets
- Sampling bias
-
Always validate models on unseen data.
๐ Output Files
featuremind_report.htmlโ Full analysis reportfeaturemind_report.pngโ Feature visualizationenhanced_data.csvโ Dataset with engineered featuresfeaturemind_experiments.csvโ Experiment logspipeline/โ Saved production model
๐ก Use Cases
- Telecom churn prediction
- Fraud detection
- Healthcare predictions
- Real estate pricing
- HR analytics
- Any tabular ML problem
๐ฅ Why Developers Love featuremind
- โก Go from raw data โ model in 1 line
- ๐ก๏ธ Built-in leakage detection (rare in AutoML)
- ๐ Explainable AI (SHAP) included by default
- ๐ง Reliability scoring (not just accuracy)
- ๐๏ธ Direct production pipeline export
๐ Not just AutoML โ this is AutoML + Trust Layer
๐ฎ Roadmap
- Time-series support
- Deep learning integration
- Streamlit dashboard
- Cloud deployment
๐ License
MIT License
๐ฉโ๐ป Author
Niveditha โ Data Scientist & ML Engineer
โญ If this project helps you, consider giving it a star!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file featuremind-3.1.1.tar.gz.
File metadata
- Download URL: featuremind-3.1.1.tar.gz
- Upload date:
- Size: 62.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bf015dc14ccad30642624e0a08b71e0321f9df8f9eb372e8e5e34ebbcf8cbee7
|
|
| MD5 |
cd0cd38d45fee67ec5ae13e327d6633a
|
|
| BLAKE2b-256 |
eff9295d94e0110e0295bec7c7350e950a25f692dd105be03b97aed6f81ad1fe
|
File details
Details for the file featuremind-3.1.1-py3-none-any.whl.
File metadata
- Download URL: featuremind-3.1.1-py3-none-any.whl
- Upload date:
- Size: 65.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a306b8d71b9d3548f3b883782840852069ac53172e16e045f412a7fa435fe19e
|
|
| MD5 |
ac3c7ebae7a22a490dd083caf3e952c8
|
|
| BLAKE2b-256 |
975144a7559c9c18d276458ee3cf4a581644c240fdeb4e008cf5745b7b0f8f0d
|