Skip to main content

Beginner-friendly AutoML library for tabular data

Project description

KrishnAutoML 🚀

PyPI version Build Status License

KrishnAutoML is a lightweight, beginner-friendly, and production-ready AutoML library for tabular data.
It automates the end-to-end machine learning workflow with minimal user input, while keeping things modular and extensible.


✨ Features

  • 📂 Load data from CSV or Pandas DataFrame
  • 🔍 Automatic problem type detection (classification or regression)
  • 🧹 Smart preprocessing (missing values, categorical encoding, scaling)
  • 📊 Optional EDA reports for insights
  • 🤖 Train multiple models (LightGBM, XGBoost, CatBoost, Scikit-Learn)
  • 🎯 Automated model selection and hyperparameter tuning (Optuna / GridSearchCV)
  • 📈 Flexible cross-validation (KFold, StratifiedKFold, GroupKFold)
  • 📝 Multiple evaluation metrics dynamically
  • ⚡ Early stopping and GPU support
  • 💾 Save models + reproducible pipeline code
  • 📑 Auto-generated reports in HTML/Markdown

🛠 Installation

From PyPI (after publishing):

pip install krishnautoml

From source:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

🚀 Quick Start

Python API

from krishnautoml import KrishnAutoML

# Initialize AutoML
automl = KrishnAutoML(target="Survived", problem_type="auto")

# Full pipeline
(
    automl
    .load_data("data/titanic.csv")
    .preprocess()
    .train_models()
    .evaluate()
    .save_model("best_model.pkl")
)

print("Best model metrics:", automl.best_score)

Command Line Interface (CLI)

krishnautoml fit --data data/titanic.csv --target Survived --report

This will:

  • Train models
  • Save best_model.pkl
  • Generate an HTML performance report

📊 Example Output

Metrics (Classification example):

{'accuracy': 0.8567, 'precision': 0.8421, 'recall': 0.8312, 'f1': 0.8350}

Generated Report:

  • 📈 Confusion matrix
  • 🔑 Feature importance
  • 📊 ROC-AUC curve
  • 📑 Summary of preprocessing steps

⚙️ Advanced Usage

  • 🔄 Custom cross-validation:
automl = KrishnAutoML(target="SalePrice", cv_strategy="KFold", n_splits=10)
  • 🎯 Specify metrics:
automl = KrishnAutoML(target="Survived", metrics=["accuracy", "f1"])
  • 📦 Load trained model:
from joblib import load
model = load("best_model.pkl")

🧑‍💻 Development

Clone and install dev dependencies:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

Run tests:

pytest

Lint & format:

flake8 krishnautoml
black krishnautoml

📜 License

MIT License © 2025 [Your Name]


🤝 Contributing

Contributions are welcome!

  • Fork the repo
  • Create a feature branch
  • Submit a PR 🎉

🙌 Acknowledgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

krishnautoml-0.1.2.tar.gz (18.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

krishnautoml-0.1.2-py3-none-any.whl (20.2 kB view details)

Uploaded Python 3

File details

Details for the file krishnautoml-0.1.2.tar.gz.

File metadata

  • Download URL: krishnautoml-0.1.2.tar.gz
  • Upload date:
  • Size: 18.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.2.tar.gz
Algorithm Hash digest
SHA256 91634ce716249dd90f365a1ae4c41d8b5ef5bf76a96bcb24460f386ee8daab65
MD5 89aa5c2b023245672be5e7417e37a1ef
BLAKE2b-256 9571956cc1292001584644ebd6b458228845e8c8f1456c98202f148c1d9e30f6

See more details on using hashes here.

File details

Details for the file krishnautoml-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: krishnautoml-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 20.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 90203e3cba4da4e8bb3d0478638663ae6f78828f6874adc830d5492ac65e021b
MD5 4a7f6a6a50ae427903a4fcce8bc107e0
BLAKE2b-256 8c567c7682455d1fcb98114110ab992ef7a212aa60f8580c406c883bf3c66306

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page