Skip to main content

Beginner-friendly AutoML library for tabular data

Project description

KrishnAutoML 🚀

PyPI version Build Status License

KrishnAutoML is a lightweight, beginner-friendly, and production-ready AutoML library for tabular data.
It automates the end-to-end machine learning workflow with minimal user input, while keeping things modular and extensible.


✨ Features

  • 📂 Load data from CSV or Pandas DataFrame
  • 🔍 Automatic problem type detection (classification or regression)
  • 🧹 Smart preprocessing (missing values, categorical encoding, scaling)
  • 📊 Optional EDA reports for insights
  • 🤖 Train multiple models (LightGBM, XGBoost, CatBoost, Scikit-Learn)
  • 🎯 Automated model selection and hyperparameter tuning (Optuna / GridSearchCV)
  • 📈 Flexible cross-validation (KFold, StratifiedKFold, GroupKFold)
  • 📝 Multiple evaluation metrics dynamically
  • ⚡ Early stopping and GPU support
  • 💾 Save models + reproducible pipeline code
  • 📑 Auto-generated reports in HTML/Markdown

🛠 Installation

From PyPI (after publishing):

pip install krishnautoml

From source:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

🚀 Quick Start

Python API

from krishnautoml import KrishnAutoML

# Initialize AutoML
automl = KrishnAutoML(target="Survived", problem_type="auto")

# Full pipeline
(
    automl
    .load_data("data/titanic.csv")
    .preprocess()
    .train_models()
    .evaluate()
    .save_model("best_model.pkl")
)

print("Best model metrics:", automl.best_score)

Command Line Interface (CLI)

krishnautoml fit --data data/titanic.csv --target Survived --report

This will:

  • Train models
  • Save best_model.pkl
  • Generate an HTML performance report

📊 Example Output

Metrics (Classification example):

{'accuracy': 0.8567, 'precision': 0.8421, 'recall': 0.8312, 'f1': 0.8350}

Generated Report:

  • 📈 Confusion matrix
  • 🔑 Feature importance
  • 📊 ROC-AUC curve
  • 📑 Summary of preprocessing steps

⚙️ Advanced Usage

  • 🔄 Custom cross-validation:
automl = KrishnAutoML(target="SalePrice", cv_strategy="KFold", n_splits=10)
  • 🎯 Specify metrics:
automl = KrishnAutoML(target="Survived", metrics=["accuracy", "f1"])
  • 📦 Load trained model:
from joblib import load
model = load("best_model.pkl")

🧑‍💻 Development

Clone and install dev dependencies:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

Run tests:

pytest

Lint & format:

flake8 krishnautoml
black krishnautoml

📜 License

MIT License © 2025 [Your Name]


🤝 Contributing

Contributions are welcome!

  • Fork the repo
  • Create a feature branch
  • Submit a PR 🎉

🙌 Acknowledgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

krishnautoml-0.1.1.tar.gz (17.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

krishnautoml-0.1.1-py3-none-any.whl (20.2 kB view details)

Uploaded Python 3

File details

Details for the file krishnautoml-0.1.1.tar.gz.

File metadata

  • Download URL: krishnautoml-0.1.1.tar.gz
  • Upload date:
  • Size: 17.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d2e13453067f39b03975273b1b5224d36401957aa5b965b914db5346fdce1cd0
MD5 d107c4c2fb4f97e1fea464b5d186f5dc
BLAKE2b-256 be1883c4b50e9171a5829f683aba61384f5a5574de46adaede91f0c09f345f58

See more details on using hashes here.

File details

Details for the file krishnautoml-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: krishnautoml-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 20.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5e70992e33025d984a495132f5d44d2619741beb0de8eb6d25601ed3e2238c06
MD5 ee7c0bfd2fb6acb7e60641444f7ce263
BLAKE2b-256 e6bf4ae70c1b5add379dabf51264e732297348db1c7b5f5634efa6de73e23722

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page