Skip to main content

Beginner-friendly AutoML library for tabular data

Project description

KrishnAutoML 🚀

PyPI version Build Status License

KrishnAutoML is a lightweight, beginner-friendly, and production-ready AutoML library for tabular data.
It automates the end-to-end machine learning workflow with minimal user input, while keeping things modular and extensible.


✨ Features

  • 📂 Load data from CSV or Pandas DataFrame
  • 🔍 Automatic problem type detection (classification or regression)
  • 🧹 Smart preprocessing (missing values, categorical encoding, scaling)
  • 📊 Optional EDA reports for insights
  • 🤖 Train multiple models (LightGBM, XGBoost, CatBoost, Scikit-Learn)
  • 🎯 Automated model selection and hyperparameter tuning (Optuna / GridSearchCV)
  • 📈 Flexible cross-validation (KFold, StratifiedKFold, GroupKFold)
  • 📝 Multiple evaluation metrics dynamically
  • ⚡ Early stopping and GPU support
  • 💾 Save models + reproducible pipeline code
  • 📑 Auto-generated reports in HTML/Markdown

🛠 Installation

From PyPI (after publishing):

pip install krishnautoml

From source:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

🚀 Quick Start

Python API

from krishnautoml import KrishnAutoML

# Initialize AutoML
automl = KrishnAutoML(target="Survived", problem_type="auto")

# Full pipeline
(
    automl
    .load_data("data/titanic.csv")
    .preprocess()
    .train_models()
    .evaluate()
    .save_model("best_model.pkl")
)

print("Best model metrics:", automl.best_score)

Command Line Interface (CLI)

krishnautoml fit --data data/titanic.csv --target Survived --report

This will:

  • Train models
  • Save best_model.pkl
  • Generate an HTML performance report

📊 Example Output

Metrics (Classification example):

{'accuracy': 0.8567, 'precision': 0.8421, 'recall': 0.8312, 'f1': 0.8350}

Generated Report:

  • 📈 Confusion matrix
  • 🔑 Feature importance
  • 📊 ROC-AUC curve
  • 📑 Summary of preprocessing steps

⚙️ Advanced Usage

  • 🔄 Custom cross-validation:
automl = KrishnAutoML(target="SalePrice", cv_strategy="KFold", n_splits=10)
  • 🎯 Specify metrics:
automl = KrishnAutoML(target="Survived", metrics=["accuracy", "f1"])
  • 📦 Load trained model:
from joblib import load
model = load("best_model.pkl")

🧑‍💻 Development

Clone and install dev dependencies:

git clone https://github.com/<your-username>/KrishnAutoML.git
cd KrishnAutoML
pip install -e .[dev]

Run tests:

pytest

Lint & format:

flake8 krishnautoml
black krishnautoml

📜 License

MIT License © 2025 [Your Name]


🤝 Contributing

Contributions are welcome!

  • Fork the repo
  • Create a feature branch
  • Submit a PR 🎉

🙌 Acknowledgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

krishnautoml-0.1.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

krishnautoml-0.1.0-py3-none-any.whl (7.6 kB view details)

Uploaded Python 3

File details

Details for the file krishnautoml-0.1.0.tar.gz.

File metadata

  • Download URL: krishnautoml-0.1.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7ba0551b2afb794081f918b16a48ab1e90b0194f1d12106f9506fae0dd740182
MD5 3685d7759627bdc92f11e7f46e6bd5f7
BLAKE2b-256 48ea656af0cb7f5c7c109692cb993a3f3619ee5356ce6427ddcf134090f6f9e5

See more details on using hashes here.

File details

Details for the file krishnautoml-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: krishnautoml-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for krishnautoml-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d74340bdf7ddfdf5595b949dfe40e0b2d5a85ad3b544fe01fadaa657d2a82f4c
MD5 899d9b8bb59dd78217fe97df7c80e592
BLAKE2b-256 1eec6baebd85b41bc84374ba2a209e63117d877566297aafb6815ffc6140660a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page