A lightweight Python package for Machine Learning utilities
Project description
Package: batabyal
batabyal is a lightweight Python package for Machine Learning utilities that provides:
- cleaning_module - A CSV data cleaning module
- trainer_kit - ML module for classification problems
Installation
Use the below command in the terminal
pip install batabyal
Importation
Import a specific thing or the entire module whatever is required
from batabyal import cleaning_module as cm
from batabyal.trainer_kit import TransformedTargetClassifier, autofit_classification_model
Usage
1. cleaning_module: It provides only one function clean_csv used for cleaning .csv datasets efficiently
cm.clean_csv('filename.csv', numericData, charData, True, True)
#structure: clean_csv(file, numericData, charData, fill, case_sensitivity=False, dummies=None) -> pd.DataFrame
#If `fill==True`, it fills NaN in numeric columns with its mean.
#if `case_sensitivity=True`, it will lowercase all labelled values.
#`dummies` are the list of values to replace with NaN before cleaning.
2. trainer_kit: It provides one wrapper class TransformedTargetClassifier for encoding and inversely transforming predictions to the original label and one function autofit_classification_model for autofitting classification models with the best algorithm and hyperparameters based on roc_auc_ovr_weighted score
model = TransformedTargetClassifier(classifier=svc, transformer=labelEncoder)
#let labelencoder and svc are from sklearn
#you can now use model.fit() , model.predict() with raw labelled data, it will automate the encoding internally for training and prediction
#And model.predict() will return the original label by inversely transforming the encoded numbers back internally
model, accuracy, algorithm_name = autofit_classification_model(x, y, "numeric", 3)
#structure: autofit_classification_model(x:pd.DataFrame, y:pd.DataFrame, x_type:Literal["numeric", "categorical", "mixed"], n_splits:int, cat_features:list[str]=[], whitelisted_algorithms:list[Literal["LogisticRegression", "DecisionTree", "RandomForest", "GaussianNB", "BernoulliNB", "CategoricalNB", "CatBoost", "XGBoost", "Ripper", "SVC", "KNN"]]|Literal["auto"]="auto", enable_votingClassifier:bool=True, random_state:int|None=42, verbosity:bool=True) -> tuple[ClassifierMixin, float, str]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file batabyal-1.0.2.tar.gz.
File metadata
- Download URL: batabyal-1.0.2.tar.gz
- Upload date:
- Size: 7.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b37bdef5bac9cf626efe275923c609a7f778f5a2fa59f10a1a7983af571a2e8e
|
|
| MD5 |
b6ae3353e34db5c1868ba5a824d181c6
|
|
| BLAKE2b-256 |
2cbaf64b8faac627681a3ec197381d0fa4d07c20670d517aae241a220eedbccf
|
File details
Details for the file batabyal-1.0.2-py3-none-any.whl.
File metadata
- Download URL: batabyal-1.0.2-py3-none-any.whl
- Upload date:
- Size: 7.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
797bed7ae0ec9753bb02347805a08acae6e1455fb99aa5688c484f96fcccaed5
|
|
| MD5 |
29a27bb4bc9cac2e458b65d2b7012ae3
|
|
| BLAKE2b-256 |
30f4102ade206089296b38f9b5bdb801612b6bfcc8f0d76a299557b280182c82
|