Skip to main content

A collection of utility functions designed to simplify training machine learning models for Kaggle competitions.

Project description

Kaggle Toolbox

Koolbox is a collection of helper functions and utilities designed to simplify training machine learning models in Kaggle competitions. This library abstracts away repetitive boilerplate code, allowing competitors to focus on more important tasks.

Installation

pip install koolbox

Usage

Trainer

import pandas as pd
from sklearn.model_selection import KFold
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import roc_auc_score

from koolbox import Trainer


X = pd.DataFrame(...)
y = pd.Series(...)

trainer = Trainer(
    estimator=RandomForestClassifier(random_state=42),
    cv=KFold(n_splits=5, shuffle=True, random_state=42),
    metric=roc_auc_score,
    task="binary",
    verbose=True
)

trainer.fit(X, y)

X_test = pd.DataFrame(...)
preds = trainer.predict(X_test)

oof_preds = trainer.oof_preds
overall_score = trainer.overall_score
fold_scores = trainer.fold_scores

SequentialFeatureSelector

from sklearn.linear_model import Ridge
from sklearn.model_selection import KFold
from sklearn.metrics import root_mean_squared_error
import pandas as pd

from koolbox import SequentialFeatureSelector


X = pd.DataFrame(...)
y = pd.Series(...)
X_test = pd.DataFrame(...)

sfs = SequentialFeatureSelector(
    Ridge(),
    cv=KFold(n_splits=5, random_state=42, shuffle=True),
    objective="minimize",
    direction="backward",
    metric=root_mean_squared_error
)

X = sfs.fit_transform(X, y)
X_test = sfs.transform(X_test)

selected_features = sfs.selected_features

WeightedEnsemble[Regressor, Classifier]

from sklearn.metrics import root_mean_squared_error
import pandas as pd

from koolbox import WeightedEnsembleRegressor


X = pd.DataFrame(...)
y = pd.Series(...)
X_test = pd.DataFrame(...)

model = WeightedEnsembleRegressor(
    objective="minimize",
    metric=root_mean_squared_error
)

model.fit(X, y)
preds = model.predict(X_test)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

koolbox-0.1.3.tar.gz (15.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

koolbox-0.1.3-py3-none-any.whl (18.5 kB view details)

Uploaded Python 3

File details

Details for the file koolbox-0.1.3.tar.gz.

File metadata

  • Download URL: koolbox-0.1.3.tar.gz
  • Upload date:
  • Size: 15.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for koolbox-0.1.3.tar.gz
Algorithm Hash digest
SHA256 562188495fa395800bcea7ca849aec9a16aad482f3673027067aa9ccf225aeea
MD5 e870194e320a4a41a7eb2ca22904e585
BLAKE2b-256 6e952b1f1787cda2ea4e1cab81f05543d4b14a693eb77ce25daa9ed8a16cb666

See more details on using hashes here.

Provenance

The following attestation bundles were made for koolbox-0.1.3.tar.gz:

Publisher: publish.yml on ravaghi/koolbox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file koolbox-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: koolbox-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 18.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for koolbox-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f654008252fe17463f27548b6a327926aa5451fdba52fe60fbb7639f5afc4bdc
MD5 97e0a83f53445aa6a8da7d5c6326a2b3
BLAKE2b-256 00201f14f43ae9f2945425cf388e72950f0ac2c1305152aacd9307e5f5bb3776

See more details on using hashes here.

Provenance

The following attestation bundles were made for koolbox-0.1.3-py3-none-any.whl:

Publisher: publish.yml on ravaghi/koolbox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page