Skip to main content

A python package for ensembling machine learning predictions using hill climbing optimization

Project description

Hill Climbing

Hill climbing is a mathematical optimization algorithm that belongs to the family of local search techniques. It is commonly used to iteratively improve a solution based on a cost or objective function. This package provides a simple implementation of the hill climbing algorithm and is useful for efficiently blending predictions from multiple machine learning models. The goal is to achieve an ensemble score that is better than that of any single model in the ensemble.

How it works

Hill climbing starts with an initial solution, which is the predictions of one of the base models. It then iteratively explores neighboring solutions by adjusting the weights used to blend predictions from other models. If a new combination results in an improved value of the objective function, it becomes the current solution. This process repeats until no further improvement is possible, i.e. when a local optimum has been reached.

Installation

pip install hill-climbing

Example usage

from hill_climbing import Climber
from sklearn.metrics import root_mean_squared_error


# Running hill climbing
climber = Climber(
    objective="minimize",
    eval_metric=root_mean_squared_error
)
climber.fit(X, y)

print(f"Best score: {climber.best_score}")
print(f"Best predictions: {climber.best_oof_preds}")

# Predicting on unseen data
test_preds = climber.predict(X_test)

Example usage with cross-validation:

from hill_climbing import ClimberCV
from sklearn.metrics import root_mean_squared_error
from sklearn.model_selection import KFold


# Running hill climbing with CV
climber_cv = ClimberCV(
    objective="minimize",
    eval_metric=root_mean_squared_error,
    cv=KFold(n_splits=5, shuffle=True, random_state=42)
)
climber_cv.fit(X, y)

print(f"Best score: {climber_cv.best_score}")
print(f"Best predictions: {climber_cv.best_oof_preds}")

# Predicting on unseen data
test_preds = climber_cv.predict(X_test)

Parameters

Parameter Type Default Description
objective str - Either "maximize" or "minimize" the evaluation metric
eval_metric callable - The evaluation metric function to optimize
allow_negative_weights bool False Whether to allow negative weights. Note that allowing negative weights increases computation time, and in some cases may lead to overfitting
precision float 0.01 Controls the step size when trying new weights. Lower values will lead to higher computation times
starting_model str "best" Starting model selection strategy ("best", "random", or one of the column names in X)
score_decimal_places int 3 Number of decimal places for score display
random_state int 42 Random seed for reproducibility
verbose bool True Whether to output information during hill climbing
n_jobs int -1 Number of parallel jobs (-1 means use all available cores)
cv BaseCrossValidator - Cross-validation splitter from scikit-learn. This parameter is only available in ClimberCV

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hill_climbing-0.1.2.tar.gz (12.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hill_climbing-0.1.2-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file hill_climbing-0.1.2.tar.gz.

File metadata

  • Download URL: hill_climbing-0.1.2.tar.gz
  • Upload date:
  • Size: 12.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hill_climbing-0.1.2.tar.gz
Algorithm Hash digest
SHA256 4f00a113012eaa7354b57dbf445b305ed55d9e9c249f762f5c01d7e07adf3847
MD5 09dd9f38f91cf214e979fd8fb1e93b27
BLAKE2b-256 62cca999fc696de81c71a01d557f7f43099812c527ca61dcc22c15de984b08b2

See more details on using hashes here.

Provenance

The following attestation bundles were made for hill_climbing-0.1.2.tar.gz:

Publisher: publish.yml on ravaghi/hill-climbing

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file hill_climbing-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: hill_climbing-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 12.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hill_climbing-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5d2c25f467b635909d852f7f41fc9baf130aad559ff9b5a4c3fa76b427414697
MD5 b6f1a3208b782695701f5125bc359928
BLAKE2b-256 f211a46e1b478efef3253fe0e97318eb3a08ecd597006d15c2c47fffcba7ac09

See more details on using hashes here.

Provenance

The following attestation bundles were made for hill_climbing-0.1.2-py3-none-any.whl:

Publisher: publish.yml on ravaghi/hill-climbing

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page