Skip to main content

A python package for ensembling machine learning predictions using hill climbing optimization

Project description

Hill Climbing

Hill climbing is a mathematical optimization algorithm that belongs to the family of local search techniques. It is commonly used to iteratively improve a solution based on a cost or objective function. This package provides a simple implementation of the hill climbing algorithm and is useful for efficiently blending predictions from multiple machine learning models. The goal is to achieve an ensemble score that is better than that of any single model in the ensemble.

How it works

Hill climbing starts with an initial solution, which is the predictions of one of the base models. It then iteratively explores neighboring solutions by adjusting the weights used to blend predictions from other models. If a new combination results in an improved value of the objective function, it becomes the current solution. This process repeats until no further improvement is possible, i.e. when a local optimum has been reached.

Installation

pip install hill-climbing

Example usage

from hill_climbing import Climber
from sklearn.metrics import root_mean_squared_error


# Running hill climbing
climber = Climber(
    objective="minimize",
    eval_metric=root_mean_squared_error
)
climber.fit(X, y)

print(f"Best score: {climber.best_score}")
print(f"Best predictions: {climber.best_oof_preds}")

# Predicting on unseen data
test_preds = climber.predict(X_test)

Example usage with cross-validation:

from hill_climbing import ClimberCV
from sklearn.metrics import root_mean_squared_error
from sklearn.model_selection import KFold


# Running hill climbing with CV
climber_cv = ClimberCV(
    objective="minimize",
    eval_metric=root_mean_squared_error,
    cv=KFold(n_splits=5, shuffle=True, random_state=42)
)
climber_cv.fit(X, y)

print(f"Best score: {climber_cv.best_score}")
print(f"Best predictions: {climber_cv.best_oof_preds}")

# Predicting on unseen data
test_preds = climber_cv.predict(X_test)

Parameters

Parameter Type Default Description
objective str - Either "maximize" or "minimize" the evaluation metric
eval_metric callable - The evaluation metric function to optimize
allow_negative_weights bool False Whether to allow negative weights. Note that allowing negative weights increases computation time, and in some cases may lead to overfitting
precision float 0.001 Controls the step size when trying new weights. Lower values will lead to higher computation times
starting_model str "best" Starting model selection strategy ("best", "random", or one of the column names in X)
score_decimal_places int 3 Number of decimal places for score display
random_state int 42 Random seed for reproducibility
verbose bool True Whether to output information during hill climbing
n_jobs int -1 Number of parallel jobs (-1 means use all available cores)
cv BaseCrossValidator - Cross-validation splitter from scikit-learn. This parameter is only available in ClimberCV

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hill_climbing-0.1.0.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hill_climbing-0.1.0-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file hill_climbing-0.1.0.tar.gz.

File metadata

  • Download URL: hill_climbing-0.1.0.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hill_climbing-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c38a534c4e64865d5f3bb9c28ffc9d3da2be5e4d5a11254109adadd0d7bd24f7
MD5 7d5f6a184374b2f85307cc0ae98541d7
BLAKE2b-256 cb596005faad1438d3444cd9e8e89ebb50d29da97003437e33d714b238d265d5

See more details on using hashes here.

Provenance

The following attestation bundles were made for hill_climbing-0.1.0.tar.gz:

Publisher: publish.yml on ravaghi/hill-climbing

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file hill_climbing-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: hill_climbing-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 12.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hill_climbing-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 db81a6d2c62b7226fca49d3d90767c038ec381b5e02c6cd57ba4c4292cd91808
MD5 76d3010e070b8641792b986e936838a2
BLAKE2b-256 24c5c1dba25e4f5debaf5ca0d9b3dc6fe10df0797aa8733270cdfb8e4b31b23a

See more details on using hashes here.

Provenance

The following attestation bundles were made for hill_climbing-0.1.0-py3-none-any.whl:

Publisher: publish.yml on ravaghi/hill-climbing

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page