Skip to main content

Implements several boosting algorithms in Python

Project description

KTBoost

This Python package implements several boosting algorithms with different combinations of base learners, optimization algorithms, and loss functions.

Concerning base learners, this includes:

  • Trees
  • Kernel Ridge regression
  • A combination of the two (i.e., the KTBoost algorithm)

Concerning the optimization step for finding the boosting updates, the package supports:

  • Gradient descent
  • Newton-Rahson method
  • A hybrid version of the two for trees as base learners

The package implements the following loss functions:

  • Continuous data ("regression"): quadratic loss (L2 loss), absolute error (L1 loss), Huber loss, quantile regression loss, Gamma regression loss, negative Gaussian likelihood with both the mean and the standard deviation as functions of features
  • Count data ("regression"): Poisson regression loss
  • (Unorderd) Categorical data ("classification"): logistic regression loss (log loss), exponential loss, cross entropy loss with softmax
  • Mixed continuous-categorical data ("censored regression"): negative Tobit likelihood (i.e., the Grabit model)

The package re-uses code from scikit-learn and its workflow is very similar to that of scikit-learn.

It can be installed using pip install KTBoost and then loaded using import KTBoost.KTBoost as KTBoost. The two main classes are KTBoost.BoostingClassifier and KTBoost.BoostingRegressor.

The following code example defines a model, trains it, and makes predictions.

import KTBoost.KTBoost as KTBoost

######################################
## Define models (several examples) ##
######################################
## Standard tree boosting for regression with quadratic loss and hybrid gradient-Newton updates as in Friedman (2001)
model = KTBoost.BoostingRegressor(loss='ls')
## Grabit model as in Sigrist and Hirnschall (2018)
model = KTBoost.BoostingRegressor(loss='tobit')
## KTBoost algorithm for classification with Newton updates
model = KTBoost.BoostingClassifier(loss='deviance',base_learner='combined',update_step='newton')
## Gradient boosting for classification with trees as base learners
model = KTBoost.BoostingClassifier(loss='deviance',update_step='gradient')
## Newton boosting for classification model with trees as base learners
model = KTBoost.BoostingClassifier(loss='deviance',update_step='newton')
## Hybrid gradient-Newton boosting (Friedman, 2001) for classification with trees as base learners
model = KTBoost.BoostingClassifier(loss='deviance',update_step='hybrid')
## Kernel boosting for regression with quadratic loss
model = KTBoost.BoostingRegressor(loss='ls',base_learner='kernel')


##################
## Train models ##
##################
model.fit(Xtrain,ytrain)

######################
## Make predictions ##
######################
model.predict(Xpred)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

KTBoost-0.0.7.tar.gz (46.6 kB view details)

Uploaded Source

Built Distribution

KTBoost-0.0.7-py2-none-any.whl (51.0 kB view details)

Uploaded Python 2

File details

Details for the file KTBoost-0.0.7.tar.gz.

File metadata

  • Download URL: KTBoost-0.0.7.tar.gz
  • Upload date:
  • Size: 46.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.14

File hashes

Hashes for KTBoost-0.0.7.tar.gz
Algorithm Hash digest
SHA256 16387df705de671713798cc098c3992a74fb34334767dfd3747493da7420414c
MD5 b624eaed0f5cff8bf239c62cbd373a0a
BLAKE2b-256 fb17da982fa4cc1f75a5262d054b5b39bceaf95d511ed2fec74e982095728b07

See more details on using hashes here.

File details

Details for the file KTBoost-0.0.7-py2-none-any.whl.

File metadata

  • Download URL: KTBoost-0.0.7-py2-none-any.whl
  • Upload date:
  • Size: 51.0 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.14

File hashes

Hashes for KTBoost-0.0.7-py2-none-any.whl
Algorithm Hash digest
SHA256 4a0e7000c270a254eff7591ef816435a93367d10ebddc5240757ff33b8519985
MD5 173270adae20b09f4ba07d3e692d7e83
BLAKE2b-256 1fe4a2d8b19a527334f897215a9f9f5884bec53b269bbb8a62e0ff39ce3f09f0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page