Skip to main content

constrained optimization for gradient boosting models with non-decomposable constraints

Project description

Python Documentation Github MIT License LinkedIn


Constrained Gradient Boosting

Constrained Optimization of Gradient Boosting models which is written on top of Sklearn gradient boosting.
Explore the docs »

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgements

About The Project

This is the companion code for the Master Thesis entitled "". The master thesis is done in Bosch Center of AI research, and licenced by GNU AFFERO GENERAL PUBLIC LICENSE. The code allows the users to apply constrained for one type of error, such as false negative rate to do safe classification using gradient boosting. Besides, one can reproduce the results in the paper as it is provided in the examples.

This library enables user to define their own constraints and apply them on for the gradient boosting. To see how to do this visit here.

Built With

This project language is Python and it is built on top of scikit-learn gradient boosting. For hyper-parameter optimization GPyOpt bayesian optimization is used.

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Prerequisites

To use the constrained_gb library, you need to have scikit-learn>=0.22.0 installed, which is probably installed if you are using Machine Learning algorithm in Python. To do hyper-parameter optimization using .optimize(), you need to have GPyOpt installed. To install GPyOpt simply run

pip install gpyopt

If you have problem with GPyOpt installation visit here.

Installation

  1. Clone the repo
    git clone https://github.com/maryami66/constrained_gb.git
    
  2. Install
    pip install constrained_gb
    

Usage

In this example, we are looking for a classifier to Use this space to show useful examples of how a project can be used. Additional screenshots, code examples and demos work well in this space. You may also link to more resources.

import constrained_gb as gbmco
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.metrics import *

X, y = load_breast_cancer(return_X_y=True)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=2)
 
 
constraints = [gbmco.FalseNegativeRate(0.001)]
 
parms = {'constraints': constraints,
         'multiplier_stepsize': 0.01,
         'learning_rate': 0.1,
         'min_samples_split': 99,
         'min_samples_leaf': 19,
         'max_depth': 8,
         'max_leaf_nodes': None,
         'min_weight_fraction_leaf': 0.0,
         'n_estimators': 300,
         'max_features': 'sqrt',
         'subsample': 0.7,
         'random_state': 2
         }
 
clf = gbmco.ConstrainedClassifier(**parms)
clf.fit(X_train, y_train)
 
test_predictions = clf.predict(X_test)
 
print("Test F1 Measure: {} \n".format(f1_score(y_test, test_predictions)))
print("Test FNR: {} \n".format(1-recall_score(y_test, test_predictions)))

License

Distributed under the GNU AFFERO GENERAL PUBLIC LICENSE License v3 or later (GPLv3+). See LICENSE for more information.

Contact

Maryam Bahrami - maryami_66@yahoo.com

Project Link: https://github.com/maryami66/constrained_gb

Acknowledgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

constrained_gb-0.1.2.tar.gz (16.6 kB view hashes)

Uploaded Source

Built Distribution

constrained_gb-0.1.2-py3-none-any.whl (16.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page