scikit-uplift·PyPI

Classic approaches of Uplift modelling in scikit-learn style in python

These details have not been verified by PyPI

Project links

Homepage

Project description

scikit-uplift

scikit-uplift (sklift) is an uplift modeling python package that provides fast sklearn-style models implementation, evaluation metrics and visualization tools.

Uplift modeling estimates a causal effect of treatment and uses it to effectively target customers that are most likely to respond to a marketing campaign.

Use cases for uplift modeling:

Target customers in the marketing campaign. Quite useful in promotion of some popular product where there is a big part of customers who make a target action by themself without any influence. By modeling uplift you can find customers who are likely to make the target action (for instance, install an app) only when treated (for instance, received a push).
Combine a churn model and an uplift model to offer some bonus to a group of customers who are likely to churn.
Select a tiny group of customers in the campaign where a price per customer is high.

Read more about uplift modeling problem in User Guide.

Articles in russian on habr.com: Part 1 , Part 2 and Part 3.

Why sklift

Сomfortable and intuitive scikit-learn-like API;
More uplift metrics than you have ever seen in one place! Include brilliants like Area Under Uplift Curve (AUUC) or Area Under Qini Curve (Qini coefficient) with ideal cases;
Supporting any estimator compatible with scikit-learn (e.g. Xgboost, LightGBM, Catboost, etc.);
All approaches can be used in the sklearn.pipeline. See the example of usage on the Tutorials page;
Also metrics are compatible with the classes from sklearn.model_selection. See the example of usage on the Tutorials page;
Almost all implemented approaches solve classification and regression problems;
Nice and useful viz for analysing a performance model.

Installation

Install the package by the following command from PyPI:

pip install scikit-uplift

Or install from source:

git clone https://github.com/maks-sh/scikit-uplift.git
cd scikit-uplift
python setup.py install

Documentation

The full documentation is available at uplift-modeling.com.

Or you can build the documentation locally using Sphinx 1.4 or later:

cd docs
pip install -r requirements.txt
make html

And if you now point your browser to _build/html/index.html, you should see a documentation site.

Quick Start

See the RetailHero tutorial notebook (EN , RU ) for details.

Train and predict uplift model

Use the intuitive python API to train uplift models with sklift.models.

# import approaches
from sklift.models import SoloModel, ClassTransformation
# import any estimator adheres to scikit-learn conventions.
from lightgbm import LGBMClassifier

# define models
estimator = LGBMClassifier(n_estimators=10)

# define metamodel
slearner = SoloModel(estimator=estimator)

# fit model
slearner.fit(
    X=X_tr,
    y=y_tr,
    treatment=trmnt_tr,
)

# predict uplift
uplift_slearner = slearner.predict(X_val)

Evaluate your uplift model

Uplift model evaluation metrics are available in sklift.metrics.

# import metrics to evaluate your model
from sklift.metrics import (
    uplift_at_k, uplift_auc_score, qini_auc_score, weighted_average_uplift
)


# Uplift@30%
uplift_at_k = uplift_at_k(y_true=y_val, uplift=uplift_slearner,
                          treatment=trmnt_val,
                          strategy='overall', k=0.3)

# Area Under Qini Curve
qini_coef = qini_auc_score(y_true=y_val, uplift=uplift_slearner,
                           treatment=trmnt_val)

# Area Under Uplift Curve
uplift_auc = uplift_auc_score(y_true=y_val, uplift=uplift_slearner,
                              treatment=trmnt_val)

# Weighted average uplift
wau = weighted_average_uplift(y_true=y_val, uplift=uplift_slearner,
                              treatment=trmnt_val)

Vizualize the results

Visualize performance metrics with sklift.viz.

from sklift.viz import plot_qini_curve
import matplotlib.pyplot as plt

fig, ax = plt.subplots(1, 1)
ax.set_title('Qini curves')

plot_qini_curve(
    y_test, uplift_slearner, trmnt_test,
    perfect=True, name='Slearner', ax=ax
);

plot_qini_curve(
    y_test, uplift_revert, trmnt_test,
    perfect=False, name='Revert label', ax=ax
);

Example of some models qini curves, perfect qini curve and random qini curve

Development

We welcome new contributors of all experience levels.

Please see our Contributing Guide for more details.
By participating in this project, you agree to abide by its Code of Conduct.

Thanks to all our contributors!

If you have any questions, please contact us at team@uplift-modeling.com

Important links

Official source code repo: https://github.com/maks-sh/scikit-uplift/
Issue tracker: https://github.com/maks-sh/scikit-uplift/issues
Documentation: https://www.uplift-modeling.com/en/latest/
User Guide: https://www.uplift-modeling.com/en/latest/user_guide/index.html
Contributing guide: https://www.uplift-modeling.com/en/latest/contributing.html
Release History: https://www.uplift-modeling.com/en/latest/changelog.html

Papers and materials

Gutierrez, P., & Gérardy, J. Y.

Causal Inference and Uplift Modelling: A Review of the Literature. In International Conference on Predictive Applications and APIs (pp. 1-13).
Artem Betlei, Criteo Research; Eustache Diemert, Criteo Research; Massih-Reza Amini, Univ. Grenoble Alpes

Dependent and Shared Data Representations improve Uplift Prediction in Imbalanced Treatment Conditions FAIM’18 Workshop on CausalML.
Eustache Diemert, Artem Betlei, Christophe Renaudin, and Massih-Reza Amini. 2018.

A Large Scale Benchmark for Uplift Modeling. In Proceedings of AdKDD & TargetAd (ADKDD’18). ACM, New York, NY, USA, 6 pages.
Athey, Susan, and Imbens, Guido. 2015.

Machine learning methods for estimating heterogeneous causal effects. Preprint, arXiv:1504.01132. Google Scholar.
Oscar Mesalles Naranjo. 2012.

Testing a New Metric for Uplift Models. Dissertation Presented for the Degree of MSc in Statistics and Operational Research.
Kane, K., V. S. Y. Lo, and J. Zheng. 2014.

Mining for the Truly Responsive Customers and Prospects Using True-Lift Modeling: Comparison of New and Existing Methods. Journal of Marketing Analytics 2 (4): 218–238.
Maciej Jaskowski and Szymon Jaroszewicz.

Uplift modeling for clinical trial data. ICML Workshop on Clinical Data Analysis, 2012.
Lo, Victor. 2002.

The True Lift Model - A Novel Data Mining Approach to Response Modeling in Database Marketing. SIGKDD Explorations. 4. 78-86.
Zhao, Yan & Fang, Xiao & Simchi-Levi, David. 2017.

Uplift Modeling with Multiple Treatments and General Response Types. 10.1137/1.9781611974973.66.
Nicholas J Radcliffe. 2007.

Using control groups to target on predicted lift: Building and assessing uplift model. Direct Marketing Analytics Journal, (3):14–21, 2007.
Devriendt, F., Guns, T., & Verbeke, W. 2020.

Learning to rank for uplift modeling. ArXiv, abs/2002.05897.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.5.1

Aug 11, 2022

0.5.0

Aug 9, 2022

0.4.1

Jun 17, 2022

0.4.0

Aug 16, 2021

0.3.2

Apr 30, 2021

0.3.1

Feb 19, 2021

0.3.0

Feb 7, 2021

0.2.0

May 31, 2020

0.1.2

May 2, 2020

0.1.1

Apr 28, 2020

0.1.0

Apr 16, 2020

0.0.3

Jan 11, 2020

0.0.2

Dec 31, 2019

0.0.1

Dec 26, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scikit-uplift-0.5.1.tar.gz (36.4 kB view details)

Uploaded Aug 11, 2022 Source

Built Distribution

scikit_uplift-0.5.1-py3-none-any.whl (42.1 kB view details)

Uploaded Aug 11, 2022 Python 3

File details

Details for the file scikit-uplift-0.5.1.tar.gz.

File metadata

Download URL: scikit-uplift-0.5.1.tar.gz
Upload date: Aug 11, 2022
Size: 36.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for scikit-uplift-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`a182eb731fdbacce8997142d0f30bd539864db8e5236ec2e324ea1ac2bede13f`
MD5	`d31a6304acb7557dc4b7b46105c3eb74`
BLAKE2b-256	`99394fce95a26b4187cb3511cb39dd26b2a0c2d22c645f6e4e7d876bfde745b8`

See more details on using hashes here.

File details

Details for the file scikit_uplift-0.5.1-py3-none-any.whl.

File metadata

Download URL: scikit_uplift-0.5.1-py3-none-any.whl
Upload date: Aug 11, 2022
Size: 42.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for scikit_uplift-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d25c673c33336407811cded3256b12a76797d21a2f49b163a5041103db6c1282`
MD5	`f532188926fb0326365b51774453ba92`
BLAKE2b-256	`c1e164690394f9798935f73be859404cf46b3d63a17d8e3c25793a5b390cb2ae`

See more details on using hashes here.

scikit-uplift 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

scikit-uplift

Why sklift

Installation

Documentation

Quick Start

Development

Important links

Papers and materials

Tags

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes