Skip to main content

Tools for building scorecard models in python, with a sklearn-compatible API

Project description

skorecard

pytest PyPI - Python Version PyPI PyPI - License GitHub contributors PyPI - Downloads Downloads Code style: black pre-commit

skorecard is a scikit-learn compatible python package that helps streamline the development of credit risk acceptance models (scorecards).

Scorecards are ‘traditional’ models used by banks in the credit decision process. Internally, scorecards are Logistic Regression models that make use of features that are binned into different groups. The process of binning is usually done manually by experts, and skorecard provides tools to makes this process easier. skorecard is built on top of scikit-learn as well as other excellent open source projects like optbinning, dash and plotly.

:point_right: Read the blogpost introducing skorecard

Features ⭐

  • Automate bucketing of features inside scikit-learn pipelines.
  • Dash webapp to help manually tweak bucketing of features with business knowledge
  • Extension to sklearn.linear_model.LogisticRegression that is also able to report p-values
  • Plots and reports to speed up analysis and writing technical documentation.

Quick demo

skorecard offers a range of bucketers:

import pandas as pd
from skorecard.bucketers import EqualWidthBucketer

df = pd.DataFrame({'column' : range(100)})

ewb = EqualWidthBucketer(n_bins=5)
ewb.fit_transform(df)

ewb.bucket_table('column')
#>    bucket                       label  Count  Count (%)
#> 0      -1                     Missing    0.0        0.0
#> 1       0                (-inf, 19.8]   20.0       20.0
#> 2       1                (19.8, 39.6]   20.0       20.0
#> 3       2  (39.6, 59.400000000000006]   20.0       20.0
#> 4       3  (59.400000000000006, 79.2]   20.0       20.0
#> 5       4                 (79.2, inf]   20.0       20.0

That also support a dash app to explore and update bucket boundaries:

ewb.fit_interactive(df)
#> Dash app running on http://127.0.0.1:8050/

Installation

pip3 install skorecard

Documentation

See ing-bank.github.io/skorecard/.

Presentations

Title Host Date Speaker(s)
Skorecard: Making logistic regressions great again ING Data Science Meetup 10 June 2021 Daniel Timbrell, Sandro Bjelogrlic, Tim Vink

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skorecard-1.6.9.tar.gz (146.1 kB view details)

Uploaded Source

Built Distribution

skorecard-1.6.9-py3-none-any.whl (128.7 kB view details)

Uploaded Python 3

File details

Details for the file skorecard-1.6.9.tar.gz.

File metadata

  • Download URL: skorecard-1.6.9.tar.gz
  • Upload date:
  • Size: 146.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for skorecard-1.6.9.tar.gz
Algorithm Hash digest
SHA256 7633edfa4d0f421a96a719dc6614543bdce0fa58b14d3cb9df24a0c07bdc4f4e
MD5 735b57bed663c1a9ced7d44ecfc84772
BLAKE2b-256 717756e6784e5dedbc883dd5c730c269feae9e2f8a89e94d1fb43bc5e56f4f43

See more details on using hashes here.

File details

Details for the file skorecard-1.6.9-py3-none-any.whl.

File metadata

  • Download URL: skorecard-1.6.9-py3-none-any.whl
  • Upload date:
  • Size: 128.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for skorecard-1.6.9-py3-none-any.whl
Algorithm Hash digest
SHA256 a620d9a707b7a7c7fa8331d168e5bc30acd4b599769b2ecb4b91ee387ffe9a45
MD5 732e1ff03c89f2a4e86d6e60e79344c1
BLAKE2b-256 8ffdb5c8ddf9cd2f7f3969019ed46d6d6c80f538c68ca2ef74cc38497f036153

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page