Skip to main content

Credit Scoring - IESEG School of Management

Project description

drawing
Credit Scoring
Module
Class: 2022 & 2023

Overview

  • Odds based Grouping (OBG)

    • OBGEncoder

      • pred_var: Name of predictor variable. Values can be either continuous or categorical.
      • target_var: Name of binary target variable.

      .fit

      • df: DataFrame containing pred_var and target_var.
      • max_delta: max difference between odds for merging two levels. default: 0.05
      • min_bins: minimum number of bins. default: 3
      • q: number of quantiles when converting continuous variable to categorical. default: 10

      .transform

      • df: Transform pred_var based on fitted bins.
      • impute: Boolean indicating whether to impute missing values. default: False
      • impute_value: Category level to impute missing values with. default: 'Missing' or 'nan'

      .fit_transform

      • df: DataFrame containing pred_var and target_var. Transform pred_var based on fitted bins.

      fit_dict: dictionary containing the matched category levels and fitted bins.

      lookup: dictionary containing cutoff values for continuous variable (empty if pred_var is categorical).

  • Weight of Evidence (WOE)

    • WOEEncoder

      • pred_var: Name of predictor variable.Values can be either continuous or categorical.
      • target_var: Name of binary target variable.
      • target_value: Value indicating event. default: 1.

      .fit

      • df: DataFrame containing pred_var and target_var.
      • stop_limit: Stops WOE based merging of the predictor's classes/levels in case the resulting information value (IV) decreases more than (e.g. 0.05 = 5%) compared to the preceding binning step. stop_limit=0 will skip any WOE based merging. Increasing the stop_limit will simplify the binning solution and may avoid overfitting. Accepted value range: 0 to 0.5. default: 0.1.
      • q: number of quantiles when converting continuous variable to categorical. default: 10

      .transform

      • df: Transform pred_var based on fitted bins
      • impute: Boolean indicating whether to impute missing values. default: False
      • impute_value: Category level to impute missing values with. default: 'Missing' or 'nan'

      .fit_transform

      • df: DataFrame containing pred_var and target_var. Transform pred_var based on fitted bins.

      .test_limit

      • df: DataFrame containing pred_var and target_var to test stop limits at 1%, 2.5%, 5% and 10%.

      fit_dict: dictionary containing the matched category levels and fitted bins.

      lookup: dictionary containing cutoff values for continuous variable (empty if pred_var is categorical).


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ieseg_credscore-0.23.12.tar.gz (12.2 kB view details)

Uploaded Source

File details

Details for the file ieseg_credscore-0.23.12.tar.gz.

File metadata

  • Download URL: ieseg_credscore-0.23.12.tar.gz
  • Upload date:
  • Size: 12.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for ieseg_credscore-0.23.12.tar.gz
Algorithm Hash digest
SHA256 e5267a48aa9837634558d977c6bd130745e2dcd0c8319eae78a4f478ac673a1d
MD5 3236129cf36818fe20c7010710589faa
BLAKE2b-256 f45f1aea2549175421123f37a37777ed2d28c5592ae1ce04150f3efeb4bda9e4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page