Credit Scoring - IESEG School of Management
Project description
Credit Scoring Module Class: 2022 & 2023 |
Overview
-
Odds based Grouping (OBG)
-
OBGEncoder
pred_var: Name of predictor variable. Values can be either continuous or categorical.target_var: Name of binary target variable.
.fit
df: DataFrame containing pred_var and target_var.max_delta: max difference between odds for merging two levels. default: 0.05min_bins: minimum number of bins. default: 3q: number of quantiles when converting continuous variable to categorical. default: 10
.transform
df: Transform pred_var based on fitted bins.impute: Boolean indicating whether to impute missing values. default: Falseimpute_value: Category level to impute missing values with. default: 'Missing' or 'nan'
.fit_transform
df: DataFrame containing pred_var and target_var. Transform pred_var based on fitted bins.
fit_dict: dictionary containing the matched category levels and fitted bins.
lookup: dictionary containing cutoff values for continuous variable (empty if pred_var is categorical).
-
-
Weight of Evidence (WOE)
-
WOEEncoder
pred_var: Name of predictor variable.Values can be either continuous or categorical.target_var: Name of binary target variable.target_value: Value indicating event. default: 1.
.fit
df: DataFrame containing pred_var and target_var.stop_limit: Stops WOE based merging of the predictor's classes/levels in case the resulting information value (IV) decreases more than (e.g. 0.05 = 5%) compared to the preceding binning step. stop_limit=0 will skip any WOE based merging. Increasing the stop_limit will simplify the binning solution and may avoid overfitting. Accepted value range: 0 to 0.5. default: 0.1.q: number of quantiles when converting continuous variable to categorical. default: 10
.transform
df: Transform pred_var based on fitted binsimpute: Boolean indicating whether to impute missing values. default: Falseimpute_value: Category level to impute missing values with. default: 'Missing' or 'nan'
.fit_transform
df: DataFrame containing pred_var and target_var. Transform pred_var based on fitted bins.
.test_limit
df: DataFrame containing pred_var and target_var to test stop limits at 1%, 2.5%, 5% and 10%.
fit_dict: dictionary containing the matched category levels and fitted bins.
lookup: dictionary containing cutoff values for continuous variable (empty if pred_var is categorical).
-
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ieseg_credscore-0.23.12.tar.gz
(12.2 kB
view details)
File details
Details for the file ieseg_credscore-0.23.12.tar.gz.
File metadata
- Download URL: ieseg_credscore-0.23.12.tar.gz
- Upload date:
- Size: 12.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e5267a48aa9837634558d977c6bd130745e2dcd0c8319eae78a4f478ac673a1d
|
|
| MD5 |
3236129cf36818fe20c7010710589faa
|
|
| BLAKE2b-256 |
f45f1aea2549175421123f37a37777ed2d28c5592ae1ce04150f3efeb4bda9e4
|