Skip to main content

a modeling tool that automatically builds scorecards and tree models.

Project description

##自动构建评分卡

## 思想碰撞

微信 | 微信公众号 |
:—: | :—-: |
<img src=”https://github.com/ZhengRyan/autotreemodel/blob/master/images/%E5%B9%B2%E9%A5%AD%E4%BA%BA.png” alt=”RyanZheng.png” width=”50%” border=0/> | <img src=”https://github.com/ZhengRyan/autotreemodel/blob/master/images/%E9%AD%94%E9%83%BD%E6%95%B0%E6%8D%AE%E5%B9%B2%E9%A5%AD%E4%BA%BA.png” alt=”魔都数据干饭人.png” width=”50%” border=0/> |
干饭人 | 魔都数据干饭人 |

> 仓库地址:https://github.com/ZhengRyan/autobmt > > 微信公众号文章:https://mp.weixin.qq.com/s/30-V0p30p1rKSBtcBgLTRA > > pipy包:https://pypi.org/project/autobmt/ > > 实验数据:链接: https://pan.baidu.com/s/1BRIHH9Wcwy2EZaO5xSgH9w?pwd=tdq5 提取码: tdq5

## 一、环境准备 可以不用单独创建虚拟环境,都是日常常用的python依赖包。需要创建虚拟环境,请参考”五、依赖包安装”

### autobmt 安装 pip install(pip安装)

`bash pip install autobmt # to install pip install -U autobmt # to upgrade `

Source code install(源码安装)

`bash python setup.py install `

## 二、使用教程 1、1行代码自动构建评分卡:请查看autobmt/examples/autobmt_lr_tutorial_code.py。里面有例子

2、1步1步拆解自动构建评分卡的步骤:请查看autobmt/examples/tutorial_code.ipynb。里面有详细步骤拆解例子

## 三、训练、自动选变量、自动单调最优分箱、自动构建模型、自动构建评分卡 1、Step 1: EDA,整体数据探索性数据分析

2、Step 2: 特征粗筛选

3、Step 3: 对粗筛选后的变量调用最优分箱

4、Step 4: 对最优分箱后的变量进行woe转换

5、Step 5: 对woe转换后的变量进行stepwise

6、Step 6: 用逻辑回归构建模型

7、Step 7: 构建评分卡

8、Step 8: 持久化模型,分箱点,woe值,评分卡结构

9、Step 9: 持久化建模中间结果到excel,方便复盘

## 四、保存的建模结果相关文件说明 1、all_data_eda.xlsx:整体数据的EDA情况

2、build_model_log_var_jpg文件夹,最终入模变量的分箱画图,在”build_model_log.xlsx”最后1个sheet也有记录

3、build_model_log.xlsx:构建整个模型的过程日志,记录有利复盘

4、fb.pkl、woetf.pkl、lrmodel.pkl、in_model_var.pkl:fb.pkl分箱文件,woetf.pkl转woe文件,lrmodel.pkl模型文件,入模变量文件

5、scorecard.pkl、scorecard.csv、scorecard.json:评分卡的pkl、csv、json格式。在”build_model_log.xlsx”的”scorecard_structure”sheet也有记录

6、var_bin_woe_format.csv、var_bin_woe_format.json、var_bin_woe.csv、var_bin_woe.json、var_split_point_format.csv、var_split_point_format.json、var_split_point.csv、var_split_point.json:分箱文件和转woe文件的csv、json格式

7、lr_auc_ks_psi.csv:模型的auc、ks、psi

8、lr_pred_to_report_data.csv:构建建模报告的数据

9、lr_test_input.csv:用于模型上线后,将次数据喂入模型,对比和lr_pred_to_report_data.csv结果是否一致。验证模型上线的正确性

## 五、依赖包安装(建议先创建虚拟环境,不创建虚拟环境也行,创建虚拟环境是为了不和其它项目有依赖包的冲突,不创建虚拟环境的话在基础python环境执行pip install即可) ####创建虚拟环境 conda create -y –force -n autobmt python=3.7.2 ####激活虚拟环境 conda activate autobmt

### 依赖包安装方式一,执行如下命令安装依赖的包 pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autobmt-0.2.0.tar.gz (2.7 MB view details)

Uploaded Source

Built Distribution

autobmt-0.2.0-py2.py3-none-any.whl (82.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file autobmt-0.2.0.tar.gz.

File metadata

  • Download URL: autobmt-0.2.0.tar.gz
  • Upload date:
  • Size: 2.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.2

File hashes

Hashes for autobmt-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b5135b1888350c4222d072927d3df289f67c7e3dc13ae5e9bd42862f1c7a2bc2
MD5 f35bbf04d8e2bc98b06d94fbae1d20fd
BLAKE2b-256 d7adf50d99a49e59062a0d733413a788e1eb9d85782e4d3d4dccac2273f5cea1

See more details on using hashes here.

File details

Details for the file autobmt-0.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: autobmt-0.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 82.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.2

File hashes

Hashes for autobmt-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 5657bd3efec90689f5a0d71b3989227757dbb55b4d3764d64cbdb0190749c8ba
MD5 c82852e17cb29a9f6db2e497c73b2b04
BLAKE2b-256 c94f964ca4109b4bc0c0da4ab6d906c215a26dcc02ff9480ab1fd1258af987b9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page