Skip to main content

随机递归特征消除。在使用递归特征消除的过程中,加入随机因子,根据概率随机删除特征,更大的重要性具有更小的概率被删除。

Project description

随机递归特征消除

1、概述

为了提高模型性能,我们通常会给数据进行降维,一种有效的方法就是删掉一些无用的或者无效的特征,以达到8。递归特征消除就是这样方法之一,即通过不断训练模型,每次训练都将重要性最小的特征删除,直到满足停止条件。

这是一种贪心策略,不断删除最不重要的特征并不能保证一定能够获得最优子集,一种有效的方式是在删除特征的时候加入一些随机因素。

随机递归特征消除在递归特征消除的过程中加入了随机因子,随机因子用于控制当前特征删除是执行随机删除还是将最不重要的特征删除,当执行随机特征删除时,将随机删除一个特征,每个特征被删除的概率不同,越重要的特征具有越小的被删除概率,其中特征重要性有模型的coef_属性或feature_importances_属性决定,其值做了取绝对值处理以防止权重为负,导致负概率的发生。

2、用法

from feature_selectors.RFE import RandomRFE
import pandas as pd
from sklearn.model_selection import train_test_split

df = pd.read_csv('your_datasets_path.txt')
X = df.drop(columns=['label'])
y = df['label']
X_train,X_test,y_train,y_test = train_test_split(X,y)

# 构建一个选择器实例,其他参数请参考请help一下,代码有注释。 
fs = RandomRFE(percent_of_random=0.1)
selected_feature = fs.auto_select(X_train,y_train)

3、后续

其他特征选择方法,后续陆续加入,争取做个特性选择器,欢迎大神指正,并完善特征选择器,方便你我他!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

random_RFE-0.1.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

random_RFE-0.1.0-py2.py3-none-any.whl (4.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file random_RFE-0.1.0.tar.gz.

File metadata

  • Download URL: random_RFE-0.1.0.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0.post20200309 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.6

File hashes

Hashes for random_RFE-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c3d908a0b5f8d65343d86d35a85461a6845a53735d90bd0243591035175438e3
MD5 9988408369d876381c864ea706da1b66
BLAKE2b-256 ea46a856ddba5c71585c7bde136c9a6bfa9953eeeda655caf1eae0d339ce436a

See more details on using hashes here.

File details

Details for the file random_RFE-0.1.0-py2.py3-none-any.whl.

File metadata

  • Download URL: random_RFE-0.1.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 4.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0.post20200309 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.6

File hashes

Hashes for random_RFE-0.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 44a61cf36a9e5d4c168705a1db03dd6b20b0171a1fa8e6b2abaca88741996b07
MD5 61d46019b9cdc3ff9931b5517dbcc814
BLAKE2b-256 e51840d1cecf5442aa14b8a57a2a81873c383d016bb730453e9557f4bb3c8620

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page