A Feature Selection and Feature ranking Package that can be used to select and rank features in datasets
Project description
#Feature Selection and Feature Ranking Algorithms :
A Python package that provides many feature selection and feature ranking algorithms
##Usage
Use the function call like :
fsfr(dataset, fs = 'string_value', fr = 'string_value', ftf = 'string_value')
Parameters :
dataset : pandas dataframe of the original dataset : It must only contain numerical values (categorical, ordinal values are excluded) and the class variable (decisional attribute or variable) should be also of numerical type.
fs : string values : 'gpso' or 'ga' fs means feature selection method can be either : gpso : Geometric Particle Swarm Optimisation ga : Genetic Alogorithm
fr : string values : 'rsm_a','rsm_b','rsm_c','mifsnd','mrmr' fr means feature ranking and can be either : rsm_a : Rough Set Method 1 rsm_b : Rough Set Method 2 rsm_c : Rough Set Method 3 mifsnd : Mutual Information Feature Selection-ND mrmr : Minimum Redundancy Maximum Relevance
ftf : string values : 'ftf_1','ftf_2','ftf_3' ftf means fitness function and if 'fs' is used then, it is mandatory to specify the value of 'ftf' ftf_1 : fitness function = 0.75 * (100/accuracy) + 0.25 * (no of features) ftf_2 : fitness function = 0.75 * accuracy + 0.25 * (1 / no of features) ftf_3 : fitness_function = accuracy * (1 - no of features/total no of features) no of features = no of features that are selected by the algorithm at that instance
Returns : list of features ranked in descending order if both 'fs' and 'fr' are used or only 'fr' is used.
The feature selection and ranking can be used independently of each other by mentioning either fs='' or fr='' but both cannot be '' and it is preferable to use both at the same time in case of larger datasets.
Refrences for algorithms :
gpso with ftf_1 : https://www.researchgate.net/publication/4307926_Gene_selection_in_cancer_
rsm_b : https://ieeexplore.ieee.org/document/7104131
mifsnd : https://www.sciencedirect.com/science/article/pii/S0957417414002164
The rest of the algorithms have been self developed and do not contain any materials from any other sources
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for Feature Selction-Ranking Algorithms-1.0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8efe356658e8aeb34cb1dd71f31049f40241cb76b4e80bac329bf8a4d05584b1 |
|
MD5 | bd0e9ad2a6bfa5467a677f3794060976 |
|
BLAKE2b-256 | 6f2c7e1b6f8046e73641866bb11b1ea9f7c6022053b530169810544c86bebcdf |
Hashes for Feature_Selction_Ranking_Algorithms-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 570fd5703d0b7cb693a5796c0f26cb5deaffb486fed3d03ba9ff785764e067bb |
|
MD5 | 714abcf312d3286ce10d78912982b310 |
|
BLAKE2b-256 | 8efc2cbe13dd01f9555e62cb1e3a98d3cefafbefaecb337f3a7eab078525b37d |