Conformal Recursive Feature Elimination
Project description
CRFE - Conformal Recursive Feature Selection
CRFE is the first feature selection method based on a recursive backward elimintation policy that takes advantage from the Conformal Prediction framework [1]. CRFE´s objective is minimizing the non-conformity of the features [2] using a recursive elimination policy. We also present an automatic stopping criteria for recursive methods.
Requirements
- Python 3.9+
- scikit-learn 1.2.2+
- numpy
Quickstart
You can install this repository locally via pip using
pip install .
For editable/development mode:
pip install -e .
Using pixi (see pixi.toml in this repository):
pixi install
pixi run install-local
Let start with a basic example. This example is coded in Examples/example_2.py.
import numpy as np
from sklearn.datasets import load_iris
from sklearn.svm import LinearSVC
from sklearn.utils import check_random_state
from sklearn.model_selection import train_test_split
from CRFE import CRFE, ParamParada
rng = check_random_state(0)
iris = load_iris()
# Append irrelevant noise features under a fixed seed
X = np.c_[iris.data, rng.normal(size=(len(iris.data), 6))]
Y = iris.target
X_tr, X_test, Y_tr, Y_test = train_test_split(X, Y, test_size=0.2, stratify=Y)
X_tr, X_cal, Y_tr, Y_cal = train_test_split(X_tr, Y_tr, test_size=0.5, stratify=Y_tr)
estimator = LinearSVC(tol=1e-4, loss="squared_hinge", max_iter=300000)
crfe = CRFE(
estimator,
features_to_select=3,
stopping_activated=True,
stopping_params=ParamParada(alpha=0.05, eps=0.02, eta=0.1, paciencia=20),
)
crfe.fit(X_tr, Y_tr, X_cal, Y_cal)
print("Selected features:", crfe.idx_features_)
print("Stopping reason:", crfe.stopping_reason_)
The selected features can be reused with any estimator. The cloned estimator is available through crfe.estimator_.
X_tr_sel = X_tr[:, crfe.idx_features_]
X_test_sel = X_test[:, crfe.idx_features_]
svm_selected = crfe.estimator_.fit(X_tr_sel, Y_tr)
print("Score with selected features:", svm_selected.score(X_test_sel, Y_test))
Folder layout
CRFE_library_pro/
├── Library/
│ ├── CRFE/
│ │ ├── _crfe.py
│ │ ├── _crfe_utils.py
│ │ ├── documentation.txt
│ │ └── stopping.py
│ └── Examples/
│ ├── example_1.py
│ └── example_2.py
├── LICENSE
└── readme.md
References
[1] V. Balasubramanian, S.-S. Ho, and V. Vovk, Conformal Prediction for Reliable Machine Learning: Theory, Adaptations and Applications, 1st ed. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2014.
[2] T. Bellotti, Z. Luo, and A. Gammerman, “Strangeness Minimisation Feature Selection with Confidence Machines,” in Intelligent Data Engineering and Automated Learning IDEAL 2006, ser. Lecture Notes in Computer Science, E. Corchado, H. Yin, V. Botti, and C. Fyfe, Eds. Berlin, Heidelberg: Springer, pp. 978–985, 2006
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crfe-0.1.0.tar.gz.
File metadata
- Download URL: crfe-0.1.0.tar.gz
- Upload date:
- Size: 16.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
954e559efe4149747e4f6efb6a6c2bdf8c4f5d0e930f84d2c6b5c1bf957be33d
|
|
| MD5 |
623f924bfbeaaf7d7fe4ef5c6b3b546c
|
|
| BLAKE2b-256 |
704a05a6ecd5611aa997fd49d19c7bd7205b97001bb92b298b4561e7e1abf389
|
File details
Details for the file crfe-0.1.0-py3-none-any.whl.
File metadata
- Download URL: crfe-0.1.0-py3-none-any.whl
- Upload date:
- Size: 17.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
847004591c6b603ab0512e1c260356a9ddef29d990751df4b68ec91aae7a51fd
|
|
| MD5 |
45ce294d3f5f5cf403c0fb67a2aad4bf
|
|
| BLAKE2b-256 |
749d4d7d8fc8ef686e016c52baf63a4a64c85e98ea958588f061c3cc2883cc6f
|