An end-to-end feature selection distribution with ~constant runtime(number of features) complexity.
This project has been archived.
The maintainers of this project have marked this project as archived. No new releases are expected.
Project description
GFS Network
Gumbel Feature Selection Network is a deep learning model that can be used to select the most important features from a given dataset. The model is based on the Gumbel-Sigmoid distribution.
Installation
To install the package, you can use pip:
pip install gfs_network
Usage examples
Basic usage
from gfs_network import GFSNetwork
from sklearn.datasets import load_breast_cancer
breast = load_breast_cancer()
X = breast.data
y = breast.target
gfs = GFSNetwork()
X = gfs.fit_transform(X, y)
print(gfs.support_)
print(gfs.scores_)
Performance verification
from gfs_network import GFSNetwork
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import balanced_accuracy_score
DEVICE = "cpu"
breast = load_breast_cancer()
X = breast.data
y = breast.target
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
clf = RandomForestClassifier(random_state=42)
clf.train(X_train, y_train)
orig_score = balanced_accuracy_score(y_test, clf.predict(X_test))
print(f"Original score: {orig_score:.3f}. Original features: {X.shape[1]}")
# Original score: 0.958. Original features: 30
gfs = GFSNetwork(verbose=True, device=DEVICE)
gfs.fit(X_train, y_train)
X_transformed = gfs.transform(X_train)
X_test_transformed = gfs.transform(X_test)
clf.fit(X_transformed, y_train)
y_pred = clf.predict(X_test_transformed)
score = balanced_accuracy_score(y_test, y_pred)
logger.info(f"Score after feature selection: {score}. Selected features: {sum(gfs.support_)}")
# Score after feature selection: 0.958. Selected features: 3
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gfs_network-0.3.1.tar.gz.
File metadata
- Download URL: gfs_network-0.3.1.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.9.18 Linux/5.10.0-30-amd64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a0df58fa14f0464784b40829344fe501cb730c3e7b722651a89a166433920ea2
|
|
| MD5 |
91027dacfa39c171eb41f85d25e3d682
|
|
| BLAKE2b-256 |
310860287bb3bb6d89aea9fb3788e09e4a0a604a3b7c26e107ca4e9510e66f70
|
File details
Details for the file gfs_network-0.3.1-py3-none-any.whl.
File metadata
- Download URL: gfs_network-0.3.1-py3-none-any.whl
- Upload date:
- Size: 6.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.9.18 Linux/5.10.0-30-amd64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd7b1d9f2f035c17270ef7f8adfa6ccae50b33deb1855eeb8163aaa45bfea371
|
|
| MD5 |
10e2b92b1fbd2e650c115fcb3fc8fcde
|
|
| BLAKE2b-256 |
c5b0193f4746bfb7750b3b98c30dbf0be2743e6b07fa347de028677ce3ce259b
|