Skip to main content

An end-to-end feature selection distribution with linear runtime(number of features) complexity.

Project description

AutoNFS

AutoNFS is a deep learning model that can be used to select the most important features from a given dataset. The model is based on the Gumbel-Sigmoid distribution.

Installation

To install the package, you can use pip:

pip install autonfs

Usage examples

Basic usage

from autonfs import AutoNFS
from sklearn.datasets import load_breast_cancer

breast = load_breast_cancer()
X = breast.data
y = breast.target

gfs = GFSNetwoAutoNFSrk()
X = gfs.fit_transform(X, y)

print(gfs.support_)
print(gfs.scores_)

Performance verification

from autonfs import AutoNFS
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import balanced_accuracy_score

DEVICE = "cpu"

breast = load_breast_cancer()
X = breast.data
y = breast.target

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
clf = RandomForestClassifier(random_state=42)
clf.train(X_train, y_train)
orig_score = balanced_accuracy_score(y_test, clf.predict(X_test))

print(f"Original score: {orig_score:.3f}. Original features: {X.shape[1]}")
# Original score: 0.958. Original features: 30

gfs = AutoNFS(verbose=True, device=DEVICE)
gfs.fit(X_train, y_train)

X_transformed = gfs.transform(X_train)
X_test_transformed = gfs.transform(X_test)

clf.fit(X_transformed, y_train)
y_pred = clf.predict(X_test_transformed)
score = balanced_accuracy_score(y_test, y_pred)
logger.info(f"Score after feature selection: {score}. Selected features: {sum(gfs.support_)}")
# Score after feature selection: 0.958. Selected features: 3

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autonfs-0.3.1.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autonfs-0.3.1-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file autonfs-0.3.1.tar.gz.

File metadata

  • Download URL: autonfs-0.3.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/6.8.0-64-generic

File hashes

Hashes for autonfs-0.3.1.tar.gz
Algorithm Hash digest
SHA256 b42b967e49f1027df510d2bcd1e27d0d2a9208f5362945703e5de8cf145dfc0f
MD5 4ad9a942ad6254fc025c0658a94c4abc
BLAKE2b-256 f7b17c484940598be42aa9a7ebf5624cc69deedf05cc28b340e1cbc0b4cb136e

See more details on using hashes here.

File details

Details for the file autonfs-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: autonfs-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/6.8.0-64-generic

File hashes

Hashes for autonfs-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0a4926139150c6286eb8cfa091bceacb1d05f24180f80b2d36802031570dfa3f
MD5 e77d216d6b74b69c5a0e9ac454db8517
BLAKE2b-256 b23d990101da18aca1649f7b5e2983127865b29106e2ca970a4fe2b509b5019a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page