Evaluation and Benchmark Tool for Feature Selection

These details have not been verified by PyPI

Project links

Project description

FSEval – Feature Selection Evaluation Suite

FSEval is a lightweight, modular Python library designed to benchmark feature selection and feature ranking methods across multiple datasets using both supervised and unsupervised downstream evaluation protocols.

It helps researchers and practitioners answer the question:

"Which feature selection method actually works best for my type of data and task?"

FSEval automates:

Repeated training & evaluation at different feature subset sizes
Stochastic method averaging
Result persistence & incremental updates
Support for both classification and clustering-based evaluation

📦 Dependencies and Requirements

FSEval requires:

python>=3.8
numpy
pandas
scikit-learn
scipy
fsdem
clustpy (only needed for unsupervised_clustering_accuracy)
pcametric (only needed for AAD)

💡 Installation

You can just download the source code and import fseval, or you can install it using pip:

pip install sdufseval

🚀 Quick Example

from sdufseval import FSEVAL
import numpy as np
from sklearn.neighbors import NearestNeighbors

def snn_consistency_k5(X_orig, X_sub, y):
    """
    Calculates the average proportion of shared nearest neighbors (k=5) 
    between the original space and the feature-selected subspace.
    """
    k = 5
    k = min(k, X_orig.shape[0] - 1)
    
    def get_nn_indices(data, n_neighbors):
        nbrs = NearestNeighbors(n_neighbors=n_neighbors + 1, algorithm='auto').fit(data)
        _, indices = nbrs.kneighbors(data)
        return indices[:, 1:]

    nn_orig = get_nn_indices(X_orig, k)
    nn_sub = get_nn_indices(X_sub, k)
    
    intersections = [len(np.intersect1d(nn_orig[i], nn_sub[i])) for i in range(len(nn_orig))]
    return np.mean(intersections) / k

if __name__ == "__main__":

    DATASETS_TO_RUN = ['colon', 'leukemia', 'prostate_GE']

    evaluator = FSEVAL(
        output_dir="benchmark_results", 
        avg_steps=5,
        eval_type=["supervised", "unsupervised", "model_agnostic", "custom"],
        custom_metrics={"SNN_K5": snn_consistency_k5}
    )

    methods_list = [
        {
            'name': 'Random', 
            'stochastic': True, 
            'func': evaluator.random_baseline
        },
        {
            'name': 'Variance_Baseline', 
            'stochastic': False, 
            'func': lambda X: np.var(X, axis=0)
        }
    ]
    
    print(">>> Starting Integrated Evaluation (Global & Local metrics)...")
    evaluator.run(DATASETS_TO_RUN, methods_list)

    print("\n>>> Starting Scalability Analysis...")
    evaluator.timer(
        methods=methods_list, 
        vary_param='both', 
        time_limit=3600 
    )

Data Loading

load_dataset(dataset_name, data_dir="datasets") supports:

Single .mat file with keys 'X' and 'Y'
Two CSV files: {name}_X.csv and {name}_y.csv

📚 API Reference

🛠️ `FSEval(output_dir="results", cv=5, avg_steps=10, eval_type=["supervised", "unsupervised", "model_agnostic"], metrics=None, experiments=None)`

Initializes the evalutation and benchmark object.

Parameter	Default	Description
`output_dir`	results	Folder where CSV result files are saved.
`cv`	5	Cross-validation folds (supervised only).
`avg_steps`	10	Number of repetitions for stochastic methods.
`supervised_iter`	5	Number of classifier's runs with different random seeds.
`unsupervised_iter`	10	Number of clustering runs with different random seeds.
`eval_type`	["supervised", "unsupervised", "model_agnostic"]	"supervised", "unsupervised", "model_agnostic", or "custom" to enable inclusion of custom user-defined metrics.
`metrics`	["CLSACC", "NMI", "ACC", "AUC", "AAD"]	Evaluation metrics to calculate.
`stability`	True	Whether to calculate FSDEM stability for the metrics. True, False, or a list of metrics to calculate stability for.
`custom_metrics`	{}	User-defined custom evaluation metrics.
`experiments`	["10Percent", "100Percent"]	Which feature ratio grids to evaluate.
`save_all`	False	Save the results of all runs of the stochastic methods separately.

⚙️ `run(datasets, methods, classifier=None)`

Initializes the evalutation and benchmark object.

Argument	Type	Description
`datasets`	List[str]	Dataset names loadable via load_dataset().
`methods`	List[dict]	"[{""name"": str, ""func"": callable, ""stochastic"": bool}, ...]"
`classifier`	sklearn classifier	Classifier for supervised eval (default: RandomForestClassifier)

⚙️ `timer(methods, vary_param='features', time_limit=3600)`

Runs a runtime analysis on the methods.

Argument	Type	Description
`methods`	List[dict]	"[{""name"": str, ""func"": callable, ""stochastic"": bool}, ...]"
`vary_param`	["CLSACC", "NMI", "ACC", "AUC"]	"features", "instances", or "both".
`time_limit`	3600	Terminate the method after reecording first time it exceeds this limit.

Dashboard

There is a Feature Selection Evaluation Dashboard based on the benchmarks provided by FSEVAL, available on:

https://fseval.imada.sdu.dk/

The dashboard offers a collection of useful analytic tools to provide comprehensive and comparative insights into the performance of your feature selection method(s).

Citation

If you use FSEVAL in your research, please cite the original paper:

CITATION WILL BE PROVIDED UPON PUBLICATION.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.2.0

Mar 17, 2026

This version

1.1.3

Mar 11, 2026

1.1.2

Feb 16, 2026

1.1.1

Feb 16, 2026

1.0.9

Feb 6, 2026

1.0.8

Feb 6, 2026

1.0.7

Feb 6, 2026

1.0.6

Feb 6, 2026

1.0.5

Feb 6, 2026

1.0.4

Jan 27, 2026

1.0.3

Jan 27, 2026

1.0.2

Jan 21, 2026

1.0.1

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdufseval-1.1.3.tar.gz (7.9 kB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sdufseval-1.1.3-py3-none-any.whl (8.8 kB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file sdufseval-1.1.3.tar.gz.

File metadata

Download URL: sdufseval-1.1.3.tar.gz
Upload date: Mar 11, 2026
Size: 7.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for sdufseval-1.1.3.tar.gz
Algorithm	Hash digest
SHA256	`b98d5d6b7c414016a2dcc76959c662d79eb1d8b902da2d59d9bb12bf08136c77`
MD5	`72c5cd39c227754d278bf35de91b7922`
BLAKE2b-256	`ea7e0e88383aada5e6bbd1faa6d49cb7c2ec18c7a508192a5398ec100af471cb`

See more details on using hashes here.

File details

Details for the file sdufseval-1.1.3-py3-none-any.whl.

File metadata

Download URL: sdufseval-1.1.3-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 8.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for sdufseval-1.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8f1e9f6900cbfa8c010582cb1f879e1b6468638410b2b0a68588ecba73454d7d`
MD5	`0e6666b8c8442dca9ad7ebbb9aa4154f`
BLAKE2b-256	`10e0c6e3d26c2ee7ac4c4554ca910390da5c09c32f603452a8f98fbd3cdd35b6`

See more details on using hashes here.

sdufseval 1.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FSEval – Feature Selection Evaluation Suite

📦 Dependencies and Requirements

💡 Installation

🚀 Quick Example

Data Loading

📚 API Reference

🛠️ `FSEval(output_dir="results", cv=5, avg_steps=10, eval_type=["supervised", "unsupervised", "model_agnostic"], metrics=None, experiments=None)`

⚙️ `run(datasets, methods, classifier=None)`

⚙️ `timer(methods, vary_param='features', time_limit=3600)`

Dashboard

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

sdufseval 1.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FSEval – Feature Selection Evaluation Suite

📦 Dependencies and Requirements

💡 Installation

🚀 Quick Example

Data Loading

📚 API Reference

🛠️ FSEval(output_dir="results", cv=5, avg_steps=10, eval_type=["supervised", "unsupervised", "model_agnostic"], metrics=None, experiments=None)

⚙️ run(datasets, methods, classifier=None)

⚙️ timer(methods, vary_param='features', time_limit=3600)

Dashboard

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

🛠️ `FSEval(output_dir="results", cv=5, avg_steps=10, eval_type=["supervised", "unsupervised", "model_agnostic"], metrics=None, experiments=None)`

⚙️ `run(datasets, methods, classifier=None)`

⚙️ `timer(methods, vary_param='features', time_limit=3600)`