Implements multiple type of filter methods and heuristics for the feature selection problem in machine learning as well as a new one: tournament in differential evolution

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

TiDE: Tournament in Differential Evolution for Feature Selection

TiDE (Tournament in Differential Evolution) is a Python package that provides a comprehensive benchmark of filter-based, greedy, and metaheuristic wrapper methods for feature selection in machine learning. It also introduces a novel adaptive strategy based on Differential Evolution, capable of adjusting its mutation, initialization, and crossover policies according to the data.

This project was developed as part of a research study evaluating the robustness, performance, and extensibility of feature selection methods under various data conditions (noise, redundancy, imbalance, high-dimensionality). It is particularly suited for binary classification problems in high-dimensional settings.

🚀 Key Features

A unified framework for evaluating feature selection methods.
Integrated filter methods: ANOVA, MRMR, SURF.
Greedy wrappers: Sequential Forward and Backward Floating Selection.
Metaheuristics:
- Local search: Hill Climbing, Tabu Search
- Population-based: Genetic Algorithm, PBIL, DE, MBDE
TiDE: A novel adaptive variant of Differential Evolution.

📦 Installation

You can install TiDE directly from PyPI:

pip install tide-feature-selection

Alternatively, you can clone the repository and install it in editable mode:

git clone https://github.com/thibaultanani/TiDE.git
cd TiDE
pip install -e .

Dependencies are listed in setup.py and will be automatically installed. These include:

numpy
pandas
scikit-learn
scipy
openpyxl
psutil

You can also create a virtual environment beforehand:

python -m venv .venv
source .venv/bin/activate  # or .venv\Scripts\activate on Windows
pip install -e .

🧪 Usage Example

import pandas as pd
from sklearn.datasets import load_breast_cancer
from sklearn.metrics import balanced_accuracy_score
from sklearn.model_selection import train_test_split, KFold
from sklearn.naive_bayes import GaussianNB
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler

from feature_selections.heuristics import Tide

if __name__ == '__main__':
    # Load the dataset
    data = load_breast_cancer()
    X, y = pd.DataFrame(data.data, columns=data.feature_names), pd.Series(data.target, name='target')
    # Divide data into training and test sets
    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
    # Divide the training set into training and validation sets
    X_train, X_val, y_train, y_val = train_test_split(X_train, y_train, test_size=0.25, random_state=42)
    # Create DataFrames
    train_df, train_df['target'] = X_train.copy(), y_train
    val_df, val_df['target'] = X_val.copy(), y_val
    test_df, test_df['target'] = X_test.copy(), y_test
    # Create scikit-learn pipeline
    model = GaussianNB()
    scoring = balanced_accuracy_score
    pipeline = Pipeline([('scaler', StandardScaler()), ('clf', model)])
    # Example of the use of a feature selection method
    tide = Tide(name="n1", target='target', train=train_df, test=val_df, scoring=scoring, pipeline=pipeline,
                Tmax=60, verbose=True, output="test")
    tide.start(pid=1)
    # It is also possible to only use training data as input for cross validation
    cv = KFold(n_splits=5, shuffle=True, random_state=42)
    tide_kfold = Tide(name="n2", target='target', train=train_df, cv=cv, scoring=scoring, pipeline=pipeline,
                      Tmax=60, verbose=True, output="test")
    tide_kfold.start(pid=2)
    # The results are automatically saved in the output "test" directory

🧠 Scientific Background

This package was developed as part of a research study investigating the robustness and adaptability of feature selection strategies across diverse data challenges. The proposed method TiDE dynamically adapts its mutation and crossover mechanisms according to the data characteristics, making it highly competitive compared to classical DE and other metaheuristics.

The full study is detailed in the accompanying manuscript:

Anani, T., Delbot, F., & Pradat-Peyre, J.-F. (2025). Tournament in Differential Evolution for Robust Feature Selection. Currently being submitted.

📊 Citation

@article{anani2025tide,
  author = {Anani, Thibault and Delbot, François and Pradat-Peyre, Jean-François},
  title = {Tournament in Differential Evolution for Robust Feature Selection},
  journal = { },
  year = {2025},
  note = {Submitted}
}

🛠 Contributing

Contributions, ideas and bug reports are welcome! Please open an issue or a pull request.

📄 License

This project is licensed under the BSD 3-Clause License.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.2.1

Sep 26, 2025

This version

1.2.0

Sep 21, 2025

1.1.5

Sep 19, 2025

1.1.4

Sep 19, 2025

1.1.3

Sep 16, 2025

1.1.2

Sep 16, 2025

1.1.1

Sep 15, 2025

1.1.0

Jul 18, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tide_feature_selection-1.2.0.tar.gz (35.1 kB view details)

Uploaded Sep 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tide_feature_selection-1.2.0-py3-none-any.whl (47.6 kB view details)

Uploaded Sep 21, 2025 Python 3

File details

Details for the file tide_feature_selection-1.2.0.tar.gz.

File metadata

Download URL: tide_feature_selection-1.2.0.tar.gz
Upload date: Sep 21, 2025
Size: 35.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for tide_feature_selection-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`53fbac2f8099fe0e0e3625e11275fe03e81cd5a3145700db2395bcc6bbcf0f82`
MD5	`91ff76c0288614a6553d879d41f3eff9`
BLAKE2b-256	`d3213fdf2a70afd644ced211ca80117685cb2a93cd4c49e9f778a9fa42d436c9`

See more details on using hashes here.

File details

Details for the file tide_feature_selection-1.2.0-py3-none-any.whl.

File metadata

Download URL: tide_feature_selection-1.2.0-py3-none-any.whl
Upload date: Sep 21, 2025
Size: 47.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for tide_feature_selection-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b8939739bb7fc4a32662ad3262091a5649c24b53df23f3f583489f38e53355df`
MD5	`20d9c0f43756e25d9a35528cfde76184`
BLAKE2b-256	`a1d200f3362eacf650766248c490b64211d4f49318bec0f0acbc078aa020571f`

See more details on using hashes here.

tide-feature-selection 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TiDE: Tournament in Differential Evolution for Feature Selection

🚀 Key Features

📦 Installation

🧪 Usage Example

🧠 Scientific Background

📊 Citation

🛠 Contributing

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes