Feature Ordering Module from TabSeq (ICPR 2024)
Project description
TabSeq Feature Ordering
This module extracts and packages the feature ordering algorithm used in TabSeq (ICPR 2024) as a standalone utility, enabling integration into any tabular deep learning pipeline.
Key Features
- Variance-based intra-cluster ordering
- KMeans clustering for feature grouping
- Weighted global ordering from local cluster orders
- Minimal dependencies, flexible integration
Installation
pip install tabseq-feature-ordering
Parameters
X_train: Tabular training data aspd.DataFramecluster_size: Number of clusters (e.g., 5)sort_order: Intra-cluster sorting order by variance ('ascending'or'descending')
Output
global_ordering: List of column names in reordered orderX_train_reordered: DataFrame with reordered columns
Example
import pandas as pd
import numpy as np
from tabseq_feature_ordering import reorder_features
# Example input
X = pd.DataFrame(np.random.rand(40, 80), columns=[f"F{i}" for i in range(80)])
# Run feature ordering (descending = high variance first)
order, X_reordered = reorder_features(X, cluster_size=5, ascending=False)
# Print the full reordered feature list
print("Reordered Features:\n", order)
License
MIT License © 2024 Zadid Habib
Citation
If you use this module, please cite our paper:
Habib, Al Zadid Sultan Bin, Kesheng Wang, Mary-Anne Hartley, Gianfranco Doretto, and Donald A. Adjeroh. "TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering." In International Conference on Pattern Recognition, pp. 418-434. Cham: Springer Nature Switzerland, 2024.
Bibtex
@inproceedings{habib2024tabseq,
title={TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering},
author={Habib, Al Zadid Sultan Bin and Wang, Kesheng and Hartley, Mary-Anne and Doretto, Gianfranco and A. Adjeroh, Donald},
booktitle={International Conference on Pattern Recognition},
pages={418--434},
year={2024},
organization={Springer}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tabseq_feature_ordering-0.1.6.tar.gz.
File metadata
- Download URL: tabseq_feature_ordering-0.1.6.tar.gz
- Upload date:
- Size: 6.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
76a39347f9d01199b4b9976b8b11d2d7fba6b3b5d0810ae1d29187acbe095454
|
|
| MD5 |
fa1825e4f1f1ac7ae81b48cf3530f6dc
|
|
| BLAKE2b-256 |
27cd69c15ea5ea2200b6d6aeb555ae62365f6d06f8dc9456e22310965d7d35ce
|
File details
Details for the file tabseq_feature_ordering-0.1.6-py3-none-any.whl.
File metadata
- Download URL: tabseq_feature_ordering-0.1.6-py3-none-any.whl
- Upload date:
- Size: 5.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a1093b9ba692b4453ed9a51b1c40918eb18ab5279a05e57972a561888e787a94
|
|
| MD5 |
e4d606abeec029d1c83acc329d33f070
|
|
| BLAKE2b-256 |
aeb3ab809d60c32160c3a17913907e475d76212970cc8d7f9f8e045fd8aa8819
|