This is a Python implementation by the authors of the paper 'Online Feature Screening for Data Streams With Concept Drift' from Dr. Mingyuan Wang and Dr. Adrian Barbu. Contain various feature selection methods.
Project description
Online-Feature-Screening-for-Datastream-with-Sparsity-Concept-Drifting
This is a Python implementation by the authors of the paper "Online Feature Screening for Data Streams With Concept Drift" from Dr. Mingyuan Wang and Dr. Adrian Barbu.
Please cite this paper if you use or build on our method. doi.org/10.1109/TKDE.2022.3232752
This project enabled well-known feature screening methods, including gini index, chi-square score, mutual information, fisher-score, T-score to handle streaming data, batch data, data with drifting, and sparse data. It currently only works on binary classification data.
Installation
Prerequisites
Python3.10 or newerpipnumpy2.2.4 or newer
Note
Although the package is designed OS independent, it was only tested on Windows. You might need to use methods listed below other than pip install pyscreeningfs.
For users installing from source (e.g., if no pre-built wheels are available for your system):
You will need a C++ compiler compatible with your Python installation:
- Windows: Microsoft Visual C++ Build Tools (part of Visual Studio, or standalone).
- Linux:
gccandg++(usually included or easily installed via your package manager, e.g.,sudo apt-get install build-essential). - macOS: Xcode Command Line Tools (install with
xcode-select --install).
Install via git clone
- Clone repository
git clone https://github.com/yourusername/repo_name.git
- Navigate into the cloned repository directory
cd repo_name
- Install
pip install .
Install via download
- Download the repository
- Unpack to your own folder your_folder/repo_name
- Navigate into the unpacked repository directory
cd repo_name
- Install
pip install .
Install via pip (Currently unavailable)
If pre-built wheels are available for your system on PyPI (coming soon!), you can install directly:
pip install pyscreeningfs
Data
For .svm sparse data, visit https://www.sysnet.ucsd.edu/projects/url/
Download and put into data/url_svmlight/
For any input data/data files, the Y/label/class vector can only contain numeric value and one of the label must be 1.
Demo
For a demo, see testing.py in the root directory.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyscreeningfs-0.1.1.tar.gz.
File metadata
- Download URL: pyscreeningfs-0.1.1.tar.gz
- Upload date:
- Size: 64.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7fba58c9f71f599abf01c4cb7779a034618491d86d1c62ea91a03dd3d679c075
|
|
| MD5 |
ba952f513388a98d1e9c9abbbb207a70
|
|
| BLAKE2b-256 |
f263fbc002bc35086667e1985d7636510651c4c620f482bd20e47a2380832659
|
File details
Details for the file pyscreeningfs-0.1.1-cp310-cp310-win_amd64.whl.
File metadata
- Download URL: pyscreeningfs-0.1.1-cp310-cp310-win_amd64.whl
- Upload date:
- Size: 89.3 kB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ccd14a877d1d1fe463b42e07882bada17efb6d5610f84dc51bf4239535b042d7
|
|
| MD5 |
75448372721ecfeb4114d3c7d75131d6
|
|
| BLAKE2b-256 |
a747d825b03b60402d7f47ec95ef2d99ba930e1ded3bf6a186a80a4a95e8eeb6
|