Neighborhood Component Analysis Feature Selection
Project description
NCAFS
Neighborhood Component Analysis Feature Selection
NCAFS is a non-parametric algorithm based on k-nearest neighbors (kNN), which learns a feature weighting vector by minimizing the expected leave-one-out error with a regularization term. NCA was originally proposed in [1], which inspired the feature selection method for classification presented in [2]. It was then extended to regression problems as well [3].
NCAFS is a Python package that implements the NCA feature selection method, for both classification and regression problems.
Instalation
pip install ncafs
Getting started
Classification
from ncafs import NCAFSC
from sklearn import datasets
X, y = datasets.make_classification(
n_samples=1000,
n_classes=5,
n_features=20,
n_informative=100,
n_redundant=0,
n_repeated=0,
flip_y=0.1,
class_sep=0.5,
shuffle=False,
random_state=0
)
fs_clf = NCAFSC()
fs_clf.fit(X, y)
w = fs_clf.weights_
Regression
from ncafs import NCAFSR
from sklearn import datasets
X, y, coef = datasets.make_regression(
n_samples=1000,
n_features=100,
n_informative=20,
bias=0,
noise=1e-3,
coef=True,
shuffle=False,
random_state=0
)
fs_reg = NCAFSR()
fs_reg.fit(X, y)
w = fs_reg.weights_
References
- Goldberger, J., Hinton, G., Roweis, S., Salakhutdinov, R. (2005). Neighbourhood Components Analysis. Advances in Neural Information Processing Systems. 17, 513-520.
- Yang, W., Wang, K., & Zuo, W. (2012). Neighborhood component feature selection for high-dimensional data. J. Comput., 7(1), 161-168.
- Amankwaa-Kyeremeh, B., Greet, C., Zanin, M., Skinner, W. and Asamoah, R. K., (2020), Selecting key predictor parameters for regression analysis using modified Neighbourhood Component Analysis (NCA) Algorithm. Proceedings of 6th UMaT Biennial International Mining and Mineral Conference, Tarkwa, Ghana, pp. 320-325.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ncafs-0.2.tar.gz
(6.4 kB
view details)
File details
Details for the file ncafs-0.2.tar.gz
.
File metadata
- Download URL: ncafs-0.2.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.1 requests/2.25.1 setuptools/57.4.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b236abd86396b5ace9d1afffb71847528ebc24cd5aa712a48aebd2c5d9dd7870 |
|
MD5 | 1968501e52f6c81283bad04f99066296 |
|
BLAKE2b-256 | 7c83dbdc28f6babc7f2f1a017d0434c08fdb54a10c02aa6f1bc6b67aeafe17f4 |