Pipeline for analysis of the machine learning applications in Sci-Kit Learn
Project description
machine-learning-comparison-pipeline
Analysis of classification through machine learning is often accomplished with what the researcher is most comfortable using in the analysis. But that does not mean that the most optimal learner was selected for the research question. It is also often that feature selection is performed, but only with minimal processing with variation in the selection process.
During the analysis of a series of acoustic measurement from candidate propellers designed by the United States Air Force Academy it was determined that the 711th Human Performance Wing did not want to fall into these limitations. The wing developed a importance getter function using sensitivity analysis to determine the feature importance. This method was applied to random decision forests, support vector machines, neural networks, logistic regressions, and nearest neighbor machine learners.
This package was developed from that research in effort to canonize the process for future work.
Usage
Define the inputs to the class, including the feature DataFrame, targets Series, the learners and cross-validation
clf1 = nn.KNeighborsClassifier(n_neighbors=5)
clf2 = nn.KNeighborsClassifier(n_neighbors=5, weights='distance')
learners = list([clf1, clf2])
cv = ms.KFold(n_splits=10)
dataset = pd.read_csv(str(pathlib.Path(__file__).parents[1]) + '/data/features.csv')
features = dataset.iloc[:, 1:74]
targets = dataset['PROPELLER']
pipe = pipeline.ProcessingPipeline(learners, cv, features, targets)
pipe.process(72, verbose=True)
Cleared for public release on 14 November 2024 with case number AFRL-2024-6348.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file machinelearningcomparisonpipeline-0.2.4.tar.gz
.
File metadata
- Download URL: machinelearningcomparisonpipeline-0.2.4.tar.gz
- Upload date:
- Size: 11.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f8fa53aa39fc4d57be7b65b89675d5c30d48990f83f1cb300e4c4c5264da95c2 |
|
MD5 | 640a9c298508407e1efdee547d5c7988 |
|
BLAKE2b-256 | df5e88cdf568a3fc200b3aa451dba4f6b4de7a11595674e41db3cec09adb99ea |
File details
Details for the file MachineLearningComparisonPipeline-0.2.4-py3-none-any.whl
.
File metadata
- Download URL: MachineLearningComparisonPipeline-0.2.4-py3-none-any.whl
- Upload date:
- Size: 11.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 22244a85c6d9190ca7e83fad1150be82d6fc5abd139fb870309fa1591e6f55fa |
|
MD5 | 1026d956256947932762518d5ed47d42 |
|
BLAKE2b-256 | 99f7f26438eb64d7117de6cef0ddb5a8f20fe75c29e12176a3865ff67dd6189b |