Skip to main content

Pipeline for analysis of the machine learning applications in Sci-Kit Learn

Project description

Pipeline Logo

machine-learning-comparison-pipeline

Analysis of classification through machine learning is often accomplished with what the researcher is most comfortable using in the analysis. But that does not mean that the most optimal learner was selected for the research question. It is also often that feature selection is performed, but only with minimal processing with variation in the selection process.

During the analysis of a series of acoustic measurement from candidate propellers designed by the United States Air Force Academy it was determined that the 711th Human Performance Wing did not want to fall into these limitations. The wing developed a importance getter function using sensitivity analysis to determine the feature importance. This method was applied to random decision forests, support vector machines, neural networks, logistic regressions, and nearest neighbor machine learners.

This package was developed from that research in effort to canonize the process for future work.

Usage

Define the inputs to the class, including the feature DataFrame, targets Series, the learners and cross-validation

clf1 = nn.KNeighborsClassifier(n_neighbors=5)
clf2 = nn.KNeighborsClassifier(n_neighbors=5, weights='distance')
learners = list([clf1, clf2])
cv = ms.KFold(n_splits=10)
dataset = pd.read_csv(str(pathlib.Path(__file__).parents[1]) + '/data/features.csv')
features = dataset.iloc[:, 1:74]
targets = dataset['PROPELLER']
pipe = pipeline.ProcessingPipeline(learners, cv, features, targets)

pipe.process(72, verbose=True)

Cleared for public release on 14 November 2024 with case number AFRL-2024-6348.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

machinelearningcomparisonpipeline-0.2.4.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file machinelearningcomparisonpipeline-0.2.4.tar.gz.

File metadata

File hashes

Hashes for machinelearningcomparisonpipeline-0.2.4.tar.gz
Algorithm Hash digest
SHA256 f8fa53aa39fc4d57be7b65b89675d5c30d48990f83f1cb300e4c4c5264da95c2
MD5 640a9c298508407e1efdee547d5c7988
BLAKE2b-256 df5e88cdf568a3fc200b3aa451dba4f6b4de7a11595674e41db3cec09adb99ea

See more details on using hashes here.

File details

Details for the file MachineLearningComparisonPipeline-0.2.4-py3-none-any.whl.

File metadata

File hashes

Hashes for MachineLearningComparisonPipeline-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 22244a85c6d9190ca7e83fad1150be82d6fc5abd139fb870309fa1591e6f55fa
MD5 1026d956256947932762518d5ed47d42
BLAKE2b-256 99f7f26438eb64d7117de6cef0ddb5a8f20fe75c29e12176a3865ff67dd6189b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page