Advanced benchmarking for machine learning models.

These details have not been verified by PyPI

Project description

Benchmark-Adv-ML

Benchmark-Adv-ML is a Python package designed to facilitate advanced benchmarking of machine learning models. It provides a comprehensive pipeline to evaluate model stability, generate predictions, and visualize results through various plots, including AUC curves, feature importance, and radar charts.

Features

Model Stability Evaluation: Automatically runs multiple machine learning models (Logistic Regression, SVC, RandomForestClassifier) across multiple runs.
Prediction and Metrics: Generates and saves predictions, feature importance, and various metrics for each model and run.
Aggregation of Results: Aggregates results across runs and models for comprehensive analysis.
Visualization: Generates plots including AUC curves, AUC box plots, feature importance plots, and radar charts to compare model performance.

Installation

You can install the package directly from PyPI:

pip install benchmark-adv-ml

Usage in Unix

benchmark-adv-ml --data ./your_dataset.csv --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Useage in python

python -m benchmark_adv_ml benchmark --data ./Raisin_Dataset.data --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Train Autoencoder Model

python -m benchmark_adv_ml autoencoder --data ./Raisin_Dataset.data --epochs 10 --output_dir ./final_results/ --prelim_output ./prelim_results/ --latent_dim 10 --batch_size 32 --validation_split 0.1 --test_size 0.2 --seed 42

Command-Line Arguments

--data: Path to the existing CSV file containing the dataset. --output: Directory to save the final results and plots. --target : Target column name in the dataset. ( default : 'label') --prelim_output: Directory to save the preliminary results (predictions). --n_runs: Number of runs for model stability evaluation (default is 20). --seed: Seed for random state (default is 42).

Example run : Benchmark Code

benchmark-adv-ml --data ./your_dataset.csv --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Dependencies

Python 3.11+ seaborn scikit-learn pandas numpy matplotlib

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Vatsal Pate - VatsalPatel18

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.6

Oct 17, 2024

0.2.5

Sep 13, 2024

0.2.2

Sep 13, 2024

0.2.0

Sep 12, 2024

0.1.24

Sep 12, 2024

0.1.23

Sep 12, 2024

0.1.22

Sep 12, 2024

0.1.21

Sep 12, 2024

This version

0.1.20

Sep 12, 2024

0.1.19

Sep 12, 2024

0.1.18

Sep 12, 2024

0.1.17

Sep 12, 2024

0.1.16

Sep 12, 2024

0.1.15

Sep 12, 2024

0.1.14

Sep 12, 2024

0.1.13

Sep 12, 2024

0.1.12

Sep 12, 2024

0.1.11

Sep 11, 2024

0.1.10

Sep 11, 2024

0.1.9

Sep 11, 2024

0.1.8

Sep 11, 2024

0.1.7

Sep 11, 2024

0.1.6

Aug 21, 2024

0.1.4

Aug 21, 2024

0.1.2

Aug 21, 2024

0.1.1

Aug 19, 2024

0.1.0

Aug 19, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benchmark_adv_ml-0.1.20.tar.gz (69.5 kB view hashes)

Uploaded Sep 12, 2024 Source

Built Distribution

benchmark_adv_ml-0.1.20-py3-none-any.whl (76.3 kB view hashes)

Uploaded Sep 12, 2024 Python 3

Hashes for benchmark_adv_ml-0.1.20.tar.gz

Hashes for benchmark_adv_ml-0.1.20.tar.gz
Algorithm	Hash digest
SHA256	`5b5917e5bb2bc0cb30386a702835d18252b113e52cce8e81aef036c1889a1bf1`
MD5	`2f48acf895751bc3fb5bf4a8e8c66465`
BLAKE2b-256	`8cf8e864b477281b443100d8f72bf51597ce4ef6c9cb67e6422df3d13ccbc17c`

Hashes for benchmark_adv_ml-0.1.20-py3-none-any.whl

Hashes for benchmark_adv_ml-0.1.20-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ea97bac601c1d1c704003a35ccb87e7405e4ce56b5d85cf1265a2a0b1078b9d2`
MD5	`337e322899964031ca7ddf38fdc6ed8b`
BLAKE2b-256	`39583f7d0e5b660fc5a4fc168bb7278eae58dc68db7f8eda421195e2ec0a74a1`