Skip to main content

Advanced benchmarking for machine learning models.

Project description

Benchmark-Adv-ML

Benchmark-Adv-ML is a Python package designed to facilitate advanced benchmarking of machine learning models. It provides a comprehensive pipeline to evaluate model stability, generate predictions, and visualize results through various plots, including AUC curves, feature importance, and radar charts.

Features

  • Model Stability Evaluation: Automatically runs multiple machine learning models (Logistic Regression, SVC, RandomForestClassifier) across multiple runs.
  • Prediction and Metrics: Generates and saves predictions, feature importance, and various metrics for each model and run.
  • Aggregation of Results: Aggregates results across runs and models for comprehensive analysis.
  • Visualization: Generates plots including AUC curves, AUC box plots, feature importance plots, and radar charts to compare model performance.

Installation

You can install the package directly from PyPI:

pip install benchmark-adv-ml

Usage in Unix

benchmark-adv-ml --data ./your_dataset.csv --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Useage in python

python -m benchmark_adv_ml benchmark --data ./Raisin_Dataset.data --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Train Autoencoder Model

python -m benchmark_adv_ml autoencoder --data ./Raisin_Dataset.data --epochs 10 --output_dir ./final_results/ --prelim_output ./prelim_results/ --latent_dim 10 --batch_size 32 --validation_split 0.1 --test_size 0.2 --seed 42

Command-Line Arguments

--data: Path to the existing CSV file containing the dataset. --output: Directory to save the final results and plots. --target : Target column name in the dataset. ( default : 'label') --prelim_output: Directory to save the preliminary results (predictions). --n_runs: Number of runs for model stability evaluation (default is 20). --seed: Seed for random state (default is 42).

Example run : Benchmark Code

benchmark-adv-ml --data ./your_dataset.csv --output ./final_results --prelim_output ./prelim_results --n_runs 10 --seed 42

Dependencies

Python 3.11+ seaborn scikit-learn pandas numpy matplotlib

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Vatsal Pate - VatsalPatel18

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benchmark_adv_ml-0.1.20.tar.gz (69.5 kB view hashes)

Uploaded Source

Built Distribution

benchmark_adv_ml-0.1.20-py3-none-any.whl (76.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page