A simple machine learning benchmarking library
Project description
mlbench-lite
A simple machine learning benchmarking library that provides an easy way to compare multiple ML models on your dataset. Built with scikit-learn and pandas for seamless integration into your ML workflow.
🚀 Features
- Simple API: One function call to benchmark multiple models
- Built-in Models: Includes Logistic Regression, Random Forest, and SVM
- Comprehensive Metrics: Returns Accuracy, Precision, Recall, and F1 scores
- Custom Dataset: Includes the
load_cloverdataset for testing - Easy Integration: Works seamlessly with scikit-learn datasets
- Pandas Output: Results returned as a clean pandas DataFrame
- Reproducible: Consistent results with random state control
📦 Installation
pip install mlbench-lite
🎯 Quick Start
from mlbench_lite import benchmark, load_clover
# Load the clover dataset
X, y = load_clover(return_X_y=True)
# Benchmark multiple models
results = benchmark(X, y)
print(results)
Output:
Model Accuracy Precision Recall F1
0 Random Forest 0.9500 0.9565 0.9512 0.9505
1 SVM 0.9250 0.9337 0.9255 0.9254
2 Logistic Regression 0.9125 0.9131 0.9117 0.9115
📚 API Reference
benchmark(X, y, test_size=0.2, random_state=42)
Benchmark multiple machine learning models on a dataset.
Parameters:
X(array-like): Training vectors of shape (n_samples, n_features)y(array-like): Target values of shape (n_samples,)test_size(float, optional): Proportion of dataset for testing (default: 0.2)random_state(int, optional): Random seed for reproducibility (default: 42)
Returns:
pandas.DataFrame: Results with columns:Model: Name of the modelAccuracy: Accuracy scorePrecision: Precision score (macro-averaged)Recall: Recall score (macro-averaged)F1: F1 score (macro-averaged)
load_clover(return_X_y=False)
Load the custom clover dataset.
Parameters:
return_X_y(bool, default=False): If True, returns (data, target) instead of a Bunch object
Returns:
Bunchortuple: Dataset object with data, target, feature_names, target_names, and DESCR
💡 Code Examples
1. Basic Usage with Clover Dataset
from mlbench_lite import benchmark, load_clover
# Load the clover dataset
X, y = load_clover(return_X_y=True)
print(f"Dataset shape: {X.shape}")
print(f"Number of classes: {len(set(y))}")
# Benchmark models
results = benchmark(X, y)
print("\nBenchmark Results:")
print(results)
# Get the best model
best_model = results.iloc[0]
print(f"\n🏆 Best Model: {best_model['Model']} (Accuracy: {best_model['Accuracy']:.4f})")
2. Using with Scikit-learn Datasets
from mlbench_lite import benchmark
from sklearn.datasets import load_wine, load_breast_cancer
# Test with Wine dataset
print("=== Wine Dataset ===")
X, y = load_wine(return_X_y=True)
results = benchmark(X, y)
print(results)
# Test with Breast Cancer dataset
print("\n=== Breast Cancer Dataset ===")
X, y = load_breast_cancer(return_X_y=True)
results = benchmark(X, y)
print(results)
3. Custom Test Size
from mlbench_lite import benchmark, load_clover
X, y = load_clover(return_X_y=True)
# Use 30% of data for testing
results = benchmark(X, y, test_size=0.3)
print("Results with 30% test size:")
print(results)
# Use 10% of data for testing
results = benchmark(X, y, test_size=0.1)
print("\nResults with 10% test size:")
print(results)
4. Reproducible Results
from mlbench_lite import benchmark, load_clover
X, y = load_clover(return_X_y=True)
# Set random seed for reproducible results
results1 = benchmark(X, y, random_state=123)
results2 = benchmark(X, y, random_state=123)
print("Results with random_state=123:")
print(results1)
print(f"\nResults are identical: {results1.equals(results2)}")
# Different random state produces different results
results3 = benchmark(X, y, random_state=456)
print(f"\nDifferent random state produces different results: {not results1.equals(results3)}")
5. Working with Synthetic Data
from mlbench_lite import benchmark
from sklearn.datasets import make_classification
# Create synthetic dataset
X, y = make_classification(
n_samples=1000,
n_features=20,
n_informative=15,
n_classes=4,
random_state=42
)
print(f"Synthetic dataset shape: {X.shape}")
print(f"Number of classes: {len(set(y))}")
results = benchmark(X, y)
print("\nBenchmark Results:")
print(results)
6. Analyzing Results
from mlbench_lite import benchmark, load_clover
import pandas as pd
X, y = load_clover(return_X_y=True)
results = benchmark(X, y)
# Display results with better formatting
print("Detailed Results:")
print("=" * 60)
for idx, row in results.iterrows():
print(f"{row['Model']:20} | Acc: {row['Accuracy']:.4f} | "
f"Prec: {row['Precision']:.4f} | Rec: {row['Recall']:.4f} | "
f"F1: {row['F1']:.4f}")
# Find models with accuracy > 0.9
high_accuracy = results[results['Accuracy'] > 0.9]
print(f"\nModels with accuracy > 0.9: {len(high_accuracy)}")
# Calculate average metrics
avg_metrics = results[['Accuracy', 'Precision', 'Recall', 'F1']].mean()
print(f"\nAverage metrics across all models:")
for metric, value in avg_metrics.items():
print(f" {metric}: {value:.4f}")
7. Comparing Different Datasets
from mlbench_lite import benchmark, load_clover
from sklearn.datasets import load_wine, load_breast_cancer
datasets = [
("Clover", load_clover(return_X_y=True)),
("Wine", load_wine(return_X_y=True)),
("Breast Cancer", load_breast_cancer(return_X_y=True))
]
print("Dataset Comparison:")
print("=" * 80)
for name, (X, y) in datasets:
print(f"\n{name} Dataset:")
print(f" Shape: {X.shape}, Classes: {len(set(y))}")
results = benchmark(X, y)
best_acc = results.iloc[0]['Accuracy']
best_model = results.iloc[0]['Model']
print(f" Best Model: {best_model} (Accuracy: {best_acc:.4f})")
# Show top 2 models
print(" Top 2 Models:")
for idx, row in results.head(2).iterrows():
print(f" {row['Model']}: {row['Accuracy']:.4f}")
🔬 Models Included
The library benchmarks the following models by default:
-
Logistic Regression: Linear model for classification
- Uses default scikit-learn parameters
- Good for linear relationships
-
Random Forest: Ensemble of decision trees
- Uses default scikit-learn parameters
- Good for non-linear relationships and feature importance
-
SVM: Support Vector Machine with RBF kernel
- Uses default scikit-learn parameters
- Good for complex decision boundaries
All models use their default scikit-learn parameters with appropriate random seeds for reproducibility.
📊 Clover Dataset Details
The load_clover function provides a custom synthetic dataset:
- Samples: 400
- Features: 4
- Classes: 4
Features:
leaf_length: Length of the leaf in cmleaf_width: Width of the leaf in cmpetiole_length: Length of the petiole in cmleaflet_count: Number of leaflets per leaf
Classes:
white_clover: Trifolium repensred_clover: Trifolium pratensecrimson_clover: Trifolium incarnatumalsike_clover: Trifolium hybridum
🛠️ Requirements
- Python >= 3.8
- scikit-learn >= 1.0.0
- pandas >= 1.3.0
- numpy >= 1.20.0
🧪 Testing
Run the test suite to verify everything works:
# Run all tests
python -m pytest tests/ -v
# Run with coverage
python -m pytest tests/ --cov=mlbench_lite
# Quick functionality test
python -c "from mlbench_lite import benchmark, load_clover; X, y = load_clover(return_X_y=True); results = benchmark(X, y); print(results)"
🚀 Development
Setup Development Environment
git clone https://github.com/yourusername/mlbench-lite.git
cd mlbench-lite
pip install -e ".[dev]"
Code Quality
# Format code
black mlbench_lite tests
# Lint code
flake8 mlbench_lite tests
# Type checking
mypy mlbench_lite
Building for Distribution
# Build package
python -m build
# Upload to PyPI
twine upload dist/*
🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add some amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
📈 Changelog
0.1.0 (2024-01-XX)
- Initial release
- Basic benchmarking functionality
- Support for Logistic Regression, Random Forest, and SVM
- Comprehensive metrics (Accuracy, Precision, Recall, F1)
- Custom clover dataset
- Full test coverage
- PyPI ready
🆘 Support
If you encounter any issues or have questions:
- Check the Issues page
- Create a new issue with detailed information
- Include code examples and error messages
🙏 Acknowledgments
- Built with scikit-learn
- Uses pandas for data handling
- Inspired by the need for simple ML benchmarking tools
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mlbench_lite-0.1.0.tar.gz.
File metadata
- Download URL: mlbench_lite-0.1.0.tar.gz
- Upload date:
- Size: 9.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f3810532682dfca01c30c45cdffa7ada433ea55311a365d7ae7ecf605e29dede
|
|
| MD5 |
2515ecdb5c08300e918dc3be38b32813
|
|
| BLAKE2b-256 |
7a5dc359721067ca166f419f0101a51638faeb4e7c25ccd3eb7afb3e6883ec12
|
File details
Details for the file mlbench_lite-0.1.0-py3-none-any.whl.
File metadata
- Download URL: mlbench_lite-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ce0510cdfc4435f16dbe48241fb645f5afd0e486dd60b1705463d0e6929a47a0
|
|
| MD5 |
41f99fc0a416acb7f74e261d9d70c1ca
|
|
| BLAKE2b-256 |
c76ffa3faa71017f516cf2f0298e1d82154c6b827c4e9008795c61480d8600b1
|