Comprehensive evaluation library for scikit-learn models with advanced metrics, custom thresholds, and visualizations

These details have not been verified by PyPI

Project links

Project description

extended-sklearn-metrics

A comprehensive Python library for evaluating both regression and classification models with advanced metrics, custom thresholds, and beautiful visualizations.

Features

🔧 Core Functionality

Regression & Classification Support - Comprehensive evaluation for both problem types
Cross-validation based model evaluation - Robust performance estimation
Full sklearn Pipeline support - Works seamlessly with preprocessing pipelines
Advanced Input Validation - Comprehensive error checking and helpful warnings
Type Hints - Full typing support for better IDE integration

📊 Metrics & Evaluation

Regression: RMSE, MAE, R², Explained Variance with percentage calculations
Classification: Accuracy, Precision, Recall, F1-Score, ROC AUC (binary)
Custom Performance Thresholds - Define your own success criteria
Performance Classification - Automatic categorization (Excellent, Good, etc.)
Multi-class Support - Flexible averaging strategies (micro, macro, weighted)

📈 Visualization & Reporting

Interactive Plots - Performance summaries, model comparisons, radar charts
Rich Console Reports - Formatted reports with recommendations and emojis
Model Comparison - Side-by-side comparison of multiple models
Pipeline Compatibility - All features work with complex preprocessing pipelines

Installation

From PyPI

pip install extended-sklearn-metrics

From Source

Clone this repository:

git clone https://github.com/SubaashNair/extended-sklearn-metrics.git
cd extended-sklearn-metrics

Install dependencies:

pip install -r requirements.txt

Usage

Here's a simple example using the California Housing dataset:

from sklearn.datasets import fetch_california_housing
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LinearRegression
from extended_sklearn_metrics import evaluate_model_with_cross_validation

# Load and prepare data
housing = fetch_california_housing(as_frame=True)
X_train, X_test, y_train, y_test = train_test_split(
    housing.data, housing.target, test_size=0.2, random_state=42
)

# Scale features
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)

# Create and evaluate model
model = LinearRegression()
target_range = y_train.max() - y_train.min()

# Get performance metrics
performance_table = evaluate_model_with_cross_validation(
    model=model,
    X=X_train_scaled,
    y=y_train,
    cv=5,
    target_range=target_range
)

print(performance_table)

Pipeline Support

The library works seamlessly with sklearn pipelines, including complex preprocessing:

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.compose import ColumnTransformer
from sklearn.linear_model import Ridge
from extended_sklearn_metrics import evaluate_model_with_cross_validation

# Basic pipeline
pipeline = Pipeline([
    ('scaler', StandardScaler()),
    ('regressor', LinearRegression())
])

# Complex pipeline with mixed data types
preprocessor = ColumnTransformer([
    ('num', StandardScaler(), numeric_columns),
    ('cat', OneHotEncoder(drop='first'), categorical_columns)
])

complex_pipeline = Pipeline([
    ('preprocessor', preprocessor),
    ('regressor', Ridge(alpha=1.0))
])

# Evaluate any pipeline the same way
result = evaluate_model_with_cross_validation(
    model=pipeline,  # or complex_pipeline
    X=X_train,
    y=y_train,
    cv=5
)

Classification Support

Evaluate classification models with comprehensive metrics:

from sklearn.datasets import load_iris
from sklearn.ensemble import RandomForestClassifier
from extended_sklearn_metrics import evaluate_classification_model_with_cross_validation

# Load data
iris = load_iris(as_frame=True)
X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, test_size=0.3)

# Evaluate classifier
model = RandomForestClassifier(n_estimators=100, random_state=42)
result = evaluate_classification_model_with_cross_validation(
    model=model,
    X=X_train,
    y=y_train,
    cv=5,
    average='weighted'  # Options: 'micro', 'macro', 'weighted'
)

Custom Performance Thresholds

Define your own success criteria:

from extended_sklearn_metrics import CustomThresholds

# Create custom thresholds
custom_thresholds = CustomThresholds(
    error_thresholds=(5, 15, 25),    # RMSE/MAE: <5% = Excellent, 5-15% = Good, etc.
    score_thresholds=(0.6, 0.8)      # R²/Accuracy: >0.8 = Good, 0.6-0.8 = Acceptable
)

# Use with any evaluation function
result = evaluate_model_with_cross_validation(
    model=model,
    X=X_train,
    y=y_train,
    cv=5,
    custom_thresholds=custom_thresholds
)

Visualizations & Rich Reports

Create beautiful visualizations and reports:

from extended_sklearn_metrics import (
    create_performance_summary_plot,
    create_model_comparison_plot,
    create_performance_radar_chart,
    print_performance_report
)

# Rich console report with recommendations
print_performance_report(result)

# Interactive plots (requires matplotlib)
create_performance_summary_plot(result, title="Model Performance")
create_performance_radar_chart(result)

# Compare multiple models
models_results = {
    'LinearRegression': result1,
    'RandomForest': result2,
    'SVM': result3
}
create_model_comparison_plot(models_results)

Output Format

The library generates a DataFrame with the following columns:

Column	Description
Metric	Name of the metric (RMSE, MAE, R², etc.)
Value	Computed value of the metric
Threshold	Thresholds used for performance classification
Calculation	Formula/method used to compute the metric
Performance	Classification (Excellent, Good, Moderate, Poor)

Performance Thresholds

RMSE and MAE

< 10% of range: Excellent
10%–20% of range: Good
20%–30% of range: Moderate
30% of range: Poor

R² and Explained Variance

0.7: Good
0.5–0.7: Acceptable
< 0.5: Poor

Requirements

Core Dependencies

Python 3.8+
pandas >= 2.0.0
scikit-learn >= 1.3.0
numpy >= 1.24.0

Optional Dependencies

matplotlib >= 3.0.0 (for visualizations)

Install with visualizations:

pip install extended-sklearn-metrics matplotlib

API Reference

Core Functions

`evaluate_model_with_cross_validation(model, X, y, cv=5, target_range=None, custom_thresholds=None)`

Evaluate regression models with comprehensive metrics.

`evaluate_classification_model_with_cross_validation(model, X, y, cv=5, average='weighted')`

Evaluate classification models with comprehensive metrics.

`CustomThresholds(error_thresholds=(10, 20, 30), score_thresholds=(0.5, 0.7))`

Define custom performance thresholds for evaluation.

Visualization Functions

`print_performance_report(results)`

Print a formatted console report with recommendations.

`create_performance_summary_plot(results, title=None, figsize=(10, 6))`

Create bar charts showing metric values and performance categories.

`create_model_comparison_plot(results_dict, figsize=(12, 8))`

Compare multiple models side-by-side.

`create_performance_radar_chart(results, title=None, figsize=(8, 8))`

Create radar/spider chart visualization.

Examples

The library includes comprehensive examples in the examples/ directory:

california_housing_example.py - Basic regression usage
pipeline_examples.py - Pipeline integration examples
classification_examples.py - Classification evaluation examples
custom_thresholds_example.py - Custom threshold usage
visualization_examples.py - Visualization capabilities

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0

Oct 9, 2025

0.3.5

Aug 26, 2025

0.3.4

Aug 26, 2025

0.3.3

Aug 26, 2025

0.3.2

Aug 26, 2025

0.3.1

Aug 25, 2025

0.3.0

Aug 25, 2025

This version

0.2.0

Aug 24, 2025

0.1.8

Apr 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extended_sklearn_metrics-0.2.0.tar.gz (21.2 kB view details)

Uploaded Aug 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

extended_sklearn_metrics-0.2.0-py3-none-any.whl (24.7 kB view details)

Uploaded Aug 24, 2025 Python 3

File details

Details for the file extended_sklearn_metrics-0.2.0.tar.gz.

File metadata

Download URL: extended_sklearn_metrics-0.2.0.tar.gz
Upload date: Aug 24, 2025
Size: 21.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for extended_sklearn_metrics-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`34d3f68d98755b352a146216c16cc7e96bfc8def890ce311751ffa6976d004b5`
MD5	`c6b7966ac936b5924ffb4a771f81769f`
BLAKE2b-256	`8e6b86803e345aa8cf8bf79734bf8a074dd9c3123dd53d8ea2a940e78b047ac1`

See more details on using hashes here.

File details

Details for the file extended_sklearn_metrics-0.2.0-py3-none-any.whl.

File metadata

Download URL: extended_sklearn_metrics-0.2.0-py3-none-any.whl
Upload date: Aug 24, 2025
Size: 24.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for extended_sklearn_metrics-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f95da57e7459fd11d94e3388c9d59a7c50d3514b0823eae89ec6befc3e04d9a0`
MD5	`b186bafa92eebdbefe1b9d2a24f5e9df`
BLAKE2b-256	`45a8f67b2f31c1a6444b5b719a76f1d095fb056c95b17636c084181a3c06e951`

See more details on using hashes here.

extended-sklearn-metrics 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

extended-sklearn-metrics

Features

🔧 Core Functionality

📊 Metrics & Evaluation

📈 Visualization & Reporting

Installation

From PyPI

From Source

Usage

Pipeline Support

Classification Support

Custom Performance Thresholds

Visualizations & Rich Reports

Output Format

Performance Thresholds

RMSE and MAE

R² and Explained Variance

Requirements

Core Dependencies

Optional Dependencies

API Reference

Core Functions

evaluate_model_with_cross_validation(model, X, y, cv=5, target_range=None, custom_thresholds=None)

evaluate_classification_model_with_cross_validation(model, X, y, cv=5, average='weighted')

CustomThresholds(error_thresholds=(10, 20, 30), score_thresholds=(0.5, 0.7))

Visualization Functions

print_performance_report(results)

create_performance_summary_plot(results, title=None, figsize=(10, 6))

create_model_comparison_plot(results_dict, figsize=(12, 8))

create_performance_radar_chart(results, title=None, figsize=(8, 8))

Examples

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`evaluate_model_with_cross_validation(model, X, y, cv=5, target_range=None, custom_thresholds=None)`

`evaluate_classification_model_with_cross_validation(model, X, y, cv=5, average='weighted')`

`CustomThresholds(error_thresholds=(10, 20, 30), score_thresholds=(0.5, 0.7))`

`print_performance_report(results)`

`create_performance_summary_plot(results, title=None, figsize=(10, 6))`

`create_model_comparison_plot(results_dict, figsize=(12, 8))`

`create_performance_radar_chart(results, title=None, figsize=(8, 8))`