Skip to main content

A package for EDA and Sci-Kit Learn visualisations and utilities

Project description

modelviz - Python package to make visualizations a breeze

image

GitHub Actions PyPI version Python 3.9 Python 3.10 Python 3.11 Python 3.12

modelviz is a Python package designed for comprehensive and customizable data visualization and model evaluation. With modules for visualizing relationships, confusion matrices, ROC curves, data distributions, and handling missing values, modelviz simplifies exploratory data analysis (EDA) and model performance evaluation.

Installation

Install modelviz via pip:

pip install modelviz

Features

1. Confusion Matrix (confusion_matrix.py)

  • Visualize Confusion Matrices:
    • Supports both binary and multi-class confusion matrices.
    • Displays proportions, TP, FP, FN, and TN labels.
    • Includes detailed metrics like Accuracy, Precision, Recall, F1 Score, MCC, and Cohen's Kappa.
    • Option to normalize the confusion matrix.

Example Usage:

from modelviz.confusion_matrix import plot_confusion_matrix
import numpy as np

cm = np.array([[50, 10], [5, 35]])  # Binary confusion matrix
classes = ["Negative", "Positive"]
plot_confusion_matrix(cm, classes, "Logistic Regression")

2. Histogram (histogram.py)

  • Feature Histograms:
    • Automatically generate histograms for all numeric columns in a pandas DataFrame.
    • Skip binary columns for cleaner visualizations.
    • Customize bins, colors, and titles.

Example Usage:

from modelviz.histogram import plot_feature_histograms
import pandas as pd

df = pd.DataFrame({
    'Age': [25, 30, 35, 40],
    'Income': [40000, 50000, 60000, 70000],
    'Gender': [0, 1, 0, 1]
})
plot_feature_histograms(df, exclude_binary=True, bins=10, color='blue')

3. ROC Curve (roc.py)

  • ROC Curve Visualization:
    • Plot Receiver Operating Characteristic (ROC) curves.
    • Highlight thresholds like Youden's J and adjusted thresholds.
    • Display key metrics like AUC (Area Under Curve).

Example Usage:

from modelviz.roc import plot_roc_curve_with_youdens_thresholds

fpr = [0.0, 0.1, 0.2, 0.3]
tpr = [0.0, 0.4, 0.6, 1.0]
thresholds = [1.0, 0.8, 0.5, 0.2]
plot_roc_curve_with_youdens_thresholds(fpr, tpr, thresholds, roc_auc=0.85, model_name="My Model")

4. Relationships (relationships.py)

  • Correlation Matrix:
    • Generate and visualize correlation matrices for numeric features.
    • Customize heatmaps with annotations, colormap, and figure size.

Example Usage:

from modelviz.relationships import plot_correlation_matrix
import pandas as pd

df = pd.DataFrame({
    'A': [1, 2, 3, 4],
    'B': [4, 3, 2, 1],
    'C': [5, 6, 7, 8]
})
plot_correlation_matrix(df, method='pearson')

5. K-Fold Visualization (kfold.py)

  • Visualize K-Fold Splits:
    • Display data distribution across training and validation sets for K-Fold Cross-Validation.
    • Easy visualization for understanding fold assignments.

6. Handling Missing Values (missvals.py)

  • Missing Value Analysis:
    • Visualize missing data in a DataFrame.
    • Quickly identify patterns and percentage of missing values.

7. Model Evaluation (model_eval.py)

  • Aggregate Model Metrics:
    • Summarize key evaluation metrics for multiple models.
    • Compare performance across models.

Importing the Package

Each module in the package is designed to be imported separately. For example:

from modelviz.confusion_matrix import plot_confusion_matrix
from modelviz.histogram import plot_feature_histograms
from modelviz.roc import plot_roc_curve_with_youdens_thresholds

Contributing

Contributions are welcome! If you have suggestions or new feature ideas, feel free to open an issue or create a pull request on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelviz-2.2.0.tar.gz (26.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

modelviz-2.2.0-py3-none-any.whl (33.3 kB view details)

Uploaded Python 3

File details

Details for the file modelviz-2.2.0.tar.gz.

File metadata

  • Download URL: modelviz-2.2.0.tar.gz
  • Upload date:
  • Size: 26.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for modelviz-2.2.0.tar.gz
Algorithm Hash digest
SHA256 6f17876b21eb9ff4f269965b6e38680196940094f24082f3faf671c8a299bf54
MD5 46b64bec80e64add0a4cc6b8c863e98d
BLAKE2b-256 a868787257806104a8f25be188e3820daefef173c1cbee33e64c8ffcfd5e1eb8

See more details on using hashes here.

File details

Details for the file modelviz-2.2.0-py3-none-any.whl.

File metadata

  • Download URL: modelviz-2.2.0-py3-none-any.whl
  • Upload date:
  • Size: 33.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for modelviz-2.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 da64320349bdef3cb0401dba430f17184af012fa689098208d2faa9abdd15962
MD5 e87d7cbc7b08db92ca556396def50579
BLAKE2b-256 30585366ae420fccb4ef5140559ed9bcc4ed074d5f2d1e3785c2a6200ef07fbf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page