Skip to main content

A glass-box machine learning toolbox for interpretable pipelines

Project description

Glazzbocks

A transparent, interpretable machine learning framework.

Glazzbocks (pronounced "glass box") provides a modular and auditable pipeline for building, diagnosing, and interpreting machine learning models. Designed with interpretability and traceability at its core, Glazzbocks enables practitioners to go beyond accuracy and deliver insights that are explainable, defensible, and production-ready.


Why Glass Box ML?

Modern machine learning offers unprecedented predictive power, but too often at the cost of transparency. In high-stakes or regulatory environments, this trade-off is unacceptable.

Glazzbocks is built on the principle that powerful models should also be interpretable. Every component—from preprocessing to diagnostics and interpretation—is designed to remain visible, explainable, and auditable.

This framework promotes transparent, modular ML development where every decision and output can be inspected, traced, and justified.


Key Advantages of Glazzbocks

  • Full Interpretability: Native support for feature importances, coefficients, SHAP values, PDPs, and permutation importances
  • Auditable Pipelines: Clear step-by-step ML workflows using modular, scikit-learn-compatible structures
  • Diagnostic Depth: Includes error distributions, lift charts, cumulative gain, VIF analysis, skewness, normality tests, and more
  • Human-Centric Development: Designed for data scientists, analysts, and auditors who need to understand and explain model behavior—not just optimize accuracy

Components

ML_pipeline.py

End-to-end automation for classification and regression tasks.

  • Handles preprocessing of numerical and categorical features, with optional manual feature transformations
  • Supports any scikit-learn-compatible model
  • Allows user-defined transformation strategies for specific features (e.g., log, sqrt, yeo-johnson)
  • Includes train/test split and pipeline building
  • Performs cross-validation with detailed fold-wise metrics
  • Includes optional VIF analysis during cross-validation (linear models only)
  • Stores ROC, precision-recall, and threshold analysis (for classifiers)
  • Summarizes cross-validated performance across models

diagnostics.py

Automated visual diagnostics after training.

  • Classification: ROC, Confusion Matrix, F1 vs Threshold, Lift Chart, Gain Chart
  • Regression: Predicted vs Actual, Residual Plot, Error Distribution, Q-Q Plot
  • Auto-detects task type and generates all relevant visuals

modelinterpreter.py

Model interpretation & explainability utilities.

  • Tree-based models: Feature importances
  • Linear models: Coefficients (with plot support)
  • SHAP summary plots (with pipeline support)
  • Partial Dependence Plots (PDP)
  • Permutation Importance

data_explorer.py

Exploratory Data Analysis (EDA) for modeling decisions.

  • Auto-detects task type (regression/classification)
  • Displays shape, dtypes, missing values (via missingno matrix)
  • Visualizes target distribution and correlation heatmaps
  • Supports PDF report generation with visuals and curated tables
  • VIF for multicollinearity detection
  • Skewness and normality testing
  • Entropy calculation (for classification)
  • Automatically extracts datetime features (year, month, day, weekday)
  • Includes curated numeric and categorical summaries

Example Usage

from glazzbocks.ML_pipeline import MLPipeline
from glazzbocks.diagnostics import ModelDiagnostics
from glazzbocks.modelinterpreter import ModelInterpreter
from glazzbocks.data_explorer import DataExplorer

Notes

  • All components are sklearn-compatible and integrate seamlessly.
  • Visualizations are built using matplotlib, seaborn, missingno, or shap.
  • Pipelines auto-handle transformed features for compatibility with SHAP/PDP.
  • Designed to work in Jupyter Notebooks or production scripts.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glazzbocks-0.1.7.tar.gz (24.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

glazzbocks-0.1.7-py3-none-any.whl (25.7 kB view details)

Uploaded Python 3

File details

Details for the file glazzbocks-0.1.7.tar.gz.

File metadata

  • Download URL: glazzbocks-0.1.7.tar.gz
  • Upload date:
  • Size: 24.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.1.7.tar.gz
Algorithm Hash digest
SHA256 ef4565047376531d67b4e4a70940de217adabce5b6a475016a5e542aed06c5d2
MD5 3fd1db475ca4280566b7a4f69b333fbb
BLAKE2b-256 e160c65b5092652e192cfa76dcd5d715b7f481617b43d5eadf846ff3ad9bd449

See more details on using hashes here.

File details

Details for the file glazzbocks-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: glazzbocks-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 25.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 63111cc856cb7e97264407a80b32aae9dfa6d59c4fca40757367f82313fb87b4
MD5 2641c7658abc906391f93985758ae6aa
BLAKE2b-256 941a3139dfe1e7fa414f60d0f98b3d4463773f92e35c6362604f2e219e1e1271

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page