Skip to main content

Glassbox ML with EDA, diagnostics, and AI-powered reporting

Project description

Glazzbocks

A transparent, interpretable machine learning framework.

Glazzbocks (pronounced "glass box") provides a modular and auditable pipeline for building, diagnosing, and interpreting machine learning models. Designed with interpretability and traceability at its core, Glazzbocks enables practitioners to go beyond accuracy and deliver insights that are explainable, defensible, and production-ready.


Why Glass Box ML?

Modern machine learning offers unprecedented predictive power, but too often at the cost of transparency. In high-stakes or regulatory environments, this trade-off is unacceptable.

Glazzbocks is built on the principle that powerful models should also be interpretable. Every component—from preprocessing to diagnostics and interpretation—is designed to remain visible, explainable, and auditable.

This framework promotes transparent, modular ML development where every decision and output can be inspected, traced, and justified.


Key Advantages of Glazzbocks

  • Full Interpretability: Native support for feature importances, coefficients, SHAP values, PDPs, and permutation importances
  • Auditable Pipelines: Clear step-by-step ML workflows using modular, scikit-learn-compatible structures
  • Diagnostic Depth: Includes error distributions, lift charts, cumulative gain, VIF analysis, skewness, normality tests, and more
  • Human-Centric Development: Designed for data scientists, analysts, and auditors who need to understand and explain model behavior—not just optimize accuracy

Components

ML_pipeline.py

End-to-end automation for classification and regression tasks.

  • Handles preprocessing of numerical and categorical features, with optional manual feature transformations
  • Supports any scikit-learn-compatible model
  • Allows user-defined transformation strategies for specific features (e.g., log, sqrt, yeo-johnson)
  • Includes train/test split and pipeline building
  • Performs cross-validation with detailed fold-wise metrics
  • Includes optional VIF analysis during cross-validation (linear models only)
  • Stores ROC, precision-recall, and threshold analysis (for classifiers)
  • Summarizes cross-validated performance across models

diagnostics.py

Automated visual diagnostics after training.

  • Classification: ROC, Confusion Matrix, F1 vs Threshold, Lift Chart, Gain Chart
  • Regression: Predicted vs Actual, Residual Plot, Error Distribution, Q-Q Plot
  • Auto-detects task type and generates all relevant visuals

modelinterpreter.py

Model interpretation & explainability utilities.

  • Tree-based models: Feature importances
  • Linear models: Coefficients (with plot support)
  • SHAP summary plots (with pipeline support)
  • Partial Dependence Plots (PDP)
  • Permutation Importance

data_explorer.py

Exploratory Data Analysis (EDA) for modeling decisions.

  • Auto-detects task type (regression/classification)
  • Displays shape, dtypes, missing values (via missingno matrix)
  • Visualizes target distribution and correlation heatmaps
  • Supports PDF report generation with visuals and curated tables
  • VIF for multicollinearity detection
  • Skewness and normality testing
  • Entropy calculation (for classification)
  • Automatically extracts datetime features (year, month, day, weekday)
  • Includes curated numeric and categorical summaries

Example Usage

from glazzbocks.ML_pipeline import MLPipeline
from glazzbocks.diagnostics import ModelDiagnostics
from glazzbocks.modelinterpreter import ModelInterpreter
from glazzbocks.data_explorer import DataExplorer

Notes

  • All components are sklearn-compatible and integrate seamlessly.
  • Visualizations are built using matplotlib, seaborn, missingno, or shap.
  • Pipelines auto-handle transformed features for compatibility with SHAP/PDP.
  • Designed to work in Jupyter Notebooks or production scripts.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glazzbocks-0.2.2.tar.gz (61.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

glazzbocks-0.2.2-py3-none-any.whl (50.4 kB view details)

Uploaded Python 3

File details

Details for the file glazzbocks-0.2.2.tar.gz.

File metadata

  • Download URL: glazzbocks-0.2.2.tar.gz
  • Upload date:
  • Size: 61.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.2.2.tar.gz
Algorithm Hash digest
SHA256 fc9ec3b79ec31d28e502c233b92ff1d01b971be935094951ae58f64b49fb124c
MD5 419fd658f90d2bd67818d7ca09fceda3
BLAKE2b-256 15543b81d6e96a2de5ae9d77867fcb13c10155906f93eb42ebc01919ec8db639

See more details on using hashes here.

File details

Details for the file glazzbocks-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: glazzbocks-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 50.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e17d9ab5393479c0ece8656319afbfb723b44fb05f18baffb3fc86d99b601eae
MD5 7b98974a77eab36b846967099d880620
BLAKE2b-256 ee71de75aaa70d4b2f3eece346b6b84ef43c902dc4ff1b98c8e0908562359dbf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page