Skip to main content

Glassbox ML with EDA, diagnostics, and AI-powered reporting

Project description

Glazzbocks

A transparent, interpretable machine learning framework.

Glazzbocks (pronounced "glass box") provides a modular and auditable pipeline for building, diagnosing, and interpreting machine learning models. Designed with interpretability and traceability at its core, Glazzbocks enables practitioners to go beyond accuracy and deliver insights that are explainable, defensible, and production-ready.


Why Glass Box ML?

Modern machine learning offers unprecedented predictive power, but too often at the cost of transparency. In high-stakes or regulatory environments, this trade-off is unacceptable.

Glazzbocks is built on the principle that powerful models should also be interpretable. Every component—from preprocessing to diagnostics and interpretation—is designed to remain visible, explainable, and auditable.

This framework promotes transparent, modular ML development where every decision and output can be inspected, traced, and justified.


Key Advantages of Glazzbocks

  • Full Interpretability: Native support for feature importances, coefficients, SHAP values, PDPs, and permutation importances
  • Auditable Pipelines: Clear step-by-step ML workflows using modular, scikit-learn-compatible structures
  • Diagnostic Depth: Includes error distributions, lift charts, cumulative gain, VIF analysis, skewness, normality tests, and more
  • Human-Centric Development: Designed for data scientists, analysts, and auditors who need to understand and explain model behavior—not just optimize accuracy

Components

ML_pipeline.py

End-to-end automation for classification and regression tasks.

  • Handles preprocessing of numerical and categorical features, with optional manual feature transformations
  • Supports any scikit-learn-compatible model
  • Allows user-defined transformation strategies for specific features (e.g., log, sqrt, yeo-johnson)
  • Includes train/test split and pipeline building
  • Performs cross-validation with detailed fold-wise metrics
  • Includes optional VIF analysis during cross-validation (linear models only)
  • Stores ROC, precision-recall, and threshold analysis (for classifiers)
  • Summarizes cross-validated performance across models

diagnostics.py

Automated visual diagnostics after training.

  • Classification: ROC, Confusion Matrix, F1 vs Threshold, Lift Chart, Gain Chart
  • Regression: Predicted vs Actual, Residual Plot, Error Distribution, Q-Q Plot
  • Auto-detects task type and generates all relevant visuals

modelinterpreter.py

Model interpretation & explainability utilities.

  • Tree-based models: Feature importances
  • Linear models: Coefficients (with plot support)
  • SHAP summary plots (with pipeline support)
  • Partial Dependence Plots (PDP)
  • Permutation Importance

data_explorer.py

Exploratory Data Analysis (EDA) for modeling decisions.

  • Auto-detects task type (regression/classification)
  • Displays shape, dtypes, missing values (via missingno matrix)
  • Visualizes target distribution and correlation heatmaps
  • Supports PDF report generation with visuals and curated tables
  • VIF for multicollinearity detection
  • Skewness and normality testing
  • Entropy calculation (for classification)
  • Automatically extracts datetime features (year, month, day, weekday)
  • Includes curated numeric and categorical summaries

Example Usage

from glazzbocks.ML_pipeline import MLPipeline
from glazzbocks.diagnostics import ModelDiagnostics
from glazzbocks.modelinterpreter import ModelInterpreter
from glazzbocks.data_explorer import DataExplorer

Notes

  • All components are sklearn-compatible and integrate seamlessly.
  • Visualizations are built using matplotlib, seaborn, missingno, or shap.
  • Pipelines auto-handle transformed features for compatibility with SHAP/PDP.
  • Designed to work in Jupyter Notebooks or production scripts.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glazzbocks-0.2.0.tar.gz (56.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

glazzbocks-0.2.0-py3-none-any.whl (45.2 kB view details)

Uploaded Python 3

File details

Details for the file glazzbocks-0.2.0.tar.gz.

File metadata

  • Download URL: glazzbocks-0.2.0.tar.gz
  • Upload date:
  • Size: 56.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.2.0.tar.gz
Algorithm Hash digest
SHA256 4090fb51c74e7d5240a245e4b2e952dd079015b858099f409ff80a24f3d7cc22
MD5 e13dcdd88fcb662c5b5b706a4d7c05ea
BLAKE2b-256 bf5ce5a241b3149ed827eca82fcfa9130ad63282b163150b46a237aca10f8efa

See more details on using hashes here.

File details

Details for the file glazzbocks-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: glazzbocks-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 45.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for glazzbocks-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7872a4eb286c2f6fe304da5e9bf5c9349db1f3d35bc28e2766158220559c2b34
MD5 ad66c0a2210d792761faf3fa5281d859
BLAKE2b-256 672e296daabe0ded7ce58ab9ef460a4502302bc3e8eafbebff718e5e87a685c8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page