Glassbox ML with EDA, diagnostics, and AI-powered reporting
Project description
Glazzbocks
A transparent, interpretable machine learning framework.
Glazzbocks (pronounced "glass box") provides a modular and auditable pipeline for building, diagnosing, and interpreting machine learning models. Designed with interpretability and traceability at its core, Glazzbocks enables practitioners to go beyond accuracy and deliver insights that are explainable, defensible, and production-ready.
Why Glass Box ML?
Modern machine learning offers unprecedented predictive power, but too often at the cost of transparency. In high-stakes or regulatory environments, this trade-off is unacceptable.
Glazzbocks is built on the principle that powerful models should also be interpretable. Every component—from preprocessing to diagnostics and interpretation—is designed to remain visible, explainable, and auditable.
This framework promotes transparent, modular ML development where every decision and output can be inspected, traced, and justified.
Key Advantages of Glazzbocks
- Full Interpretability: Native support for feature importances, coefficients, SHAP values, PDPs, and permutation importances
- Auditable Pipelines: Clear step-by-step ML workflows using modular, scikit-learn-compatible structures
- Diagnostic Depth: Includes error distributions, lift charts, cumulative gain, VIF analysis, skewness, normality tests, and more
- Human-Centric Development: Designed for data scientists, analysts, and auditors who need to understand and explain model behavior—not just optimize accuracy
Components
ML_pipeline.py
End-to-end automation for classification and regression tasks.
- Handles preprocessing of numerical and categorical features, with optional manual feature transformations
- Supports any scikit-learn-compatible model
- Allows user-defined transformation strategies for specific features (e.g., log, sqrt, yeo-johnson)
- Includes train/test split and pipeline building
- Performs cross-validation with detailed fold-wise metrics
- Includes optional VIF analysis during cross-validation (linear models only)
- Stores ROC, precision-recall, and threshold analysis (for classifiers)
- Summarizes cross-validated performance across models
diagnostics.py
Automated visual diagnostics after training.
- Classification: ROC, Confusion Matrix, F1 vs Threshold, Lift Chart, Gain Chart
- Regression: Predicted vs Actual, Residual Plot, Error Distribution, Q-Q Plot
- Auto-detects task type and generates all relevant visuals
modelinterpreter.py
Model interpretation & explainability utilities.
- Tree-based models: Feature importances
- Linear models: Coefficients (with plot support)
- SHAP summary plots (with pipeline support)
- Partial Dependence Plots (PDP)
- Permutation Importance
data_explorer.py
Exploratory Data Analysis (EDA) for modeling decisions.
- Auto-detects task type (regression/classification)
- Displays shape, dtypes, missing values (via
missingnomatrix) - Visualizes target distribution and correlation heatmaps
- Supports PDF report generation with visuals and curated tables
- VIF for multicollinearity detection
- Skewness and normality testing
- Entropy calculation (for classification)
- Automatically extracts datetime features (year, month, day, weekday)
- Includes curated numeric and categorical summaries
Example Usage
from glazzbocks.ML_pipeline import MLPipeline
from glazzbocks.diagnostics import ModelDiagnostics
from glazzbocks.modelinterpreter import ModelInterpreter
from glazzbocks.data_explorer import DataExplorer
Notes
- All components are sklearn-compatible and integrate seamlessly.
- Visualizations are built using
matplotlib,seaborn,missingno, orshap. - Pipelines auto-handle transformed features for compatibility with SHAP/PDP.
- Designed to work in Jupyter Notebooks or production scripts.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file glazzbocks-0.2.2.tar.gz.
File metadata
- Download URL: glazzbocks-0.2.2.tar.gz
- Upload date:
- Size: 61.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fc9ec3b79ec31d28e502c233b92ff1d01b971be935094951ae58f64b49fb124c
|
|
| MD5 |
419fd658f90d2bd67818d7ca09fceda3
|
|
| BLAKE2b-256 |
15543b81d6e96a2de5ae9d77867fcb13c10155906f93eb42ebc01919ec8db639
|
File details
Details for the file glazzbocks-0.2.2-py3-none-any.whl.
File metadata
- Download URL: glazzbocks-0.2.2-py3-none-any.whl
- Upload date:
- Size: 50.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e17d9ab5393479c0ece8656319afbfb723b44fb05f18baffb3fc86d99b601eae
|
|
| MD5 |
7b98974a77eab36b846967099d880620
|
|
| BLAKE2b-256 |
ee71de75aaa70d4b2f3eece346b6b84ef43c902dc4ff1b98c8e0908562359dbf
|