Config-driven ML analysis library for regression and classification

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

LizyML

Config-driven ML library that unifies tune / fit / predict / evaluate / export for regression, binary classification, and multiclass classification.

Key Features

One config, full pipeline -- A single dict/YAML/JSON drives splitting, training, tuning, evaluation, and export. No boilerplate orchestration code.
Reproducibility by default -- Seed, split indices, params, library versions, and data fingerprint are captured automatically in every run.
Leakage-aware CV and calibration -- OOF predictions never see their own training rows. Calibration uses cross-fit on the same outer splits. Time and group constraints propagate to inner validation.
8 CV strategies -- KFold, Stratified, Group, StratifiedGroup, TimeSeries, Purged TimeSeries, Group TimeSeries, and 2-axis Blocked Group KFold.
Stable result contracts -- FitResult, PredictionResult, and artifact formats have fixed schemas. Downstream code never breaks on shape changes.
Codegen export -- Generate standalone train.py + predict.py that run without LizyML installed.
Optional extras -- Tuning (Optuna), SHAP explanations, Plotly visualizations, and Beta calibration (scipy) are all opt-in.

Installation

pip install lizyml

Extras

pip install 'lizyml[tuning]'        # Optuna hyperparameter search
pip install 'lizyml[explain]'        # SHAP explanations
pip install 'lizyml[plots]'          # Plotly visualizations
pip install 'lizyml[calibration]'    # Beta calibrator (scipy)
pip install 'lizyml[tuning,explain,plots,calibration]'  # all extras

Development install

git clone https://github.com/nbx-liz/LizyML.git
cd LizyML
uv sync --group dev

Quick Start

import numpy as np
import pandas as pd
from lizyml import Model

# --- Synthetic data ---
rng = np.random.default_rng(42)
n = 500
df = pd.DataFrame({
    "feat_a": rng.normal(size=n),
    "feat_b": rng.normal(size=n),
    "cat_col": rng.choice(["x", "y", "z"], size=n),
    "target": rng.normal(size=n),
})

# --- Config ---
config = {
    "config_version": 1,
    "task": "regression",
    "data": {"target": "target"},
    "features": {"categorical": ["cat_col"]},
    "split": {"method": "kfold", "n_splits": 5},
    "model": {"name": "lgbm"},
    "evaluation": {"metrics": ["rmse", "mae"]},
}

# --- Train, evaluate, predict ---
model = Model(config=config)
fit_result = model.fit(data=df)
metrics = model.evaluate()
print(metrics)  # {"raw": {"oof": {"rmse": ..., "mae": ...}, ...}}

pred = model.predict(df.drop(columns=["target"]))
print(pred.pred[:5])

# --- Save and reload ---
model.export("my_model")
loaded = Model.load("my_model")
loaded.predict(df.drop(columns=["target"]))

Configuration

LizyML accepts configs as Python dicts, JSON files, or YAML files. Environment variables override any key using the LIZYML__ prefix (e.g., LIZYML__training__seed=99).

See Config Reference for all keys, defaults, split method guides, and tuning space definitions.

Codegen Export

Generate LizyML-free scripts for production deployment:

model.export_code("deploy/my_model")

Output:

train.py -- retrain on new data with python train.py data.csv
predict.py -- run inference with python predict.py test.csv -o out.csv
config.json -- all hyperparameters and feature definitions
test_equivalence.py -- verify codegen matches Model.predict()
artifacts/ -- model files in human-readable formats

Dependencies: only lightgbm, numpy, pandas, scikit-learn.

Architecture

LizyML uses a 5-layer architecture where dependencies flow strictly downward:

Layer 4  Facade       Model (orchestration only, no logic)
           |
Layer 3  Optional     explain / plots / persistence / codegen
           |
Layer 2  Composition  training / evaluation / tuning
           |
Layer 1  Leaf         config / data / splitters / features / estimators / metrics / calibration
           |
Layer 0  Foundation   types (FitResult, PredictionResult, ...) / exceptions / logging

Key rules:

Downward-only dependencies (no circular imports)
Layer 2 references Layer 1 through abstract interfaces only
Only the Facade (Layer 4) assembles concrete classes

See ARCHITECTURE.md for full diagrams and module layout.

Design Priorities

Reproducibility -- Same config + seed = same splits, same OOF predictions, same metrics. Every run captures seed, split indices, params, library versions, and a data fingerprint.

Leakage prevention -- OOF rows are never seen during training. Calibration cross-fit reuses outer CV splits. Time and group constraints propagate to inner validation (early stopping) and calibration.

Contract stability -- FitResult, PredictionResult, and artifact formats have fixed schemas. Breaking changes require a format_version bump and migration path.

Result Objects

Object	Key fields
`FitResult`	`oof_pred`, `if_pred_per_fold`, `metrics`, `models`, `splits`, `run_meta`
`PredictionResult`	`pred`, `proba` (binary), `shap_values` (optional), `warnings`
`Model Artifact`	Trained models, pipeline state, calibrator, config, `format_version`

model.evaluate() returns structured metrics:

{
    "raw": {
        "oof": {"rmse": 0.42, "mae": 0.33},
        "if_mean": {"rmse": 0.40, "mae": 0.31},
        "if_per_fold": [...],
        "oof_coverage": 1.0,
    },
    "calibrated": {"oof": {"logloss": 0.35}},  # binary only
}

See BLUEPRINT.md for full schemas and invariants.

Roadmap

Broader scikit-learn estimator support
DNN backend (PyTorch)
Multiclass calibration
Ranking tasks
Additional export formats (ONNX, TorchScript)

Documentation

Config Reference -- all config keys, defaults, and split guides
BLUEPRINT.md -- implementation specification (source of truth)
ARCHITECTURE.md -- 5-layer architecture diagrams
CHANGELOG.md -- release history
HISTORY.md -- proposal and decision records

Contributing

Fork the repo and create a branch from develop
Run quality gates: uv run ruff check . && uv run mypy lizyml/ && uv run pytest
Open a PR against develop

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nbx

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.0

Apr 11, 2026

0.8.1

Apr 10, 2026

0.8.0

Apr 2, 2026

0.7.3

Apr 2, 2026

0.7.2

Apr 1, 2026

0.7.1

Apr 1, 2026

0.7.0

Mar 28, 2026

0.6.1

Mar 28, 2026

0.6.0

Mar 28, 2026

This version

0.5.0

Mar 28, 2026

0.4.2

Mar 28, 2026

0.4.1

Mar 28, 2026

0.4.0

Mar 20, 2026

0.3.0

Mar 19, 2026

0.2.0

Mar 17, 2026

0.1.4

Mar 14, 2026

0.1.3

Mar 14, 2026

0.1.2

Mar 9, 2026

0.1.1

Mar 7, 2026

0.1.0

Mar 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lizyml-0.5.0.tar.gz (554.5 kB view details)

Uploaded Mar 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lizyml-0.5.0-py3-none-any.whl (137.9 kB view details)

Uploaded Mar 28, 2026 Python 3

File details

Details for the file lizyml-0.5.0.tar.gz.

File metadata

Download URL: lizyml-0.5.0.tar.gz
Upload date: Mar 28, 2026
Size: 554.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for lizyml-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`245b7fc87245c74ac32565473ad1c7c44369814c1b2f2dadaef409b24df41227`
MD5	`f88b6bc2eaebf58ccca9cdf6fc3ad8b2`
BLAKE2b-256	`258a690e989ae6abf33382a16035390be7878b2f1f59ba58b7a0e760855f7a84`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lizyml-0.5.0.tar.gz:

Publisher: release.yml on nbx-liz/LizyML

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lizyml-0.5.0.tar.gz
- Subject digest: 245b7fc87245c74ac32565473ad1c7c44369814c1b2f2dadaef409b24df41227
- Sigstore transparency entry: 1190670119
- Sigstore integration time: Mar 28, 2026
Source repository:
- Permalink: nbx-liz/LizyML@11d8398f897fe032123aa849f34c78076671bd50
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nbx-liz
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@11d8398f897fe032123aa849f34c78076671bd50
- Trigger Event: workflow_dispatch

File details

Details for the file lizyml-0.5.0-py3-none-any.whl.

File metadata

Download URL: lizyml-0.5.0-py3-none-any.whl
Upload date: Mar 28, 2026
Size: 137.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for lizyml-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`81a2437eb31863c398d29486a2b8f340c4c77322ca7dc1d04b103c0d31a639de`
MD5	`2dd49f15a78512e7115b9ec3456b3ba6`
BLAKE2b-256	`8cdf6522fc37c9166b5b5d1669dd288d76cfacf7a5655ef5391a204a876ea30a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lizyml-0.5.0-py3-none-any.whl:

Publisher: release.yml on nbx-liz/LizyML

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lizyml-0.5.0-py3-none-any.whl
- Subject digest: 81a2437eb31863c398d29486a2b8f340c4c77322ca7dc1d04b103c0d31a639de
- Sigstore transparency entry: 1190670132
- Sigstore integration time: Mar 28, 2026
Source repository:
- Permalink: nbx-liz/LizyML@11d8398f897fe032123aa849f34c78076671bd50
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nbx-liz
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@11d8398f897fe032123aa849f34c78076671bd50
- Trigger Event: workflow_dispatch

lizyml 0.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

LizyML

Key Features

Installation

Extras

Development install

Quick Start

Configuration

Codegen Export

Architecture

Design Priorities

Result Objects

Roadmap

Documentation

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance