Skip to main content

No project description provided

Project description

ml-labs

A structured machine learning experimentation framework for building, managing, and evaluating ML pipelines with cross-validation, caching, and multi-framework support.

Installation

pip install ml-labs

With optional dependencies:

pip install ml-labs[xgboost]    # XGBoost support
pip install ml-labs[lightgbm]   # LightGBM support
pip install ml-labs[catboost]   # CatBoost support
pip install ml-labs[shap]       # SHAP value analysis
pip install ml-labs[polars]     # Polars DataFrame support
pip install ml-labs[all]        # All optional dependencies

Key Features

  • Pipeline: DAG-based node graph for defining ML workflows with stages (data transformation) and heads (model prediction)
  • Experimenter: Experiment execution engine with LRU caching, state management, and error resilience
  • Trainer: Cross-validation training pipeline with split management
  • Collectors: Extensible data collection — metrics, stacking outputs, model attributes, SHAP values, raw outputs
  • Adapters: Unified interface for scikit-learn, XGBoost, LightGBM, CatBoost, and Keras
  • Data Flexibility: Support for pandas, polars, cuDF, and NumPy arrays

Architecture Overview

Pipeline          Define node graphs (stages + heads) with groups and edges
    │
Experimenter      Execute pipelines, manage cache and state
    │
  ├── ExpObj      Per-node build/experiment objects (StageObj, HeadObj)
  ├── Trainer     Cross-validation training with split management
  └── Collector   Collect metrics, predictions, model attributes, SHAP values

Node State Model:

init ──→ built ──→ finalized
  │
  └──→ error ──→ (reset) ──→ init

Quick Start

from mllabs import Experimenter, Connector, MetricCollector

exp = Experimenter(data=df, path="exp/my_experiment")

p = exp.pipeline
p.set_grp("scale", role="stage", processor="StandardScaler")
p.set_grp("model", role="head", processor="RandomForestClassifier",
          parent="scale", edges={"X": [(None, None)], "y": [(None, "target")]})

p.set_node("rf_default", grp="model")

exp.build(["rf_default"])
exp.exp(["rf_default"])

mc = MetricCollector("accuracy", Connector(), output_var="prediction",
                     metric_func=lambda y, pred: (y == pred).mean())
exp.add_collector(mc)
exp.collect(mc)

print(mc.get_metrics(["rf_default"]))

Requirements

  • Python >= 3.10
  • pandas >= 1.5
  • numpy >= 1.23
  • scikit-learn >= 1.2
  • cachetools >= 5.0

License

PolyForm Noncommercial 1.0.0 — free for non-commercial use.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ml_labs-0.6.0.tar.gz (97.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ml_labs-0.6.0-py3-none-any.whl (93.5 kB view details)

Uploaded Python 3

File details

Details for the file ml_labs-0.6.0.tar.gz.

File metadata

  • Download URL: ml_labs-0.6.0.tar.gz
  • Upload date:
  • Size: 97.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ml_labs-0.6.0.tar.gz
Algorithm Hash digest
SHA256 68b599b7b85085a8066ce57d1673399da8414fa5acb896a4b5a5aa3596aeb40b
MD5 f1b7633be03adbbd1b8c05013b9ea912
BLAKE2b-256 5b00585a0fa6fe47d0c59cc2f3c05a052a5e5707fdd47685f7502950f03cc2c4

See more details on using hashes here.

Provenance

The following attestation bundles were made for ml_labs-0.6.0.tar.gz:

Publisher: publish.yml on sunkusun9/ml-labs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ml_labs-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: ml_labs-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 93.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ml_labs-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b50e62c064d11b157e695552d7c2dd3dcdb3d58cb8e9952c8d5e46ed46bf5211
MD5 592f8e90c03495399f543a15bfdeb166
BLAKE2b-256 7d0aa325956ba8a8cbc737312172901a45aabfbb1e40019dc6cc8c7b5cbcc0a4

See more details on using hashes here.

Provenance

The following attestation bundles were made for ml_labs-0.6.0-py3-none-any.whl:

Publisher: publish.yml on sunkusun9/ml-labs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page