Machine learning lifecycle for the Kailash ecosystem

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

esperie

These details have not been verified by PyPI

Project links

Documentation

Project description

kailash-ml

Classical and deep learning lifecycle for the Kailash ecosystem. Train models, store features, serve predictions, monitor drift, and optionally augment with AI agents.

Part of the Kailash Python SDK by the Terrene Foundation.

Installation

pip install kailash-ml            # Base (~195MB): sklearn, LightGBM, polars
pip install kailash-ml[dl]        # + PyTorch, Lightning
pip install kailash-ml[rl]        # + Stable Baselines3, Gymnasium
pip install kailash-ml[agents]    # + Kaizen (LLM-augmented ML)
pip install kailash-ml[full]      # Everything

Quick Start

import polars as pl
from kailash_ml.engines.feature_store import FeatureStore
from kailash_ml.engines.model_registry import ModelRegistry
from kailash_ml.engines.training_pipeline import TrainingPipeline, ModelSpec, EvalSpec
from kailash_ml.engines.inference_server import InferenceServer
from kailash_ml._types import FeatureSchema, FeatureField
from dataflow import DataFlow

# Initialize
db = DataFlow("sqlite:///ml.db")
await db.initialize()

feature_store = FeatureStore(db)
registry = ModelRegistry(db)
pipeline = TrainingPipeline(feature_store, registry)

# Define schema
schema = FeatureSchema("customers", [
    FeatureField("age", "float64"),
    FeatureField("income", "float64"),
], entity_id_column="customer_id")

# Train
data = pl.DataFrame({"customer_id": ["c1","c2","c3"], "age": [25.0,35.0,45.0], "income": [50000.0,75000.0,90000.0], "churn": [0,1,0]})
result = await pipeline.train(
    data, schema,
    ModelSpec("sklearn.ensemble.RandomForestClassifier", {"n_estimators": 100}, "sklearn"),
    EvalSpec(metrics=["accuracy", "f1"]),
    "churn_predictor",
)
print(f"Accuracy: {result.metrics['accuracy']:.2f}")

# Serve predictions
server = InferenceServer(registry)
predictions = await server.predict("churn_predictor", data)

Engines

kailash-ml provides 9 engines organized in 3 quality tiers:

P0: Production (stable API, full test coverage)

Engine	Purpose
FeatureStore	Compute, version, and serve features with point-in-time correct retrieval
ModelRegistry	Model lifecycle management (staging -> shadow -> production -> archived)
TrainingPipeline	Full training lifecycle: data prep -> train -> evaluate -> register
InferenceServer	Load, cache, and serve predictions; auto-expose via Nexus
DriftMonitor	Feature drift (PSI, KS-test) and performance degradation detection

P1: Production with Caveats (tested, API may evolve)

Engine	Purpose
HyperparameterSearch	Grid, random, Bayesian (optuna), successive halving
AutoMLEngine	Automated model selection with optional LLM augmentation

P2: Experimental (functional, API may change)

Engine	Purpose
DataExplorer	Statistical profiling with polars, plotly visualizations
FeatureEngineer	Automated feature generation and selection

Quality Tier Promotion

P2 -> P1: 3 integration tests, 2 real-world user validations, no open bugs above LOW
P1 -> P0: No API changes for 3+ minor releases, complete documentation, performance benchmarks

Agents (Optional)

6 Kaizen agents provide LLM-augmented ML workflows. Double opt-in: install kailash-ml[agents] AND pass agent=True.

Agent	Purpose	Tier
DataScientistAgent	Data analysis + ML strategy	P0
RetrainingDecisionAgent	Retrain/monitor/rollback decision	P0
ModelSelectorAgent	Model family + config recommendation	P1
ExperimentInterpreterAgent	Trial result analysis + next steps	P1
FeatureEngineerAgent	Feature design and pruning	P2
DriftAnalystAgent	Drift interpretation + action decision	P2

All agents emit self_assessed_confidence (0-1), enforce cost budgets, require human approval by default, show baseline comparison, and log decisions to DataFlow.

ONNX Bridge

Models registered in ModelRegistry are automatically exported to ONNX format when supported:

sklearn: ~90% success rate (all standard estimators)
LightGBM: ~95% success rate
PyTorch: 70-85% (feedforward networks)

Failed ONNX exports are non-fatal -- the model falls back to native Python inference. Use model.onnx_status to check export status.

polars-native

All data operations use polars DataFrames internally. The kailash_ml.interop module provides converters for ecosystem integration:

from kailash_ml.interop import to_sklearn_input, to_pandas, from_pandas

X, y = to_sklearn_input(polars_df, label_col="target")
pandas_df = to_pandas(polars_df)
polars_df = from_pandas(pandas_df)

DataFlow Integration

All engines persist data through DataFlow -- no additional infrastructure needed. Features, model metadata, drift reports, and audit logs use the same database as your application.

License

Apache-2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

esperie

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

1.7.2

May 6, 2026

1.7.1

May 1, 2026

1.7.0

Apr 30, 2026

1.6.0

Apr 29, 2026

1.5.1

Apr 28, 2026

1.5.0

Apr 27, 2026

1.4.2

Apr 27, 2026

1.1.1

Apr 24, 2026

1.1.0

Apr 24, 2026

0.17.0

Apr 21, 2026

0.15.2

Apr 20, 2026

0.15.1

Apr 20, 2026

0.15.0

Apr 20, 2026

0.14.0

Apr 20, 2026

0.13.0

Apr 19, 2026

0.12.1

Apr 19, 2026

0.12.0

Apr 19, 2026

0.11.1

Apr 19, 2026

0.11.0

Apr 19, 2026

0.10.0

Apr 19, 2026

0.9.0

Apr 12, 2026

0.7.0

Apr 7, 2026

0.6.0

Apr 6, 2026

0.5.0

Apr 6, 2026

0.4.0

Apr 5, 2026

0.3.0

Apr 5, 2026

This version

0.2.0

Apr 1, 2026

0.1.0

Apr 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kailash_ml-0.2.0-py3-none-any.whl (123.2 kB view details)

Uploaded Apr 1, 2026 Python 3

File details

Details for the file kailash_ml-0.2.0-py3-none-any.whl.

File metadata

Download URL: kailash_ml-0.2.0-py3-none-any.whl
Upload date: Apr 1, 2026
Size: 123.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kailash_ml-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`49444ff48fa37c69374360d2e66471f64aec4a6e6bd92b229a8c3867cd5ee5b4`
MD5	`825a8fec9f42364638083d2a269d695a`
BLAKE2b-256	`110e0ab4517f295730576f78ba49e0e0f907dbb845fe4b24de965466913b5f20`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kailash_ml-0.2.0-py3-none-any.whl:

Publisher: publish-pypi.yml on terrene-foundation/kailash-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kailash_ml-0.2.0-py3-none-any.whl
- Subject digest: 49444ff48fa37c69374360d2e66471f64aec4a6e6bd92b229a8c3867cd5ee5b4
- Sigstore transparency entry: 1206594580
- Sigstore integration time: Apr 1, 2026
Source repository:
- Permalink: terrene-foundation/kailash-py@924bae3db4f883c55b3e2264a7d352d63529946a
- Branch / Tag: refs/tags/ml-v0.2.0
- Owner: https://github.com/terrene-foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@924bae3db4f883c55b3e2264a7d352d63529946a
- Trigger Event: push

kailash-ml 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

kailash-ml

Installation

Quick Start

Engines

P0: Production (stable API, full test coverage)

P1: Production with Caveats (tested, API may evolve)

P2: Experimental (functional, API may change)

Quality Tier Promotion

Agents (Optional)

ONNX Bridge

polars-native

DataFlow Integration

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance