Skip to main content

COGFlow — modular machine learning workflow management system

Project description

CogFlow

CogFlow is a modular, SDK-first machine learning workflow management system built on Kubeflow Pipelines, MLflow, Kubernetes, and MinIO.

It provides a clean Python API for:

  • building production-grade ML pipelines
  • managing datasets and components
  • orchestrating federated learning workflows
  • enforcing consistent error handling and validation

CogFlow is designed for real infrastructure, not notebooks only.


Why CogFlow?

Modern ML platforms are powerful but fragmented:

  • Kubeflow Pipelines → great orchestration, weak ergonomics
  • MLflow → experiment tracking, limited workflow control
  • Kubernetes → powerful, but verbose and error-prone
  • Federated learning → no standard orchestration layer

CogFlow bridges these gaps by providing:

  • a stable Python SDK
  • safe lazy-loading of heavy dependencies
  • unified error handling
  • infrastructure-aware abstractions
  • zero circular imports

Core Features

🧩 Pipeline Orchestration

  • Lazy-loaded Kubeflow Pipelines client
  • Safe pipeline compilation and submission
  • Runtime environment injection
  • Kubernetes service lifecycle management

📦 Component Management

  • YAML-based component registry
  • MinIO-backed component storage
  • Automatic component registration
  • Runtime-safe component loading

📊 Dataset Management

  • Dataset registration and metadata retrieval
  • Secure dataset downloads
  • Silent deletes with strict error semantics
  • Pluggable storage backends

🤝 Federated Learning

  • Auto-generated FL pipelines
  • Dynamic pipeline signatures
  • Connector-based and dataspace-based workflows
  • Region-aware scheduling with node selectors

🧠 Unified Error Handling

  • Strongly typed error hierarchy
  • Context-aware exception wrapping
  • API-ready error serialization
  • Zero silent failures

Architecture Overview

CogFlow follows a layered SDK architecture:

cogflow/ ├── core/ │ ├── pipelines/ # Kubeflow orchestration & FL pipelines │ ├── datasets/ # Dataset lifecycle management │ ├── components/ # Component registry & YAML handling │ └── models/ # (future extension) │ ├── utils/ │ ├── common.py # UUIDs, paths, Kubernetes helpers │ ├── network.py # HTTP utilities with retry & streaming │ ├── storage.py # MinIO client abstraction │ └── exceptions.py # Unified error framework │ ├── config.py └── api.py

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cogflow-2.0.1b9.tar.gz (255.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cogflow-2.0.1b9-py3-none-any.whl (301.7 kB view details)

Uploaded Python 3

File details

Details for the file cogflow-2.0.1b9.tar.gz.

File metadata

  • Download URL: cogflow-2.0.1b9.tar.gz
  • Upload date:
  • Size: 255.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cogflow-2.0.1b9.tar.gz
Algorithm Hash digest
SHA256 0de587c9b2f18ec68cdba7b056c392f50c8d22eb5e949053ca6aaa857a2ac31d
MD5 198f932785d5fd0e8dfa162c2989310a
BLAKE2b-256 fc0f68387f4881b7dd45ceb0250d9e7d3f1f5783a692ac8ad2645c2ba8b3c5fa

See more details on using hashes here.

Provenance

The following attestation bundles were made for cogflow-2.0.1b9.tar.gz:

Publisher: release.yml on HIRO-MicroDataCenters-BV/cogflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cogflow-2.0.1b9-py3-none-any.whl.

File metadata

  • Download URL: cogflow-2.0.1b9-py3-none-any.whl
  • Upload date:
  • Size: 301.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cogflow-2.0.1b9-py3-none-any.whl
Algorithm Hash digest
SHA256 0098c1cf857ecfec5a30faf7ca65ff3acff63e0c399a41fd6e761215c39c531d
MD5 644314b2c0eceef8f71668e3e77c7d74
BLAKE2b-256 e0ec0a9e38822cb35b57327b76dd985d655b3cf0afccc12a46569a7ba22617c4

See more details on using hashes here.

Provenance

The following attestation bundles were made for cogflow-2.0.1b9-py3-none-any.whl:

Publisher: release.yml on HIRO-MicroDataCenters-BV/cogflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page