COGFlow — modular machine learning workflow management system
Project description
CogFlow
CogFlow is a modular, SDK-first machine learning workflow management system built on Kubeflow Pipelines, MLflow, Kubernetes, and MinIO.
It provides a clean Python API for:
- building production-grade ML pipelines
- managing datasets and components
- orchestrating federated learning workflows
- enforcing consistent error handling and validation
CogFlow is designed for real infrastructure, not notebooks only.
Why CogFlow?
Modern ML platforms are powerful but fragmented:
- Kubeflow Pipelines → great orchestration, weak ergonomics
- MLflow → experiment tracking, limited workflow control
- Kubernetes → powerful, but verbose and error-prone
- Federated learning → no standard orchestration layer
CogFlow bridges these gaps by providing:
- a stable Python SDK
- safe lazy-loading of heavy dependencies
- unified error handling
- infrastructure-aware abstractions
- zero circular imports
Core Features
🧩 Pipeline Orchestration
- Lazy-loaded Kubeflow Pipelines client
- Safe pipeline compilation and submission
- Runtime environment injection
- Kubernetes service lifecycle management
📦 Component Management
- YAML-based component registry
- MinIO-backed component storage
- Automatic component registration
- Runtime-safe component loading
📊 Dataset Management
- Dataset registration and metadata retrieval
- Secure dataset downloads
- Silent deletes with strict error semantics
- Pluggable storage backends
🤝 Federated Learning
- Auto-generated FL pipelines
- Dynamic pipeline signatures
- Connector-based and dataspace-based workflows
- Region-aware scheduling with node selectors
🧠 Unified Error Handling
- Strongly typed error hierarchy
- Context-aware exception wrapping
- API-ready error serialization
- Zero silent failures
Architecture Overview
CogFlow follows a layered SDK architecture:
cogflow/ ├── core/ │ ├── pipelines/ # Kubeflow orchestration & FL pipelines │ ├── datasets/ # Dataset lifecycle management │ ├── components/ # Component registry & YAML handling │ └── models/ # (future extension) │ ├── utils/ │ ├── common.py # UUIDs, paths, Kubernetes helpers │ ├── network.py # HTTP utilities with retry & streaming │ ├── storage.py # MinIO client abstraction │ └── exceptions.py # Unified error framework │ ├── config.py └── api.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cogflow-2.0.1b3.tar.gz.
File metadata
- Download URL: cogflow-2.0.1b3.tar.gz
- Upload date:
- Size: 244.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
386b288811ab163a6721760965dce87312fc04e85acab9dce8060f3dbc3d6a0e
|
|
| MD5 |
7db2d9ce17e3554ece3ab3c168b8d796
|
|
| BLAKE2b-256 |
48d0a3889029e7862baca9e9b56fc362a393d44616b4f4df7ac3941f58598860
|
Provenance
The following attestation bundles were made for cogflow-2.0.1b3.tar.gz:
Publisher:
release.yml on HIRO-MicroDataCenters-BV/cogflow
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cogflow-2.0.1b3.tar.gz -
Subject digest:
386b288811ab163a6721760965dce87312fc04e85acab9dce8060f3dbc3d6a0e - Sigstore transparency entry: 1313017667
- Sigstore integration time:
-
Permalink:
HIRO-MicroDataCenters-BV/cogflow@e2b42c0074aac9ff381ee734fed48915df535f28 -
Branch / Tag:
refs/tags/v2.0.1b3 - Owner: https://github.com/HIRO-MicroDataCenters-BV
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@e2b42c0074aac9ff381ee734fed48915df535f28 -
Trigger Event:
push
-
Statement type:
File details
Details for the file cogflow-2.0.1b3-py3-none-any.whl.
File metadata
- Download URL: cogflow-2.0.1b3-py3-none-any.whl
- Upload date:
- Size: 295.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d783a4aa6cc4a2c75fa9790ac3e1c749e67d4923d0066d3d6377e82f538ce715
|
|
| MD5 |
8ffb5e5ff5ae2f1a39db432b003092fa
|
|
| BLAKE2b-256 |
de2c26d86eef7ecc01abd209dc375d253d866b96770430eb1d9e42a4d2441849
|
Provenance
The following attestation bundles were made for cogflow-2.0.1b3-py3-none-any.whl:
Publisher:
release.yml on HIRO-MicroDataCenters-BV/cogflow
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cogflow-2.0.1b3-py3-none-any.whl -
Subject digest:
d783a4aa6cc4a2c75fa9790ac3e1c749e67d4923d0066d3d6377e82f538ce715 - Sigstore transparency entry: 1313017773
- Sigstore integration time:
-
Permalink:
HIRO-MicroDataCenters-BV/cogflow@e2b42c0074aac9ff381ee734fed48915df535f28 -
Branch / Tag:
refs/tags/v2.0.1b3 - Owner: https://github.com/HIRO-MicroDataCenters-BV
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@e2b42c0074aac9ff381ee734fed48915df535f28 -
Trigger Event:
push
-
Statement type: