PyLabFlow is a lightweight framework that simplifies experiment management, reducing setup time with reusable components for training, logging, and checkpointing. It streamlines workflows, making it ideal for fast and efficient experimentation.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

PyLabFlow

Domain-Independent, Secure, and Offline Computational Research Management

PyLabFlow is a self-contained Python framework for managing computational research experiments. It is designed to work with various domains, from machine learning and data processing to simulation and numerical analysis. Built on the principles of flexibility, reproducibility, and data privacy, PyLabFlow allows researchers to define, run, track, and transfer entire custom workflows offline.

PyLabFlow is domain-agnostic, leveraging abstract Component and WorkFlow classes that users can customize to fit their needs. This makes it ideal for use with frameworks like PyTorch, TensorFlow, or any other complex pipeline structures used in scientific computing.

✨ Key Capabilities

Offline-First:
- All experiment tracking and data management are handled locally using file systems and SQLite databases, ensuring complete data privacy and uninterrupted work, even without an internet connection.
100% Customization:
- Built on abstract Component and WorkFlow classes, users can define their own logic for data loading, processing, simulation, and analysis.
Pipeline Tracking (PPLs):
- Centralized management of experiment configurations, status (e.g., init, running, frozen, cleaned), and history.
Reproducibility Guarantee:
- Each pipeline (PPL) is uniquely identified by the cryptographic hash of its entire configuration, preventing configuration drift and ensuring exact reproducibility.
Seamless Transfer:
- Easily archive experiments, or transfer project setups (including configurations and output artifacts) between local directories, cloud storage, or high-performance computing (HPC) nodes without path reconfiguration.
Session Logging:
- Automatic tracking of the session origin (e.g., Jupyter session, script filename) and execution time for enhanced auditability.

🛠 Installation

You can install PyLabFlow using pip:


pip install PyLabFlow

Alternatively, clone the repository for development purposes:


git clone https://github.com/ExperQuick/PyLabFlow.git
cd PyLabFlow
pip install -e .

🚀 Getting Started: Running an Experiment Pipeline

PyLabFlow structures work around Labs (your project environment) and Pipelines (PPLs) (customizable experiment runs).

1. Setting Up the Lab Environment

The Lab manages file paths and databases. You only need to run this once per project.

import os
from plf.lab import create_project, lab_setup

# Define mandatory project settings
SETTINGS = {
    "project_name": "General_Research_Lab",
    "project_dir": "/path/to/my/research_projects",  # Parent directory for the project
    "component_dir": "/path/to/my/custom_components",  # Directory for custom components and workflows
}

# Create project structure, databases, and settings file
settings_path = create_project(SETTINGS)
print(f"Project structure and settings created at: {settings_path}")

# Set up the lab environment for the current Python session
lab_setup(settings_path)

2. Defining a Custom WorkFlow

Since PyLabFlow is domain-independent, you define your experiment logic by subclassing WorkFlow and Component.

import os
from plf.utils import WorkFlow, Component
from typing import Dict, Any

# A generic computational component
class MyComputationalComponent(Component):
    def _setup(self, args: Dict[str, Any]):
        print(f"Setting up component with args: {args}")
        self.data = args.get("initial_value", 0)

# Define the flow that combines components and executes the run logic
class GenericDataWorkflow(WorkFlow):
    
    # Initialize the pipeline run configuration
    def new(self, args: Dict[str, Any]):
        # Ensure all required configuration keys are provided
        if not self.template.issubset(set(args.keys())):
            raise ValueError(f'the args should have {", ".join(self.template- set(list(args.keys())))}')
        
        

    # Perform setup (e.g., loading large datasets/models into memory)
    def prepare(self):
        self.data_source = self.load_component(**args['data_source'])
        self.algorithm = self.load_component(**args['algorithm'])
        print("Preparing workflow: loading external data or setting up environment...")
        return True

    # Main execution logic (e.g., training loop, simulation run)
    def run(self):
        print(f"Running PPL: {self.P.pplid}")
        result = self.data_source.data + 10
        print(f"Final result: {result}")

    # Define standardized paths for saving artifacts
    def get_path(self, of: str, pplid: str, args: Dict) -> str:
        if of == 'results':
            return os.path.join(self.P.settings['data_path'], 'Results', f'{pplid}_output.txt')
        raise NotImplementedError(f"Path for artifact type '{of}' is not defined in GenericDataWorkflow.")

    # Clean up temporary files or output artifacts
    def clean(self):
        print(f"Cleaning artifacts for {self.P.pplid}...")
        # Add logic to delete files here

    # Return execution status or key metrics
    def status(self):
        return {"last_result": 100, "status_detail": "Completed successfully"}

3. Creating and Executing a Pipeline

Once your components are defined, you can create and execute the pipeline using the PipeLine class.

from plf.experiment import PipeLine

# Define the full configuration with fully qualified paths to components
pipeline_config = {
    "workflow": {
        "loc": "my_workflows.GenericDataWorkflow",
        "args": {}
    },
    "args": {
        "data_source": {"loc": "my_workflows.MyComputationalComponent", "args": {"initial_value": 42}},
        "algorithm": {"loc": "my_workflows.MyComputationalComponent", "args": {"param_b": 5}},
    }
}

# Create a new pipeline. Configuration is hashed and logged
P = PipeLine()
P.new(pplid="ppl_data_run_001", args=pipeline_config)

# Prepare the environment
P.prepare()

# Run the workflow
P.run()

💾 Experiment Management Tools

The plf.experiment module provides powerful tools for managing your PPL database:

get_ppls(): List all active pipeline IDs in the current Lab.
get_ppl_status(): Returns a DataFrame summarizing the status, last run, and key metrics for all PPLs.
filter_ppls(query): Filters PPLs based on configuration arguments (e.g., filter_ppls("data_source=my_workflows.MyComponent")).
archive_ppl(ppls): Archives a pipeline, moving its configurations and artifacts to an archived folder for safe storage.
archive_ppl(ppls, reverse=True): Unarchives a pipeline and returns it to the active environment.
delete_ppl(ppls): Permanently deletes a pipeline from the archive.
stop_running(): Gracefully stops a currently running pipeline after its current iteration completes.

📜 License

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.3.0.3

Jan 10, 2026

0.3.0.1

Jan 10, 2026

0.3.0

Jan 10, 2026

0.2.1.4

Dec 11, 2025

0.2.1.3

Nov 8, 2025

0.2.1.2

Nov 4, 2025

0.2.1.1

Nov 4, 2025

0.2.1

Nov 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pylabflow-0.3.0.3.tar.gz (27.7 kB view details)

Uploaded Jan 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pylabflow-0.3.0.3-py3-none-any.whl (31.5 kB view details)

Uploaded Jan 10, 2026 Python 3

File details

Details for the file pylabflow-0.3.0.3.tar.gz.

File metadata

Download URL: pylabflow-0.3.0.3.tar.gz
Upload date: Jan 10, 2026
Size: 27.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for pylabflow-0.3.0.3.tar.gz
Algorithm	Hash digest
SHA256	`767cc3a859090e146ec523e8e9595db265ca0e5d16c7511c3fd929e227fa77a7`
MD5	`f3f456b897b8e8b8fa86fe7f98e0a413`
BLAKE2b-256	`3a9526bfd63d138d14119845f8c28fcbbd48c5705cdf60dd3c013d7d94f7e1fc`

See more details on using hashes here.

File details

Details for the file pylabflow-0.3.0.3-py3-none-any.whl.

File metadata

Download URL: pylabflow-0.3.0.3-py3-none-any.whl
Upload date: Jan 10, 2026
Size: 31.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for pylabflow-0.3.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5bf8935447b3a9c314f3d97fbbd57f0e5ed8451bdbc6583e2bb944c0125eccf8`
MD5	`202bde2f495d13d27e14adef8b6fa993`
BLAKE2b-256	`e4a0479902a065047868b9d41343cccdf74d510df66ce9e3cc0750f0fd26c196`

See more details on using hashes here.

PyLabFlow 0.3.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PyLabFlow

Domain-Independent, Secure, and Offline Computational Research Management

✨ Key Capabilities

🛠 Installation

🚀 Getting Started: Running an Experiment Pipeline

1. Setting Up the Lab Environment

2. Defining a Custom WorkFlow

3. Creating and Executing a Pipeline

💾 Experiment Management Tools

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes