Python lightweight workflow management framework with data exploration features

These details have not been verified by PyPI

Project links

Project description

Sinagot

Source Code: https://gitlab.com/YannBeauxis/sinagot

Sinagot is a Python lightweight workflow management framework using Ray as distributed computing engine.

The key features are:

Easy to use: Design workflow with simple Python classes and functions without external configuration files.
Data exploration: Access to computed data directly with object attributes, including complex type as pandas DataFrame.
Scalable: The Ray engine enable seamless scaling of workflows to external clusters.

Installation

pip install sinagot

Getting started

import pandas as pd
import sinagot as sg

# Decorate functions to use them as workflow step
@sg.step
def multiply(df: pd.DataFrame, factor: int) -> pd.DataFrame:
    return df * factor


@sg.step
def get_single_data(df: pd.DataFrame) -> int:
    return int(df.iloc[0, 0])


# Design a workflow
class TestWorkflow(sg.Workflow):
    raw_data: pd.DataFrame = sg.seed() # seed is input data
    factor: int = sg.seed()
    multiplied_data: pd.DataFrame = multiply.step(raw_data, factor=factor)
    final_data: int = get_single_data.step(multiplied_data)


# Create a workspace on top of workflow for storage policy of data produced
class TestWorkspace(sg.Workspace[TestWorkflow]):
    raw_data = sg.LocalStorage("raw_data/data-{workflow_id}.csv")
    factor = sg.LocalStorage("params/factor")
    multiplied_data = sg.LocalStorage(
        "computed/multiplied_data-{workflow_id}.csv", write_kwargs={"index": False}
    )
    # In this example final_data is not stored and computed on demand


# Create a workspace with local storage folder root path parameter
ws = TestWorkspace("/path/to/local_storage")

# Access to a single workflow with its ID
wf = ws["001"]

# Access to item data, computed automatically if it does not exist in storage
display(wf.multiplied_data)
print(wf.final_data)

In this example, the storage dataset is structured as follows :

├── params/
│   └── factor
├── raw_data/
│   ├── data-{item_id}.csv
│   └── ...
└── computed/
    ├── step-1-{item_id}.csv
    └── ...

And the workflow is :

Development Roadmap

Sinagot is at an early development stage but ready to be tested on actual datasets for workflows prototyping.

Features development roadmap will be prioritized depending on usage feedbacks, so feel free to post an issue if you have any requirement.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.3

Mar 2, 2022

0.5.2

Feb 28, 2022

0.5.1

Feb 25, 2022

0.5.0

Feb 14, 2022

0.4.0

Jan 4, 2022

0.3.0

Nov 26, 2020

0.2.9

Nov 11, 2020

0.2.8

Nov 2, 2020

0.2.7

Oct 15, 2020

0.2.6

Oct 7, 2020

0.2.5

Oct 6, 2020

0.2.4

Oct 6, 2020

0.2.3

Sep 22, 2020

0.2.2

Sep 17, 2020

0.2.0

Aug 20, 2020

0.1.4

May 27, 2020

0.1.3

Apr 22, 2020

0.1.2

Apr 20, 2020

0.1.1

Apr 9, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sinagot-0.5.3.tar.gz (9.0 kB view details)

Uploaded Mar 2, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sinagot-0.5.3-py3-none-any.whl (10.3 kB view details)

Uploaded Mar 2, 2022 Python 3

File details

Details for the file sinagot-0.5.3.tar.gz.

File metadata

Download URL: sinagot-0.5.3.tar.gz
Upload date: Mar 2, 2022
Size: 9.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.13 CPython/3.8.12 Linux/5.4.109+

File hashes

Hashes for sinagot-0.5.3.tar.gz
Algorithm	Hash digest
SHA256	`ec2f0566e733dab98bad9f2d395ac64f69863925a98613213e83d4b0f5cf8e6f`
MD5	`1ad511fe005c115f953457cc567e7429`
BLAKE2b-256	`f8eb1b031706b7a1c06df49639a137f4a618678cac6008c0caedf6ff093c4753`

See more details on using hashes here.

File details

Details for the file sinagot-0.5.3-py3-none-any.whl.

File metadata

Download URL: sinagot-0.5.3-py3-none-any.whl
Upload date: Mar 2, 2022
Size: 10.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.13 CPython/3.8.12 Linux/5.4.109+

File hashes

Hashes for sinagot-0.5.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`28993a07504221ed99fab4888c5eaf2775761c7029fc1fdc228b988aa42b2cc7`
MD5	`e7ef0cc494cf0daf1981457bbb17729b`
BLAKE2b-256	`17e9b605047ba9e7bbb49f70bb52d7afa7ea1d44b99f07443670c127fea5982a`

See more details on using hashes here.

sinagot 0.5.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sinagot

Installation

Getting started

Development Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes