Running dynamic DAG workflows

These details have not been verified by PyPI

Project links

Project description

Adage - A DAG Executor

This is a small experimental package to see how one could describe workflows that are not completely known at definition time. Tasks should be runnable both in a multiprocessing pool, or using a number of celery workers or a IPython cluster.

Example

example image

Problem

Workflows can be comfortably represented by directed acyclic graphs (DAGs). But sometimes the precise structure of the graph is not known before the processing starts. Instead often one only has partial information of what kind of edges are possible and depending on a certain result in a node the DAG might be appended with more nodes and edges.

For example, one node (call it "node A") could be downloading a list of files, which can be processed in parallel. The DAG would therefore have one node for each file-processing (let's call them "node_file_1" to "node_file_n") depending on "node A". Since the exact number of files is not known until run-time, we cannot map out the DAG beforehand. Also after this "map"-step one might want to have a "reduce"-step to merge the individual result. This can also only be scheduled after the number of "map"-nodes is known.

Another example is that one might have a whole set of nodes that run a certain kind of task (e.g. produce a PDF file). One could imagine wanting to have a "reduce"-type task which merges all these individual PDF files. While any given node does not know where else PDF-generating tasks are scheduled, one can wait until no edges to PDF-generating tasks are possible anymore to then append a PDF-merging node to the DAG.

Solution

Generically, we want individual nodes to have a limited set of operations they can do on the DAG that they are part of. Specifically we can only allow queries on the structure of the DAG as well as append operations, nodes must not be able to remove nodes. The way we implement this is that we have a append-only record of scheduled rules. A rule is a pair of functions (predicate,body) that operate on the DAG. The predicate is a query function that inspects the graph to decide whether the DAG has enough information to apply the body (e.g. are edges of a certain type still possible to append or not?). If the DAG does have enough information the body which is an append-only operation on the DAG is applied, i.e. nodes are added . Periodically the list of rules is iterated to extend the DAG where possible.

Rules for Rules

There are a couple of rule that the rules need to obey themselves in order to make

it is the responsibility of the predicate to signal that all necessary nodes for the body are present in the DAG. Examples are:
- wait until no nodes of a particular type could possibly be added to the DAG. This requires us to know what kind of edges are valid on a global level.
  - wait until a certain number of nodes are present in the DAG (say )
- select a certain set of nodes by their unique id (useful to attach a sub-DAG to a existing node from within that node)
the only valid edges that you can dynamically add are ones that point away from existing nodes to new nodes.. edges directed towards existing nodes would introduce new dependencies which were not present before and so that job might have already run, or be currently running

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.11.0

Feb 8, 2023

0.10.3

Feb 3, 2022

0.10.2

Dec 17, 2021

0.10.1

Sep 30, 2019

0.10.0

Jun 22, 2019

0.9.3

Jun 20, 2019

0.9.2

Jun 1, 2019

0.9.0

Jul 15, 2018

0.8.7

Jul 11, 2018

0.8.6

Mar 21, 2018

0.8.5

Dec 19, 2017

0.8.4

Dec 18, 2017

0.8.3

Oct 9, 2017

0.8.2

Sep 27, 2017

0.8.1

Aug 3, 2017

0.8.0

Aug 1, 2017

0.7.3

Jul 25, 2017

0.7.2

Jul 22, 2017

0.7.1

Apr 10, 2017

0.7.0

Apr 3, 2017

0.6.1

Apr 1, 2017

0.6.0

Mar 29, 2017

0.5.6

Dec 18, 2016

0.5.5

Nov 18, 2016

0.5.4

Oct 15, 2016

0.5.3

Aug 28, 2016

0.5.2

Jul 21, 2016

0.5.1

Jul 21, 2016

0.5.0

Jul 13, 2016

0.4.0

Jul 13, 2016

0.3.0

May 5, 2016

0.2.0

Mar 5, 2016

0.1.7.1

Dec 11, 2015

0.1.7

Aug 14, 2015

0.1.6

May 1, 2015

0.1.5

Apr 25, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adage-0.11.0.tar.gz (18.1 kB view details)

Uploaded Feb 8, 2023 Source

Built Distribution

adage-0.11.0-py3-none-any.whl (18.8 kB view details)

Uploaded Feb 8, 2023 Python 3

File details

Details for the file adage-0.11.0.tar.gz.

File metadata

Download URL: adage-0.11.0.tar.gz
Upload date: Feb 8, 2023
Size: 18.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for adage-0.11.0.tar.gz
Algorithm	Hash digest
SHA256	`cd93f2682a1003560ac7fdf7df2d1d6de960143cd0c50d862c794d5cfe907a03`
MD5	`7f0ad217267c5d13dd493b7c933787f0`
BLAKE2b-256	`d98e9273a34dcf8db672e884520891fce1d084396a4256c80d3a5cb80c7cc5db`

See more details on using hashes here.

File details

Details for the file adage-0.11.0-py3-none-any.whl.

File metadata

Download URL: adage-0.11.0-py3-none-any.whl
Upload date: Feb 8, 2023
Size: 18.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for adage-0.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4d7c132c6c94b8cefbc01f914e3b55e0ffffcf874ba80f7cd59e09a7daf04cd5`
MD5	`e489317ee692aac331963c1506ac7621`
BLAKE2b-256	`aaf2f20e5551bba33a076c9f83cb8934eb7a9d813e25a41374d8a1f6cb9c4a81`

See more details on using hashes here.

adage 0.11.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Adage - A DAG Executor

Example

Problem

Solution

Rules for Rules

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes