pypipegraph

A workflow (job) engine/pipeline for bioinformatics and scientific computing.

These details have not been verified by PyPI

Project links

Homepage

Project description

# pypipegraph

| Build status: | [![Build Status](https://travis-ci.com/TyberiusPrime/pypipegraph.svg?branch=master)](https://travis-ci.com/TyberiusPrime/pypipegraph)|
|---------------|-----------------------------------------------------------------------------|
| Documentation | https://pypipegraph.readthedocs.io/en/latest/
| Code style | ![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)

## Introduction

[pypipegraph](https://github.com/IMTMarburg/pypipegraph): is an
MIT-licensed library for constructing a workflow piece by piece and
executing just the parts of it that need to be (re-)done. It supports
using multiple cores (SMP) and (eventually, alpha code right now)
machines (cluster) and is a hybrid between a dependency tracker (think
'make') and a cluster engine.

More specifically, you construct Jobs, which encapsulate output (i.e.
stuff that needs to be done), invariants (which force re-evaluation of
output jobs if they change), and stuff inbetween (e.g. load data from
disk).

From your point of view, you create a pypipegraph, you create jobs,
chain them together, then ask the pypipegraph to run. It examines all
jobs for their need to run (either because the have not been finished,
or because they have been invalidated), distributes them across multiple
python instances, and get's them executed in a sensible order.

It is robust against jobs dying for whatever reason (only the failed job
and everything 'downstream' will be affected, independend jobs will
continue running), allows you to resume at any point 'in between' jobs,
and isolates jobs against each other.

pypipegraph supports Python 3 only.

## 30 second summary

```python
pypipegraph.new_pipeline()
output_filenameA = 'sampleA.txt'
def do_the_work():
op = open(output_filename, 'wb').write("hello world")
jobA = pypipegraph.FileGeneratingJob(output_filenameA, do_the_work)
output_filenameB = 'sampleB.txt'
def do_the_work():
op = open(output_filenameB, 'wb').write(open(output_filenameA, 'rb').read() + ", once again")
jobB = pypipegraph.FileGeneratingJob(output_filenameB, do_the_work)
jobB.depends_on(jobA)
pypipegraph.run()
print('the pipegraph is done and has returned control to you.')
print('sampleA.txt contains "hello world"')
print('sampleB.txt contains "hello world, once again")
```

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.197

Jul 12, 2021

0.196

Jul 12, 2021

0.195

Apr 14, 2021

0.194

Apr 14, 2021

0.193

Oct 15, 2020

0.192

Oct 13, 2020

0.191

Aug 14, 2020

0.190

Jul 27, 2020

0.189

Nov 22, 2019

0.188

Nov 13, 2019

0.187

Aug 28, 2019

0.185

May 7, 2019

0.184

May 7, 2019

0.183

Mar 13, 2019

0.182

Mar 13, 2019

This version

0.181

Mar 11, 2019

0.180

Jan 23, 2019

0.178

Jan 11, 2019

0.177

Jan 9, 2019

0.175

Jan 4, 2019

0.174

Nov 23, 2018

0.173

Nov 23, 2018

0.172

Nov 23, 2018

0.171

Nov 23, 2018

0.170

Nov 15, 2018

0.160

May 14, 2018

0.159

Dec 11, 2017

0.158

Nov 22, 2017

0.157

May 22, 2012

0.156

Apr 20, 2012

0.151

Apr 12, 2012

0.128

Jan 30, 2012

0.126

Jan 30, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypipegraph-0.181.tar.gz (131.3 kB view hashes)

Uploaded Mar 11, 2019 Source

Hashes for pypipegraph-0.181.tar.gz

Hashes for pypipegraph-0.181.tar.gz
Algorithm	Hash digest
SHA256	`58b99288b815149216e83836f7d0a1e81c50f4f2bdc00620132b52df2f8f43a5`
MD5	`03b9205fb5d990a27c806d28663c978d`
BLAKE2b-256	`2867feab534fe6f4384fbf8aa1772076af37f2dcfaa48ae62cf8ee5a360118cc`