StreamFlow framework
Project description
StreamFlow
The StreamFlow framework is a container-native Workflow Management System (WMS) written in Python 3. It has been designed around two main principles:
- Allow the execution of tasks in multi-container environments, in order to support concurrent execution of multiple communicating tasks in a multi-agent ecosystem.
- Relax the requirement of a single shared data space, in order to allow for hybrid workflow executions on top of multi-cloud or hybrid cloud/HPC infrastructures.
Use StreamFlow
PyPI
The StreamFlow module is available on PyPI, so you can install it using pip.
pip install streamflow
Please note that StreamFlow requires python >= 3.8. Then you can execute it directly from the CLI
streamflow run /path/to/streamflow.yml
Docker
StreamFlow Docker images are available on Docker Hub. In order to run a workflow inside the StreaFlow image
- A StreamFlow project, containing a
streamflow.ymlfile and all the other relevant dependencies (e.g. a CWL description of the workflow steps and a Helm description of the execution environment) need to be mounted as a volume inside the container, for example in the/streamflow/projectfolder - Workflow outputs, if any, will be stored in the
/streamflow/resultsfolder. Therefore, it is necessary to mount such location as a volume in order to persist the results - StreamFlow will save all its temporary files inside the
/tmp/streamflowlocation. For debugging purposes, or in order to improve I/O performances in case of huge files, it could be useful to mount also such location as a volume - The path of the
streamflow.ymlfile inside the container (e.g./streamflow/project/streamflow.yml) must be passed as an argument to the Docker container
The script below gives an example of StreamFlow execution in a Docker container
docker run -d \
--mount type=bind,source="$(pwd)"/my-project,target=/streamflow/project \
--mount type=bind,source="$(pwd)"/results,target=/streamflow/results \
--mount type=bind,source="$(pwd)"/tmp,target=/tmp/streamflow \
alphaunito/streamflow run /streamflow/project/streamflow.yml
Kubernetes
It is also possible to execute the StreamFlow container as a Job in Kubernetes.
In this case, StreamFlow is able to deploy Helm models directly on the parent cluster through the
ServiceAccount credentials. In order to do that, the inCluster option must be set to true for each
involved module on the streamflow.yml file
models:
helm-model:
type: helm
config:
inCluster: true
...
A Helm template of a StreamFlow Job can be found in the helm/chart folder.
Please note that, in case RBAC is active on the
Kubernetes cluster, a proper RoleBinding must be attached to the ServiceAccount object, in order to give
StreamFlow the permissions to manage deployments of pods and executions of tasks.
CWL Compatibility
StreamFlow relies on the Common Workflow Language (CWL) standard to design workflow models. CWL conformance badges for StreamFlow are reported below.
CWL v1.0
Classes
Required features
Optional features
CWL v1.1
Classes
Required features
Optional features
CWL v1.2
Classes
Required features
Optional features
Contribute to StreamFlow
As a first step, get StreamFlow from GitHub
git clone git@github.com:alpha-unito/streamflow.git
Then you can install all the requred packages using the pip install command
cd streamflow
pip install .
StreamFlow relies on GitHub Actions for PyPI and Docker Hub distributions. Therefore, in order to publish a
new version of the software, you only have to augment the version number in version.py file.
StreamFlow Team
Iacopo Colonnelli iacopo.colonnelli@unito.it (creator and maintainer)
Barbara Cantalupo barbara.cantalupo@unito.it (maintainer)
Marco Aldinucci aldinuc@di.unito.it (maintainer)
Gaetano Saitta gaetano.saitta@edu.unito.it (contributor)
Alberto Mulone alberto.mulone@edu.unito.it (contributor)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file streamflow-0.1.6.tar.gz.
File metadata
- Download URL: streamflow-0.1.6.tar.gz
- Upload date:
- Size: 150.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0e0356323ecf910dd8ced3ceff1450ee30ec4b8a17b4a6cf39fdc8812659bc01
|
|
| MD5 |
4b6b177f4218fc1c35dd0f533d570d99
|
|
| BLAKE2b-256 |
130f2ae2f50517922e2736bdb75f92802f6037bfa32efca56fee8f76f77221c7
|
File details
Details for the file streamflow-0.1.6-py2.py3-none-any.whl.
File metadata
- Download URL: streamflow-0.1.6-py2.py3-none-any.whl
- Upload date:
- Size: 175.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0ed03ddaa1008a2de967b5dcbe1c8d7b36ca538a11704790a8fac16fd236b077
|
|
| MD5 |
287243176716aa7ca11bce2d27edb19d
|
|
| BLAKE2b-256 |
be7e762f0495fdbf0a454eaa4dfbf167754f9a9d76a43f0976b66a250d493890
|