FlowCept is a runtime data integration system that empowers any data processing system to capture and query workflow provenance data using data observability, requiring minimal or no changes in the target system code. It seamlessly integrates data from multiple workflows, enabling users to comprehend complex, heterogeneous, and large-scale data from various sources in federated environments.

These details have not been verified by PyPI

Project links

Homepage

Project description

FlowCept

FlowCept is intended to address scenarios where multiple workflows in a science campaign or in an enterprise run and generate important data to be analyzed in an integrated manner. Since these workflows may use different data manipulation tools (e.g., provenance or lineage capture tools, database systems, performance profiling tools) or can be executed within different parallel computing systems (e.g., Dask, Spark, Workflow Management Systems), its key differentiator is the capability to seamless and automatically integrate data from various workflows using data observability. It builds an integrated data view at runtime enabling end-to-end exploratory data analysis and monitoring. It follows W3C PROV recommendations for its data schema. It does not require changes in user codes or systems (i.e., instrumentation). All users need to do is to create adapters for their systems or tools, if one is not available yet.

Currently, FlowCept provides adapters for: Dask, MLFlow, TensorBoard, and Zambeze.

See the Jupyter Notebooks for utilization examples.

See the Contributing file for guidelines to contribute with new adapters. Note that we may use the term 'plugin' in the codebase as a synonym to adapter. Future releases should standardize the terminology to use adapter.

Install and Setup:

Install FlowCept:

pip install .[full] in this directory (or pip install flowcept[full]).

For convenience, this will install all dependencies for all adapters. But it can install dependencies for adapters you will not use. For this reason, you may want to install like this: pip install .[adapter_key1,adapter_key2] for the adapters we have implemented, e.g., pip install .[dask]. See extra_requirements if you want to install the dependencies individually.

Start MongoDB and Redis:

To enable the full advantages of FlowCept, the user needs to run Redis, as FlowCept's message queue system, and MongoDB, as FlowCept's main database system. The easiest way to start Redis and MongoDB is by using the docker-compose file for its dependent services: MongoDB and Redis. You only need RabbitMQ if you want to observe Zambeze messages as well.

Define the settings (e.g., routes and ports) accordingly in the settings.yaml file.
Start the observation using the Controller API, as shown in the Jupyter Notebooks.
To use FlowCept's Query API, see utilization examples in the notebooks.

Performance Tuning for Performance Evaluation

In the settings.yaml file, the following variables might impact interception performance:

main_redis:
  buffer_size: 50
  insertion_buffer_time_secs: 5

plugin:
  enrich_messages: false

And other variables depending on the Plugin. For instance, in Dask, timestamp creation by workers add interception overhead.

Acknowledgement

This research uses resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.10.1

Feb 21, 2026

0.9.23

Feb 19, 2026

0.9.22

Feb 18, 2026

0.9.21

Feb 14, 2026

0.9.20

Feb 10, 2026

0.9.19

Feb 5, 2026

0.9.18

Feb 3, 2026

0.9.17

Oct 24, 2025

0.9.16

Oct 22, 2025

0.9.15

Sep 15, 2025

0.9.12

Sep 14, 2025

0.9.11

Sep 14, 2025

0.9.10

Sep 12, 2025

0.9.9

Sep 11, 2025

0.9.8

Sep 11, 2025

0.9.7

Sep 11, 2025

0.9.6

Sep 10, 2025

0.9.5

Sep 10, 2025

0.9.4

Sep 10, 2025

0.9.3

Sep 10, 2025

0.9.2

Sep 10, 2025

0.9.1

Sep 5, 2025

0.8.12

Sep 5, 2025

0.8.11

Jun 13, 2025

0.8.10

Jun 5, 2025

0.8.9

May 28, 2025

0.8.8

May 23, 2025

0.8.7

May 23, 2025

0.8.6

May 23, 2025

0.8.5

May 15, 2025

0.8.4

Mar 20, 2025

0.8.3

Mar 20, 2025

0.8.2

Mar 20, 2025

0.8.1

Mar 15, 2025

0.7.22

Mar 13, 2025

0.7.21

Mar 11, 2025

0.7.20

Mar 7, 2025

0.7.19

Feb 28, 2025

0.7.18

Feb 28, 2025

0.7.17

Feb 26, 2025

0.7.16

Feb 25, 2025

0.7.15

Feb 25, 2025

0.7.14

Feb 21, 2025

0.7.13

Feb 11, 2025

0.7.12

Feb 10, 2025

0.7.11

Feb 4, 2025

0.7.10

Jan 30, 2025

0.7.9

Jan 13, 2025

0.7.8

Jan 9, 2025

0.7.7

Jan 6, 2025

0.7.6

Jan 3, 2025

0.7.5

Dec 21, 2024

0.7.4

Dec 18, 2024

0.7.3

Dec 13, 2024

0.7.2

Dec 13, 2024

0.7.1

Dec 13, 2024

0.6.14

Dec 11, 2024

0.6.13

Nov 28, 2024

0.6.12

Nov 27, 2024

0.6.11

Nov 14, 2024

0.6.10

Oct 31, 2024

0.6.9

Oct 30, 2024

0.6.7

Oct 29, 2024

0.6.6

Oct 27, 2024

0.6.5

Oct 25, 2024

0.6.4

Oct 25, 2024

0.6.3

Oct 25, 2024

0.6.2

Oct 25, 2024

0.6.1

Oct 25, 2024

0.5.4

Oct 16, 2024

0.5.3

Oct 2, 2024

0.5.2

Sep 24, 2024

0.5.1

Sep 24, 2024

0.3.11

Sep 24, 2024

0.3.10

Sep 19, 2024

0.3.9

Sep 14, 2024

0.3.8

Sep 14, 2024

0.3.7

Sep 14, 2024

0.3.6

Sep 14, 2024

0.3.5

Sep 14, 2024

0.3.4

Sep 14, 2024

0.3.3

Sep 13, 2024

0.2.10

Feb 28, 2024

0.2.9

Feb 26, 2024

0.2.8

Feb 24, 2024

0.2.7

Feb 16, 2024

0.2.6

Feb 16, 2024

0.2.4

Feb 16, 2024

0.2.3

Feb 15, 2024

0.2.2

Feb 14, 2024

0.2.1

Feb 12, 2024

0.1.13

Feb 11, 2024

0.1.12

Jan 16, 2024

0.1.11

Oct 26, 2023

0.1.10

Sep 24, 2023

This version

0.1.9

Sep 22, 2023

0.1.8

Sep 20, 2023

0.1.7

Sep 20, 2023

0.1.6

Sep 16, 2023

0.1.5

Jul 3, 2023

0.1.4

Jun 30, 2023

0.1.3

Jun 29, 2023

0.1.2

Jun 28, 2023

0.1.1

Jun 27, 2023

0.0.133

Feb 14, 2023

0.0.132

Feb 13, 2023

0.0.131

Feb 13, 2023

0.0.130

Feb 13, 2023

0.0.129

Feb 10, 2023

0.0.128

Feb 10, 2023

0.0.127

Feb 9, 2023

0.0.126

Feb 9, 2023

0.0.125

Feb 9, 2023

0.0.124

Feb 9, 2023

0.0.123

Feb 7, 2023

0.0.122

Feb 6, 2023

0.0.114

Dec 9, 2022

0.0.97

Dec 6, 2022

0.0.84

Nov 18, 2022

0.0.82

Nov 18, 2022

0.0.80

Nov 18, 2022

0.0.69

Nov 14, 2022

0.0.66

Nov 14, 2022

0.0.1

Nov 11, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flowcept-0.1.9.tar.gz (40.1 kB view details)

Uploaded Sep 22, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

flowcept-0.1.9-py3-none-any.whl (53.7 kB view details)

Uploaded Sep 22, 2023 Python 3

File details

Details for the file flowcept-0.1.9.tar.gz.

File metadata

Download URL: flowcept-0.1.9.tar.gz
Upload date: Sep 22, 2023
Size: 40.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for flowcept-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`846e753101f4246bb7dbc1d872b2894e7c649d85ad93e0d2c3d8a4a4c7452335`
MD5	`ae2477cc0e17d8bf3bf94b0626418527`
BLAKE2b-256	`28a6fe474fa6f01f9d7733cbc75662c566c84af25d0f1624c0f1b06d3d63adc7`

See more details on using hashes here.

File details

Details for the file flowcept-0.1.9-py3-none-any.whl.

File metadata

Download URL: flowcept-0.1.9-py3-none-any.whl
Upload date: Sep 22, 2023
Size: 53.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for flowcept-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ce0bad37a02cad6cd322515afb455f069cc9bab3887cc9b58e1624fc94e6f603`
MD5	`9a8278e2914be19f62a7bb215b8e42f2`
BLAKE2b-256	`4cb2f53ad4a5140c32b09ad127a755196a8dda93b85c2c6fbf13ea99f20298dd`

See more details on using hashes here.

flowcept 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FlowCept

Install and Setup:

Performance Tuning for Performance Evaluation

See also

Acknowledgement

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes