Lightweight Declarative Data Framework
Project description
DeFlow
A Lightweight Declarative Data Framework that allow you to run data pipelines by YAML config template.
[!NOTE] I want to use this project is the real-world use-case for my Workflow package that able to handle production data pipeline with the DataOps strategy.
[!WARNING] This framework does not allow you to custom your pipeline yet. If you want to create your workflow, you can implement it by your custom template reference this package.
In my opinion, I think it should not create duplicate workflow codes if I can write with dynamic input parameters on the one template workflow that just change the input parameters per use-case instead. This way I can handle a lot of logical workflows in our orgs with only metadata configuration. It called Metadata Driven Data Workflow.
📦 Installation
pip install -U deflow
:dart: Usage
Version 1
[!NOTE] This project will create the data framework Version 1 first.
After initialize your data framework project with Version 1, your data pipeline config files will store with this file structure:
conf/
├─ conn/
│ ├─ c_conn_01.yml
│ ╰─ c_conn_02.yml
├─ routes/
│ ╰─ routing.yml
╰─ stream/
╰─ s_stream_01/
├─ g_group_01.tier.priority/
│ ├─ p_proces_01.yml
│ ╰─ p_proces_02.yml
├─ g_group_02.tier.priority/
│ ├─ p_proces_01.yml
│ ╰─ p_proces_02.yml
╰─ config.yml
You can run the data flow by:
from deflow.flow import Flow
from ddeutil.workflow import Result
flow: Result = (
Flow(name="s_stream_01")
.option("conf_paths", ["./data/conf"])
.run(mode="N")
)
Version 2
[!NOTE] This version is the same DAG and Task strategy like Airflow.
After initialize your data framework project with Version 2, your data pipeline config files will store with this file structure:
conf/
├─ conn/
│ ├─ c_conn_01.yml
│ ╰─ c_conn_02.yml
├─ routes/
│ ╰─ routing.yml
├─ pipeline/
│ ╰─ p_pipe_01/
│ ├─ config.yml
│ ├─ n_node_01.yml
│ ╰─ n_node_02.yml
╰─ .configore
:cookie: Configuration
| Name | Component | Default | Description |
|---|---|---|---|
| DEFLOW_CORE_CONF_PATH | CORE | ./conf |
A config path to get data framework configuration. |
| DEFLOW_CORE_VERSION | CORE | v1 |
A specific data framework version. |
Support data framework version:
| Version | Supported | Description |
|---|---|---|
| 1 | Progress | A data framework that base on stream, group, and process. |
| 2 | Progress | A data framework that base on pipeline, and node. |
💬 Contribute
I do not think this project will go around the world because it has specific propose, and you can create by your coding without this project dependency for long term solution. So, on this time, you can open the GitHub issue on this project 🙌 for fix bug or request new feature if you want it.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deflow-0.0.3.tar.gz.
File metadata
- Download URL: deflow-0.0.3.tar.gz
- Upload date:
- Size: 27.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dea297954af9e27a77e22908b2cab8302e4056ed463fcae9f8be82da4b7355a5
|
|
| MD5 |
917ac59e4e6dfe880a5f4e8d8ce5428a
|
|
| BLAKE2b-256 |
811e3b39124c42d7630ce91288c8dd660f64435c4a68cfb807969f2dfaea7d6c
|
Provenance
The following attestation bundles were made for deflow-0.0.3.tar.gz:
Publisher:
publish.yml on ddeutils/deflow
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deflow-0.0.3.tar.gz -
Subject digest:
dea297954af9e27a77e22908b2cab8302e4056ed463fcae9f8be82da4b7355a5 - Sigstore transparency entry: 234249263
- Sigstore integration time:
-
Permalink:
ddeutils/deflow@fa2e3776c0c25f94b748bfdd2d6f0f6c1095f986 -
Branch / Tag:
refs/tags/v0.0.3 - Owner: https://github.com/ddeutils
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@fa2e3776c0c25f94b748bfdd2d6f0f6c1095f986 -
Trigger Event:
release
-
Statement type:
File details
Details for the file deflow-0.0.3-py3-none-any.whl.
File metadata
- Download URL: deflow-0.0.3-py3-none-any.whl
- Upload date:
- Size: 22.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
50c99b8444a4c4d5a605ba6f6c8930873de4f9fd682bce8a126dae025a1cd2ba
|
|
| MD5 |
651dfe9941daeeb6229270257ab881a6
|
|
| BLAKE2b-256 |
03ac882e958316da233b351228221873ee7abd536c07abbfb1428f21b8c6e19e
|
Provenance
The following attestation bundles were made for deflow-0.0.3-py3-none-any.whl:
Publisher:
publish.yml on ddeutils/deflow
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deflow-0.0.3-py3-none-any.whl -
Subject digest:
50c99b8444a4c4d5a605ba6f6c8930873de4f9fd682bce8a126dae025a1cd2ba - Sigstore transparency entry: 234249266
- Sigstore integration time:
-
Permalink:
ddeutils/deflow@fa2e3776c0c25f94b748bfdd2d6f0f6c1095f986 -
Branch / Tag:
refs/tags/v0.0.3 - Owner: https://github.com/ddeutils
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@fa2e3776c0c25f94b748bfdd2d6f0f6c1095f986 -
Trigger Event:
release
-
Statement type: