Skip to main content

A workflow dendency graph compiler and automation enabler

Project description

bioblueprint

bioblueprint is a python library designed to enable workflow language-interchangeable dependency graph compilation and development automation. It operates by compiling workflows, their dependencies, Git diff between branches, and using the modified files to trace testing paths within the dependency graph.

NOTE: bioblueprint operates on local branches


Install

python3 -m pip install bioblueprint


Usage

Run bioblueprint after a development iteration or to automate testing, then edit the "DESCRIPTION" field in the IO TSVs. You can also run after PRs have been generated.

Please see the help menu for a comprehensive list of input options.

bioblueprint -i <REPO_BASE_DIR> -d <DEVELOPMENT_BRANCH>

DEVELOPMENT: -d is the dev branch; -s is main (default)

VALIDATION: -d is main; -s is the previous release tag

PULL REQUESTS: A blank pull request is generated by default, but append -pr # to pull and use an existing PR.


Outputs

An output directory bioblueprint_YYYYmmdd/ will be generated containing the following files:

<REPO>.pr.md

A populated pull request template with I/O modifications, WF modifications, and testing paths. If -pr is specified, the PR will be downloaded and relevant fields populated with I/O and testing information - existing testing data will be retained and unmodified if formatted as a checklist with exact workflow name matches that are the first entry following the markdown checkbox (links are permitted). This function is tailored for accounted repositories:

<REPO>_inputs.tsv & <REPO>_outputs.tsv

Updated inputs/outputs tables for Public Health Bioinformatics

testing/<WORKFLOW>.testing.json

A JSON formatted with testing parameters (designed for bioforklift integration) for hosted workflows:

{
  "<WF_NAME>": {
    "path": "<PATH_RELATIVE_TO_REPO>",
    "modified": 
        "<TASK/WF_1>",
        ..
    ],
    "workflow_name": "<HOSTED_NAME>",
    "branch": "<DERIVED_BRANCH>",
    "repository": "<OWNER/REPOSITORY>",
    "table": "<WF_NAME>_<TESTING_SUFFIX>",
    "comment": "PR: <PR#>",
    "input_json": "<REPOSITORY_INPUTS_JSON>",
    "output_json": "<AVAILABLE_OUTPUTS>"
  },
  ..
}

The input_json is derived from an inputs JSON file hosted in the repository that corresponds to the testing table that is hosted in the Terra workspace.

<REPO>.io.json

A JSON formatted to convey inputs and outputs, including defaults and types, for workflows:

{
  <WF_NAME_1>: {
    "path": <PATH_RELATIVE_TO_REPO>,
    "inputs": {
        <INPUT_1>:
        {
            "type": <WF_LANGUAGE_TYPE>,
            "default": <DEFAULT_VAL>
        },
        ..
    },
    "outputs": {
        <OUTPUT_1>: <WF_LANGUAGE_TYPE>,
        ..
    }
  },
  ..
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioblueprint-1.2.0.tar.gz (31.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioblueprint-1.2.0-py3-none-any.whl (34.3 kB view details)

Uploaded Python 3

File details

Details for the file bioblueprint-1.2.0.tar.gz.

File metadata

  • Download URL: bioblueprint-1.2.0.tar.gz
  • Upload date:
  • Size: 31.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.0.tar.gz
Algorithm Hash digest
SHA256 c4c9185abbeec95b6fdcbae47321b436222b5930080e46152364a7ae82e98171
MD5 ecfcbb556baed77249a16f6199d1505f
BLAKE2b-256 fb2b5a73061228164ba39e2da738548935c02f2ad9a846ae91e916bfb173e98c

See more details on using hashes here.

File details

Details for the file bioblueprint-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: bioblueprint-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 34.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1c3f007492a71e6f799de00ac895b1a2bfbdbd115537ad4cc6d1cb11def92509
MD5 4054f38d7db7950fa9c5f00f4fa4a009
BLAKE2b-256 1c66c312c40a6e07fc7680adb78b0d624423905763f1c20823b2a5b8275b594f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page