Skip to main content

A workflow dendency graph compiler and automation enabler

Project description

bioblueprint

bioblueprint is a python library designed to enable workflow language-interchangeable dependency graph compilation and development automation. It operates by compiling workflows, their dependencies, Git diff between branches, and using the modified files to trace testing paths within the dependency graph.

NOTE: bioblueprint operates on local branches


Install

python3 -m pip install bioblueprint


Usage

Run bioblueprint after a development iteration or to automate testing, then edit the "DESCRIPTION" field in the IO TSVs. You can also run after PRs have been generated.

Please see the help menu for a comprehensive list of input options.

bioblueprint -i <REPO_BASE_DIR> -d <DEVELOPMENT_BRANCH>

DEVELOPMENT: -d is the dev branch; -s is main (default)

VALIDATION: -d is main; -s is the previous release tag

PULL REQUESTS: A blank pull request is generated by default, but append -pr # to pull and use an existing PR.


Outputs

An output directory bioblueprint_YYYYmmdd/ will be generated containing the following files:

<REPO>.pr.md

A populated pull request template with I/O modifications, WF modifications, and testing paths. If -pr is specified, the PR will be downloaded and relevant fields populated with I/O and testing information - existing testing data will be retained and unmodified if formatted as a checklist with exact workflow name matches that are the first entry following the markdown checkbox (links are permitted). This function is tailored for accounted repositories:

<REPO>_inputs.tsv & <REPO>_outputs.tsv

Updated inputs/outputs tables for Public Health Bioinformatics

testing/<WORKFLOW>.testing.json

A JSON formatted with testing parameters (designed for bioforklift integration) for hosted workflows:

{
  "<WF_NAME>": {
    "path": "<PATH_RELATIVE_TO_REPO>",
    "modified": 
        "<TASK/WF_1>",
        ..
    ],
    "workflow_name": "<HOSTED_NAME>",
    "branch": "<DERIVED_BRANCH>",
    "repository": "<OWNER/REPOSITORY>",
    "table": "<WF_NAME>_<TESTING_SUFFIX>",
    "comment": "PR: <PR#>",
    "input_json": "<REPOSITORY_INPUTS_JSON>",
    "output_json": "<AVAILABLE_OUTPUTS>"
  },
  ..
}

The input_json is derived from an inputs JSON file hosted in the repository that corresponds to the testing table that is hosted in the Terra workspace.

<REPO>.io.json

A JSON formatted to convey inputs and outputs, including defaults and types, for workflows:

{
  <WF_NAME_1>: {
    "path": <PATH_RELATIVE_TO_REPO>,
    "inputs": {
        <INPUT_1>:
        {
            "type": <WF_LANGUAGE_TYPE>,
            "default": <DEFAULT_VAL>
        },
        ..
    },
    "outputs": {
        <OUTPUT_1>: <WF_LANGUAGE_TYPE>,
        ..
    }
  },
  ..
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioblueprint-1.2.2.tar.gz (32.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioblueprint-1.2.2-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file bioblueprint-1.2.2.tar.gz.

File metadata

  • Download URL: bioblueprint-1.2.2.tar.gz
  • Upload date:
  • Size: 32.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.2.tar.gz
Algorithm Hash digest
SHA256 b15901a7edd6c64256828650ebafbb1193af099a284138a8e261b006290d3043
MD5 2646ebdd8489498edcba91de79004b7f
BLAKE2b-256 f5efb5488222834d343fd8c68f460be44911aff46236a87e66f564cec2a07630

See more details on using hashes here.

File details

Details for the file bioblueprint-1.2.2-py3-none-any.whl.

File metadata

  • Download URL: bioblueprint-1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 f80c114f40f2fb8bc5811c870f69565a22480339c5c05fbc0ed841fe45ecfa58
MD5 24f1c973903bb35568af9c937d20a8b8
BLAKE2b-256 db91320dc99023debe258a7a46f5f8efd305e03fcbaa44ad034b738ea86a51da

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page