Skip to main content

A workflow dendency graph compiler and automation enabler

Project description

bioblueprint

bioblueprint is a python library designed to enable workflow language-interchangeable dependency graph compilation and development automation. It operates by compiling workflows, their dependencies, Git diff between branches, and using the modified files to trace testing paths within the dependency graph.

NOTE: bioblueprint operates on local branches


Install

python3 -m pip install bioblueprint


Usage

Run bioblueprint after a development iteration or to automate testing, then edit the "DESCRIPTION" field in the IO TSVs. You can also run after PRs have been generated.

Please see the help menu for a comprehensive list of input options.

bioblueprint -i <REPO_BASE_DIR> -d <DEVELOPMENT_BRANCH>

DEVELOPMENT: -d is the dev branch; -s is main (default)

VALIDATION: -d is main; -s is the previous release tag

PULL REQUESTS: A blank pull request is generated by default, but append -pr # to pull and use an existing PR.


Outputs

An output directory bioblueprint_YYYYmmdd/ will be generated containing the following files:

<REPO>.pr.md

A populated pull request template with I/O modifications, WF modifications, and testing paths. If -pr is specified, the PR will be downloaded and relevant fields populated with I/O and testing information - existing testing data will be retained and unmodified if formatted as a checklist with exact workflow name matches that are the first entry following the markdown checkbox (links are permitted). This function is tailored for accounted repositories:

<REPO>_inputs.tsv & <REPO>_outputs.tsv

Updated inputs/outputs tables for Public Health Bioinformatics

testing/<WORKFLOW>.testing.json

A JSON formatted with testing parameters (designed for bioforklift integration) for hosted workflows:

{
  "<WF_NAME>": {
    "path": "<PATH_RELATIVE_TO_REPO>",
    "modified": 
        "<TASK/WF_1>",
        ..
    ],
    "workflow_name": "<HOSTED_NAME>",
    "branch": "<DERIVED_BRANCH>",
    "repository": "<OWNER/REPOSITORY>",
    "table": "<WF_NAME>_<TESTING_SUFFIX>",
    "comment": "PR: <PR#>",
    "input_json": "<REPOSITORY_INPUTS_JSON>",
    "output_json": "<AVAILABLE_OUTPUTS>"
  },
  ..
}

The input_json is derived from an inputs JSON file hosted in the repository that corresponds to the testing table that is hosted in the Terra workspace.

<REPO>.io.json

A JSON formatted to convey inputs and outputs, including defaults and types, for workflows:

{
  <WF_NAME_1>: {
    "path": <PATH_RELATIVE_TO_REPO>,
    "inputs": {
        <INPUT_1>:
        {
            "type": <WF_LANGUAGE_TYPE>,
            "default": <DEFAULT_VAL>
        },
        ..
    },
    "outputs": {
        <OUTPUT_1>: <WF_LANGUAGE_TYPE>,
        ..
    }
  },
  ..
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioblueprint-1.2.4.tar.gz (32.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bioblueprint-1.2.4-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file bioblueprint-1.2.4.tar.gz.

File metadata

  • Download URL: bioblueprint-1.2.4.tar.gz
  • Upload date:
  • Size: 32.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.4.tar.gz
Algorithm Hash digest
SHA256 26bcd8b7eb963e4eefdf31ffc9fa75ab978eccfe942912f6c924f4960dc085ea
MD5 b45c912c6bc3d8eabcde1700ee57a679
BLAKE2b-256 c39eaf8e51594fc287635e6434e01320f2a69c715bf817b9763de4694e37f63c

See more details on using hashes here.

File details

Details for the file bioblueprint-1.2.4-py3-none-any.whl.

File metadata

  • Download URL: bioblueprint-1.2.4-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for bioblueprint-1.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2eeb50ebadd28387036c5e54bf84e5b943e943c8a0634a5fb225df5f52c9f1fb
MD5 65bfaf377f6e4114074054c9a39adad5
BLAKE2b-256 52a26b3f8bdf54eec2d01058704631a5dba082f2fd8b6cacf92adbcd4ddb3457

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page