Skip to main content

Bioinformatics tools for Janis; the Pipeline creation helper

Project description

Janis - Example Pipelines

PyPI version

This repository contains workflows written using Janis.

These workflows are installed by default, but are available on PIP with:

pip3 install janis-pipelines.pipelines

Documentation

These pipelines are documented with run instructions here: https://janis.readthedocs.io/en/latest/pipelines/index.html

More information

These pipelines are made available through the entrypoint: janis.extension=pipelines.

They can be imported in Python with the following:

from janis_pipelines import WGSGermlineGATK

WGSGermlineGATK().translate("wdl")

Pipelines

Whole genome sequencing (WGS) pipelines:

Other:

Reference files:

WGS Germline pipeline

See the - germline folder for more information, the workflow and CWL / WDL translations.

The WGS germline pipeline takes a FASTQ pair, aligns, sorts, marks duplicates and calls variants across GATK4, Strelka and VarDict. These variants are combined and sorted at the end.

These variants were validated against the Genome in a Bottle data sets to achieve:

  • Recall: 99.25%
  • Precision: 92.02%

These results were identical across 3 research institutes (combination of Slurm / PBS / Torque) and Google Cloud platform. The pipeline took approximately 27-30 hours to run at a 30x coverage, depending on the resource constraint.

WGS Somatic pipeline for tumor-normal variant discovery

See the somatic folder for more information, the workflow and CWL / WDL translations.

The WGS somatic pipeline takes normal and tumor FASTQ pairs, aligns, sorts and marks duplicates separately, and then performs tumor-normal variant discovery across GATK4, Strelka and VarDict (in somatic modes). These variants are combined and sorted at the end.

This pipeline was run successfully across a similar set of research institutes (as germline) and the cloud, however these variants have not been validated yet.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

janis-pipelines.pipelines-0.13.0.tar.gz (35.5 kB view details)

Uploaded Source

Built Distribution

janis_pipelines.pipelines-0.13.0-py3-none-any.whl (52.1 kB view details)

Uploaded Python 3

File details

Details for the file janis-pipelines.pipelines-0.13.0.tar.gz.

File metadata

File hashes

Hashes for janis-pipelines.pipelines-0.13.0.tar.gz
Algorithm Hash digest
SHA256 d558584fc7aca1c392ef8e59f3b75b40c164e69cb3fbc7f4f8927f88dada7d6d
MD5 9d79a9e98b7e9326e3f1671b21c77fa7
BLAKE2b-256 bd31a240cac10d3719628aac5b3eba79a05bffb2271d7475eda387f76b7e59d1

See more details on using hashes here.

File details

Details for the file janis_pipelines.pipelines-0.13.0-py3-none-any.whl.

File metadata

File hashes

Hashes for janis_pipelines.pipelines-0.13.0-py3-none-any.whl
Algorithm Hash digest
SHA256 730144568d1a1bf613b48e01cdc46b74d3a251d6bf497516f75628f6954dc1ce
MD5 f95c3db3bb0a81d5085201fb0a974d4c
BLAKE2b-256 b6c97c35f767f6444f572723688b6991026c4d8fb76922ba04835c4a2afad04e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page