Bioinformatics tools for Janis; the Pipeline creation helper
Project description
Janis - Example Pipelines
This repository contains workflows written using Janis.
It can be installed through PIP with:
pip3 install janis-pipelines.pipelines
This allows Janis to find these installed pipelines from the CLI (through entrypoint: janis.extension=pipelines
) and means they can be imported from any Python file.
from janis_pipelines import WGSGermlineGATK
WGSGermlineGATK().translate("wdl")
Pipelines
Whole genome sequencing (WGS) pipelines:
Other:
Quickstart
To use these pipelines, you will need to have Janis installed:
pip3 install janis-pipelines
Then you can run the pipeline through Janis:
WORKFLOW="https://raw.githubusercontent.com/PMCC-BioinformaticsCore/janis-pipelines/master/workflows/alignment/alignment.py"
# Generate inputs file
janis inputs WGSGermlineGATK > myinps.yml
# Run workflow
janis run --inputs myinps.yml WGSGermlineGATK
WGS Germline pipeline
See the - germline folder for more information, the workflow and CWL / WDL translations.
The WGS germline pipeline takes a FASTQ pair, aligns, sorts, marks duplicates and calls variants across GATK4, Strelka and VarDict. These variants are combined and sorted at the end.
These variants were validated against the Genome in a Bottle data sets to achieve:
- Recall: 99.25%
- Precision: 92.02%
These results were identical across 3 research institutes (combination of Slurm / PBS / Torque) and Google Cloud platform. The pipeline took approximately 27-30 hours to run at a 30x coverage, depending on the resource constraint.
WGS Somatic pipeline for tumor-normal variant discovery
See the somatic folder for more information, the workflow and CWL / WDL translations.
The WGS somatic pipeline takes normal and tumor FASTQ pairs, aligns, sorts and marks duplicates separately, and then performs tumor-normal variant discovery across GATK4, Strelka and VarDict (in somatic modes). These variants are combined and sorted at the end.
This pipeline was run successfully across a similar set of research institutes (as germline) and the cloud, however these variants have not been validated yet.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for janis-pipelines.pipelines-0.9.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | b81b56262a570bccc43b320b0176759abcf200910c7dd187eb067f37506b14f0 |
|
MD5 | 74732e7dbe4316823556202a31666711 |
|
BLAKE2b-256 | 9e530ba5adcf291fd8ca7b73ed254fb720498d30c10452d0f4faf4e275d49441 |
Hashes for janis_pipelines.pipelines-0.9.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd3768b990ab899f7e636d9e9f2c3acb3158c7d93c1bbcc867537e2a1c70fad1 |
|
MD5 | 2c38da3a06ff968b1b4f4b33ae3070c3 |
|
BLAKE2b-256 | 4cc49fc9e9e5441267f55f691ad96786e740fa5d951850e21a3cab0ff9b85c50 |