sequana-variant-calling·PyPI

A variant calling pipeline to analyse sequencing Illumina data

These details have not been verified by PyPI

Project links

Homepage

Project description

This is is the variant_calling pipeline from the Sequana projet

Overview:: Variant calling from FASTQ files
Input:: FASTQ files from Illumina Sequencing instrument
Output:: VCF and HTML files
Status:: production
Citation:: Cokelaer et al, (2017), ‘Sequana’: a Set of Snakemake NGS pipelines, Journal of Open Source Software, 2(16), 352, JOSS DOI https://doi:10.21105/joss.00352

Installation

You must install Sequana first:

pip install sequana

Then, just install this package:

pip install sequana_variant_calling

Usage

sequana_pipelines_variant_calling --help
sequana_pipelines_variant_calling --input-directory DATAPATH --run-mode local --reference measles.fa
sequana_pipelines_variant_calling --input-directory DATAPATH --run-mode slurm --reference measles.fa

This creates a directory variant_calling. You just need to execute the pipeline:

cd variant_calling
snakemake -s variant_calling.rules --stats stats.txt

This launch a snakemake pipeline. If you are familiar with snakemake, you can retrieve the fastqc.rules and config.yaml files and then execute the pipeline yourself with specific parameters:

snakemake -s variant_calling.rules --cores 4 --stats stats.txt

Or use sequanix interface.

Requirements

This pipelines requires the following executable(s):

bwa
freebayes
picard (picard-tools)
sambamba
samtools
snpEff

https://raw.githubusercontent.com/sequana/sequana_variant_calling/master/sequana_pipelines/variant_calling/dag.png

Details

Snakemake variant calling pipeline is based on tutorial written by Erik Garrison. Input reads (paired or single) are mapped using bwa and sorted with sambamba-sort. PCR duplicates are marked with sambamba-markdup. Freebayes is used to detect SNPs and short INDELs. The INDEL realignment and base quality recalibration are not necessary with Freebayes. For more information, please refer to a post by Brad Chapman on minimal BAM preprocessing methods.

The pipeline provides an analysis of the mapping coverage using sequana coverage. It detects and characterises automatically low and high genome coverage regions.

Detected variants are annotated with SnpEff if a GenBank file is provided. The pipeline does the database building automatically. Although most of the species should be handled automatically, some special cases such as particular codon table will required edition of the snpeff configuration file.

Finally, joint calling is also available and can be switch on if desired.

Rules and configuration details

Mapping

This pipeline uses the following rule from Sequana to perform the mapping and marking duplicates.

snpeff_add_locus_in_fasta
bwa_mem_dynamic
sambamba_markdup
sambamba_filter

Variant Calling

The variant calling itself depends on those rules:

freebayes
freebayes_vcf_filter

Joint variants calling

joint_freebayes
joint_freebayes_vcf_filter

Annotation

snpeff

Coverage analysis

samtools_depth
sequana_coverage

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.3.0

Feb 28, 2025

1.2.0

Jan 31, 2024

1.1.2

Jan 11, 2024

1.1.1

Dec 5, 2023

1.1.0

Dec 5, 2023

1.0.2

Oct 10, 2023

1.0.1

Jul 7, 2023

1.0.0

May 24, 2023

0.12.0

Nov 24, 2022

0.11.0

Aug 31, 2022

0.10.0

Mar 11, 2022

0.9.5

Nov 24, 2020

0.9.4

May 19, 2020

0.9.3

May 15, 2020

0.9.2

Mar 9, 2020

0.9.1

Feb 12, 2020

0.9.0

Dec 11, 2019

0.8.7

Dec 7, 2019

0.8.6

Dec 7, 2019

0.8.5

Dec 2, 2019

0.8.4

Dec 2, 2019

This version

0.8.3

Nov 29, 2019

0.8.2

Nov 29, 2019

0.8.1

Nov 29, 2019

0.8.0

Nov 29, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

sequana_variant_calling-0.8.3-py3.5.egg (421.4 kB view details)

Uploaded Nov 29, 2019 Egg

File details

Details for the file sequana_variant_calling-0.8.3-py3.5.egg.

File metadata

Download URL: sequana_variant_calling-0.8.3-py3.5.egg
Upload date: Nov 29, 2019
Size: 421.4 kB
Tags: Egg
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.4.3 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.5.5

File hashes

Hashes for sequana_variant_calling-0.8.3-py3.5.egg
Algorithm	Hash digest
SHA256	`fc7c209f8e56119239012fef93e6680e2476a14cdbce0d4e2c0171979403672f`
MD5	`132feec5df47bc5e56edb7384b016da6`
BLAKE2b-256	`423fb11c629fbd7ed8937b1c827db65017aa348fd3073c2a6aee048ddef7630b`

See more details on using hashes here.

sequana-variant-calling 0.8.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Usage

Requirements

Details

Rules and configuration details

Mapping

Variant Calling

Joint variants calling

Annotation

Coverage analysis

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes