Skip to main content

Exo-C Data Quality Check, Mapping, and Variant Calling, part of ExoClasma Suite

Project description

exoclasma-pipe

Description

exoclasma-pipe is a tool for reads quality control, adapters removing, mapping, and variant calling, a part of upcoming ExoClasma Suite.

Features:

  • Pool FastQ quality control via FastQC
  • Mapping pipeline: cutadapt + BWA + GATK MarkDuplicates
  • Variant calling pipeline: GATK BQSR + GATK HaplotypeCaller

This is a pre-release. Use it at your own risk!

Installation

python3 -m pip install exoclasma-pipe

Command-line dependencies

First five are available at Ubuntu repos:

apt install samtools samtools bwa bedtools fastqc cutadapt

GATK should be installed into Miniconda environment as described by the developer.

Usage

FastQC

exoclasma-pipe FastQC -f ${file_1} ${mask_1} ${file_2} -d ${output_folder}

Align

exoclasma-pipe Align -u ${unit_json_file}

Unit JSON describes a job for pipeline. It must have the following format:

{
	"ID": "Scc3-WXS",
	"Description": "Extensive sampling of Saccharomyces cerevisiae in Taiwan reveals ecology and evolution of predomesticated lineages (SRX11913458)",
	"ParentDir": "testoutput",
	"Input": [
		{
			"Files": {
				"R1": "testdata/test_scc3_R1.fastq.gz",
				"R2": "testdata/test_scc3_R2.fastq.gz",
				"Unpaired": null
			},
			"RG": {
				"Sample": "Scc3",
				"Library": "LY0200-I",
				"Platform": "ILLUMINA",
				"Instrument": "IlluminaHiSeq2500",
				"Lane": "1",
				"Barcode": "ACGTAT"
			},
			"Adapter": "illumina"
		}
	],
	"Reference": {
		"GenomeInfo": "testdata/testReference/testReference.info.json",
		"Capture": "testCapture"
	},
	"Config": {
		"Threads": 4,
		"RemoveTempFiles": true
	}
}

Reference with GenomeInfo file and Captures can be prepared with exoclasma-index.

Call

exoclasma-pipe Call -u ${unit_json_file} -d ${dbSNP_vcf_gz}

Here you will need a Unit JSON file which was created with exoclasma-pipe Align. dbSNP will be used for BQSR stage.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

exoclasma_pipe-0.9.1-py3-none-any.whl (22.0 kB view details)

Uploaded Python 3

File details

Details for the file exoclasma_pipe-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: exoclasma_pipe-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 22.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for exoclasma_pipe-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2834893267813d1d93a3f4c2d6def03813d09e2716457476ff19e2729679a274
MD5 3c71e4ab36feee24bcfc954e36fe74c4
BLAKE2b-256 852a33efa2094e4e8d324cfc8a9810f7530551a363c10c1f5cab0b226eb4dc3f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page