Skip to main content

GVC4BAM is a data processing pipeline developed by Genome Wisdom Inc. GVC4FASTQ detects germline and somatic mutations (SNV, InDel, SV) from BAM files.

Project description

GVC_VCF Pipeline:

GVC_VCF is a pipeline developed at Genowis for germline and somatic mutation(snv, indel, sv) in next generation sequencing data.

Basic Command Line Parameters Options:

Positional Arguments:
	input_json: The json file stores names and paths of both normal and tumor samples. 
			eg: {"N": ["/disk/N.sort.dup.bam"], "T": ["/disk/T.sort.dup.bam"]}
	reference: The reference fasta file.
	outpath: The output folder.
Optional Arguments:
	-h: Print help messages.
	--dbsnp: The Single Nucleotide Polymorphism Database(dbSNP) file has three columns(chr, position, rsID), Values on each line of the file are separated by tab.
	--bed: The WES file need to provide bed region, The bed region has at least three columns(chr,start,end), Values on each line of the file are separated by tab.
	--gvc_lib: The library folder has configuration file.
		The docker volume file needs to be modified. A dictionary to configure volumes mounted inside the container. The key is either the host path or a volume name, and the value is a dictionary with the keys: 
				bind: The path to mount the volume inside the container(the host path needs same with the container path).
				mode: rw to mount the volume read/write.
				eg: {"/disk": {"bind": "/disk","mode": "rw"}}
	--strategy: choose WES or WGS.
	--mutantType: Getting Germline mutation or Somatic mutaion.
	--sample_name: The Sample name.

Note: the pipeline was writed by toil, so when running the programe, you need to provide jobStore(one dir name, eg: first_gvc_run or /home/first_gvc_run). A job store holds persistent information about the jobs and files in a workflow. Eg: python gvc_vcf_pipeline.py first_gvc_run /disk/gvc_vcf_pipeline/bam.json /disk/db/ref/human.fa /disk/gvc_vcf_pipeline/outpath/ --dbsnp /disk/db/dbsnp/dbsnp_frequency --bed /disk/data/no_ref.ccd --gvc_lib /disk/gvc_vcf_pipeline/gvc_lib/ --mutantType Somatic --mutantType Germline --sample_name test_data --strategy WES

demo: 192.168.75.200 /disk/chenfs/gvc_vcf_pipeline/demo, the demo taked 4 minutes. python /disk/chenfs/gvc_vcf_pipeline/src/gvc_vcf_pipeline.py
first_gvc_run
/disk/chenfs/gvc_vcf_pipeline/demo/demo.json
/disk/db/ref/human.fa
/disk/chenfs/gvc_vcf_pipeline/demo/test/
--dbsnp /disk/db/dbsnp/dbsnp_frequency
--bed /disk/chenfs/gvc_vcf_pipeline/demo/demo.bed
--gvc_lib /disk/chenfs/gvc_vcf_pipeline/gvc_lib/
--mutantType Somatic --mutantType Germline --strategy WES

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gvc4bam-1.0.10.tar.gz (20.5 kB view hashes)

Uploaded Source

Built Distribution

gvc4bam-1.0.10-py2-none-any.whl (28.9 kB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page