Skip to main content

GVC4BAM is a data processing pipeline developed by Genome Wisdom Inc. GVC4FASTQ detects germline and somatic mutations (SNV, InDel, SV) from BAM files.

Project description

GVC_VCF Pipeline:

GVC_VCF is a pipeline developed at Genowis for germline and somatic mutation(snv, indel, sv) in next generation sequencing data.

Basic Command Line Parameters Options:

Positional Arguments:
	input_json: The json file stores names and paths of both normal and tumor samples. 
			eg: {"N": ["/disk/N.sort.dup.bam"], "T": ["/disk/T.sort.dup.bam"]}
	reference: The reference fasta file.
	outpath: The output folder.
Optional Arguments:
	-h: Print help messages.
	--dbsnp: The Single Nucleotide Polymorphism Database(dbSNP) file has three columns(chr, position, rsID), Values on each line of the file are separated by tab.
	--bed: The WES file need to provide bed region, The bed region has at least three columns(chr,start,end), Values on each line of the file are separated by tab.
	--gvc_lib: The library folder has configuration file.
		The docker volume file needs to be modified. A dictionary to configure volumes mounted inside the container. The key is either the host path or a volume name, and the value is a dictionary with the keys: 
				bind: The path to mount the volume inside the container(the host path needs same with the container path).
				mode: rw to mount the volume read/write.
				eg: {"/disk": {"bind": "/disk","mode": "rw"}}
	--strategy: choose WES or WGS.
	--mutantType: Getting Germline mutation or Somatic mutaion.
	--sample_name: The Sample name.

Note: the pipeline was writed by toil, so when running the programe, you need to provide jobStore(one dir name, eg: first_gvc_run or /home/first_gvc_run). A job store holds persistent information about the jobs and files in a workflow. Eg: python gvc_vcf_pipeline.py first_gvc_run /disk/gvc_vcf_pipeline/bam.json /disk/db/ref/human.fa /disk/gvc_vcf_pipeline/outpath/ --dbsnp /disk/db/dbsnp/dbsnp_frequency --bed /disk/data/no_ref.ccd --gvc_lib /disk/gvc_vcf_pipeline/gvc_lib/ --mutantType Somatic --mutantType Germline --sample_name test_data --strategy WES

demo: 192.168.75.200 /disk/chenfs/gvc_vcf_pipeline/demo, the demo taked 4 minutes. python /disk/chenfs/gvc_vcf_pipeline/src/gvc_vcf_pipeline.py
first_gvc_run
/disk/chenfs/gvc_vcf_pipeline/demo/demo.json
/disk/db/ref/human.fa
/disk/chenfs/gvc_vcf_pipeline/demo/test/
--dbsnp /disk/db/dbsnp/dbsnp_frequency
--bed /disk/chenfs/gvc_vcf_pipeline/demo/demo.bed
--gvc_lib /disk/chenfs/gvc_vcf_pipeline/gvc_lib/
--mutantType Somatic --mutantType Germline --strategy WES

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gvc4bam-1.0.10.tar.gz (20.5 kB view details)

Uploaded Source

Built Distribution

gvc4bam-1.0.10-py2-none-any.whl (28.9 kB view details)

Uploaded Python 2

File details

Details for the file gvc4bam-1.0.10.tar.gz.

File metadata

  • Download URL: gvc4bam-1.0.10.tar.gz
  • Upload date:
  • Size: 20.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.5

File hashes

Hashes for gvc4bam-1.0.10.tar.gz
Algorithm Hash digest
SHA256 0320d68376a7cc5138ab1a3d7383de1b39be46150e22c98bd410e34f11fc9fca
MD5 4d381a7830bbd382f6ec3aa24102b346
BLAKE2b-256 e20db98c9a8358220212645e6051ed9a2f0f6b3c0ce2950c8ff73fbc7dca0700

See more details on using hashes here.

File details

Details for the file gvc4bam-1.0.10-py2-none-any.whl.

File metadata

  • Download URL: gvc4bam-1.0.10-py2-none-any.whl
  • Upload date:
  • Size: 28.9 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.5

File hashes

Hashes for gvc4bam-1.0.10-py2-none-any.whl
Algorithm Hash digest
SHA256 d6263a876ef4c37141ef7cd3fa4d81d33593bdae6d10aa9d93b2cd773dc11c69
MD5 77254ac605124db6bb8e44d43fae451f
BLAKE2b-256 ca7a205ae2b96e889fdab85c51d769ced84eb54f11a2bbe6a00c472e1066aaa1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page