GVC4BAM is a data processing pipeline developed by Genome Wisdom Inc. GVC4FASTQ detects germline and somatic mutations (SNV, InDel, SV) from BAM files.
Project description
GVC_VCF Pipeline:
GVC_VCF is a pipeline developed at Genowis for germline and somatic mutation(snv, indel, sv) in next generation sequencing data.
Basic Command Line Parameters Options:
Positional Arguments:
input_json: The json file stores names and paths of both normal and tumor samples.
eg: {"N": ["/disk/N.sort.dup.bam"], "T": ["/disk/T.sort.dup.bam"]}
reference: The reference fasta file.
outpath: The output folder.
Optional Arguments:
-h: Print help messages.
--dbsnp: The Single Nucleotide Polymorphism Database(dbSNP) file has three columns(chr, position, rsID), Values on each line of the file are separated by tab.
--bed: The WES file need to provide bed region, The bed region has at least three columns(chr,start,end), Values on each line of the file are separated by tab.
--gvc_lib: The library folder has configuration file.
The docker volume file needs to be modified. A dictionary to configure volumes mounted inside the container. The key is either the host path or a volume name, and the value is a dictionary with the keys:
bind: The path to mount the volume inside the container(the host path needs same with the container path).
mode: rw to mount the volume read/write.
eg: {"/disk": {"bind": "/disk","mode": "rw"}}
--strategy: choose WES or WGS.
--mutantType: Getting Germline mutation or Somatic mutaion.
--sample_name: The Sample name.
Note: the pipeline was writed by toil, so when running the programe, you need to provide jobStore(one dir name, eg: first_gvc_run or /home/first_gvc_run). A job store holds persistent information about the jobs and files in a workflow. Eg: python gvc_vcf_pipeline.py first_gvc_run /disk/gvc_vcf_pipeline/bam.json /disk/db/ref/human.fa /disk/gvc_vcf_pipeline/outpath/ --dbsnp /disk/db/dbsnp/dbsnp_frequency --bed /disk/data/no_ref.ccd --gvc_lib /disk/gvc_vcf_pipeline/gvc_lib/ --mutantType Somatic --mutantType Germline --sample_name test_data --strategy WES
demo: 192.168.75.200
/disk/chenfs/gvc_vcf_pipeline/demo, the demo taked 4 minutes.
python /disk/chenfs/gvc_vcf_pipeline/src/gvc_vcf_pipeline.py
first_gvc_run
/disk/chenfs/gvc_vcf_pipeline/demo/demo.json
/disk/db/ref/human.fa
/disk/chenfs/gvc_vcf_pipeline/demo/test/
--dbsnp /disk/db/dbsnp/dbsnp_frequency
--bed /disk/chenfs/gvc_vcf_pipeline/demo/demo.bed
--gvc_lib /disk/chenfs/gvc_vcf_pipeline/gvc_lib/
--mutantType Somatic --mutantType Germline --strategy WES
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gvc4bam-1.0.10.tar.gz
.
File metadata
- Download URL: gvc4bam-1.0.10.tar.gz
- Upload date:
- Size: 20.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0320d68376a7cc5138ab1a3d7383de1b39be46150e22c98bd410e34f11fc9fca |
|
MD5 | 4d381a7830bbd382f6ec3aa24102b346 |
|
BLAKE2b-256 | e20db98c9a8358220212645e6051ed9a2f0f6b3c0ce2950c8ff73fbc7dca0700 |
File details
Details for the file gvc4bam-1.0.10-py2-none-any.whl
.
File metadata
- Download URL: gvc4bam-1.0.10-py2-none-any.whl
- Upload date:
- Size: 28.9 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/2.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d6263a876ef4c37141ef7cd3fa4d81d33593bdae6d10aa9d93b2cd773dc11c69 |
|
MD5 | 77254ac605124db6bb8e44d43fae451f |
|
BLAKE2b-256 | ca7a205ae2b96e889fdab85c51d769ced84eb54f11a2bbe6a00c472e1066aaa1 |