Skip to main content

A pipeline for analyzing fungal genomic data

Project description

vbtools: a variant benchmark tool to compare VCFs with a consensus set

Install

Before installation, make sure conda is under your PATH. Conda installation here.

# clone this repo
git clone git@github.com:broadinstitute/vbtools.git

# setup conda environment
cd vbtools

conda env create -f env.yml # this will take a few minutes
conda list  # verify new environment was installed correctly

# activate environment
conda activate vbtools

# deactivate the environment when done
conda deactivate

# completely remove the virtual environment
conda remove -name vbtools --all

Usage

You can use following command to benchmark a VCF against a reference/consensus VCF.

vcfbench.py -v <input.vcf> -b <reference.vcf>

--prefix is an option to define prefix to the output files.

Currently, only haploid VCF is supported in the analysis. Diploid VCF will be standardized into haploid before comparison. Input VCF should follow VCF spec v4.2.

Following pre-processing steps are performed on the input VCF before the analysis:

  1. remove unused alleles
  2. remove monomorphic sites
  3. remove sites with heterozygous genotypes
  4. remove non-SNP sites
  5. remove sites with asterisk marks
  6. change diploid to haploid VCF

The script will output:

  1. Site level comparison:
    • a tsv file including number of unique and shared sites.
  2. Sample level comparision:
    • The sample level comparison functionality will be added to the script soon.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vbtools-0.1.0.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vbtools-0.1.0-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file vbtools-0.1.0.tar.gz.

File metadata

  • Download URL: vbtools-0.1.0.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.7.3

File hashes

Hashes for vbtools-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6efcd364cf07c0e6d5ce9979c03d71b3a1f8254fdda38dcff5ce327a67b9dc76
MD5 80fb9e1892dd10196d62dc28285c088c
BLAKE2b-256 916360807d208790a81613d1cb12e995822e9866b9574de346bb171809405d75

See more details on using hashes here.

File details

Details for the file vbtools-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: vbtools-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.7.3

File hashes

Hashes for vbtools-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 105e1c88db2dbab344a1fed078e7f715eda9ae9a26b54a48dfbf88284dfab69b
MD5 da219a04d9ff72e526a21f1004624d31
BLAKE2b-256 d3b234d2693223978c037b912111c0a670835c12b99e4877831b1f13bacbbec6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page