Skip to main content

Tools for working with Variant Call Format files.

Project description

https://img.shields.io/pypi/v/vcftoolz.svg http://joss.theoj.org/papers/10.21105/joss.01144/status.svg

Tools for working with Variant Call Format files.

VCF Toolz was developed by the United States Food and Drug Administration, Center for Food Safety and Applied Nutrition.

Features

  • Compares the snps in two or more VCF files.

  • Lists the snps that are unique to each VCF file with full genotype information per snp.

  • Lists the snps that are missing from each VCF file if present in at least two other VCF files.

  • Generates Venn diagrams of positions and snps in the VCF files.

  • Reports precision, recall, and F1 score when the truth is known.

  • Reports the effectiveness of filtered variants when the truth is known.

  • Reformat the VCF file in a tall-narrow format for easy viewing and diffs.

  • Count samples, positions, calls, snps, indels, other variants, missing calls, and filter reasons.

  • Plot calls along the length of the genome and show the location of filtered calls.

Citing VCF Toolz

To cite VCF Toolz, please reference the VCF Toolz paper:

https://doi.org/10.21105/joss.01144

License

See the LICENSE file included in the VCF Toolz distribution.

History

1.2.3 (2023-06-02)

  • Fixed imports in our fork of pyvenn

1.2.2 (2023-06-01)

  • Fixed broken Pyvenn dependency

1.2.1 (2023-05-09)

  • Update PyVCF to PyVCF3

1.2.0 (2019-04-04)

  • Fix defect in narrow command wrongly printing ALT=. when GT=.

  • Add the count command to count samples, positions, calls, snps, indels, other variants, filtered calls, missing calls, and filter reasons.

  • Add the plot command to plot calls along the length of the genome and show the location of filtered calls.

  • Change the text of the compare report to refer to “Calls”, not “Sample snps”.

  • Drop support for Python 3.4, which is not supported by matplotlib.

  • Add support for Python 3.7.

1.1.1 (2019-03-26)

  • Replace None with ‘.’ when printing call data.

  • Support VCF files with multiple alternate alleles per position.

1.1.0 (2019-02-06)

  • Support reading gzip compressed vcf files.

1.0.0 (2018-11-20)

  • First public release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vcftoolz-1.2.3.tar.gz (37.4 kB view details)

Uploaded Source

Built Distribution

vcftoolz-1.2.3-py2.py3-none-any.whl (25.5 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file vcftoolz-1.2.3.tar.gz.

File metadata

  • Download URL: vcftoolz-1.2.3.tar.gz
  • Upload date:
  • Size: 37.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for vcftoolz-1.2.3.tar.gz
Algorithm Hash digest
SHA256 cb6ebbc2241c15825fbc7db755878d3ccfaffa54b84b6a9b989dbed6a9d89391
MD5 70e0036722d58a342a7ae6dca64e3144
BLAKE2b-256 e9bacea71114855dc18312669f5f6b1dea2282e18db75c32598af8d8cf1c05c1

See more details on using hashes here.

File details

Details for the file vcftoolz-1.2.3-py2.py3-none-any.whl.

File metadata

  • Download URL: vcftoolz-1.2.3-py2.py3-none-any.whl
  • Upload date:
  • Size: 25.5 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for vcftoolz-1.2.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2887a0432912deed55efadb5c7fc489a9bcfc8bf514568c0b26307594e201a4f
MD5 3042283ad0e4a8dce2054d30546eefca
BLAKE2b-256 c4b0e811a74258cb9479a70cd2db51afd701e9e54c73dd04e5043de9ec103ac9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page