Skip to main content

Convert gVCF into BED

Project description

This is a small tool to convert a gVCF file to BED. This is useful for extracting regions that pass a certain genotype quality threshold.

Installation

gvcf2bed is now available through pypi with: pip install gvcf2bed

Requirements

  • Python 3.4+

  • pyvcf

  • cyvcf2

For developers

  • pytest

  • pytest-cov

Changelog

0.3.1

  • Fix a bug for variants where GQ is not defined.

0.3

  • Use cyvcf2 by default in tool. This results in a speed-up of approximately 8-10x. Existing API has not changed, and will still work with pyvcf.

  • Add separate filter for non-variants, as GQ scores may have a different distribution on non-variant records than on variant records.

Usage

usage: gvcf2bed [-h] -I INPUT -O OUTPUT [-s SAMPLE] [-q QUALITY]
                [-nq NON_VARIANT_QUALITY] [-b]

Create a BED file from a gVCF. Regions are based on a minimum genotype
quality. The gVCF file must contain a GQ field in its FORMAT fields. GQ scores
of non-variants records have a different distribution from the GQ score
distribution of variant records. Hence, an option is provided to set a
different threshold for non-variant positions.

optional arguments:
  -h, --help            show this help message and exit
  -I INPUT, --input INPUT
                        Input gVCF
  -O OUTPUT, --output OUTPUT
                        Output bed file
  -s SAMPLE, --sample SAMPLE
                        Sample name in VCF file to use. Will default to first
                        sample (alphabetically) if not supplied
  -q QUALITY, --quality QUALITY
                        Minimum genotype quality (default 20)
  -nq NON_VARIANT_QUALITY, --non-variant-quality NON_VARIANT_QUALITY
                        Minimum genotype quality for non-variant records
                        (default 20)
  -b, --bedgraph        Output in bedgraph mode

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gvcf2bed-0.3.1.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

gvcf2bed-0.3.1-py3-none-any.whl (5.2 kB view details)

Uploaded Python 3

File details

Details for the file gvcf2bed-0.3.1.tar.gz.

File metadata

  • Download URL: gvcf2bed-0.3.1.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for gvcf2bed-0.3.1.tar.gz
Algorithm Hash digest
SHA256 b6e11d3561bc362ac19bbb42c1e37a9395dc43dcbcb6c444f99db29ae12fa7c2
MD5 61f8398bfa03e7fc787f3907557f1de3
BLAKE2b-256 834b63bb9a61a31cfc3816643ab46e44d896867dda2e57bce6c8deef1ffdfcbc

See more details on using hashes here.

File details

Details for the file gvcf2bed-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: gvcf2bed-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 5.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for gvcf2bed-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9a1f8b1fad0bb85a0fdba6d23a17f338bee73ce8aa34348cf917620947025d9d
MD5 e603ce5bc5a95d99443f8deb83de5b3f
BLAKE2b-256 1a8927173d160b5fafe2195c0d1329018d2cf81a7e3c2c03e6ad47a340e458f7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page