Skip to main content
Donate to the Python Software Foundation or Purchase a PyCharm License to Benefit the PSF! Donate Now

Convert gVCF into BED

Project description

This is a small tool to convert a gVCF file to BED. This is useful for extracting regions that pass a certain genotype quality threshold.

Installation

gvcf2bed is now available through pypi with: pip install gvcf2bed

Requirements

  • Python 3.4+
  • pyvcf
  • cyvcf2

For developers

  • pytest
  • pytest-cov

Changelog

0.3.1

  • Fix a bug for variants where GQ is not defined.

0.3

  • Use cyvcf2 by default in tool. This results in a speed-up of approximately 8-10x. Existing API has not changed, and will still work with pyvcf.
  • Add separate filter for non-variants, as GQ scores may have a different distribution on non-variant records than on variant records.

Usage

usage: gvcf2bed [-h] -I INPUT -O OUTPUT [-s SAMPLE] [-q QUALITY]
                [-nq NON_VARIANT_QUALITY] [-b]

Create a BED file from a gVCF. Regions are based on a minimum genotype
quality. The gVCF file must contain a GQ field in its FORMAT fields. GQ scores
of non-variants records have a different distribution from the GQ score
distribution of variant records. Hence, an option is provided to set a
different threshold for non-variant positions.

optional arguments:
  -h, --help            show this help message and exit
  -I INPUT, --input INPUT
                        Input gVCF
  -O OUTPUT, --output OUTPUT
                        Output bed file
  -s SAMPLE, --sample SAMPLE
                        Sample name in VCF file to use. Will default to first
                        sample (alphabetically) if not supplied
  -q QUALITY, --quality QUALITY
                        Minimum genotype quality (default 20)
  -nq NON_VARIANT_QUALITY, --non-variant-quality NON_VARIANT_QUALITY
                        Minimum genotype quality for non-variant records
                        (default 20)
  -b, --bedgraph        Output in bedgraph mode

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
gvcf2bed-0.3.1-py3-none-any.whl (5.2 kB) Copy SHA256 hash SHA256 Wheel py3
gvcf2bed-0.3.1.tar.gz (5.2 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page