Skip to main content

Convert gVCF into BED

Project description

This is a small tool to convert a gVCF file to BED. This is useful for extracting regions that pass a certain genotype quality threshold.

Installation

gvcf2bed is now available through pypi with: pip install gvcf2bed

Requirements

  • Python 3.4+
  • pyvcf
  • cyvcf2

For developers

  • pytest
  • pytest-cov

Changelog

0.3.1

  • Fix a bug for variants where GQ is not defined.

0.3

  • Use cyvcf2 by default in tool. This results in a speed-up of approximately 8-10x. Existing API has not changed, and will still work with pyvcf.
  • Add separate filter for non-variants, as GQ scores may have a different distribution on non-variant records than on variant records.

Usage

usage: gvcf2bed [-h] -I INPUT -O OUTPUT [-s SAMPLE] [-q QUALITY]
                [-nq NON_VARIANT_QUALITY] [-b]

Create a BED file from a gVCF. Regions are based on a minimum genotype
quality. The gVCF file must contain a GQ field in its FORMAT fields. GQ scores
of non-variants records have a different distribution from the GQ score
distribution of variant records. Hence, an option is provided to set a
different threshold for non-variant positions.

optional arguments:
  -h, --help            show this help message and exit
  -I INPUT, --input INPUT
                        Input gVCF
  -O OUTPUT, --output OUTPUT
                        Output bed file
  -s SAMPLE, --sample SAMPLE
                        Sample name in VCF file to use. Will default to first
                        sample (alphabetically) if not supplied
  -q QUALITY, --quality QUALITY
                        Minimum genotype quality (default 20)
  -nq NON_VARIANT_QUALITY, --non-variant-quality NON_VARIANT_QUALITY
                        Minimum genotype quality for non-variant records
                        (default 20)
  -b, --bedgraph        Output in bedgraph mode

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for gvcf2bed, version 0.3.1
Filename, size File type Python version Upload date Hashes
Filename, size gvcf2bed-0.3.1-py3-none-any.whl (5.2 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size gvcf2bed-0.3.1.tar.gz (5.2 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page