Skip to main content

Description of your package

Project description

gene4mVCF

Introduction

gene4mVCF is a Python package that allows you to extract variant entries for specific genes or a list of genes from a VCF (Variant Call Format) file. It utilizes tools like bcftools, tabix, and Python libraries like pysam, pandas, pybedtools, tqdm, and gffutils to efficiently parse and extract variants.

Installation

You can install gene4mVCF via pip:

$ pip install gene4mVCF

After installation please download the four required bed files and place inside the folder /gene4mVCF
'hg19.ensGene.bed' --> https://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/genes/hg19.ensGene.gtf.gz

'hg19.ncbiRefSeq.bed' --> https://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/genes/hg19.ncbiRefSeq.gtf.gz

'hg38.ensGene.bed' --> https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/genes/hg38.ensGene.gtf.gz

'hg38.ncbiRefSeq.bed'--> https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/genes/hg38.ncbiRefSeq.gtf.gz

Usage

usage: $ gene4mVCF [-h] -i INPUT -g GENE

Extract variant entries for a specific gene or list of genes from a VCF file.

optional arguments: -h, --help show this help message and exit -i INPUT, --input INPUT Input bgzip compressed VCF file -g GENES, --genes GENES Gene name, Ensembl gene ID, or path to a gene list file

Examples

Extract variants for a single gene using gene name: $ gene4mVCF -i input.vcf.gz -g EGFR

Extract variants for a single using ensembl gene id: $ gene4mVCF -i input.vcf.gz -g ENSG00000168878

Extract variants for multiple genes listed in a file: $ gene4mVCF -i input.vcf.gz -g genes.txt

For more options and details, refer to the help message.

Support

For any issues or inquiries, please open an issue on the GitHub repository https://github.com/VJ-Ulaganathan/gene4mVCF

Installation

Installation via pip:

$ pip install gene4mVCF

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gene4mvcf-1.1.3.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

gene4mVCF-1.1.3-py3-none-any.whl (4.1 kB view details)

Uploaded Python 3

File details

Details for the file gene4mvcf-1.1.3.tar.gz.

File metadata

  • Download URL: gene4mvcf-1.1.3.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.11.6

File hashes

Hashes for gene4mvcf-1.1.3.tar.gz
Algorithm Hash digest
SHA256 9ab0c82684cb8bfc848dc42fef72b806f02afa9e4259615c8d24c608207fdcbc
MD5 a71948c059a33dd2951a97d5b51dc4b0
BLAKE2b-256 b0e118863109e7b80cc572fde626aff55d8c24b57e75451c0f983d49791a8b93

See more details on using hashes here.

File details

Details for the file gene4mVCF-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: gene4mVCF-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 4.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.11.6

File hashes

Hashes for gene4mVCF-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 57b17f286ab94aed83bacb86252000959b35f33fbeacfa47bebb19bc6a521ede
MD5 2b0cdde488f7d6a01b417e64331edfc0
BLAKE2b-256 4706e3c518ed5c3059b4be56e25efed7cf12fb287910e5a7991c0c885c80f4d2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page