Skip to main content

Fast and effective functional annotation of SVs

Project description

FunnSV

Efficient and reliable annotation of Structural Variants (SVs) from a VCF file and a GFF3 annotation source.

Installation

FunnSV can be installed using pip:

pip install funnSV

Requirements

  • Python >= 3.7
  • pysam
  • variant-extractor

Tested on:

  • Python 3.10
  • pysam 0.21.0

Common use cases

Gene only annotation

To annotate gene elements from a GFF3 file in the form of their fields Name and Symbol:

funnSV -i <INPUT_VCF> -g <INPUT_GFF3> -r <REFERENCE_GENOME> -o <OUTPUT_VCF> -f ID,Name

Gene, Transcript, and transcript element annotations

To annotate all possible common elements in a GFF3 file, including all fields:

funnSV -i <INPUT_VCF> -g <INPUT_GFF3> -r <REFERENCE_GENOME> -o <OUTPUT_VCF> -m balanced

All types annotations

This mode is not recommended but can be useful if there are pseudogenes or other uncommon types, but it is prone to errors due to inconsistencies in GFF3 files:

funnSV -i <INPUT_VCF> -g <INPUT_GFF3> -r <REFERENCE_GENOME> -o <OUTPUT_VCF> -m complete -f ID,Name

Specific use case: Protein coding genes

This mode allows to only annotate protein coding genes, if gene records contain the biotype=protein_coding field in the attributes column (This has been tested using the Human GRCh37 Release 87 GFF3):

funnSV -i <INPUT_VCF> -g <INPUT_GFF3> -r <REFERENCE_GENOME> -o <OUTPUT_VCF> -m minimal_pc -f ID,Name

Finally, this software was implemented by using the GFF3 specification from https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md. Possible errors or bugs coming from deviations in the format are to be expected. Please refer any such occurrances to the GitHub issues section. We are open to support other annotation source formats in the future.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

funnsv-0.0.2.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

funnsv-0.0.2-py3-none-any.whl (12.4 kB view details)

Uploaded Python 3

File details

Details for the file funnsv-0.0.2.tar.gz.

File metadata

  • Download URL: funnsv-0.0.2.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.0 CPython/3.10.12 Linux/6.2.0-39-generic

File hashes

Hashes for funnsv-0.0.2.tar.gz
Algorithm Hash digest
SHA256 7eaca0f1c0257380bb0be47fa71a8fadd1b0452db5211fedd7e8181abbb375c6
MD5 b44e7df29ce2d10efb38854fc042ae4d
BLAKE2b-256 696a1085666784d2a548451861b8f87c690c23b94b68d974aa853fcf467b8cd2

See more details on using hashes here.

File details

Details for the file funnsv-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: funnsv-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 12.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.0 CPython/3.10.12 Linux/6.2.0-39-generic

File hashes

Hashes for funnsv-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a86e018f488e9e40de16fc5e333fae44fe06fab228a80394a272fa172a9fb313
MD5 10263366b88f3a0bf18d33ce4d7eaec9
BLAKE2b-256 bd43f393b8ee93875d42310e6ec2ebeb8db5d4270a2b0c0d4e3dbf788f91be2f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page