Skip to main content

VIRA: By-Reference Exon and CDS Viral Genome Annotation

Project description

VIRA: By-Reference Exon and CDS Viral Genome Annotation

PyPI version GitHub Downloads License

Introduction

VIRA is a fully-automated protocol for transfering annotations from reference to target genomes, optimized for viral genomes and primarily developed and tested on HIV and SIV genomes.

The method uses both nucleotide and protein sequence information to search for correct alignments between genomes with high degrees of sequence divergence. VIRA is tailored to take advantage of guide protein annotations to further improve the accuracy of alignments and final annotations.

Publications

Coming soon...

Documentation

Installation

Via PyPI

The easiest way to install VIRA is through PyPI:

$ pip install vira-av
$ vira --help

To uninstall VIRA:

$ pip uninstall vira-av

Building from source

To build from source, clone the git repository:

$ git clone https://github.com/alevar/vira.git --recursive
$ cd vira
$ pip install -r requirements.txt
$ pip install .

Requirements

Requirement Details
Language support Python ≥ 3.6
Dependencies - gffread
- minimap2
- miniprot
- snapper

Getting started

Usage

vira [-h] -a ANNOTATION -g GENOME -t TARGET [-q GUIDE] [-o OUTPUT] [--force-cds] 
     [--gffread GFFREAD] [--minimap2 MINIMAP2] [--miniprot MINIPROT] [--snapper SNAPPER] 
     [--keep-tmp] [--tmp-dir TMP_DIR]

Options

Option Description
-a, --annotation Path to the query GTF/GFF annotation file.
-g, --genome Path to the query genome FASTA file.
-t, --target Path to the target genome FASTA file.
-q, --guide Optional path to the guide annotation file for the target genome. Transcripts and CDS from the guide will be used to validate the annotation.
-o, --output Path to the output GTF file.
--force-cds Force the CDS from the guide onto the transcript chain, even if that means merging adjacent exons together (can fix alignment artifacts such as spurious introns). If the CDS does not fit the transcript chain, the transcript will be skipped.
--gffread Path to the gffread executable.
--minimap2 Path to the minimap2 executable.
--miniprot Path to the miniprot executable. If not set - minimap2 will be used to align nucleotide sequence of the CDS instead.
--snapper Path to the snapper executable.
--keep-tmp Keep temporary files.
--tmp-dir Directory to store temporary files.

Help Options

Option Description
-h, --help Prints help message.

Example Data

Sample datasets are provided in the "example" directory to test and get familiar with VIRA.

The included example can be run with the following command from the root directory of the repository:

vira --annotation example/query.gtf --output example/output.gtf --genome example/query.fasta --target example/target.fasta --guide example/guide.gtf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vira_av-1.0.0.tar.gz (58.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vira_av-1.0.0-py3-none-any.whl (60.2 kB view details)

Uploaded Python 3

File details

Details for the file vira_av-1.0.0.tar.gz.

File metadata

  • Download URL: vira_av-1.0.0.tar.gz
  • Upload date:
  • Size: 58.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for vira_av-1.0.0.tar.gz
Algorithm Hash digest
SHA256 498fb897ce65f5b66f31a31ae48ed502035ec348511417bca364a807ddd47022
MD5 4b0b83c5a4b845553af71ca59dddda8e
BLAKE2b-256 e7ac72150ab1a4fcfcd9fb308c30d8d43124d7ce89ca5a86a0c397a844e37478

See more details on using hashes here.

File details

Details for the file vira_av-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: vira_av-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 60.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for vira_av-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 850869bf57e6b157f74ba5bd7e8896056c9e02d15a75ed785ffc7d81081806d3
MD5 6ef4460ff052bf92da895cc80ae443a4
BLAKE2b-256 31d0165d4fb72833c30e29e1b785e721a8166867f98ee45d96b71bcd60e72c72

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page