Skip to main content

find highly divergent DNA and RNA viruses in microbiomes

Project description

https://travis-ci.org/USDA-ARS-GBRU/vica.svg?branch=master https://codecov.io/gh/USDA-ARS-GBRU/vica/branch/master/graph/badge.svg Documentation Status https://api.codacy.com/project/badge/Grade/f39e8359ea334739842bba35e596cfdc

Authors

  • Adam R. Rivers, US Department of Agriculture, Agricultural Research Service

  • Qingpeng Zhang, US Department of Energy, Joint Genome Institute

  • Susannah G. Tringe, US Department of Energy, Joint Genome Institute

Introduction

Vica is designed to identify highly divergent viruses and phage representing new families or orders in assembled metagenomic and metatranscriptomic data. Vica does this by combining information from across the spectrum of composition to homology. The current version of Vica uses three feature sets (5-mers, codon usage in all three frames, and minhash sketches from long kmers (k=24,31). The classifier uses a jointly trained deep neural network and logistic model implemented in Tensorflow. The software is designed to identify both DNA and RNA viruses and phage in metagenomes and metatranscriptomes.

Models

The current leases does not include trained models but we will be adding them in the future to allow for the rapid identification of viruses without model training.

Usage

This package can classify assembled data and train new classification models. Most users will only use the classification functionality in Vica. We will provide trained models for classifying contigs in future releases. classification can be easily invoked with the command:

vica classify -infile contigs.fasta -out classifications.txt -modeldir modeldir

The package also has a suite of tools to prepare data, train and evaluate new classification models. Many of the workflows for doing this can be evoked with the same sub-command interface:

vica split
vica get_features
vica train
vica evaluate

For details see the Tutorial.

Requirements

The package relies on a number of python dependencies that are resolved when the package is installed with PIP.

The non-python dependencies are:

Documentation

Documentation for the package is at http://vica.readthedocs.io/en/latest/

Package availability

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vica-0.1.5.tar.gz (70.4 kB view details)

Uploaded Source

Built Distribution

vica-0.1.5-py3-none-any.whl (66.6 kB view details)

Uploaded Python 3

File details

Details for the file vica-0.1.5.tar.gz.

File metadata

  • Download URL: vica-0.1.5.tar.gz
  • Upload date:
  • Size: 70.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for vica-0.1.5.tar.gz
Algorithm Hash digest
SHA256 011a5b16f97cddd2a926d1baf6ea66d625436bfc31fca9a461390c0c7eb4f25f
MD5 dd3f103cbb076bf04e64a2c356f4009d
BLAKE2b-256 aa844f6404c068355d2e6e88ecad7464ed3a74d100ffc8470730b2e582ceface

See more details on using hashes here.

File details

Details for the file vica-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for vica-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 14721ebdfeff07ee6db967a50b601548e9d7d40430fcf63c7d06cd2bcc11ae9e
MD5 68e16afb4971b2646886d8ba6a1a3530
BLAKE2b-256 d7b21553cbabc0e96d0b93117c5e295b13b389b19f79c5dd3302f78ee3ce9e6e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page