Skip to main content

viralVerify rewrite/refactor for PyPI packaging and distribution

Project description

viral_verify

https://img.shields.io/pypi/v/viral_verify.svg https://img.shields.io/travis/peterk87/viral_verify.svg Documentation Status

viralVerify rewrite/refactor for PyPI packaging and distribution, maintainability and clarity.

NOTE: BLAST+ search option has been removed. Results output table will be different than the original viralVerify. Naive Bayes classifier training script has not been ported yet.

Features

  • Gene prediction with Prodigal in metagenomic mode

  • HMMer3 hmmsearch for protein domains in predicted genes

  • Naive Bayes classification of contigs as viral/not viral based on HMMer3 results

  • Output of detailed contig classification results table in CSV format

  • Output of contigs based on classification into separate FASTA files

Requirements

An HMMer3 HMM database is required. For example, the latest version of Pfam-A HMM:

NOTE: Please extract any compressed HMM DB ($ gunzip Pfam-A.hmm.gz)

Software dependencies:

Python dependencies:

Installation

Conda

It’s recommended that you use Conda to install the required software (Prodigal and HMMer3) and Python dependencies.

$ conda env create -f environment.yml

Pip

If you have Prodigal and HMMer3 installed in your $PATH, and Python 3.6 or greater, you can use pip to install viral_verify:

$ pip install viral_verify

Usage

$ viral_verify --help
Usage: viral_verify [OPTIONS]

  HMM and Naive Bayes classification of contig sequences as either viral,
  plasmid or chromosomal.

  Requires Prodigal for gene prediction and hmmsearch from HMMer3 for
  searching for Pfam HMM profiles.

Options:
  -i, --input-fasta PATH          Input fasta file  [required]
  -o, --outdir PATH               Output directory  [required]
  -H, --hmm-db PATH               Path to Pfam-A HMM database  [required]
  -t, --threads INTEGER           Number of threads (default=16)
  -p, --output-plasmids-separately
                                  Output predicted plasmids separately?
  --prefix TEXT                   Output file prefix (default: None)
  --uncertainty-threshold FLOAT   Uncertainty threshold (Natural log
                                  probability) (default=3.0)

  --naive-bayes-classifier-table PATH
                                  Table of protein domain frequencies to use
                                  for Naive Bayes classification (default="/ho
                                  me/pkruczkiewicz/repos/viral_verify/viral_ve
                                  rify/data/classifier_table.txt")

  -v, --verbose                   Logging verbosity
  --version                       Show the version and exit.
  --help                          Show this message and exit.

Credits

The original source code, design and conception can be found at viralVerify. This is merely a rewrite for easier packaging via PyPI, adding some CI with Travis-CI and organizing the code for maintainability and clarity.

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.1 (2020-06-04)

  • Fix PyPI release (include classifier_table.txt in package)

0.1.0 (2020-06-03)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viral_verify-0.1.1.tar.gz (173.8 kB view details)

Uploaded Source

Built Distribution

viral_verify-0.1.1-py2.py3-none-any.whl (172.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file viral_verify-0.1.1.tar.gz.

File metadata

  • Download URL: viral_verify-0.1.1.tar.gz
  • Upload date:
  • Size: 173.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for viral_verify-0.1.1.tar.gz
Algorithm Hash digest
SHA256 48c7d5afb62b1b53353c301fffb6b2c753353b2f0f2cd11d71991bac310d758f
MD5 8beedb679dae8a304cbac8b424d17b09
BLAKE2b-256 ee23a30499af4ffd4862633bf2f87622e7f9ee1698b549c3d75f05d4807c98c8

See more details on using hashes here.

File details

Details for the file viral_verify-0.1.1-py2.py3-none-any.whl.

File metadata

  • Download URL: viral_verify-0.1.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 172.9 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for viral_verify-0.1.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2f3a8022f8cf097075e31a355a8cd3d31e3a3a518f1de5810ad4093f30dbd504
MD5 919e7eb00054a01f4c7e21aa271df580
BLAKE2b-256 459bd5ae0ee252dd47fa6c372c949a4a5cf3e88124fb84fd9f6a0307272e1248

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page