Tool for identifying endogenous retrovirus like regions in a set of sequences
Project description
ERVsearch
Full documentation is available via ReadTheDocs.
ERVsearch is a pipeline for identification of endogenous retrovirus like regions in a host genome, based on sequence similarity to known retroviruses.
ERVsearch screens for endogenous retrovirus (ERV) like regions in any FASTA file using the Exonerate algorithm (Slater and Birney, 2005, doi:10.1186/1471-2105-6-31).
- In the Screen section, open reading frames (ORFs) resembling retroviral gag, pol and env genes are identified based on their level of similarity to a database of known complete or partial retroviral ORFs.
- In the Classify section, these ORFs are classified into groups based on a database of currently classified retroviruses and phylogenetic trees are built.
- In the ERVRegions section, regions with ORFs resembling more than one retroviral gene are identified.
This is a updated and expanded version of the pipeline used to identify ERVs in Brown and Tarlinton 2017 (doi: 10.1111/mam.12079), Brown et al. 2014 (doi: 10.1128/JVI.00966-14), Brown et al. 2012 (doi: j.virol.2012.07.010) and Tarlinton et al. 2012 (doi: 10.1016/j.tvjl.2012.08.011). The original version is available here as a Perl pipeline and was written by Dr Richard Emes.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ervsearch-1.0.11-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 429bc9663908f1c41c4089add11e1d183529c2aa6361bdc61278149ef1013526 |
|
MD5 | 2cfc519e15e9546f20ebed35bf7b027d |
|
BLAKE2b-256 | 671c3ce2e670f1b6fa55e5697c0a5e1d3d5c894a28d808d2e607e88c3a0a23f5 |