Skip to main content

An efficient pipeline for identification and annotation of endogenous retroviruses (ERVs)

Project description

ERVdetective: an efficient pipeline for identification and annotation of endogenous retroviruses (ERVs)

1. Download and install

ervdetective is a command-line-interface program developed based on Python 3, and you can download and install the ervdetective in a variety of ways.

1.1. pip method

ervdetective has been distributed to the standard library of PyPI (https://pypi.org/project/ervdetective/), and can be easily installed by the tool pip.

Firstly, download Python3 (https://www.python.org/), and install Python3 and pip tool, then,

pip install ervdetective
ervdetective -h

1.2. Or local installation

In addition, ervdetective can also be installed manually using the file using the file setup.py.

Firstly, download this repository, then, run:

python setup.py install
ervdetective -h

1.3. Or run the source code directly

you can also directly run the source code of ervdetective without installation. First, download this repository, then, install the required python environment of ervdetective:

pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

finally, run ervdetective using the file main.py. Please view the help documentation by python main.py -h.

2. Software dependencies

The running of ervdetective relies on these softwares:

  • blast (version >=2.9.0+), has to contain makeblastdb and tblastn.

  • genometools (version >=1.6.1), has to contain ltrharvest.

  • hmmer (version >=3.0), has to contain hmmpress and hmmscan.

Note, these dependencies need to be installed and added to environment variables of system (or user) beforehand, because ervdetective call them from the environment variables directly.

3. Getting help

The help documentation can be get by entering ervdetective -h or ervdetective --help.

Parameter Description
-h, --help show this help message and exit
-i HOST The file-path of host genome sequence, the suffix is generally *.fna, *.fas, *.fasta.
-eb EBLAST Specify threshold of e-value for BLAST search, default: 1e-5.
-f FLANK The length of extended flank sequence on either side of the blast hit-site, default: 15000.
-l1 MINLTR Specify minimum length of LTR, default: 100.
-l2 MAXLTR Specify maximum length of LTR, default: 1000.
-s LTRSIMILAR Specify threshold(%) of the similarity of paired LTRs, default: 80.
-d1 MINDISTLTR The minimum interval of paired-LTRs start-positions, default: 1000.
-d2 MAXDISTLTR The maximum interval of paired-LTRs start-positions, default: 15000.
-t1 MINTSD The minimum length for each TSD site, default: 4.
-t2 MAXTSD The maximum length for each TSD site, default: 6.
-motif MOTIF Specify start-motif (2 nucleotides) and end-motif (2 nucleotides), default string: TGCA.
-mis MISMOTIF The maximum number of mismatches nucleotides in motif, default: 1.
-ed EHMMER The threshold of e-value using for HMMER search, default: 1e-6.
-n THREAD The the number of threads used, default: 1.
-p PREFIX The the prefix of output file, default character: 'host'.
-o OUTPUT The path of output folder to store all the results.
--gag GAG_LENGTH The threshold of length of GAG protein in HMMER search, default: 250 aa.
--pro PRO_LENGTH The threshold of length of PRO protein in HMMER search, default: 50 aa.
--rt RT_LENGTH The threshold of length of RT protein in HMMER search, default: 150 aa.
--rh RNASEH_LENGTH The threshold of length of RNaseH protein in HMMER search, default: 65 aa.
--int INT_LENGTH The threshold of length of INT protein in HMMER search, default: 150 aa.
--env ENV_LENGTH The threshold of length of ENV protein in HMMER search, default: 250 aa.

4. Example of usage

The bat Myotis myotis (GCA_004026985.1) , and its genome data was downloaded from https://www.ncbi.nlm.nih.gov/datasets/taxonomy/51298/

Then, run:

ervdetective -i GCA_004026985.1_MyoMyo_v1_BIUU_genomic.fna -p myotis_myotis -o output

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ervdetective-1.0.8.tar.gz (999.1 kB view details)

Uploaded Source

Built Distribution

ervdetective-1.0.8-py3-none-any.whl (1.0 MB view details)

Uploaded Python 3

File details

Details for the file ervdetective-1.0.8.tar.gz.

File metadata

  • Download URL: ervdetective-1.0.8.tar.gz
  • Upload date:
  • Size: 999.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.9

File hashes

Hashes for ervdetective-1.0.8.tar.gz
Algorithm Hash digest
SHA256 3438eaf219dc78c51f7711dc420b6f7cc7d71ec4b6608573a11df1dac8f407d4
MD5 99bd67a847d5f48dfdc74ed52f8bf5c6
BLAKE2b-256 511cf82a4ef583a48799e029cfe0d671866cce847c251572afcb8cef93782dde

See more details on using hashes here.

File details

Details for the file ervdetective-1.0.8-py3-none-any.whl.

File metadata

File hashes

Hashes for ervdetective-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 6c4b543b03e387705637e829ebf38fa480cd5fba3b26bd22fb7c2c84c4a21672
MD5 2412df0d5c30b1101b14f7c7ae170067
BLAKE2b-256 6de4d44645cd746b07130ecb86acff485f0851ab426673535c9e9d4b0e349843

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page