An efficient pipeline for identification and annotation of endogenous retroviruses (ERVs)
Project description
ERVdetective: an efficient pipeline for identification and annotation of endogenous retroviruses (ERVs)
1. Download and install
ervdetective is a command-line-interface program developed based on Python 3, and you can download and install the ervdetective in a variety of ways.
1.1. pip method
ervdetective has been distributed to the standard library of PyPI (https://pypi.org/project/ervdetective/), and can be easily installed by the tool pip.
Firstly, download Python3 (https://www.python.org/), and install Python3 and pip tool, then,
pip install ervdetective
ervdetective -h
1.2. Or local installation
In addition, ervdetective can also be installed manually using the file using the file setup.py.
Firstly, download this repository, then, run:
python setup.py install
ervdetective -h
2. Software dependencies
The running of ervdetective relies on these softwares:
-
blast (version >=2.9.0+), has to contain
makeblastdbandtblastn. -
genometools (version >=1.6.1), has to contain
ltrharvest. -
hmmer (version >=3.0), has to contain
hmmpressandhmmscan.
Note, these dependencies need to be installed and added to environment variables of system (or user) beforehand, because ervdetective call them from the environment variables directly.
3. Getting help
The help documentation can be get by entering ervdetective -h or ervdetective --help.
| Parameter | Description |
|---|---|
| -h, --help | show this help message and exit |
| -i HOST | The file-path of host genome sequence, the suffix is generally *.fna, *.fas, *.fasta. |
| -eb EBLAST | Specify threshold of e-value for BLAST search, default: 1e-5. |
| -f FLANK | The length of extended flank sequence on either side of the blast hit-site, default: 15000. |
| -l1 MINLTR | Specify minimum length of LTR, default: 100. |
| -l2 MAXLTR | Specify maximum length of LTR, default: 1000. |
| -s LTRSIMILAR | Specify threshold(%) of the similarity of paired LTRs, default: 80. |
| -d1 MINDISTLTR | The minimum interval of paired-LTRs start-positions, default: 1000. |
| -d2 MAXDISTLTR | The maximum interval of paired-LTRs start-positions, default: 15000. |
| -t1 MINTSD | The minimum length for each TSD site, default: 4. |
| -t2 MAXTSD | The maximum length for each TSD site, default: 6. |
| -motif MOTIF | Specify start-motif (2 nucleotides) and end-motif (2 nucleotides), default string: TGCA. |
| -mis MISMOTIF | The maximum number of mismatches nucleotides in motif, default: 1. |
| -ed EHMMER | The threshold of e-value using for HMMER search, default: 1e-6. |
| -n THREAD | The the number of threads used, default: 1. |
| -p PREFIX | The the prefix of output file, default character: 'host'. |
| -o OUTPUT | The path of output folder to store all the results. |
| --gag GAG_LENGTH | The threshold of length of GAG protein in HMMER search, default: 250 aa. |
| --pro PRO_LENGTH | The threshold of length of PRO protein in HMMER search, default: 50 aa. |
| --rt RT_LENGTH | The threshold of length of RT protein in HMMER search, default: 150 aa. |
| --rh RNASEH_LENGTH | The threshold of length of RNaseH protein in HMMER search, default: 65 aa. |
| --int INT_LENGTH | The threshold of length of INT protein in HMMER search, default: 150 aa. |
| --env ENV_LENGTH | The threshold of length of ENV protein in HMMER search, default: 250 aa. |
4. Example of usage
The bat Myotis myotis (GCA_004026985.1) , and its genome data was downloaded from https://www.ncbi.nlm.nih.gov/datasets/taxonomy/51298/
Then, run:
ervdetective -i GCA_004026985.1_MyoMyo_v1_BIUU_genomic.fna -p myotis_myotis -n 10 -o output
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ervdetective-1.0.9.tar.gz.
File metadata
- Download URL: ervdetective-1.0.9.tar.gz
- Upload date:
- Size: 998.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
163a414f36f3f1587d65e7f451d5c50316f790a1867fa06a3f9616bbdc8733b7
|
|
| MD5 |
84758ed1f10db910139981ff0b468a17
|
|
| BLAKE2b-256 |
40ccbb017112be0106dba71d438d1610637b4a4bb94280b1114f13b9c8dad299
|
File details
Details for the file ervdetective-1.0.9-py3-none-any.whl.
File metadata
- Download URL: ervdetective-1.0.9-py3-none-any.whl
- Upload date:
- Size: 999.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
355de8cae3ffac5a7ef712b24d8ed68c460a80b97fbcce0bffacde21d6f2c17a
|
|
| MD5 |
d4da5f07a72769d3b30789cf71a38582
|
|
| BLAKE2b-256 |
265cbb8676238ebb08055616ba8fa49c403f5c562443dd4e1d4a9a2f88b17981
|