tree-based orthology inference
Project description
PhyloPyPruner is a Python package for refining the output of a graph-based orthology inference approach such as [OrthoMCL](https://www.ncbi.nlm.nih.gov/pubmed/12952885), [OrthoFinder](https://www.ncbi.nlm.nih.gov/pubmed/26243257) or [HaMStR](https://www.ncbi.nlm.nih.gov/pubmed/19586527). Similar to other tree-based orthology inference methods (e.g., [PhyloTreePruner](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3825643/), [UPhO](https://academic.oup.com/mbe/article/33/8/2117/2578877), [Agalma](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3840672/) and [Phylogenomic Dataset Reconstruction](https://www.ncbi.nlm.nih.gov/pubmed/25158799)), it uses phylogenetic trees in order to obtain genes that are 1:1 orthologous. In addition to filters and algorithms seen in pre-existing tools, this package provides new methods for differentiating [contamination-like sequences](https://gitlab.com/fethalen/phylopypruner/wikis/About-PhyloPyPruner#contamination-like-issues-) from paralogs.
PhyloPyPruner is under active development and I would appreciate it if you try this software on your own data and [leave feedback](mailto:felix.thalen@uni-goettingen.de).
![proteomes2orthologs](https://gitlab.com/fethalen/phylopypruner/raw/master/doc/images/proteomes2orthologs.png)
Figure 1. An overview of a tree-based orthology inference approach. First, sequences from different species are clustered together based on an all-versus-all BLAST, followed by Markov clustering. Each node in the cluster corresponds to a sequence and each edge corresponds to a similarity score. Homologous groups are then aligned and a phylogenetic tree is inferred from the alignment. From this tree, orthologous groups can be identified and paralogs are pruned away.
## Quick installation
The easiest way to install PhyloPyPruner is by using the package manager for Python, [pip](https://pypi.org/project/pip/):
`bash pip install phylopypruner # install for all users pip install --user phylopypruner # install for the current user only `
Once installed, the program is located within $HOME/.local/bin. Depending on your OS, you might have to add the directory to your $PATH to avoid typing the entire path. Once in your path, you run the program like this:
`bash phylopypruner `
## [Documentation](https://gitlab.com/fethalen/phylopypruner/wikis)
[About PhyloPyPruner](https://gitlab.com/fethalen/phylopypruner/wikis/about-phylopypruner)
Tutorial
[Installation](https://gitlab.com/fethalen/phylopypruner/wikis/installation)
[Input data](https://gitlab.com/fethalen/phylopypruner/wikis/input-data)
[Output files](https://gitlab.com/fethalen/phylopypruner/wikis/output-files)
[Methods](https://gitlab.com/fethalen/phylopypruner/wikis/methods)
[Options](https://gitlab.com/fethalen/phylopypruner/wikis/options)
## Cite
Our manuscript is still in preparation, it will be posted here once a preprint of the article is available.
© [Kocot Lab](https://www.kocotlab.com/) 2018
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file phylopypruner-0.5.2.tar.gz
.
File metadata
- Download URL: phylopypruner-0.5.2.tar.gz
- Upload date:
- Size: 36.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.6.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d9026e994439cc4578b4dc5f2f034ede74047926f04c0b2cd472c853903b9cfe |
|
MD5 | 3354c137540c0f95b68557b72d1b9026 |
|
BLAKE2b-256 | a076f5279848a7e704923615ecf29891acb702668cdfa4f45752401883b920d7 |
File details
Details for the file phylopypruner-0.5.2-py3-none-any.whl
.
File metadata
- Download URL: phylopypruner-0.5.2-py3-none-any.whl
- Upload date:
- Size: 47.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.6.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d79ff0638ac27eda755c62d8e08e8138fd135f754596f6ccf92e9366d66102c9 |
|
MD5 | 09b09edc731d05c48eba030290262cbe |
|
BLAKE2b-256 | 715173937a47b2ba0f5f4f0b388907d1dcc592cbb4e2417e0541bf89dbdbe869 |