Skip to main content

tree-based orthology inference

Project description

PhyloPyPruner is a Python package for refining the output of a graph-based orthology inference approach such as [OrthoMCL](https://www.ncbi.nlm.nih.gov/pubmed/12952885), [OrthoFinder](https://www.ncbi.nlm.nih.gov/pubmed/26243257) or [HaMStR](https://www.ncbi.nlm.nih.gov/pubmed/19586527). Similar to other tree-based orthology inference methods (e.g., [PhyloTreePruner](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3825643/), [UPhO](https://academic.oup.com/mbe/article/33/8/2117/2578877), [Agalma](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3840672/) and [Phylogenomic Dataset Reconstruction](https://www.ncbi.nlm.nih.gov/pubmed/25158799)), it uses phylogenetic trees in order to obtain genes that are 1:1 orthologous. In addition to filters and algorithms seen in pre-existing tools, this package provides new methods for differentiating [contamination-like sequences](https://gitlab.com/fethalen/phylopypruner/wikis/About-PhyloPyPruner#contamination-like-issues-) from paralogs.

PhyloPyPruner is under active development and I would appreciate it if you try this software on your own data and [leave feedback](mailto:felix.thalen@uni-goettingen.de).

![proteomes2orthologs](https://gitlab.com/fethalen/phylopypruner/raw/master/doc/images/proteomes2orthologs.png)

Figure 1. An overview of a tree-based orthology inference approach. First, sequences from different species are clustered together based on an all-versus-all BLAST, followed by Markov clustering. Each node in the cluster corresponds to a sequence and each edge corresponds to a similarity score. Homologous groups are then aligned and a phylogenetic tree is inferred from the alignment. From this tree, orthologous groups can be identified and paralogs are pruned away.

## Quick installation

The easiest way to install PhyloPyPruner is by using the package manager for Python, [pip](https://pypi.org/project/pip/):

`bash pip install phylopypruner # install for all users pip install --user phylopypruner # install for the current user only `

Once installed, the program is located within $HOME/.local/bin. Depending on your OS, you might have to add the directory to your $PATH to avoid typing the entire path. Once in your path, you run the program like this:

`bash phylopypruner `

## [Documentation](https://gitlab.com/fethalen/phylopypruner/wikis)

## Cite

Our manuscript is still in preparation, it will be posted here once a preprint of the article is available.

© [Kocot Lab](https://www.kocotlab.com/) 2018

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phylopypruner-0.5.2.tar.gz (36.9 kB view details)

Uploaded Source

Built Distribution

phylopypruner-0.5.2-py3-none-any.whl (47.5 kB view details)

Uploaded Python 3

File details

Details for the file phylopypruner-0.5.2.tar.gz.

File metadata

  • Download URL: phylopypruner-0.5.2.tar.gz
  • Upload date:
  • Size: 36.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.6.7

File hashes

Hashes for phylopypruner-0.5.2.tar.gz
Algorithm Hash digest
SHA256 d9026e994439cc4578b4dc5f2f034ede74047926f04c0b2cd472c853903b9cfe
MD5 3354c137540c0f95b68557b72d1b9026
BLAKE2b-256 a076f5279848a7e704923615ecf29891acb702668cdfa4f45752401883b920d7

See more details on using hashes here.

File details

Details for the file phylopypruner-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: phylopypruner-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 47.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.6.7

File hashes

Hashes for phylopypruner-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d79ff0638ac27eda755c62d8e08e8138fd135f754596f6ccf92e9366d66102c9
MD5 09b09edc731d05c48eba030290262cbe
BLAKE2b-256 715173937a47b2ba0f5f4f0b388907d1dcc592cbb4e2417e0541bf89dbdbe869

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page