Skip to main content

Tree-based orthology inference

Project description

ppp_logotype

About

Orthologs are genes that are related through a speciation event, while paralogs are genes that are related through a gene duplication event. Accurate identification of orthologs is a prerequisite for phylogenomics, since including genes that diverged because of a gene duplication event for species tree inference can cause an erroneous inference of speciation nodes, due to disparencies between individual gene trees and the species tree. Unfortunately, contaminants present in even a single taxon can cause a tree-based orthology inference method to erroneuosly infer paralogy and unnecessarily exclude sequences.

PhyloPyPruner is a Python package for phylogenetic tree-based orthology inference, using the species overlap method. It uses trees and alignments inferred from the output of a graph-based orthology inference approach, such as OrthoMCL, OrthoFinder or HaMStR, in order to obtain sets of sequences that are 1:1 orthologous. In addition to algorithms seen in pre-existing tree-based tools (for example, PhyloTreePruner, UPhO, Agalma or Phylogenomic Dataset Reconstruction), this package provides new methods for reducing potential contamination.

proteomes2orthologs

Figure 1. A rough overview of a tree-based orthology inference approach.

Quick installation

The easiest way to install PhyloPyPruner is by using the package manager for Python, pip:

pip install phylopypruner # install for all users
pip install --user phylopypruner # install for the current user only

Once installed, the program is located within $HOME/.local/bin. Depending on your OS, you might have to add the directory to your $PATH to avoid typing the entire path. Once in your path, you run the program like this:

phylopypruner

Documentation

  1. About PhyloPyPruner
  2. Tutorial
  3. Installation
  4. Input data
  5. Output files
  6. Methods
  7. Options

Cite

Our manuscript is still in preparation, it will be posted here once a preprint of the article is available.

© Kocot Lab 2019

Project details


Release history Release notifications | RSS feed

This version

1.2.7

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phylopypruner-1.2.7.tar.gz (58.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phylopypruner-1.2.7-py3-none-any.whl (65.6 kB view details)

Uploaded Python 3

File details

Details for the file phylopypruner-1.2.7.tar.gz.

File metadata

  • Download URL: phylopypruner-1.2.7.tar.gz
  • Upload date:
  • Size: 58.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for phylopypruner-1.2.7.tar.gz
Algorithm Hash digest
SHA256 9ccaa460a82649c4183476b5860da4c0d3118e9d73c4fd13221f62950a1b9b5d
MD5 e7999272ab3de6bf201db75e8709cb7f
BLAKE2b-256 4c808381e3948de094135eeb11d5c1efa260151e7b8fd581fd721b71edc8b583

See more details on using hashes here.

File details

Details for the file phylopypruner-1.2.7-py3-none-any.whl.

File metadata

  • Download URL: phylopypruner-1.2.7-py3-none-any.whl
  • Upload date:
  • Size: 65.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for phylopypruner-1.2.7-py3-none-any.whl
Algorithm Hash digest
SHA256 d50184bc2325bf670cb4fe7ee4edd6fd13ea96ce691582e2220a822ad3242d5f
MD5 1ecd036f3d57669146f8c65db32c74e4
BLAKE2b-256 f5f753e947c17716f9745663fc6f0e44f8546af776fefe97e0a542b6fa3aac7b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page