Skip to main content

tree-based orthology inference

Project description

ppp_logotype

About

Orthologs are genes that are related through a speciation event, while paralogs are genes that are related through a gene duplication event. Accurate identification of orthologs is a prerequisite for phylogenomics, since including genes that diverged because of a gene duplication event for species tree inference can cause an erroneous inference of speciation nodes, due to disparencies between individual gene trees and the species tree. Unfortunately, contaminants present in even a single taxon can cause a tree-based orthology inference method to erroneuosly infer paralogy and unnecessarily exclude sequences.

PhyloPyPruner is a Python package for phylogenetic tree-based orthology inference, using the species overlap method. It uses trees and alignments inferred from the output of a graph-based orthology inference approach, such as OrthoMCL, OrthoFinder or HaMStR, in order to obtain sets of sequences that are 1:1 orthologous. In addition to algorithms seen in pre-existing tree-based tools (for example, PhyloTreePruner, UPhO, Agalma or Phylogenomic Dataset Reconstruction), this package provides new methods for reducing potential contamination.

proteomes2orthologs

Figure 1. A rough overview of a tree-based orthology inference approach.

Quick installation

The easiest way to install PhyloPyPruner is by using the package manager for Python, pip:

pip install phylopypruner # install for all users
pip install --user phylopypruner # install for the current user only

Once installed, the program is located within $HOME/.local/bin. Depending on your OS, you might have to add the directory to your $PATH to avoid typing the entire path. Once in your path, you run the program like this:

phylopypruner

Documentation

  1. About PhyloPyPruner
  2. Tutorial
  3. Installation
  4. Input data
  5. Output files
  6. Methods
  7. Options

Cite

Our manuscript is still in preparation, it will be posted here once a preprint of the article is available.

© Kocot Lab 2019

Project details


Release history Release notifications | RSS feed

This version

1.2.2

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phylopypruner-1.2.2.tar.gz (1.8 MB view details)

Uploaded Source

Built Distribution

phylopypruner-1.2.2-py3-none-any.whl (65.4 kB view details)

Uploaded Python 3

File details

Details for the file phylopypruner-1.2.2.tar.gz.

File metadata

  • Download URL: phylopypruner-1.2.2.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.5

File hashes

Hashes for phylopypruner-1.2.2.tar.gz
Algorithm Hash digest
SHA256 b9e823d9e909d0778a065cd778a023ddd09178b2e2fc1b934ef54574bf7fbb33
MD5 641d7b6fbc12dd4cb762251f38ea9e35
BLAKE2b-256 9ca27b5fcc05ee20e1c53995591567c429f6ffc53741a27a5f345179d5d05537

See more details on using hashes here.

File details

Details for the file phylopypruner-1.2.2-py3-none-any.whl.

File metadata

  • Download URL: phylopypruner-1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 65.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.5

File hashes

Hashes for phylopypruner-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 00caf1d8ee8bfff591fbd90d933ed61c67275f6c8e086fbb037f784c77dbec43
MD5 cdc0b32d1caabbdddfc26745d151fdac
BLAKE2b-256 130ed8c57250b55256d9a5020c0350edac3e802a101f418a59d4676d4889c3ec

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page