Skip to main content

a python package for automated generation of heuristic phylogenetic trees from genbank files

Project description

getphylo: GEnbank To PHYLOgeny

a python package for automated generation of heuristic phylogenetic trees from genbank files

Description

getphylo was designed to automatically build multi-locus phylogenetic trees from GenBank files. The workflow consists of the following steps: i) extract protein coding sequences; ii) screen for suitable markers; iii) align individual marker sequences and create a combined alignment; and iv) produce a tree from the combined alignment. Please see the 'parameters' section below for a full list of parameters.

Installation

The easiest way to install getphylo is using the command:

pip install getphylo

This will fetch and install the latest version from: https://pypi.org/project/getphylo/

For full installation instructions, please see the getphylo wiki.

Important: getphylo requires DIAMOND, MUSCLE and FastTree2 to be installed to work correctly. These must be installed manually. Further instructions are availiable on the wiki.

Quick-start

This package has been designed to be as easy to run as possible. Simply navigate to a working directory containing .gbk files and input:

getphylo

This will run the software with default settings.

A full list of options and flags can be viewed with:

getphylo -h

A full list of parameters and further usage examples are availiable on the wiki.

Example Analysis and Datasets

Example outputs and benchmarking data can be found in the getphylo benchmarking repository. The example data includes:

  1. A phylogeny of bacterial genomes,
  2. A phylogeny of a biosynthetic gene cluster,
  3. A phylogeny of primate genomes,
  4. A phylogeny of Eurotiomycete fungi.

Citation

If you use getphylo, please cite:

Booth, T. J., Shaw, S., & Weber, T. (2023). getphylo: rapid and automatic generation of multi-locus phylogenetic trees. BioRxiv, 2023.07.26.550493.

DOI: https://doi.org/10.1101/2023.07.26.550493

Patch Notes

Version 0

  • 0.1.0
    • beta version initial release
  • 0.1.1
    • added support for MUSCLE5
  • 0.1.2
    • now raises an error if translations are present but empty
    • error messages from the extract module are now more informative
    • fixed a fatal issue with --build-all
  • 0.2.0
    • now supports iqtree using the --method parameter
  • 0.2.1
    • now able to provide custom paths for binary dependencies
    • parser now has argument groups and is more readable
    • file exists error message more informative
  • 0.2.2
    • added error message when users attempt to input directory instead of a search string

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

getphylo-0.2.2.tar.gz (31.0 kB view details)

Uploaded Source

Built Distribution

getphylo-0.2.2-py3-none-any.whl (35.0 kB view details)

Uploaded Python 3

File details

Details for the file getphylo-0.2.2.tar.gz.

File metadata

  • Download URL: getphylo-0.2.2.tar.gz
  • Upload date:
  • Size: 31.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for getphylo-0.2.2.tar.gz
Algorithm Hash digest
SHA256 ab1cc57dfae2f79f1aaa4040c06462105b08b4bdfea674800920f516f283f0c9
MD5 ad24b9c94d2795338d1288f49df610d4
BLAKE2b-256 aaeace3c5ccf8d22d0d14675ecb4382e53fb43f2007a4d9d72914ad8921abfea

See more details on using hashes here.

File details

Details for the file getphylo-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: getphylo-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 35.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for getphylo-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 50a85b956ffe59a6f8e93bc6e40c93c0b69dff11b228419dcbdbadc68adf36fc
MD5 28987e4001932df4b142cf60fd9c62aa
BLAKE2b-256 e3ec3f8a50f4d35dba7cac3624c1e6708f0a554ff00328d5c31029e1e009f316

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page