Skip to main content

A toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.

Project description

GTDB-Tk

PyPI PyPI Downloads Bioconda BioConda Downloads Docker Image Version (latest by date) Docker Pulls

GTDB-Tk is a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes based on the Genome Database Taxonomy (GTDB). It is designed to work with recent advances that allow hundreds or thousands of metagenome-assembled genomes (MAGs) to be obtained directly from environmental samples. It can also be applied to isolate and single-cell genomes. The GTDB-Tk is open source and released under the GNU General Public License (Version 3).

Notifications about GTDB-Tk releases will be available through the GTDB Twitter account and the GTDB Announcements Forum.

Please post questions and issues related to GTDB-Tk on the Issues section of the GitHub repository. Questions related to the GTDB can be posted on the GTDB Forum or sent to the GTDB team.

🚀 Getting started

Be sure to check the hardware requirements, then choose your preferred method:

📖 Documentation

Documentation for GTDB-Tk can be found here.

✨ New Features

GTDB-Tk v2.4.0+ includes the following new features:

  • FastANI has been replaced by skani as the primary tool for computing Average Nucleotide Identity (ANI).Users may notice slight variations in the results compared to those obtained using FastANI.

📈 Performance

Using ANI screen "can" reduce computation by >50%, although it depends on the set of input genomes. A set of input genomes consisting primarily of new species will not benefit from ANI screen as much as a set of genomes that are largely assigned to GTDB species clusters. In the latter case, the ANI screen will reduce the number of genomes that need to be classified by pplacer which reduces computation time substantially (between 25% and 60% in our testing).

📚 References

GTDB-Tk is described in:

The Genome Taxonomy Database (GTDB) is described in:

We strongly encourage you to cite the following 3rd party dependencies:

© Copyright

Copyright 2017 Pierre-Alain Chaumeil. See LICENSE for further details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gtdbtk-2.4.0.tar.gz (1.7 MB view details)

Uploaded Source

Built Distribution

gtdbtk-2.4.0-py3-none-any.whl (1.8 MB view details)

Uploaded Python 3

File details

Details for the file gtdbtk-2.4.0.tar.gz.

File metadata

  • Download URL: gtdbtk-2.4.0.tar.gz
  • Upload date:
  • Size: 1.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.8.19

File hashes

Hashes for gtdbtk-2.4.0.tar.gz
Algorithm Hash digest
SHA256 e67bab2c8f3e47c7242c70236c78e85bb9dc4721636bbf5044b171f18f22b1f7
MD5 4cb391517c2f98c253f9a003e5a64fd3
BLAKE2b-256 c6bdb66510b12e1b143fcd6fd45a78a8c50cf8750e95d98ebcb5b496421ab279

See more details on using hashes here.

File details

Details for the file gtdbtk-2.4.0-py3-none-any.whl.

File metadata

  • Download URL: gtdbtk-2.4.0-py3-none-any.whl
  • Upload date:
  • Size: 1.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.8.19

File hashes

Hashes for gtdbtk-2.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 420223c61860a2cdcbb345717d4d9509c3a7cb1bd56edf22b698344605d94b61
MD5 b54983ccb9cceebfe0423f2da8672bb9
BLAKE2b-256 3edf7409603fc08d111a48ed3399f1f895e7c6929e295d5b1eac44fab61c8990

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page