TreeSAPP is a functional and taxonomic annotation tool for genomes and metagenomes.
Project description
TreeSAPP: Tree-based Sensitive and Accurate Phylogenetic Profiler
Overview
TreeSAPP is a python package for functional and taxonomic annotation of proteins from genomes and metagenomes using phylogenetic placement.
Quick start
We recommend installing TreeSAPP into its own conda environment with the following command:
conda create -n treesapp_cenv -c bioconda -c conda-forge treesapp
conda activate treesapp_cenv
To list all the sub-commands run treesapp
.
To test the assign
workflow, run:
treesapp assign -i TreeSAPP/tests/test_data/marker_test_suite.faa -m prot --trim_align -o assign_test -t McrA,DsrAB
To classify sequences in your genome of interest:
treesapp assign -i my.fasta -o ~/path/to/output/directory/
TreeSAPP comes installed with 33 reference packages involved in a variety of biogeochemical and cellular processes. We also have many more reference packages available on our RefPkgs repository and you can view the complete list here.
Tutorials
All of our tutorials are available on the GitHub wiki page. Here are some specific tutorial examples:
If we do not yet have a reference package for a gene you are interested in, please try building a new reference package. Of course, if you run into any problems or would like to collaborate on building many reference packages don't hesitate to email us or create a new issue with an 'enhancement' label.
To determine whether the sequences used to build your new reference package are what you think they are, and whether it might unexpectedly annotate homologous sequences, see the purity tutorial.
If you are working with a particularly complex reference package, from an orthologous group for example, or have extra
phylogenetic information you'd like to include in your classifications,
try annotating extra features with treesapp layer
.
Citation
If you found TreeSAPP useful in your work, please cite the following paper:
Morgan-Lang, C., McLaughlin, R., Armstrong, Z., Zhang, G., Chan, K., & Hallam, S. J. (2020). TreeSAPP: The Tree-based Sensitive and Accurate Phylogenetic Profiler. Bioinformatics, 1–8.
This was brought to you by the team:
- Connor Morgan-Lang (cmorganl, maintainer)
- Ryan McLaughlin (McGlock)
- Grace Zhang (grace72)
- Kevin Chan (kevinxchan)
- Zachary Armstrong
- Steven J. Hallam
References
If you're feeling extra citation-happy, please consider citing the following works as well:
- Eddy, S. R. (1998). Profile hidden Markov models. Bioinformatics (Oxford, England), 14(9), 755–763.
- Criscuolo, A., & Gribaldo, S. (2010). BMGE (Block Mapping and Gathering with Entropy): A new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evolutionary Biology, 10(1).
- Kozlov, A. M., Darriba, D., Flouri, T., Morel, B., & Stamatakis, A. (2019). RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics, 35(21), 4453–4455.
- Barbera, P., Kozlov, A. M., Czech, L., Morel, B., & Stamatakis, A. (2018). EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences. Systematic Biology, 0(0), 291658.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file treesapp-0.11.4.tar.gz
.
File metadata
- Download URL: treesapp-0.11.4.tar.gz
- Upload date:
- Size: 11.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7f8353339b1a449d1cf9d31e425b37490653a9b854a7bd3fe1ceceeb87e8eee |
|
MD5 | 4df92183839db81339792d2533908711 |
|
BLAKE2b-256 | c922768e33057257389a7d4e234e7910768a8ab3a44017e76990be5cca8712a4 |
File details
Details for the file treesapp-0.11.4-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 11.7 MB
- Tags: CPython 3.8, manylinux: glibc 2.12+ x86-64, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8afdb1bf3c012c29753d98ad2b6c484cbf1978a276e0dc2c3621fd092d1a531c |
|
MD5 | d9b69f45d55e251f9893d3da5f5d7c38 |
|
BLAKE2b-256 | 5df9dc60da8fa76fcfa4775aad1f9c19595e884936ec23c9f44c1bf50c2d4a79 |
File details
Details for the file treesapp-0.11.4-cp38-cp38-macosx_10_15_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp38-cp38-macosx_10_15_x86_64.whl
- Upload date:
- Size: 11.6 MB
- Tags: CPython 3.8, macOS 10.15+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2a56a39a690769df9aa9ab036543d3cc86e1443cd81b23748e9ccd5c657212b7 |
|
MD5 | 34c2200472ddee25113b0754947c7458 |
|
BLAKE2b-256 | 152d5cb5ea596a242bd78062fd97e72e1e6eaca355e50318e6da44b77fd5f1fe |
File details
Details for the file treesapp-0.11.4-cp37-cp37m-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp37-cp37m-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 11.7 MB
- Tags: CPython 3.7m, manylinux: glibc 2.12+ x86-64, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8e020b9af00f5ff2b06c93a740afc746a59cd12109ea10dfc98bbcb1dcc4396d |
|
MD5 | 396683ca1d3245bc1b521a1522677484 |
|
BLAKE2b-256 | 710453b73c8b470e8f2328c0dc06a0ca3b873102fa188e2dbab1480d71d7f650 |
File details
Details for the file treesapp-0.11.4-cp37-cp37m-macosx_10_15_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp37-cp37m-macosx_10_15_x86_64.whl
- Upload date:
- Size: 11.6 MB
- Tags: CPython 3.7m, macOS 10.15+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58f8a8b6cd5cc413e4cd279e2276a3c70938e35da3c72036e7ad1a1d7aa88389 |
|
MD5 | f452b3383797765c0aaeb6f413601c66 |
|
BLAKE2b-256 | 67879be582b7886e5985df11e15c67207b6a05ee4839d97c0004c4013359e65c |
File details
Details for the file treesapp-0.11.4-cp36-cp36m-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp36-cp36m-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 11.7 MB
- Tags: CPython 3.6m, manylinux: glibc 2.12+ x86-64, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 12bc12d91fe329b6b8e4ea50dd48c42c1ea04437e731c06a68cfdeec962fa53c |
|
MD5 | a27db996a19a928fab68150121f1bbd8 |
|
BLAKE2b-256 | 82ca03bc92c0210e0af776899b8caa5a536b55eb54a1e050ba68b75e85b4b7fb |
File details
Details for the file treesapp-0.11.4-cp36-cp36m-macosx_10_14_x86_64.whl
.
File metadata
- Download URL: treesapp-0.11.4-cp36-cp36m-macosx_10_14_x86_64.whl
- Upload date:
- Size: 11.6 MB
- Tags: CPython 3.6m, macOS 10.14+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd1cd88a80d0bb9940aecd872587069804eed2a0c36a044a7a0e2eeb6b90d11f |
|
MD5 | 65c5d17953dfd01ec686dfb2c580ad6c |
|
BLAKE2b-256 | b466d95281da17aebc080a80471feecb73915b55474249edfc62ffafed279121 |