TreeSAPP is a functional and taxonomic annotation tool for genomes and metagenomes.
Project description
TreeSAPP: Tree-based Sensitive and Accurate Phylogenetic Profiler
Connor Morgan-Lang, Ryan McLaughlin, Grace Zhang, Kevin Chan, Zachary Armstrong, and Steven J. Hallam
Overview
TreeSAPP is a python package for phylogenetically annotating genomes and metagenomes. Here is a diagram of the workflow:
Installation
TreeSAPP supports Python versions 3.5, 3.6, 3.7 and 3.8.
Conda
TreeSAPP and most of its dependencies can be installed in its own environment using conda.
conda create -n treesapp_cenv -c bioconda -c conda-forge treesapp
conda activate treesapp_cenv
If you plan on building your own reference packages you will also require USEARCH.
Singularity
If you're working in an HPC environment and don't have conda installed, we also have a singularity container available:
singularity pull library://cmorganl/default/treesapp
singularity exec treesapp.sif
PyPI
The most recent version of TreeSAPP is hosted on the Python Package Index (PyPI) and can be installed using pip install treesapp
.
Alternatively you can install the latest development version of TreeSAPP locally with git clone
.
In either case we recommend installing within a virtual environment using the python package virtualenv
.
cd ~/bin
virtualenv ~/bin/treesapp_venv
source ~/bin/treesapp_venv/bin/activate
git clone https://github.com/hallamlab/TreeSAPP.git
cd TreeSAPP/
python setup.py sdist
pip install dist/treesapp*.tar.gz
If you opted to install TreeSAPP either using pip
or by cloning the development version from GitHub you will need to
install dependencies that you do not already
have installed (i.e. they will need to be in you're environment's path).
Running TreeSAPP
To list all the sub-commands run treesapp
.
To test the assign
workflow, run:
treesapp assign -i ~/bin/TreeSAPP/test_data/marker_test_suite.faa -m prot --trim_align -o assign_test -t M0701,M0702,M0705
To assign sequences in your genome of interest:
treesapp assign -i Any.fasta -o ~/path/to/output/directory/
As in the previous command, we recommend using the --trim_align
flag and increasing the number of threads to use with -n
.
Tutorials
If we do not yet have a reference package for a gene you are interested in, please try building a new reference package. Of course, if you run into any problems or would like to collaborate on building many reference packages don't hesitate to email us or create a new issue with an 'enhancement' label.
To determine whether the sequences used to build your new reference package are what you think they are, and whether it might unexpectedly annotate homologous sequences, see the purity tutorial.
If you are working with a particularly complex reference package, from an orthologous group for example, or have extra
phylogenetic information you'd like to include in your classifications,
try annotating extra features with treesapp layer
.
Yet to come
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for treesapp-0.6.7-cp38-cp38-manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8b62892eee16a080f5cff9f5410f7f296e3d13efd64d304fdde385750d69a57 |
|
MD5 | fa21b24afc8bddd78b0da2b2014391ff |
|
BLAKE2b-256 | 65668ad697153f519167d43347900a9384b94448e69e6bb4f98af8561f0fc5f7 |
Hashes for treesapp-0.6.7-cp38-cp38-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 30b80dbac58b114c43168f0bf6b3363659a0f023d39b88dfced1f28e91876eb2 |
|
MD5 | a6d2ba79274b54641a1f12ef7c596f37 |
|
BLAKE2b-256 | 08aa45bfdc58df7589b388033e65b60f0020a1e5ffad172c951398e2cd2bb0c4 |
Hashes for treesapp-0.6.7-cp37-cp37m-manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 380109b3ef7fe0a1ee17edb57e77877ad5c3f653be2c575a60a9ee6e32a678d0 |
|
MD5 | f0aa17fb6bd16422d21334f432e22d53 |
|
BLAKE2b-256 | 6d142befd60fabfa217b452cf44a03233cf10cee6000c22df56d6602733b9a83 |
Hashes for treesapp-0.6.7-cp37-cp37m-macosx_10_6_intel.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9277aa36ce77db4262ac109ffdbd56ef5fa84bb059996b650068cf2bd353356 |
|
MD5 | 084571603d32b8441976fa641e919a31 |
|
BLAKE2b-256 | d11bf5d2c6d3107de273953b5e2f22437676969764f2b835a6ebd0909a309af2 |
Hashes for treesapp-0.6.7-cp36-cp36m-manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 83be64dd233741851f9690e7001b89238c78f87423966569c5b6186d8286bcc5 |
|
MD5 | 0b476dc7eb859f27522112a3be85d435 |
|
BLAKE2b-256 | fb237878e5e0a5508736e2ad3a60aa04a00a0e6cc21d7269d50ac133413450d0 |
Hashes for treesapp-0.6.7-cp36-cp36m-macosx_10_6_intel.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1926f5d2dabe95b98a8d2c33a3fcc52b73302d69bcf33fbb49eae201c64d4a83 |
|
MD5 | 8e930254546930072a68e52482f9879c |
|
BLAKE2b-256 | e19fd42ab941fcd0155eac9e59071a736295038c564e89c28a1ca41af3acb8fa |
Hashes for treesapp-0.6.7-cp35-cp35m-manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1ae58b6634ef8860580134e3ede3df4eb6c82b1086d845cd6b841d78f986f0c2 |
|
MD5 | a5a348ca60636a292acb001248cabd04 |
|
BLAKE2b-256 | e9bbfd48a6c5d783a8d1285bf03040c2d9cc9c5d12d7ad292a8637874682ef96 |
Hashes for treesapp-0.6.7-cp35-cp35m-macosx_10_6_intel.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1bf7a519d6bb030cab6d94517df4fa4c655dc81885ca845575a9cb9c6224b3d0 |
|
MD5 | 1f297c6ec011c82a221dbe75e740e3e1 |
|
BLAKE2b-256 | 34902c03042bdd65ee0c75410bf6fab6c24e2ff5af1c6fb0b6170e9fa8e70a05 |