ProPhyle metagenomic classifier
Project description
Getting started
Prerequisities
GCC 4.8+
ZLib
Python 3 with ete3 library
SamTools
Recommended way of installation using Conda
Environment installation:
conda create -y --name prophyle \
-c etetoolkit -c bioconda \
python==3.4 ete3 bitarray \
parallel blast samtools=1.3.1
Environment activation:
source activate prophyle
Compile all programs
make -C src
Custom taxonomic trees
Use [bin/build_taxonomic_tree.py](bin/build_taxonomic_tree.py) to build custom taxonomic trees starting from your database’s fasta indexes and taxonomy files ([library/Taxonomy](library/Taxonomy) for more information). Taxonomic identifiers are assigned to the sequences first, and then the tree is built using [ETE Toolkit](http://etetoolkit.org/) and saved as newick format 1. Necessary node attributes are:
name: unique node name (format n[0-9]*)
taxid: unique taxonomic identifier
seqname: names of the sequences sharing the same taxid, separated by @
fastapath: paths of the sequences’ fasta files, separated by @ (absolute or relative from the main directory of the repository)
infasta_offset: positions where each sequence starts inside the corresponding fasta files, separated by @
base_len: length of each sequence, separated by @
Other optional attributes are sci_name, named_lineage, lineage, rank (more info [here](http://etetoolkit.org/docs/latest/tutorial/tutorial_ncbitaxonomy.html#automatic-tree-annotation-using-ncbi-taxonomy)).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for prophyle-0.1.0.12-cp35-cp35m-macosx_10_7_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c68368bef48ab3f80405b76e4bb85c3a57722c2b4c90fa8c2f813b9d7cb6890e |
|
MD5 | 26671f419b25d2ab37ae71bb0a5bebed |
|
BLAKE2b-256 | 858daf9c6a732565295bcdf086468eac4c8e7ab4c417d0f3f2c1fbe6fbe3c9f7 |