ProPhyle metagenomic classifier
Project description
Getting started
Prerequisities
GCC 4.8+
ZLib
Python 3 with ete3 library
SamTools
Recommended way of installation using Conda
Environment installation:
conda create -y --name prophyle \
-c etetoolkit -c bioconda \
python==3.4 ete3 bitarray \
parallel blast samtools=1.3.1
Environment activation:
source activate prophyle
Compile all programs
make -C src
Custom taxonomic trees
Use [bin/build_taxonomic_tree.py](bin/build_taxonomic_tree.py) to build custom taxonomic trees starting from your database’s fasta indexes and taxonomy files ([library/Taxonomy](library/Taxonomy) for more information). Taxonomic identifiers are assigned to the sequences first, and then the tree is built using [ETE Toolkit](http://etetoolkit.org/) and saved as newick format 1. Necessary node attributes are:
name: unique node name (format n[0-9]*)
taxid: unique taxonomic identifier
seqname: names of the sequences sharing the same taxid, separated by @
fastapath: paths of the sequences’ fasta files, separated by @ (absolute or relative from the main directory of the repository)
infasta_offset: positions where each sequence starts inside the corresponding fasta files, separated by @
base_len: length of each sequence, separated by @
Other optional attributes are sci_name, named_lineage, lineage, rank (more info [here](http://etetoolkit.org/docs/latest/tutorial/tutorial_ncbitaxonomy.html#automatic-tree-annotation-using-ncbi-taxonomy)).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for prophyle-0.1.0.5-cp35-cp35m-macosx_10_7_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5eb3d3da3ed6251333178d3c8e712b0174a23e5d926c775adba2cc151e965c57 |
|
MD5 | 99ca221a1789339dc5510f5c2cbc23e4 |
|
BLAKE2b-256 | 6ba868638cab6ac8db91f006fbd3283aa3fbb87a870dd6d026390f863fd5c4a0 |