ProPhyle metagenomic classifier
Project description
Getting started
Prerequisities
GCC 4.8+
ZLib
Python 3 with ete3 library
SamTools
Recommended way of installation using Conda
Environment installation:
conda create -y --name prophyle \
-c etetoolkit -c bioconda \
python==3.4 ete3 bitarray \
parallel blast samtools=1.3.1
Environment activation:
source activate prophyle
Compile all programs
make -C src
Custom taxonomic trees
Use bin/build_taxonomic_tree.py to build custom taxonomic trees starting from your database’s fasta indexes and taxonomy files (library/Taxonomy for more information). Taxonomic identifiers are assigned to the sequences first, and then the tree is built using ETE Toolkit and saved with newick format 1. Necessary node attributes are:
name: unique node name
taxid: unique taxonomic identifier
seqname: names of the sequences sharing the same taxid, separated by @
fastapath: paths of the sequences’ fasta files, separated by @ (relative paths from their library directory)
infasta_offset: positions where each sequence starts inside the corresponding fasta files, separated by @
base_len: length of each sequence, separated by @
seqname, infasta_offset and base_len can be found in samtools’ fasta index. Other optional attributes are sci_name, named_lineage, lineage, rank (more info here).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for prophyle-0.1.0.17-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0cd5f8233ee24f27f28650312055d07015fd0fc7916746ad5130c21fcb3095e7 |
|
MD5 | 70a4f288ea3843e3d41d77da24d4ff52 |
|
BLAKE2b-256 | 8d867d67d307317c416b792cd1040aae3cac9451f973f2715a2b4c71c29fec6a |