tapir

Tally Approximations of Phylogenetic Informativeness Rapidly (TAPIR)

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Scientific/Engineering :: Bio-Informatics

Project description

Purpose

tapir contains programs to estimate and plot phylogenetic informativeness for large datasets.

Citing tapir

When using tapir, please cite:

Faircloth BC, Chang J, Alfaro ME: tapir enables high throughput analysis of phylogenetic informativeness. https://github.com/faircloth-lab/tapir
Townsend JP: Profiling phylogenetic informativeness. Systematic Biol. 2007, 56:222-231.
Pond SLK, Frost SDW, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics 2005, 21:676-679.

Dependencies

hyphy2 (please download or build a single-threaded hyphy2)
Python 2.6
numpy
scipy
dendropy

Installation

For ALL platforms, you must download a hyphy binary for your platform (osx or linux) and place that within your $PATH:

wget https://github.com/downloads/faircloth-lab/tapir/hyphy2.osx.gz
gunzip hyphy2.*.gz
chmod 0700 hyphy2.*
mv hyphy2.* ~/Bin/hyphy2

To install the other dependencies (numpy, scipy), you may need to install a Fortran compiler on linux/osx:

Linux

On linux (ubuntu/debian), use:

apt-get install gfortran libatlas-base-dev liblapack-dev

Install tapir and dependencies, which include numpy and scipy (the reason we installed the dependencies above):

pip install tapir

To plot results, you will also need to:

apt-get install r-base r-base-dev
pip install rpy2

OSX

It is easiest just to install the scipy superpack. This will install the dependencies that tapir needs. After installing the superpack, using pip, install tapir:

pip install tapir

Alternatively, you can simply try to install tapir using:

pip install tapir

To plot results, you need to install R and then install rpy2:

pip install rpy2

Other OSs

Install numpy, scipy, and dendropy for your platform. Then:

wget http://pypi.python.org/packages/source/t/tapir/tapir-1.0.tar.gz
tar -xzvf tapir-1.0.tar.gz
cd tapir*
python setup.py build
python setup.py test
python setup.py install

Plotting

Plotting is optional. To install the plotting dependencies, see Installation, above.

Testing

If you didn’t run the tests using python setup.py test above, you can also:

import tapir
tapir.test()

Use

The estimate_p_i.py code calls a batch file for hyphy that is in templates/. This file needs to be in the same position relative to wherever you put estimate_p_i.py. If you install thins as above, you’ll be fine, for the moment.

To run:

cd /path/to/tapir/

python tapir_compute.py Input_Folder_of_Nexus_Files/ Input.tree \
    --output Output_Directory \
    --epochs=32-42,88-98,95-105,164-174 \
    --times=37,93,100,170 \
    --multiprocessing

–multiprocessing is optional, without it, each locus will be run consecutively.

If you have already run the above and saved results to your output folder (see below), you can use the pre-existing site-rate records rather than estimating those again with:

python tapir_compute.py Input_Folder_of_Site_Rate_JSON_Files/ Input.tree \
   --output Output_Directory \
   --epochs=32-42,88-98,95-105,164-174 \
   --times=37,93,100,170 \
   --multiprocessing \
   --site-rates

Results

tapir writes results to a sqlite database in the output directory of your choosing. This directory also holds site rate files in JSON format for each locus passed through tapir_compute.py.

You can access the results in the database as follows. For more examples, including plotting, see the documentation

crank up sqlite:

sqlite3  Output_Directory/phylogenetic-informativeness.sqlite

get integral data for all epochs:

select locus, interval, pi from loci, interval where loci.id = interval.id

get integral data for a specific epoch:

select locus, interval, pi from loci, interval
where interval = '95-105' and loci.id = interval.id;

get the count of loci having max(PI) at different epochs:

create temporary table max as select id, max(pi) as max from interval group by id;

create temporary table t as select interval.id, interval, max from interval, max
where interval.pi = max.max;

select interval, count(*) from t group by interval;

Plotting Results

tapir contains plotting scripts to help you plot data within a results database and compare data between different databases. tapir uses RPY and R to do this. You can also plot data directly in R. Until we finish the documentation, please see the wiki for examples.

Acknowledgements

BCF thanks SP Hubbell, PA Gowaty, RT Brumfield, TC Glenn, NG Crawford, JE McCormack, and M Reasel. JHLC and MEA thank J Eastman and J Brown for thoughtful comments about PI. We thank Francesc Lopez-Giraldez and Jeffrey Townsend for providing us with a copy of their web-application source code and helpful discussion.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

This version

1.0

Nov 7, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tapir-1.0.tar.gz (121.7 kB view details)

Uploaded Nov 7, 2011 Source

File details

Details for the file tapir-1.0.tar.gz.

File metadata

Download URL: tapir-1.0.tar.gz
Upload date: Nov 7, 2011
Size: 121.7 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for tapir-1.0.tar.gz
Algorithm	Hash digest
SHA256	`e89853f117def8ed2e31efcc6207116a1922cc12462d79ccbc525cb15c072151`
MD5	`1f35210873e6d487318d103189dba0b4`
BLAKE2b-256	`81f6f4d869c79a52122c97a195671a59bc1a6b86295d66def4313fa03e1812bd`

See more details on using hashes here.

tapir 1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Purpose

Citing tapir

Dependencies

Installation

Linux

OSX

Other OSs

Plotting

Testing

Use

Results

Plotting Results

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes