No project description provided
Project description
Ancestral Cost
Ancestral Cost is a tool for validating multiple sequence alignments prior to performing ancestral sequence reconstruction.
It checks for each position in a given ancestor that the presence of ancestral content implied to be there by a given alignment and tree is not substantially less parsimonious then the alternative of not having ancestral content there.
Installation
Using Pip
$ pip install ancestral_cost
Manual
$ git clone https://github.com/gabefoley/ancestral_cost
$ cd ancestral_cost
$ python setup.py install
Usage
$ ancestral_cost -a <alignment> -t <tree>
Workflow
Before performing ancestral sequence reconstruction (ASR) we can recognise that a multiple sequence alignment implies that every aligned column should have a common ancestor.
Ancestral Cost checks that for every ancestral position that is implied by a given alignment and tree the parsimony cost of having ancestral content there isn't far greater than not having ancestral content.
Ancestral Cost is intended to be run before ASR in order to validate alignments and trees. It highlights positions that may be erroneously aligned.
If an alignment suggests two positions should be aligned but they are only present in distant clades then they shouldn't be one column but split into two columns. Failing to do this will influence ancestors that are predicted at these positions.
First Ancestral Cost calculates all of the positions required to be there. In the example this is done by simply looking at the highest ancestral position implied by each column. From the example, N3 is the only ancestral node that has content at each of the four alignment positions, all of the other nodes have content at three alignment positions.
It then calculates the parsimony cost for each implied position and reports on the cost of content being present and cost of content being absent.
This allows users to filter on particularly informative sites or particularly large discrepencies in parsimony scores.
The intention is to look at the positions identified by Ancestral Cost and potentially amend the multiple sequence alignment as a result.
All commands
-a Path to alignment
-t Path to phylogenetic tree
-n Node to return cost for (default is root)
-p Just return the positions required to be there
-f Return all ancestors as a FASTA file
-to Write out the ancestor tree
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ancestralcost-1.0.0.tar.gz
.
File metadata
- Download URL: ancestralcost-1.0.0.tar.gz
- Upload date:
- Size: 30.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bb2d60625b7aacd035ccc1efabca7d176eab87a118d0d99bdd9230d45d46c9dd |
|
MD5 | e180600ea928836b6f9b9b822d4eb4a8 |
|
BLAKE2b-256 | 768acfdda0c17c4d6e27d56e7edcb4e5e777373dc99965b33a146cf65813e098 |
File details
Details for the file ancestralcost-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: ancestralcost-1.0.0-py3-none-any.whl
- Upload date:
- Size: 44.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18f30e11f9e2143b971b67190bbb67f4e16fed6dd5e393616e22be7a2404a745 |
|
MD5 | d54ff3f9160a1bb8405f55a756408c55 |
|
BLAKE2b-256 | 5583ec693db71b04d1096dfcf393c915a95c1d947183c24cfec8a9b6fd347047 |