pastml

Python wrapper for PASTML.

These details have not been verified by PyPI

Project links

Project description

# PASTML

__PASTML__ infers ancestral states on a phylogenetical tree with annotated tips, using maximum likelihood.
The tree with reconstructed ancestral states can then be visualised as a zoomable html map with __cytopast__.

# Run PASTML

There are 3 alternative ways to run PASTML: with [docker](https://hub.docker.com/), in python3, or in C (without visualisation).

## Run PASTML with docker
As an input, one needs to provide a phylogenetical tree in [newick](https://en.wikipedia.org/wiki/Newick_format) format,
and a table containing tip states,
in tab-delimited (by default) or csv format (to be specified with *--data_sep ,* option).

### Basic usage
```bash
docker run -v <path_to_the_folder_containing_the_tree_and_the_annotations>:/data:rw -t evolbioinfo/pastml --tree /data/<tree_file> --data /data/<annotation_file> --columns <one_or_more_column_names> --html_compressed /data/<map_name>
```

### Example
Let's assume that the tree and annotation files are in the Downloads folder,
and are named respectively tree.nwk and states.csv.

The states.csv is a comma-separated file, containing tip ids in the first column,
and several named columns, including *Location*, i.e.:

Tip_id | ... | Location | ...
----- | ----- | ----- | -----
1 | ... | Africa | ...
2 | ... | Asia | ...
3 | ... | Africa | ...
... | ... | ... | ...

To reconstruct and visualise the ancestral Location states,
one needs to run the following command:

```bash
docker run -v ~/Downloads:/data:rw -t evolbioinfo/pastml --tree /data/tree.nwk --data /data/states.csv --data_sep , --columns Location --html_compressed /data/location_map.html
```

This will produce a file location_map.html in the Downloads folder,
that can be viewed with a browser.

### Help

To see advanced options, run
```bash
docker run -t evolbioinfo/pastml -h
```

### Options

```
optional arguments:
-h, --help show the help message and exit
-v, --verbose print information on the progress of the analysis

annotation-related arguments:
-d DATA, --data DATA the annotation file in tab/csv format with the first
row containing the column names.
-s DATA_SEP, --data_sep DATA_SEP
the column separator for the data table. By default is
set to tab, i.e. for tab file. Set it to ',' if your
file is csv.
-i ID_INDEX, --id_index ID_INDEX
the index of the column in the data table that
contains the tree tip names, indices start from zero
(by default is set to 0).
-c [COLUMNS [COLUMNS ...]], --columns [COLUMNS [COLUMNS ...]]
names of the data table columns that contain states to
be analysed with PASTML. If neither columns nor
copy_columns are specified, then all columns will be
considered for PASTMl analysis.
--copy_columns [COPY_COLUMNS [COPY_COLUMNS ...]]
names of the data table columns that contain states to
be copied as-is, without applying PASTML (the missing
states will stay unresolved).

tree-related arguments:
-t TREE, --tree TREE the input tree in newick format.

ancestral-state inference-related arguments:
-m {JC,F81}, --model {JC,F81}
the evolutionary model to be used by PASTML, by
default JC.
--prediction_method {marginal_approx,marginal,max_posteriori,joint,downpass,acctran,deltran}
the ancestral state prediction method to be used by
PASTML, by default marginal_approx.
--work_dir WORK_DIR the working dir for PASTML to put intermediate files
into (if not specified a temporary dir will be
created).

visualisation-related arguments:
-n NAME_COLUMN, --name_column NAME_COLUMN
name of the data table column to be used for node
names in the compressed map visualisation(must be one
of those specified in columns or copy_columns if they
are specified).If the data table contains only one
column it will be used by default.
--tip_size_threshold TIP_SIZE_THRESHOLD
Remove the tips of size less than the threshold-th
from the compressed map (set to inf to keep all tips).

output-related arguments:
-o OUT_DATA, --out_data OUT_DATA
the output annotation file with the states inferred by
PASTML.
-p HTML_COMPRESSED, --html_compressed HTML_COMPRESSED
the output summary map visualisation file (html).
-l HTML, --html HTML the output tree visualisation file (html).
```

## Run PASTML in python3

### Installation

First install [GNU GSL](https://www.gnu.org/software/gsl/).
Then run:

```bash
pip3 install cytopast
```

### Basic usage in a command line
```bash
cytopast --tree <path/to/tree_file.nwk> --data <path/to/annotation_file.tab> --columns <one_or_more_column_names> --html_compressed <path/to/output/map.html>
```

To see advanced options, run
```bash
cytopast -h
```

### Basic usage in python3
```python
from cytopast.pastml_analyser import pastml_pipeline

# Path to the table containing tip/node annotations, in csv or tab format
data = "/path/to/the/table/eg/data.csv"

# Path to the tree in newick format
tree = "/path/to/the/tree/eg/tree.nwk"

# Columns present in the annotation table,
# for which we want to reconstruct ancestral states
columns = ['Location', 'Resistant_or_not']

# Columns present in the annotation table,
# for which we want to copy existing annotations from the annotation table,
# without inferring ancestral states
copy_columns = ['Sex']

# Path to the output compressed map visualisation
html_compressed = "/path/to/the/future/map/eg/map.html"

# Path to the output tree visualisation
html = "/path/to/the/future/tree/visualisation/eg/tree.html"

pastml_pipeline(data=data, data_sep=',', columns=columns, name_column='Location',
tree=tree,
html_compressed=html_compressed, html=html,
verbose=True)
```

## Run PASTML in C (without visualisation)

### Installation

First install [GNU GSL](https://www.gnu.org/software/gsl/).
Then run:
```bash
cmake
make
```

### Basic usage in a command line
```bash
pastml -t <path/to/tree_file.nwk> -a <path/to/annotation_file.csv>
```

To see advanced options, run
```bash
cytopast -h
```

### Options
```
required arguments:
-a ANNOTATION_FILE path to the annotation file containing tip states (in csv format: tip_id,state.)
-t TREE_NWK path to the tree file (in newick format)

optional arguments:
-o OUTPUT_ANNOTATION_CSV path where the output annotation file containing node states will be created (in csv format)
-n OUTPUT_TREE_NWK path where the output tree file will be created (in newick format)
-r OUTPUT_PARAMETERS_CSV path where the output parameters file will be created (in csv format)
-m MODEL state evolution model for max likelihood prediction methods: "JC" (default) or "F81"
-p PREDICTION_METHOD ancestral state prediction method: "marginal_approx" (default), "marginal", "max_posteriori", "joint", "downpass", "acctran", or "deltran"
("marginal_approx", "marginal", "max_posteriori", and "joint" are max likelihood methods, while "downpass", "acctran", and "deltran" are parsimonious ones)
-q quiet, do not print progress information
```

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.9.49

Sep 18, 2024

1.9.48

Sep 11, 2024

1.9.47

Sep 11, 2024

1.9.46

Jul 12, 2024

1.9.45

Jul 12, 2024

1.9.44

Jul 12, 2024

1.9.43

Feb 26, 2024

1.9.42

Jul 10, 2023

1.9.41

Jul 10, 2023

1.9.40

Aug 24, 2022

1.9.39

Aug 18, 2022

1.9.38

Aug 17, 2022

1.9.37

Aug 17, 2022

1.9.36

Aug 17, 2022

1.9.35

Aug 16, 2022

1.9.34

Jul 13, 2021

1.9.33

Mar 2, 2021

1.9.32

Jan 8, 2021

1.9.31

Dec 8, 2020

1.9.30

Aug 31, 2020

1.9.29.9

Aug 10, 2020

1.9.29.8

Aug 10, 2020

1.9.29.7

Aug 10, 2020

1.9.29.6

Jul 29, 2020

1.9.29.5

Jul 6, 2020

1.9.29.4

May 29, 2020

1.9.29.3

May 26, 2020

1.9.29.2

Apr 30, 2020

1.9.29.1

Apr 27, 2020

1.9.29

Apr 27, 2020

1.9.28.1

Apr 21, 2020

1.9.28

Apr 21, 2020

1.9.27

Apr 20, 2020

1.9.26

Apr 16, 2020

1.9.25

Apr 15, 2020

1.9.24

Jan 27, 2020

1.9.23

Jan 17, 2020

1.9.22

Jan 17, 2020

1.9.20

Aug 26, 2019

1.9.19

Aug 12, 2019

1.9.18

Aug 8, 2019

1.9.17

Aug 8, 2019

1.9.16

Aug 8, 2019

1.9.15

Apr 15, 2019

1.9.14

Apr 10, 2019

1.9.13

Apr 10, 2019

1.9.12

Mar 20, 2019

1.9.11

Mar 20, 2019

1.9.10

Mar 12, 2019

1.9.9

Mar 12, 2019

1.9.8

Mar 4, 2019

1.9.7

Feb 27, 2019

1.9.6

Feb 27, 2019

1.9.5

Feb 26, 2019

1.9.4

Feb 25, 2019

1.9.3

Feb 13, 2019

1.9.2

Feb 13, 2019

1.9.1

Dec 7, 2018

1.8.1

Nov 27, 2018

1.8

Nov 27, 2018

1.7

Nov 22, 2018

1.6.1

Nov 21, 2018

1.6

Nov 20, 2018

1.5

Nov 15, 2018

1.1.1

Oct 17, 2018

1.0.9

Oct 17, 2018

1.0.8

Oct 16, 2018

1.0.7

Oct 9, 2018

1.0.6

Oct 9, 2018

1.0.4

Oct 9, 2018

1.0.3

Oct 3, 2018

1.0.2

Oct 2, 2018

1.0.1

Oct 2, 2018

1.0

Jul 26, 2018

0.9.2

Jul 24, 2018

0.9.1

Jul 24, 2018

0.9

Jul 21, 2018

0.8

Jun 22, 2018

0.7.4.5

May 31, 2018

0.7.4.4

May 31, 2018

0.7.4.3

May 31, 2018

0.7.4.2

May 31, 2018

0.7.4.1

May 31, 2018

This version

0.7.4

May 31, 2018

0.7.3

May 31, 2018

0.7.2

May 30, 2018

0.7.1

May 30, 2018

0.7

May 30, 2018

0.6.8

May 16, 2018

0.6.6

Apr 16, 2018

0.6.5

Apr 16, 2018

0.6.4

Apr 3, 2018

0.6.3

Mar 26, 2018

0.6.2

Mar 26, 2018

0.6.1

Mar 20, 2018

0.5.9.2

Mar 20, 2018

0.5.9.1

Mar 20, 2018

0.5.9

Mar 19, 2018

0.5.8

Mar 19, 2018

0.5.7

Mar 19, 2018

0.5.6

Mar 5, 2018

0.5.5

Mar 5, 2018

0.5.4

Mar 5, 2018

0.5.3

Mar 5, 2018

0.5.2

Mar 5, 2018

0.5.1

Mar 5, 2018

0.5

Feb 27, 2018

0.3.5

Feb 7, 2018

0.3.3

Feb 2, 2018

0.3.2

Feb 1, 2018

0.3.1

Jan 29, 2018

0.3

Jan 26, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pastml-0.7.4.tar.gz (27.3 kB view hashes)

Uploaded May 31, 2018 Source

Hashes for pastml-0.7.4.tar.gz

Hashes for pastml-0.7.4.tar.gz
Algorithm	Hash digest
SHA256	`95a496c9cb638b488c38db66541639f79633228b79e6f92d8604457acab26d48`
MD5	`66f01f829d75823e76199b1668a3dcf3`
BLAKE2b-256	`c58aa230a3ef543a1d7654a3699d7c13ae34077f92cd1c6f44da2b74e7581344`