A phylogenetic and geographic analysis tool
Project description
aPhyloGeo
🌳 Multi-platform application for analyze phylogenetic trees with climatic parameters
Table of Contents
📝 About the project
aPhyloGeo
is a powerful and versatile bioinformatics pipeline specifically designed for the analysis of phylogeography. Developed by the dedicated team led by Professor Nadia Tahiri at the University of Sherbrooke in Quebec, Canada, this open-source multi-platform application is implemented in Python. It serves as a valuable tool for researchers and scientists interested in unraveling the intricate relationships between genetic sequences and geographic locations.
💡 If you are using our algorithm in your research, please cite our recent papers:
1️⃣ Li, W. & Tahiri, N. (2023). aPhyloGeo-Covid: A Web Interface for Reproducible Phylogeographic Analysis of SARS-CoV-2 Variation using Neo4j and Snakemake. Proceeding in SciPy 2023, Auxtin, TX, USA
2️⃣ Koshkarov, A., Li, W., Luu, M. L., & Tahiri, N. (2022). Phylogeography: Analysis of genetic and climatic data of SARS-CoV-2. Proceeding in SciPy 2022, Auxtin, TX, USA
⚒️ Installation
Linux UNIX, Mac OS & Windows versions
aPhyloGeo
is available as a Python script.
Prerequisites
This package use Poetry
dependency management and packaging tool for Python. Poetry installation guide can be found here: Poetry Install
⚠️ For windows installation it's recommended to launch powershell in Administrator mode.
Once Poetry is installed, you can clone the repository and install the package using the following commands:
poetry install
Usage
Poetry will handle the virtual environment automatically. if you want to use the virtual environment manually, you can use the following command:
poetry shell
⚠️ Assuming Python 3.8 or higher is installed on the machine, these scripts should run well with the libraries installed.
You can also launch the package using the make
command from your terminal when you are in the root
. This command will use the Makefile
to run the script. If you use the command make clean
, it will erase the output.csv
file previously created with the first command.
🚀 Settings
The aPhyloGeo
software can be encapsulated in other applications and applied to other data by providing a YAML file. This file will include a set of parameters for easy handling.
- Bootstrap threshold: Number of replicates threshold to be generated for each sub-MSA (each position of the sliding window)
- Window length: Size of the sliding window
- Step: Sliding window advancement step
- Distance choice: distance selection
- '1' for Least Square (LS) distance (version 1.0)
- '2' for Robinson and Foulds (RF) distance
- '3' for Euclidean distance
- Distance threshold: LS distance threshold at which the results are most significant
- Alignment method: algorithm selection for sequence alignment
- '1' for pairwise2
- '2' for MUSCLE
- '3' for CLUSTALW
- '4' for MAFFT
- Fit method: gap selection elongation
- '1' for Wider Fit by elongating with Gap (starAlignment)
- '2' for Narrow-fit prevent elongation with gap when possible
✔️ References
1️⃣ Calculation of distance between phylogenetic tree: Least Square metric
- Cavalli-Sforza, L. L., & Edwards, A. W. (1967). Phylogenetic analysis. Models and estimation procedures. American journal of human genetics, 19(3 Pt 1), 233.
- Felsenstein, J. (1997). An alternating least squares approach to inferring phylogenies from pairwise distances. Systematic biology, 46(1), 101-111.
- Makarenkov, V., & Lapointe, F. J. (2004). A weighted least-squares approach for inferring phylogenies from incomplete distance matrices. Bioinformatics, 20(13), 2113-2121.
2️⃣ Calculation of distance between phylogenetic tree: Robinson-Foulds metric
3️⃣ Dataset full description: Analysis of genetic and climatic data of SARS-CoV-2
- Koshkarov, A., Li, W., Luu, M. L., & Tahiri, N. (2022). Phylogeography: Analysis of genetic and climatic data of SARS-CoV-2.
- Li, W., & Tahiri, N. (2023). aPhyloGeo-Covid: A Web Interface for Reproducible Phylogeographic Analysis of SARS-CoV-2 Variation using Neo4j and Snakemake.
📧 Contact
Please email us at: Nadia.Tahiri@USherbrooke.ca for any questions or feedback.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file aphylogeo-0.3.2.tar.gz
.
File metadata
- Download URL: aphylogeo-0.3.2.tar.gz
- Upload date:
- Size: 25.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.6.1 CPython/3.11.6 Linux/6.2.0-1015-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a132f98ae5f12460defa3d1b74b083ea95fd29af11d3d1efcb5179d90794265f |
|
MD5 | 1e3bd013e15ef61c69bbf92bf8071eb7 |
|
BLAKE2b-256 | d0fdd8286c965f52fb25dea65c5ce3ee9e91a0afa7c23eaf7baa531bbbfa5457 |
File details
Details for the file aphylogeo-0.3.2-py3-none-any.whl
.
File metadata
- Download URL: aphylogeo-0.3.2-py3-none-any.whl
- Upload date:
- Size: 24.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.6.1 CPython/3.11.6 Linux/6.2.0-1015-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f1c16ce4fdf3590e39cb3403cedbcb1458a9a8f3fe6982d0b53c8cf8ae46162 |
|
MD5 | 06dd81337dcd89d688d4fc6b30366e07 |
|
BLAKE2b-256 | 8e8b8feb3eccb1ffcd0ce391e73b9fef9570f193af86027a270f2cdca08180ae |