Reconstruct haplotypes and produce genealogy graphs from population data
Project description
Hapsolutely
Reconstruct haplotypes and produce genealogy graphs from population data.
- Phase sequences: Reconstruct haplotypes from sequence data
- Haplotype visualization: Generate haplotype networks, genealogies and haplowebs
- Haplotype statistics: Detect fields for recombination and subset overlap
Hapsolutely is a comprehensive program that integrates ConvPhase, haplodemo, popart_networks and haplostats.
Input files can be in TSV, FASTA or SPART/XML format.
Executables
Download and run the standalone executables without installing Python.
Installation
Hapsolutely is available on PyPI. You can install it through pip
:
pip install itaxotools-hapsolutely
hapsolutely
Usage
Please refer to the Hapsolutely manual for information on how to use the program.
Citations
Hapsolutely was developed in the framework of the iTaxoTools project:
Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.
Sequences are phased using PHASE and SeqPHASE:
Stephens, M., Smith, N., and Donnelly, P. (2001). A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68, 978--989.
Stephens, M., and Donnelly, P. (2003). A comparison of Bayesian methods for haplotype reconstruction from population genotype data. American Journal of Human Genetics, 73:1162-1169.
Flot, J.F. (2010) seqphase: a web tool for interconverting phase input/output files and fasta sequence alignments. Mol. Ecol. Resour., 10, 162–166.
Networks are generated using either of Fitchi or popart_networks:
Matschiner M (2015) Fitchi: Haplotype genealogy graphs based on the Fitch algorithm. Bioinformatics, 32:1250-252.
Leigh, JW, Bryant D (2015). PopART: Full-feature software for haplotype network construction. Methods Ecol Evol 6(9):1110-1116.
Bandelt H, Forster P, Röhl A (1999). Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16(1):37-48.
Clement M, Snell Q, Walke P, Posada D, Crandall, K (2002). TCS: estimating gene genealogies. Proc 16th Int Parallel Distrib Process Symp 2:184.
Doyle, J. J. (1995) The irrelevance of allele tree topologies for species delimitation, and a non-topological alternative. Syst. Bot., 20, 574-588.
BioPython is used to create ML/NJ trees if needed. networkx is used for laying out the initial graph.
Cock, P.J. et al., 2009. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), pp.1422-1423.
Hagberg, A., Swart, P. & S Chult, D., 2008. Exploring network structure, dynamics, and function using NetworkX.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for itaxotools-hapsolutely-0.2.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4e757258026c12da3408f62b5a5659e2401282420a08058a68a6e4d5facba33e |
|
MD5 | d18205eb9f2b73447c9eececb5ee25b8 |
|
BLAKE2b-256 | 8f684b069d89fd415a6d1004800263c5f2ed978017786afe94d81d832858bfff |
Hashes for itaxotools_hapsolutely-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 368dce8d842bc14edf7bc322b2ee607a35c15e9bf749aaa09b25253e7029aabe |
|
MD5 | 07544b2943738e32bd832254ac1e7dbf |
|
BLAKE2b-256 | eb066d44c845b9572f4a963a3d111f44b94d1ae5e9bd9c6cb8c5a49797f4da32 |