Reconstruct haplotypes and produce genealogy graphs from population data
Project description
Hapsolutely
Reconstruct haplotypes and produce genealogy graphs from population data.
- Phase sequences: Reconstruct haplotypes from sequence data
- Haplotype visualization: Generate haplotype networks, genealogies and haplowebs
- Haplotype statistics: Detect fields of recombinations and subset overlap
Hapsolutely is a comprehensive program that integrates ConvPhase, haplodemo, popart_networks and haplostats.
Input files can be in TSV, FASTA or SPART/XML format.
Executables
Download and run the standalone executables without installing Python.
Installation
Hapsolutely is available on PyPI. You can install it through pip
:
pip install itaxotools-hapsolutely
hapsolutely
Usage
Please refer to the Hapsolutely manual for information on how to use the program.
Citations
Hapsolutely was developed in the framework of the iTaxoTools project:
Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.
Sequences are phased using PHASE and SeqPHASE:
Stephens, M., Smith, N., and Donnelly, P. (2001). A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68, 978--989.
Stephens, M., and Donnelly, P. (2003). A comparison of Bayesian methods for haplotype reconstruction from population genotype data. American Journal of Human Genetics, 73:1162-1169.
Flot, J.F. (2010) seqphase: a web tool for interconverting phase input/output files and fasta sequence alignments. Mol. Ecol. Resour., 10, 162–166.
Networks are generated using either of Fitchi or popart_networks:
Matschiner M (2015) Fitchi: Haplotype genealogy graphs based on the Fitch algorithm. Bioinformatics, 32:1250-252.
Leigh, JW, Bryant D (2015). PopART: Full-feature software for haplotype network construction. Methods Ecol Evol 6(9):1110-1116.
Bandelt H, Forster P, Röhl A (1999). Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16(1):37-48.
Clement M, Snell Q, Walke P, Posada D, Crandall, K (2002). TCS: estimating gene genealogies. Proc 16th Int Parallel Distrib Process Symp 2:184.
BioPython is used to create NJ trees if needed. networkx is used for laying out the initial graph.
Cock, P.J. et al., 2009. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), pp.1422-1423.
Hagberg, A., Swart, P. & S Chult, D., 2008. Exploring network structure, dynamics, and function using NetworkX.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for itaxotools-hapsolutely-0.2.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | dad2904eb44e2e6aca6bd349ba5ff853263f8555f1eb3aea026b6f8e4d276f72 |
|
MD5 | d31c34e5912169ff42449b85cbc22e03 |
|
BLAKE2b-256 | 0d5b7be05ac1ca2308ba58c377c1b8888ce2f505aa16b9793bf9ac1308273581 |
Hashes for itaxotools_hapsolutely-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4a2959bb50d8816347b150ec42065bcda2634cfdb96240af3cceec86575edd3d |
|
MD5 | f4989c7d1e251fd15ea88db246f9302e |
|
BLAKE2b-256 | 80fc9c44ae2e8cd0c5f46e15d9231e64e684b3cd3fdbeeb1eba8d505eee785e5 |