Skip to main content

Reconstruct haplotypes and produce genealogy graphs from population data

Project description

Hapsolutely

PyPI - Version PyPI - Python Version GitHub - Tests GitHub - Windows GitHub - macOS

Reconstruct haplotypes and produce genealogy graphs from population data.

  • Phase sequences: Reconstruct haplotypes from sequence data
  • Haplotype visualization: Generate haplotype networks, genealogies and haplowebs
  • Haplotype statistics: Detect fields for recombination and subset overlap

Screenshot

Hapsolutely is a comprehensive program that integrates ConvPhase, haplodemo, popart_networks and haplostats.

Input files can be in TSV, FASTA or SPART/XML format.

Executables

Download and run the standalone executables without installing Python.

Release Windows MacOS

Installation

Hapsolutely is available on PyPI. You can install it through pip:

pip install itaxotools-hapsolutely
hapsolutely

Usage

Please refer to the Hapsolutely manual for information on how to use the program.

Citations

Hapsolutely was developed in the framework of the iTaxoTools project:

Vences M. et al. (2021): iTaxoTools 0.1: Kickstarting a specimen-based software toolkit for taxonomists. - Megataxa 6: 77-92.

Sequences are phased using PHASE and SeqPHASE:

Stephens, M., Smith, N., and Donnelly, P. (2001). A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68, 978--989.

Stephens, M., and Donnelly, P. (2003). A comparison of Bayesian methods for haplotype reconstruction from population genotype data. American Journal of Human Genetics, 73:1162-1169.

Flot, J.F. (2010) seqphase: a web tool for interconverting phase input/output files and fasta sequence alignments. Mol. Ecol. Resour., 10, 162–166.

Networks are generated using either of Fitchi or popart_networks:

Matschiner M (2015) Fitchi: Haplotype genealogy graphs based on the Fitch algorithm. Bioinformatics, 32:1250-252.

Leigh, JW, Bryant D (2015). PopART: Full-feature software for haplotype network construction. Methods Ecol Evol 6(9):1110-1116.

Bandelt H, Forster P, Röhl A (1999). Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16(1):37-48.

Clement M, Snell Q, Walke P, Posada D, Crandall, K (2002). TCS: estimating gene genealogies. Proc 16th Int Parallel Distrib Process Symp 2:184.

Doyle, J. J. (1995) The irrelevance of allele tree topologies for species delimitation, and a non-topological alternative. Syst. Bot., 20, 574-588.

BioPython is used to create ML/NJ trees if needed. networkx is used for laying out the initial graph.

Cock, P.J. et al., 2009. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), pp.1422-1423.

Hagberg, A., Swart, P. & S Chult, D., 2008. Exploring network structure, dynamics, and function using NetworkX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itaxotools-hapsolutely-0.2.2.tar.gz (505.0 kB view hashes)

Uploaded Source

Built Distribution

itaxotools_hapsolutely-0.2.2-py3-none-any.whl (106.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page