Skip to main content

Whole Genome Duplication Identification

Project description


Latest PyPI version Downloads install with bioconda

Author Pengchuan Sun (sunpengchuan)
License BSD


WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidizations and cross-species genome alignments.

WGDI supports three main workflows (polyploid inference, hierarchical inference of genomic homology, and ancestral chromosomal karyotyping) that can improve detection of WGD and characterization of related events. It incorporates a more sensitive and accurate collinearity detection algorithm than previous softwares, and can accelerate WGD-related karyotype research.

WGDI outperforms similar tools in terms of efficiency, flexibility and scalability.


Python package and command line interface (IDLE) for the analysis of whole genome duplications (WGDI). WGDI can be deployed in Windows, Linux, and Mac OS operating systems and can be installed via pip and conda.


conda install -c bioconda  wgdi


pip3 install wgdi

Documentation for installation along with a user tutorial, a default parameter file, and test data are provided. please consult the docs at


Here are some videos with simple examples of WGDI.


chatting group QQ : 966612552

Citating WGDI

If you use wgdi in your work, please cite:

Sun P, Jiao B, Yang Y, et al. WGDI: A user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes[J]. bioRxiv, 2021. doi:



  • Fixed issue with alignment (-a). Only version 0.6.0 has this bug.


  • Fixed issue with improved collinearity (-icl).
  • Added a parameter 'tandem_ratio' to blockinfo (-bi).


  • Update the improved collinearity (-icl). Faster than before, but lower than MCscanX, JCVI.
  • Fixed issue with ancestral karyotype repertoire (-akr).


  • Fixed issue with gene names (-ks).


  • Fixed issue with chromosome order (-ak).
  • Fixed issue with gene names (-ks). This version is not fixed, please install the latest version.

0.5.5 and 0.5.6

  • Add ancestral karyotype (-ak)
  • Add ancestral karyotype repertoire (-akr)


  • Improved the alignmenttrees (-km) effect.
  • little change (-at).


  • Fixed legend issue with (-kf).
  • Fixed calculate Ks issue with (-ks).
  • Improved the karyotype_mapping (-km) effect.
  • Improved the alignmenttrees (-at) effect.


  • Fixed some bugs.


  • Fixed the error of the command (-conf).
  • Improved the karyotype_mapping (-km) effect.
  • Added the available data set of alignmenttree (-at). Low copy data set (for example, single-copy_groups.tsv of sonicparanoid2 software).


  • The latest version adds karyotype_mapping (-km) and karyotype (-k) display.
  • The latest version changes the calculation of extracting pvalue from collinearity (-icl), making this parameter more sensitive. Therefore, it is recommended to set to 0.2 instead of 0.05.
  • The latest version has also changed the drawing display of ksfigure (-kf) to make it more beautiful.

Already cited WGDI articles

  1. Genomic analysis of Medicago ruthenica provides insights into its tolerance to abiotic stress and demographic history 《Molecular Ecology Resources》
  2. Chromosomal‐scale genome assembly of Eleutherococcus senticosus provides insights into chromosome evolution in Araliaceae 《Molecular Ecology Resources》
  3. The Corylus mandshurica genome provides insights into the evolution of Betulaceae genomes and hazelnut breeding 《Horticulture Research》
  4. An ancient whole-genome duplication event and its contribution to flavor compounds in the tea plant (Camellia sinensis) 《Horticulture Research》
  5. A tetraploidization event shaped the Aquilaria sinensis genome and contributed to the ability of sesquiterpenes synthesis 《BMC Genomics》
  6. High-quality genome assembly of Cinnamomum burmami (chvar. Borneol) provides insights into the natural borneol biosynthesis 《BioRxiv》
  7. The genome sequence provides insights into salt tolerance of Achnatherum splendens (Gramineae), a constructive species of alkaline grassland 《Plant Biotechnology Journal》
  8. Chromosome-level assembly of the common vetch (Vicia sativa) reference genome 《Gigabyte》
  9. A chromosome-level genome assembly of an alpine plant Crucihimalaya lasiocarpa provides insights into high-altitude adaptation 《DNA Research》
  10. Chromosome-scale genome assembly of the diploid oat Avena longiglumis reveals the landscape of repetitive sequences, genes and chromosome evolution in grasses 《BioRxiv》
  11. The chromosome-level rambutan genome reveals a significant role of segmental duplication in the expansion of resistance genes 《Horticulture Research》
  12. A chromosome-level genome assembly for the tertiary relict plant Tetracentron sinense oliv. (trochodendraceae) 《Molecular Ecology Resources》
  13. Multi-omics reveal differentiation and maintenance of dimorphic flowers in an alpine plant on the Qinghai-Tibet Plateau 《Authorea》
  14. The Chromosome-Level Genome of Miracle Fruit (Synsepalum dulcificum) Provides New Insights Into the Evolution and Function of Miraculin. 《Frontiers in Plant Science》
  15. A chromosome-level reference genome of Ensete glaucum gives insight into diversity, chromosomal and repetitive sequence evolution in the Musaceae 《BioRxiv》
  16. High-quality genome assembly of Cinnamomum burmannii (chvar. Borneol) provides insights into the natural borneol biosynthesis 《BioRxiv》
  17. The Chloranthus sessilifolius genome provides insight into early diversification of angiosperms 《Nature Communications》
  18. Chromosome‐level pepino genome provides insights into genome evolution and anthocyanin biosynthesis in Solanaceae 《The Plant Journal》
  19. The genome of Hibiscus hamabo reveals its adaptation to saline and waterlogged habitat 《Horticulture Research》
  20. Chromosome-Level Genome Assembly of the Rare and Endangered Tropical Plant Speranskia yunnanensis (Euphorbiaceae) 《Frontiers in Genetics》
  21. The chromosome-level genome assembly of Gentiana dahurica (Gentianaceae) provides insights into gentiopicroside biosynthesis《DNA Research》
  22. Genomic insights into present local adaptation and future climate change vulnerability of a keystone forest tree species in East Asian 《BioRxiv》
  23. PolyReco: A Method to Automatically Label Collinear Regions and Recognize Polyploidy Events Based on the K S Dotplot 《Frontiers in Genetics》
  24. Reshuffling of the ancestral core-eudicot genome shaped chromatin topology and epigenetic modification in Panax 《Nature Communications》
  25. Deletion and tandem duplications of biosynthetic genes drive the diversity of triterpenoids in Aralia elata 《Nature Communications》

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wgdi-0.6.1.tar.gz (38.8 kB view hashes)

Uploaded source

Built Distribution

wgdi-0.6.1-py3-none-any.whl (51.6 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page