A hierarchical domain caller for Hi-C data based on a modified version of Directionality Index
3C-based techniques(5C, Hi-C) have revealed the existence of topologically associating domains(TADs), a pervasive sub-megabase scale structure of chromosome. TADs are contiguous regions in which loci interact much more frequently with each other than with loci out of the region. Visually, TADs appear as square blocks along the diagonal on a heatmap.
There are various methods for TAD identification , . Most methods apply a two-step scheme: First, transform TAD or boundary signal into 1d profile using some statistic(e.g. Directionality Index, DI); Then, use the 1d profile to identify potential boundaries and produce a set of discrete non-overlapping TADs. However, the organization of chromosome structure is always intricate and hierarchical. Phillips-Cremins JE et al.  utilized a modified DI of multiple scales subdivided TADs into smaller subtopologies (sub-TADs) using 5C data. Here, I extend their algorithm to the whole genome and develop this software.
Please check the file “INSTALL.rst” in the distribution.
Open a terminal, type calTADs -h for help information.
calTADs contains a process management system, so you can submit the same command repeatedly to utilize the parallel power as much as possible.
|||Dixon JR, Selvaraj S, Yue F et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature, 2012, 485: 376-380.|
|||Sexton T, Yaffe E, Kenigsberg E et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell, 2012, 148: 458-472.|
|||Phillips-Cremins JE, Sauria ME, Sanyal A et al. Architectural protein subclasses shape 3D organization of genomes during lineage commitment. Cell, 2013, 153(6):1281-95.|
|||Lieberman-Aiden E, van Berkum NL, Williams L et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science, 2009, 326: 289-293.|
|||Rao SS, Huntley MH, Durand NC. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell, 2014, 159(7):1665-80.|