This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

A hierarchical domain caller for Hi-C data based on a modified version of Directionality Index

Project Description

Introduction

3C-based techniques(5C, Hi-C) have revealed the existence of topologically associating domains(TADs), a pervasive sub-megabase scale structure of chromosome. TADs are contiguous regions in which loci interact much more frequently with each other than with loci out of the region. Visually, TADs appear as square blocks along the diagonal on a heatmap.

There are various methods for TAD identification [1], [2]. Most methods apply a two-step scheme: First, transform TAD or boundary signal into 1d profile using some statistic(e.g. Directionality Index, DI); Then, use the 1d profile to identify potential boundaries and produce a set of discrete non-overlapping TADs. However, the organization of chromosome structure is always intricate and hierarchical. Phillips-Cremins JE et al. [3] utilized a modified DI of multiple scales subdividing TADs into smaller subtopologies (sub-TADs) using 5C data. Here, I extend their algorithm to the whole genome and develop this software.

calTADs are tested on traditional [4] and in-situ [5] Hi-C data, both generating reasonable results.

Installation

Please check the file “INSTALL.rst” in the distribution.

Usage

Open a terminal, type calTADs -h for help information.

calTADs contains a process management system, so you can submit the same command repeatedly to utilize the parallel power as much as possible.

Reference

[1]Dixon JR, Selvaraj S, Yue F et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature, 2012, 485: 376-380.
[2]Sexton T, Yaffe E, Kenigsberg E et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell, 2012, 148: 458-472.
[3]Phillips-Cremins JE, Sauria ME, Sanyal A et al. Architectural protein subclasses shape 3D organization of genomes during lineage commitment. Cell, 2013, 153(6):1281-95.
[4]Lieberman-Aiden E, van Berkum NL, Williams L et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science, 2009, 326: 289-293.
[5]Rao SS, Huntley MH, Durand NC. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell, 2014, 159(7):1665-80.
Release History

Release History

This version
History Node

0.1.0-dev3

History Node

0.1.0-dev2

History Node

0.1.0-dev1

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
calTADs-0.1.0-dev3.tar.gz (1.0 MB) Copy SHA256 Checksum SHA256 Source Feb 25, 2016

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting