Skip to main content

JOnTADS: a unified caller for TADs and stripes in Hi-C data

Project description

JOnTADS: a unified caller for TADs and stripes in Hi-C data

JOnTADS is a versatile tool for identifying Topologically Associating Domains (TADs) and stripes in various chromatin conformation capture data, including population Hi-C, single-109 cell Hi-C and micro-C. It allows for easy analysis of Hi-C data across multiple samples and outputs results in a structured format.

Dependencies

pip install numba==0.56.4  
pip install numpy==1.23.5  
pip install scipy==1.12.0  
pip install qpsolvers==2.7.3

Installation

pip install JOnTADS

Usage

Suppose you are at the folder of this README file.

Test Run

Single Sample Run

To identify TADs from a single sample, use the following command:

python JOnTADS.py -F ./data/chr18.csv -O ./results/chr18.csv.tad

Multiple Samples Run

To analyze multiple samples simultaneously, use:

python JOnTADS.py -F ./data/ES_rep1.chr18 ./data/ES_rep2.chr18 ./data/ME_rep1.chr18 ./data/ME_rep2.chr18 ./data/MS_rep1.chr18 ./data/MS_rep2.chr18 ./data/NP_rep1.chr18 ./data/NP_rep2.chr18 ./data/TP_rep1.chr18 ./data/TP_rep2.chr18 -O ./results/ES_rep1.chr18.tad ./results/ES_rep2.chr18.tad ./results/ME_rep1.chr18.tad ./results/ME_rep2.chr18.tad ./results/MS_rep1.chr18.tad ./results/MS_rep2.chr18.tad ./results/NP_rep1.chr18.tad ./results/NP_rep2.chr18.tad ./results/TP_rep1.chr18.tad ./results/TP_rep2.chr18.tad

Stripe Calling

To call stripes in addition to TADs:

python JOnTADS.py -F ./data/chr18.csv -O ./results/chr18.csv.tad --stripe_output ./results/chr18.csv.stripe -C 18 --stripe True

Input and Output Format

Input Format

The input files should be Hi-C contact matrices separated by spaces or commas.

In progress: we are working on supporting supporting additional input formats.

Output Format

The output files contain information about the identified TADs or stripes.

For TAD calling, the output contains four columns:

start, end, TAD score, TAD size

For stripe calling, the output contains six columns

chr, x1, x2, chr, y1, y2

where the stripe extends from (x1, y1) to (x2, y2).

Parameters

  • -F: Input file(s) with Hi-C data.
  • -O: Output file(s) for the detected TADs.
  • -MAXSZ: Maximum size of TADs allowed, default 200.
  • -MINSZ: Minimum size of TADs allowed, default 7.
  • --stripe: Set to True to enable stripe detection.
  • -C: (When `stripe' is set to True) Chromosome number for stripe calling, e.g. 18.
  • --stripe_output: (When `stripe' is set to True) Output file for stripe calling results.

Contact

Feel free to contribute to the project by opening issues or pull requests. Any feedback or suggestions are highly appreciated. Correspondence should be addressed to qunhua.li@psu.edu. You can also contact the maintainer qiuhai.stat@gmail.com.

Happy analyzing your Hi-C data with JOnTADS!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

JOnTADS-0.2.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

JOnTADS-0.2-py3-none-any.whl (3.3 kB view details)

Uploaded Python 3

File details

Details for the file JOnTADS-0.2.tar.gz.

File metadata

  • Download URL: JOnTADS-0.2.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.5

File hashes

Hashes for JOnTADS-0.2.tar.gz
Algorithm Hash digest
SHA256 3e2c3ab1d71034a8d17a58161ea5a5847c453a7ec311e1b1bf57bcf02d42515f
MD5 46d20d12fadd0d852c702a37dbed1dda
BLAKE2b-256 a91027cfc4fd042d054edfe03ae26d497331a1af4267690fc626ba60d2a81fdf

See more details on using hashes here.

File details

Details for the file JOnTADS-0.2-py3-none-any.whl.

File metadata

  • Download URL: JOnTADS-0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.5

File hashes

Hashes for JOnTADS-0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b4d6eb994a8aa87a6368e7cafd5653aef8786d9d066b2dc7db19ba1c8a445402
MD5 c6fb73d872642791f2bce6cc2c9ece93
BLAKE2b-256 7529a4a000a7ce33e51d2bde482e58fa6f98cc70fc391b2b5e84ef58c443e13f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page