JOnTADS: a unified caller for TADs and stripes in Hi-C data
Project description
JOnTADS: a unified caller for TADs and stripes in Hi-C data
JOnTADS is a versatile tool for identifying Topologically Associating Domains (TADs) and stripes in various chromatin conformation capture data, including population Hi-C, single-cell Hi-C and micro-C. It allows for easy analysis of Hi-C data across multiple samples.
Dependencies
pip install numba==0.56.4
pip install numpy==1.23.5
pip install scipy==1.12.0
pip install qpsolvers==2.7.3
Installation
pip install JOnTADS
Test Run
Suppose you are at the folder of this README file.
Single Sample Run
To identify TADs from a single sample, use the following command:
JOnTADS -F ./data/chr18.csv -O ./results/chr18.csv.tad
Multiple Samples Run
To analyze multiple samples simultaneously, use:
JOnTADS -F ./data/ES_rep1.chr18 ./data/ES_rep2.chr18 ./data/ME_rep1.chr18 ./data/ME_rep2.chr18 ./data/MS_rep1.chr18 ./data/MS_rep2.chr18 ./data/NP_rep1.chr18 ./data/NP_rep2.chr18 ./data/TP_rep1.chr18 ./data/TP_rep2.chr18 -O ./results/ES_rep1.chr18.tad ./results/ES_rep2.chr18.tad ./results/ME_rep1.chr18.tad ./results/ME_rep2.chr18.tad ./results/MS_rep1.chr18.tad ./results/MS_rep2.chr18.tad ./results/NP_rep1.chr18.tad ./results/NP_rep2.chr18.tad ./results/TP_rep1.chr18.tad ./results/TP_rep2.chr18.tad
Stripe Calling
To call stripes in addition to TADs:
JOnTADS -F ./data/chr18.csv -O ./results/chr18.csv.tad --stripe True --stripe_output ./results/chr18.csv.stripe --chr 18
Usage
You can run JOnTADS in your terminal via
JOnTADS -F ./data/chr18.csv -O ./results/chr18.csv.tad --stripe True --stripe_output ./results/chr18.csv.stripe --chr 18
or
python JOnTADS.py -F ./data/chr18.csv -O ./results/chr18.csv.tad --stripe True --stripe_output ./results/chr18.csv.stripe --chr 18
or in your python file via
from JOnTADS import JOnTADS, get_stripe
tads = JOnTADS(file_names, outputs)
stripes = get_stripe(file_name, chr_num)
Input and Output Format
Input Format
The input files should be Hi-C contact matrices separated by spaces or commas.
In progress: we are working on supporting supporting additional input formats.
Output Format
The output files contain information about the identified TADs or stripes.
For TAD calling, the output contains four columns:
start, end, TAD score, TAD size
For stripe calling, the output contains six columns
chr, x1, x2, chr, y1, y2
where the stripe extends from (x1, y1)
to (x2, y2)
.
Parameters
-F
: Input file(s) with Hi-C data.-O
: Output file(s) for the detected TADs.-MAXSZ
: Maximum size of TADs allowed, default 200.-MINSZ
: Minimum size of TADs allowed, default 7.--stripe
: Set toTrue
to enable stripe detection.--chr
: (When `stripe' is set to True) Chromosome number for stripe calling, e.g. 18.--stripe_output
: (When `stripe' is set to True) Output file for stripe calling results.
Contact
Feel free to contribute to the project by opening issues or pull requests. Any feedback or suggestions are highly appreciated. Correspondence should be addressed to qunhua.li@psu.edu. You can also contact the maintainer qiuhai.stat@gmail.com.
Happy analyzing your Hi-C data with JOnTADS!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file JOnTADS-0.9.tar.gz
.
File metadata
- Download URL: JOnTADS-0.9.tar.gz
- Upload date:
- Size: 12.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 54935fafd5c2a8699fe6ac8a2fb103089c00170fb118c21dd6a03f179c721b92 |
|
MD5 | 7c19c76daa031fc2ea8fc8407f980a09 |
|
BLAKE2b-256 | bd104e2397f60f80e364bf9771dc3c7edb9e9ce3cc43746488d240d55f0e7ac2 |
File details
Details for the file JOnTADS-0.9-py3-none-any.whl
.
File metadata
- Download URL: JOnTADS-0.9-py3-none-any.whl
- Upload date:
- Size: 14.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ae896274e751685d82314f43d993d07d9f5172378a8d3697a97b5e4132d7e207 |
|
MD5 | 2dbe2e575a76204bb0c017153299a82f |
|
BLAKE2b-256 | 9866298fe10e674b10efcc86fded16065a6bc0083cfbc321206d9ed5390dc8aa |