JOnTADS: a unified caller for TADs and stripes in Hi-C data
Project description
JOnTADS: a unified caller for TADs and stripes in Hi-C data
JOnTADS is a versatile tool for identifying Topologically Associating Domains (TADs) and stripes in various chromatin conformation capture data, including population Hi-C, single-109 cell Hi-C and micro-C. It allows for easy analysis of Hi-C data across multiple samples and outputs results in a structured format.
Dependencies
pip install numba==0.56.4
pip install numpy==1.23.5
pip install scipy==1.12.0
pip install qpsolvers==2.7.3
Installation
pip install JOnTADS
Usage
Suppose you are at the folder of this README file.
Test Run
Single Sample Run
To identify TADs from a single sample, use the following command:
python JOnTADS.py -F ./data/chr18.csv -O ./results/chr18.csv.tad
Multiple Samples Run
To analyze multiple samples simultaneously, use:
python JOnTADS.py -F ./data/ES_rep1.chr18 ./data/ES_rep2.chr18 ./data/ME_rep1.chr18 ./data/ME_rep2.chr18 ./data/MS_rep1.chr18 ./data/MS_rep2.chr18 ./data/NP_rep1.chr18 ./data/NP_rep2.chr18 ./data/TP_rep1.chr18 ./data/TP_rep2.chr18 -O ./results/ES_rep1.chr18.tad ./results/ES_rep2.chr18.tad ./results/ME_rep1.chr18.tad ./results/ME_rep2.chr18.tad ./results/MS_rep1.chr18.tad ./results/MS_rep2.chr18.tad ./results/NP_rep1.chr18.tad ./results/NP_rep2.chr18.tad ./results/TP_rep1.chr18.tad ./results/TP_rep2.chr18.tad
Stripe Calling
To call stripes in addition to TADs:
python JOnTADS.py -F ./data/chr18.csv -O ./results/chr18.csv.tad --stripe_output ./results/chr18.csv.stripe -C 18 --stripe True
Input and Output Format
Input Format
The input files should be Hi-C contact matrices separated by spaces or commas.
In progress: we are working on supporting supporting additional input formats.
Output Format
The output files contain information about the identified TADs or stripes.
For TAD calling, the output contains four columns:
start, end, TAD score, TAD size
For stripe calling, the output contains six columns
chr, x1, x2, chr, y1, y2
where the stripe extends from (x1, y1)
to (x2, y2)
.
Parameters
-F
: Input file(s) with Hi-C data.-O
: Output file(s) for the detected TADs.-MAXSZ
: Maximum size of TADs allowed, default 200.-MINSZ
: Minimum size of TADs allowed, default 7.--stripe
: Set toTrue
to enable stripe detection.-C
: (When `stripe' is set to True) Chromosome number for stripe calling, e.g. 18.--stripe_output
: (When `stripe' is set to True) Output file for stripe calling results.
Contact
Feel free to contribute to the project by opening issues or pull requests. Any feedback or suggestions are highly appreciated. Correspondence should be addressed to qunhua.li@psu.edu. You can also contact the maintainer qiuhai.stat@gmail.com.
Happy analyzing your Hi-C data with JOnTADS!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file JOnTADS-0.5.tar.gz
.
File metadata
- Download URL: JOnTADS-0.5.tar.gz
- Upload date:
- Size: 12.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aafa03f00cdb7d3866019766c3395bf53e5cb3831ea25e6f0d328fb1e611e114 |
|
MD5 | 7a337ede0c58966d64d3689868af1c0b |
|
BLAKE2b-256 | 67d9b5e4add294d05fcd966a27ac427f36c8e54daac2806d9418e336cf96c657 |
File details
Details for the file JOnTADS-0.5-py3-none-any.whl
.
File metadata
- Download URL: JOnTADS-0.5-py3-none-any.whl
- Upload date:
- Size: 14.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 70833b0e42a082783234b3e0d4f083ffe8544c390237fdec73b07fc28cac0dac |
|
MD5 | fea6d3a0e271cc66b84758fb2e11d0b5 |
|
BLAKE2b-256 | f6af84ab9097643856066cfe52b4c132577d78a2fcc952b6ef9023dc0322437d |