A tool for analyzing SSRs in genomic data
Project description
Crossroad fastAPI and CLI
A comprehensive tool for analyzing Simple Sequence Repeats (SSRs) in genomic data,
Features
- SSR comparative analysis of genomic data
- Relative abundance and Relative density
- Conserved, shared, and unique SSR motifs
- Gene-based SSR analysis
- Mutational hotspot detection
- Evolutionary analysis:SSR length variation with respect to time point
- Both Reference free and Reference-based comparison
- REST API support
Installation
Using pip
Contributors
- Dr. Preeti Agarwal (PhD & Senior Research Fellow)
- Dr. Jitendra Narayan (Principal Investigator)
- Pranjal Pruthi (Project Scientist, CSIR IGIB)
Institution
CSIR-Institute of Genomics and Integrative Biology Lab of Bioinformatics and Big Data analysis Mall Road, New Delhi - 110007, India
License
MIT License
Citation
If you use Crossroad in your research, please cite: [Citation information coming soon]
| Output File | With Reference ID | Without Reference ID |
|---|---|---|
| hssr_data.csv | 1. find_different_repeats() finds differences from reference 2. group_ssr_records_from_excluded() groups these differences 3. filter_hotspot_records() filters for variations 4. process_hssr_data() creates final HSSR data |
1. group_ssr_records() groups all SSRs 2. filter_hotspot_records() filters for variations 3. process_hssr_data() creates final HSSR data |
| ref_ssr_genecombo.csv (excluded_repeats) |
1. find_different_repeats() identifies differences from reference 2. Saves directly to file 3. Only created when reference ID is given |
Not created |
| all_variations.csv | 1. find_different_repeats() finds differences 2. group_ssr_records_from_excluded() processes these 3. Saves to file before filtering |
1. group_ssr_records() groups all SSRs 2. Saves to file before filtering |
| mutational_hotspots.csv | 1. Uses grouped excluded repeats 2. filter_hotspot_records() finds variations 3. Applies min_repeat_count and min_genome_count filters |
1. Uses grouped all SSRs 2. filter_hotspot_records() finds variations 3. Applies min_repeat_count and min_genome_count filters |
Each file in simple terms:
hssr_data.csv:
- Final output containing hotspot SSRs
- With reference: Only includes variations different from reference
- Without reference: Includes all variations meeting criteria
ref_ssr_genecombo.csv (excluded_repeats):
- Only created when using reference ID
- Lists all SSRs that are different from reference genome
- Raw differences before processing
all_variations.csv:
- With reference: All variations of different SSRs from reference
- Without reference: All variations of all SSRs
- Intermediate file before filtering
mutational_hotspots.csv:
-
Contains filtered hotspots meeting criteria
-
Uses min_repeat_count and min_genome_count
-
With reference: Only from differences
-
Without reference: From all SSRs
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crossroad_cli-0.2.4.tar.gz.
File metadata
- Download URL: crossroad_cli-0.2.4.tar.gz
- Upload date:
- Size: 67.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6c1dba3efc604a96117a61320f7ba246fdd6c278b1f4d13492f2fe65ac938cf0
|
|
| MD5 |
8a9cb4f55d9a084ee3725dd2e123566a
|
|
| BLAKE2b-256 |
10d7af811bc0f4678b0ae4cfd3b8ce2216efe8fb04f18514b519ec465f9afe5d
|
File details
Details for the file crossroad_cli-0.2.4-py3-none-any.whl.
File metadata
- Download URL: crossroad_cli-0.2.4-py3-none-any.whl
- Upload date:
- Size: 95.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3ff2cb470befd2ee0d39615f6d8229023b28ca00a3bdc69334730d7fb75348f3
|
|
| MD5 |
f5baa3fa42216778767a34786a9b9434
|
|
| BLAKE2b-256 |
7cad031b89338d0a1f0992866365a5f4c5d418efa946c34ce3de2b1bf06e377c
|