Skip to main content

A tool for genome-wide prediction of double-stranded RNA structures

Project description

dsRNAscan

CI Tests Python Platforms License: GPL v3

dsRNAscan is a bioinformatics tool for genome-wide identification of double-stranded RNA (dsRNA) structures. It uses a sliding window approach to detect inverted repeats that can form dsRNA secondary structures, with support for G-U wobble base pairing and ML-based scoring.

Browse human genome results at dsrna.chpc.utah.edu

Install

pip install dsrnascan

Platforms: Linux (Python 3.8+), macOS (Python 3.9+). Windows not supported (use WSL).

Dependencies (auto-installed): biopython, numpy, pandas, ViennaRNA

If ViennaRNA fails via pip, install with conda: conda install -c bioconda viennarna

Usage

# Basic scan (defaults: -w 10000 -s 500 --score 75 -c 4)
dsrnascan input.fasta

# Specific chromosome with 8 CPUs
dsrnascan genome.fasta --only_seq chr21 -c 8

# Scan a specific region
dsrnascan genome.fasta --only_seq chr21 --start 33455482 --end 33655482

# Faster scan with larger step size (less overlap between windows)
dsrnascan genome.fasta -s 5000 -c 16

# Sensitive scan for shorter dsRNAs
dsrnascan sequence.fasta -w 5000 --min_bp 15 --paired_cutoff 60

Parameters

Parameter Default Description
-w 10000 Window size (bp)
-s/--step 500 Step size between windows
-c/--cpus 4 Number of CPUs
--min_bp 25 Minimum base pairs required
--score 75 Minimum einverted score
--paired_cutoff 70 Minimum % paired bases
--only_seq None Specific chromosome(s) to scan
--start/--end Full seq Region coordinates
--forward-only False Forward strand only
--reverse-only False Reverse strand only
--no-ml False Disable ML scoring
--output-dir Auto Output directory

Run dsrnascan --help for advanced options (scoring parameters, repeat length limits, folding temperature, etc.).

Output

Results are written to the output directory:

*_merged_results.txt - Tab-delimited predictions with columns:

  • Coordinates: Chromosome, Strand, i_start, i_end, j_start, j_end
  • einverted: Score, RawMatch, PercMatch, Gaps
  • RNAduplex: dG(kcal/mol), base_pairs, percent_paired, longest_helix, eff_i/j_start/end, i_seq, j_seq, structure
  • ML scores: stability_model_score, probing_model_score, likely_edited (Yes/No), likely_forms (Yes/No)

*.bp - IGV arc visualization file

*.gff3 - GFF3 with mRNA/exon types so IGV renders the loop between arms as a thin connecting line

*.bedpe - BEDPE format with paired coordinates (one line per dsRNA)

Browse Results

# Interactive viewer with Forna RNA structure visualization
dsrna-browse results_directory/

# With RNA editing site annotations (BED or GFF3)
dsrna-browse results_directory/ --editing-file editing_sites.bed

Citation

If you use dsRNAscan, please cite:

Comprehensive mapping of human dsRNAome reveals conservation, neuronal enrichment, and intermolecular interactions https://doi.org/10.1101/2025.01.24.634786

Additional Tools

overlap_analyzer - Statistical enrichment analysis for genomic features overlapping dsRNA predictions. See overlap_analyzer/README.md. Not included in PyPI package; clone the repo to access.

License

GNU General Public License v3.0 - see LICENSE.

Issues: GitHub Issues

Acknowledgments

  • EMBOSS team for the einverted algorithm
  • ViennaRNA team for RNA folding algorithms

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsrnascan-0.5.0.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dsrnascan-0.5.0-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file dsrnascan-0.5.0.tar.gz.

File metadata

  • Download URL: dsrnascan-0.5.0.tar.gz
  • Upload date:
  • Size: 2.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.5.0.tar.gz
Algorithm Hash digest
SHA256 17f53c81817b7c4538587c31d0500c67121fcacffc5d416cfb05db20650f61c6
MD5 ce0c1eefc9c0f7f485e18640464cbef0
BLAKE2b-256 978e352dee5ea92935083509ca71db0e297a391d08d31c524116ee473c33bde2

See more details on using hashes here.

File details

Details for the file dsrnascan-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: dsrnascan-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c7bc536ba0569fc1f168caf8c5d8fa9a0d65829eb331fd6dbcb47ad74349dad2
MD5 05e70a8208419768ad4449f1d48daa07
BLAKE2b-256 382ca2285dc0e9bf8b2dacfd7a0c7c3e203ea5b5540e72609c390ffa2c7c6660

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page