Skip to main content

A tool for genome-wide prediction of double-stranded RNA structures

Project description

dsRNAscan

CI Tests Python Platforms License: GPL v3

dsRNAscan is a bioinformatics tool for genome-wide identification of double-stranded RNA (dsRNA) structures. It uses a sliding window approach to detect inverted repeats that can form dsRNA secondary structures, with support for G-U wobble base pairing and ML-based scoring.

Browse human genome results at dsrna.chpc.utah.edu

Install

pip install dsrnascan

Platforms: Linux (Python 3.8+), macOS (Python 3.9+). Windows not supported (use WSL).

Dependencies (auto-installed): biopython, numpy, pandas, ViennaRNA

If ViennaRNA fails via pip, install with conda: conda install -c bioconda viennarna

Usage

# Basic scan (defaults: -w 10000 -s 150 --score 75 -c 4)
dsrnascan input.fasta

# Specific chromosome with 8 CPUs
dsrnascan genome.fasta --only_seq chr21 -c 8

# Scan a specific region
dsrnascan genome.fasta --only_seq chr21 --start 33455482 --end 33655482

# Faster scan with larger step size (less overlap between windows)
dsrnascan genome.fasta -s 5000 -c 16

# Sensitive scan for shorter dsRNAs
dsrnascan sequence.fasta -w 5000 --min_bp 15 --paired_cutoff 60

Parameters

Parameter Default Description
-w 10000 Window size (bp)
-s/--step 150 Step size between windows
-c/--cpus 4 Number of CPUs
--min_bp 25 Minimum base pairs required
--score 75 Minimum einverted score
--paired_cutoff 70 Minimum % paired bases
--only_seq None Specific chromosome(s) to scan
--start/--end Full seq Region coordinates
--forward-only False Forward strand only
--reverse-only False Reverse strand only
--no-ml False Disable ML scoring
--output-dir Auto Output directory

Run dsrnascan --help for advanced options (scoring parameters, repeat length limits, folding temperature, etc.).

Output

Results are written to the output directory:

*_merged_results.txt - Tab-delimited predictions with columns:

  • Coordinates: Chromosome, Strand, i_start, i_end, j_start, j_end
  • einverted: Score, RawMatch, PercMatch, Gaps
  • RNAduplex: dG(kcal/mol), base_pairs, percent_paired, longest_helix, eff_i/j_start/end, i_seq, j_seq, structure
  • ML scores: stability_model_score, probing_model_score, likely_edited (Yes/No), likely_forms (Yes/No)

*.bp - IGV arc visualization file

*.gff3 - GFF3 with mRNA/exon types so IGV renders the loop between arms as a thin connecting line

*.bedpe - BEDPE format with paired coordinates (one line per dsRNA)

Browse Results

# Interactive viewer with Forna RNA structure visualization
dsrna-browse results_directory/

# With RNA editing site annotations (BED or GFF3)
dsrna-browse results_directory/ --editing-file editing_sites.bed

Citation

If you use dsRNAscan, please cite:

Comprehensive mapping of human dsRNAome reveals conservation, neuronal enrichment, and intermolecular interactions https://doi.org/10.1101/2025.01.24.634786

Additional Tools

overlap_analyzer - Statistical enrichment analysis for genomic features overlapping dsRNA predictions. See overlap_analyzer/README.md. Not included in PyPI package; clone the repo to access.

License

GNU General Public License v3.0 - see LICENSE.

Issues: GitHub Issues

Acknowledgments

  • EMBOSS team for the einverted algorithm
  • ViennaRNA team for RNA folding algorithms

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsrnascan-0.4.9.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dsrnascan-0.4.9-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file dsrnascan-0.4.9.tar.gz.

File metadata

  • Download URL: dsrnascan-0.4.9.tar.gz
  • Upload date:
  • Size: 2.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.9.tar.gz
Algorithm Hash digest
SHA256 686a9340e8465552061409575aa0f518f1ae95dd3a504348a669103e5363d31d
MD5 c94668c448b72fa22c0524ef2681ed0d
BLAKE2b-256 0191d97f90dbe9a9f7b47aea017cf410db1e5d97dd677a1ca10bfe1ee5d38cde

See more details on using hashes here.

File details

Details for the file dsrnascan-0.4.9-py3-none-any.whl.

File metadata

  • Download URL: dsrnascan-0.4.9-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.9-py3-none-any.whl
Algorithm Hash digest
SHA256 2c019e098c36a4ec1c34689f08784a399a830ba7c0929207b5b6ffa9a7fd6d2f
MD5 ee61b80e69059d7ce6b22927f652f7d2
BLAKE2b-256 4ed731e15c4e3d3c3a8b825d4a175abd839207faae31e56b89b30850df25a4a6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page