Skip to main content

A tool for genome-wide prediction of double-stranded RNA structures

Project description

dsRNAscan

CI Tests Python Platforms License: GPL v3

dsRNAscan is a bioinformatics tool for genome-wide identification of double-stranded RNA (dsRNA) structures. It uses a sliding window approach to detect inverted repeats that can form dsRNA secondary structures, with support for G-U wobble base pairing and ML-based scoring.

Browse human genome results at dsrna.chpc.utah.edu

Install

pip install dsrnascan
# No EMBOSS installation required - einverted binary is included

Platforms: Linux (Python 3.8+), macOS (Python 3.9+). Windows not supported (use WSL).

Dependencies (auto-installed): biopython, numpy, pandas, ViennaRNA

Optional (for ML scoring): pip install dsrnascan[scoring]

If ViennaRNA fails via pip, install with conda: conda install -c bioconda viennarna

Usage

# Basic scan (defaults: -w 10000 -s 150 --score 75 -c 4)
dsrnascan input.fasta

# Specific chromosome with 8 CPUs
dsrnascan genome.fasta --only_seq chr21 -c 8

# Scan a specific region
dsrnascan genome.fasta --only_seq chr21 --start 33455482 --end 33655482

# Faster scan with larger step size (less overlap between windows)
dsrnascan genome.fasta -s 5000 -c 16

# Sensitive scan for shorter dsRNAs
dsrnascan sequence.fasta -w 5000 --min_bp 15 --paired_cutoff 60

Parameters

Parameter Default Description
-w 10000 Window size (bp)
-s/--step 150 Step size between windows
-c/--cpus 4 Number of CPUs
--min_bp 25 Minimum base pairs required
--score 75 Minimum einverted score
--paired_cutoff 70 Minimum % paired bases
--only_seq None Specific chromosome(s) to scan
--start/--end Full seq Region coordinates
--forward-only False Forward strand only
--reverse-only False Reverse strand only
--no-ml False Disable ML scoring
--output-dir Auto Output directory

Run dsrnascan --help for advanced options (scoring parameters, repeat length limits, folding temperature, etc.).

Output

Results are written to the output directory:

*_merged_results.txt - Tab-delimited predictions with columns:

  • Coordinates: Chromosome, Strand, i_start, i_end, j_start, j_end
  • einverted: Score, RawMatch, PercMatch, Gaps
  • RNAduplex: dG(kcal/mol), base_pairs, percent_paired, longest_helix, eff_i/j_start/end, i_seq, j_seq, structure
  • ML scores (if enabled): stability_score, probing_score

*.dsRNApredictions.bp - IGV-compatible visualization file

Browse Results

# Interactive viewer with Forna RNA structure visualization
dsrna-browse results_directory/

# With RNA editing site annotations (BED or GFF3)
dsrna-browse results_directory/ --editing-file editing_sites.bed

Citation

If you use dsRNAscan, please cite:

Comprehensive mapping of human dsRNAome reveals conservation, neuronal enrichment, and intermolecular interactions https://doi.org/10.1101/2025.01.24.634786

Additional Tools

overlap_analyzer - Statistical enrichment analysis for genomic features overlapping dsRNA predictions. See overlap_analyzer/README.md. Not included in PyPI package; clone the repo to access.

License

GNU General Public License v3.0 - see LICENSE.

Issues: GitHub Issues

Acknowledgments

  • EMBOSS team for the einverted algorithm
  • ViennaRNA team for RNA folding algorithms

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsrnascan-0.4.7.tar.gz (944.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dsrnascan-0.4.7-py3-none-any.whl (933.5 kB view details)

Uploaded Python 3

File details

Details for the file dsrnascan-0.4.7.tar.gz.

File metadata

  • Download URL: dsrnascan-0.4.7.tar.gz
  • Upload date:
  • Size: 944.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.7.tar.gz
Algorithm Hash digest
SHA256 d16cf51547ce07917751e6c7eb170edf5b7ad51c2d365365e785316622159109
MD5 d8a696a6ca09e9844c8d02a8efd25356
BLAKE2b-256 88ba85105dfeca44bc28457a77dc405d11d2b07a66c2913b629d2805b7089d0b

See more details on using hashes here.

File details

Details for the file dsrnascan-0.4.7-py3-none-any.whl.

File metadata

  • Download URL: dsrnascan-0.4.7-py3-none-any.whl
  • Upload date:
  • Size: 933.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 a71d65f51280711b7ca23addbae5fa90c97234c6cd78bd50e826aa819022af22
MD5 d9ad58616c7254d3eb12403ad3486aea
BLAKE2b-256 363b8304f853bba860e35c75e9a1ae033fc58c469ddd60d57ed52403dc5f090e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page