Skip to main content

A tool for genome-wide prediction of double-stranded RNA structures

Project description

dsRNAscan

CI Tests Python Platforms License: GPL v3

dsRNAscan is a bioinformatics tool for genome-wide identification of double-stranded RNA (dsRNA) structures. It uses a sliding window approach to detect inverted repeats that can form dsRNA secondary structures, with support for G-U wobble base pairing and ML-based scoring.

Browse human genome results at dsrna.chpc.utah.edu

Install

pip install dsrnascan

Platforms: Linux (Python 3.8+), macOS (Python 3.9+). Windows not supported (use WSL).

Dependencies (auto-installed): biopython, numpy, pandas, ViennaRNA

If ViennaRNA fails via pip, install with conda: conda install -c bioconda viennarna

Usage

# Basic scan (defaults: -w 10000 -s 150 --score 75 -c 4)
dsrnascan input.fasta

# Specific chromosome with 8 CPUs
dsrnascan genome.fasta --only_seq chr21 -c 8

# Scan a specific region
dsrnascan genome.fasta --only_seq chr21 --start 33455482 --end 33655482

# Faster scan with larger step size (less overlap between windows)
dsrnascan genome.fasta -s 5000 -c 16

# Sensitive scan for shorter dsRNAs
dsrnascan sequence.fasta -w 5000 --min_bp 15 --paired_cutoff 60

Parameters

Parameter Default Description
-w 10000 Window size (bp)
-s/--step 150 Step size between windows
-c/--cpus 4 Number of CPUs
--min_bp 25 Minimum base pairs required
--score 75 Minimum einverted score
--paired_cutoff 70 Minimum % paired bases
--only_seq None Specific chromosome(s) to scan
--start/--end Full seq Region coordinates
--forward-only False Forward strand only
--reverse-only False Reverse strand only
--no-ml False Disable ML scoring
--output-dir Auto Output directory

Run dsrnascan --help for advanced options (scoring parameters, repeat length limits, folding temperature, etc.).

Output

Results are written to the output directory:

*_merged_results.txt - Tab-delimited predictions with columns:

  • Coordinates: Chromosome, Strand, i_start, i_end, j_start, j_end
  • einverted: Score, RawMatch, PercMatch, Gaps
  • RNAduplex: dG(kcal/mol), base_pairs, percent_paired, longest_helix, eff_i/j_start/end, i_seq, j_seq, structure
  • ML scores: stability_model_score, probing_model_score, likely_edited (Yes/No), likely_forms (Yes/No)

*.bp - IGV arc visualization file

*.gff3 - GFF3 with mRNA/exon types so IGV renders the loop between arms as a thin connecting line

*.bedpe - BEDPE format with paired coordinates (one line per dsRNA)

Browse Results

# Interactive viewer with Forna RNA structure visualization
dsrna-browse results_directory/

# With RNA editing site annotations (BED or GFF3)
dsrna-browse results_directory/ --editing-file editing_sites.bed

Citation

If you use dsRNAscan, please cite:

Comprehensive mapping of human dsRNAome reveals conservation, neuronal enrichment, and intermolecular interactions https://doi.org/10.1101/2025.01.24.634786

Additional Tools

overlap_analyzer - Statistical enrichment analysis for genomic features overlapping dsRNA predictions. See overlap_analyzer/README.md. Not included in PyPI package; clone the repo to access.

License

GNU General Public License v3.0 - see LICENSE.

Issues: GitHub Issues

Acknowledgments

  • EMBOSS team for the einverted algorithm
  • ViennaRNA team for RNA folding algorithms

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsrnascan-0.4.8.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dsrnascan-0.4.8-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file dsrnascan-0.4.8.tar.gz.

File metadata

  • Download URL: dsrnascan-0.4.8.tar.gz
  • Upload date:
  • Size: 2.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.8.tar.gz
Algorithm Hash digest
SHA256 d12cec5fe7bc7913382d3619cec3a204a18730f00b5e3d6af1ba70cdfd44ffd4
MD5 9a8ea2c64e8fe6b9e9065a28ca68c274
BLAKE2b-256 dfbc6a5478bcfabfe5f91ec42343cc616242a4d52bddf7c559b4e6955a4fdfa2

See more details on using hashes here.

File details

Details for the file dsrnascan-0.4.8-py3-none-any.whl.

File metadata

  • Download URL: dsrnascan-0.4.8-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for dsrnascan-0.4.8-py3-none-any.whl
Algorithm Hash digest
SHA256 2179536f8904096641a89bff404b8c5c0233299a2f296bac6af44f4f69007830
MD5 c758b57287ad5716e6c12fa676499f34
BLAKE2b-256 ddb816e6a1027ef327c11a3efe8569ee89b8d1a580aa51f568a127ccf4b3f855

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page