Skip to main content

A package for miRNA protein prediction with SeqFinder and validator modules

Project description

miRNAProtPred

A Python package for miRNA protein prediction with SeqFinder and validator modules.

Overview

miRNAProtPred is a bioinformatics tool designed to identify miRNA target sequences in DNA, RNA, or protein sequences. It uses the Boyer-Moore string matching algorithm combined with BLAST and ViennaRNA for comprehensive miRNA target prediction and validation.

Features

  • SeqFinder: Find miRNA target sequences in DNA, RNA, or protein sequences

    • Automatic sequence type detection (DNA/RNA/Protein)
    • BLAST integration for protein sequence analysis
    • Boyer-Moore pattern matching for efficient sequence searching
    • ViennaRNA integration for minimum free energy (MFE) calculation
    • Probability scoring (High/Medium/Low) based on MFE values
    • Results export to CSV format
  • Validator: Validate miRNA-mRNA interactions (experimental module)

Installation

Prerequisites

  • Python >= 3.7
  • Required Python packages:
    • pandas >= 1.3.0
    • openpyxl >= 3.0.0
    • biopython
    • ViennaRNA
    • pyfiglet

Install from source

git clone https://github.com/somenath-combio/mirnaprotpred.git
cd mirnaprotpred
pip install -e .

Usage

SeqFinder

Find miRNA target sequences in your input sequence:

SeqFinder <sequence>

Examples:

# DNA sequence
SeqFinder "ATGCATGCATGCATGC"

# RNA sequence
SeqFinder "AUGCAUGCAUGCAUGC"

# Protein sequence
SeqFinder "MKKLAVSLLLFLSSLA"

The tool will:

  1. Automatically detect the sequence type
  2. Search for miRNA seed sequences from the database
  3. Calculate minimum free energy (MFE) using ViennaRNA
  4. Assign probability scores (High: MFE ≤ -15, Medium: -15 < MFE ≤ -10, Low: MFE > -10)
  5. Display results sorted by MFE
  6. Optionally save results to CSV

Validator

Validate miRNA-mRNA interactions:

validator <miRNA_sequence> <mRNA_sequence>

Example:

validator "AUGCAUGC" "GCAUGCAU"

Data Requirements

The SeqFinder module requires a data file located at data/data.xlsx containing:

  • miRNA descriptions
  • Human miRNA IDs
  • Accession numbers
  • Sequences
  • Seed sequences (seed1, seed2, seed3 columns)

Output

SeqFinder generates results with the following columns:

  • Description: miRNA description
  • Human miRNA ID: Identifier for the miRNA
  • Accession: Accession number
  • Sequence: Full miRNA sequence
  • Seed: Matched seed sequence
  • Position: Position of the match in the input sequence
  • CTS: Complementary target site sequence
  • MFE: Minimum free energy
  • Prob: Probability score (High/Medium/Low)

How It Works

  1. Sequence Type Detection: Automatically identifies whether the input is DNA, RNA, or protein
  2. Protein Processing: For protein sequences, uses BLAST to retrieve the corresponding nucleotide sequence
  3. Pattern Matching: Uses the Boyer-Moore algorithm to find miRNA seed sequences
  4. Energy Calculation: Calculates duplex formation energy using ViennaRNA
  5. Scoring: Assigns probability based on MFE thresholds

License

MIT License

Author

Sudipta Sardar (sudipta@pusan.ac.kr)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Citation

If you use this tool in your research, please cite appropriately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirnaprotpred-0.1.5.tar.gz (413.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirnaprotpred-0.1.5-py3-none-any.whl (410.5 kB view details)

Uploaded Python 3

File details

Details for the file mirnaprotpred-0.1.5.tar.gz.

File metadata

  • Download URL: mirnaprotpred-0.1.5.tar.gz
  • Upload date:
  • Size: 413.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.5.tar.gz
Algorithm Hash digest
SHA256 898a332d1f90e721fdb5da5531c3893847396cf26bd9a004d487e5b0d0aa2860
MD5 d7f964255049fcd54f105196b2e4aabe
BLAKE2b-256 10c04f0a6a9159af35f99b2588711870b44a9a59634c58f87a9d1a4db8afe569

See more details on using hashes here.

File details

Details for the file mirnaprotpred-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: mirnaprotpred-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 410.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 cdbb7dc9639cdb0171c524d5fef7721046a8b6609733ccfc27f424b0b3cf1885
MD5 931acb8f982599561466b8cb6f5b2400
BLAKE2b-256 d377fe81fb57cfe96aa66810b8c4cbdfe0c95b9d04c7de384dc11d8acf15e21a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page