Skip to main content

A package for miRNA protein prediction with SeqFinder and validator modules

Project description

miRNAProtPred

A Python package for miRNA protein prediction with SeqFinder and validator modules.

Overview

miRNAProtPred is a bioinformatics tool designed to identify miRNA target sequences in DNA, RNA, or protein sequences. It uses the Boyer-Moore string matching algorithm combined with BLAST and ViennaRNA for comprehensive miRNA target prediction and validation.

Features

  • SeqFinder: Find miRNA target sequences in DNA, RNA, or protein sequences

    • Automatic sequence type detection (DNA/RNA/Protein)
    • BLAST integration for protein sequence analysis
    • Boyer-Moore pattern matching for efficient sequence searching
    • ViennaRNA integration for minimum free energy (MFE) calculation
    • Probability scoring (High/Medium/Low) based on MFE values
    • Results export to CSV format
  • Validator: Validate miRNA-mRNA interactions (experimental module)

Installation

Prerequisites

  • Python >= 3.7
  • Required Python packages:
    • pandas >= 1.3.0
    • openpyxl >= 3.0.0
    • biopython
    • ViennaRNA
    • pyfiglet

Install from source

git clone https://github.com/somenath-combio/mirnaprotpred.git
cd mirnaprotpred
pip install -e .

Usage

SeqFinder

Find miRNA target sequences in your input sequence:

SeqFinder <sequence>

Examples:

# DNA sequence
SeqFinder "ATGCATGCATGCATGC"

# RNA sequence
SeqFinder "AUGCAUGCAUGCAUGC"

# Protein sequence
SeqFinder "MKKLAVSLLLFLSSLA"

The tool will:

  1. Automatically detect the sequence type
  2. Search for miRNA seed sequences from the database
  3. Calculate minimum free energy (MFE) using ViennaRNA
  4. Assign probability scores (High: MFE ≤ -15, Medium: -15 < MFE ≤ -10, Low: MFE > -10)
  5. Display results sorted by MFE
  6. Optionally save results to CSV

Validator

Validate miRNA-mRNA interactions:

validator <miRNA_sequence> <mRNA_sequence>

Example:

validator "AUGCAUGC" "GCAUGCAU"

Data Requirements

The SeqFinder module requires a data file located at data/data.xlsx containing:

  • miRNA descriptions
  • Human miRNA IDs
  • Accession numbers
  • Sequences
  • Seed sequences (seed1, seed2, seed3 columns)

Output

SeqFinder generates results with the following columns:

  • Description: miRNA description
  • Human miRNA ID: Identifier for the miRNA
  • Accession: Accession number
  • Sequence: Full miRNA sequence
  • Seed: Matched seed sequence
  • Position: Position of the match in the input sequence
  • CTS: Complementary target site sequence
  • MFE: Minimum free energy
  • Prob: Probability score (High/Medium/Low)

How It Works

  1. Sequence Type Detection: Automatically identifies whether the input is DNA, RNA, or protein
  2. Protein Processing: For protein sequences, uses BLAST to retrieve the corresponding nucleotide sequence
  3. Pattern Matching: Uses the Boyer-Moore algorithm to find miRNA seed sequences
  4. Energy Calculation: Calculates duplex formation energy using ViennaRNA
  5. Scoring: Assigns probability based on MFE thresholds

License

MIT License

Author

Sudipta Sardar (sudipta@pusan.ac.kr)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Citation

If you use this tool in your research, please cite appropriately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirnaprotpred-0.1.4.tar.gz (413.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirnaprotpred-0.1.4-py3-none-any.whl (410.4 kB view details)

Uploaded Python 3

File details

Details for the file mirnaprotpred-0.1.4.tar.gz.

File metadata

  • Download URL: mirnaprotpred-0.1.4.tar.gz
  • Upload date:
  • Size: 413.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.4.tar.gz
Algorithm Hash digest
SHA256 3ff70b877b56bf12e17ce6ce5eb72b90ee167fba2ce2169ad93be00fd7f062b1
MD5 2550e626fcfb3e2d7a7308cfeaf02bd4
BLAKE2b-256 61589d8736b11b9b43c2c260b7523f0c2b1c3a6513348788159055725096be8a

See more details on using hashes here.

File details

Details for the file mirnaprotpred-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: mirnaprotpred-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 410.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4943a9a3c01ead6d12acd9534404ed22f5785909466c51bf77ae4f5e777806f0
MD5 9da1b49b5a41d407260d325c00ab7ceb
BLAKE2b-256 ad7d5d90a5e9389f90069200de14537df958ae6b4fbc691eb9fed4f73d4af4b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page