Skip to main content

A package for miRNA protein prediction with SeqFinder and validator modules

Project description

miRNAProtPred

A Python package for miRNA protein prediction with SeqFinder and validator modules.

Overview

miRNAProtPred is a bioinformatics tool designed to identify miRNA target sequences in DNA, RNA, or protein sequences. It uses the Boyer-Moore string matching algorithm combined with BLAST and ViennaRNA for comprehensive miRNA target prediction and validation.

Features

  • SeqFinder: Find miRNA target sequences in DNA, RNA, or protein sequences

    • Automatic sequence type detection (DNA/RNA/Protein)
    • BLAST integration for protein sequence analysis
    • Boyer-Moore pattern matching for efficient sequence searching
    • ViennaRNA integration for minimum free energy (MFE) calculation
    • Probability scoring (High/Medium/Low) based on MFE values
    • Results export to CSV format
  • Validator: Validate miRNA-mRNA interactions (experimental module)

Installation

Prerequisites

  • Python >= 3.7
  • Required Python packages:
    • pandas >= 1.3.0
    • openpyxl >= 3.0.0
    • biopython
    • ViennaRNA
    • pyfiglet

Install from source

git clone https://github.com/somenath-combio/mirnaprotpred.git
cd mirnaprotpred
pip install -e .

Usage

SeqFinder

Find miRNA target sequences in your input sequence:

SeqFinder <sequence>

Examples:

# DNA sequence
SeqFinder "ATGCATGCATGCATGC"

# RNA sequence
SeqFinder "AUGCAUGCAUGCAUGC"

# Protein sequence
SeqFinder "MKKLAVSLLLFLSSLA"

The tool will:

  1. Automatically detect the sequence type
  2. Search for miRNA seed sequences from the database
  3. Calculate minimum free energy (MFE) using ViennaRNA
  4. Assign probability scores (High: MFE ≤ -15, Medium: -15 < MFE ≤ -10, Low: MFE > -10)
  5. Display results sorted by MFE
  6. Optionally save results to CSV

Validator

Validate miRNA-mRNA interactions:

validator <miRNA_sequence> <mRNA_sequence>

Example:

validator "AUGCAUGC" "GCAUGCAU"

Data Requirements

The SeqFinder module requires a data file located at data/data.xlsx containing:

  • miRNA descriptions
  • Human miRNA IDs
  • Accession numbers
  • Sequences
  • Seed sequences (seed1, seed2, seed3 columns)

Output

SeqFinder generates results with the following columns:

  • Description: miRNA description
  • Human miRNA ID: Identifier for the miRNA
  • Accession: Accession number
  • Sequence: Full miRNA sequence
  • Seed: Matched seed sequence
  • Position: Position of the match in the input sequence
  • CTS: Complementary target site sequence
  • MFE: Minimum free energy
  • Prob: Probability score (High/Medium/Low)

How It Works

  1. Sequence Type Detection: Automatically identifies whether the input is DNA, RNA, or protein
  2. Protein Processing: For protein sequences, uses BLAST to retrieve the corresponding nucleotide sequence
  3. Pattern Matching: Uses the Boyer-Moore algorithm to find miRNA seed sequences
  4. Energy Calculation: Calculates duplex formation energy using ViennaRNA
  5. Scoring: Assigns probability based on MFE thresholds

License

MIT License

Author

Sudipta Sardar (sudipta@pusan.ac.kr)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Citation

If you use this tool in your research, please cite appropriately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirnaprotpred-0.1.3.tar.gz (413.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirnaprotpred-0.1.3-py3-none-any.whl (410.4 kB view details)

Uploaded Python 3

File details

Details for the file mirnaprotpred-0.1.3.tar.gz.

File metadata

  • Download URL: mirnaprotpred-0.1.3.tar.gz
  • Upload date:
  • Size: 413.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.3.tar.gz
Algorithm Hash digest
SHA256 145643617d5c9939a1b45ff2dd7d760af17a70f6c89d89a4ccd0abb050bf819d
MD5 976720883e413b20f5e59acf033d50cd
BLAKE2b-256 a34901c62c99bacdc202229fa375bc90fba25e36c0b4c5c7e38da127706ae06c

See more details on using hashes here.

File details

Details for the file mirnaprotpred-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: mirnaprotpred-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 410.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 3fe58d9031e0c6f2566f8e4a6b536c85bbc46d4034790589660e11d4f8111a58
MD5 fdba35c0bc738fbee88583c2fa64d27c
BLAKE2b-256 8625740704bd4b4e42c106ff7be03b61c7b7f8800c7268131c0a8bc181bb1aea

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page