Skip to main content

A package for miRNA protein prediction with SeqFinder and validator modules

Project description

miRNAProtPred

A Python package for miRNA protein prediction with SeqFinder and validator modules.

Overview

miRNAProtPred is a bioinformatics tool designed to identify miRNA target sequences in DNA, RNA, or protein sequences. It uses the Boyer-Moore string matching algorithm combined with BLAST and ViennaRNA for comprehensive miRNA target prediction and validation.

Features

  • SeqFinder: Find miRNA target sequences in DNA, RNA, or protein sequences

    • Automatic sequence type detection (DNA/RNA/Protein)
    • BLAST integration for protein sequence analysis
    • Boyer-Moore pattern matching for efficient sequence searching
    • ViennaRNA integration for minimum free energy (MFE) calculation
    • Probability scoring (High/Medium/Low) based on MFE values
    • Results export to CSV format
  • Validator: Validate miRNA-mRNA interactions (experimental module)

Installation

Prerequisites

  • Python >= 3.7
  • Required Python packages:
    • pandas >= 1.3.0
    • openpyxl >= 3.0.0
    • biopython
    • ViennaRNA
    • pyfiglet

Install from source

git clone https://github.com/somenath-combio/mirnaprotpred.git
cd mirnaprotpred
pip install -e .

Usage

SeqFinder

Find miRNA target sequences in your input sequence:

SeqFinder <sequence>

Examples:

# DNA sequence
SeqFinder "ATGCATGCATGCATGC"

# RNA sequence
SeqFinder "AUGCAUGCAUGCAUGC"

# Protein sequence
SeqFinder "MKKLAVSLLLFLSSLA"

The tool will:

  1. Automatically detect the sequence type
  2. Search for miRNA seed sequences from the database
  3. Calculate minimum free energy (MFE) using ViennaRNA
  4. Assign probability scores (High: MFE ≤ -15, Medium: -15 < MFE ≤ -10, Low: MFE > -10)
  5. Display results sorted by MFE
  6. Optionally save results to CSV

Validator

Validate miRNA-mRNA interactions:

validator <miRNA_sequence> <mRNA_sequence>

Example:

validator "AUGCAUGC" "GCAUGCAU"

Data Requirements

The SeqFinder module requires a data file located at data/data.xlsx containing:

  • miRNA descriptions
  • Human miRNA IDs
  • Accession numbers
  • Sequences
  • Seed sequences (seed1, seed2, seed3 columns)

Output

SeqFinder generates results with the following columns:

  • Description: miRNA description
  • Human miRNA ID: Identifier for the miRNA
  • Accession: Accession number
  • Sequence: Full miRNA sequence
  • Seed: Matched seed sequence
  • Position: Position of the match in the input sequence
  • CTS: Complementary target site sequence
  • MFE: Minimum free energy
  • Prob: Probability score (High/Medium/Low)

How It Works

  1. Sequence Type Detection: Automatically identifies whether the input is DNA, RNA, or protein
  2. Protein Processing: For protein sequences, uses BLAST to retrieve the corresponding nucleotide sequence
  3. Pattern Matching: Uses the Boyer-Moore algorithm to find miRNA seed sequences
  4. Energy Calculation: Calculates duplex formation energy using ViennaRNA
  5. Scoring: Assigns probability based on MFE thresholds

License

MIT License

Author

Sudipta Sardar (sudipta@pusan.ac.kr)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Citation

If you use this tool in your research, please cite appropriately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirnaprotpred-0.1.6.tar.gz (413.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirnaprotpred-0.1.6-py3-none-any.whl (410.5 kB view details)

Uploaded Python 3

File details

Details for the file mirnaprotpred-0.1.6.tar.gz.

File metadata

  • Download URL: mirnaprotpred-0.1.6.tar.gz
  • Upload date:
  • Size: 413.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.6.tar.gz
Algorithm Hash digest
SHA256 8697c35dff563b1cfe85dc4c02c615b4c09f8f95d262c32790744c45d7de3831
MD5 441180200b955eb47fddfb3fb7d54aae
BLAKE2b-256 0117fa2a17328d0f4d1477180bdd597148731b44929dab8f81483633c888a0f6

See more details on using hashes here.

File details

Details for the file mirnaprotpred-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: mirnaprotpred-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 410.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 401c32f1d40a6967ffa6884502847922c17bc975b21c65a06169a6b1cf2a4721
MD5 c52222b6873cb40e1af360ffbb5594a7
BLAKE2b-256 6db0389658a1fe56d3839d9fa88f01f69976a211d3b5a921603e7e1e55087979

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page