Skip to main content

A package for miRNA protein prediction with SeqFinder and validator modules

Project description

miRNAProtPred

A Python package for miRNA protein prediction with SeqFinder and validator modules.

Overview

miRNAProtPred is a bioinformatics tool designed to identify miRNA target sequences in DNA, RNA, or protein sequences. It uses the Boyer-Moore string matching algorithm combined with BLAST and ViennaRNA for comprehensive miRNA target prediction and validation.

Features

  • SeqFinder: Find miRNA target sequences in DNA, RNA, or protein sequences

    • Automatic sequence type detection (DNA/RNA/Protein)
    • BLAST integration for protein sequence analysis
    • Boyer-Moore pattern matching for efficient sequence searching
    • ViennaRNA integration for minimum free energy (MFE) calculation
    • Probability scoring (High/Medium/Low) based on MFE values
    • Results export to CSV format
  • Validator: Validate miRNA-mRNA interactions (experimental module)

Installation

Prerequisites

  • Python >= 3.7
  • Required Python packages:
    • pandas >= 1.3.0
    • openpyxl >= 3.0.0
    • biopython
    • ViennaRNA
    • pyfiglet

Install from source

git clone https://github.com/somenath-combio/mirnaprotpred.git
cd mirnaprotpred
pip install -e .

Usage

SeqFinder

Find miRNA target sequences in your input sequence:

SeqFinder <sequence>

Examples:

# DNA sequence
SeqFinder "ATGCATGCATGCATGC"

# RNA sequence
SeqFinder "AUGCAUGCAUGCAUGC"

# Protein sequence
SeqFinder "MKKLAVSLLLFLSSLA"

The tool will:

  1. Automatically detect the sequence type
  2. Search for miRNA seed sequences from the database
  3. Calculate minimum free energy (MFE) using ViennaRNA
  4. Assign probability scores (High: MFE ≤ -15, Medium: -15 < MFE ≤ -10, Low: MFE > -10)
  5. Display results sorted by MFE
  6. Optionally save results to CSV

Validator

Validate miRNA-mRNA interactions:

validator <miRNA_sequence> <mRNA_sequence>

Example:

validator "AUGCAUGC" "GCAUGCAU"

Data Requirements

The SeqFinder module requires a data file located at data/data.xlsx containing:

  • miRNA descriptions
  • Human miRNA IDs
  • Accession numbers
  • Sequences
  • Seed sequences (seed1, seed2, seed3 columns)

Output

SeqFinder generates results with the following columns:

  • Description: miRNA description
  • Human miRNA ID: Identifier for the miRNA
  • Accession: Accession number
  • Sequence: Full miRNA sequence
  • Seed: Matched seed sequence
  • Position: Position of the match in the input sequence
  • CTS: Complementary target site sequence
  • MFE: Minimum free energy
  • Prob: Probability score (High/Medium/Low)

How It Works

  1. Sequence Type Detection: Automatically identifies whether the input is DNA, RNA, or protein
  2. Protein Processing: For protein sequences, uses BLAST to retrieve the corresponding nucleotide sequence
  3. Pattern Matching: Uses the Boyer-Moore algorithm to find miRNA seed sequences
  4. Energy Calculation: Calculates duplex formation energy using ViennaRNA
  5. Scoring: Assigns probability based on MFE thresholds

License

MIT License

Author

Sudipta Sardar (sudipta@pusan.ac.kr)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Citation

If you use this tool in your research, please cite appropriately.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirnaprotpred-0.1.0.tar.gz (210.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirnaprotpred-0.1.0-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file mirnaprotpred-0.1.0.tar.gz.

File metadata

  • Download URL: mirnaprotpred-0.1.0.tar.gz
  • Upload date:
  • Size: 210.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ecf79273ae0ab738e8dc2d10c6d861278ea2d12baf36c03ee10b4513d9623cdd
MD5 59ce379cd2a24bfcd8419ab267e58b48
BLAKE2b-256 b311370c59e59854b3b2d88213efe8054da323488d18f738972f2e0786c99ebb

See more details on using hashes here.

File details

Details for the file mirnaprotpred-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mirnaprotpred-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mirnaprotpred-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7bb994d8f83d504226e89c294227f10baefd299b536453446dd5842e40f3d59d
MD5 cd462dfc371c6462846291ca71b92a95
BLAKE2b-256 3e932270d510f9db4ad65b52b977a35882c8c8c7fa0e6d87374382c3e83d4dcd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page