Skip to main content

Comprehensive Python Module for Protein Data Management: Designed for streamlined integration and processing of protein information from both UniProt and PDB. Equipped with features for concurrent data fetching, robust error handling, and database synchronization.

Project description

PyPI - Version Documentation Status Linting Status

Metamorphic & Multifunctional Protein Search

🔬 Overview

metamorphic_multifunction_search is a systematic protocol for the large-scale detection of structural metamorphisms and protein multifunctionality, built on top of the Protein Information System (PIS).

The project combines structural alignments, functional GO annotations, and protein language models to uncover hidden relationships between structure and function across model and non-model organisms.


🧠 What Does This Protocol Do?

1. Structural Metamorphism Detection

  • Aligns 3D protein structures with high sequence identity.
  • Detects divergent conformations (i.e. metamorphisms) using metrics like RMSD or FC-score.
  • Uses large-scale filtering (e.g., CD-HIT) and pairwise structural comparison.

2. Functional Multifunctionality Analysis

  • Extracts Gene Ontology (GO) annotations per protein.
  • Computes semantic distances between GO terms within each namespace (MF, BP, CC).
  • Identifies the most divergent pair of terms per protein to quantify multifunctionality.

⚙️ Requirements

  • Python 3.11.6
  • RabbitMQ
  • PostgreSQL with pgvector extension
  • Docker (optional but recommended for deployment)

🚀 Quick Start

  1. Start PostgreSQL with pgvector:
docker run -d --name pgvectorsql \
    -e POSTGRES_USER=user \
    -e POSTGRES_PASSWORD=password \
    -e POSTGRES_DB=BioData \
    -p 5432:5432 \
    pgvector/pgvector:pg16
  1. Start RabbitMQ:
docker run -d --name rabbitmq \
    -p 15672:15672 \
    -p 5672:5672 \
    rabbitmq:management
  1. Run the main protocol:
python main.py

This command executes the full pipeline: data extraction, structural filtering, alignment, functional analysis, and metric computation.


⚒️ Customization

You can tailor the pipeline by editing the config.yaml file or modifying main.py to:

  • Switch embedding models
  • Apply taxonomy-based filters
  • Add new annotation types or similarity metrics

📚 Related Projects


Let me know if you'd like me to overwrite your current README.md with this version.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metamorphic_multifunction_search-4.2.0.tar.gz (20.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file metamorphic_multifunction_search-4.2.0.tar.gz.

File metadata

File hashes

Hashes for metamorphic_multifunction_search-4.2.0.tar.gz
Algorithm Hash digest
SHA256 40def4633f30198754d13be9d0fc2bcce994474313e6370cbb4d914695582dc1
MD5 3012ab77262927493d78d9de1525f8f7
BLAKE2b-256 25641a6653240b961ae90150a3b7b7c82edf2b5a22daf38c97f11434cc7db8ea

See more details on using hashes here.

File details

Details for the file metamorphic_multifunction_search-4.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for metamorphic_multifunction_search-4.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 aad57eada86b8ea740e6123bd82965eb289536c9f25d13c7ed814c55b2db20f1
MD5 ffad99134532618cfc6439659eeaed9c
BLAKE2b-256 d30e837f5f61c5ea0a4c566398793668531e8c81bf0151a6f15f06a137e30b97

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page