Skip to main content

Weak Heuristic Inference for Supervisory Protein intERaction mapping for PDB and AP-MS datasets

Project description

whisper

PyPI Docs Python

whisper-ppi is a Python package for scoring protein–protein interactions from proximity labeling and affinity purification mass spectrometry datasets.
It uses interpretable features, programmatic weak supervision, and decoy-based false discovery rate (FDR) estimation to identify high-confidence interactors.


Install from PyPI:

pip install whisper-ppi

Install from GitHub:

git clone https://github.com/camlab-bioml/whisper
cd whisper
pip install .

Input Format

  • A CSV file with:
    • One column named Protein
    • Other columns representing bait replicate intensities, named as BAIT_1, BAIT_2, etc.
  • Control samples must be identifiable via substrings in their column names (e.g., "EGFP" or "Empty").

Usage

#protein-level
from whisper.protein_features import feature_engineering_protein
from whisper.protein_train import train_and_score_protein
import pandas as pd


# Load intensity table
intensity_df = pd.read_csv("input_intensity_dataset.tsv", sep="\t")

controls = ['EGFP', 'Empty', 'NminiTurbo']

# Run feature engineering
features_df = feature_engineering_protein(intensity_df, controls)

# You can save the features to use in the next step with different settings without generating them again.
features_df = pd.read_csv("features.csv")


# Run scoring and FDR estimation
scored_df = train_and_score_protein(features_df, initial_positives=15, initial_negatives=200)


#peptide-level
from whisper.peptide_features import feature_engineering_peptide
from whisper.peptide_train import train_and_score_peptide
import pandas as pd


# Load intensity table
intensity_df = pd.read_csv("input_intensity_dataset.tsv", sep="\t")

controls = ['EGFP', 'Empty', 'NminiTurbo']

# Run feature engineering
features_df = feature_engineering_peptide(intensity_df, controls)

# features_df = pd.read_csv("features.csv")


# Run scoring and FDR estimation
scored_df = train_and_score_peptide(features_df, initial_positives=15, initial_negatives=200)


#fragment-level
from whisper.fragment_features import feature_engineering_fragment
from whisper.fragment_train import train_and_score_fragment
import pandas as pd


# Load intensity table
intensity_df = pd.read_csv("input_intensity_dataset.tsv", sep="\t")

controls = ['EGFP', 'Empty', 'NminiTurbo']

# Run feature engineering
features_df = feature_engineering_fragment(intensity_df, controls)

# features_df = pd.read_csv("features.csv")


# Run scoring and FDR estimation
scored_df = train_and_score_fragment(features_df, initial_positives=15, initial_negatives=200)

Output

The final output includes:

  • predicted_probability: Probability of each bait–prey interaction being real
  • FDR: Estimated false discovery rate
  • global_cv_flag: Flag for likely background preys based on variability across all samples

Tutorial

Read the full documentation

Citation

This software is authored by: Vesal Kasmaeifar, Kieran R Campbell

Lunenfeld-Tanenbaum Research Institute & University of Toronto

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_ppi-0.1.1.tar.gz (11.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_ppi-0.1.1-py3-none-any.whl (16.5 kB view details)

Uploaded Python 3

File details

Details for the file whisper_ppi-0.1.1.tar.gz.

File metadata

  • Download URL: whisper_ppi-0.1.1.tar.gz
  • Upload date:
  • Size: 11.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for whisper_ppi-0.1.1.tar.gz
Algorithm Hash digest
SHA256 120ea5a121df0a87f5926267b6e3b3ebdb8c2a331640a482ec63dec34fb0b151
MD5 21b254ec2c14874a8c70cc0107804717
BLAKE2b-256 8b2b5361a21f3a1edf714a6a84225f2edb785f8782c2bdf3af405dd14067080b

See more details on using hashes here.

File details

Details for the file whisper_ppi-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: whisper_ppi-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 16.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for whisper_ppi-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7e4216f08e463b4eddaaa8a7f7ee832c02a0e29d3f41824f02f7bf6ef08d3f5f
MD5 a5fdef2f0c3535e9b8f15eac9ff8072e
BLAKE2b-256 c504055918bb9624178d423d2ec5cfa55c08ec6ab1d18d1a2daf101f4c7301bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page