Skip to main content

PRISM: Probabilistic Inference of Subject-level Mixture for contextualized differential expression

Project description

PRISM

Phenotype-Resolved Inference in Single-Cell Mixed Models via Latent Disease States and Contextualized Differential Expression

▶ Full model explainer (1080p, ~1 min)

Overview

PRISM extends the NEBULA negative-binomial log-normal mixed model for single-cell differential expression by introducing:

  1. Latent per-cell disease states $d_{ij} \sim \text{Bernoulli}(\rho_i)$ that separate truly affected cells from unaffected ones within disease subjects.
  2. Contextualized covariate effects $x^\top \Gamma_g z$ that model how context modulates all covariate effects (a main effect of context on expression).
  3. Context-dependent disease effects $\Delta_g(z) = \alpha_g + \theta_g^\top z$ that let the DE magnitude vary with continuous cell-level covariates ($\alpha_g$ = constant disease effect, $\theta_g$ = context modulation).
  4. An EM algorithm that alternates between inferring $q_{ij} = P(d_{ij}=1 \mid Y, \Theta)$ (E-step) and optimizing the NEBULA-LN approximate likelihood weighted by $q_{ij}$ (M-step).

Installation

Option A — conda (recommended)

git clone https://github.com/AndreaRubbi/ContextualizedDifferentialExpression.git && cd ContextualizedDifferentialExpression
conda env create -f environment.yml
conda activate prism
pip install -e .

Option B — pip + venv

python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"

# If CUDA 12.4 driver, pin a compatible torch build:
pip install torch==2.5.1+cu124 --index-url https://download.pytorch.org/whl/cu124

# Required for HVG selection (seurat_v3 flavor):
pip install scikit-misc

Verify

python -c "
import torch, prism
print(f'PRISM v{prism.__version__}')
print(f'PyTorch {torch.__version__}, CUDA available: {torch.cuda.is_available()}')
"

Quick start

from prism.data.simulation import generate_prism_data
from prism.inference.em_loop import PRISMConfig, PRISMTrainer

# 1. Generate synthetic data: 50 subjects × 200 cells × 60 genes
data, ground_truth = generate_prism_data(
    n_subjects=50, n_genes=60, n_cells_per_subject=200,
    rho=0.6, seed=42, device="cpu",
)

# 2. Configure and fit
config = PRISMConfig(
    n_genes=data.n_genes,
    n_covars=data.n_covars,
    n_context=data.n_context,
    max_em_iter=30,
    m_step_optimizer="trust_region",
    run_wald=True,
    wandb_enabled=False,
    device="cpu",
)
trainer = PRISMTrainer(config)
results = trainer.fit(data)

# 3. Inspect results
print(f"EM iterations: {results.em_iterations}")
print(f"Converged: {results.converged}")
sig = results.significant_genes(fdr=0.05)
print(f"Significant genes (FDR<0.05): {sig.sum().item()}")

Project layout

prism/
├── prism/
│   ├── model/           # PrismModel, encoders, NB/HL likelihoods
│   ├── inference/       # EM (e_step, m_step, em_loop), Wald & score tests
│   ├── data/            # PrismData, simulation, ROSMAP preprocessing
│   ├── baselines/       # NEBULA, context-only, latent-only, stratified
│   ├── evaluation/      # Metrics and plotting
│   └── utils/           # Numerical helpers, logging
├── tests/               # Unit and integration tests
└── tutorials/           # Notebook tutorials (API, synthetic, ROSMAP)

Tutorials

Reproducing the paper

All scripts, pipelines and notebooks that reproduce the paper's results live in the top-level reproducibility/ folder (Snakemake ablations, NEBULA-style benchmark, external baselines, ROSMAP and COVID real-data analyses). See ../reproducibility/README.md.

Citation

@article{prism2026,
  title={PRISM: Phenotype-Resolved Inference in Single-Cell Mixed Models
         via Latent Disease States and Contextualized Differential Expression},
  author={Anonymous Authors},
  journal={Under review},
  year={2026}
}

Regenerating the animations

The animated visuals are built with Manim Community.

pip install manim
cd media

# Standalone logo (1080p MP4)
manim -qh prism_explainer.py PRISMLogo

# Full explainer (1080p MP4)
manim -qh prism_explainer.py PRISMExplainer

# Full explainer (720p GIF — large file)
manim -qm --format=gif prism_explainer.py PRISMExplainer

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prism_de-0.1.1.tar.gz (84.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prism_de-0.1.1-py3-none-any.whl (84.0 kB view details)

Uploaded Python 3

File details

Details for the file prism_de-0.1.1.tar.gz.

File metadata

  • Download URL: prism_de-0.1.1.tar.gz
  • Upload date:
  • Size: 84.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for prism_de-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8647ddc08b5f2bf8e5fed70c5129cf0571320ef2a8ec4b3e057afd2e05e5a74b
MD5 8636c0c8dd95c6061976308b3ca2600c
BLAKE2b-256 daffde5e3d34c68ec674b5373a4edc9d908c2fd6d334a12b013de0f7adac48e4

See more details on using hashes here.

Provenance

The following attestation bundles were made for prism_de-0.1.1.tar.gz:

Publisher: pypi.yml on AndreaRubbi/PRISM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file prism_de-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: prism_de-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 84.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for prism_de-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e2979c074eff50598f4fc027cecbc7b667af493d767b592a4531575ef5039232
MD5 b51945b05f04eb9638737d039fb6a948
BLAKE2b-256 dfce98a1575d31af40a04803398df4a1f13f4b648487e4e7ff55e532c16149e6

See more details on using hashes here.

Provenance

The following attestation bundles were made for prism_de-0.1.1-py3-none-any.whl:

Publisher: pypi.yml on AndreaRubbi/PRISM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page