Skip to main content

multiHIVE, a hierarchical multimodal deep generative model for integrating multiomics data

Project description

multiHIVE

Overview

multiHIVE is a hierarchical multimodal deep generative model designed to infer cellular embeddings by integrating multi-omics data with different modalities from the same cell. It uses:

  • Hierarchically stacked latent variables to capture shared biological signals
  • Modality-specific latent variables to model private (modality-unique) variation

This enables multiHIVE to perform:

  • Joint integration of multi-modal data
  • Denoising
  • Protein imputation
  • Integration of multi-modal with uni-modal datasets

Additionally, multiHIVE's denoised gene expression cabn be factorized of into interpretable gene expression programs, facilitating the identification of biological processes at multiple levels of cellular hierarchy.

multiHIVE Architecture

Basic Installation

we recommend users to directly clone our stable main branch and set multiHIVE as the working directory and install following dependencies in a new conda environment python>=3.11 or conda create -n multiHIVE python==3.11

git clone https://github.com/Zafar-Lab/multiHIVE.git
pip install scvi-tools==1.3.0
pip install scanpy==1.11.0
pip install scikit-misc==0.3.1

Or install directly via pip

pip install multiHIVE

Tutorials

Explore the following tutorials to get started:

1. Main Script:

# features should genes followed by regions [genes, regions]
multiHIVE.setup_anndata(adata, batch_key="batch", protein_expression_obsm_key = "protein_expression")

vae = multiHIVE(adata,  
            n_genes=(adata.var["modality"] == "Gene Expression").sum(), # number of genes 
            n_regions=(adata.var["modality"] == "Peaks").sum(), # number of regions
            n_proteins=46, # number of proteins 
            latent_distribution="normal", kl_dot_product=True, deep_network=True)
vae.train()
vae.get_latent_representation()

2. Model Parameters:

Parameter Description
latent_distribution Distribution for latent variables (e.g., "normal")
kl_dot_product Enables regularization using dot-product of modality-specific latents
deep_network Uses deeper neural networks; recommended for datasets > 100,000 cells

3. Results:

  • vae.get_latent_representation() gives zs1, zs2, zr and zp or/and za
  • zs1 is the joint latent variable.
  • zs2 is the hierarchical joint latent variable.
  • zr is the gene modality specific latent variable.
  • zp is the protein modality specific latent variable.
  • za is the chromatin accessibility specific latent variable

Documentation

For more advanced settings, preprocessing tips, and API references, refer to the multiHIVE Documentation

Citation

multiHIVE: Hierarchical Multimodal Deep Generative Model for Single-cell Multiomics Integration
Anirudh Nanduri*, Musale Krushna Pavan*, Kushagra Pandey, Hamim Zafar
bioRxiv 2025.01.28.635222; doi: https://doi.org/10.1101/2025.01.28.635222
*Equal contribution

Contact

For questions, issues, or contributions, please open an issue on the GitHub repository

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multihive-0.1.1.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multihive-0.1.1-py3-none-any.whl (31.6 kB view details)

Uploaded Python 3

File details

Details for the file multihive-0.1.1.tar.gz.

File metadata

  • Download URL: multihive-0.1.1.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for multihive-0.1.1.tar.gz
Algorithm Hash digest
SHA256 a763100023d9358f8985d8aa1a873123a1ef1eaab31cef2d64406874dd30934d
MD5 a01378645fa5651608b468e39a0b4de2
BLAKE2b-256 e11e6e47d14e2ed179af04aecb615dfeb8ec71419c5e82b55850141d8d78f84c

See more details on using hashes here.

File details

Details for the file multihive-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: multihive-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 31.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for multihive-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 67219ffc738557c1d4ae035d0a8ea4913dd1a392ee21a0a2e950387a91ea1ac1
MD5 f33559ae8edc4451e0d144a316c5f602
BLAKE2b-256 086dacf4db13df5e5334b4f48a7cc6f409b0fc948388dd9bf498877a84ee5d45

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page