Skip to main content

multiHIVE, a hierarchical multimodal deep generative model for integrating multiomics data

Project description

multiHIVE

Overview

multiHIVE is a hierarchical multimodal deep generative model designed to infer cellular embeddings by integrating multi-omics data with different modalities from the same cell. It uses:

  • Hierarchically stacked latent variables to capture shared biological signals
  • Modality-specific latent variables to model private (modality-unique) variation

This enables multiHIVE to perform:

  • Joint integration of multi-modal data
  • Denoising
  • Protein imputation
  • Integration of multi-modal and uni-modal datasets

Additionally, multiHIVE enables factorization of denoised gene expression into interpretable gene expression programs, facilitating the identification of biological processes at multiple levels of cellular hierarchy.

multiHIVE Architecture

Basic Installation

we recommend users to directly clone our stable main branch and set multiHIVE as the working directory and install following dependencies in a new conda environment python>=3.11

git clone https://github.com/Zafar-Lab/multiHIVE.git
pip install scvi-tools==1.3.0
pip install scanpy==1.11.0
pip install scikit-misc

Or install directly via pip

pip install multiHIVE

Tutorials

Explore the following tutorials to get started:

1. Main Script:

# features should genes followed by regions [genes, regions]
multiHIVE.setup_anndata(adata, batch_key="batch", protein_expression_obsm_key = "protein_expression")

vae = multiHIVE(adata,  
            n_genes=(adata.var["modality"] == "Gene Expression").sum(), # number of genes 
            n_regions=(adata.var["modality"] == "Peaks").sum(), # number of regions
            n_proteins=46, # number of proteins 
            latent_distribution="normal", kl_dot_product=True, deep_network=True)
vae.train()
vae.get_latent_representation()

2. Model Parameters:

Parameter Description
latent_distribution Distribution for latent variables (e.g., "normal")
kl_dot_product Enables regularization using dot-product of modality-specific latents
deep_network Uses deeper neural networks; recommended for datasets > 100,000 cells

3. Results:

  • vae.get_latent_representation() gives zs1, zs2, zr and zp or/and za
  • zs1 is the joint latent variable.
  • zs2 is the hierarchical joint latent variable.
  • zr is the gene modality specific latent variable.
  • zp is the protein modality specific latent variable.
  • za is the chromatin accessibility specific latent variable

Documentation

For more advanced settings, preprocessing tips, and API references, refer to the multiHIVE Documentation (link coming soon)

Citation

multiHIVE: Hierarchical Multimodal Deep Generative Model for Single-cell Multiomics Integration
Anirudh Nanduri*, Musale Krushna Pavan*, Kushagra Pandey, Hamim Zafar
bioRxiv 2025.01.28.635222; doi: https://doi.org/10.1101/2025.01.28.635222
*Equal contribution

Contact

For questions, issues, or contributions, please open an issue on the GitHub repository

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multihive-0.1.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multihive-0.1.0-py3-none-any.whl (30.5 kB view details)

Uploaded Python 3

File details

Details for the file multihive-0.1.0.tar.gz.

File metadata

  • Download URL: multihive-0.1.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for multihive-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bdef1df294f6e8e2552d91fc0959805282e4d46bb090aae99511c718bf21a3bb
MD5 dba534414ad9623dfae1f769bb1996bc
BLAKE2b-256 5f0673d2dda7a5d4883b9c01d8d7096b13b1466b4577412abafc7b9a63c90fd9

See more details on using hashes here.

File details

Details for the file multihive-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: multihive-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 30.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for multihive-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5834355bfa4a31fd7f12e4a3e6205161c421edcd9470bfd6233b82dcc76d1295
MD5 168f930ab39f5f3bf23dca514d97eb1f
BLAKE2b-256 608cda955199a8068ef1cac6e23a0a67c8485852c754d7f10b1dd2ebb8175847

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page