multiHIVE, a hierarchical multimodal deep generative model for integrating multiomics data
Project description
multiHIVE
Overview
multiHIVE is a hierarchical multimodal deep generative model designed to infer cellular embeddings by integrating multi-omics data with different modalities from the same cell. It uses:
- Hierarchically stacked latent variables to capture shared biological signals
- Modality-specific latent variables to model private (modality-unique) variation
This enables multiHIVE to perform:
- Joint integration of multi-modal data
- Denoising
- Protein imputation
- Integration of multi-modal with uni-modal datasets
Additionally, multiHIVE's denoised gene expression cabn be factorized of into interpretable gene expression programs, facilitating the identification of biological processes at multiple levels of cellular hierarchy.
Basic Installation
we recommend users to directly clone our stable main branch and set multiHIVE as the working directory and install following dependencies in a new conda environment python>=3.11 or conda create -n multiHIVE python==3.11
git clone https://github.com/Zafar-Lab/multiHIVE.git
pip install scvi-tools==1.3.0
pip install scanpy==1.11.0
pip install scikit-misc==0.3.1
Or install directly via pip
pip install multiHIVE
Tutorials
Explore the following tutorials to get started:
- CITE-seq integration: Tutorials/CITE_seq_Integration.ipynb
- Protein imputation: Tutorials/Protein_Imputatoin.ipynb
- TEA-seq integration: Tutorials/TEA-seq_integration.ipynb
1. Main Script:
# features should genes followed by regions [genes, regions]
multiHIVE.setup_anndata(adata, batch_key="batch", protein_expression_obsm_key = "protein_expression")
vae = multiHIVE(adata,
n_genes=(adata.var["modality"] == "Gene Expression").sum(), # number of genes
n_regions=(adata.var["modality"] == "Peaks").sum(), # number of regions
n_proteins=46, # number of proteins
latent_distribution="normal", kl_dot_product=True, deep_network=True)
vae.train()
vae.get_latent_representation()
2. Model Parameters:
| Parameter | Description |
|---|---|
latent_distribution |
Distribution for latent variables (e.g., "normal") |
kl_dot_product |
Enables regularization using dot-product of modality-specific latents |
deep_network |
Uses deeper neural networks; recommended for datasets > 100,000 cells |
3. Results:
- vae.get_latent_representation() gives zs1, zs2, zr and zp or/and za
- zs1 is the joint latent variable.
- zs2 is the hierarchical joint latent variable.
- zr is the gene modality specific latent variable.
- zp is the protein modality specific latent variable.
- za is the chromatin accessibility specific latent variable
Documentation
For more advanced settings, preprocessing tips, and API references, refer to the multiHIVE Documentation
Citation
multiHIVE: Hierarchical Multimodal Deep Generative Model for Single-cell Multiomics Integration
Anirudh Nanduri*, Musale Krushna Pavan*, Kushagra Pandey, Hamim Zafar
bioRxiv 2025.01.28.635222; doi: https://doi.org/10.1101/2025.01.28.635222
*Equal contribution
Contact
For questions, issues, or contributions, please open an issue on the GitHub repository
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file multihive-0.1.1.tar.gz.
File metadata
- Download URL: multihive-0.1.1.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a763100023d9358f8985d8aa1a873123a1ef1eaab31cef2d64406874dd30934d
|
|
| MD5 |
a01378645fa5651608b468e39a0b4de2
|
|
| BLAKE2b-256 |
e11e6e47d14e2ed179af04aecb615dfeb8ec71419c5e82b55850141d8d78f84c
|
File details
Details for the file multihive-0.1.1-py3-none-any.whl.
File metadata
- Download URL: multihive-0.1.1-py3-none-any.whl
- Upload date:
- Size: 31.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
67219ffc738557c1d4ae035d0a8ea4913dd1a392ee21a0a2e950387a91ea1ac1
|
|
| MD5 |
f33559ae8edc4451e0d144a316c5f602
|
|
| BLAKE2b-256 |
086dacf4db13df5e5334b4f48a7cc6f409b0fc948388dd9bf498877a84ee5d45
|