Skip to main content

Multi-resolution Variational Inference

Project description

mrvi

Tests Documentation

Multi-resolution Variational Inference.

🚧 :warning: [DEPRECATED] For the latest version of MrVI, please use the implementation in the scvi-tools package (from scvi.external import MRVI). This package will no longer be maintained. :warning: 🚧

Getting started

Please refer to the documentation. In particular, the

Installation

You need to have Python 3.8 or newer installed on your system. If you don't have Python installed, we recommend installing Miniconda.

There are several alternative options to install mrvi:

  1. Install the latest pre-release of mrvi from PyPI <https://pypi.org/project/mrvi/>_:
pip install --pre mrvi
  1. Install the latest development version:
pip install git+https://github.com/YosefLab/mrvi.git@main

User guide

While a more comprehensive user guide is in the works, you can find here a brief overview of the main features of mrvi.

Data preparation: MrVI relies on scvi-tools routines for model initialization and training. In particular, mrvi assumes data to be stored in an AnnData object. A first step is to load the data and register it, as follows:

from mrvi import MrVI

MrVI.setup_anndata(adata, sample_key="my_sample_key", batch_key="my_batch_key")

where here 'my_sample_key' and 'my_batch_key' are expected to be keys of adata.obs that contain the sample and batch assignments, respectively.

Model training: The next step is to initialize and train the model, which can be done via:

model = MrVI(adata)
model.train()

Once the model is trained, we recommend visualizing the validation ELBO to assess convergence, which is stored in model.history["elbo_validation"]. If the ELBO has not converged, you should consider training the model for more epochs.

Latent space visualization: MrVI contains two latent spaces, u, that captures global cell-type variations, and z, that additionally captures sample-specific variations. These two latent representations can be accessed via model.get_latent_representation(), (with give_z=True to access z). In particular, these latent variables can be seemlessly used for data visualization or clustering using scanpy. For instance, visualizing the u latent space can be done via:

import scanpy as sc
from scvi.model.utils import mde

u = model.get_latent_representation()
u_mde = mde(u)
adata.obsm["u_mde"] = u_mde
sc.pl.embedding(adata, basis="u_mde")

Computing sample-sample dissimilarities: MrVI can be used to predict sample-sample dissimilarities, using the following snippet:

# Predict sample-sample dissimilarities per cell type
dists = model.get_local_sample_distances(
    adata, keep_cell=False, groupby="initial_clustering", batch_size=32
)

# OR predict sample-sample dissimilarities for EACH cell
# WARNING: this can be slow and memory-intensive for large datasets
dists = model.get_local_sample_distances(adata, keep_cell=True, batch_size=32)

These dissimilarities can then be visualized via seaborn.clustermap or similar tools.

DE analysis: MrVI can be used to identify differentially expressed genes (DEGs) between two groups of samples at the single-cell level. Here, "samples" refere to the sample_key provided in MrVI.setup_anndata. Identifying such genes can be done as follows,

sample_cov_keys = ["Status"]  # Here, Status is the sample covariate of interest
de_res = model.differential_expression(
    sample_cov_keys=sample_cov_keys,
)

DA analysis: MrVI can also be used to assess differences in cell-type compositions across sample groups, using the following snippet:

da_res = model.differential_abundance(sample_cov_keys=sample_cov_keys)
A_log_probs = da_res.Status_log_probs.loc[{"Status": "A"}]
B_log_probs = da_res.Status_log_probs.loc[{"Status": "B"}]
A_B_log_prob_ratio = A_log_probs - B_log_probs

Release notes

See the changelog.

Contact

For questions and help requests, you can reach out in the scverse discourse. If you found a bug, please use the issue tracker.

Citation

Deep generative modeling of sample-level heterogeneity in single-cell genomics

Pierre Boyeau, Justin Hong, Adam Gayoso, Martin Kim, Jose L. McFaline-Figueroa, Michael I. Jordan, Elham Azizi, Can Ergen, & Nir Yosef

bioRxiv 2024 May 10. doi: 10.1101/2022.10.04.510898.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mrvi-1.0.1.tar.gz (40.1 kB view details)

Uploaded Source

Built Distribution

mrvi-1.0.1-py3-none-any.whl (27.0 kB view details)

Uploaded Python 3

File details

Details for the file mrvi-1.0.1.tar.gz.

File metadata

  • Download URL: mrvi-1.0.1.tar.gz
  • Upload date:
  • Size: 40.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.27.0

File hashes

Hashes for mrvi-1.0.1.tar.gz
Algorithm Hash digest
SHA256 9192388c0b4602e4c8d9454c9f6dbc46fdb367bb19bce2594ab21728a9b19bd9
MD5 64b001d3c6c4c992dbfee895ba7d2160
BLAKE2b-256 73be5fd2a03fd7806c6b52592eb59b273632398db8b55f89f9e9ffc3b469e6b3

See more details on using hashes here.

File details

Details for the file mrvi-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: mrvi-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 27.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.27.0

File hashes

Hashes for mrvi-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6e60fe82c111733bf46a40c1525cdb6e35276d4784dd493e044f0b7f30e01427
MD5 f3d918ac2d1bd61993d32e8c83f658cd
BLAKE2b-256 3306333e59a6f2a49a175ef2d5d14add22644aedc58bcc14e86a89cef7551a68

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page