multiscale_phate
Project description
Multiscale PHATE
Multiscale PHATE is a python package for multiresolution analysis of high dimensional data. For an in-depth explanation of the algorithm and applications, please read our manuscript on BioRxiv.
The biomedical community is producing increasingly high dimensional datasets integrated from hundreds of patient samples that current computational techniques are unable to explore. Current tools for dimensionality reduction, such as tSNE, UMAP, and PCA, and clustering, such as Louvain and Leiden, only show a single salient level of granularity in biomedical data. When applied to cellular datasets currently being produced, these techniques are able to visualize and cluster major cell types such as B cells, T cells and myeloid cells. Differences between patient disease states, however, may not be found at the granularity of cell type alone. In fact, appreciation of a finer resolution the manifold would reveal subsets that may be predictive of outcome. This phenomenon is found across biomedical data science, as the cellular state space is known to form a collection of sub-manifolds that disease status can differentially affect.
The goal of Multiscale PHATE is to learn and visualize abstract cellular features and groupings of the data at all levels of granularity in an efficient manner to identify meaningful resolutions. Our approach learns a tree of data granularities which can be cut at coarse levels for high level summarizations of data as well as at fine levels for detailed representations on subsets. Our algorithm is based on a dynamic process we have developed called diffusion condensation, that computes a manifold-intrinsic diffusion space on the original data before slowly condensing data points towards local centers of gravity to form natural, data-driven groupings across multiple granularities. While this may sound computationally inefficient, we show that we are able to perform these calculations as well as visualize and cluster the data significantly faster than “single-scale” visualization techniques like tSNE, UMAP or PHATE, allowing the analysis of millions of cells within minutes. When combined with other computational algorithms for high dimensional data analysis, such as MELD, DREMI and TrajcetoryNet, Multiscale PHATE is able to provide deep and detailed insights in biological processes.
Installation
Multiscale PHATE is available on pip
. Install by running the following in a terminal:
pip install --user git+https://github.com/KrishnaswamyLab/Multiscale_PHATE
Quick Start
import multiscale_phate
mp_op = multiscale_phate.Multiscale_PHATE()
mp_embedding, mp_clusters, mp_sizes, tree = mp_op.fit_transform(X)
# Plot optimal visualization
scprep.plot.scatter2d(mp_embedding, s = mp_sizes, c = mp_clusters,
fontsize=16, ticks=False,label_prefix="Multiscale PHATE", figsize=(16,12))
Guided Tutorial
For more details on using Multiscale PHATE, see our guided tutorial using 10X's public PBMC4k dataset.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file multiscale_phate-0.0.tar.gz
.
File metadata
- Download URL: multiscale_phate-0.0.tar.gz
- Upload date:
- Size: 27.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09a32aa483768ec235679146adeb6c5ca3d4de43b69001a4349847a503813cc0 |
|
MD5 | c70bdf822c0534a8baeca4ed1416d4ee |
|
BLAKE2b-256 | 82309f6a47268209197fad84c125802942d77b608bad0c8df4864ddeec484e5f |
File details
Details for the file multiscale_phate-0.0-py3-none-any.whl
.
File metadata
- Download URL: multiscale_phate-0.0-py3-none-any.whl
- Upload date:
- Size: 28.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3cb46ef9df4061b1dc6d3908ef7149ca209fd3f2cf475ebfb93cfd084e3faed2 |
|
MD5 | 4b8079e58b8f83e8ba53cdf7aff4f678 |
|
BLAKE2b-256 | f15ecf74348a31d5da281477a730b99e9e8b4622c1786a7108f28c9c1fec3f1d |