Skip to main content

Feature-preserving Manifold Approximation and Projection

Project description

FeatureMAP Illustration

FeatureMAP: Feature-preserving Manifold Approximation and Projection

Visualizing single-cell data is essential for understanding cellular heterogeneity and dynamics. FeatureMAP enhances this process by introducing gene projection and transition/core states, providing deeper insights into cellular states. While traditional methods like UMAP and t-SNE effectively capture clustering, they often overlook critical gene-level information. FeatureMAP addresses this limitation by integrating concepts from UMAP and PCA, preserving both clustering structures and gene feature variations within a low-dimensional space.

Description

FeatureMAP presents a novel approach by enhancing manifold learning with pairwise tangent space embedding, ensuring the retention of crucial cellular data features. It introduces two visualization plots: expression embedding (GEX) and variation embedding (GVA).

Here, we demonstrate its effectiveness using a synthetic dataset from (BEELINE) based on a bifurcation model. Compared to UMAP, FeatureMAP-GEX better preserves cell density, while FeatureMAP-GVA clearly delineates developmental paths.

Transition and Core States

Besides the two-dimensional visualization, FeatureMAP presents three key concepts:

  1. Gene Projection: Estimating and projecting gene feature loadings, where arrows indicate the direction and magnitude of gene expression changes. Gene Projection

  2. Transition and Core States: Transition and core states are computationally defined based on cell density, curvature, and betweenness centrality. Transition states are characterized by the lowest cell densities, maximal curvature, and highest betweenness centrality, whereas core states exhibit the highest cell densities, minimal curvature, and lowest betweenness centrality.

    Transition and Core States
  3. Differential Gene Variation (DGV) Analysis: The third concept introduces differential gene variation (DGV) analysis, which compares transition and core states to identify genes with significant variability. By quantifying gene variation between dynamic transition states and stable core states, DGV highlights regulatory genes likely driving cell-state transitions and differentiation.

    DGV

FeatureMAP, a feature-preserving method, enhances the visualization and interpretation of single-cell data. Through analyses of both synthetic and real scRNA-seq data (TUTORIAL), FeatureMAP effectively captures intricate clustering structures and identifies key regulatory genes, offering significant advantages for single-cell data analysis.

Getting Started

Dependencies

  • Python 3.8 or higher
  • Required Python libraries: numpy, scipy, matplotlib, umap-learn, scikit-learn
  • Operating System: Any (Windows, macOS, Linux)

Installation

Install directly using pip:

pip install featuremap

How to use FeatureMAP

Data Visualization

To apply FeatureMAP in Python with a data matrix (data), where rows represent cells and columns represent genes, use the following command:

import featuremap
v_emb = featuremap.FeatureMAP(output_variation=True).fit_transform(data)
x_emb = featuremap.FeatureMAP(output_variation=False).fit_transform(data)


Parameters:

output_variation: bool (False by default). Decide to generate expression embedding or variation embedding.

Outputs

x_emb: expession embedding to show the clustering

v_emb: variation embedding to show the trajectory

Documentation

More tutorials are at https://featuremap.readthedocs.io/en/latest/index.html.

Citation

Our FeatureMAP alogrithm is based on the paper

Yang, Yang, et al. "Interpretable Dimensionality Reduction by Feature Preserving Manifold Approximation and Projection." arXiv preprint arXiv:2211.09321 (2022).

License

The FeatureMAP package is under BSD-3-Clause license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featuremap-0.0.3.tar.gz (53.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

featuremap-0.0.3-py3-none-any.whl (52.6 kB view details)

Uploaded Python 3

File details

Details for the file featuremap-0.0.3.tar.gz.

File metadata

  • Download URL: featuremap-0.0.3.tar.gz
  • Upload date:
  • Size: 53.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.23

File hashes

Hashes for featuremap-0.0.3.tar.gz
Algorithm Hash digest
SHA256 0fb64354606201089906fb7dd50c5f9f0d276de5aae15ff939e1d9dfc9ddcda2
MD5 77b054bfe9a237466227232bf8f5a851
BLAKE2b-256 fdf9714f4b6cc6b26069f7eb3d30fc477383a77e949b61dd6ae1d29987d9fb9d

See more details on using hashes here.

File details

Details for the file featuremap-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: featuremap-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 52.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.23

File hashes

Hashes for featuremap-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 649ca663a20f3af83ae7fdd9c88e067ba9328713d272a906ab8d196eb97eb4d3
MD5 0a90b60bbbdf71ecb24e49bc6c1d8e5a
BLAKE2b-256 80b9acb0c688edd02d45fd7ef2b3a5fe213498cc6e55e4b0d16a115ccee0c4aa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page