Skip to main content

FeatureMAP

Project description

FeatureMAP Illustration

FeatureMAP: Feature-preserving Manifold Approximation and Projection

Visualizing single-cell data is essential for understanding cellular heterogeneity and dynamics. FeatureMAP enhances this process by introducing gene projection and transition/core states, providing deeper insights into cellular states. While traditional methods like UMAP and t-SNE effectively capture clustering, they often overlook critical gene-level information. FeatureMAP addresses this limitation by integrating concepts from UMAP and PCA, preserving both clustering structures and gene feature variations within a low-dimensional space.

Description

FeatureMAP presents a novel approach by enhancing manifold learning with pairwise tangent space embedding, ensuring the retention of crucial cellular data features. It introduces two visualization plots: expression embedding (GEX) and variation embedding (GVA).

Here, we demonstrate its effectiveness using a synthetic dataset from (BEELINE) based on a bifurcation model. Compared to UMAP, FeatureMAP-GEX better preserves cell density, while FeatureMAP-GVA clearly delineates developmental paths.

Transition and Core States

Besides the two-dimensional visualization, FeatureMAP presents three key concepts:

  1. Gene Projection: Estimating and projecting gene feature loadings, where arrows indicate the direction and magnitude of gene expression changes. Gene Projection

  2. Transition and Core States: Transition and core states are computationally defined based on cell density, curvature, and betweenness centrality. Transition states are characterized by the lowest cell densities, maximal curvature, and highest betweenness centrality, whereas core states exhibit the highest cell densities, minimal curvature, and lowest betweenness centrality.

    Transition and Core States
  3. Differential Gene Variation (DGV) Analysis: The third concept introduces differential gene variation (DGV) analysis, which compares transition and core states to identify genes with significant variability. By quantifying gene variation between dynamic transition states and stable core states, DGV highlights regulatory genes likely driving cell-state transitions and differentiation.

    DGV

FeatureMAP, a feature-preserving method, enhances the visualization and interpretation of single-cell data. Through analyses of both synthetic and real scRNA-seq data (TUTORIAL), FeatureMAP effectively captures intricate clustering structures and identifies key regulatory genes, offering significant advantages for single-cell data analysis.

Getting Started

Dependencies

  • Python 3.8 or higher
  • Required Python libraries: numpy, scipy, matplotlib, umap-learn, scikit-learn
  • Operating System: Any (Windows, macOS, Linux)

Installation

Install directly using pip:

pip install featuremap

How to use FeatureMAP

Data Visualization

To apply FeatureMAP in Python with a data matrix (data), where rows represent cells and columns represent genes, use the following command:

import featuremap
v_emb = featuremap.FeatureMAP(output_variation=True).fit_transform(data_pca)
x_emb = featuremap.FeatureMAP(output_variation=False).fit_transform(data_pca)


Parameters:

output_variation: bool (False by default). Decide to generate expression embedding or variation embedding.

Outputs

x_emb: expession embedding to show the clustering

v_emb: variation embedding to show the trajectory

Documentation

More tutorials are at https://featuremap.readthedocs.io/en/latest/index.html.

Citation

Our FeatureMAP alogrithm is based on the paper

Yang, Yang, et al. "Interpretable Dimensionality Reduction by Feature Preserving Manifold Approximation and Projection." arXiv preprint arXiv:2211.09321 (2022).

License

The FeatureMAP package is under BSD-3-Clause license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featuremap-0.0.1.tar.gz (57.5 kB view details)

Uploaded Source

Built Distribution

featuremap-0.0.1-py3-none-any.whl (56.7 kB view details)

Uploaded Python 3

File details

Details for the file featuremap-0.0.1.tar.gz.

File metadata

  • Download URL: featuremap-0.0.1.tar.gz
  • Upload date:
  • Size: 57.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for featuremap-0.0.1.tar.gz
Algorithm Hash digest
SHA256 39fe9866869fd97ad2e26721c33b02d2ed4c8389d875714fefe222b18dfccad7
MD5 fbf14e7a041d18f29bba21753dde2468
BLAKE2b-256 2a5ec73b2f99a5fe6519346b48181ebcc49cfeab5d06134dc0f2f5f214b34bec

See more details on using hashes here.

File details

Details for the file featuremap-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: featuremap-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 56.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for featuremap-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fa03c6b75f914481b8c5d0768f0b35e271c32d440cbe7292b418ea1138dd2679
MD5 35e0e1c6cb0c9ac4bcfb36da34491a14
BLAKE2b-256 68f960ff4187c4e5004ec58a26f3f2aa3aef66cc7d7aea7a05b7385d5b10b912

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page