Skip to main content

Aligned Cross-modal Integration and Characterization of Single-Cell Multiomic Data with Deep Contrastive Learning

Project description

scMDCF

scMDCF badge PyPI badge License

scMDCF is a python package containing tools for clustering single cell multi-omics data based on cross-modality contrastive learning to learn the common latent representation and assign clustering.

Overview

Single-cell multi-omics (scMulti-omics) technologies have revolutionized our understanding of cellular functions and interactions by enabling the simultaneous measurement of diverse cellular modalities. However, the inherent complexity, high-dimensionality, and heterogeneity of these datasets pose substantial challenges for integration and analysis across different modalities. To address these challenges, we develop a single-cell multi-omics deep learning model (scMDCF) based on contrastive learning, tailored for the efficient characterization and integration of scMulti-omics data. scMDCF features a cross-modality contrastive learning module that harmonizes data representations across different omics types, ensuring consistency while accommodating conditional entropy to preserve data heterogeneity. Furthermore, a cross-modality feature fusion module is designed to extract common low-dimensional latent representations of scMulti-omics data, effectively balancing the characteristics of these diverse omics data. Extensive empirical studies demonstrate that scMDCF outperforms existing state-of-the-art scMulti-omics models across various types of scMulti-omics data. In particular, scMDCF exhibits progressive capability in extracting cell-type specific peak-gene associations and cis-regulatory elements from SNARE-seq data, as well as in elucidating immune regulation from CITE-seq data. Furthermore, we demonstrate that in the post-BNT162b2 mRNA SARS‐CoV‐2 vaccination dataset, scMDCF successfully annotates specific vaccine-induced B cell subpopulations through integrative and multimodal analysis, uncovering dynamic interactions and regulatory mechanisms within the immune system after vaccination. The framework plot of scMDCF

System Requirements

Hardware requirements

scMDCF package requires only a standard computer with enough RAM to support the in-memory operations.

Software requirements

OS requirements

This package is supported for Linux. The package has been tested on the following systems:

  • Linux: Ubuntu 18.04

Python Dependencies

scMDCF mainly depends on the Python scientific stack. numpy pytorch scanpy pandas scikit-learn For specific setting, please see requirements.

Installation Guide

Install from PyPi

conda create -n scMDCF_env python=3.9.16
conda activate scMDCF_env
pip install scMDCF==1.1.3

Usage

scMDCF is a deep embedding learning method for single-cell multi-omics data clustering, which can be used to:

  • CITE-seq dataset clustering. The example can be seen in the main_CITE.py
  • SNARE-seq (paired RNA-seq and ATAC-seq) dataset clustering. The example can be seen in the main_SNARE.py

Data Availability

The datasets we used can be download in dataset

License

This project is covered under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scMDCF-1.1.3.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scMDCF-1.1.3-py2.py3-none-any.whl (8.4 kB view details)

Uploaded Python 2Python 3

File details

Details for the file scMDCF-1.1.3.tar.gz.

File metadata

  • Download URL: scMDCF-1.1.3.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.16

File hashes

Hashes for scMDCF-1.1.3.tar.gz
Algorithm Hash digest
SHA256 9f6d10cc692ca46978065d559723fe6485a812bf4678e198bf7491ba9d050c3c
MD5 2b5f87cd27ae6d07df1dbc53fa6f017d
BLAKE2b-256 1ba3da9bf0b7673308024d39b65f8e37e104d1851b73bf62e2ad092f9fa168c9

See more details on using hashes here.

File details

Details for the file scMDCF-1.1.3-py2.py3-none-any.whl.

File metadata

  • Download URL: scMDCF-1.1.3-py2.py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.16

File hashes

Hashes for scMDCF-1.1.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 56ba81d77b06331f42ccc929779ca2279b2bc565b1a8cf7e2f4b1cc7cb4eb3bc
MD5 d32a498b1bd64e468d63909542241a83
BLAKE2b-256 ec7c9951d879d7546cee7191b9d1cd1b886d39f0df255c4be0d249e27828aa1c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page