PHASE:PHenotype prediction with Attention mechanisms for Single-cell Exploring

These details have not been verified by PyPI

Project links

Homepage

Project description

PHASE: PHenotype prediction with Attention mechanisms for Single-cell Exploring

PHASE utilizes an attention-based neural network framework to predict clinical phenotypes from scRNA-seq data while providing interpretability of key features linked to phenotypic outcomes at both the gene and cell levels. PHASE consists of several components:

A data-preprocessing procedure
A gene feature embedding module
A self-attention (SA) module for cell embedding learning
An attention-based deep multiple instance learning (AMIL) module for aggregating all single-cell information within a sample

The manuscript has been pre-printed in bioRxiv:

Qinhua Wu, Junxiang Ding, Ruikun He, Lijian Hui, Junwei Liu, Yixue Li. Exploring phenotype-related single-cells through attention-enhanced representation learning. bioRxiv (2024). https://doi.org/10.1101/2024.10.31.619327

Installation

Installing PHASE package

PHASE is written in Python and can be installed using pip:

pip install phase-sc

Requirements

PHASE should run on any environmnet where Python is available，utilizing PyTorch for its computational needs. The training of PHASE can be done using CPUs only or GPU acceleration. If you do not have powerful GPUs available, it is possible to run using only CPUs.

The PHASE pipeline

Predict clinical phenotypes from scRNA-seq data
- 1.1 Data preprocessing: Encode the data into a format that can be read by PHASE.
- 1.2 Gene feature embedding: Extract and represent gene features.
- 1.3 Self-attention (SA): Learn cell embeddings.
- 1.4 Attention-based deep multiple instance learning (AMIL): aggregate all single-cell information within a sample.
Provide interpretability of key phenotype-related features
- 2.1 Attribution analysis: Use Integrated Gradients (IG) to link genes to phenotypes via attribution scores.
- 2.2 Attention analysis: Use AMIL attention scores to relate individual cells to the phenotype.
- 2.3 Conjoint analysis: Correlate top genes' expression levels with cells' attention scores to reveal gene-cell contributions to the phenotype.

Usages

Command Line Arguments

The following table lists the command line arguments available for training the model:

Abbreviation	Parameter	Description
-t	--type	Type of task: classification or regression.
-p	--path	Path to the dataset.
-r	--result	Path to the directory where results will be saved.
-e	--epoch	Number of training epochs (default: 100).
-l	--learningrate	Learning rate for the optimizer (default: 0.00001).
-d	--devices	List of GPU device IDs to use for training (default: first GPU).

Each argument is required unless a default value is specified.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.0.4

Nov 19, 2024

2.0.3

Nov 19, 2024

2.0.2

Nov 19, 2024

2.0.1

Nov 19, 2024

2.0.0

Nov 19, 2024

1.1.0

Nov 19, 2024

1.0.1

Nov 19, 2024

1.0.0

Nov 19, 2024

This version

0.1.0

Nov 19, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

phase_sc-0.1.0-py3-none-any.whl (10.8 kB view details)

Uploaded Nov 19, 2024 Python 3

File details

Details for the file phase_sc-0.1.0-py3-none-any.whl.

File metadata

Download URL: phase_sc-0.1.0-py3-none-any.whl
Upload date: Nov 19, 2024
Size: 10.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for phase_sc-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5932f70ded478c9f734d9700681928d566cd7711b51d788bb11f7bcda21ebefc`
MD5	`0d72d3566217df4c3479b09216ae62d3`
BLAKE2b-256	`c44337c4b6649de79d1d58b150a5bcb6cb45bcf5344e2775e6487859fe894e6c`