Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing (CUDA-enabled)
Project description
Detect T-cell clonal expansion from single-cell RNA sequencing data without paired TCR sequencing
Preprint • Documentation • Installation • Quick Start • Usage Guide • Citation
A framework for predicting T-cell clonal expansion from single-cell RNA sequencing data.
Manuscript in preparation - detailed methodology and benchmarks coming soon.
View full documentation for comprehensive guides and API reference.
Features
- Multiple Model Architectures:
- Autoencoder-based: Encoder-decoder with reconstruction and classification heads
- MLP: Multi-layer perceptron
- LightGBM: Gradient boosted decision trees
- Linear Models: Logistic regression and support vector machines
- Scalable Processing: Handles millions of cells with memory-efficient data streaming from disk during training
- Automated Hyperparameter Optimization: Built-in Optuna integration for model tuning
Installation
For detailed installation instructions, please refer to our Installation Guide.
Published Version Install
CUDA version (NVIDIA GPU):
pip install --upgrade scxpand-cuda --extra-index-url https://download.pytorch.org/whl/cu128
CPU/Apple Silicon/Other GPUs:
pip install --upgrade scxpand
Development Setup (Install from Source)
See the Installation Guide
Quick Start
import scxpand
# Make sure that "your_data.h5ad" includes only T cells for the results to be meaningful
# Ensure that "your_data.var_names" are provided as Ensembl IDs (as the pre-trained models were trained using this gene representation)
# Please refer to our documentation for more information
# List available pre-trained models
scxpand.list_pretrained_models()
# Run inference with automatic model download
results = scxpand.run_inference(
model_name="pan_cancer_autoencoder", # default model
data_path="your_data.h5ad"
)
# Access predictions
predictions = results.predictions
if results.has_metrics:
print(f"AUROC: {results.get_auroc():.3f}")
See our Tutorial Notebook for a complete example with data preprocessing, T-cell filtering, gene ID conversion, and model application using a real breast cancer dataset.
Documentation
Setup & Getting Started:
- Installation Guide - Setup for local development of scXpand
- User Guide - Quick start and comprehensive workflow guide
- Data Format - Input data requirements and specifications
Using Pre-trained Models:
- Model Inference - Run predictions on new data with pre-trained models
Training Your Own Models:
- Model Training - Train models with CLI and programmatic API
- Hyperparameter Optimization - Automated model tuning with Optuna
Understanding Results:
- Model Architectures - Detailed architecture descriptions and configurations
- Evaluation Metrics - Performance assessment and interpretation
- Output Format - Understanding model outputs and results
📖 Full Documentation - Complete guides, API reference, and interactive tutorials
License
This project is licensed under the MIT License – see the LICENSE file for details.
Citation
If you use scXpand in your research, please cite:
Shorer, O., Amit, R., and Yizhak, K. (2025). scXpand: Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing. Preprint at bioRxiv, https://doi.org/10.1101/2025.09.14.676069.
BibTeX
@article{shorer2025scxpand,
title={scXpand: Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing},
author={Shorer, Ofir and Amit, Ron and Yizhak, Keren},
year={2025},
journal={bioRxiv},
doi={https://doi.org/10.1101/2025.09.14.676069}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scxpand_cuda-0.4.3.tar.gz.
File metadata
- Download URL: scxpand_cuda-0.4.3.tar.gz
- Upload date:
- Size: 7.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9586200d0ab4e9dc856fa6f19e1283a6c536c0178e43deafda1b1afee091b1d9
|
|
| MD5 |
8b3be9299f8e22d2008c7a8f83b2075b
|
|
| BLAKE2b-256 |
9d9291310e9fbe40e5f0c59d11f9697cb5ee8bdcea17e315f501e9d01e7c139e
|
File details
Details for the file scxpand_cuda-0.4.3-py3-none-any.whl.
File metadata
- Download URL: scxpand_cuda-0.4.3-py3-none-any.whl
- Upload date:
- Size: 141.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6950d83218f39449193c765d58fe627a977ede8e2a7b9733b4fc1cb4e368935b
|
|
| MD5 |
68d84bdafaa0ac436f320ce4a69715d9
|
|
| BLAKE2b-256 |
452bd4ab72a44994933945025b586a7a9fd7d65b8acf14f3af7fae83bf90cdb3
|