Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing
Project description
scXpand
Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing
Documentation • Installation • Quick Start • Usage Examples • Data Format • Output Format • Model Architectures • Citation
A framework for predicting T-cell clonal expansion from single-cell RNA sequencing data.
Manuscript in preparation - detailed methodology and benchmarks coming soon.
View full documentation for comprehensive guides and API reference.
Features
- Multiple Model Architectures: Autoencoder, MLP, LightGBM, Logistic Regression, and SVM for comprehensive analysis
- Scalable Processing: Handles millions of cells with memory-efficient data streaming from disk during training
- Automated Hyperparameter Optimization: Built-in Optuna integration for model tuning
Installation
pip install scxpand
Quick Start
import scxpand
# List available pre-trained models
scxpand.list_pretrained_models()
# Run inference with automatic model download
results = scxpand.run_inference_with_pretrained(
model_name="autoencoder_pan_cancer",
data_path="your_data.h5ad"
)
Or via command line:
# Pre-trained model inference (curated models)
scxpand predict --data_path your_data.h5ad --model_name autoencoder_pan_cancer
# Direct DOI inference (any Zenodo model - seamless sharing!)
scxpand predict --data_path your_data.h5ad --model_doi 10.5281/zenodo.1234567
# Local model inference
scxpand predict --data_path your_data.h5ad --model_path results/my_model
Development
For development installation and model training, see the documentation.
Model Architectures
scXpand provides multiple model architectures to suit different use cases and data characteristics:
Autoencoder-based Classifiers
Architecture featuring an encoder with auxiliary decoder for reconstruction and classifier head for expansion prediction. This approach leverages representation learning to capture complex patterns in single-cell data.
Multi-Layer Perceptron (MLP)
Standard feed-forward neural networks for direct expansion prediction.
LightGBM
Gradient boosting for classification tasks with excellent performance on tabular data.
Linear Models
Classical machine learning approaches including logistic regression and support vector machines.
License
This project is licensed under the MIT License – see the LICENSE file for details.
Citation
If you use scXpand in your research, please cite:
@article{scxpand2024,
title={scXpand: Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing},
author={[Your Name]},
journal={[Journal Name]},
year={2024},
doi={[DOI]}
}
This project was created in favor of the scientific community worldwide, with a special dedication to the cancer research community. We hope you’ll find this repository helpful, and we warmly welcome any requests or suggestions - please don’t hesitate to reach out!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scxpand-0.1.0.dev1.tar.gz.
File metadata
- Download URL: scxpand-0.1.0.dev1.tar.gz
- Upload date:
- Size: 112.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f91d16c6a5b710ee9628e449852691b14f39bcba9268fa08d9560851db60a038
|
|
| MD5 |
9ffaaf8917915d60b6bf56314ba2172f
|
|
| BLAKE2b-256 |
8404d65a145ef208e3185bc766fd56c7fb6020303b22a4ddfd495f2f3cf4c498
|
File details
Details for the file scxpand-0.1.0.dev1-py3-none-any.whl.
File metadata
- Download URL: scxpand-0.1.0.dev1-py3-none-any.whl
- Upload date:
- Size: 128.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
951921c5222cb9e05637c992556e178c32c160a61d05ccb8e40407f1148893f6
|
|
| MD5 |
0ecbb9b78ca404fefe3604ec8b5f080c
|
|
| BLAKE2b-256 |
7aea06aac4031a21a4972f3f649d1660eb1564ceaf16a7d975b80ed436d7cadd
|