Skip to main content

Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing

Project description

scXpand Logo

scXpand: Pan-cancer Detection of T-cell Clonal Expansion

Detect T-cell clonal expansion from single-cell RNA sequencing data without paired TCR sequencing

DocumentationInstallationQuick StartExamplesCitation

scXpand Datasets Overview

A framework for predicting T-cell clonal expansion from single-cell RNA sequencing data.

Manuscript in preparation - detailed methodology and benchmarks coming soon.

View full documentation for comprehensive guides and API reference.

Features

  • Multiple Model Architectures:
    • Autoencoder-based: Encoder-decoder with reconstruction and classification heads
    • MLP: Multi-layer perceptron
    • LightGBM: Gradient boosted decision trees
    • Linear Models: Logistic regression and support vector machines
  • Scalable Processing: Handles millions of cells with memory-efficient data streaming from disk during training
  • Automated Hyperparameter Optimization: Built-in Optuna integration for model tuning

Installation

scXpand is available in two variants:

Standard Version (CPU/MPS):

pip install scxpand

CUDA Version (NVIDIA GPU):

pip install scxpand-cuda

Note: The standard version supports CPU and MPS (Apple Silicon), while the CUDA version provides NVIDIA GPU acceleration.

See the full installation guide for detailed setup instructions.

Quick Start

import scxpand
# Make sure that "your_data.h5ad" includes only T cells for the results to be meaningful
# Ensure that "your_data.var_names" are provided as Ensembl IDs (as the pre-trained models were trained using this gene representation)
# Please refer to our documentation for more information

# List available pre-trained models
scxpand.list_pretrained_models()

# Run inference with automatic model download
results = scxpand.run_inference(
    model_name="pan_cancer_autoencoder",  # default model
    data_path="your_data.h5ad"
)

Documentation

See our Tutorial Notebook for a complete example with data preprocessing, T-cell filtering, gene ID conversion, and model application using a real breast cancer dataset.

Getting Started:

Model Training & Optimization:

Analysis & Evaluation:

📖 Full Documentation - Complete guides, API reference, and interactive tutorials

License

This project is licensed under the MIT License – see the LICENSE file for details.

Citation

If you use scXpand in your research, please cite:

@article{scxpand2025,
  title={scXpand: Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing},
  author={[Your Name]},
  journal={[Journal Name]},
  year={2025},
  doi={[DOI]}
}

This project was created in favor of the scientific community worldwide, with a special dedication to the cancer research community. We hope you’ll find this repository helpful, and we warmly welcome any requests or suggestions - please don’t hesitate to reach out!

Visitors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scxpand-0.1.18.tar.gz (126.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scxpand-0.1.18-py3-none-any.whl (137.4 kB view details)

Uploaded Python 3

File details

Details for the file scxpand-0.1.18.tar.gz.

File metadata

  • Download URL: scxpand-0.1.18.tar.gz
  • Upload date:
  • Size: 126.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.17

File hashes

Hashes for scxpand-0.1.18.tar.gz
Algorithm Hash digest
SHA256 c29b80bde1533260fa75b36a2422eb93b3715cc49fd1ba3fdaf5d6b103e94089
MD5 e2954adb2523763088c324f60bb83402
BLAKE2b-256 d9f0f061c1740c897fca6e622d5c1832a26bfa0d85cab015adba92b71962c5e7

See more details on using hashes here.

File details

Details for the file scxpand-0.1.18-py3-none-any.whl.

File metadata

  • Download URL: scxpand-0.1.18-py3-none-any.whl
  • Upload date:
  • Size: 137.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.17

File hashes

Hashes for scxpand-0.1.18-py3-none-any.whl
Algorithm Hash digest
SHA256 0e2c9757bac6a31fc33d576df44556f31dee969714538c47a2523e678cefad04
MD5 59a30e115033e1d9b5189f52bd490a7c
BLAKE2b-256 19e78caf446a4a0308ad97c3c1670cb1c53af71195a9a778c3f331e0eb93f463

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page