Skip to main content

Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing

Project description

scXpand


scXpand Logo

Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing


DocumentationInstallationQuick StartUsage ExamplesData FormatOutput FormatModel ArchitecturesCitation

scXpand Datasets Overview

A framework for predicting T-cell clonal expansion from single-cell RNA sequencing data.

Manuscript in preparation - detailed methodology and benchmarks coming soon.

View full documentation for comprehensive guides and API reference.

Features

  • Multiple Model Architectures: Autoencoder, MLP, LightGBM, Logistic Regression, and SVM for comprehensive analysis
  • Scalable Processing: Handles millions of cells with memory-efficient data streaming from disk during training
  • Automated Hyperparameter Optimization: Built-in Optuna integration for model tuning

Installation

pip install scxpand

Quick Start

import scxpand

# List available pre-trained models
scxpand.list_pretrained_models()

# Run inference with automatic model download
results = scxpand.run_inference_with_pretrained(
    model_name="autoencoder_pan_cancer",
    data_path="your_data.h5ad"
)

Or via command line:

# Pre-trained model inference (curated models)
scxpand predict --data_path your_data.h5ad --model_name autoencoder_pan_cancer

# Direct DOI inference (any Zenodo model - seamless sharing!)
scxpand predict --data_path your_data.h5ad --model_doi 10.5281/zenodo.1234567

# Local model inference
scxpand predict --data_path your_data.h5ad --model_path results/my_model

Development

For development installation and model training, see the documentation.

Model Architectures

scXpand provides multiple model architectures to suit different use cases and data characteristics:

Autoencoder-based Classifiers

Architecture featuring an encoder with auxiliary decoder for reconstruction and classifier head for expansion prediction. This approach leverages representation learning to capture complex patterns in single-cell data.

Multi-Layer Perceptron (MLP)

Standard feed-forward neural networks for direct expansion prediction.

LightGBM

Gradient boosting for classification tasks with excellent performance on tabular data.

Linear Models

Classical machine learning approaches including logistic regression and support vector machines.

License

This project is licensed under the MIT License – see the LICENSE file for details.

Citation

If you use scXpand in your research, please cite:

@article{scxpand2024,
  title={scXpand: Pan-cancer detection of T-cell clonal expansion from single-cell RNA sequencing without paired single-cell TCR sequencing},
  author={[Your Name]},
  journal={[Journal Name]},
  year={2024},
  doi={[DOI]}
}

This project was created in favor of the scientific community worldwide, with a special dedication to the cancer research community. We hope you’ll find this repository helpful, and we warmly welcome any requests or suggestions - please don’t hesitate to reach out!

Visitors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scxpand-0.1.0.dev1.tar.gz (112.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scxpand-0.1.0.dev1-py3-none-any.whl (128.8 kB view details)

Uploaded Python 3

File details

Details for the file scxpand-0.1.0.dev1.tar.gz.

File metadata

  • Download URL: scxpand-0.1.0.dev1.tar.gz
  • Upload date:
  • Size: 112.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.15

File hashes

Hashes for scxpand-0.1.0.dev1.tar.gz
Algorithm Hash digest
SHA256 f91d16c6a5b710ee9628e449852691b14f39bcba9268fa08d9560851db60a038
MD5 9ffaaf8917915d60b6bf56314ba2172f
BLAKE2b-256 8404d65a145ef208e3185bc766fd56c7fb6020303b22a4ddfd495f2f3cf4c498

See more details on using hashes here.

File details

Details for the file scxpand-0.1.0.dev1-py3-none-any.whl.

File metadata

File hashes

Hashes for scxpand-0.1.0.dev1-py3-none-any.whl
Algorithm Hash digest
SHA256 951921c5222cb9e05637c992556e178c32c160a61d05ccb8e40407f1148893f6
MD5 0ecbb9b78ca404fefe3604ec8b5f080c
BLAKE2b-256 7aea06aac4031a21a4972f3f649d1660eb1564ceaf16a7d975b80ed436d7cadd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page