Skip to main content

Oncoordinate is an interpretable deep learning framework for single-cell and spatial transcriptomic analysis of malignancy that learns malignant and malignancy-associated cell states across epithelial, stromal, and immune lineages while remaining tightly integrated with the scverse ecosystem.

Project description

Oncoordinate

Contents

  1. Introduction
  2. Discovering single-cell states
  3. Projecting to spatial transcriptomic cohorts
  4. References
  5. Acknowledgements

Note: we provide a demo notebook (vignettes/tutorial.html) and two spatial datasets (demo-data/V10U24-037_A, demo-data/V10U24-037_B). We will be uploading the train set, the test set, and the reference_sc.h5ad on zenodo. We have also provided an environment.yml file so that you can run the command: conda env create -f environment.yml and get a working environment for Oncoordinate.

1. Introduction

Oncoordinate is an interpretable deep learning framework for single-cell and spatial transcriptomic analysis of malignancy that learns malignant and malignancy-associated cell states across epithelial, stromal, and immune lineages while remaining tightly integrated with the scverse ecosystem. Built on a sequential attention–based and trained on an integrated lung atlas (~3M cells spanning normal, chronic disease, and multiple lung carcinoma subtypes), Oncoordinate predicts a four-stage neoplastic continuum (normal, dysplastic, pre-malignant, malignant) with calibrated probabilities and sparse, step-wise feature selection that can be traced back to genes and pathways. Beyond single-cell analysis, it includes a de novo label transfer pipeline based on scVI and scANVI that projects these learned states into spatial transcriptomic datasets (e.g., 10x Visium) via pseudospots and joint latent embeddings, enabling the localization of aggressive niches, tumor–CAF neighborhoods, and other malignant ecosystems within intact tissue. Oncoordinate also works with niche detection methods such as SOAPy and works seamlessly with AnnData and Scanpy-based workflows, providing a GPU-accelerated, atlas-scale, and plug-and-play framework for malignancy modeling across molecular and spatial dimensions.

To install Oncoordinate, please run:

pip install oncoordinate

Fig1

2. Discovering single-cell states

Oncoordinate discovers malignant and malignancy-associated cell states by modeling neoplastic progression as a continuous, lineage-aware process rather than a binary tumor versus non-tumor classification. Using atlas-scale single-cell RNA-seq data, the model integrates gene-level features and pathway scores to learn a four-stage neoplastic continuum spanning normal, dysplastic, pre-malignant, and malignant states. Its sequential attention mechanism performs sparse, step-wise feature selection, allowing the model to focus on distinct transcriptional programs at different stages of malignancy while preserving interpretability. This design enables Oncoordinate to recover coherent oncogenic trajectories within individual lineages, such as epithelial, fibroblast, immune, and endothelial compartments, and to quantify heterogeneity both within and across patients. The resulting per-cell malignancy probabilities and lineage-resolved scores provide an interpretable representation of tumor ecosystem remodeling that can be directly used for downstream analyses, including trajectory visualization, pathway interrogation, and patient-level stratification.

Fig2

3. Projecting to spatial transcriptomic cohorts

To translate single-cell–derived malignancy states into intact tissue architecture, Oncoordinate implements a de novo label transfer framework tailored for spatial transcriptomics data. Single-cell profiles sharing the same lineage and malignancy state are aggregated into pseudospots to approximate the multicellular composition of spatial capture spots, and these pseudospots are jointly embedded with spatial transcriptomic data using scVI to learn a shared, batch-corrected latent space. scANVI is then applied in a semi-supervised manner to infer probabilistic malignancy and lineage labels for each spatial spot. This approach enables robust projection of malignant programs into spatial coordinates, revealing colocalized malignant neighborhoods such as tumor–fibroblast (CAF) niches, regions of stromal remodeling, and immune-associated malignant ecosystems. By combining probabilistic spatial labeling with downstream niche detection methods such as SOAPy, Oncoordinate allows users to identify and characterize aggressive, spatially confined microenvironments that are not apparent from dissociated single-cell data alone.

Fig3

4. References

  • Venkat V. et al. Disruptive changes in tissue microenvironment prime oncogenic processes at different stages of carcinogenesis in lung. bioRxiv (2024).
  • Venkat VV, De S. Oncoordinate-derived single-cell states translate to malignant clusters in spatial transcriptomic cohorts. bioRxiv (2026).

5. Acknowledgements

We thank colleagues at the Rutgers Cancer Institute and members of the De Laboratory for their guidance and support.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oncoordinate-0.1.7.tar.gz (1.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

oncoordinate-0.1.7-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file oncoordinate-0.1.7.tar.gz.

File metadata

  • Download URL: oncoordinate-0.1.7.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.5

File hashes

Hashes for oncoordinate-0.1.7.tar.gz
Algorithm Hash digest
SHA256 811a1dfe59f45682470306c5fa57bd21fbc4b21682fb0522f447b3c293bb13b6
MD5 228e093982ecdb15de13fa8db6a7e97d
BLAKE2b-256 d881b42a1708c321a30d6fc73b1ea9c7e57a84c8a5fc2ffe084336a46e900761

See more details on using hashes here.

File details

Details for the file oncoordinate-0.1.7-py3-none-any.whl.

File metadata

File hashes

Hashes for oncoordinate-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 1f8ecfd72a5a63a0783a0a401330e62d82bc57ef2069932cece42bbadbec1b2e
MD5 da71ffb44454ade0082036c5448a4ab2
BLAKE2b-256 0c0b1082cd5d3b0325615b8c309184acaebb5abfe48f29ad28ebb24f1c86a98c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page