Cell-GPS spatial topology analysis for spatial omics data.
Project description
Cell-GPS
Cell-GPS is the Python package and reference implementation for Cophenetic Spatial Topology Embedding (COSTE), a spatial topology analysis framework for spatial omics data. New Python code should import cellgps; the historical sfplot import namespace is retained for backward compatibility.
This repository is maintained as both the installable Python package and the code companion for the Cell-GPS/COSTE bioRxiv preprint.
Preprint and manuscript code
Cell-GPS/COSTE is described in the associated bioRxiv preprint:
Long M, Hu T, Sountoulidis A, Samakovlis C, Nilsson M. Cophenetic Spatial Topology Embedding reveals multiscale tissue architecture in spatial omics. bioRxiv. 2026. doi: 10.64898/2026.05.26.727847
Versioned bioRxiv page: https://www.biorxiv.org/content/10.64898/2026.05.26.727847v1
If you use the Python package, the Windows executable, the R companion package, or the manuscript figure/table code, please cite this preprint.
The code used to generate the preprint figures and supplementary tables is organized in Cell-GPS manuscript code/:
main_figures/: notebooks for main figure analyses.supplementary_figures/: notebooks for supplementary figure analyses.supplementary_tables/: notebooks for supplementary table analyses.
The notebooks are intentionally output-free and preserve the original manuscript data paths where those paths were required for reproduction. A detailed mapping from manuscript results to source code is available in docs/cellgps_science_manuscript_code_inventory.md.
Package Names
- Python distribution:
Cell-GPS - Python import package:
cellgps - R package/repository:
cellgpsr - Windows executable:
cellgps.exe
The Python package is hosted at https://github.com/hutaobo/cellgps. The R package is hosted separately at https://github.com/hutaobo/cellgpsr. The Windows single-file executable is distributed through Zenodo: https://zenodo.org/records/17859173.
What Cell-GPS does
- Computes searcher-findee distance matrices from spatial coordinates and cell labels.
- Builds cophenetic distance matrices and StructureMap heatmaps from
AnnDataobjects or plainpandastables. - Loads 10x Xenium outputs and prepares them for downstream spatial analysis.
- Provides Xenium loaders backed by
pyXenium.io, including standard Xenium folders and table bundles withcells.parquet, official*_cell_groups.csv, andcell_feature_matrix.h5. - Supports transcript-by-cell analysis for locating transcripts relative to cell types.
- Includes memory-optimized workflows for large datasets.
- Provides plotting utilities such as clustered heatmaps, circular dendrograms, and related summary figures.
Repository layout
src/cellgps/: recommended Python import namespace.src/sfplot/: legacy compatibility namespace and current implementation modules.tests/: package tests and smoke checks.docs/: Sphinx documentation.Cell-GPS manuscript code/: curated preprint figure and table notebooks.sfplot-manuscript/: legacy manuscript-specific notebooks, figures, and derived outputs.benchmarking/: benchmarking-related material.segmentation_methods/: supporting segmentation workflows.
Installation
Install from PyPI:
pip install Cell-GPS
Install from the maintained conda channel:
conda install -c https://raw.githubusercontent.com/hutaobo/conda-channel/main cell-gps
Install directly from GitHub:
pip install git+https://github.com/hutaobo/Cell-GPS.git
For local development or reviewer inspection:
git clone https://github.com/hutaobo/Cell-GPS.git
cd cellgps
pip install -e .
The package requires Python 3.9 or later.
Quick start from a coordinate table
The minimal input is a table with spatial coordinates and a cell-type column.
import pandas as pd
from cellgps import compute_cophenetic_distances_from_df, plot_cophenetic_heatmap
df = pd.DataFrame(
{
"x": [0, 1, 5, 6],
"y": [0, 1, 5, 6],
"celltype": ["A", "A", "B", "B"],
}
)
row_coph, col_coph = compute_cophenetic_distances_from_df(
df=df,
x_col="x",
y_col="y",
celltype_col="celltype",
)
plot_cophenetic_heatmap(
row_coph,
matrix_name="row_coph",
output_dir="output",
output_filename="StructureMap_example.pdf",
sample="Example",
)
Quick start from Xenium output
from cellgps import load_xenium_data, load_xenium_table_bundle, compute_cophenetic_distances_from_adata
# Standard Xenium folder through pyXenium.io
adata = load_xenium_data("/path/to/xenium/run", normalize=False)
# Explicit table-bundle route used for the Atera Xenium benchmark
adata = load_xenium_table_bundle("/path/to/xenium/run", normalize=False)
row_coph, col_coph = compute_cophenetic_distances_from_adata(
adata,
cluster_col="Cluster",
output_dir="output",
)
Useful public entry points
load_xenium_data: load and preprocess Xenium data.load_xenium_table_bundle: load Xenium data fromcells.parquet+*_cell_groups.csv+cell_feature_matrix.h5throughpyXenium.io.compute_cophenetic_distances_from_df: compute structure matrices from a coordinate table.compute_weighted_searcher_findee_distance_matrix_from_df: weighted searcher-findee kernel for entity, pathway, or LR analysis.compute_weighted_cophenetic_distances_from_df: weighted StructureMap wrapper over the weighted kernel.compute_cophenetic_distances_from_adata: compute structure matrices fromAnnData.compute_entity_to_cell_topology: generalizet_and_cfrom transcripts to arbitrary weighted entities.compute_entity_structuremap: build StructureMap-style topology among arbitrary weighted entities.plot_cophenetic_heatmap: generate StructureMap and related clustered heatmaps.transcript_by_cell_analysis: analyze transcript-to-cell spatial structure at scale.ligand_receptor_topology_analysis: score sender→receiver ligand-receptor candidates using topology, structure compatibility, and local contact.ligand_receptor_target_consistency: add a NicheNet-style downstream target-consistency layer.compute_pathway_activity_matrix: compute rank-based or weighted pathway activities per cell.pathway_topology_analysis: analyze pathway-to-cell and pathway-to-pathway spatial topology.compute_cophenetic_distances_from_df_memory_opt: memory-aware alternative for large tables.plot_circular_dendrogram_pycirclize: circular dendrogram visualization.
Notes for reviewers
- The recommended Python import namespace is
cellgps; the legacysfplotnamespace remains available for older scripts. - The curated figure and table notebooks for the bioRxiv preprint are kept in
Cell-GPS manuscript code/. sfplot-manuscript/is retained as a legacy manuscript workspace with additional notebooks, generated figure assets, and intermediate outputs.- Raw experimental datasets are not bundled in this repository because of size and distribution constraints. The code expects standard spatial omics outputs such as Xenium folders or tabular coordinate inputs.
- When a
cellgps_tbc_formal_wta/results-style directory is already available, the LR and pathway topology extensions are designed to reuse itst_and_c_result_*.csvandStructureMap_table_*.csvoutputs as the preferred gene-level topology anchors before falling back to recomputation. - Xenium loading depends on
pyXenium>=0.4.3. Visium helpers remain optional through the separateCell-GPS[visium]extra. - A short repository walkthrough is available in REVIEWER_GUIDE.md.
Documentation
Sphinx documentation sources are available in docs/.
Citation
If you use Cell-GPS or reuse the manuscript analysis code, please cite:
Long M, Hu T, Sountoulidis A, Samakovlis C, Nilsson M. Cophenetic Spatial Topology Embedding reveals multiscale tissue architecture in spatial omics. bioRxiv. 2026. doi: 10.64898/2026.05.26.727847
@article{long2026cellgps,
title = {Cophenetic Spatial Topology Embedding reveals multiscale tissue architecture in spatial omics},
author = {Long, Mengping and Hu, Taobo and Sountoulidis, Alexandros and Samakovlis, Christos and Nilsson, Mats},
journal = {bioRxiv},
year = {2026},
doi = {10.64898/2026.05.26.727847},
url = {https://www.biorxiv.org/content/10.64898/2026.05.26.727847v1}
}
License
This project is released under the MIT License. See LICENSE.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cell_gps-0.0.6.tar.gz.
File metadata
- Download URL: cell_gps-0.0.6.tar.gz
- Upload date:
- Size: 75.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e273f817c35977e535872962f71cfd6c8b346ddf9d96f8a069daf1c1b52fc4c
|
|
| MD5 |
86e09e580b4dbb0334ed3491157e6ea4
|
|
| BLAKE2b-256 |
023b339fb7917b71fbc32d887800c375011a9e134b4d72097c83f36d63dfbeac
|
Provenance
The following attestation bundles were made for cell_gps-0.0.6.tar.gz:
Publisher:
python-publish.yml on hutaobo/Cell-GPS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cell_gps-0.0.6.tar.gz -
Subject digest:
7e273f817c35977e535872962f71cfd6c8b346ddf9d96f8a069daf1c1b52fc4c - Sigstore transparency entry: 1675508869
- Sigstore integration time:
-
Permalink:
hutaobo/Cell-GPS@57fa32d36bf8ec9599f63fa3f3d58be368ac05a9 -
Branch / Tag:
refs/tags/v0.0.6 - Owner: https://github.com/hutaobo
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@57fa32d36bf8ec9599f63fa3f3d58be368ac05a9 -
Trigger Event:
release
-
Statement type:
File details
Details for the file cell_gps-0.0.6-py3-none-any.whl.
File metadata
- Download URL: cell_gps-0.0.6-py3-none-any.whl
- Upload date:
- Size: 76.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c3f3477612acd551224b75a15f6139c5f756c9fa176a30189822cc874fb36d8f
|
|
| MD5 |
0bf137b18b74487e53116c5f744dea0e
|
|
| BLAKE2b-256 |
ec992adbd20c10f2d8bc19c7b3a2211c7996f14240ffcd62e17f162d4d872106
|
Provenance
The following attestation bundles were made for cell_gps-0.0.6-py3-none-any.whl:
Publisher:
python-publish.yml on hutaobo/Cell-GPS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cell_gps-0.0.6-py3-none-any.whl -
Subject digest:
c3f3477612acd551224b75a15f6139c5f756c9fa176a30189822cc874fb36d8f - Sigstore transparency entry: 1675508911
- Sigstore integration time:
-
Permalink:
hutaobo/Cell-GPS@57fa32d36bf8ec9599f63fa3f3d58be368ac05a9 -
Branch / Tag:
refs/tags/v0.0.6 - Owner: https://github.com/hutaobo
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@57fa32d36bf8ec9599f63fa3f3d58be368ac05a9 -
Trigger Event:
release
-
Statement type: