Skip to main content

A package that efficiently computes p-values for a given set of genes based on input matrices representing cell coordinates and gene expression data

Project description

\n# scBSP - A Fast Tool for Single-Cell Spatially Variable Genes Identifications on Large-Scale Spatially Resolved Transcriptomics Data

DOI

This package utilizes a granularity-based dimension-agnostic tool, single-cell big-small patch (scBSP), implementing sparse matrix operation and KD-tree/balltree method for distance calculation, for the identification of spatially variable genes on large-scale data.

Installation

Dependencies

To ensure scBSP functions optimally, the following dependencies are required:

  • Python (>= 3.8)
  • NumPy (>= 1.24.4)
  • Pandas (>= 1.3.5)
  • SciPy (>= 1.10.1)
  • scikit-learn (>=1.3.2)

Installation Commands

For Standard Installation (Using Ball Tree):

pip install "scbsp"

For Installation with GPU:

pip install "scbsp[gpu]"

Usage

To use scBSP, you need to provide two primary inputs:

  1. Cell Coordinates Matrix (input_sp_mat):

    • Format: Numpy array.
    • Dimensions: N x D, where N is the number of cells and D is the dimension of coordinates.
  2. Gene Expression Matrix (input_exp_mat_raw):

    • Format: Numpy array, Pandas DataFrame, or CSR matrix.
    • Dimensions: N x P, where N is the number of cells and P is the number of genes.

Additional parameters to specify include:

  • d1: A floating-point number. Default value is 1.0.
  • d2: A floating-point number. Default value is 3.0.
  • leaf_size: Optional integer defining the maximum point threshold for the Ball Tree algorithm to revert to brute-force search (default = 80).
  • use_gpu: Optional boolean defining whether to use the GPU (default = False).

Example

Below is a straightforward example showcasing how to compute p-values with scBSP:

import scbsp

# Load your data into these variables
input_sp_mat = ...  # Cell Coordinates Matrix
input_exp_mat_raw = ...  # Gene Expression Matrix

# Set the optional parameters
d1 = 1.0
d2 = 3.0

# Compute p-values
p_values = scbsp.granp(input_sp_mat, input_exp_mat_raw, d1, d2)

Output

The function returns a Pandas DataFrame, featuring two columns: gene_names and p_values. Each row within this DataFrame represents a unique gene from the input gene expression matrix. The gene_names column specifies the identifier for each gene, while the p_values column quantifies the statistical significance of the expression differences observed across various cell coordinates. This structured format enhances the ease of conducting sophisticated biological analyses, allowing for straightforward identification and investigation of genes with significant expression variability.

Reference

  • Li, Jinpu, Yiqing Wang, Mauminah Azam Raina, Chunhui Xu, Li Su, Qi Guo, Qin Ma, Juexin Wang, and Dong Xu. "scBSP: A fast and accurate tool for identifying spatially variable genes from spatial transcriptomic data." bioRxiv (2024).

  • Wang, Juexin, Jinpu Li, Skyler T. Kramer, Li Su, Yuzhou Chang, Chunhui Xu, Michael T. Eadon, Krzysztof Kiryluk, Qin Ma, and Dong Xu. "Dimension-agnostic and granularity-based spatially variable gene identification using BSP." Nature Communications 14, no. 1 (2023): 7367.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scbsp-0.3.0.tar.gz (19.4 kB view details)

Uploaded Source

Built Distribution

scbsp-0.3.0-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file scbsp-0.3.0.tar.gz.

File metadata

  • Download URL: scbsp-0.3.0.tar.gz
  • Upload date:
  • Size: 19.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for scbsp-0.3.0.tar.gz
Algorithm Hash digest
SHA256 32544a3f082b627a82be3bc0873ee4f5d4800b0f71d5147b405bede3d1aee9cf
MD5 c54b253926551ee95c30dcbaba3e2b92
BLAKE2b-256 e13a0b8983055395b78d6eeb844d06f2f5ca6cc103eedc693e6ea69dbed197bf

See more details on using hashes here.

File details

Details for the file scbsp-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: scbsp-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for scbsp-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 508d1a1c2baac6183505e281ef29e6d29c81eb476c98586842e2d85810f8f627
MD5 7cad4365c82b05ba379c8faa4153701e
BLAKE2b-256 93f20c04412f8ee25000ff219c3a55a5bfc46412ace977f3c192d7fc2c9242a9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page