Skip to main content

scanpy compatible python suite for fast tree inference and advanced pseudotime downstream analysis

Project description

PyPI DOI Documentation Status Build and Test codecov Line count GitHub license Code style: black

Description

This package provides a scalable Python suite for fast tree inference and advanced pseudotime downstream analysis, with a focus on fate biasing. This package is compatible with anndata object format used in scanpy or scvelo pipelines. A complete documentation of this package is available here.

The related work is now available in Bioinformatics:

Louis Faure, Ruslan Soldatov, Peter V. Kharchenko, Igor Adameyko
scFates: a scalable python package for advanced pseudotime and bifurcation analysis from single cell data
Bioinformatics, btac746; doi: https://doi.org/10.1093/bioinformatics/btac746

Tree inference algorithms

The user have the choice between two algorithm for tree inference:

ElPiGraph

For scFates, the python implementation of the ElPiGraph algorithm is used, which include GPU accelerated principal tree inference. A self-contained description of the algorithm is available here or in the related paper

A R implementation of this algorithm is also available, coded by Luca Albergante

A native MATLAB implementation of the algorithm (coded by Andrei Zinovyev and Evgeny Mirkes) is also available

Simple PPT

A simple PPT inspired approach, translated from the crestree R package, code has been also adapted to run on GPU for accelerated tree inference.

Other Citations

Code for PPT inference and most of downstream pseudotime analysis was initially written in a R package by Ruslan Soldatov for the following paper:

Soldatov, R., Kaucka, M., Kastriti, M. E., Petersen, J., Chontorotzea, T., Englmaier, L., … Adameyko, I. (2019).
Spatiotemporal structure of cell fate decisions in murine neural crest.
Science, 364(6444).

if you are using ElPiGraph, please cite:

Albergante, L., Mirkes, E. M., Chen, H., Martin, A., Faure, L., Barillot, E., … Zinovyev, A. (2020).
Robust And Scalable Learning Of Complex Dataset Topologies Via Elpigraph.
Entropy, 22(3), 296.

Code for preprocessing has been translated from R package pagoda2, if you use any of these functions (scf.pp.batch_correct & scf.pp.find_overdispersed), please cite:

Nikolas Barkas, Viktor Petukhov, Peter Kharchenko and Evan
Biederstedt (2021). pagoda2: Single Cell Analysis and Differential
Expression. R package version 1.0.2.

Palantir python tool provides a great dimensionality reduction method, which usually lead to consitent trees with scFates, if use scf.pp.diffusion, please cite:

Manu Setty and Vaidotas Kiseliovas and Jacob Levine and Adam Gayoso and Linas Mazutis and Dana Pe'er (2019)
Characterization of cell fate probabilities in single-cell data with Palantir.
Nature Biotechnology

Installation

scFates is available on pypi, you can install it using:

pip install -U scFates

or the latest development version can be installed from GitHub:

pip install git+https://github.com/LouisFaure/scFates

Development

For development, it is recommended to use pixi to manage the environment and dependencies.

To install the environment:

pixi install

To run tests:

pixi run test

With all dependencies

-pp.find_overdispersed, tl.test_association, tl.fit, tl.test_fork, tl.activation, tl.test_association_covariate, tl.test_covariate: Require R package mgcv interfaced via python package rpy2:

conda create -n scFates -c conda-forge -c r python=3.11 r-mgcv rpy2=3.4.2 -y
conda activate scFates
pip install scFates

to avoid any possible crashes due to rpy2 not finding the R install on conda, run the following import command:

import os, sys
os.environ['R_HOME'] = sys.exec_prefix+"/lib/R/"
import scFates

-tl.cellrank_to_tree: Requires cellrank to be installed in order to function::

pip install cellrank

On Apple Silicon

Installing mgcv using conda/mamba on Apple Silicon lead to the package not being able to find some dynamic libraries (BLAS). In that case it is recommended to install it separately:

mamba create -n scFates -c conda-forge -c bioconda -c defaults python numpy=1.24.4 "libblas=*=*accelerate" rpy2 -y
mamba activate scFates
Rscript -e 'install.packages("mgcv",repos = "http://cran.us.r-project.org")'

GPU dependencies (optional)

If you have a nvidia GPU, scFates can leverage CUDA computations for speedups for the following functions:

pp.filter_cells, pp.batch_correct, pp.diffusion, tl.tree, tl.cluster

The latest version of rapids framework is required. Create the following conda environment:

conda create --solver=libmamba -n scFates-gpu -c rapidsai -c conda-forge -c nvidia  \
    cuml=23.12 cugraph=23.12 python=3.10 cuda-version=11.2
conda activate scFates-gpu
pip install git+https://github.com/j-bac/elpigraph-python.git
pip install scFates

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scfates-1.2.1.tar.gz (402.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scfates-1.2.1-py3-none-any.whl (426.0 kB view details)

Uploaded Python 3

File details

Details for the file scfates-1.2.1.tar.gz.

File metadata

  • Download URL: scfates-1.2.1.tar.gz
  • Upload date:
  • Size: 402.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scfates-1.2.1.tar.gz
Algorithm Hash digest
SHA256 2a9953b4bd2ed0ec17caf0b349d4788f641b00626c70ab0f379e50e7939d889f
MD5 d7aa86248460b219b42a0e03dc6c59f0
BLAKE2b-256 e9307c6e5898ff3e9793e2176c68a1369b9e9e47340f05135683dec69fdf888e

See more details on using hashes here.

File details

Details for the file scfates-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: scfates-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 426.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scfates-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 746a46bccdc6a796d05525e64072c215feb782942f2f3b4105aafef64dad305a
MD5 05f8d7ac489333635f33cb5848a522ab
BLAKE2b-256 1ba8e8b004bda3bfe103ddcd88b80abf8cd34df559955a678ef9386919d9f87a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page