scanpy compatible python suite for fast tree inference and advanced pseudotime downstream analysis
Project description
Description
This package provides a scalable Python suite for fast tree inference and advanced pseudotime downstream analysis, with a focus on fate biasing. This package is compatible with anndata object format used in scanpy or scvelo pipelines. A complete documentation of this package is available here.
The related work is now available in Bioinformatics:
Louis Faure, Ruslan Soldatov, Peter V. Kharchenko, Igor Adameyko
scFates: a scalable python package for advanced pseudotime and bifurcation analysis from single cell data
Bioinformatics, btac746; doi: https://doi.org/10.1093/bioinformatics/btac746
Tree inference algorithms
The user have the choice between two algorithm for tree inference:
ElPiGraph
For scFates, the python implementation of the ElPiGraph algorithm is used, which include GPU accelerated principal tree inference. A self-contained description of the algorithm is available here or in the related paper
A R implementation of this algorithm is also available, coded by Luca Albergante
A native MATLAB implementation of the algorithm (coded by Andrei Zinovyev and Evgeny Mirkes) is also available
Simple PPT
A simple PPT inspired approach, translated from the crestree R package, code has been also adapted to run on GPU for accelerated tree inference.
Other Citations
Code for PPT inference and most of downstream pseudotime analysis was initially written in a R package by Ruslan Soldatov for the following paper:
Soldatov, R., Kaucka, M., Kastriti, M. E., Petersen, J., Chontorotzea, T., Englmaier, L., … Adameyko, I. (2019).
Spatiotemporal structure of cell fate decisions in murine neural crest.
Science, 364(6444).
if you are using ElPiGraph, please cite:
Albergante, L., Mirkes, E. M., Chen, H., Martin, A., Faure, L., Barillot, E., … Zinovyev, A. (2020).
Robust And Scalable Learning Of Complex Dataset Topologies Via Elpigraph.
Entropy, 22(3), 296.
Code for preprocessing has been translated from R package pagoda2, if you use any of these functions (scf.pp.batch_correct
& scf.pp.find_overdispersed
), please cite:
Nikolas Barkas, Viktor Petukhov, Peter Kharchenko and Evan
Biederstedt (2021). pagoda2: Single Cell Analysis and Differential
Expression. R package version 1.0.2.
Palantir python tool provides a great dimensionality reduction method, which usually lead to consitent trees with scFates, if use scf.pp.diffusion
, please cite:
Manu Setty and Vaidotas Kiseliovas and Jacob Levine and Adam Gayoso and Linas Mazutis and Dana Pe'er (2019)
Characterization of cell fate probabilities in single-cell data with Palantir.
Nature Biotechnology
Installation
scFates is available on pypi, you can install it using:
pip install -U scFates
or the latest development version can be installed from GitHub:
pip install git+https://github.com/LouisFaure/scFates
With all dependencies
-pp.find_overdispersed
, tl.test_association
, tl.fit
, tl.test_fork
, tl.activation
, tl.test_association_covariate
, tl.test_covariate
: Require R package mgcv interfaced via python package rpy2:
conda create -n scFates -c conda-forge -c r python=3.11 r-mgcv rpy2=3.4.2 -y
conda activate scFates
pip install scFates
to avoid any possible crashes due to rpy2 not finding the R install on conda, run the following import command:
import os, sys
os.environ['R_HOME'] = sys.exec_prefix+"/lib/R/"
import scFates
-tl.cellrank_to_tree
: Requires cellrank to be installed in order to function::
pip install cellrank
On Apple Silicon
Installing mgcv using conda/mamba on Apple Silicon lead to the package not being able to find some dynamic libraries (BLAS). In that case it is recommended to install it separately:
mamba create -n scFates -c conda-forge -c bioconda -c defaults python numpy=1.24.4 "libblas=*=*accelerate" rpy2 -y
mamba activate scFates
Rscript -e 'install.packages("mgcv",repos = "http://cran.us.r-project.org")'
GPU dependencies (optional)
If you have a nvidia GPU, scFates can leverage CUDA computations for speedups for the following functions:
pp.filter_cells
, pp.batch_correct
, pp.diffusion
, tl.tree
, tl.cluster
The latest version of rapids framework is required. Create the following conda environment:
conda create --solver=libmamba -n scFates-gpu -c rapidsai -c conda-forge -c nvidia \
cuml=23.12 cugraph=23.12 python=3.10 cuda-version=11.2
conda activate scFates-gpu
pip install git+https://github.com/j-bac/elpigraph-python.git
pip install scFates
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file scFates-1.0.9.tar.gz
.
File metadata
- Download URL: scFates-1.0.9.tar.gz
- Upload date:
- Size: 99.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 17a7d7c6b98c3253506362af6b4c0878bb44d64ba9f11c0d372e450712357bcb |
|
MD5 | 80cb697db377647e12be21eb20cbd4de |
|
BLAKE2b-256 | 6f59e7c27b941d21458c57c47400b9a8549dfca2aef3f266796c5e14e65bf7e7 |
File details
Details for the file scFates-1.0.9-py3-none-any.whl
.
File metadata
- Download URL: scFates-1.0.9-py3-none-any.whl
- Upload date:
- Size: 421.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 718f9c117f5e0fc6a624432fa2ec12e24c22deeef4f7f8a1dcc56ef1daf09458 |
|
MD5 | 1c1529c6b1a5a17c68b088b75dd9fa03 |
|
BLAKE2b-256 | b52d8aa6b01184da0561a5ad628976b618ae38ed702b73312ba963275b714956 |