Basic Informatics and Gene Statistics from Unnormalized Reads, a feature selection tool for scRNAseq
Project description
BigSur
BigSur is a package for principled, robust scRNAseq normalization. Currently we can perform feature selection, see BigSurR for correlations.
What is BigSur?
Basic Informatics and Gene Statistics from Unnormalized Reads (BigSur) is a principled pipeline allowing for feature selection, correlation and clustering in scRNAseq.
- The feature selection derivations are detailed in the BioRxiv preprint Dollinger et al. 2024.
- The correlation are detailed in Silkwood et al. 2023.
Installation
The easiest way to install bigsur is via pip: conda create -n bigsur_env python pip conda activate bigsur_env pip install bigsur
Alternatively, you can clone the GitHub repo. We've included a environment file for conda environment installation; the only package we require that isn't installed with scanpy is mpmath and numexpr. For example:
In terminal:
cd bigsur_dir #directory to clone to
git clone https://github.com/landerlabcode/BigSur.git
conda create -f environment.yml -n bigsur
A note about the virtual environment
This environment contains all packages that are required to reproduce any result of the paper. If you want a lightweight conda enviroment (or alternatively, if the environment file is causing issues), you can create a sufficient conda environment as follows:
In terminal:
conda create -n bigsur -c conda-forge scanpy mpmath numexpr ipykernel python-igraph leidenalg
Usage
Usage for feature selection is detailed in the example notebook.
TL;DR:
import sys
sys.path.append(bigsur_dir) # directory where git repo was cloned
from BigSur.feature_selection import mcfano_feature_selection as mcfano
Replace sc.pp.highly_variable_genes(adata) in your pipeline with mcfano(adata, layer='counts'), where the UMI counts are in adata.layers['counts'].
And that's it! You can read more about how to use BigSur for feature selection, and in particular how to optimize cutoffs for a given dataset, in the example notebook.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bigsur-0.0.6.tar.gz.
File metadata
- Download URL: bigsur-0.0.6.tar.gz
- Upload date:
- Size: 12.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34bf1155ec6e99a6e5725ea18d06186455e982180f7e05c0255b8b1d196dbace
|
|
| MD5 |
57ae8058c8b42fec7a29aa1a98050f67
|
|
| BLAKE2b-256 |
a82839b3119021046a5ca3b96c6b134692d637db2cb542bd663f73db2e2ae847
|
File details
Details for the file bigsur-0.0.6-py3-none-any.whl.
File metadata
- Download URL: bigsur-0.0.6-py3-none-any.whl
- Upload date:
- Size: 11.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b0245243675d3bc57778be8437e6caa71dda9f92ea712b13e4f1a9bc1af2f1bc
|
|
| MD5 |
a929f5d9bd568853447f34b68d7e223a
|
|
| BLAKE2b-256 |
3ec9107fc04488c177be5bb235d982315531e894671134e2fce65022a556eb28
|