Array alignment and 3D differential expression for 3D sc omics
Project description
sc3D
sc3D is a Python library to handle 3D spatial transcriptomic datasets.
To access the 3D viewer for sc3D datasets, you can go to the following link: GuignardLab/napari-sc3D-viewer
You can find it on the Guignard Lab GitHub page: GuignardLab/sc3D. In there you will be able to find jupyter notebooks giving examples about how to use the datasets.
This code was developed in the context of the following study:
Spatial transcriptomic maps of whole mouse embryos. Abhishek Sampath Kumar, Luyi Tian, Adriano Bolondi et al.
The sc3D code is based on the anndata and Scanpy libraries and allows to read, register arrays and compute 3D differential expression.
The dataset necessary to run the tests and look at the results can be downloaded there for the unregistered dataset (and test the provided algorithms) and there for the already registered atlas to browse with our visualiser. You can find the visualiser there.
Description of the GitHub repository
-
data: a folder containing examples for the tissue color and tissue name files
-
src: a folder containing the source code
-
txt: a folder containing the text describing the method (LaTeX, pdf and docx formats are available)
-
README.md: this file
-
notebooks/Test-embryo.ipynb: Basic read and write examples (many different ways of writing)
-
notebooks/Spatial-differential-expression.ipynb: a jupyter notebook with some examples on how to perform the spatial differential expression
-
notebooks/Embryo-registration.ipynb: a jupyter notebook with an example on how to do the array registration
-
setup.py: Setup file to install the library
Installation
We strongly advise to use virtual environments to install this package. For example using conda or miniconda:
conda create -n sc-3D
conda activate sc-3D
If necessary, install pip
:
conda install pip
Then, to install the latest stable version:
pip install sc-3D
or to install the latest version from the GitHub repository:
git clone https://github.com/GuignardLab/sc3D.git
cd sc3D
pip install .
Troubleshooting for latest M1 MacOs chips
If working with an M1 chip, it is possible that all the necessary libraries are not yet available from the usual channels.
To overcome this issue we recommand to manually install the latest, GitHub version of sc3D using miniforge instead of anaconda or miniconda.
Once miniforge is installed and working, you can run the following commands:
conda create -n sc-3D
conda activate sc-3D
to create your environment, then:
git clone https://github.com/GuignardLab/sc3D.git
cd sc3D
conda install pip scipy numpy matplotlib pandas seaborn anndata napari
pip install .
If the previous commands are still not working, it is possible that you need to install the pkg-config
package. You can find some information on how to do it there: install pkg-config
Basic usage
Once installed, the library can be called the following way:
from sc3D import Embryo
To import some data:
Note: at the time being, the following conventions are expected:
- the x-y coordinates are stored in
data.obsm['X_spatial']
- the array number should be stored in
data.obs['orig.ident']
in the format".*_[0-9]*"
where the digits after the underscore (_
) are the id of the array - the tissue type has to be stored in
data.obs['predicted.id']
- the gene names have to be stored as indices or in
data.var['feature_name']
Since version 0.1.2
, one can specify the name of the columns where the different necessary informations are stored using the following parameters:
tissue_id
to inform the tissue id columnarray_id
to inform the array/puck/slice id columnpos_id
to inform the position column (anx, y
position is expected within this given column)gene_name_id
to inform the gene name columnpos_reg_id
when to inform the registered position column (anx, y, z
position is expected within this given column)
# To read the data
embryo = Embryo('path/to/data.h5ad')
# To remove potential spatial outliers
embryo.removing_spatial_outliers(th=outlier_threshold)
# To register the arrays and compute the
# spline interpolations
embryo.reconstruct_intermediate(embryo, th_d=th_d,
genes=genes_of_interest)
# To save the dataset as a registered dataset (to then look at it in the 3D visualizer)
embryo.save_anndata('path/to/out/registered.h5ad')
# To compute the 3D differential expression for selected tissues
tissues_to_process = [5, 10, 12, 18, 21, 24, 30, 31, 33, 34, 39]
th_vol = .025
_ = embryo.get_3D_differential_expression(tissues_to_process, th_vol)
The dataset used for the project this code is from can be downloaded there (under the name mouse_embryo_E8.5_merged_data
)
Many other functions are available that can be found used in the two provided jupyter notebooks.
Running the notebooks
Two example notebooks are provided. To run them one wants to first install the jupyter notebook:
conda install jupyter
or
pip install jupyter
The notebooks can the be started from a terminal in the folder containing the .ipynb
files with the following command:
jupyter notebook
The notebooks should be self content.
Note that the test dataset is not included in this repository put can be downloaded from there.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sc-3D-1.1.0.tar.gz
.
File metadata
- Download URL: sc-3D-1.1.0.tar.gz
- Upload date:
- Size: 33.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d83de0dde6ce34faa8606a27939a98363b2b440a505b75d0197f7f2ab5c39e0b |
|
MD5 | df1d0f839734cca3cc73ff2c5ea6e979 |
|
BLAKE2b-256 | 83f9756fc854f03a1c19ee4ec1dcc0f855c5e1c54e8e3841c3bf91019feb79c9 |
File details
Details for the file sc_3D-1.1.0-py3-none-any.whl
.
File metadata
- Download URL: sc_3D-1.1.0-py3-none-any.whl
- Upload date:
- Size: 31.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 512cdb88367cda36fa20573631deea8669af507a93aef8597e45fca7d6581a48 |
|
MD5 | bb94d5cb8cbc274eff38803db1f54a80 |
|
BLAKE2b-256 | 2450edf01a7f866f286d2790c9f797c421295f6725950124bb66fc294fe7c898 |