Skip to main content

A toolkit package of data integration

Project description

python >3.8.8 Downloads Downloads Downloads Documentation Status

A toolkit for data integration


Behold, a magnificent toolkit emerges, designed to seamlessly integrate the vast realm of data in the realm of single-cell genomics. Within its hallowed digital confines, this toolkit bestows upon researchers the power to harmoniously merge and analyze the intricate tapestry of cellular information. Crafted with meticulous care, this toolkit embodies the epitome of elegance and efficiency. It serves as a conduit, enabling the synthesis of diverse datasets from single-cell genomics experiments, unlocking new realms of knowledge and understanding. The power of data integration within this toolkit extends beyond mere aggregation. It empowers researchers to unravel the intricate web of cellular interactions, uncovering hidden patterns, identifying novel cell types, and discerning the complex dynamics that govern cellular behavior. Through its refined algorithms and advanced statistical techniques, this toolkit illuminates the path towards deeper insights and discoveries. It refines and enhances the quality of data, mitigating confounding factors and removing noise, thus unveiling the true essence of the cellular landscape. As the sun sets on the horizon of single-cell genomics, this toolkit emerges as a guiding beacon, illuminating the path towards a more comprehensive understanding of cellular complexity. Embrace its power and unlock the secrets hidden within the realm of single-cell genomics.

Dependence

torch-1.10.0 pandas-1.2.4 scikit-learn-0.24 scipy-0.12.x scanpy-1.9.1

Install

The stereoAlign python package is available on Pypi and can be installed through

pip install stereoAlign

Import stereoAlign in python

import stereoAlign

We created the python package called stereoAlign that uses scanpy to streamline the integration of single-cell datasets and evaluate the results. The package contains several modules for preprocessing an anndata object, running integration methods and evaluating the resulting using a number of metrics. Functions for the data integration methods are in stereoAlign.alignment or for short stereoAlign.alg and metrics are under stereoAlign.metrics.

Tutorials

Quick Start https://stereoalign-tutorial.readthedocs.io/en/latest/index.html#

Integration Tools

This repository contains the code for the stereoAlign package for data integration tools.
This toolkit that is compared include:

Integration Method Invoke Output Recommendation index Describe Language
Harmony stereoAlign.alg.harmony_alignment Embedding *** Leverage iterative clustering with maximum diversity for batch correction and integration. Python / R
Scanorama stereoAlign.alg.scanorama_alignment Embedding **** Leverage computer vision algorithms for panorama stitching matches for batch correction and integration. Python
scGEN stereoAlign.alg.scgen_alignment Features **** Use variational autoencoders to reduce the dimension of gene expression matrix, and apply extra label to batch correction and integrations. Python
scvi stereoAlign.alg.scvi_alignment Embedding **** Use variational autoencoders to batch correct and integrate. Python
MNN stereoAlign.alg.mnn_alignment Features ** Leverage mutual nearest neighbor for batch correction and integration. Python
BBKNN stereoAlign.alg.bbknn_alignment Graph *** BBKNN taking each cell and identifying a (smaller) k nearest neighbours in each batch separately, and merged into a final neighbour list for the cell. Python
Combat stereoAlign.alg.combat_alignment Features * Leverage empirical Bayesian model for batch correction and integration. Python
DESC stereoAlign.alg.desc_alignment Embedding ** DESC eventually eliminates batch effects by recurrent self-learning, as long as technological changes between batches are fewer than real biological variances. Python
PRECAST Embedding **** PRECAST is a probabilistic model-based approach that integrates SRT datasets from multiple tissue slides. R
spatiAlign stereoAlign.alg.spatialign_alignment Embedding / Features ***** spatiAlign is an unsupervised across domain adapatation methods for SRT datasets batch correction and integration. Python
SCALEX stereoAlign.alg.scalex_alignment Embedding ** SCALEX leverage domain specific batch normalization for batch correction and integration. Python

Metric

Metric Method Invoke
ARI stereoAlign.metrics.cal_ari
Graph connectivity stereoAlign.metrics.cal_graph_connectivity
KBET stereoAlign.metrics.cal_kbet
LISI stereoAlign.metrics.cal_lisi
Silhouette stereoAlign.metrics.cal_silhouette

Disclaimer

This is not an official product.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

stereoAlign-0.0.3-py3-none-any.whl (30.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page