Skip to main content

A unifying data integration framework.

Project description

Transmorph logo

PyPI version GitHub license Documentation Status Downloads Downloads

transmorph is a python framework dedicated to data integration, with a focus on single-cell applications. Dataset integration describes the problem of embedding two or more datasets together, across different batches or feature spaces, so that similar samples end up close from one another. In transmorph we aim to provide a comprehensive framework to design, apply, report and benchmark data integration models using a system of interactive building blocks supported by statistical and plotting tools. We included pre-built models as well as benchmarking databanks in order to easily set up integration tasks. This package can be used in compatibility with scanpy and anndata packages, and works in jupyter notebooks.

Transmorph workflow

Transmorph is also computationally efficient, and can scale to large datasets with competitive integration quality.

Documentation

https://transmorph.readthedocs.io/en/latest/

Installation

transmorph can be installed either from source of from the python repository PyPi. PyPi version is commonly more stable, but may not contain latest features, while you can find the development version on GitHub. Using a python environment is highly recommended (for instance pipenv) in order to easily handle dependencies and versions. transmorph has only be tested for python >=3.9, on Linux and Windows systems.

See the instructions: https://transmorph.readthedocs.io/en/latest/sections/installation.html

Quick starting with a pre-built model

All transmorph models take a list or a dictionary containing AnnData objects as input for data integration. Let us start by loading some benchmarking data, gathered from [Chen 2020] (3.4GB size).

from transmorph.datasets import load_chen_10x
chen_10x = load_chen_10x()

One can then either create a custom integration model, or load a pre-built transmorph model. We will choose the EmbedMNN model with default parameters for this example, which embeds all datasets into a common abstract 2D space.

from transmorph.models import EmbedMNN
model = EmbedMNN()
model.transform(chen_10x)

Integration embedding coordinates can be gathered in each AnnData object, in AnnData.obsm['X_transmorph'].

chen_10x['P01'].obsm['X_transmorph']

One can for instance use a plotting function from transmorph to display integration results.

from transmorph.utils.plotting import scatter_plot

scatter_plot(chen_10x, use_rep="transmorph")
scatter_plot(chen_10x, use_rep="transmorph", color_by="class")

[Chen 2020] Chen, Y. P., Yin, J. H., Li, W. F., Li, H. J., Chen, D. P., Zhang, C. J., ... & Ma, J. (2020). Single-cell transcriptomics reveals regulators underlying immune cell diversity and immune subtypes associated with prognosis in nasopharyngeal carcinoma. Cell research, 30(11), 1024-1042.

Citing transmorph

If you find transmorph useful for your research, please consider citing our pre-print which can be found on bioRxiv.

@article{fouche2022transmorph,
  title={transmorph: a unifying computational framework for single-cell data integration},
  author={Fouch{\'e}, Aziz, Chadoutaud, Lo{\¨i}c, Delattre, Olivier and Zinovyev, Andrei},
  journal={bioRxiv},
  year={2022}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transmorph-0.2.5.tar.gz (687.7 kB view details)

Uploaded Source

Built Distribution

transmorph-0.2.5-py3-none-any.whl (172.1 kB view details)

Uploaded Python 3

File details

Details for the file transmorph-0.2.5.tar.gz.

File metadata

  • Download URL: transmorph-0.2.5.tar.gz
  • Upload date:
  • Size: 687.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.8

File hashes

Hashes for transmorph-0.2.5.tar.gz
Algorithm Hash digest
SHA256 0d651a9ac5b7a90f629c1f6661a9466a0851b6b61a95e5e55fb9b42cc26b0d6b
MD5 0d17a2906e0e83f8c905f13028e69da7
BLAKE2b-256 952adbfbb39a2b196ecd28273f8308184a85b6e9af9d923b03e050052b8e4bc2

See more details on using hashes here.

File details

Details for the file transmorph-0.2.5-py3-none-any.whl.

File metadata

  • Download URL: transmorph-0.2.5-py3-none-any.whl
  • Upload date:
  • Size: 172.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.8

File hashes

Hashes for transmorph-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 0f5ec128deb6c2f7e62bd3dba939ee2016e4507dd473880f11d098753f7f97c8
MD5 c8331cd24232c82dde8d4bf7b8a1b468
BLAKE2b-256 8ae476b8f927b5a1211268e1fba6b397c2696a2cac2da0cb4b4a682a99be5d42

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page