Skip to main content

A package for matching and integrating multi-modal single cell data with shared and distinct features

Project description

MARIO: single-cell proteomic data matching and integration pipeline

Description

This github repo includes mario-py and mario-R, which is a Python package for matching and integrating multi-modal single cell data with partially overlapping features. The method is specifically tailored toward proteomic datasets, and for detailed description on the algorithm, including the core methodology, mathmetical ingredients, application on various biological samples, and extensive benchmarking, please refer to the paper.

This work has been lead by Shuxiao Chen from Zongming Lab @Upenn and Bokai Zhu from Nolan lab @Stanford.

Getting Started

Dependencies

For easy usage, we suggest builing a conda virtualenv with python = 3.8.

conda create -n mario python=3.8

Installing

To install MARIO, we can easily install it with pip function (package name pyMARIO):

python -m pip install pyMARIO

How to use

Quick example:

To use in MARIO in python :

from mario.match import pipelined_mario
final_matching_lst, embedding_lst = pipelined_mario(data_lst=[df1, df2])

Where df1 and df2 are two dataframes for match and integration, with row as cells, columns as features. Remember for shared features, the column names should be identical. Input list can be multiple dataframes, as MARIO accomodates for multiple dataset match and integration.

The result contains the a matching list (matching), and a embedding list (integration). For detailed usage please refer to the Full tutorial section.

Similarly, to use in MARIO in R (with package reticulate) :

library(reticulate)
myenvs=conda_list() # get conda virtualenv list
envname=myenvs$name[12] # specify which virtualenv to use, should use the one for MARIO
use_condaenv(envname, required = TRUE)
mario.match <- import("mario.match") # import main mario-py module

pipelined_res = mario.match$pipelined_mario(data_lst=list(df1, df2))

Where the result also contains the matching list and embedding list.

Full tutorial:

For step by step tutorials on how to use MARIO, with fine-tuned parameters for optimal results and full functionality, please refer to the documents we provided here:

Python - Jupyter notebook: Match and Integration of Human Bonemarrow datasets

Python - Jupyter notebook: Match and Integration of multiple Xspecies datasets

R - Rmarkdown: Match and Integration of Human Bonemarrow datasets

License and Citation

MARIO is under the Academic Software License Agreement, please use accordingly.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyMARIO-0.0.2.tar.gz (23.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyMARIO-0.0.2-py3-none-any.whl (23.6 kB view details)

Uploaded Python 3

File details

Details for the file pyMARIO-0.0.2.tar.gz.

File metadata

  • Download URL: pyMARIO-0.0.2.tar.gz
  • Upload date:
  • Size: 23.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.8

File hashes

Hashes for pyMARIO-0.0.2.tar.gz
Algorithm Hash digest
SHA256 b7bf9845288527722c6e8850ed8741ddd9ffdd1f8f84c6a4df770d63acda8769
MD5 143c67f4857dd0ef0006a4985af6fe00
BLAKE2b-256 5a101942bf6d79299fc044ace37ea707ee71530e13d464d2dbcdceb456084aeb

See more details on using hashes here.

File details

Details for the file pyMARIO-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: pyMARIO-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 23.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.8

File hashes

Hashes for pyMARIO-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 724c3bccdec6355d001691f02a31ed8ebe77d19f13ffe2df59ae305e17206fd0
MD5 0b11280501ed50c78cbf9b7385c1e3f8
BLAKE2b-256 a4f7859774c1c8a10382a48c4c6e24b02c7e6743e50c757c36a27238b7338164

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page