A package for matching and integrating multi-modal single cell data with shared and distinct features
Project description
MARIO: single-cell proteomic data matching and integration pipeline
Description
This github repo includes mario-py and mario-R, which is a Python package for matching and integrating multi-modal single cell data with partially overlapping features. The method is specifically tailored toward proteomic datasets, and for detailed description on the algorithm, including the core methodology, mathmetical ingredients, application on various biological samples, and extensive benchmarking, please refer to the paper.
This work has been lead by Shuxiao Chen from Zongming Lab @Upenn and Bokai Zhu from Nolan lab @Stanford.
Getting Started
Dependencies
For easy usage, we suggest builing a conda virtualenv with python = 3.8.
conda create -n mario python=3.8
Installing
To install MARIO, we can easily install it with pip function (package name pyMARIO):
python -m pip install pyMARIO
How to use
Quick example:
To use in MARIO in python :
from mario.match import pipelined_mario
final_matching_lst, embedding_lst = pipelined_mario(data_lst=[df1, df2])
Where df1 and df2 are two dataframes for match and integration, with row as cells, columns as features. Remember for shared features, the column names should be identical. Input list can be multiple dataframes, as MARIO accomodates for multiple dataset match and integration.
The result contains the a matching list (matching), and a embedding list (integration). For detailed usage please refer to the Full tutorial section.
Similarly, to use in MARIO in R (with package reticulate) :
library(reticulate)
myenvs=conda_list() # get conda virtualenv list
envname=myenvs$name[12] # specify which virtualenv to use, should use the one for MARIO
use_condaenv(envname, required = TRUE)
mario.match <- import("mario.match") # import main mario-py module
pipelined_res = mario.match$pipelined_mario(data_lst=list(df1, df2))
Where the result also contains the matching list and embedding list.
Full tutorial:
For step by step tutorials on how to use MARIO, with fine-tuned parameters for optimal results and full functionality, please refer to the documents we provided here:
Python - Jupyter notebook: Match and Integration of Human Bonemarrow datasets
Python - Jupyter notebook: Match and Integration of multiple Xspecies datasets
R - Rmarkdown: Match and Integration of Human Bonemarrow datasets
License and Citation
MARIO is under the Academic Software License Agreement, please use accordingly.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyMARIO-0.0.2.tar.gz.
File metadata
- Download URL: pyMARIO-0.0.2.tar.gz
- Upload date:
- Size: 23.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b7bf9845288527722c6e8850ed8741ddd9ffdd1f8f84c6a4df770d63acda8769
|
|
| MD5 |
143c67f4857dd0ef0006a4985af6fe00
|
|
| BLAKE2b-256 |
5a101942bf6d79299fc044ace37ea707ee71530e13d464d2dbcdceb456084aeb
|
File details
Details for the file pyMARIO-0.0.2-py3-none-any.whl.
File metadata
- Download URL: pyMARIO-0.0.2-py3-none-any.whl
- Upload date:
- Size: 23.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
724c3bccdec6355d001691f02a31ed8ebe77d19f13ffe2df59ae305e17206fd0
|
|
| MD5 |
0b11280501ed50c78cbf9b7385c1e3f8
|
|
| BLAKE2b-256 |
a4f7859774c1c8a10382a48c4c6e24b02c7e6743e50c757c36a27238b7338164
|