Skip to main content

A multiplet removal tool for processing cell hashing data

Project description

GMM-Demux

A Gaussian Mixture Model based software for processing cell hashing data.

Blow shows an example classification result. Orange dots are multi-sample multiplets.

GMM-Demux example

Description

GMM-Demux removes Multi-Sample-Multiplets (MSMs) in a cell hashing dataset and estimates the fraction of Same-Sample-Multiplets (SSMs) and singlets in the remaining dataset.

Features

  • Remove cell-hashing-identifiable multiplets from the dataset.
  • Estimate the fraction of cell-hashing-unidentifiable multiplets in the remaining dataset (the RSSM value).

Authors

Hongyi Xin, Qi Yan, Yale Jiang, Jiadi Luo, Carla Erb, Richard Duerr, Kong Chen* and Wei Chen*

Maintainer

Hongyi Xin xhongyi@pitt.edu

Requirement

GMM-Demux requires python3 (>3.5) installed in the system.

Install

GMM-Demux can be directly installed from PIPy. Or it can be built and installed locally.

  • Install GMM-Demux from PIPy.
pip3 install --user GMM_Demux
  • Install GMM-Demux locally using setuptools and pip3.
cd <GMM-Demux dir>
python3 setup.py sdist bdist_wheel
pip3 install --user . 

Usage

GMM-demux <cell_hashing_path> <HTO_names> <estimated_cell_num>

MSM-free droplets are stored in folder GMM_Demux_mtx by default.

Example Usage

An example cell hashing data is provided in example_input. <HTO_names> can be obtained from the features.tsv file.

python3 GMM-demux.py example_input/outs/filtered_feature_bc_matrix HTO_1,HTO_2,HTO_3,HTO_4 35685

<HTO_names> are obtained from the features.tsv file. An example is shown below.

HTO names example

Optional Arguments

  • -h: show help information.
  • -f FULL, --full FULL Generate the full classification report. Require a path argument.
  • -s SIMPLIFIED, --simplified SIMPLIFIED Generate the simplified classification report. Require a path argument.
  • -o OUTPUT, --output OUTPUT Specify the folder to store the result. Require a path argument.
  • -r REPORT, --report REPORT Specify the file to store summary report. Require a file argument.

Output Values

  • CellRanger MSM-free drops, in MTX format. Compatible with CellRanger 3.0.
  • Dataset summary. An example summary is shown below. Summary example

Output Explanation

  • MSM denotes the percentage of identified and removed multiplets among all droplets.
  • SSM denotes the percentage of unidentifiable multiplets among all droplets.
  • RSSM denotes the percentage of multiplets among the output droplets (after removing identifiable multiplets). RSSM measures the quality of the cell hashing dataset.

Online Cell Hashing Experiment Planner

A GMM-Demux based online cell hashing experiment planner is publically accessible at here.

Online explanner example

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GMM_Demux-0.0.3.tar.gz (7.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

GMM_Demux-0.0.3-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file GMM_Demux-0.0.3.tar.gz.

File metadata

  • Download URL: GMM_Demux-0.0.3.tar.gz
  • Upload date:
  • Size: 7.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.1 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.6.7

File hashes

Hashes for GMM_Demux-0.0.3.tar.gz
Algorithm Hash digest
SHA256 1eaf043cada2bd0f744a678247f79a0c005a1b22ca3377ca3afae46c02930a9c
MD5 ce23f3b6dcea01eefd6552af6b078ffa
BLAKE2b-256 8c7be7b8b37a979ba10271a3337153f13276d8e2a6c7a62a24ec0a11cb916d69

See more details on using hashes here.

File details

Details for the file GMM_Demux-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: GMM_Demux-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 10.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.1 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.6.7

File hashes

Hashes for GMM_Demux-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ae68cc5da113ac1d02af83d316a0872f78704c38139d3bc9817858e05a907a68
MD5 23d3583ccb89f5f85b1613b617e2110d
BLAKE2b-256 9a3c4583f11fa9660b32a4b8e3c3cab539d2c2fc053e21fb14937ad8dcfa20c0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page