Skip to main content

A method to demultiplex hashtagged single-cell data.

Project description

HTO DND - Demultiplex Hashtag Data

PyPI version Build Status

hto is a Python package designed for efficient and accurate demultiplexing of hash-tagged oligonucleotides (HTOs) in single-cell data. It normalises based on observed background signal and denoises the data to remove batch effects and noise:

  • Normalization: Normalize HTO data using background signal, inspired by the DSB method (see citation below).
  • Denoising: Remove batch effects and noise from the data by regressing out cell by cell variation.
  • Demultiplexing: Cluster and classify cells into singlets, doublets, or negatives using clustering methods like k-means or Gaussian Mixture Models (GMM).

The package supports command-line interface (CLI) usage and Python imports.

HTO DND

Installation

Using pip:

pip install hto

From source:

git clone https://github.com/sail-mskcc/hto_dnd.git
cd hto_dnd
pip install .

Usage

Python API

The python API is built around AnnData. it is highly recommended two work with three AnnData objects:

  • adata_hto: Filtered AnnData object with HTO data, containing only actual cells.
  • adata_hto_raw: Raw AnnData object with HTO data, containing actual cells and background signal.
  • adata_gex: Raw AnnData object with gene expression data. This is optional and can be used to construct a more informative background signal.
import hto

# get mockdata
mockdata = hto.data.generate_hto(n_cells=1000, n_htos=3, seed=10)
adata_hto = mockdata["filtered"]
adata_hto_raw = mockdata["raw"]
adata_gex = mockdata["gex"]

# denoise, normalize, and demultiplex
adata_demux = hto.demultiplex(
  adata_hto,
  adata_hto_raw,
  adata_gex=adata_gex,
  inplace=False,
)

# see results
adata_demux.obs[["hash_id", "doublet_info"]].head()

Command-Line Interface (CLI)

The CLI provides an API for the hto demultiplex scripts. Make sure to define --adata-out to save the output.

hto demultiplex \
  --adata-hto /path/to/adata_hto.h5ad \
  --adata-hto-raw /path/to/adata_hto_raw.h5ad \
  --adata-gex /path/to/adata_gex.h5ad \
  --adata-out /path/to/output.h5ad

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hto-1.1.4.tar.gz (38.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hto-1.1.4-py3-none-any.whl (50.8 kB view details)

Uploaded Python 3

File details

Details for the file hto-1.1.4.tar.gz.

File metadata

  • Download URL: hto-1.1.4.tar.gz
  • Upload date:
  • Size: 38.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.9 Linux/4.18.0-425.19.2.el8_7.x86_64

File hashes

Hashes for hto-1.1.4.tar.gz
Algorithm Hash digest
SHA256 7e53edcc44a0e3be8221bac5a06f88329bd1c56536d9e7b9c9ae6833b0f7d7e8
MD5 57548427c9b7cb4225155282466b0aa8
BLAKE2b-256 6d32cdf6335a251dee33ce115b6e6985a1ad218f90f96d2596fd2c1b62816cf2

See more details on using hashes here.

File details

Details for the file hto-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: hto-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 50.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.9 Linux/4.18.0-425.19.2.el8_7.x86_64

File hashes

Hashes for hto-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 53c5503712bf72450ee2b0c0d4f736d47dfe0e9139cf13ac7939a0d0021751fe
MD5 f844b5c4a6193aabd3b445396757d53f
BLAKE2b-256 568da34455940004bec1b9a992c17bc96df8db708e32096d3f683ad438d36f78

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page