Skip to main content

A method to demultiplex hashtagged single-cell data.

Project description

HTO DND - Demultiplex Hashtag Data

PyPI version Build Status

hto is a Python package designed for efficient and accurate demultiplexing of hash-tagged oligonucleotides (HTOs) in single-cell data. It normalises based on observed background signal and denoises the data to remove batch effects and noise:

  • Normalization: Normalize HTO data using background signal, inspired by the DSB method (see citation below).
  • Denoising: Remove batch effects and noise from the data by regressing out cell by cell variation.
  • Demultiplexing: Cluster and classify cells into singlets, doublets, or negatives using clustering methods like k-means or Gaussian Mixture Models (GMM).

The package supports command-line interface (CLI) usage and Python imports.

HTO DND

Installation

Using pip:

pip install hto

From source:

git clone https://github.com/sail-mskcc/hto_dnd.git
cd hto_dnd
pip install .

Usage

Python API

The python API is built around AnnData. it is highly recommended two work with three AnnData objects:

  • adata_hto: Filtered AnnData object with HTO data, containing only actual cells.
  • adata_hto_raw: Raw AnnData object with HTO data, containing actual cells and background signal.
  • adata_gex: Raw AnnData object with gene expression data. This is optional and can be used to construct a more informative background signal.
import hto

# get mockdata
mockdata = hto.data.generate_hto(n_cells=1000, n_htos=3, seed=10)
adata_hto = mockdata["filtered"]
adata_hto_raw = mockdata["raw"]
adata_gex = mockdata["gex"]

# denoise, normalize, and demultiplex
adata_demux = hto.demultiplex(
  adata_hto,
  adata_hto_raw,
  adata_gex=adata_gex,
  inplace=False,
)

# see results
adata_demux.obs[["hash_id", "doublet_info"]].head()

Command-Line Interface (CLI)

The CLI provides an API for the hto demultiplex scripts. Make sure to define --adata-out to save the output.

hto demultiplex \
  --adata-hto /path/to/adata_hto.h5ad \
  --adata-hto-raw /path/to/adata_hto_raw.h5ad \
  --adata-gex /path/to/adata_gex.h5ad \
  --adata-out /path/to/output.h5ad

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hto-1.1.5.tar.gz (40.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hto-1.1.5-py3-none-any.whl (52.5 kB view details)

Uploaded Python 3

File details

Details for the file hto-1.1.5.tar.gz.

File metadata

  • Download URL: hto-1.1.5.tar.gz
  • Upload date:
  • Size: 40.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.9 Linux/4.18.0-425.19.2.el8_7.x86_64

File hashes

Hashes for hto-1.1.5.tar.gz
Algorithm Hash digest
SHA256 7280e26013aae1fd5aba02c378821d66f17279214646b07974e1eaadaeea3f10
MD5 0aa1fd7bf90055b60685dc0f8c480f0f
BLAKE2b-256 fcd25f6883c632fac284bb91eddd0a64cba616ae9fecd0c14fea56e7726fce04

See more details on using hashes here.

File details

Details for the file hto-1.1.5-py3-none-any.whl.

File metadata

  • Download URL: hto-1.1.5-py3-none-any.whl
  • Upload date:
  • Size: 52.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.9 Linux/4.18.0-425.19.2.el8_7.x86_64

File hashes

Hashes for hto-1.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 f1d1453da45600f48f037521697d85686c59293a422cb9137c3d18e6d70c4fd8
MD5 f63ad5533d16db6f620ea27c6fb5f128
BLAKE2b-256 0fa969a59eddd6e4b6a5ea6359dc84ab3e01c5ae5f06a005cdddfdb883a3f255

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page