Skip to main content

A marker-based cell type annotation method that combines the self-training strategy with pseudo-labeling and the nonlinear association capturing capability of Transformer.

Project description

sICTA: Interpretable Cell Type Annotation based on self-training

The full description of sICTA and its application on published single cell RNA-seq datasets are available.

Download archive with preprocessed data at: https://drive.google.com/drive/folders/1jbqSxacL_IDIZ4uPjq220C9Kv024m9eL.

The repository includes detailed installation instructions and requirements, scripts and demos.

1 The workflow of sICTA.

(a) Combining cell expression and marker gene specificity to generate pseudo-labels. (b) The downstream Transformer classifiers are first pre-trained based on cell type probability distributions (pseudo-labels), followed by iterative refinement of the classifiers through a self-training framework until convergence. The sICTA takes the a priori knowledge from the biological domain and uses masked learnable embeddings to transform the input data ($G$ genes) into $k$ input tokens representing each gene set (GS) and a class token (CLS).

2 Requirements

  • Linux/UNIX/Windows system
  • Python == 3.8.6
  • torch == 1.12.1
  • scanpy == 1.9.1

3 Usage

Data format

sICTA requires cell-gene matrix and cell type marker information to be entered in .h5ad object format.

Training

python main.py

We provide default data for users to understand and debug sICTA code.

Reference

If you use sICTA in your work, please cite

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sicta-0.0.2.tar.gz (15.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sICTA-0.0.2-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file sicta-0.0.2.tar.gz.

File metadata

  • Download URL: sicta-0.0.2.tar.gz
  • Upload date:
  • Size: 15.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10

File hashes

Hashes for sicta-0.0.2.tar.gz
Algorithm Hash digest
SHA256 5bb0e790806e8358b2c13d4a23c87d72646558e4e53bc5c133d7576b39c3b71e
MD5 933fe10fd0813c8a2ede6c58c63aaef3
BLAKE2b-256 27729f2d942948218cbf978103a792600c8d8e8fd6230007858d3123b2daaab9

See more details on using hashes here.

File details

Details for the file sICTA-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: sICTA-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10

File hashes

Hashes for sICTA-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 342f3064d1bfb98bed314b1b7d0dce22b7eaeded75c5638bb582107db059b5e1
MD5 cb71e4b082633501dba0b735dc7f039f
BLAKE2b-256 8d5283bb70f9bcf2e80432164a3ec86490ace5c148280a720c814227144b03d9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page