Skip to main content

A marker-based cell type annotation method that combines the self-training strategy with pseudo-labeling and the nonlinear association capturing capability of Transformer.

Project description

sICTA: Interpretable Cell Type Annotation based on self-training

The full description of sICTA and its application on published single cell RNA-seq datasets are available.

Download archive with preprocessed data at: https://drive.google.com/drive/folders/1jbqSxacL_IDIZ4uPjq220C9Kv024m9eL.

The repository includes detailed installation instructions and requirements, scripts and demos.

1 The workflow of sICTA.

(a) Combining cell expression and marker gene specificity to generate pseudo-labels. (b) The downstream Transformer classifiers are first pre-trained based on cell type probability distributions (pseudo-labels), followed by iterative refinement of the classifiers through a self-training framework until convergence. The sICTA takes the a priori knowledge from the biological domain and uses masked learnable embeddings to transform the input data ($G$ genes) into $k$ input tokens representing each gene set (GS) and a class token (CLS).

2 Requirements

  • Linux/UNIX/Windows system
  • Python == 3.8.6
  • torch == 1.12.1
  • scanpy == 1.9.1

Topic_gene_embedding

3 Usage

Data format

sICTA requires cell-by-cell-gene matrix and cell type information to be entered in csv object format. We provide default data for users to understand and debug sICTA code.

Installation and implementation

Installation via github:

Download sICTA via github clone, you can run it directly by main.py file.

python main.py

Installation via PyPI:

After installing and importing sICTA via PyPI, a notebook tutorial can be found at tutorial.ipynb.

python -m venv sICTA-env
source sICTA-env/bin/activate 
pip install sICTA

Reference

If you use sICTA in your work, please cite

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sicta-1.0.0.tar.gz (6.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sICTA-1.0.0-py3-none-any.whl (750.0 kB view details)

Uploaded Python 3

File details

Details for the file sicta-1.0.0.tar.gz.

File metadata

  • Download URL: sicta-1.0.0.tar.gz
  • Upload date:
  • Size: 6.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10

File hashes

Hashes for sicta-1.0.0.tar.gz
Algorithm Hash digest
SHA256 8bb7f5e2fb8ab2fdc72bc1465709133fc858634b7fbfda55fc58dd2e19c9c763
MD5 436cc5670fa09ffdbe2f72455d5f2998
BLAKE2b-256 7ba6d0b114896c44395504a6202f8d4aa79e02e3ddfc02ef687ba46c0f83c5c1

See more details on using hashes here.

File details

Details for the file sICTA-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: sICTA-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 750.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10

File hashes

Hashes for sICTA-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0670f572069da8449b3b4b0697536779bac5f876a676ef5b7a773f0e2a1732ce
MD5 5bc98b491218ff2c80d1ddd6e4380b5f
BLAKE2b-256 1235a67c5d78bf65c6301be5a31d93e6f2d9e325e697bb6c697f6c03256576a9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page