A marker-based cell type annotation method that combines the self-training strategy with pseudo-labeling and the nonlinear association capturing capability of Transformer.
Project description
sICTA: Interpretable Cell Type Annotation based on self-training
The full description of sICTA and its application on published single cell RNA-seq datasets are available.
Download archive with preprocessed data at: https://drive.google.com/drive/folders/1jbqSxacL_IDIZ4uPjq220C9Kv024m9eL.
The repository includes detailed installation instructions and requirements, scripts and demos.
1 The workflow of sICTA.
(a) Combining cell expression and marker gene specificity to generate pseudo-labels. (b) The downstream Transformer classifiers are first pre-trained based on cell type probability distributions (pseudo-labels), followed by iterative refinement of the classifiers through a self-training framework until convergence. The sICTA takes the a priori knowledge from the biological domain and uses masked learnable embeddings to transform the input data ($G$ genes) into $k$ input tokens representing each gene set (GS) and a class token (CLS).
2 Requirements
- Linux/UNIX/Windows system
- Python == 3.8.6
- torch == 1.12.1
- scanpy == 1.9.1
3 Usage
Data format
sICTA requires cell-gene matrix and cell type marker information to be entered in .h5ad object format.
Training
python main.py
We provide default data for users to understand and debug sICTA code.
Reference
If you use sICTA in your work, please cite
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sicta-0.0.1.tar.gz.
File metadata
- Download URL: sicta-0.0.1.tar.gz
- Upload date:
- Size: 15.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
23db9a8981c07c0071d21f13f827ca1d9bb3e3bd730efdcbcbb5dc048404b3a3
|
|
| MD5 |
77412d4cea8b7cfedae9ba67ad8ce78e
|
|
| BLAKE2b-256 |
b0969170aaa8ae4724d2ed680fe9c08306aef6dc3c78d2781b66036f37dec151
|
File details
Details for the file sICTA-0.0.1-py3-none-any.whl.
File metadata
- Download URL: sICTA-0.0.1-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
79e2181c115befaae4edb7815d90aed22b236ad602f88b3548c47606cd80d675
|
|
| MD5 |
bcd5a12ddfcaf5f9b281231b89ff4b71
|
|
| BLAKE2b-256 |
e0abf81555fcad6a1639a28d6d9419532b173143c121e0927e654bcc188963a1
|