Package of CellPLM: A pretrain-ed cell language model beyond single cells. Paper link: https://www.biorxiv.org/content/10.1101/2023.10.03.560734
Reason this release was yanked:
Incorrect setup
Project description
CellPLM
This is the official codebase for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.
CellPLM is the first single-Cell Pre-trained Language Model that encodes cell-cell relations and it consistently outperforms existing pre-trained and non-pre-trained models in diverse downstream tasks, with 100x higher inference speed compared to existing pre-trained models.
Installation
We plan to release our project on PyPI soon. For now, please follow the instruction below to set up the environment for CellPLM:
Quick Installaton
git clone git@github.com:OmicsML/cellplm.git && cd cellplm
pip install -r requirements.txt
Complete Installation (recommended for HPC users)
conda create -n cellplm python=3.9 -y && conda activate cellplm
conda install cudatoolkit=11.7 -c pytorch -c nvidia
pip install torch==1.13.0+cu117 --extra-index-url https://download.pytorch.org/whl/cu117
pip install einops ipdb pydance torchmetrics wandb hdf5plugin dgl mygene
We recommend using python 3.9
and cuda 11.7
but they are adjustable.
Tutorials
We offer several notebooks for various downstream tasks as introductory tutorials.
We are also working on developing more streamlined protocols for supported tasks and a comprehensive documentation. We aim to release these by the end of the year.
Pretrained CellPLM Model Checkpoints
The checkpoint can be acquired from our dropbox. We might update our checkpoints from time to time.
[10/10/2023] The latest version is 20230926_85M
.
Citation
@article{wen2023cellplm,
title={CellPLM: Pre-training of Cell Language Model Beyond Single Cells},
author={Wen, Hongzhi and Tang, Wenzhuo and Dai, Xinnan and Ding, Jiayuan and Jin, Wei and Xie, Yuying and Tang, Jiliang},
journal={bioRxiv},
pages={2023--10},
year={2023},
publisher={Cold Spring Harbor Laboratory}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cellplm-0.1.0.tar.gz
.
File metadata
- Download URL: cellplm-0.1.0.tar.gz
- Upload date:
- Size: 3.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bd9489bab227797a151f21ba7bd0d1a86e9795991f1650a477ee0c67619e415e |
|
MD5 | 353c25697f57149a575122983f0e333c |
|
BLAKE2b-256 | a2046742bfdc7b29b04f6ed1aba780c7b821559ef7fdc474ec4f6a2ebf6f5f2a |
File details
Details for the file cellplm-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: cellplm-0.1.0-py3-none-any.whl
- Upload date:
- Size: 3.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 06506bf988776a4672d751c8208563ce59a67b64fd8682d84c3ff972583eae15 |
|
MD5 | f6fbea67c761a2edcdcafaa67463803a |
|
BLAKE2b-256 | 3ca2b423c43f215325dc371fc0ef22cedc06c9ac7fec4eea51c2aa6b5da69e28 |