Skip to main content

Package of CellPLM: A pretrain-ed cell language model beyond single cells. Paper link: https://www.biorxiv.org/content/10.1101/2023.10.03.560734

Reason this release was yanked:

Incorrect setup

Project description

CellPLM

This is the official codebase for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.

Preprint License

CellPLM is the first single-Cell Pre-trained Language Model that encodes cell-cell relations and it consistently outperforms existing pre-trained and non-pre-trained models in diverse downstream tasks, with 100x higher inference speed compared to existing pre-trained models.

Installation

We plan to release our project on PyPI soon. For now, please follow the instruction below to set up the environment for CellPLM:

Quick Installaton

git clone git@github.com:OmicsML/cellplm.git && cd cellplm
pip install -r requirements.txt

Complete Installation (recommended for HPC users)

conda create -n cellplm python=3.9 -y && conda activate cellplm
conda install cudatoolkit=11.7 -c pytorch -c nvidia
pip install torch==1.13.0+cu117 --extra-index-url https://download.pytorch.org/whl/cu117
pip install einops ipdb pydance torchmetrics wandb hdf5plugin dgl mygene

We recommend using python 3.9 and cuda 11.7 but they are adjustable.

Tutorials

We offer several notebooks for various downstream tasks as introductory tutorials.

We are also working on developing more streamlined protocols for supported tasks and a comprehensive documentation. We aim to release these by the end of the year.

Pretrained CellPLM Model Checkpoints

The checkpoint can be acquired from our dropbox. We might update our checkpoints from time to time.

[10/10/2023] The latest version is 20230926_85M.

Citation

@article{wen2023cellplm,
  title={CellPLM: Pre-training of Cell Language Model Beyond Single Cells},
  author={Wen, Hongzhi and Tang, Wenzhuo and Dai, Xinnan and Ding, Jiayuan and Jin, Wei and Xie, Yuying and Tang, Jiliang},
  journal={bioRxiv},
  pages={2023--10},
  year={2023},
  publisher={Cold Spring Harbor Laboratory}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cellplm-0.1.0.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

cellplm-0.1.0-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file cellplm-0.1.0.tar.gz.

File metadata

  • Download URL: cellplm-0.1.0.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for cellplm-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bd9489bab227797a151f21ba7bd0d1a86e9795991f1650a477ee0c67619e415e
MD5 353c25697f57149a575122983f0e333c
BLAKE2b-256 a2046742bfdc7b29b04f6ed1aba780c7b821559ef7fdc474ec4f6a2ebf6f5f2a

See more details on using hashes here.

File details

Details for the file cellplm-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cellplm-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for cellplm-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 06506bf988776a4672d751c8208563ce59a67b64fd8682d84c3ff972583eae15
MD5 f6fbea67c761a2edcdcafaa67463803a
BLAKE2b-256 3ca2b423c43f215325dc371fc0ef22cedc06c9ac7fec4eea51c2aa6b5da69e28

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page