Skip to main content

BENchmarking Transformer-Obtained Single-Cell embeddings

Project description

bento-sc

BENchmarking Transformer-Obtained Single-Cell representations.

PyPi Version GitHub license Documentation

Single-cell language modeling

This package contains routines and definitions for pre-training single-cell (transcriptomic) language models.

Package features:

  • Memory-efficient scRNA-seq dataloading from h5torch-compatible HDF5 files.
  • yaml-configurable language model training scripts.
  • Modular and extendable data preprocessing pipelines.
  • A diverse set of downstream tasks to evaluate scLM performance.
  • Full reproducibility instructions of our study results via bento-sc-reproducibility.

Install

bento-sc is distributed on PyPI.

pip install bento-sc

Note: The package has been tested with torch==2.2.2 and pytorch-lightning==2.2.5. If you encounter errors with bento-sc using more recent version of these two packages, consider downgrading.

You may need to install PyTorch before running this command in order to ensure the right CUDA kernels for your system are installed.

Package usage and structure

Please refer to our documentation page.

Academic reproducibility

All config files and scripts that were used to pre-train models and fine-tune them towards downstream tasks are included in a separate GitHub repository: bento-sc-reproducibility.

In addition, all scripts to reproduce the "baselines" in our study are located in the bento-sc-reproducibility repository.

Citation

:eyes: :eyes: :eyes:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bento_sc-0.0.8.tar.gz (10.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bento_sc-0.0.8-py3-none-any.whl (10.8 MB view details)

Uploaded Python 3

File details

Details for the file bento_sc-0.0.8.tar.gz.

File metadata

  • Download URL: bento_sc-0.0.8.tar.gz
  • Upload date:
  • Size: 10.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bento_sc-0.0.8.tar.gz
Algorithm Hash digest
SHA256 cdbca503297b38dd7959bbd6905d450d42dc8519c8d78725634e522da0fb52ed
MD5 f89b66cca893044faecef0a34daa30b8
BLAKE2b-256 e56a29b149cb48d404651f0371ccf8b9f5e2fda4dbd16747bf9421bd2d4b5c93

See more details on using hashes here.

File details

Details for the file bento_sc-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: bento_sc-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 10.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bento_sc-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 b3591a244df3fa0cabe3cd4c50a8dfaa5b01c4a6cb000dc3a675198ddd354716
MD5 afa5e200fd13e5e362bf908b4b607ebc
BLAKE2b-256 bab5f3f7e3c48a2eebb24c26e6abfec0d5171608557207d33678cf7f433eb303

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page