Skip to main content

BENchmarking Transformer-Obtained Single-Cell embeddings

Project description

bento-sc

BENchmarking Transformer-Obtained Single-Cell representations.

PyPi Version GitHub license Documentation

Single-cell language modeling

This package contains routines and definitions for pre-training single-cell (transcriptomic) language models.

Package features:

  • Memory-efficient scRNA-seq dataloading from h5torch-compatible HDF5 files.
  • yaml-configurable language model training scripts.
  • Modular and extendable data preprocessing pipelines.
  • A diverse set of downstream tasks to evaluate scLM performance.
  • Full reproducibility instructions of our study results via bento-sc-reproducibility.

Install

bento-sc is distributed on PyPI.

pip install bento-sc

Note: The package has been tested with torch==2.2.2 and pytorch-lightning==2.2.5. If you encounter errors with bento-sc using more recent version of these two packages, consider downgrading.

You may need to install PyTorch before running this command in order to ensure the right CUDA kernels for your system are installed.

Package usage and structure

Please refer to our documentation page.

Academic reproducibility

All config files and scripts that were used to pre-train models and fine-tune them towards downstream tasks are included in a separate GitHub repository: bento-sc-reproducibility.

In addition, all scripts to reproduce the "baselines" in our study are located in the bento-sc-reproducibility repository.

Citation

:eyes: :eyes: :eyes:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bento_sc-0.0.6.tar.gz (10.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bento_sc-0.0.6-py3-none-any.whl (10.8 MB view details)

Uploaded Python 3

File details

Details for the file bento_sc-0.0.6.tar.gz.

File metadata

  • Download URL: bento_sc-0.0.6.tar.gz
  • Upload date:
  • Size: 10.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bento_sc-0.0.6.tar.gz
Algorithm Hash digest
SHA256 1c5125a883f3b4d5c0b00c25855315f96a8c1f0c83357bd79516ecef5241446a
MD5 1cac068b03d70e0e7180f4ad38637457
BLAKE2b-256 dddb24fd04762571637832b9894acf493242ed0aaf1f75015da67e92523afba0

See more details on using hashes here.

File details

Details for the file bento_sc-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: bento_sc-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 10.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for bento_sc-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 915469bcd4e83fe8d8f75254d10553e85159a8a1f429c8050dd0b7b8ad581f77
MD5 feb3c097d6da91a84bacb55af21a28bd
BLAKE2b-256 3e2d8ba9a527a6e02eb671872233ab31ce38027f48a621ca85eddc1a90cfb651

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page