Skip to main content

CREsted: Cis-Regulatory Element Sequence Training, Explanation, and Design

Project description

CREsted: Cis Regulatory Element Sequence Training, Explanation, and Design

Tests Documentation

Introduction

CREsted (Cis-Regulatory Element Sequence Training, Explanation, and Design) is an easy-to-use deep learning package for training enhancer models on single-cell ATAC sequencing (scATAC-seq) data. CREsted provides comprehensive analyses and tutorials to study enhancer codes and the ability to design synthetic enhancer sequences at a cell type-specific, nucleotide-level resolution. Integrated into the scverse framework, CREsted is compatible with outcomes from established scATAC-seq processing tools. It employs novel scATAC-seq preprocessing techniques, such as peak height normalization across cell types, offers flexibility and variety in deep learning modeling architectures and tasks, and contains thorough analysis of the cell type-specific enhancer codes captured during modeling that can also be used for the design of synthetic sequences.

Getting started

Please refer to the documentation. In particular, the

Installation

You need to have Python 3.9 or newer installed on your system and a deep learning backend to be able to use CREsted.

CREsted is build on top of keras 3 and can therefore be used with your deep learning backend of choice (Tensorflow or Pytorch).

  1. Install either Tensorflow or Pytorch for GPU. Refer to the installation instructions on those pages to ensure you have the correct version of CUDA and cuDNN installed. If you don't have a preference and don't know which backend to choose, refer to choosing your backend. If you have all the latest drivers installed, this installation boils down to doing:
pip install tensorflow[and-cuda]
# or
pip install torch
  1. Install the latest release of crested from PyPI
pip install crested
  1. If you plan on doing motif analysis using the tf-modisco (lite) and tangermeme's tomtom functionality inside CREsted, you will need to run the following additional install:
pip install crested[tfmodisco]

This requires a cmake installation on your system. If you don't have it, you can install it with:

pip install cmake

Release notes

See the changelog.

Contact

For questions and help requests, please use the issue tracker.

Citation

Kempynck, N., Mahieu, L., Ekşi, E. C., Konstantakos, V., Blaauw, C., De Winter, S., Hulselmans, G., Taskiran, I., & Aerts, S. (2024). CREsted: Cis Regulatory Element Sequence Training, Explanation, and Design (1.0.0). Zenodo. https://doi.org/10.5281/zenodo.13320756

Choosing your backend

CREsted is build on top of keras 3.0 and can therefore be used with your deep learning backend of choice (Tensorflow or Pytorch). If you don't have a preference, you can take the following into account:

  1. From our (and Keras' official) benchmarking, Tensorflow is generally faster than pytorch for training (up to 2x) since Tensorflow operates in graph mode whereas Pytorch uses eager mode. If you plan on training many models, Tensorflow might be the better choice.
  2. Pytorch is easier to debug and get going. Tensorflow will easily throw a bunch of warnings or fail to detect CUDA if you don't have the exact right versions of CUDA and cuDNN installed. Pytorch seems more lenient in this regard. If you only plan on running predictions or training a few models, Pytorch might be the easier choice.
  3. Current Keras 3.0 is still in active development and some features (mainly multi GPU training and weights and biases logging) are currently only supported with the Tensorflow backend. If you plan on using these features, you should choose Tensorflow. We will implement these features in a backend agnostic way as soon as Keras 3.0 has done so (it is on their roadmap).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crested-1.1.0.tar.gz (12.9 MB view details)

Uploaded Source

Built Distribution

crested-1.1.0-py3-none-any.whl (117.3 kB view details)

Uploaded Python 3

File details

Details for the file crested-1.1.0.tar.gz.

File metadata

  • Download URL: crested-1.1.0.tar.gz
  • Upload date:
  • Size: 12.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for crested-1.1.0.tar.gz
Algorithm Hash digest
SHA256 17ad82d8a6c91dd3886ecae63199fd9322dd96a4b9eb053044ee9888c6ccd6b7
MD5 f2ce3cca361e8124bc58e25980d2a4e5
BLAKE2b-256 4b9799ae489ea25f1174034b3eace50a7b6c4f8c75a8623ec8d918553ae8146e

See more details on using hashes here.

File details

Details for the file crested-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: crested-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 117.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for crested-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 663cabc6f068e89278c461c2a08039a8537984c3a388530da098c462fe33ddf8
MD5 87592518ce659bc9c4a3a2f724b0d0c8
BLAKE2b-256 8e3eabdb6aa4674140e4447c926a40fe6a996b5d5576b23ca511357a0b54473a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page