Skip to main content

gReLU is a python library to train, interpret, and apply deep learning models to DNA sequences

Project description

DOI

gReLU

gReLU is a Python library to train, interpret, and apply deep learning models to DNA sequences. Code documentation is available here.

Flowchart

Installation

To install from source:

git clone https://github.com/Genentech/gReLU.git
cd gReLU
pip install .

To install using pip:

pip install gReLU

Typical installation time including all dependencies is under 10 minutes.

To train or use transformer models containing flash attention layers, flash-attn needs to be installed first:

conda install -c conda-forge cudatoolkit-dev -y
pip install torch ninja
pip install flash-attn --no-build-isolation
pip install gReLU

Contributing

See our contribution guide.

Additional requirements

If you want to use genome annotation features through the function grelu.io.genome.read_gtf, you will need to install the following UCSC utilities: genePredToBed, genePredToGtf, bedToGenePred, gtfToGenePred, gff3ToGenePred.

If you want to create bigWig files through the function grelu.data.preprocess.make_insertion_bigwig, you will need to install the following UCSC utilities: bedGraphToBigWig.

UCSC utilities can be installed from http://hgdownload.cse.ucsc.edu/admin/exe/, for example using the following commands:

rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/bedGraphToBigWig /usr/bin/
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/genePredToBed /usr/bin/
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/genePredToGtf /usr/bin/
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/bedToGenePred /usr/bin/
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/gtfToGenePred /usr/bin/
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/linux.x86_64/gff3ToGenePred /usr/bin/

or via bioconda:

conda install -y \
bioconda::ucsc-bedgraphtobigwig \
bioconda::ucsc-genepredtobed    \
bioconda::ucsc-genepredtogtf    \
bioconda::ucsc-bedtogenepred    \
bioconda::ucsc-gtftogenepred    \
bioconda::ucsc-gff3togenepred

If you want to create ATAC-seq coverage bigWig files using grelu.data.preprocess.make_insertion_bigwig, you will need to install bedtools. See https://bedtools.readthedocs.io/en/latest/content/installation.html for instructions.

Citation

Please cite our preprint: https://www.biorxiv.org/content/10.1101/2024.09.18.613778v1

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grelu-1.0.10.tar.gz (24.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grelu-1.0.10-py3-none-any.whl (1.4 MB view details)

Uploaded Python 3

File details

Details for the file grelu-1.0.10.tar.gz.

File metadata

  • Download URL: grelu-1.0.10.tar.gz
  • Upload date:
  • Size: 24.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for grelu-1.0.10.tar.gz
Algorithm Hash digest
SHA256 61828b0b58fe847fe5a109a365b80b78ae7aa9e65c1362175689038421a22c84
MD5 c479c964d012d3e16406a2fcb192d56c
BLAKE2b-256 6c1e66506f2a8cff640a4634dedea240a6dd9154dd0a599c93d6b91f0456a598

See more details on using hashes here.

File details

Details for the file grelu-1.0.10-py3-none-any.whl.

File metadata

  • Download URL: grelu-1.0.10-py3-none-any.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for grelu-1.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 549a27d23901d84307c379af5613fb18808be705c7ca5295c5bcffed5674113d
MD5 5bb9709704892661b7be5aff77518c42
BLAKE2b-256 394022dcacd1bc0a1078afb64c7959f9d0351b86162c1f2cfb3dfdc1afb1ef6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page