Skip to main content

Deep Learning for Proteomics

Project description

DLOmix

Docs Build PyPI

DLOmix is a Python framework for Deep Learning in Proteomics. Initially built on top of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.

Usage

Experiment a simple retention time prediction use-case using Google Colab    Colab

A version that includes experiment tracking with Weights and Biases is available here    Colab

Resources Repository

More learning resources can be found in the dlomix-resources repository.

Installation

Run the following to install:

$ pip install dlomix

If you would like to use Weights & Biases for experiment tracking and use the available reports for Retention Time under /notebooks, please install the optional wandb python dependency with dlomix by running:

$ pip install dlomix[wandb]

General Overview

  • data: structures for modeling the input data, processing functions, and feature extractions based on Hugging Face datasets Dataset and DatasetDict
  • eval: classes for evaluating models and reporting results
  • layers: custom layers used for building models, based on tf.keras.layers.Layer
  • losses: custom losses to be used for training with model.fit()
  • models: common model architectures for the relevant use-cases based on tf.keras.Model to allow for using the Keras training API
  • pipelines: an exemplary high-level pipeline implementation
  • reports: classes for generating reports related to the different tasks
  • constants.py: constants and configuration values

Use-cases

  • Retention Time Prediction:

    • a regression problem where the retention time of a peptide sequence is to be predicted.
  • Fragment Ion Intensity Prediction:

    • a multi-output regression problem where the intensity values for fragment ions are predicted given a peptide sequence along with some additional features.
  • Peptide Detectability:

    • a multi-class classification problem where the detectability of a peptide is predicted given the peptide sequence.

To-Do

Functionality:

  • integrate prosit
  • integrate hugging face datasets
  • extend data representation to include modifications
  • add PTM features
  • add residual plots to reporting, possibly other regression analysis tools
  • output reporting results as PDF
  • refactor reporting module to use W&B Report API (Retention Time)
  • add additional detectability task
  • extend pipeline for different types of models and backbones
  • extend pipeline to allow for fine-tuning with custom datasets

Package structure:

  • integrate deeplc.py into models.py, preferably introduce a package structure (e.g. models.retention_time)
  • add references for implemented models in the ReadMe
  • introduce formatting and precommit hooks
  • plan documentation (sphinx and readthedocs)
  • refactor following best practices for cleaner install

Developing DLOmix

To install dlomix, along with the tools needed to develop and run tests, run the following command in your virtualenv:

$ pip install -e .[dev]

References:

[Prosit]

[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.

[DeepLC]

[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003

[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlomix-0.1.4.tar.gz (69.0 kB view details)

Uploaded Source

Built Distribution

dlomix-0.1.4-py3-none-any.whl (78.2 kB view details)

Uploaded Python 3

File details

Details for the file dlomix-0.1.4.tar.gz.

File metadata

  • Download URL: dlomix-0.1.4.tar.gz
  • Upload date:
  • Size: 69.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dlomix-0.1.4.tar.gz
Algorithm Hash digest
SHA256 a397b3c6d07789adac542b830bc78b2ed2eedab11de8ec7c944586fa7930bb0b
MD5 6039c00a6438c4f8d314c3d170fdd885
BLAKE2b-256 609b728adeb304957c377501792740656f0414706ab2b5e25afbfe00c0dfe640

See more details on using hashes here.

File details

Details for the file dlomix-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: dlomix-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 78.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dlomix-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 87f5723f8d8209e4f7922b64f08e3da739c3169dcdf44bc32d01abe73ea69a5c
MD5 b22f46ab3f465bb5e47fbc71979d3022
BLAKE2b-256 7cc1b741841e1efc8068cbe8b1ff1585e4c7857e26cca3493b91bc1d8019752b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page