Skip to main content

Deep Learning for Proteomics

Project description

DLOmix

Docs Build PyPI

DLOmix is a Python framework for Deep Learning in Proteomics. Initially built on top of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.

Usage

Experiment a simple retention time prediction use-case using Google Colab    Colab

A version that includes experiment tracking with Weights and Biases is available here    Colab

Resources Repository

More learning resources can be found in the dlomix-resources repository.

Installation

Run the following to install:

$ pip install dlomix

General Overview

  • data: structures for modeling the input data, currently based on tf.Dataset
  • eval: classes for evaluating models and reporting results
  • layers: custom layers used for building models, based on tf.keras.layers.Layer
  • losses: custom losses to be used for training with model.fit()
  • models: common model architectures for the relevant use-cases based on tf.keras.Model to allow for using the Keras training API
  • pipelines: an exemplary high-level pipeline implementation
  • reports: classes for generating reports related to the different tasks
  • constants.py: constants and configuration values
  • utils.py: utility functions

Use-cases

  • Retention Time Prediction:
    • a regression problem where the retention time of a peptide sequence is to be predicted.

To-Do

Functionality:

  • integrate prosit
  • extend pipeline for different types of models and backbones
  • extend pipeline to allow for fine-tuning with custom datasets
  • add residual plots to reporting, possibly other regression analysis tools
  • output reporting results as PDF
  • extend data representation to include modifications
  • refactor reporting module to use W&B Report API

Package structure:

  • integrate deeplc.py into models.py, preferably introduce a package structure (e.g. models.retention_time)
  • add references for implemented models in the ReadMe
  • introduce a style guide and checking (e.g. PEP)
  • plan documentation (sphinx and readthedocs)

Developing DLOmix

To install dlomix, along with the tools needed to develop and run tests, run the following command in your virtualenv:

$ pip install -e .[dev]

References:

[Prosit]

[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.

[DeepLC]

[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003

[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlomix-0.0.6.tar.gz (31.7 kB view details)

Uploaded Source

Built Distribution

dlomix-0.0.6-py3-none-any.whl (45.1 kB view details)

Uploaded Python 3

File details

Details for the file dlomix-0.0.6.tar.gz.

File metadata

  • Download URL: dlomix-0.0.6.tar.gz
  • Upload date:
  • Size: 31.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for dlomix-0.0.6.tar.gz
Algorithm Hash digest
SHA256 ab8e45861e79c37eaf1b3fc99904e5d97cc24f2e168ab87cea1398c880118696
MD5 493e9880d7cbf2d7db3188ce91271690
BLAKE2b-256 6298d7b4c573f3ed52ab2075fbe21cbefeb45f63a04213d88aacac7eccf19c67

See more details on using hashes here.

File details

Details for the file dlomix-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: dlomix-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 45.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for dlomix-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 5e1839dc2462d1d6dcbfa580a0b89e65cff85cbd026d646cff2df91b6cb708bf
MD5 e8228338ec98254473bbab4df17987d7
BLAKE2b-256 ced195c268466040473a6a12b51c9b1287fd02aabed8501e1821f98536a2328e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page