Skip to main content

Deep Learning for Proteomics

Project description

DLOmix

DLOmix is a python framework for Deep Learning in Proteomics. Initially built ontop of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.

Usage

Experiment a simple retention time prediction use-case using Google Colab    Colab

Installation

Run the following to install:

$ pip install dlomix

Package Structure:

  • data.py: structures for modelling the input data, currently based on tf.Dataset
  • models.py: common model architectures for the relevant use-cases based on tf.keras.Model to allow for using the Keras training API
  • pipeline.py: an exemplary high-level pipeline implementation
  • eval.py: classes for evaluating models and reporting results
  • eval_utils.py: custom evaluation metrics implemented in TensorFlow/Keras
  • constants.py: constants and configuration values needs for the pipeline class.

Use-cases:

  • Retention Time Prediction (indexed retention time):
    • a regression problem where the the retention time of a peptide sequence is to be predicted.

To-Do:

Functionality:

  • integrate prosit
  • extend pipeline for different types of models and backbones
  • extend pipeline to allow for fine-tuning with custom datasets
  • add residual plots to reporting, possibly other regression analysis tools
  • output reporting results as PDF
  • extend data representation to include modifications

Package structure:

  • integrate deeplc.py into models.py, preferably introduce a package structure (e.g. models.retention_time)
  • add references for implemented models in the ReadMe
  • introduce a style guide and checking (e.g. PEP)
  • plan documentation (sphinx and readthedocs)

Developing DLOmix

To install dlomix, along with the the tools needed to develop and run tests, run the following command in your virtualenv:

$ pip install -e .[dev]

References:

[Prosit] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.

[DeepLC] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlomix-0.0.1.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

dlomix-0.0.1-py3-none-any.whl (22.5 kB view details)

Uploaded Python 3

File details

Details for the file dlomix-0.0.1.tar.gz.

File metadata

  • Download URL: dlomix-0.0.1.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.1 requests/2.24.0 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.7.7

File hashes

Hashes for dlomix-0.0.1.tar.gz
Algorithm Hash digest
SHA256 4969f52337bb3f048931f9912648ad5ee696c80a6c25736658f68d9e8b6b46c2
MD5 cf8fdd50f134aa5fadc080619b5546a8
BLAKE2b-256 1199691037fbfb7c3d7dbfbb5be62e8483d4bf73b0e951c2fb084f07c2b3732b

See more details on using hashes here.

File details

Details for the file dlomix-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: dlomix-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 22.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.1 requests/2.24.0 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.7.7

File hashes

Hashes for dlomix-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 156707c72ed4f9dc0044255348b182c848d0f384560373cb5c2a45f0c4bff58e
MD5 26c3386e7d22f1ef323c0fc03162a8a4
BLAKE2b-256 52d83f163d37935d2c5b8041482204ec234007b55a62edded1e4b6a2f80822b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page