Skip to main content

Deep Learning for Proteomics

Project description

DLOmix

Docs Build

DLOmix is a python framework for Deep Learning in Proteomics. Initially built ontop of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.

Usage

Experiment a simple retention time prediction use-case using Google Colab    Colab

Resources Repository:

More learning resources can be found in the dlomix-resources repository.

Installation

Run the following to install:

$ pip install dlomix

General Overview:

  • data.py: structures for modelling the input data, currently based on tf.Dataset
  • models.py: common model architectures for the relevant use-cases based on tf.keras.Model to allow for using the Keras training API
  • pipeline.py: an exemplary high-level pipeline implementation
  • eval.py: classes for evaluating models and reporting results
  • eval_utils.py: custom evaluation metrics implemented in TensorFlow/Keras
  • constants.py: constants and configuration values needs for the pipeline class.

Use-cases:

  • Retention Time Prediction:
    • a regression problem where the the retention time of a peptide sequence is to be predicted.

To-Do:

Functionality:

  • integrate prosit
  • extend pipeline for different types of models and backbones
  • extend pipeline to allow for fine-tuning with custom datasets
  • add residual plots to reporting, possibly other regression analysis tools
  • output reporting results as PDF
  • extend data representation to include modifications

Package structure:

  • integrate deeplc.py into models.py, preferably introduce a package structure (e.g. models.retention_time)
  • add references for implemented models in the ReadMe
  • introduce a style guide and checking (e.g. PEP)
  • plan documentation (sphinx and readthedocs)

Developing DLOmix

To install dlomix, along with the the tools needed to develop and run tests, run the following command in your virtualenv:

$ pip install -e .[dev]

References:

[Prosit]

[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.

[DeepLC]

[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003

[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlomix-0.0.2.dev1.tar.gz (20.5 kB view details)

Uploaded Source

Built Distribution

dlomix-0.0.2.dev1-py3-none-any.whl (28.4 kB view details)

Uploaded Python 3

File details

Details for the file dlomix-0.0.2.dev1.tar.gz.

File metadata

  • Download URL: dlomix-0.0.2.dev1.tar.gz
  • Upload date:
  • Size: 20.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for dlomix-0.0.2.dev1.tar.gz
Algorithm Hash digest
SHA256 b7a618b5eda8a976ff65da5c226ae7bb56ea9b87f1b3d38345c4d31e0a7b22e0
MD5 9becd788eb4e77a4b3d0f814bd0b9883
BLAKE2b-256 4329745b863a7c841e4fff1bc9e838ad776d1bf83dba1a3a5dfb576997192151

See more details on using hashes here.

File details

Details for the file dlomix-0.0.2.dev1-py3-none-any.whl.

File metadata

  • Download URL: dlomix-0.0.2.dev1-py3-none-any.whl
  • Upload date:
  • Size: 28.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for dlomix-0.0.2.dev1-py3-none-any.whl
Algorithm Hash digest
SHA256 657ec5860332947f384d3ebddd3adae86c5a2a0605ed32bbac73d3bc793254d4
MD5 398cf315222606d911f36bf4a6a9c944
BLAKE2b-256 39394a7fdde8b951ef9321550dd366b406dbfa58f0858ac50754482ba50a26ed

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page