Deep Learning for Proteomics
Project description
DLOmix
DLOmix is a Python framework for Deep Learning in Proteomics. Initially built on top of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.
Usage
Experiment a simple retention time prediction use-case using Google Colab
A version that includes experiment tracking with Weights and Biases is available here
Resources Repository
More learning resources can be found in the dlomix-resources repository.
Installation
Run the following to install:
$ pip install dlomix
General Overview
data
: structures for modeling the input data, currently based ontf.Dataset
eval
: classes for evaluating models and reporting resultslayers
: custom layers used for building models, based ontf.keras.layers.Layer
losses
: custom losses to be used for training withmodel.fit()
models
: common model architectures for the relevant use-cases based ontf.keras.Model
to allow for using the Keras training APIpipelines
: an exemplary high-level pipeline implementationreports
: classes for generating reports related to the different tasksconstants.py
: constants and configuration valuesutils.py
: utility functions
Use-cases
- Retention Time Prediction:
- a regression problem where the retention time of a peptide sequence is to be predicted.
To-Do
Functionality:
- integrate prosit
- extend pipeline for different types of models and backbones
- extend pipeline to allow for fine-tuning with custom datasets
- add residual plots to reporting, possibly other regression analysis tools
- output reporting results as PDF
- extend data representation to include modifications
- refactor reporting module to use W&B Report API
Package structure:
- integrate
deeplc.py
intomodels.py
, preferably introduce a package structure (e.g.models.retention_time
) - add references for implemented models in the ReadMe
- introduce a style guide and checking (e.g. PEP)
- plan documentation (sphinx and readthedocs)
Developing DLOmix
To install dlomix, along with the tools needed to develop and run tests, run the following command in your virtualenv:
$ pip install -e .[dev]
References:
[Prosit]
[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.
[DeepLC]
[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003
[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dlomix-0.0.6.tar.gz
.
File metadata
- Download URL: dlomix-0.0.6.tar.gz
- Upload date:
- Size: 31.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab8e45861e79c37eaf1b3fc99904e5d97cc24f2e168ab87cea1398c880118696 |
|
MD5 | 493e9880d7cbf2d7db3188ce91271690 |
|
BLAKE2b-256 | 6298d7b4c573f3ed52ab2075fbe21cbefeb45f63a04213d88aacac7eccf19c67 |
File details
Details for the file dlomix-0.0.6-py3-none-any.whl
.
File metadata
- Download URL: dlomix-0.0.6-py3-none-any.whl
- Upload date:
- Size: 45.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e1839dc2462d1d6dcbfa580a0b89e65cff85cbd026d646cff2df91b6cb708bf |
|
MD5 | e8228338ec98254473bbab4df17987d7 |
|
BLAKE2b-256 | ced195c268466040473a6a12b51c9b1287fd02aabed8501e1821f98536a2328e |