Deep Learning for Proteomics
Project description
DLOmix
DLOmix is a python framework for Deep Learning in Proteomics. Initially built ontop of TensorFlow/Keras, support for PyTorch can however be integrated once the main API is established.
Usage
Experiment a simple retention time prediction use-case using Google Colab
Resources Repository:
More learning resources can be found in the dlomix-resources repository.
Installation
Run the following to install:
$ pip install dlomix
General Overview:
data.py
: structures for modelling the input data, currently based ontf.Dataset
models.py
: common model architectures for the relevant use-cases based ontf.keras.Model
to allow for using the Keras training APIpipeline.py
: an exemplary high-level pipeline implementationeval.py
: classes for evaluating models and reporting resultseval_utils.py
: custom evaluation metrics implemented in TensorFlow/Kerasconstants.py
: constants and configuration values needs for thepipeline
class.
Use-cases:
- Retention Time Prediction:
- a regression problem where the the retention time of a peptide sequence is to be predicted.
To-Do:
Functionality:
- integrate prosit
- extend pipeline for different types of models and backbones
- extend pipeline to allow for fine-tuning with custom datasets
- add residual plots to reporting, possibly other regression analysis tools
- output reporting results as PDF
- extend data representation to include modifications
Package structure:
- integrate
deeplc.py
intomodels.py
, preferably introduce a package structure (e.g.models.retention_time
) - add references for implemented models in the ReadMe
- introduce a style guide and checking (e.g. PEP)
- plan documentation (sphinx and readthedocs)
Developing DLOmix
To install dlomix, along with the the tools needed to develop and run tests, run the following command in your virtualenv:
$ pip install -e .[dev]
References:
[Prosit]
[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.
[DeepLC]
[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003
[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dlomix-0.0.2.dev1.tar.gz
.
File metadata
- Download URL: dlomix-0.0.2.dev1.tar.gz
- Upload date:
- Size: 20.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7a618b5eda8a976ff65da5c226ae7bb56ea9b87f1b3d38345c4d31e0a7b22e0 |
|
MD5 | 9becd788eb4e77a4b3d0f814bd0b9883 |
|
BLAKE2b-256 | 4329745b863a7c841e4fff1bc9e838ad776d1bf83dba1a3a5dfb576997192151 |
File details
Details for the file dlomix-0.0.2.dev1-py3-none-any.whl
.
File metadata
- Download URL: dlomix-0.0.2.dev1-py3-none-any.whl
- Upload date:
- Size: 28.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 657ec5860332947f384d3ebddd3adae86c5a2a0605ed32bbac73d3bc793254d4 |
|
MD5 | 398cf315222606d911f36bf4a6a9c944 |
|
BLAKE2b-256 | 39394a7fdde8b951ef9321550dd366b406dbfa58f0858ac50754482ba50a26ed |