Skip to main content

Deep Learning for Proteomics

Project description

DLOmix

Docs Build PyPI

DLOmix is a Python framework for Deep Learning in Proteomics. DLOmix provides multi-backend support for both TensorFlow/Keras and PyTorch, allowing researchers to choose their preferred deep learning framework while maintaining identical APIs and functionality. The dataset module is built upon HuggingFace datasets and can provide both TensorFlow and PyTorch tensors.

**Note:Multi-backend support was introduced in dlomix==0.2. Earlier versions supported TensorFlow/Keras only. **

The PyTorch implementation was largely introduced during a hackathon as part of the EuBIC Developer Meeting 2025. We appreciate the efforts and contributions of the team who joined the hackathon and the efforts of the EuBIC team and organizers.

**

Backend Selection

DLOmix automatically detects and uses the appropriate backend based on your environment setup. You can control which backend to use through the DLOMIX_BACKEND environment variable:

TensorFlow Backend (Default)

# Set TensorFlow as backend (default)
export DLOMIX_BACKEND=tensorflow
# or
export DLOMIX_BACKEND=tf

# Install DLOmix with TensorFlow
pip install dlomix[tensorflow]

# Or install tensorflow separately (existing installation), then only install dlomix
pip install dlomix

PyTorch Backend

# Set PyTorch as backend
export DLOMIX_BACKEND=pytorch
# or
export DLOMIX_BACKEND=torch
# or
export DLOMIX_BACKEND=pt

# Install DLOmix with PyTorch support
pip install dlomix[pytorch]

# Or install pytorch separately (existing installation), then only install dlomix
pip install dlomix

Note: The backend must be set before importing DLOmix. If no backend is specified, DLOmix defaults to TensorFlow with a user warning.

Usage

Experiment a simple retention time prediction use-case using Google Colab    Colab

A version that includes experiment tracking with Weights and Biases is available here    Colab

Resources Repository

More learning resources can be found in the dlomix-resources repository.

Installation

Quick Start

# Basic installation
# TensorFlow and PyTorch not installed, please install separately
pip install dlomix

# Install DLOmix and additionally install specific backend
pip install dlomix[tensorflow]  # TensorFlow backend
pip install dlomix[pytorch]     # PyTorch backend

General Package Overview

DLOmix provides a unified API across both TensorFlow and PyTorch backends:

  • data: structures for modeling input data, processing functions, and feature extractions based on Hugging Face datasets Dataset and DatasetDict (backend-agnostic)
  • eval: classes for evaluating models and reporting results (backend-specific implementations)
  • layers: custom layers for building models
    • TensorFlow: based on tf.keras.layers.Layer
    • PyTorch: based on torch.nn.Module
  • losses: custom loss functions
    • TensorFlow: compatible with model.fit()
    • PyTorch: compatible with standard PyTorch training loops
  • models: common model architectures for relevant use-cases
    • TensorFlow: based on tf.keras.Model
    • PyTorch: based on torch.nn.Module
  • pipelines: high-level pipeline implementations (backend-agnostic)
  • reports: classes for generating reports (backend-agnostic)
  • constants.py: constants and configuration values

Available Models by Backend

Model TensorFlow/Keras PyTorch
PrositRetentionTimePredictor [1]
PrositIntensityPredictor [1]
ChargeStatePredictor
DetectabilityModel [4]
DeepLCRetentionTimePredictor [2,3]
Ionmob [5]
PIMMS-CF [6] ⚠ (experimental)

Use-cases

  • Retention Time Prediction:

    • a regression problem where the retention time of a peptide sequence is to be predicted.
  • Fragment Ion Intensity Prediction:

    • a multi-output regression problem where the intensity values for fragment ions are predicted given a peptide sequence along with some additional features.
  • Peptide Detectability (Pfly) [4]:

    • a multi-class classification problem where the detectability of a peptide is predicted given the peptide sequence.

Developing DLOmix

To install dlomix, along with the tools needed to develop and run tests, run the following command in your virtualenv:

$ pip install -e .[dev]

References

[Prosit]

[1] Gessulat, S., Schmidt, T., Zolg, D. P., Samaras, P., Schnatbaum, K., Zerweck, J., ... & Wilhelm, M. (2019). Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nature methods, 16(6), 509-518.

[DeepLC]

[2] DeepLC can predict retention times for peptides that carry as-yet unseen modifications Robbin Bouwmeester, Ralf Gabriels, Niels Hulstaert, Lennart Martens, Sven Degroeve bioRxiv 2020.03.28.013003; doi: 10.1101/2020.03.28.013003

[3] Bouwmeester, R., Gabriels, R., Hulstaert, N. et al. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 18, 1363–1369 (2021). https://doi.org/10.1038/s41592-021-01301-5

[Detectability - Pfly]

[4] Abdul-Khalek, N., Picciani, M., Wimmer, R., Overgaard, M. T., Wilhelm, M., & Gregersen Echers, S. (2024). To fly, or not to fly, that is the question: A deep learning model for peptide detectability prediction in mass spectrometry. bioRxiv, 2024-10.

[IonMob]

[5] Teschner, D., Gomez-Zepeda, D., Declercq, A., Łącki, M. K., Avci, S., Bob, K., ... & Hildebrandt, A. (2023). Ionmob: a Python package for prediction of peptide collisional cross-section values. Bioinformatics, 39(9), btad486.

[PIMMS]

[6] Webel, H., Niu, L., Nielsen, A.B. et al. Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning. Nat Commun 15, 5405 (2024). https://doi.org/10.1038/s41467-024-48711-5

Credit

PyTorch Implementation Hackathon during EuBIC Developer Meeting 2025

  • Ayla Schröder
  • Henry Webel
  • David Teschner
  • Stan Reinders

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlomix-0.2.4.tar.gz (95.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dlomix-0.2.4-py3-none-any.whl (110.4 kB view details)

Uploaded Python 3

File details

Details for the file dlomix-0.2.4.tar.gz.

File metadata

  • Download URL: dlomix-0.2.4.tar.gz
  • Upload date:
  • Size: 95.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlomix-0.2.4.tar.gz
Algorithm Hash digest
SHA256 37d2491a11b3be6eb7cdceb2acd67aa013f6cc6748ab7a1cd7d49e43af0a72e3
MD5 6636b6cade81c3e4b996c0f16908db2d
BLAKE2b-256 c39f013abe16aed299b2ed2b35058f3db5c236fcd069804c39bb649c445ceb15

See more details on using hashes here.

File details

Details for the file dlomix-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: dlomix-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 110.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlomix-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 813c6e30e73c1da5f195854c1af9a0d22bf157b5884630dae57070e6d72922b6
MD5 9bde21730f01e436c3ebe32874445a8e
BLAKE2b-256 68afdd6e0d6262449d0caecfa47dcde2017d0c385bc89ad1d457b0d238ec05ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page