Skip to main content

Transparent and reproducible proteomics analysis pipeline

Project description

ProteoFlux

DOI

ProteoFlux is an open-source Python framework for transparent, reproducible downstream analysis of quantitative proteomics data.

It operates on quantitative outputs generated by upstream search and quantification tools and provides a unified workflow for:

  • Protein-centric proteomics
  • Peptide-centric analyses
  • Phosphoproteomics (with optional protein-level covariate adjustment)

ProteoFlux emphasizes:

  • Explicit data semantics
  • Transparent preprocessing
  • Deterministic statistical modeling
  • Fully reproducible outputs

Overview

ProteoFlux overview

ProteoFlux takes a single YAML configuration file and produces:

  • Harmonized and preprocessed quantification data
  • Differential expression results using a limma-based empirical Bayes framework
  • Principal component analysis (PCA), multidimensional scaling (MDS), and hierarchical clustering
  • A structured multi-page PDF report
  • Portable .h5ad files compatible with ProteoViewer
  • Summary tables in Excel

Installation

Python ≥ 3.9 required.

Install from source (recommended)

git clone https://github.com/YOUR-ORG/proteoflux.git
cd proteoflux
pip install -e .

This installs ProteoFlux in editable mode.


Quickstart (CLI)

List available configuration templates:

proteoflux templates

Create a template:

proteoflux init spectronaut-proteomics --path config.yaml

Edit the file and then run the full pipeline:

proteoflux run --config config.yaml

Output Files

File Description
*.h5ad Full AnnData object with raw, normalized, imputed data, statistics, embeddings, and metadata
*.xlsx Differential expression summary table
*.pdf Multi-page QC and analysis report

The .h5ad files are compatible with ProteoViewer, an interactive visualization tool for ProteoFlux outputs.


Using ProteoFlux as a Python Library

ProteoFlux can also be used programmatically:

import yaml
from proteoflux.main import run_pipeline

config = yaml.safe_load(open("config.yaml")) #or use a dict
run_pipeline(config)

This allows:

  • Integration into larger workflows
  • Batch processing
  • Use inside Jupyter notebooks
  • Custom downstream analysis of AnnData objects

All behavior is controlled by the same YAML schema used by the CLI.


Configuration

All parameters are defined in a single YAML configuration file. We recommend starting with a template:

proteoflux templates
proteoflux init TEMPLATE_NAME

Full parameter reference:

See the configuration guide docs/CONFIGURATION.md.


Examples

Runnable, reduced example datasets and matching configs are provided under examples/:

  • examples/searle_small/: small DIA proteomics example (Spectronaut export subset)
  • examples/phospho_small/: phosphoproteomics example with an injected flow-through covariate run (Spectronaut export subsets)

Each example folder contains a README.md with a minimal command to run the pipeline (typically proteoflux run --config <config>.yaml) and the required input files (data + annotation, if applicable).


Paper

The JOSS manuscript source is provided as paper.md.


License

ProteoViewer is released under the MIT License. See the LICENSE file for details.


Citation

If you use ProteoFlux in your work, please cite:

ProteoFlux (latest version).
Zenodo. https://doi.org/10.5281/zenodo.18640998

A peer-reviewed publication is currently under submission.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proteoflux-1.8.8.tar.gz (105.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

proteoflux-1.8.8-py3-none-any.whl (123.2 kB view details)

Uploaded Python 3

File details

Details for the file proteoflux-1.8.8.tar.gz.

File metadata

  • Download URL: proteoflux-1.8.8.tar.gz
  • Upload date:
  • Size: 105.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for proteoflux-1.8.8.tar.gz
Algorithm Hash digest
SHA256 d37552634a39a4f48d021821f85a5ec12dafd917dbbd0c375221b2d5fd5a0b88
MD5 e0e9b7fd8505772e58956e59abb05ecf
BLAKE2b-256 35a352661c3b1d982eb187a0a73a77810969797992da1e34e93f48a6633d86f4

See more details on using hashes here.

Provenance

The following attestation bundles were made for proteoflux-1.8.8.tar.gz:

Publisher: release.yml on Afanc/proteoflux

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file proteoflux-1.8.8-py3-none-any.whl.

File metadata

  • Download URL: proteoflux-1.8.8-py3-none-any.whl
  • Upload date:
  • Size: 123.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for proteoflux-1.8.8-py3-none-any.whl
Algorithm Hash digest
SHA256 8af9bdb2d5c1dcff7e88d7c38b0d63fe70f587878a2d4d4662cc5733d09e4ef3
MD5 c5fb2002d86b61401f744403c8c0b52c
BLAKE2b-256 59df8f09972dd3ef4a870fa45ccd7680b0bdd0f9260545ed3cbeb3392b200f78

See more details on using hashes here.

Provenance

The following attestation bundles were made for proteoflux-1.8.8-py3-none-any.whl:

Publisher: release.yml on Afanc/proteoflux

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page