Quantitative proteomics data toolkit — convert, transform, query, and validate QPX Parquet datasets

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ypriverol

These details have not been verified by PyPI

Project description

QPX

A Python package for working with mass spectrometry data in the QPX format.

QPX Architecture

Features

Convert data from DIA-NN, MaxQuant, FragPipe, QuantMS (mzTab), mzIdentML, and SDRF to QPX Parquet format
Transform QPX data: gene mapping, protein quantification (DirectLFQ, MaxLFQ, iBAQ, TopN, …), accession normalization, metadata updates
Query datasets with SQL, filter rows, or preview with head
Inspect dataset summaries, Arrow schemas, and Parquet metadata
Validate datasets against the canonical QPX schema
Ontology management for PSI-MS and PRIDE CV terms

Performance

QPX Benchmark

Installation

Install from PyPI

pip install qpx

# With optional extras
pip install "qpx[quantify]"    # protein quantification (mokume + DirectLFQ)
pip install "qpx[all]"         # all optional dependencies

Install from GitHub (latest dev)

pip install git+https://github.com/bigbio/qpx.git

Install from Source

# Clone the repository
git clone https://github.com/bigbio/qpx.git
cd qpx

# Install the package locally
pip install .

Install and build with uv

uv is a fast Python package installer and resolver. The project supports PEP 621 and can be installed, built, and published with uv.

Prerequisites: Install uv (e.g. curl -LsSf https://astral.sh/uv/install.sh | sh or pip install uv).

# Install from GitHub
uv pip install "qpx @ git+https://github.com/bigbio/qpx.git"

# With optional extras (transforms, plotting)
uv pip install "qpx[transforms,plotting] @ git+https://github.com/bigbio/qpx.git"

From a local clone:

git clone https://github.com/bigbio/qpx.git
cd qpx

# Create a venv, install the project and its dependencies (recommended)
uv sync

# Or install in editable mode with optional dev dependencies
uv sync --extra dev

# Run the CLI without installing globally
uv run qpxc --help

Build distributable packages (sdist and wheel in dist/):

uv build

Publish to PyPI (after configuring credentials or trusted publishing):

uv build
uv publish

The pyproject.toml uses PEP 621 metadata with Hatchling as the build backend.

Development Installation

For development with all dependencies:

# Using uv (recommended for fast installs)
uv sync --extra dev

# Or using pip
pip install -e ".[dev]"

System Dependencies

QPX depends on pyOpenMS, which requires certain system libraries. If you encounter errors related to missing shared libraries (e.g., libglib-2.0.so.0), install the required system dependencies:

Ubuntu/Debian:

sudo apt-get update
sudo apt-get install -y libglib2.0-0

macOS:

brew install glib

Using Conda/Mamba (Recommended for pyOpenMS):

Using mamba (faster dependency resolution):

mamba env create -f environment.yml
conda activate qpx
pip install git+https://github.com/bigbio/qpx.git

Or with conda:

conda env create -f environment.yml
conda activate qpx
pip install git+https://github.com/bigbio/qpx.git

Usage

The package provides a command-line interface (qpxc) with the following command groups:

qpxc [OPTIONS] COMMAND [ARGS]...

Commands:
  convert    Convert external tool outputs to QPX format.
  transform  Transform QPX data into derived representations.
  query      Query and inspect QPX datasets.
  info       Show information about a QPX dataset.
  validate   Validate a QPX dataset or structure against the canonical schema.
  ontology   Manage CV ontology data (PSI-MS, PRIDE CV).

Convert

qpxc convert [diann | maxquant | quantms | fragpipe | mzidentml | sdrf] [OPTIONS]

Transform

qpxc transform [gene-map | quantify | normalize-accessions | update-metadata] [OPTIONS]

Query

# Run SQL against a dataset
qpxc query sql --dataset-path ./PXD014414 --sql "SELECT anchor_protein, COUNT(*) FROM feature GROUP BY 1"

# Filter rows
qpxc query filter --dataset-path ./PXD014414 --structure feature --condition "charge >= 3"

# Preview first N rows
qpxc query head --dataset-path ./PXD014414 --structure feature -n 20

Info & Validate

# Dataset summary
qpxc info --dataset-path ./PXD014414

# Validate against canonical schema
qpxc validate --dataset-path ./PXD014414

Configuration

Most commands support a --verbose flag that enables more detailed logging to stdout. The CLI uses standard logging configuration and does not require environment variables.

Development

Project Structure

qpx/
├── cli/                    # Click CLI (entry point: qpx.cli.main:main)
│   ├── main.py             # Top-level CLI group
│   └── convert.py          # convert subcommands (maxquant, diann, quantms, fragpipe, mzidentml, sdrf)
├── converters/             # Tool-specific converters
│   ├── quantms/            # QuantMS (mzTab) converter
│   ├── diann/              # DIA-NN converter
│   ├── maxquant/           # MaxQuant converter
│   ├── fragpipe/           # FragPipe converter
│   ├── mzidentml/          # mzIdentML converter
│   └── sdrf.py             # Shared SDRF converter
├── core/                   # Core logic & formats
│   ├── data/               # Schema definitions (YAML + Python)
│   │   └── schemas/        # YAML schema files for all structures
│   ├── engine.py           # DuckDB engine wrapper
│   ├── scores.py           # Score normalization & ontology
│   └── ontology/           # OBO ontology registry
├── writers/                # Parquet writers (one per structure)
├── views/                  # Analytical views (protein, peptide, QC)
└── dataset.py              # Main Dataset class entry point

Contributing

Fork the repository
Create a feature branch
Make your changes
Run tests
Submit a pull request

License

This project is licensed under the Apache-2.0 License - see the LICENSE file for details.

Core contributors and collaborators

The project is run by different groups:

Yasset Perez-Riverol (PRIDE Team, European Bioinformatics Institute - EMBL-EBI, U.K.)
Ping Zheng (Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, China)

IMPORTANT: If you contribute with the following specification, please make sure to add your name to the list of contributors.

Code of Conduct

As part of our efforts toward delivering open and inclusive science, we follow the Contributor Covenant Code of Conduct for Open Source Projects.

How to cite

Copyright notice

Copyright 2025 BigBio

Licensed under the Apache License, Version 2.0.
See the LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ypriverol

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.2

May 6, 2026

1.0.1

Apr 3, 2026

This version

1.0.0

Apr 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

qpx-1.0.0-py3-none-any.whl (498.9 kB view details)

Uploaded Apr 1, 2026 Python 3

File details

Details for the file qpx-1.0.0-py3-none-any.whl.

File metadata

Download URL: qpx-1.0.0-py3-none-any.whl
Upload date: Apr 1, 2026
Size: 498.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for qpx-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`777413f7d4e1ff564cabd36691676bf88291051a72b78dbaff6f07a7905833f2`
MD5	`6fc20e926af30a2aa6d21c59fdd07f44`
BLAKE2b-256	`62c1c69202c996e090f83c80ae21468baf673359b200d0819bec1505b510d0f6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for qpx-1.0.0-py3-none-any.whl:

Publisher: python-publish.yml on bigbio/qpx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: qpx-1.0.0-py3-none-any.whl
- Subject digest: 777413f7d4e1ff564cabd36691676bf88291051a72b78dbaff6f07a7905833f2
- Sigstore transparency entry: 1206122506
- Sigstore integration time: Apr 1, 2026
Source repository:
- Permalink: bigbio/qpx@23a539f1293ce393c23dca61ebc85df14fc0424c
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/bigbio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@23a539f1293ce393c23dca61ebc85df14fc0424c
- Trigger Event: release

qpx 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

QPX

Features

Performance

Installation

Install from PyPI

Install from GitHub (latest dev)

Install from Source

Install and build with uv

Development Installation

System Dependencies

Usage

Convert

Transform

Query

Info & Validate

Configuration

Development

Project Structure

Contributing

License

Core contributors and collaborators

Code of Conduct

How to cite

Copyright notice

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance