Skip to main content

Validate and convert SDRF files to configuration files for pipelines.

Project description

sdrf-pipelines

PyPI version PyPI - Downloads CI Codacy Badge

The official SDRF-Proteomics validator and converter. Validate your sample metadata files and convert them to workflow configurations for OpenMS, MaxQuant, MSstats, and more.

Quick Start with sdrf as a command/line tool

Install python and pipx or just uv

uv:

# Install
uv tool install sdrf-pipelines[all]

# Validate your SDRF file
parse_sdrf validate-sdrf --sdrf_file your_file.sdrf.tsv

pipx:

# Install
pipx install sdrf-pipelines[all]

# Validate your SDRF file
parse_sdrf validate-sdrf --sdrf_file your_file.sdrf.tsv

That's it! Your SDRF file will be validated against the default mass spectrometry template.

Installation

Installation Command Features
Basic pip install sdrf-pipelines Structural validation + all converters
Full pip install sdrf-pipelines[ontology] + Ontology term validation (EFO, CL, MS, etc.)

The basic installation validates column structure, formatting, and uniqueness. Add [ontology] to also validate that terms exist in their respective ontologies.

Usage

For detailed command documentation, see COMMANDS.md or use the built-in help:

parse_sdrf --help
parse_sdrf validate-sdrf --help

Validation Examples

# Basic validation
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv

# Validate with a specific template
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv --template human

# Skip ontology validation for quick structural checks
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv --skip-ontology

Converter Examples

# Convert to OpenMS format
parse_sdrf convert-openms -s sdrf.tsv

# Convert to MaxQuant format
parse_sdrf convert-maxquant -s sdrf.tsv -f database.fasta -r /path/to/raw/files

# Convert to MSstats annotation
parse_sdrf convert-msstats -s sdrf.tsv -o annotation.csv

# Convert to MHCquant format (samplesheet + search presets)
parse_sdrf convert-mhcquant -s sdrf.tsv -os samplesheet.tsv -op presets.tsv

Development

We use modern Python tooling for development:

Package Manager: uv

We use uv as our package manager for fast, reliable dependency management.

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Create virtual environment and install dependencies
uv sync

# Install with dev dependencies
uv sync --group dev

Code Quality: ruff + pre-commit

We use ruff for linting and formatting, and pre-commit for automated checks.

# Install pre-commit hooks
uv run pre-commit install

# Run all checks manually
uv run pre-commit run --all-files

Running Tests

uv run pytest

For more details, see CONTRIBUTING.md.

Citation

If you use this software, please cite:

Dai C, Füllgrabe A, Pfeuffer J, et al. A proteomics sample metadata representation for multiomics integration and big data analysis. Nat Commun 12, 5854 (2021). https://doi.org/10.1038/s41467-021-26111-3

For full citation details and BibTeX format, see CITATION.cff.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdrf_pipelines-0.1.3.tar.gz (36.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sdrf_pipelines-0.1.3-py3-none-any.whl (594.1 kB view details)

Uploaded Python 3

File details

Details for the file sdrf_pipelines-0.1.3.tar.gz.

File metadata

  • Download URL: sdrf_pipelines-0.1.3.tar.gz
  • Upload date:
  • Size: 36.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sdrf_pipelines-0.1.3.tar.gz
Algorithm Hash digest
SHA256 27229226774b46ac3b55ce14c878fd30605d1dd43dbe7481bcef8b4c68265473
MD5 f955973c97ef954f471afb54fc54f49b
BLAKE2b-256 4b5369fcfd1911a07a464ebeb291ed9ffd47198172f0843e4ebf8af852ae3c12

See more details on using hashes here.

File details

Details for the file sdrf_pipelines-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: sdrf_pipelines-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 594.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sdrf_pipelines-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 71fae131739e5a01acba3424907657883e12f99547450066b3a2f618a1d6579f
MD5 96ba786104ec5c032c07a4300afd1084
BLAKE2b-256 8984d6b9154f640612935a748f7fa924ca0080af4ac2a182538164a5895bef6a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page