Skip to main content

Validate and convert SDRF files to configuration files for pipelines.

Project description

sdrf-pipelines

PyPI version PyPI - Downloads CI Codacy Badge

The official SDRF-Proteomics validator and converter. Validate your sample metadata files and convert them to workflow configurations for OpenMS, MaxQuant, MSstats, and more.

Quick Start with sdrf as a command/line tool

Install python and pipx or just uv

uv:

# Install
uv tool install sdrf-pipelines[all]

# Validate your SDRF file
parse_sdrf validate-sdrf --sdrf_file your_file.sdrf.tsv

pipx:

# Install
pipx install sdrf-pipelines[all]

# Validate your SDRF file
parse_sdrf validate-sdrf --sdrf_file your_file.sdrf.tsv

That's it! Your SDRF file will be validated against the default mass spectrometry template.

Installation

Installation Command Features
Basic pip install sdrf-pipelines Structural validation + all converters
Full pip install sdrf-pipelines[ontology] + Ontology term validation (EFO, CL, MS, etc.)

The basic installation validates column structure, formatting, and uniqueness. Add [ontology] to also validate that terms exist in their respective ontologies.

Usage

For detailed command documentation, see COMMANDS.md or use the built-in help:

parse_sdrf --help
parse_sdrf validate-sdrf --help

Validation Examples

# Basic validation
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv

# Validate with a specific template
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv --template human

# Skip ontology validation for quick structural checks
parse_sdrf validate-sdrf --sdrf_file sample.sdrf.tsv --skip-ontology

Converter Examples

# Convert to OpenMS format
parse_sdrf convert-openms -s sdrf.tsv

# Convert to MaxQuant format
parse_sdrf convert-maxquant -s sdrf.tsv -f database.fasta -r /path/to/raw/files

# Convert to MSstats annotation
parse_sdrf convert-msstats -s sdrf.tsv -o annotation.csv

# Convert to MHCquant format (samplesheet + search presets)
parse_sdrf convert-mhcquant -s sdrf.tsv -os samplesheet.tsv -op presets.tsv

Development

We use modern Python tooling for development:

Package Manager: uv

We use uv as our package manager for fast, reliable dependency management.

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Create virtual environment and install dependencies
uv sync

# Install with dev dependencies
uv sync --group dev

Code Quality: ruff + pre-commit

We use ruff for linting and formatting, and pre-commit for automated checks.

# Install pre-commit hooks
uv run pre-commit install

# Run all checks manually
uv run pre-commit run --all-files

Running Tests

uv run pytest

For more details, see CONTRIBUTING.md.

Citation

If you use this software, please cite:

Dai C, Füllgrabe A, Pfeuffer J, et al. A proteomics sample metadata representation for multiomics integration and big data analysis. Nat Commun 12, 5854 (2021). https://doi.org/10.1038/s41467-021-26111-3

For full citation details and BibTeX format, see CITATION.cff.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdrf_pipelines-0.1.4.tar.gz (36.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sdrf_pipelines-0.1.4-py3-none-any.whl (595.2 kB view details)

Uploaded Python 3

File details

Details for the file sdrf_pipelines-0.1.4.tar.gz.

File metadata

  • Download URL: sdrf_pipelines-0.1.4.tar.gz
  • Upload date:
  • Size: 36.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sdrf_pipelines-0.1.4.tar.gz
Algorithm Hash digest
SHA256 dad60edc3b90c93c0d9563c215b648a5b08446b166985e34095845873e9a484d
MD5 9313c70625f9cbeded6bc85142bdf4d9
BLAKE2b-256 267d3f46d532ca1caa95c0e1c66e4033d97d358094abce275e9c86e249ffafa5

See more details on using hashes here.

File details

Details for the file sdrf_pipelines-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: sdrf_pipelines-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 595.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sdrf_pipelines-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e47874d856d177e586a677492df6acabe8b49fffa22e160c5b7c874674042b5c
MD5 8651702c923b40c1c3e4c8aa23271cd1
BLAKE2b-256 fd43202404688347ea156e45d2619cd3933d929ba7b3fca622555b8b7bb3b1e9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page