Skip to main content

Genome-level presence inference from metaproteomic peptide lists.

Project description

MetaUmbra

MetaUmbra

Genome-level presence inference from metaproteomic peptides

MetaUmbra performs genome-level presence inference from metaproteomic peptide lists. It combines unique peptide support with weighted shared peptide evidence to identify statistically supported microbial genomes and generate interpretable presence rankings.

Main features

  • Evaluate candidate genome support from metaproteomic peptide tables
  • Build genome-specific theoretical peptide references from protein FASTA files
  • Support user-defined genome collections, including isolate genomes, strain panels, and MAG catalogs
  • Use both unique and shared peptide evidence for genome presence inference
  • Report genome-level p-values, BH-adjusted q-values, and presence scores
  • Provide GUI, command-line, and Python workflow support
  • Support peptide tables from common metaproteomics workflows such as DIA-NN and MaxQuant

Workflow overview

MetaUmbra workflow

Installation

MetaUmbra requires Python 3.10 or newer. Installation is available via pip from PyPI.

# Install with all features (GUI, parquet support)
pip install metaumbra[all]

The default GUI extra uses PySide6. To run the GUI with PyQt5 instead, install metaumbra[gui-pyqt5].

or

# Install with core features only
pip install metaumbra

Usage

MetaUmbra can be used through either the graphical interface or the command line.

For a detailed walkthrough, including input formats, CLI examples, output interpretation, and troubleshooting, see the MetaUmbra Usage Guide.

Graphical interface

metaumbra-gui

The GUI supports FASTA digestion, peptide table loading, genome presence scoring, and result export.

Command line

MetaUmbra provides separate commands for the main workflow steps:

metaumbra digest --help
metaumbra score --help
metaumbra extract-parquet --help

A typical workflow is:

metaumbra digest ...
metaumbra score ...

Use metaumbra extract-parquet ... to convert DIA-NN parquet reports to peptide TSV files before scoring.

Input

MetaUmbra requires:

  • Protein FASTA files, with one FASTA file per genome
  • An observed peptide table containing peptide sequences

Optional inputs include peptide scores, peptide-level error values, decoy flags, and genome lineage annotations.

Output

The main output is a TSV table containing genome-level evidence and significance values.

Key output columns include:

Column Description
genome_id Candidate genome identifier
num_peptides_matched Number of observed peptides matched to the genome
num_peptides_unique Number of matched peptides unique to the genome
shared_fraction Fraction of matched peptides that are shared with other genomes
mean_degeneracy Mean number of genomes containing the matched peptides
pvalue Genome-level p-value
qvalue BH-adjusted genome-level q-value
presence_score Ranking score based on q-value

Citation

If you use MetaUmbra, please cite:

Wu Q, Ning Z, Zhang A, Cheng K, Figeys D. MetaUmbra: Statistically Controlled Genome-Level Presence Inference from Metaproteomic Peptides.[J]. bioRxiv, 2026.04.29.721689.

A formal citation will be added after publication.

Contact

For questions or issues, please use the GitHub issue tracker or contact the corresponding author listed in the associated manuscript.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metaumbra-1.1.6.tar.gz (3.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

metaumbra-1.1.6-py3-none-any.whl (3.2 MB view details)

Uploaded Python 3

File details

Details for the file metaumbra-1.1.6.tar.gz.

File metadata

  • Download URL: metaumbra-1.1.6.tar.gz
  • Upload date:
  • Size: 3.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.6.tar.gz
Algorithm Hash digest
SHA256 ebb647ab93153cce4f49f9e22740222fd6789d4fbc3fd356be721417950e2f4a
MD5 b34824430afa79e270753d973af63d6d
BLAKE2b-256 c5f2ef1cf4eebb55173139679009a1e27ab4d4f37afb89d60603af64830e7e98

See more details on using hashes here.

File details

Details for the file metaumbra-1.1.6-py3-none-any.whl.

File metadata

  • Download URL: metaumbra-1.1.6-py3-none-any.whl
  • Upload date:
  • Size: 3.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 bd99f22c0830f6f335d24e408022a178458eb018d30c3a4e4cd8448988d1704c
MD5 ed3dd5fd87b141d2e9b8f09ac387c234
BLAKE2b-256 e3865c6aa4b1331b88bad4d6f251db63a7123073ad1f04b64677e26fe3300adc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page