Skip to main content

Genome-level presence inference from metaproteomic peptide lists.

Project description

MetaUmbra

MetaUmbra

GitHub Homepage

Genome-level presence inference from metaproteomic peptides

MetaUmbra converts identified metaproteomic peptides into statistically supported genome presence calls. It evaluates each candidate genome using both unique and shared peptide evidence and reports genome-level p-values, BH-adjusted q-values, and presence scores.

Main features

  • Evaluate candidate genome support from metaproteomic peptide tables
  • Build genome-specific theoretical peptide references from protein FASTA files
  • Support user-defined genome collections, including isolate genomes, strain panels, and MAG catalogs
  • Use both unique and shared peptide evidence for genome presence inference
  • Report genome-level p-values, BH-adjusted q-values, and presence scores
  • Provide GUI, command-line, and Python workflow support
  • Support peptide tables from common metaproteomics workflows such as DIA-NN and MaxQuant

Workflow overview

MetaUmbra workflow

Installation

MetaUmbra requires Python 3.10 or newer.

pip install ".[all]"

Usage

MetaUmbra can be used through either the graphical interface or the command line.

Graphical interface

metaumbra-gui

The GUI supports FASTA digestion, peptide table loading, genome presence scoring, and result export.

Command line

MetaUmbra provides separate commands for the main workflow steps:

metaumbra digest --help
metaumbra score --help
metaumbra extract-parquet --help

A typical workflow is:

metaumbra digest ...
metaumbra score ...

Use metaumbra extract-parquet ... to convert DIA-NN parquet reports to peptide TSV files before scoring.

Input

MetaUmbra requires:

  • Protein FASTA files, with one FASTA file per genome
  • An observed peptide table containing peptide sequences

Optional inputs include peptide scores, peptide-level error values, decoy flags, and genome lineage annotations.

Output

The main output is a TSV table containing genome-level evidence and significance values.

Key output columns include:

Column Description
genome_id Candidate genome identifier
num_peptides_matched Number of observed peptides matched to the genome
num_peptides_unique Number of matched peptides unique to the genome
weighted_evidence Total degeneracy-weighted peptide evidence
weighted_evidence_shared Weighted evidence from shared peptides
p_presence Genome-level p-value
q_presence BH-adjusted genome-level q-value
presence_score Ranking score based on q-value

Citation

If you use MetaUmbra, please cite:

Wu Q, Ning Z, Zhang A, Cheng K, Figeys D. MetaUmbra: Statistically Controlled Genome-Level Presence Inference from Metaproteomic Peptides.

A formal citation will be added after publication.

Contact

For questions or issues, please use the GitHub issue tracker or contact the corresponding author listed in the associated manuscript.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metaumbra-1.1.1.tar.gz (3.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

metaumbra-1.1.1-py3-none-any.whl (3.2 MB view details)

Uploaded Python 3

File details

Details for the file metaumbra-1.1.1.tar.gz.

File metadata

  • Download URL: metaumbra-1.1.1.tar.gz
  • Upload date:
  • Size: 3.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.1.tar.gz
Algorithm Hash digest
SHA256 ec1d8325dd6dddf1c23fd49643f82cb186f941a456db92148a22ccec05c1b52c
MD5 d57086be5d91bab5e21a14514540cc18
BLAKE2b-256 9efb09f8a56712b53ef36e2a4cfd573a4653be4682c1bd921c00d083c8c4efb2

See more details on using hashes here.

File details

Details for the file metaumbra-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: metaumbra-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 3.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d69501495a91ad799b47bda0ca9c0c783e288ffb3a1bf0e1b8eb5040e2859d2
MD5 6a19dead4567dcb34a595793232c415a
BLAKE2b-256 c8fa474c568cb5613ac30d96a36b0a85ab630c9426a77227aafb3afe5a3cd1a1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page