Skip to main content

Genome-level presence inference from metaproteomic peptide lists.

Project description

MetaUmbra

MetaUmbra

Genome-level presence inference from metaproteomic peptides

MetaUmbra performs genome-level presence inference from metaproteomic peptide lists. It combines unique peptide support with weighted shared peptide evidence to identify statistically supported microbial genomes and generate interpretable presence rankings.

Main features

  • Evaluate candidate genome support from metaproteomic peptide tables
  • Build genome-specific theoretical peptide references from protein FASTA files
  • Support user-defined genome collections, including isolate genomes, strain panels, and MAG catalogs
  • Use both unique and shared peptide evidence for genome presence inference
  • Report genome-level p-values, BH-adjusted q-values, and presence scores
  • Provide GUI, command-line, and Python workflow support
  • Support peptide tables from common metaproteomics workflows such as DIA-NN and MaxQuant

Workflow overview

MetaUmbra workflow

Installation

MetaUmbra requires Python 3.10 or newer.

# Install with all features (GUI, parquet support)
pip install metaumbra[all]

or

# Install with core features only
pip install metaumbra

Usage

MetaUmbra can be used through either the graphical interface or the command line.

For a detailed walkthrough, including input formats, CLI examples, output interpretation, and troubleshooting, see the MetaUmbra Usage Guide.

Graphical interface

metaumbra-gui

The GUI supports FASTA digestion, peptide table loading, genome presence scoring, and result export.

Command line

MetaUmbra provides separate commands for the main workflow steps:

metaumbra digest --help
metaumbra score --help
metaumbra extract-parquet --help

A typical workflow is:

metaumbra digest ...
metaumbra score ...

Use metaumbra extract-parquet ... to convert DIA-NN parquet reports to peptide TSV files before scoring.

Input

MetaUmbra requires:

  • Protein FASTA files, with one FASTA file per genome
  • An observed peptide table containing peptide sequences

Optional inputs include peptide scores, peptide-level error values, decoy flags, and genome lineage annotations.

Output

The main output is a TSV table containing genome-level evidence and significance values.

Key output columns include:

Column Description
genome_id Candidate genome identifier
num_peptides_matched Number of observed peptides matched to the genome
num_peptides_unique Number of matched peptides unique to the genome
shared_fraction Fraction of matched peptides that are shared with other genomes
mean_degeneracy Mean number of genomes containing the matched peptides
pvalue Genome-level p-value
qvalue BH-adjusted genome-level q-value
presence_score Ranking score based on q-value

Citation

If you use MetaUmbra, please cite:

Wu Q, Ning Z, Zhang A, Cheng K, Figeys D. MetaUmbra: Statistically Controlled Genome-Level Presence Inference from Metaproteomic Peptides.[J]. bioRxiv, 2026.

A formal citation will be added after publication.

Contact

For questions or issues, please use the GitHub issue tracker or contact the corresponding author listed in the associated manuscript.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metaumbra-1.1.3.tar.gz (3.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

metaumbra-1.1.3-py3-none-any.whl (3.2 MB view details)

Uploaded Python 3

File details

Details for the file metaumbra-1.1.3.tar.gz.

File metadata

  • Download URL: metaumbra-1.1.3.tar.gz
  • Upload date:
  • Size: 3.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.3.tar.gz
Algorithm Hash digest
SHA256 bd53628b2dbd34eeb1711dd3b4b2fef987c9db752b6cda6f776c761b4e537b45
MD5 99a3cf78260a607985d1c8572a6f7826
BLAKE2b-256 1dabc0ea972e8fc18cdcbcfe912dfd27bf3604e344b82763c78d6ffcba2cce85

See more details on using hashes here.

File details

Details for the file metaumbra-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: metaumbra-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 3.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for metaumbra-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e3b499efb0c1496a28c95a330781ae21f09f68863751f5f114d2070000ec2ae2
MD5 b24fd4cb7c860e2d6154cd4008a1f782
BLAKE2b-256 b89d7eb3fd6559c04380fa2f3a077f237f32bd2b2b43822a9647f8f29c82a045

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page