Skip to main content

Metadata assistant agent for Lobster AI - sample metadata and harmonization operations

Project description

lobster-metadata

Sample metadata management and harmonization for multi-omics datasets.

Installation

pip install lobster-metadata

Agents

Agent Description
metadata_assistant Metadata operations specialist. Sample ID mapping, schema standardization, dataset validation, and disease annotation enrichment.

Services

Service Purpose
SampleMappingService Map sample IDs between datasets using multiple strategies
MetadataStandardizationService Standardize metadata fields to Pydantic schemas
MetadataFilteringService Filter datasets by metadata criteria
DiseaseStandardizationService Standardize disease names to controlled vocabularies
DiseaseOntologyService Map diseases to ontology terms (MONDO, DO)
ClinicalMetadataService Extract and validate clinical metadata fields
SampleGroupingService Group samples by metadata attributes
MicrobiomeFilteringService Microbiome-specific metadata filtering

Features

Sample ID Mapping

  • Exact match between matrix and metadata identifiers
  • Fuzzy matching with configurable similarity thresholds
  • Pattern-based matching using regular expressions
  • Metadata-based correlation for complex mapping scenarios

Metadata Standardization

  • Transcriptomics schema (cell type, tissue, organism, disease)
  • Proteomics schema (platform, quantification method, normalization)
  • Microbiome schema (16S vs shotgun, taxonomic level, diversity)

Dataset Validation

  • Sample count consistency between matrix and metadata
  • Condition coverage verification for experimental design
  • Control sample identification and validation
  • Biological and technical replicate detection
  • Platform consistency checks across samples

Disease Enrichment

Four-phase hierarchy for missing disease annotations:

  1. Column re-scan for disease-related field names
  2. LLM-based abstract extraction from publication context
  3. LLM-based methods section parsing
  4. Manual mapping fallback for known datasets

Multi-Omics Integration

  • Cross-modality sample alignment
  • Shared sample identification across data types
  • Metadata merging with conflict resolution

Requirements

  • Python 3.12+
  • lobster-ai >= 1.0.0

Testing

To run the test suite for this package:

# Install in editable mode with dev dependencies
pip install -e ".[dev]"

# Run all tests
pytest tests/ -v

# Run specific test categories
pytest tests/services/metadata/ -v  # Service tests only
pytest tests/agents/ -v             # Agent tests only

Documentation

Full documentation: docs.omics-os.com/docs/agents/metadata

License

AGPL-3.0-or-later

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lobster_metadata-1.1.411.tar.gz (120.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lobster_metadata-1.1.411-py3-none-any.whl (137.8 kB view details)

Uploaded Python 3

File details

Details for the file lobster_metadata-1.1.411.tar.gz.

File metadata

  • Download URL: lobster_metadata-1.1.411.tar.gz
  • Upload date:
  • Size: 120.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for lobster_metadata-1.1.411.tar.gz
Algorithm Hash digest
SHA256 1d08786d42780ecf8fd369deb6bdeb3cd19aab97aec799fcfa196aaca81cc643
MD5 d7dbad9ba5ba5915ff55404f6a238e66
BLAKE2b-256 f28d78b1ba635f87d24f995843ffb1d2eb295e1bc185ecf7e7f0b0c45b05dd36

See more details on using hashes here.

Provenance

The following attestation bundles were made for lobster_metadata-1.1.411.tar.gz:

Publisher: publish-packages.yml on the-omics-os/lobster

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file lobster_metadata-1.1.411-py3-none-any.whl.

File metadata

File hashes

Hashes for lobster_metadata-1.1.411-py3-none-any.whl
Algorithm Hash digest
SHA256 c3c093f8cd83758ce424893460347b95f5a34a7817cc26a58546c686048337a1
MD5 693607b863f2fdb9e356fe96131d132b
BLAKE2b-256 67fff9fdd991872e8e849c1fb75cfeb58e163ef77bc1c565045b8d1cebe3a9d1

See more details on using hashes here.

Provenance

The following attestation bundles were made for lobster_metadata-1.1.411-py3-none-any.whl:

Publisher: publish-packages.yml on the-omics-os/lobster

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page