Skip to main content

Schema resources for the National Microbiome Data Collaborative (NMDC)

Project description

Long NMDC logo

National Microbiome Data Collaborative Schema

PyPI - License PyPI version

The mission of the NMDC is to build a FAIR microbiome data sharing network, through infrastructure, data standards, and community building, that addresses pressing challenges in environmental sciences. The NMDC platform is built on top of a unified data model (schema) that weaves together existing standards and ontologies to provide a systematic representation of all aspects of the microbiome data life cycle.

This repository mainly defines a LinkML schema for managing metadata from the National Microbiome Data Collaborative (NMDC).

Documentation

The documentation for the NMDC schema can be found at https://microbiomedata.github.io/nmdc-schema/. This documentation is aimed at consumers of NMDC data and metadata, it describes the different data elements used to describe studies, samples, sample processing, data generation, workflows, and downstream data objects.

The NMDC Introduction to metadata and ontologies primer provides some the context for this project.

The remainder of this page is primary for the internal maintainers and contributors to the NMDC schema

Repository Contents Overview

Some products that are maintained, and tasks orchestrated within this repository are:

  • Maintenance of LinkML YAML that specifies the NMDC Schema
  • Makefile targets for converting the schema from it's native LinkML YAML format to other artifact like JSON Schema
  • Build, deployment and distribution of the schema as a PyPI package
  • Automatic publishing of refreshed documentation upon change to the schema, accessible here

Maintaining the Schema

See DEVELOPMENT.md for instructions on setting up a development environment.

See MAINTAINERS.md for instructions on using that development environment to maintain the schema.

Makefiles

Makefiles are text files people can use to tell make (a computer program) how it can make things (or—in general—do things). In the world of Makefiles, those things are called targets.

This repo contains 2 Makefiles:

  • Makefile, based on the generic Makefile from the LinkML cookiecutter
  • project.Makefile, which contains targets that are specific to this project

Here's an example of using make in this repo:

# Deletes all files in `examples/output`.
make examples-clean

The examples-clean target is defined in the project.Makefile. In this repo, the Makefile includes the project.Makefile. As a result, make has access to the targets defined in both files.

Data downloads

The NMDC's metadata about biosamples, studies, bioinformatics workflows, etc. can be obtained from our nmdc-runtime API. Try entering "biosample_set" or "study_set" into the collection_name box at https://api.microbiomedata.org/docs#/metadata/list_from_collection_nmdcschema__collection_name__get

Or use the API programmatically! Note that some collections are large, so the responses are paged.

You can learn about the other available collections at https://microbiomedata.github.io/nmdc-schema/Database/

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nmdc_schema-11.20.0.tar.gz (771.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nmdc_schema-11.20.0-py3-none-any.whl (845.6 kB view details)

Uploaded Python 3

File details

Details for the file nmdc_schema-11.20.0.tar.gz.

File metadata

  • Download URL: nmdc_schema-11.20.0.tar.gz
  • Upload date:
  • Size: 771.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for nmdc_schema-11.20.0.tar.gz
Algorithm Hash digest
SHA256 6eafd0bc92b0435b93e9f0064438e55e31ff7ac9a0c9b47e569c8676b713eef5
MD5 d8155787e1fdbcd301b4114038267e94
BLAKE2b-256 a2fc3778c6e380d8e78f994297d9268a5d285dba343271571177634d9a1538ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for nmdc_schema-11.20.0.tar.gz:

Publisher: pypi-publish.yaml on microbiomedata/nmdc-schema

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nmdc_schema-11.20.0-py3-none-any.whl.

File metadata

  • Download URL: nmdc_schema-11.20.0-py3-none-any.whl
  • Upload date:
  • Size: 845.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for nmdc_schema-11.20.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5321201d5f085b12f9b6fddf5cda5019461c281811c7840193845235d825c6da
MD5 10591416fcda2830d1902366eae3cc53
BLAKE2b-256 c48b050dd7e311b5fb3be39ebce5e986f10ac6f758f0f1870d69513343cf7f0f

See more details on using hashes here.

Provenance

The following attestation bundles were made for nmdc_schema-11.20.0-py3-none-any.whl:

Publisher: pypi-publish.yaml on microbiomedata/nmdc-schema

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page