Skip to main content

MDMP core protocol for dataset contracts, grading, fingerprints, and AI lineage cards

Project description

MDMP

MDMP is an open protocol and tooling stack for dataset quality and AI training provenance.

Tagline: Know what your AI learned from.

30-second summary

MDMP gives every dataset a contract, a grade, and a fingerprint. It gives every model a lineage card that points to exactly which dataset fingerprints were used.

  • Contract: schema, ranges, consent metadata.
  • Validation: reproducible checks + deterministic grade.
  • Fingerprint: immutable dataset identity (sha256:...).
  • Lineage card: model-to-dataset traceability with stale detection.

Install

python -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
pip install -e .

Quick Start

mdmp init --flavor health --output contracts/mdmp_contract.yaml
mdmp validate contracts/mdmp_contract.yaml data/demo_cgm.csv --output-json results/mdmp_report.json
mdmp report results/mdmp_report.json --output-html results/mdmp_dashboard.html

Core Commands

# grading + fingerprint
mdmp grade contracts/mdmp_contract.yaml data/demo_cgm.csv
mdmp fingerprint data/demo_cgm.csv

# stale lineage lifecycle
mdmp fingerprint-record data/demo_cgm.csv --output-json results/fingerprint.json --expires-days 365
mdmp fingerprint-check results/fingerprint.json data/demo_cgm.csv
mdmp lineage-card --model glucose_forecaster_v2 --dataset data/demo_cgm.csv --contract contracts/health_demo.yaml --output results/mdmp_model_card.yaml
mdmp lineage-card-refresh results/mdmp_model_card.yaml

# local registry scaffold
mdmp registry init --registry registry/mdmp_registry.json
mdmp registry push --registry registry/mdmp_registry.json --report results/mdmp_report.json --visibility public --model-id glucose_forecaster_v2
mdmp registry lookup sha256:YOUR_FINGERPRINT --registry registry/mdmp_registry.json
mdmp registry list --registry registry/mdmp_registry.json

# Hugging Face section export
mdmp hf-export --dataset-id python35/demo-cgm --report-json results/mdmp_report.json --output-md results/mdmp_hf_section.md

Documentation

  • Spec: MDMP_SPEC.md
  • Docs index: docs/index.md
  • CLI reference: docs/reference/cli.md
  • Launch checklist: docs/launch/launch-checklist.md
  • Contributing: CONTRIBUTING.md

Release Automation

  • GitHub release workflow: .github/workflows/release.yml
  • PyPI publish workflow: .github/workflows/publish-pypi.yml
  • Docs deploy workflow: .github/workflows/docs-site.yml

Design Boundaries

  • MDMP stores metadata + fingerprints, not raw datasets.
  • Local-first workflow; cloud registry is optional.
  • Complementary to DVC / MLflow / W&B.

Status

  • Spec version: v0.1-draft
  • Research/provenance utility.
  • Not a medical device and not clinical decision support.

Integrations

  • IINTS integration guide: docs/IINTS_INTEGRATION.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mdmp_protocol-0.1.0.tar.gz (21.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mdmp_protocol-0.1.0-py3-none-any.whl (22.8 kB view details)

Uploaded Python 3

File details

Details for the file mdmp_protocol-0.1.0.tar.gz.

File metadata

  • Download URL: mdmp_protocol-0.1.0.tar.gz
  • Upload date:
  • Size: 21.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mdmp_protocol-0.1.0.tar.gz
Algorithm Hash digest
SHA256 1265f375a88eb531078e4a345a267f57729a72373b6a2a1134555de309ebf7da
MD5 c19e2db87ef83cd9d309bff985647e77
BLAKE2b-256 88cdd9117cd77187bcc227c855b9205a09194621d7aa62a789dadc7ccab7f2f1

See more details on using hashes here.

Provenance

The following attestation bundles were made for mdmp_protocol-0.1.0.tar.gz:

Publisher: publish-pypi.yml on python35/MDMP

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mdmp_protocol-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mdmp_protocol-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 22.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mdmp_protocol-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a8895959ec7940100b2ae248e1f4969be211cc98b3afe6d13aeac68ae285e283
MD5 22577b558bb7f021acfe0b49ad5dd4d2
BLAKE2b-256 9c4b438587750ac10787217e0fef3b421d8b5d55c8dcd3d789c92cac5386d4ba

See more details on using hashes here.

Provenance

The following attestation bundles were made for mdmp_protocol-0.1.0-py3-none-any.whl:

Publisher: publish-pypi.yml on python35/MDMP

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page