MDMP core protocol for dataset contracts, grading, fingerprints, and AI lineage cards
Project description
MDMP
MDMP is an open protocol and tooling stack for dataset quality and AI training provenance.
Tagline: Know what your AI learned from.
30-second summary
MDMP gives every dataset a contract, a grade, and a fingerprint. It gives every model a lineage card that points to exactly which dataset fingerprints were used.
- Contract: schema, ranges, consent metadata.
- Validation: reproducible checks + deterministic grade.
- Fingerprint: immutable dataset identity (
sha256:...). - Lineage card: model-to-dataset traceability with stale detection.
Install
python -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
pip install -e .
Quick Start
mdmp init --flavor health --output contracts/mdmp_contract.yaml
mdmp validate contracts/mdmp_contract.yaml data/demo_cgm.csv --output-json results/mdmp_report.json
mdmp report results/mdmp_report.json --output-html results/mdmp_dashboard.html
Core Commands
# grading + fingerprint
mdmp grade contracts/mdmp_contract.yaml data/demo_cgm.csv
mdmp fingerprint data/demo_cgm.csv
# stale lineage lifecycle
mdmp fingerprint-record data/demo_cgm.csv --output-json results/fingerprint.json --expires-days 365
mdmp fingerprint-check results/fingerprint.json data/demo_cgm.csv
mdmp lineage-card --model glucose_forecaster_v2 --dataset data/demo_cgm.csv --contract contracts/health_demo.yaml --output results/mdmp_model_card.yaml
mdmp lineage-card-refresh results/mdmp_model_card.yaml
# local registry scaffold
mdmp registry init --registry registry/mdmp_registry.json
mdmp registry push --registry registry/mdmp_registry.json --report results/mdmp_report.json --visibility public --model-id glucose_forecaster_v2
mdmp registry lookup sha256:YOUR_FINGERPRINT --registry registry/mdmp_registry.json
mdmp registry list --registry registry/mdmp_registry.json
# Hugging Face section export
mdmp hf-export --dataset-id python35/demo-cgm --report-json results/mdmp_report.json --output-md results/mdmp_hf_section.md
Documentation
- Spec:
MDMP_SPEC.md - Docs index:
docs/index.md - CLI reference:
docs/reference/cli.md - Launch checklist:
docs/launch/launch-checklist.md - Contributing:
CONTRIBUTING.md
Release Automation
- GitHub release workflow:
.github/workflows/release.yml - PyPI publish workflow:
.github/workflows/publish-pypi.yml - Docs deploy workflow:
.github/workflows/docs-site.yml
Design Boundaries
- MDMP stores metadata + fingerprints, not raw datasets.
- Local-first workflow; cloud registry is optional.
- Complementary to DVC / MLflow / W&B.
Status
- Spec version:
v0.1-draft - Research/provenance utility.
- Not a medical device and not clinical decision support.
Integrations
- IINTS integration guide:
docs/IINTS_INTEGRATION.md
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mdmp_protocol-0.1.2.tar.gz.
File metadata
- Download URL: mdmp_protocol-0.1.2.tar.gz
- Upload date:
- Size: 21.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
68bde2a87b17a6aea94444a322946575500121a9d4cc40af0bed607694fcf454
|
|
| MD5 |
fb3180a4e7b37962e8eef0bee6dc9ead
|
|
| BLAKE2b-256 |
d2ca08787cd7ab515922df322e9289506ea33561d2af5e307363a14438c8e4b3
|
Provenance
The following attestation bundles were made for mdmp_protocol-0.1.2.tar.gz:
Publisher:
publish-pypi.yml on python35/MDMP
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mdmp_protocol-0.1.2.tar.gz -
Subject digest:
68bde2a87b17a6aea94444a322946575500121a9d4cc40af0bed607694fcf454 - Sigstore transparency entry: 1058342751
- Sigstore integration time:
-
Permalink:
python35/MDMP@9c75bf38fadf14f187581da5387401cec19da68d -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/python35
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@9c75bf38fadf14f187581da5387401cec19da68d -
Trigger Event:
push
-
Statement type:
File details
Details for the file mdmp_protocol-0.1.2-py3-none-any.whl.
File metadata
- Download URL: mdmp_protocol-0.1.2-py3-none-any.whl
- Upload date:
- Size: 22.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6f4413324a4711e588624ed94b1cdc2fbb189ab9e27ee02450c4810cad4cfa35
|
|
| MD5 |
bfbc5a9205dddcab98d62413ed61e69d
|
|
| BLAKE2b-256 |
0390258e217b25925f837dfd4287d8cbdcd9e65eab166abc987edd636363f9ba
|
Provenance
The following attestation bundles were made for mdmp_protocol-0.1.2-py3-none-any.whl:
Publisher:
publish-pypi.yml on python35/MDMP
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mdmp_protocol-0.1.2-py3-none-any.whl -
Subject digest:
6f4413324a4711e588624ed94b1cdc2fbb189ab9e27ee02450c4810cad4cfa35 - Sigstore transparency entry: 1058342759
- Sigstore integration time:
-
Permalink:
python35/MDMP@9c75bf38fadf14f187581da5387401cec19da68d -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/python35
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@9c75bf38fadf14f187581da5387401cec19da68d -
Trigger Event:
push
-
Statement type: