pcq - contract runtime + agent CLI surface for CQ ML experiments

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

pcq

pcq is the contract for agent-run ML experiments. This repository hosts the contract specification under spec/ and the reference Python implementation under src/pcq/. Install the reference impl: uv add pcq (Apache-2.0).

The contract turns a project with cq.yaml into a reproducible experiment unit. The reference Python implementation loads config, resolves output paths, captures metrics, writes standard artifacts, finalizes run evidence, and exposes JSON/JSONL/MCP surfaces that coding agents, CI jobs, notebooks, and services can consume. See spec/IMPLEMENTATIONS.md for the registered implementation list (Python reference + CQ Go production worker today) and the procedure for adding yours.

pcq is not a training framework, model zoo, adapter matrix, or CQ-only client. Use PyTorch, Hugging Face Trainer, Lightning, sklearn, TabPFN, PyCaret, XGBoost, shell scripts, remote jobs, or project-local research code. The contract is the integration layer.

pcq does not operate the model.
pcq operates the experiment boundary.

Case studies (external evidence): mnist-dogfood | tabular-dogfood | mcp-dogfood | cq-worker-dogfood

Agent-readable site files: llms.txt, llms-full.txt, agent-manifest.json.

Identity

pcq = open-source experiment evidence/control library
cq  = managed execution + orchestration + dashboard + agent loop

CQ service is one managed consumer of the contract. pcq remains useful without CQ: locally, in CI, in notebooks, and inside third-party orchestrators.

Why pcq

Framework-neutral — keep the training stack that fits the problem.
Agent-readable — use JSON/JSONL instead of terminal scraping.
Agent-verifiable — validate source, config, environment, metrics, artifacts, and run records.
Agent-operable — run, observe, validate, describe, compare, lineage, and iterate through stable commands.
Service-ready — CQ can consume the same contract for managed execution and automatic experiment loops.

What's New (v4.4 – v4.6)

Three agent-fillable metadata fields were added to run_record.json across the last three minor releases, making each run's evidence richer with zero extra code in most cases.

Field	Since	Captures	Auto-filled?
`attribution`	v4.4	author / committer / operator — who ran the experiment	Yes (agent identity injected at runtime)
`worker_spec`	v4.5	cpu / gpu / memory / os — where it ran	Yes (`pcq.worker_spec()` auto-detects hardware)
`fingerprint`	v4.6	modality / task_kind / shape / PII-safe stats — what data	Semi-auto (`pcq.fingerprint(X, y)` detects most fields)

attribution — who

Records the human author, the AI committer, and the operator that launched the run. Coding agents (Claude Code, Codex) fill this automatically from their identity context.

pcq.attribution(
    author={"kind": "human", "id": "alice"},
    committer={"kind": "agent", "id": "claude-code"},
    operator="ci-runner-42",
)

Spec: spec/SPEC.md § Attribution

worker_spec — where

Records CPU model, core count, GPU kind/VRAM, total memory, and OS. Called with no arguments for a full auto-detection pass.

pcq.worker_spec()          # 자동 감지 — 인수 불필요

Spec: spec/SPEC.md § Worker Spec

fingerprint — what

Records dataset modality, task kind, sample count, size class, domain, and PII-safe summary statistics. Accepts a NumPy/pandas array or DataFrame and infers most fields.

pcq.fingerprint(X, y, modality="tabular")

Spec: spec/SPEC.md § Fingerprint

All three fields are optional — existing runs remain valid. When present they appear as first-class evidence in run_record.json and are surfaced through pcq describe-run --json.

Reproducibility Substrate

3개의 선택적 필드로 독립 재현이 가능한 substrate를 run_record.json에 제공한다. pcq는 검증하지 않는다 — 검증을 가능하게 만든다.

Field	Captures
`code`	`{content_sha256, scope}` — 어떤 코드가 해시되었는지
`seeds`	`{<name>: int \| str, ...}` — 재현에 필요한 난수 시드
`data_ref`	`{uri, content_sha256, size_bytes}` — 데이터셋 식별자 및 콘텐츠 hash

상세 스키마, PHI 게이트(R5), integrity 확장, R8 한계 문장: spec/SPEC.md § Reproducibility Pack

Note: code content sha proves WHAT code was recorded, not THAT it produced these outputs. See SPEC.md R8.

Note: pcq records claims, not judgments — intent is a recorded assertion (a fact about what was claimed), not a pcq verdict on success.

Installation

uv add pcq
# Optional — to expose pcq as MCP tools to agent runtimes:
uv add 'pcq[mcp]'

pyproject.toml:

[project]
dependencies = ["pcq"]              # core only
# or:
dependencies = ["pcq[mcp]"]         # core + Model Context Protocol server

Docker (MCP server only)

A minimal container image is also published; it packages pcq[mcp] from PyPI and runs pcq mcp serve on stdio.

docker build -t pcq .
docker run -i --rm pcq          # MCP client attaches to stdin/stdout

The image is intentionally scoped to the MCP server surface — for pcq run, pcq describe-run, pcq agent install and other CLI subcommands, install pcq directly with uv add pcq instead.

For a tag, branch, or private fork:

[tool.uv.sources]
pcq = { git = "https://github.com/playidea-lab/pcq.git", tag = "v4.1.0" }

The PyPI distribution, import name, CLI command, GitHub repository, runtime workspace, and JSON contract namespace are all pcq. Runtime contract names from CQ remain stable: cq.yaml, CQ_CONFIG_JSON, and cq://.

Minimal Contract

cq.yaml declares the run:

name: sklearn-baseline
cmd: uv run python train.py
configs:
  output_dir: output
  seed: 42
  strictness: 3
  monitor: eval_acc
  mode: max
metrics:
  - epoch
  - eval_acc
artifacts:
  - output/
inputs: {}

train.py can use any framework:

import pickle

import pcq
from sklearn.datasets import load_iris
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split

cfg = pcq.config()
out = pcq.output_dir()
pcq.seed_everything(cfg.get("seed", 42))

x, y = load_iris(return_X_y=True)
x_train, x_eval, y_train, y_eval = train_test_split(
    x,
    y,
    test_size=0.25,
    random_state=int(cfg.get("seed", 42)),
    stratify=y,
)

model = RandomForestClassifier(random_state=int(cfg.get("seed", 42)))
model.fit(x_train, y_train)
eval_acc = float(model.score(x_eval, y_eval))

with (out / "model.pkl").open("wb") as f:
    pickle.dump(model, f)

history = [{"epoch": 0, "eval_acc": eval_acc}]
pcq.log(**history[-1])
pcq.save_all(history=history, artifacts={"model": "model.pkl"})

No sklearn adapter is required. The same pattern works for HF Trainer, Lightning, XGBoost, TabPFN, PyCaret, shell commands, or custom code.

Agent Command Surface

Read and validate the project:

pcq resolve --json
pcq inspect . --json
pcq validate . --strictness 2 --json

Run the project:

pcq run --path . --json
pcq run --path . --jsonl
pcq run --path . --events output/events.jsonl --json

Validate and summarize outputs:

pcq validate-run output --strictness 3 --json
pcq describe-run output --json
pcq compare-runs old_output new_output --json
pcq lineage output --json

Iterate:

pcq apply-plan experiment.plan.json --json

Agent rule: prefer JSON/JSONL surfaces over scraping human output. pcq reports facts; the agent or service chooses policy.

Standard Artifacts

A completed run should produce:

config.json
metrics.json
manifest.json
run_summary.json
run_record.json
validation_report.json

run_record.json is the canonical completion object. It combines execution, source, environment, input identity, metric schema, artifact manifest, agent provenance, validation, and summary evidence.

Agent Runtime Assets

pcq can install its canonical agent instructions and skill into a project. Package installation itself never mutates project agent files.

pcq agent install --target codex --path .
pcq agent install --target claude --path .
pcq agent install --target both --path . --dry-run --json
pcq agent status --target both --path . --json

To also wire the project for MCP-aware agents (Claude Code, Codex), install pcq[mcp] and pass --mcp:

uv add 'pcq[mcp]'
pcq agent install --target claude --path . --mcp     # writes .mcp.json
pcq mcp serve                                         # stdio (default)

This exposes 14 mcp__pcq__* tools (resolve_project, validate_run, describe_run, compare_runs, ...) so agents call pcq directly without subprocess parsing. See MCP Integration.

Reusable assets:

v4 Direction

v4 clarifies the product boundary:

contract-first workflow, not a 3-tier training API
project-local training code, not built-in production catalogs
contract scripts, not framework adapters
run evidence validation, not recipe ownership
JSON/JSONL facts, not prose parsing

See pcq v4 Direction.

Development

uv run ruff check src/ tests/ scripts/
uv run python -m compileall src/pcq
uv run pytest tests/ -q
bash scripts/release-smoke.sh

License

Apache-2.0.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

leetang2

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

4.11.0

May 23, 2026

4.10.0

May 20, 2026

4.3.0

May 12, 2026

4.2.0

May 9, 2026

4.1.0

May 9, 2026

4.0.0

May 9, 2026

3.0.4

May 9, 2026

3.0.3

May 9, 2026

3.0.2

May 9, 2026

3.0.1

May 9, 2026

3.0.0

May 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pcq-4.11.0.tar.gz (631.6 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pcq-4.11.0-py3-none-any.whl (171.2 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file pcq-4.11.0.tar.gz.

File metadata

Download URL: pcq-4.11.0.tar.gz
Upload date: May 23, 2026
Size: 631.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pcq-4.11.0.tar.gz
Algorithm	Hash digest
SHA256	`03e7867b264b48bfa37b117e036fc67d21fca3ab5514fd4e92ad8d55e2ff19df`
MD5	`67bc30f0fa7f409c123bfa26c599c099`
BLAKE2b-256	`ab52a8796d1054ba4498a1dc9a8865cbe3643e6ff89978016bab1647cca6814d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pcq-4.11.0.tar.gz:

Publisher: release.yml on playidea-lab/pcq

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pcq-4.11.0.tar.gz
- Subject digest: 03e7867b264b48bfa37b117e036fc67d21fca3ab5514fd4e92ad8d55e2ff19df
- Sigstore transparency entry: 1610205068
- Sigstore integration time: May 23, 2026
Source repository:
- Permalink: playidea-lab/pcq@66b361bd88e88e7aa686a26300aabfcdd2db85c5
- Branch / Tag: refs/tags/v4.11.0
- Owner: https://github.com/playidea-lab
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@66b361bd88e88e7aa686a26300aabfcdd2db85c5
- Trigger Event: push

File details

Details for the file pcq-4.11.0-py3-none-any.whl.

File metadata

Download URL: pcq-4.11.0-py3-none-any.whl
Upload date: May 23, 2026
Size: 171.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pcq-4.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`532a6882c6edb1f7f6769ad9a437e9f953d47db60f9f52b72c306b513eb0beb7`
MD5	`6047d50a430cb130fc48d76c09520c6e`
BLAKE2b-256	`889b9bffd45b0e3d99762ffdf79a51d690b54d52bf038ae137fdf55300fde5c2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pcq-4.11.0-py3-none-any.whl:

Publisher: release.yml on playidea-lab/pcq

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pcq-4.11.0-py3-none-any.whl
- Subject digest: 532a6882c6edb1f7f6769ad9a437e9f953d47db60f9f52b72c306b513eb0beb7
- Sigstore transparency entry: 1610205420
- Sigstore integration time: May 23, 2026
Source repository:
- Permalink: playidea-lab/pcq@66b361bd88e88e7aa686a26300aabfcdd2db85c5
- Branch / Tag: refs/tags/v4.11.0
- Owner: https://github.com/playidea-lab
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@66b361bd88e88e7aa686a26300aabfcdd2db85c5
- Trigger Event: push

pcq 4.11.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

pcq

Identity

Why pcq

What's New (v4.4 – v4.6)

attribution — who

worker_spec — where

fingerprint — what

Reproducibility Substrate

Installation

Docker (MCP server only)

Minimal Contract

Agent Command Surface

Standard Artifacts

Agent Runtime Assets

v4 Direction

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance