A lightweight Python library for capturing execution context.

These details have not been verified by PyPI

Project links

Project description

pubrun

Let your code monitor itself and write its own Methods section while you go to the pub.

pubrun is a stupidly simple, zero-dependency¹ Python library that eliminates the boilerplate of documenting methodology, tracking versions, recording inputs, and monitoring resources — making it dramatically easier to publish, share, and reproduce your models and research. If you're feeling formal, you can think of "publication-ready runner" as the meaning of the name.

Installation

Available on PyPI:

pip install pubrun

On Python 3.8–3.10, this also installs tomli (a backport of the standard-library tomllib). On Python 3.11+, there are zero runtime dependencies.

Quick Start

import pubrun  # That's it 90% of the time!

pubrun -h  # Lots of info here.

That's it. No frameworks, no heavy integrations, no syntax hijacking. When the script exits, pubrun silently generates a structured, lightweight footprint in your local ./runs/ directory.

[!NOTE] Console capture: By default, pubrun tees stdout and stderr to log files in the run directory. Your terminal output is unchanged, but a copy is saved alongside the manifest. If your script produces very high output volume, you can disable this with capture_mode = "off" in .pubrun.toml or via pubrun.start(console={"capture_mode": "off"}). See Configuration for details.

See CLI Reference and API Reference for full details.

Features

Automatic Execution Tracing — Captures environment variables, hardware specs, and dependency graphs without manual configuration.
Publication-Ready Output — Generates LaTeX/Markdown methodology blocks ready for academic papers.
Semantic Diffing — Compares execution footprints to identify subtle but critical differences between runs.
Secret Redaction — Automatically detects and redacts passwords, tokens, and API keys in environment variables and CLI arguments.
Codebase Drift Detection — Compares current code state against the execution snapshot to highlight changes.
Cross-Platform Reproducibility — Extracts initialization commands for seamless environment replication.
HPC Optimized — Supports global parent-child manifest hydration to minimize overhead on massive clusters.

The Problem

Modern scientific workflows rely on implicit state. When it's time to publish a paper or ship a model, researchers are forced to retroactively piece together their methodology — PyTorch versions, OS constraints, hardware parameters — from memory.

The Solution

pubrun removes this friction by automating execution tracking and metadata compilation.

With a single import pubrun, the library quietly traces your script execution, hashes your environment dependencies, detects codebase drift, and compiles publication-ready Computational Methodology LaTeX/Markdown blocks making your run immediately documentable and ready for publication.

Import Modes

By default, import pubrun starts tracking immediately. For more control, use namespaced import modes:

import pubrun.noauto as pubrun   # Load API, start later with pubrun.start()
import pubrun.nopatch as pubrun  # Auto-start; no subprocess/console monkeypatching; standard hooks active
import pubrun.noconsole as pubrun # Auto-start; intercepts subprocesses and signals, but skips wrapping console streams
import pubrun.minimal as pubrun  # API only; no auto-start; all monkeypatches and hooks disabled

Preset Modes Behavior Matrix

Import Mode	Auto-Start	Intercept Subprocesses (`SubprocessSpy`)	Wrap Console Streams (`ConsoleInterceptor`)	Intercept Signals & Exits (`SignalExitCapture`)	Description
`auto` (default)	✅	✅	✅	✅	Full telemetry tracking begins automatically on import.
`noauto`	❌	✅	✅	✅	Tracking must be started manually; all patches and hooks are active once started.
`nopatch`	✅	❌	❌	✅	Telemetry tracking begins automatically; no intrusive stdout/stderr wrapping or subprocess patching; standard exit/signal hooks remain active.
`noconsole`	✅	✅	❌	✅	Telemetry tracking begins automatically; intercepts subprocesses and signals, but skips wrapping stdout/stderr console streams.
`minimal`	❌	❌	❌	❌	API only; tracking must be started manually; all patches and hooks are disabled (zero-footprint mode).

Or configure project-wide in .pubrun.toml:

[imports]
mode = "noauto"

Or use the CLI wrapper for scripts you can't modify:

pubrun run --mode minimal -- python script.py

Legacy approaches still work: PUBRUN_AUTO_START=false and [core].auto_start = false.

See Configuration for the full [imports] section.

Explicit Tracking Example

import pubrun.noauto as pubrun

pubrun.start(output_dir="./custom_storage", profile="deep")
# ... your code ...
pubrun.stop()

Now extract your method paragraph for your paper:

pubrun methods --format latex

Sample Output

Computational experiments were executed on a machine running Linux (5.15.0-91-generic) equipped with an Intel(R) Core(TM) i7-12700H and 32.0 GB of RAM. The execution environment relied on Python 3.10.12 (CPython). Key dependencies tracked include torch (v2.0.1) and numpy (v1.24.3). To facilitate computational reproducibility, the exact state of the source code was anchored at Git commit a1b2c3d4. Environment and execution provenance were tracked using the pubrun library [1].

[!NOTE] Windows support: pubrun works on Windows, but some capture engines have reduced functionality. Process uid/gid fields are not available, and os.system interception uses shell-string parsing rather than structured argument lists. All other features work identically.

CLI Reference

The pubrun CLI (and its convenient shorthand alias pbr) provides thirteen commands and diagnostic flags, all designed to work equally well on a developer laptop or across a Slurm array of thousands of HPC jobs.

`pubrun bug-report`

Opens the GitHub issue tracker and prints environment diagnostics for copy-pasting.

pubrun bug-report

`pubrun cite`

Generates the bibliographic citation for crediting this library in your paper.

pubrun cite --style bibtex

`pubrun clean`

Interactively delete old run directories. Lists candidates with age and size, then prompts for confirmation.

pubrun clean                        # Interactive: list and confirm
pubrun clean --older-than 7d --yes  # Non-interactive: delete all completed runs older than 7 days
pubrun clean --status crashed --yes # Delete all crashed runs
pubrun clean --dry-run              # Preview what would be deleted

`pubrun combined`

Interleave stdout and stderr logs chronologically from one or more runs.

pubrun combined [RUN_ID ...] --output combined.log

`pubrun diff`

Generates a semantic side-by-side comparison between two execution traces, filtering volatile noise (timestamps, PIDs) by default.

pubrun diff ./runs/pubrun-A ./runs/pubrun-B --same --basic --wrap

`pubrun meta`

Generates a standalone environment snapshot for HPC parent-child hydration.

pubrun meta --out ./runs/meta.json --deep

`pubrun methods`

Translates raw JSON diagnostic payloads into publication-ready methodology paragraphs.

pubrun methods [RUN_DIR] --format markdown|latex

`pubrun report`

A diagnostic viewer that surfaces execution timing, hardware, dependencies, and codebase drift. Accepts multiple run directories for sequential evaluation.

pubrun report ./runs/pubrun-A ./runs/pubrun-B --deep

`pubrun rerun`

Extracts the exact shell command needed to reproduce a run.

pubrun rerun ./runs/pubrun-A

`pubrun resources`

Renders CPU and memory utilization graphs over the lifecycle of a run.

pubrun resources [RUN_DIR]

`pubrun run`

Spawn a command with a specific import mode. Useful for CI, Slurm, and scripts you can't modify.

pubrun run --mode minimal -- python script.py
pubrun run --mode nopatch -- python train.py

`pubrun status`

Lists all runs with their current status (completed, failed, interrupted, running, crashed, ghost), or inspects a specific run in detail. Detects active processes via cross-platform PID liveness checks.

pubrun status              # Compact table of all runs
pubrun status -v           # Verbose listing with PID, RSS, CPU, events
pubrun status a3f9         # Inspect a specific run by ID prefix
pubrun status --dir /path  # Scan a non-default output directory

`pubrun ui`

Launches the interactive terminal user interface (TUI) dashboard to browse, inspect, and manage runs. Note: Requires optional TUI dependencies (installable via pip install "pubrun[tui]" or pip install textual rich).

pubrun ui              # Open the interactive TUI manager (aliases: tui, gui)
pubrun ui --dir /path  # Scan a non-default output directory

Diagnostic Flags

Flag	Description
`--version`	Print the installed pubrun version and exit
`--create-config`	Bootstrap a fully commented `.pubrun.toml` file
`--show-config`	Print the default configuration to the terminal
`--info`	Display system capabilities and pubrun version
`--run-tests`	Execute the built-in self-test suite

See CLI Reference for full details and examples.

Monitoring Runs

pubrun tracks the lifecycle of every run from start to finish, enabling real-time and post-hoc inspection of execution state.

Lock Files and Liveness Detection

When a run starts, pubrun writes a .pubrun.lock file to the run directory containing the PID, start timestamp, hostname, and git commit. This file is removed when the run finalizes normally.

If a process is killed (SIGKILL, OOM, power loss), the lock file persists. pubrun status detects these orphaned runs by checking whether the recorded PID is still alive and whether its start time matches (to handle PID recycling). Runs are classified as:

Status	Meaning
completed	Manifest exists, outcome is "completed"
failed	Manifest exists, outcome is "failed"
interrupted	Run received SIGINT, SIGTERM, or SIGHUP (e.g., Ctrl+C)
broken pipe	Run completed but received SIGPIPE (downstream consumer closed)
running	Lock file present, process is alive
crashed	Lock file present, process is dead
ghost	Run entered ghost mode (filesystem write failure at init)

Signal and Exit Code Capture

pubrun installs non-intrusive signal handlers that record OS signals (SIGINT, SIGTERM, SIGHUP, etc.) received during execution. These handlers chain to any pre-existing handlers — if the importing script has its own SIGINT handler, it is called normally after pubrun records the signal.

The process exit code is also captured at finalization. All signal and exit data appears in the "signals" section of the manifest:

{
  "signals_received": [
    {"signal": 2, "signal_name": "SIGINT", "timestamp_utc": 1780250544.068}
  ],
  "exit_code": 0,
  "exit_exception": null
}

Signal capture is configurable via [capture.signals].enabled in .pubrun.toml.

Live Process Inspection

For running processes, pubrun status <run-id> shows live resource usage (RSS memory and CPU percent) queried cross-platform:

Linux: reads from /proc/<pid>/status and /proc/<pid>/stat
macOS: queries via ps
Windows: uses ctypes (kernel32/psapi) and wmic

No external dependencies are required.

Advanced HPC Ecosystems (Global Hydration)

If you run thousands of array jobs across a cluster, you don't want each child run wasting time and disk logging identical dependency graphs. pubrun supports parent-child manifest hydration.

Step 1: Snap the Parent Cluster

On the head node, snapshot the global environment:

pubrun meta --out ./runs/meta.json --deep

This generates a deep metadata map of hardware, environment variables, and the full Python package tree.

Step 2: Hydrate Children

In your Slurm script, reference the parent snapshot:

export PUBRUN_META_REF=meta.json
python minimal_script.py

Child scripts automatically skip heavy footprint tracking. When you run pubrun report or pubrun methods, the orchestrator detects the PUBRUN_META_REF, pulls in the parent meta.json context, and stitches the complete hardware and dependency picture back together. It also compares script timestamps against the parent snapshot and warns you if environmental drift has been detected.

Configuration

pubrun supports a hierarchical configuration system (highest to lowest precedence):

API overrides — pubrun.start(profile="deep")
Environment variables — PUBRUN_AUTO_START=false
Local project config — .pubrun.toml or .config/pubrun/config.toml
User home config — ~/.config/pubrun/config.toml
Built-in defaults — default.toml (shipped with the library)

Generate a Configuration File

pubrun --create-config

See Configuration Reference for all settings and examples.

Security & Redaction

pubrun automatically detects and redacts sensitive values (passwords, tokens, API keys, credentials) in both environment variables and CLI arguments before writing them to the manifest. Redaction is destructive by default — raw values are replaced with {"representation": "redacted"}, and no hashes are generated, to prevent brute-force attacks.

Both environment variable and argv redaction are independently configurable:

[redaction]
env_enabled = true    # Redact matching environment variable values
argv_enabled = true   # Redact matching CLI argument values

See Configuration Reference for the full redaction policy and regex pattern.

Roadmap

Future

Sphinx / MkDocs integration — Generate hosted API documentation from docstrings.
GitHub Actions CI — Automated test matrix on push/PR.
Plugin / extension model — Formal extension points for custom capture engines.
Artifact registration API — register_artifact() for tracking user-produced output files.
Custom metadata API — register_metadata() for injecting structured data into the manifest.
Timestamped console capture — standard mode prepends timestamps to log lines, enabling pubrun combined (below).
pubrun combined command — Interleaves stdout and stderr from one or more runs using log timestamps. Requires timestamped capture (item 6).

Citation

If you use pubrun in your research, please cite the JOSS paper as the preferred reference, or cite the specific software version archived on Zenodo.

Preferred Citation (JOSS Paper):

Fariello, G. (2026). pubrun: Low-friction execution provenance for Python research. Journal of Open Source Software, (Paper in submission).
Software Archive (Zenodo):

Fariello, G. (2026). pubrun: Lightweight native execution provenance and reproducibility tracking. Zenodo. (Archive pending public release).

Upon release and archiving on Zenodo, the following citation badges will be active:

Concept DOI Badge (resolves to all versions): [![DOI](https://zenodo.org/badge/doi/10.5281/zenodo.[CONCEPT_ID].svg)](https://doi.org/10.5281/zenodo.[CONCEPT_ID])
Version DOI Badge (resolves specifically to version 1.2.0): [![DOI](https://zenodo.org/badge/doi/10.5281/zenodo.[VERSION_ID].svg)](https://doi.org/10.5281/zenodo.[VERSION_ID])

Acknowledgements

pubrun was redesigned and rewritten from pre-existing custom libraries, code fragments, scripts, and ideas spanning almost two decades, with the assistance of Google Antigravity for its official release.

License

On Python 3.11+, pubrun uses only the standard library. On Python 3.8–3.10, the sole runtime dependency is tomli (a backport of the standard-library tomllib). ↩

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.3.0

Jun 23, 2026

1.2.0

Jun 22, 2026

1.1.2

Jun 22, 2026

1.1.1

Jun 21, 2026

1.1.0

Jun 21, 2026

1.0.3

Jun 21, 2026

1.0.2

Jun 20, 2026

1.0.1

Jun 20, 2026

1.0.0

Jun 20, 2026

0.3.0

Jun 5, 2026

0.2.0

Jun 1, 2026

0.1.1

May 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pubrun-1.3.0.tar.gz (283.2 kB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pubrun-1.3.0-py3-none-any.whl (129.1 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file pubrun-1.3.0.tar.gz.

File metadata

Download URL: pubrun-1.3.0.tar.gz
Upload date: Jun 23, 2026
Size: 283.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for pubrun-1.3.0.tar.gz
Algorithm	Hash digest
SHA256	`e107cf1c08aeb33100cf647e307b71b6259250929a7370e5b3b98c28ad901033`
MD5	`fb0cbcdc4d8f772af322e1a56f04e595`
BLAKE2b-256	`9c87015876c84af7f77fac923641b11b039c9b546134573b6093bc12c40301ab`

See more details on using hashes here.

File details

Details for the file pubrun-1.3.0-py3-none-any.whl.

File metadata

Download URL: pubrun-1.3.0-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 129.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for pubrun-1.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e7423918e774a8ba6debb25022802f69eb406327f24e3ea7dcd3e5dfde95773f`
MD5	`58b217a747f1837d6a0db1e3e49ed766`
BLAKE2b-256	`95e76025ac08b92f820fbf27251138bea39dcf052f90e58b825048f27882b8ed`

See more details on using hashes here.

pubrun 1.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pubrun

Installation

Quick Start

Features

The Problem

The Solution

Import Modes

Preset Modes Behavior Matrix

Explicit Tracking Example

Sample Output

CLI Reference

pubrun bug-report

pubrun cite

pubrun clean

pubrun combined

pubrun diff

pubrun meta

pubrun methods

pubrun report

pubrun rerun

pubrun resources

pubrun run

pubrun status

pubrun ui

Diagnostic Flags

Monitoring Runs

Lock Files and Liveness Detection

Signal and Exit Code Capture

Live Process Inspection

Advanced HPC Ecosystems (Global Hydration)

Step 1: Snap the Parent Cluster

Step 2: Hydrate Children

Configuration

Generate a Configuration File

Security & Redaction

Roadmap

Future

Citation

Acknowledgements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`pubrun bug-report`

`pubrun cite`

`pubrun clean`

`pubrun combined`

`pubrun diff`

`pubrun meta`

`pubrun methods`

`pubrun report`

`pubrun rerun`

`pubrun resources`

`pubrun run`

`pubrun status`

`pubrun ui`