Minimal deterministic test-run snapshot capture for pytest.

These details have not been verified by PyPI

Project links

Project description

pytest-snap

Minimal deterministic snapshot capture of a pytest run: per-test outcome + duration (ns) stored in a JSON file. Intended as a small foundation for optional future diff / perf / gating features.

Current 0.1.0 scope:

Pytest plugin auto‑loaded (entry point snap).
--snap flag enables capture.
--snap-out PATH chooses output file (default .snap/current.json).
CLI wrapper for repeated labeled runs (pytest-snap run, pytest-snap all).

Out of scope (planned, not implemented yet in the plugin runtime): budgets, flake scoring, inline performance gating, historical diffing inside the test session. The README will grow with those features post‑0.1.0.

Installation

pip install pytest-snapcheck

Quick Start

Install:

pip install pytest-snapcheck

Run tests with snapshot capture:

pytest --snap

Result written to .snap/current.json (create the directory if needed). Change destination:

pytest --snap --snap-out my_run.json

Use the helper CLI for labeled runs (writes .artifacts/snap_<label>.json):

pytest-snap run v1
pytest-snap run v2
# or
pytest-snapcheck run v1

Generate several labels in sequence:

pytest-snap all               # default labels v1 v2 v3

Using a custom tests folder

By default, the CLI runs your repo's ./tests directory if it exists. To target a different folder, file, or a single test (node id), pass --tests:

# A specific directory
pytest-snap run v1 --tests ./path/to/tests

# A subfolder of your test tree
pytest-snap run v1 --tests tests/integration

# A single file or a single test node
pytest-snap run v1 --tests tests/test_api.py
pytest-snap run v1 --tests tests/test_api.py::test_happy_path

# Add regular pytest filters (forwarded as-is)
pytest-snap run v1 --tests tests/integration -k "smoke" -m "not flaky"

Prefer using plain pytest? The plugin doesn't change discovery; just supply paths as usual and add the flags:

pytest --snap --snap-out .artifacts/snap.json ./path/to/tests
# If plugin autoload is disabled:
pytest -p pytest_snap.plugin --snap --snap-out .artifacts/snap.json ./path/to/tests

Artifacts and outputs

Where results are written by default and how to change it:

Pure pytest (plugin)
- Default file: .snap/current.json
- Override with --snap-out PATH.
- Example:
```
 pytest --snap --snap-out .artifacts/snap_v1.json tests/
```

CLI (pytest-snap)

Default directory: .artifacts
Files created per run:
- .artifacts/snap_<label>.json (always)
- .artifacts/run_<label>.html (only with --html and pytest-html installed)
Change directory with --artifacts DIR.

Examples:

 # Default outputs
 pytest-snap run v1

 # Custom output directory
 pytest-snap run v1 --artifacts out/snapshots

 # Diff reads from the same directory
 pytest-snap diff v1 v2 --artifacts out/snapshots

Housekeeping helpers

 pytest-snap list                # list available snapshots
 pytest-snap show v1            # show summary for a snapshot
 pytest-snap clean              # remove the artifacts directory
 # (all accept --artifacts DIR)

Snapshot Schema (v0.1.0)

{
	"started_ns": 1234567890,
	"finished_ns": 1234569999,
	"env": {"pytest_version": "8.x"},
	"results": [
		{"nodeid": "tests/test_example.py::test_ok", "outcome": "passed", "dur_ns": 10423}
	]
}

Future Roadmap (High Level)

Planned incremental additions (subject to change):

Baseline diff & change bucket summarization.
Slower test detection & perf thresholds.
Budget YAML support and gating.
Historical flake scoring.
Rich diff / timeline CLI views.

Early adopters should pin minor versions if depending on emerging fields.

Code-level Diff (`--code`)

In addition to outcome & timing changes you can compare the test function source between two labeled versions.

Typical layout:

project/
	v1/tests/...
	v2/tests/...

Run a snapshot diff including code changes:

pytest-snap diff v1 v2 --code

What happens:

Auto-detects version directories <A> and <B> under the current working directory (or under --versions-base if provided).
Lists added / removed / modified test functions (def test_*).
Shows a unified diff (syntax-colored) for modified tests with simple performance hints (range() growth, added sleep time).

Options:

--code Combine snapshot diff + code diff.
--code-only Suppress snapshot outcome section; only show code diff.
--versions-base DIR Look for version subdirectories under DIR instead of ..

Examples:

# Just code changes (no outcome buckets)
pytest-snap diff v1 v2 --code-only --code

# Custom versions base path
pytest-snap diff release_old release_new --code --versions-base ./releases

# Code + performance analysis together
pytest-snap diff v1 v2 --code --perf

Limitations:

Only inspects top-level test_*.py files; helper modules not diffed.
Function-level granularity (class-based tests appear as functions with node ids).
Large diffs are truncated after 20 modified tests (increase by editing source if needed).

Performance Diff (`--perf`) in the CLI

The CLI snapshot diff (pytest-snap diff A B) ignores timing changes unless you opt in:

pytest-snap diff v1 v2 --perf

This adds a "Slower Tests" section listing tests whose elapsed time increased beyond BOTH thresholds:

ratio: new_duration / old_duration >= --perf-ratio (default 1.30 ⇒ at least 30% slower)
absolute: new_duration - old_duration >= --perf-abs (default 0.05s)

Optional flags:

Flag	Meaning
`--perf-ratio 1.5`	Require 50%+ slow-down (instead of 30%)
`--perf-abs 0.02`	Require at least 20ms added latency
`--perf-show-faster`	Also list significantly faster tests

To see only timings + code changes (skip outcome buckets):

pytest-snap diff v1 v2 --perf --code --code-only

Performance Gating During Test Runs

Inside pytest runs (plugin), slower tests are tracked when you supply a baseline and choose a fail mode:

pytest --snap-baseline .artifacts/snap_base.json \
	--snap-fail-on slower \
	--snap-slower-threshold-ratio 1.25 \
	--snap-slower-threshold-abs 0.10

Behavior:

A test is considered slower if it exceeds both the ratio and absolute thresholds.
--snap-fail-on slower turns any slower test into a non‑zero exit (CI gating).
Adjust thresholds to tune sensitivity (raise ratio or abs to reduce noise).

Shortcut mental model: ratio filters relative regressions; absolute filters micro‑noise. Both must pass so a 2ms blip on a 1µs test won't alert even if ratio is large.

If you only care about functional changes, omit perf flags; if you want early perf regression visibility, add them.

Timeline / Historical Progression (`timeline` subcommand)

Use the timeline view to see how snapshots evolved over time and when failures first appeared.

Create snapshots (labels arbitrary):

pytest-snap run v1
pytest-snap run v2
pytest-snap run v3

Show chronological summary:

pytest-snap timeline

Sample output:

TIMELINE (3 snapshots)
2025-09-04T19:20:21Z v1 commit=8e05100 total=28 fail=0 new_fail=0 fixes=0 regressions=0
2025-09-04T19:25:07Z v2 commit=8e05100 total=28 fail=1 new_fail=1 fixes=0 regressions=1
2025-09-04T19:30:44Z v3 commit=8e05100 total=28 fail=1 new_fail=0 fixes=1 regressions=0

Flags:

Flag	Purpose
`--since <commit>`	Start listing from first snapshot whose `git_commit` matches (short hash)
`--limit N`	Show only the last N snapshots after filtering
`--json`	Emit machine-readable JSON array
`--artifacts DIR`	Use alternate artifacts directory

Computed per row (vs previous snapshot):

new_fail: tests that newly failed.
fixes: previously failing tests that now pass.
regressions: passed → failed transitions.

Metadata:

Each snapshot is enriched (best effort) with git_commit (short HEAD hash) after write.
If git metadata isn’t available (outside a repo), the commit shows as unknown or None.

JSON example:

pytest-snap timeline --json | jq .

Produces entries like:

[
	{"label":"v1","git_commit":"8e05100","total":28,"failed":0,"passed":28,"xfailed":0,"xpassed":0,"new_fail":0,"fixes":0,"regressions":0},
	{"label":"v2","git_commit":"8e05100","total":28,"failed":1,"passed":27,"xfailed":0,"xpassed":0,"new_fail":1,"fixes":0,"regressions":1}
]

Use cases:

Quickly pinpoint when a regression first appeared before diving into full diff.
Send the timeline JSON straight to a small dashboard (Prometheus push, simple web chart) without re-reading all snapshot files.
In Continuous Integration (CI) pipelines, fail the run (block the merge) if the timeline shows new failures or regressions. CI = automated test/build system that runs on every change.

Labels vs paths (what does `v1` mean?)

pytest-snap run <label>

The label only names the output file: .artifacts/snap_<label>.json.
It does not select a folder named <label>; discovery defaults to ./tests unless you pass --tests.

Examples:

 pytest-snap run v1                      # runs ./tests, writes .artifacts/snap_v1.json
 pytest-snap run mylabel --tests tests/api
 pytest-snap run pr-123  --tests tests/test_api.py::test_happy_path

pytest-snap diff <A> <B>
- Labels refer to snapshot files in the artifacts directory (default .artifacts).
- When you add --code (or --code-only), directories named <A> and <B> are looked up under --versions-base (default .).
- You can control the base with --versions-base PATH.

Flaky Detection

When history logging is enabled (default in pytest-snap run), previous outcomes are tracked. A weighted score measures pass ↔ fail flips. Highly flaky tests can be excluded from "new failures" to reduce noise.

Conceptual Model

Enable capture (flag / CLI) → write snapshot.
(Future) Compare snapshots → categorize changes.
(Future) Apply gating policies.
Refresh baseline as intent changes.

FAQ

Do I need the CLI? No; it's convenience sugar for labeled runs.

Why not a baseline diff yet? Keeping 0.1.0 deliberately small; diffing lands next.

Will the schema change? Potentially (still pre-1.0.0) but additions will prefer backward compatibility.


---

## Glossary
| Term | Definition |
|------|------------|
| Snapshot | JSON record of one full test run |
| Nodeid | Pytest's canonical test identifier |
| Duration | Test call-phase elapsed time (ns stored) |

---

## Contributing

1. Fork / clone.  
2. (Optional) Create venv & install: `pip install -e .[dev]`.  
3. Add or adjust tests for your changes.  
4. Keep documentation clear and concise.  
5. Open a PR.

---

## License

MIT (see `LICENSE`).

---

Happy hacking.

---

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.5

Sep 7, 2025

0.1.4

Sep 6, 2025

This version

0.1.3

Sep 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytest_snapcheck-0.1.3.tar.gz (28.6 kB view details)

Uploaded Sep 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pytest_snapcheck-0.1.3-py3-none-any.whl (26.0 kB view details)

Uploaded Sep 6, 2025 Python 3

File details

Details for the file pytest_snapcheck-0.1.3.tar.gz.

File metadata

Download URL: pytest_snapcheck-0.1.3.tar.gz
Upload date: Sep 6, 2025
Size: 28.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for pytest_snapcheck-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`98da4d8a07c3c17aa41234c9c7701b82ded03eaba566244019447670cef96799`
MD5	`b772902bd3d57803cd347f8aafb6443d`
BLAKE2b-256	`2c2ebf062c4ce9139e12db9fc1d50c57fc1a6ecda6829324e1f4561d8e399fb1`

See more details on using hashes here.

File details

Details for the file pytest_snapcheck-0.1.3-py3-none-any.whl.

File metadata

Download URL: pytest_snapcheck-0.1.3-py3-none-any.whl
Upload date: Sep 6, 2025
Size: 26.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for pytest_snapcheck-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1dc9e31d7390623af222bce589062d4380befe2031a972a26c19be89afe455c9`
MD5	`1ce3fde1f0a89144d34078d53a8ea6e6`
BLAKE2b-256	`f60042f41db6250c5a013594f3493fc42af9c514af566946eb3f069fb7a861ed`

See more details on using hashes here.

pytest-snapcheck 0.1.3

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pytest-snap

Installation

Quick Start

Using a custom tests folder

Artifacts and outputs

Snapshot Schema (v0.1.0)

Future Roadmap (High Level)

Code-level Diff (--code)

Performance Diff (--perf) in the CLI

Performance Gating During Test Runs

Timeline / Historical Progression (timeline subcommand)

Labels vs paths (what does v1 mean?)

Flaky Detection

Conceptual Model

FAQ

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Code-level Diff (`--code`)

Performance Diff (`--perf`) in the CLI

Timeline / Historical Progression (`timeline` subcommand)

Labels vs paths (what does `v1` mean?)