Cache function results and side effects (stdout, stderr, file writes) with automatic file I/O discovery via strace or audit hooks

These details have not been verified by PyPI

Project description

pycasher

Cache Python function results and their side effects — stdout, stderr, and filesystem writes — with automatic invalidation.

uv add pycasher

If you want the casher CLI outside a project environment, install it as a tool instead:

uv tool install pycasher

What makes it different

Most caching libraries cache return values. casher also captures and replays:

stdout/stderr printed during execution
Files written by the function (restored from cache on hit)
Files read by the function (used as cache keys — change a true upstream input file, cache auto-invalidates)

No manual file declarations needed. casher discovers file I/O automatically via strace (subprocess mode) or wrapped Python file handles plus tracked shutil.copyfile()-based copies in in-process mode. Files that are written before they are first read during one invocation are treated as generated outputs, not cache inputs.

On Linux, subprocess=True is the authoritative dependency mode: cache reuse is validated against the exact implicit input paths recorded during the last successful execution, including imported project source files discovered in the child process. subprocess=False remains useful, but only as best-effort Python-level tracking.

Usage

from casher import cached, expand_input_dir

@cached
def train(data_path: str, output_path: str, lr: float = 0.01) -> dict:
    df = read_csv(data_path)
    model = fit(df, lr=lr)
    save(model, output_path)
    return {"accuracy": model.score}

# First call — runs function, traces file I/O, caches everything
result = train("train.csv", "model.pkl")

# Second call — instant replay from cache (model.pkl restored too)
result = train("train.csv", "model.pkl")

# Change train.csv — casher detects it, re-runs automatically

For directory-shaped inputs, keep the argument semantics explicit instead of making every directory Path recursive by magic:

from pathlib import Path

from casher import cached, expand_input_dir


@cached(input_files=lambda data_dir: expand_input_dir(data_dir, "*.csv"))
def build_dataset(data_dir: Path) -> int:
    return len(list(data_dir.glob("*.csv")))

Path arguments that point to files are hashed by file content for the function-argument portion of the cache key. Auto-discovered input files remain path-sensitive and content-sensitive.

If you pass an existing filesystem path as a plain str, casher warns once per parameter and process. String arguments still hash as strings; Path arguments make path-aware hashing intent explicit.

Declared output paths are treated differently: Path arguments that are listed in output_files= or fall under output_roots= are hashed by path identity, not file content. That keeps cache keys stable when the function creates those outputs during the first run.

casher now also learns this automatically for common workflow signatures. If a Path argument is observed during execution to be a generated output, later lookups treat that argument as an output-path identity even without explicit output_files= / output_roots= declarations.

Workflow-style functions can declare output directories explicitly to keep reads under those roots out of input_files:

from pathlib import Path

from casher import cached


work_dir = Path("work")


@cached(output_roots=[work_dir], replay_outputs="if-missing")
def assemble_workset() -> Path:
    generated = work_dir / "reference" / "mworld.par"
    generated.parent.mkdir(parents=True, exist_ok=True)
    generated.write_text("patched content")
    generated.read_text()
    return generated

On cache hit, unchanged output files are not restored again. You can also set replay_outputs=False or replay_outputs="if-missing" to control file replay.

Cache any shell command without code changes:

casher -- python train.py --data train.csv

Key features

Automatic file tracking: authoritative strace-based subprocess tracking on Linux, plus best-effort Python-level tracking in in-process mode
Generated-output awareness: files written before their first read are excluded from input_files
Automatic output-arg stabilization: generated output Path arguments are learned from runtime I/O and matched by path identity on later lookups
Dependency invalidation: in subprocess mode, imported project .py files are recorded as ordinary implicit inputs and revalidated by path + content hash on lookup
Optional dependency narrowing: use dep_roots=[...] or dep_files=[...] to limit which imported source files from subprocess mode are considered relevant
File-hash memoization: unchanged files reuse cached content hashes from a small SQLite metadata store
LRU eviction: configurable via max_cache_bytes or CASHER_MAX_CACHE_BYTES env var (default 32 GB)
Faster hits for large artifacts: output replay skips files whose current hash already matches the cached output
DataFrame support: polars and pandas DataFrames serialized via Arrow IPC
Environment-aware: include env vars in cache key with env_vars=["MY_VAR"]
Miss diagnostics: diagnose_misses=True logs which recorded input changed or disappeared
Path-clarity warnings: existing filesystem paths passed as plain str args emit a one-time warning suggesting Path(...)
Earlier progress logs: lookup, execution start, execution finish, and cache-store phases are logged so long misses are visible live
Structured logging: loguru INFO for config changes, enablement, hit/miss, mode, eviction
Explicit directory expansion helper: expand_input_dir() for stable input_files lists

Configuration

Env var	Default	Description
`CASHER_CACHE_DIR`	unset	Cache storage directory. Caching stays disabled until this is set.
`CASHER_MAX_CACHE_BYTES`	`34359738368` (32 GB)	Max cache size before LRU eviction

Or set programmatically (takes priority over env vars):

from casher import configure, get_config

configure(cache_dir="/data/my_cache", max_cache_bytes=10 * 1024**3)
print(get_config())  # effective config

If no cache directory is configured via CASHER_CACHE_DIR, configure(cache_dir=...), @cached(cache_dir=...), or casher --cache-dir ..., casher runs transparently without caching and emits a one-time warning.

Platform support

Full caching on Linux only (requires strace for authoritative subprocess mode, fcntl for locking). On macOS and Windows the decorator is a transparent pass-through — functions execute normally, caching is skipped with a one-time warning.

Documentation

See documentation/ for detailed docs:

Introduction — architecture, limitations, in-process vs subprocess comparison
Quick Start — installation, decorator options, CLI usage
API Reference — full API surface

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.18

May 14, 2026

This version

0.5.17

May 14, 2026

0.5.16

May 13, 2026

0.5.15

May 13, 2026

0.5.14

May 13, 2026

0.5.13

May 13, 2026

0.5.12

May 11, 2026

0.5.11

May 11, 2026

0.5.10

May 11, 2026

0.5.9

May 11, 2026

0.5.8

May 10, 2026

0.5.7

May 10, 2026

0.5.6

May 10, 2026

0.5.5

May 9, 2026

0.5.4

May 9, 2026

0.5.3

May 9, 2026

0.5.2

May 9, 2026

0.5.1

May 8, 2026

0.5.0

May 8, 2026

0.4.1

May 8, 2026

0.4.0

May 8, 2026

0.3.0

May 8, 2026

0.2.2

May 8, 2026

0.2.1

May 8, 2026

0.2.0

May 8, 2026

0.1.0

May 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycasher-0.5.17.tar.gz (98.6 kB view details)

Uploaded May 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycasher-0.5.17-py3-none-any.whl (58.9 kB view details)

Uploaded May 14, 2026 Python 3

File details

Details for the file pycasher-0.5.17.tar.gz.

File metadata

Download URL: pycasher-0.5.17.tar.gz
Upload date: May 14, 2026
Size: 98.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for pycasher-0.5.17.tar.gz
Algorithm	Hash digest
SHA256	`8374dd08ad28699556be07f7de977e85ca1034d7863a979cdef849cdac059051`
MD5	`4ae11f8cfd0ea4d01eadb16ecf49642a`
BLAKE2b-256	`801ed966d3457840846af2388305949d931b2e4b26a146505a9d8e846e8642d3`

See more details on using hashes here.

File details

Details for the file pycasher-0.5.17-py3-none-any.whl.

File metadata

Download URL: pycasher-0.5.17-py3-none-any.whl
Upload date: May 14, 2026
Size: 58.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for pycasher-0.5.17-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dcc42bc93ef8e6261b73c15ad930bcbb86af1837c4b740453a3ed0f5520822e2`
MD5	`a5e7221c17f6e225298b2ea1c86b82f1`
BLAKE2b-256	`8a94bc08c8e6490ee53bcf80c850bdbbb0c011e9fb5bddd5f22083dbde1b6cc4`

See more details on using hashes here.

pycasher 0.5.17

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

pycasher

What makes it different

Usage

Key features

Configuration

Platform support

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes