Skip to main content

Operator-curated, URL-keyed artifact cache for a small lab (CUDA/ROCm/DOCA/firmware)

Project description

withcache

ci PyPI license built with Zig static musl

A tiny, operator-curated artifact cache for a small lab, for the big vendor downloads you re-pull constantly (CUDA, ROCm, DOCA, firmware, drivers), fronted by transparent curl/wget shims so existing scripts use it with no changes.

Think of it as "ccache for HTTP artifacts, without a proxy."

curl -fsSL https://the/origin/cuda.tar.gz -o cuda.tar.gz     # your script, unchanged
   └─ curlwithcache shim ─ WITHCACHE_SERVER set?
        ├─ cached  → served from the cache-host (fast, local)
        └─ miss/unset/unreachable → runs the real curl, exactly as written

Artifacts are cached by their origin URL as a key; the shim opts in by re-pointing the URL at the cache. No transparent proxy, no TLS interception, no client CA. The URL is a lookup key, not a connection target.

By default a miss is auto-fetched: the request falls through to origin (so the caller gets its file straight away), and the cache-host pulls the same artifact in the background, so the next request hits. Run with --curate to require a human instead, who reviews the miss list in a small web UI and presses Download (or pre-seeds via Add from URI). Either way the cache-host is the only box that needs internet egress (and any vendor credentials), and clients never write to it.

Why not just curl + a caching proxy?

For https:// (i.e. every vendor download) a forward proxy can't cache without SSL-bump / MITM: curl tunnels TLS end-to-end via CONNECT, so the proxy only sees ciphertext. The shim sidesteps that entirely by re-pointing the URL to the cache instead of intercepting the connection. And no proxy offers the optional operator-curated model (--curate: a miss queue a human approves).

Components

Path What it is
src/withcache/server.py The cache-host: blob store + miss table + background download manager + operator UI (Pico.css + HTMX)
src/withcache/_shim.py Shared shim core (find URL → probe → rewrite → exec)
src/withcache/curlwithcache.py / wgetwithcache.py The Python curl / wget shims
shim/shim.zig The native shim: one static binary, both tools via argv[0]
deploy/Containerfile, deploy/compose.yml Single Podman/Docker host deploy

The cache-host and the Python shims are stdlib-only (no third-party runtime deps); the native shim is a dependency-free static binary.

Install

The cache-host and Python shims (works on any box with Python):

pipx install withcache    # or: uv tool install withcache  /  pip install withcache
# provides: curlwithcache  wgetwithcache  withcache-server

The native shim (no Python needed, for minimal/distroless boxes; ~200 KB static musl binary). Grab it from the Releases page; one binary serves both tools by the name it's invoked as:

curl -L .../releases/.../withcache-shim-x86_64-linux-musl -o /usr/local/bin/curlwithcache
chmod +x /usr/local/bin/curlwithcache

The Python shim is also the tested oracle and install-time fallback for platforms without a prebuilt binary; a differential test asserts the binary and the Python plan() rewrite argv identically.

Deploy the cache-host

export WITHCACHE_ADMIN_PASSWORD=change-me    # protects the operator UI
podman compose -f deploy/compose.yml up -d   # or: docker compose -f ...
# operator UI:  http://withcache-server:3000/

Or without containers:

WITHCACHE_ADMIN_PASSWORD=change-me withcache-server --data-dir ./data --port 3000

Data (blobs + cache.db + session-secret) lives in the /data volume (or --data-dir). Artifacts are immutable per version, so there's no cache invalidation. --workers N sets the number of concurrent download workers, --curate switches from auto-fetch to operator-approved pulls, and --max-bytes (e.g. 50G) caps the cache: when full it refuses new fills (no auto-eviction), and you free space by deleting artifacts in the UI.

Use the shims (transparent curl / wget)

Every approach is the same two ingredients: (1) point at the cache with WITHCACHE_SERVER, and (2) make curl/wget resolve to the shim. They differ only in how widely the system curl/wget is shadowed. Pick the least invasive one that fits.

Safety: with WITHCACHE_SERVER unset the shim is a pure pass-through (it just execs the real tool, zero network/parsing), so even the system-wide setup is harmless wherever the cache isn't configured. Worst case is always "no caching, curl still works."

These all use command -v curlwithcache, so they work whether you installed the native binary or the Python launcher (both land under that name).

1. No shadowing: call the shims by name (least invasive)

Nothing is renamed; you opt in per command. Good for trying it out or a script you can edit.

export WITHCACHE_SERVER=http://withcache-server:3000
curlwithcache -fsSL https://the/origin/cuda.tar.gz -o cuda.tar.gz
wgetwithcache https://the/origin/rocm.tar.gz

2. This shell only: shadow curl/wget for the session

Put curl/wget symlinks in a dir and prepend it to PATH in the current shell. Reversible by just closing the shell.

mkdir -p ~/.withcache/bin
ln -sf "$(command -v curlwithcache)" ~/.withcache/bin/curl
ln -sf "$(command -v wgetwithcache)" ~/.withcache/bin/wget

export WITHCACHE_SERVER=http://withcache-server:3000
export PATH="$HOME/.withcache/bin:$PATH"
hash -r                       # forget any cached curl/wget location

command -v curl               # -> ~/.withcache/bin/curl  (verify it's the shim)
curl -fsSL https://the/origin/cuda.tar.gz -o cuda.tar.gz   # existing scripts, unchanged
wget https://the/origin/rocm.tar.gz                        # still saved as rocm.tar.gz

3. Your user: make it the default for your shells (persistent)

Create the symlinks once, then add the two exports to your shell rc. Affects all your future interactive shells; undo by deleting the block.

mkdir -p ~/.withcache/bin
ln -sf "$(command -v curlwithcache)" ~/.withcache/bin/curl
ln -sf "$(command -v wgetwithcache)" ~/.withcache/bin/wget

cat >> ~/.bashrc <<'EOF'

# withcache: transparent curl/wget caching
export WITHCACHE_SERVER=http://withcache-server:3000
export PATH="$HOME/.withcache/bin:$PATH"
EOF

4. One project only: scope it with direnv

Drop an .envrc in a project tree (requires direnv); caching applies only inside that directory.

# .envrc
export WITHCACHE_SERVER=http://withcache-server:3000
PATH_add ~/.withcache/bin        # assumes the symlinks from approach 2/3 exist

Then direnv allow.

5. The whole machine: every user, every shell (most invasive)

Install the shim as curl/wget in /usr/local/bin (ahead of /usr/bin on the default PATH) and set the server globally. This also catches build tools and package managers that shell out to curl/wget.

sudo ln -sf "$(command -v curlwithcache)" /usr/local/bin/curl
sudo ln -sf "$(command -v wgetwithcache)" /usr/local/bin/wget

# A login-shell env file (covers interactive logins; daemons started outside a
# login shell won't see it; set WITHCACHE_SERVER in their unit if you need it).
echo 'export WITHCACHE_SERVER=http://withcache-server:3000' \
  | sudo tee /etc/profile.d/withcache.sh >/dev/null

On minimal/distroless hosts use the native shim binary here: same symlink, no Python required.

Verify / turn it off

command -v curl                       # which curl is in effect (the shim, or the real one)
export REAL_CURL=/usr/bin/curl        # optional: pin the wrapped tool (also $REAL_WGET)

unset WITHCACHE_SERVER                 # instantly back to plain curl (pass-through)
rm ~/.withcache/bin/curl ~/.withcache/bin/wget   # remove shadowing entirely

How it works: the shim scans for the URL, asks the cache, and execs the real tool:

  1. Find the real curl/wget on $PATH (skipping itself; $REAL_CURL/$REAL_WGET override).
  2. With WITHCACHE_SERVER set, find the URL (the scheme:// arg, or --url).
  3. Probe the cache with that same tool (curl -I / wget --spider).
    • Hit → re-point only the URL at http://server/b/<base64(origin)>/<basename> and exec the real tool (so -o, -O, -L, --retry, … all still apply, and the file is named after the artifact).
    • Miss / unreachableexec the real tool with your arguments untouched (origin); the miss is recorded for the operator.
  4. With no WITHCACHE_SERVER, it does zero network/parsing, just execs the real tool.

Notes & limits (all degrade gracefully; worst case is "no caching, curl still works"):

  • Needs the wrapped tool present (it shims it). Adds ~Python-startup latency per call.
  • URLs hidden in a -K/-i config file or piped via stdin aren't seen → those calls pass through uncached.
  • Per-tool env override: CURLWITHCACHE_SERVER / WGETWITHCACHE_SERVER beat WITHCACHE_SERVER.

Operator UI

http://withcache-server:3000/ (Pico.css + HTMX, bundled offline) shows:

  • Misses: auto-fetched by default, or (under --curate) each with Download (queues a background pull) and Dismiss.
  • Downloads: live progress bars, queued/running/completed/cancelled/failed, Cancel, and Clear finished. Downloads run in a background worker pool, not in the request, so large pulls never block, modelled on bty's job managers.
  • Cached artifacts: URL, size, hits (times served) and misses (times requested before it was cached), SHA-256, fetched-at, each with Delete to free space.
  • Add from URI: pre-seed an artifact before anyone misses it.

Auth

Single-tenant session-cookie auth (modelled on bty's approach, env password instead of PAM). The read path (/blob, /b/…, /healthz) is open so shims never log in; the operator surface (/, /admin/*) is gated.

Env var Purpose
WITHCACHE_SERVER Cache-host URL the shims use
CURLWITHCACHE_SERVER / WGETWITHCACHE_SERVER Per-tool override of the above
WITHCACHE_ADMIN_PASSWORD Operator login password (unset ⇒ UI open, with a warning)
WITHCACHE_SESSION_SECRET Override the persisted cookie-signing key (optional)

Cache keys & signed URLs

The key is scheme://host/path with the query string dropped by default, so CDN/presigned URLs (whose tokens change every request) still match by path. Pass --keep-query to the server for query-sensitive keys. Package-manager repos (.deb/.rpm) are GPG-signed and verified by the client regardless of transport, so caching them this way is safe.

Consume from another tool (the client library)

A tool that already knows its download URLs (e.g. an installer or a provisioner) can prefer the cache without shelling out to a shim or re-implementing the /b/ scheme. withcache.client is stdlib-only, so importing it adds no dependencies:

from withcache import client

# "use the cache when it's warm, the origin otherwise"
url = client.serve_url("http://cache:3000", origin) or origin

is_cached() is a graceful HEAD (a miss, timeout, or unreachable cache all return False, so you fall back to the origin), and it doubles as a warm-up: the probe records the miss and, in auto-fetch mode, enqueues the fill, so the next call flips to the cache. The encoding is shared with the shims and server, so consumers stay in lockstep with the cache-host.

Tests

python -m unittest discover -s tests   # stdlib only, no test deps

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

withcache-0.5.2.tar.gz (72.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

withcache-0.5.2-py3-none-musllinux_1_2_x86_64.whl (253.9 kB view details)

Uploaded Python 3musllinux: musl 1.2+ x86-64

withcache-0.5.2-py3-none-musllinux_1_2_aarch64.whl (271.6 kB view details)

Uploaded Python 3musllinux: musl 1.2+ ARM64

withcache-0.5.2-py3-none-manylinux_2_17_x86_64.whl (253.9 kB view details)

Uploaded Python 3manylinux: glibc 2.17+ x86-64

withcache-0.5.2-py3-none-manylinux_2_17_aarch64.whl (271.6 kB view details)

Uploaded Python 3manylinux: glibc 2.17+ ARM64

withcache-0.5.2-py3-none-any.whl (60.1 kB view details)

Uploaded Python 3

File details

Details for the file withcache-0.5.2.tar.gz.

File metadata

  • Download URL: withcache-0.5.2.tar.gz
  • Upload date:
  • Size: 72.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for withcache-0.5.2.tar.gz
Algorithm Hash digest
SHA256 22afddc8f82b2b6df3319bb2a613d6b93f34af10841efee91062a543ffa91c37
MD5 c8c2494d52e6233762c44b56e361a0af
BLAKE2b-256 e54c511f20b0201bab9dfed81d0e14743d4c684f188eacb5b754ed630c1b7e0b

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2.tar.gz:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file withcache-0.5.2-py3-none-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for withcache-0.5.2-py3-none-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 44791e59999a06b3c700e470ea03c52d484b5bed1d8e6281fc59e02536a77e06
MD5 dcad8c86d546204f5112a2f9ee300cc2
BLAKE2b-256 cf3fb103495e1449f51fde7d19311d7c0c577fcbeb9f83a31b22ba3556f2cc9e

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2-py3-none-musllinux_1_2_x86_64.whl:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file withcache-0.5.2-py3-none-musllinux_1_2_aarch64.whl.

File metadata

File hashes

Hashes for withcache-0.5.2-py3-none-musllinux_1_2_aarch64.whl
Algorithm Hash digest
SHA256 e18d0567efe13f07f9059eaa87b90190ad2386676a8ad9f55c89b0b259f82d95
MD5 e9c3a496cf5b3ef594964c5a2f3a1ca7
BLAKE2b-256 db627be6d935a762f25afe47512fe64a410e8fd256a9a6cd1b5496331193396a

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2-py3-none-musllinux_1_2_aarch64.whl:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file withcache-0.5.2-py3-none-manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for withcache-0.5.2-py3-none-manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 b12fba4ac603e92772aa486c54665a495b08db82413dc225b9f82218e4d3f969
MD5 dd9c716ab53f1ef51aef62781357fa2c
BLAKE2b-256 6b1df5a9f48b1b576e10f4b1e5a40d5255b60f34557d48672805c26a15eba1fe

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2-py3-none-manylinux_2_17_x86_64.whl:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file withcache-0.5.2-py3-none-manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for withcache-0.5.2-py3-none-manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 d9c4cd3f710310344ed3e1f9431c26ea1666a6f47028109f66dd812cc461593a
MD5 d0dddde9360530f8ddc8f3a5e19701df
BLAKE2b-256 d8d1c40b9bc8cb5f0a9a522c8d83eec6751438d9c3a082e9d5d9cb68bb5b06dd

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2-py3-none-manylinux_2_17_aarch64.whl:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file withcache-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: withcache-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 60.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for withcache-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4baa7e810291f4d80d0f3ae2d2c88fdb2834f430d839dab66d3cd1d6c676feaf
MD5 9ee513a11ee3f533f2f52328741758c4
BLAKE2b-256 35e3e5f98795cb3def650abf27418092aa127ad7ce6880213dd8f69712724478

See more details on using hashes here.

Provenance

The following attestation bundles were made for withcache-0.5.2-py3-none-any.whl:

Publisher: ci-cd.yml on safl/withcache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page