Skip to main content

Terminal UI for generic-ml-cache: the gmlcache command. A thin inbound driver over generic-ml-cache-core -- reads config, provides the data source, maps commands onto the core library.

Project description

gmlcache

Detached ML Execution Cache — the terminal client

License: Apache 2.0 Status: Alpha

gmlcache runs, records, and replays detached ML workloads — record a real client (or API) call once, replay it forever by its content key, offline and byte-for-byte.

Single-user, local — not a gateway. gmlcache runs on your machine, as you, across the subscriptions and APIs you already hold. It is not a multi-user router and not a way to share one subscription — see Positioning.

gmlcache: a miss records the real client call; the same command again is served instantly from cache, byte-identical

Install

pip install generic-ml-cache-cli

This installs the gmlcache command and pulls in the engine, generic-ml-cache-core.

Use

gmlcache run    --client claude --model sonnet --prompt "…"   # record on a miss, replay on a hit
gmlcache check  --client claude --model sonnet --prompt "…"   # is this exact call already cached?
gmlcache list                                                 # stored executions, grouped by client/model
gmlcache stats                                                # totals, hit counts, token usage & cost
gmlcache inspect <key>                                        # pretty-print one stored execution
gmlcache doctor | models | status | init                     # environment & configuration helpers

What it does

  • Records a real agentic CLI client (claude, codex, cursor-agent) or an API call — stdout, stderr, exit code, generated files, and token usage.
  • Replays an identical request instantly and offline, byte-for-byte — gmlcache adds nothing to the client's output, so it is a transparent drop-in.
  • Reports — list, group, inspect, and measure stored executions and their savings.

Built on a reusable engine

gmlcache is the terminal client — one inbound driver over the engine. The whole cache logic and every adapter live in generic-ml-cache-core, a stateless library. To embed the cache in your own application instead of driving it from a terminal, depend on the core and inject your own data source — you never reimplement the adapters.

Links

License

Apache-2.0 — see LICENSE and NOTICE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

generic_ml_cache_cli-0.7.0.tar.gz (41.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

generic_ml_cache_cli-0.7.0-py3-none-any.whl (28.4 kB view details)

Uploaded Python 3

File details

Details for the file generic_ml_cache_cli-0.7.0.tar.gz.

File metadata

  • Download URL: generic_ml_cache_cli-0.7.0.tar.gz
  • Upload date:
  • Size: 41.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for generic_ml_cache_cli-0.7.0.tar.gz
Algorithm Hash digest
SHA256 9139b23bbc7fc9b7056fc0e0d43b6e28a8b5f5c10bf1e510f5261ea904783694
MD5 6030b21e32db5f0bb78628f8a11df369
BLAKE2b-256 d450a49bed1a971f4250a13c97258028c35a941a85c35215a0902e295c6e853c

See more details on using hashes here.

Provenance

The following attestation bundles were made for generic_ml_cache_cli-0.7.0.tar.gz:

Publisher: release.yml on danielslobozian/generic-ml-cache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file generic_ml_cache_cli-0.7.0-py3-none-any.whl.

File metadata

File hashes

Hashes for generic_ml_cache_cli-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ddb20ff3aca03e341be1a63964631e328585677252c868aeb82fa12c2c8e23ad
MD5 ed0274d2f822fc099b2a2c91eede3d5e
BLAKE2b-256 bcbb3959ff86e49d4541c9fb78f52ddd16431b95ced45490aedccac79d5f0465

See more details on using hashes here.

Provenance

The following attestation bundles were made for generic_ml_cache_cli-0.7.0-py3-none-any.whl:

Publisher: release.yml on danielslobozian/generic-ml-cache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page