Skip to main content

Terminal UI for generic-ml-cache: the gmlcache command. A thin inbound driver over generic-ml-cache-core -- reads config, provides the data source, maps commands onto the core library.

Project description

gmlcache

Detached ML Execution Cache — the terminal client

License: Apache 2.0 Status: Alpha

gmlcache runs, records, and replays detached ML workloads — record a real client (or API) call once, replay it forever by its content key, offline and byte-for-byte.

Single-user, local — not a gateway. gmlcache runs on your machine, as you, across the subscriptions and APIs you already hold. It is not a multi-user router and not a way to share one subscription — see Positioning.

gmlcache: a miss records the real client call; the same command again is served instantly from cache, byte-identical

Install

pip install generic-ml-cache-cli

This installs the gmlcache command and pulls in the engine, generic-ml-cache-core.

Use

gmlcache run    --client claude --model sonnet --prompt "…"   # record on a miss, replay on a hit
gmlcache check  --client claude --model sonnet --prompt "…"   # is this exact call already cached?
gmlcache list                                                 # stored executions, grouped by client/model
gmlcache stats                                                # totals, hit counts, token usage & cost
gmlcache inspect <key>                                        # pretty-print one stored execution
gmlcache doctor | models | status | init                     # environment & configuration helpers

What it does

  • Records a real agentic CLI client (claude, codex, cursor-agent) or an API call — stdout, stderr, exit code, generated files, and token usage.
  • Replays an identical request instantly and offline, byte-for-byte — gmlcache adds nothing to the client's output, so it is a transparent drop-in.
  • Reports — list, group, inspect, and measure stored executions and their savings.

Built on a reusable engine

gmlcache is the terminal client — one inbound driver over the engine. The whole cache logic and every adapter live in generic-ml-cache-core, a stateless library. To embed the cache in your own application instead of driving it from a terminal, depend on the core and inject your own data source — you never reimplement the adapters.

Links

License

Apache-2.0 — see LICENSE and NOTICE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

generic_ml_cache_cli-0.6.0.tar.gz (40.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

generic_ml_cache_cli-0.6.0-py3-none-any.whl (27.7 kB view details)

Uploaded Python 3

File details

Details for the file generic_ml_cache_cli-0.6.0.tar.gz.

File metadata

  • Download URL: generic_ml_cache_cli-0.6.0.tar.gz
  • Upload date:
  • Size: 40.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for generic_ml_cache_cli-0.6.0.tar.gz
Algorithm Hash digest
SHA256 c8950c535092ef39b07dd61e214c62754b4e8f289b925886d82d956842512ebd
MD5 5f31d980e59c8d4e1978b744f1098a5a
BLAKE2b-256 8b9f5650659c1f14fbb14558993093edf8819bcf6577eb8fd1360ce197a979d3

See more details on using hashes here.

Provenance

The following attestation bundles were made for generic_ml_cache_cli-0.6.0.tar.gz:

Publisher: release.yml on danielslobozian/generic-ml-cache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file generic_ml_cache_cli-0.6.0-py3-none-any.whl.

File metadata

File hashes

Hashes for generic_ml_cache_cli-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 629b84e94ee688094b26dddc36c856d6d012fc7138f5efc48d1565d6f5d21565
MD5 46755e78fb47fb96d86c54e35eaf71d4
BLAKE2b-256 1d7cfc91878689e154404b271b2f8cbb355603ad8819c7f97d92c2575e1eb7ef

See more details on using hashes here.

Provenance

The following attestation bundles were made for generic_ml_cache_cli-0.6.0-py3-none-any.whl:

Publisher: release.yml on danielslobozian/generic-ml-cache

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page