Cache your API calls with a single line of code. No mocks, no fixtures. Just faster, cleaner code.

These details have not been verified by PyPI

Project links

Project description

cachy

We often call APIs while prototyping and testing our code. A single API call (e.g. an Anthropic chat completion) can take 100’s of ms to run. This can really slow down development especially if our notebook contains many API calls 😞.

cachy caches API requests. It does this by saving the result of each call to a local cachy.jsonl file. Before calling an API (e.g. OpenAI) it will check if the request exists in cachy.jsonl. If it does it will return the cached result.

How does it work?

Under the hood popular SDK’s like OpenAI, Anthropic and LiteLLM use httpx.Client and httpx.AsyncClient.

cachy patches the send method of both clients and injects a simple caching mechanism:

create a cache key from the request
if the key exists in cachy.jsonl return the cached response
if not, call the API and save the response to cachy.jsonl

Usage

To use cachy

install the package: pip install pycachy
add the snippet below to the top of your notebook

from cachy import enable_cachy

enable_cachy()

By default cachy will cache requests made to OpenAI, Anthropic, Gemini and DeepSeek.

Note: Gemini caching only works via the LiteLLM SDK.

[!NOTE]

Custom APIs

If you’re using the OpenAI or LiteLLM SDK for other LLM providers like Grok, Mistral you can cache these requests as shown below.
from cachy import enable_cachy, doms
enable_cachy(doms=doms+('api.x.ai', 'api.mistral.com'))

Docs

Docs can be found hosted on this GitHub repository’s pages.

How to use

First import and enable cachy

from cachy import enable_cachy

enable_cachy()

Now run your api calls as normal.

from openai import OpenAI

cli = OpenAI()

r = cli.responses.create(model="gpt-4.1", input="Hey!")
r

Hey! How can I help you today? 😊

id: resp_68b9978ecec48196aa3e77b09ed41c6403f00c61bc19c097
created_at: 1756993423.0
error: None
incomplete_details: None
instructions: None
metadata: {}
model: gpt-4.1-2025-04-14
object: response
output: [ResponseOutputMessage(id=‘msg_68b9978f9f70819684b17b0f21072a9003f00c61bc19c097’, content=[ResponseOutputText(annotations=[], text=‘Hey! How can I help you today? 😊’, type=‘output_text’, logprobs=[])], role=‘assistant’, status=‘completed’, type=‘message’)]
parallel_tool_calls: True
temperature: 1.0
tool_choice: auto
tools: []
top_p: 1.0
background: False
conversation: None
max_output_tokens: None
max_tool_calls: None
previous_response_id: None
prompt: None
prompt_cache_key: None
reasoning: Reasoning(effort=None, generate_summary=None, summary=None)
safety_identifier: None
service_tier: default
status: completed
text: ResponseTextConfig(format=ResponseFormatText(type=‘text’), verbosity=‘medium’)
top_logprobs: 0
truncation: disabled
usage: ResponseUsage(input_tokens=9, input_tokens_details=InputTokensDetails(cached_tokens=0), output_tokens=11, output_tokens_details=OutputTokensDetails(reasoning_tokens=0), total_tokens=20)
user: None
store: True

If you run the same request again it will read it from the cache.

r = cli.responses.create(model="gpt-4.1", input="Hey!")
r

Hey! How can I help you today? 😊

id: resp_68b9978ecec48196aa3e77b09ed41c6403f00c61bc19c097
created_at: 1756993423.0
error: None
incomplete_details: None
instructions: None
metadata: {}
model: gpt-4.1-2025-04-14
object: response
output: [ResponseOutputMessage(id=‘msg_68b9978f9f70819684b17b0f21072a9003f00c61bc19c097’, content=[ResponseOutputText(annotations=[], text=‘Hey! How can I help you today? 😊’, type=‘output_text’, logprobs=[])], role=‘assistant’, status=‘completed’, type=‘message’)]
parallel_tool_calls: True
temperature: 1.0
tool_choice: auto
tools: []
top_p: 1.0
background: False
conversation: None
max_output_tokens: None
max_tool_calls: None
previous_response_id: None
prompt: None
prompt_cache_key: None
reasoning: Reasoning(effort=None, generate_summary=None, summary=None)
safety_identifier: None
service_tier: default
status: completed
text: ResponseTextConfig(format=ResponseFormatText(type=‘text’), verbosity=‘medium’)
top_logprobs: 0
truncation: disabled
usage: ResponseUsage(input_tokens=9, input_tokens_details=InputTokensDetails(cached_tokens=0), output_tokens=11, output_tokens_details=OutputTokensDetails(reasoning_tokens=0), total_tokens=20)
user: None
store: True

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.14

May 7, 2026

0.0.13

Apr 28, 2026

0.0.12

Apr 27, 2026

0.0.11

Apr 24, 2026

0.0.10

Apr 22, 2026

0.0.9

Apr 20, 2026

0.0.8

Apr 11, 2026

0.0.7

Feb 24, 2026

This version

0.0.6

Jan 29, 2026

0.0.5

Dec 2, 2025

0.0.4

Oct 18, 2025

0.0.3

Oct 17, 2025

0.0.2.post1

Oct 5, 2025

0.0.2

Sep 15, 2025

0.0.1

Sep 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycachy-0.0.6.tar.gz (9.1 kB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycachy-0.0.6-py3-none-any.whl (9.5 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file pycachy-0.0.6.tar.gz.

File metadata

Download URL: pycachy-0.0.6.tar.gz
Upload date: Jan 29, 2026
Size: 9.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for pycachy-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`4e41bdcdc7bbaec144effbb1e3ab016be632755f58c60fb913c57a657962b73e`
MD5	`a98152d8f6d0f374cdd94104a1a9c31a`
BLAKE2b-256	`075cce286568164cc2fb0418ea2afe090d3cfad7ab3e661ed4d59d595743ca27`

See more details on using hashes here.

File details

Details for the file pycachy-0.0.6-py3-none-any.whl.

File metadata

Download URL: pycachy-0.0.6-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 9.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for pycachy-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3d5d5a5c233e51a239f1855d492df84d93b84449d593745435b303ee98b8d393`
MD5	`b896014206ec233b21bf93ff261ddace`
BLAKE2b-256	`1c9e594a5d17a7d410be0b626fa19879ac07317371c572f387aacec0c1645ad3`

See more details on using hashes here.

pycachy 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cachy

Usage

Custom APIs

Docs

How to use

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes