Python package for easy function caching, allowing users to customize cache behavior at call-time.
Project description
cachecache: Python function decorator for runtime-configurable caching.
A Python package that provides a simple way to cache function results while allowing to dynamically configure caching behavior at each function call.
By "caching behavior" reconfigurable at each function call, we mean:
- Whether to recompute results and overwrite the cache, which is useful for functions whose results rely on data loaded internally (therefore hidden from the function arguments, thus from the cache hash) and that can change on disk;
- Whether to cache the results, which is useful for functions who may need caching in context A (e.g. frontend recurrent use) but not context B (e.g. backend unique use);
- Where to save the cached results, which is useful for functions that return voluminous data, as this allows to distribute their cache across several locations given different arguments.
Features
- 1-liner caching with a decorator: @cache
- Caching behavior can be customized on the fly for specific function calls, by passing the following arguments to the cached functions:
- 🔄
again=True
: recompute and overwrite cached results on-demand - ⏸️
cache_results=False
: disable caching for specific function calls, for instance if the computed result would take too much room on disk. - 📁
cache_path='different/caching/path'
: use custom cache locations for specific function calls
- 🔄
- Built on joblib's Memory class.
Installation
You can install cachecache
using pip:
pip install cachecache
Usage
Here's a basic example of how to use cachecache
:
from cachecache import cache, Cacher
Cache using the default "~/.cachecache" directory and default maximum cache size:
@cache # behind the scenes, "cache" is simply defined as "cache = Cacher()"
def my_cached_function(*args, again=False, cache_results=True, cache_path=None):
# complex operations involving args...
results = ...
return results
result = my_cached_function(arg) # potentially slow
result = my_cached_function(arg) # always fast (results loaded from cache)
Cache using a custom directory and maximum cache size:
cacher = Cacher("my/custom/caching/path", 10e9) # size in bytes - 10GB
@cacher
def my_cached_function(...):
...
Recompute results and overwrite cache:
result = my_cached_function(arg, again=True)
This proves useful if the results depend on data that can change on disk (this information is not present in the arguments of the function, so the cacher does not know about it!).
Adjust caching directory at runtime
result = my_cached_function(arg, cache_path="somewhere/else")
This proves useful if you need to distribute the cached results of a function across several disks.
cachecache also provides a way to create a distributed_cacher
that will cache a function's results at a location specified by a custom argument (such as 'datapath'):
from cachecache import Cacher, distributed_cacher
global_cacher = Cacher('~/.global_cache')
# Arguments of distributed_cacher:
# - datapath_arg_name (str, optional): The name of the argument in the decorated function
# that specifies the datapath for the local cache. Defaults to 'datapath'.
# - local_cache_path (str, optional): The relative path to the local cache directory
# within the datapath. Defaults to '.local_cache' (and results cached at f'{datapath}/.local_cache').
# - global_cache (cachecache.Cacher instance, optional): The global cacher to use by default
# for cached functions without 'datapath_arg_name' (or when 'datapath_arg_name' is None).
# Defaults to a cache at '~/.cachecache' (default instance of Cacher()).
dist_cacher = distributed_cacher(datapath_arg_name='datapath',
local_cache_path='.local_cache',
global_cache=global_cacher)
# You can then decorate a function as follow:
@dist_cacher
def my_distributed_cached_function(datapath, ...):
"""
A function whose results will be cached at 'datapath/.local_cache'
unless specified otherwise with the cache_path argument.
Note: works with args and kwargs
"""
...
Behind the scenes, this works by swapping in the value of the specified argument (datapath_arg_name) instead of the 'cache_path' argument from Cacher (if 'cache_path' is also specified, it takes precedence over 'datapath').
Of course, you can use a single cacher for multiple functions:
@cacher
def foo1(x):
return x ** 2
@cacher
def foo2(x):
return x / 10
And both of these syntaxes are possible:
cacher = Cacher("my/custom/caching/path")
@cacher
def my_cached_function(...):
...
@Cacher("my/custom/caching/path")
def my_cached_function(...):
...
License
This project is licensed under the terms of the GNU General Public License v3.0. You may copy, distribute and modify the software as long as you track changes/dates in source files. Any modifications to or software including (via compiler) GPL-licensed code must also be made available under the GPL along with build & install instructions.
Support
If you have any questions, issues, or feature requests, please open an issue so that everybody can benefit from your experience! This package is actively maintained.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for cachecache-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | eac8a9a1b62e9bd7de585c67294054a0f562fabb174da5035c83c0f309eb6e33 |
|
MD5 | b0099e0de31477e62349c92e92092ccf |
|
BLAKE2b-256 | cd89c4c1206db4f51fd47f6d0e13a6701e8ff1fa9246c824f40d6fab3a1c811c |