cache-decorator

a simple decorator to cache the results of computationally heavy functions

These details have not been verified by PyPI

Project links

Homepage

Project description

A simple decorator to cache the results of computationally heavy functions. The package automatically serialize and deserialize depending on the format of the save path.

By default it supports only .json and .pkl but other extensions can be enabled by using the extra feature:

[compress_json] .json.gz .json.bz .json.lzma

[compress_pickle] .pkl.gz .pkl.bz .pkl.lzma .pkl.zip

[numpy] .npy .npz

[pandas] .csv .csv.gz .csv.bz2 .csv.zip .csv.xz

[excel] .xlsx

The extra feature [numba] enables the caching of numba objects.

How do I install this package?

As usual, just download it using pip:

pip install cache_decorator

To install all the extensions use:

pip install "cache_decorator[all]"

(the double quotes are optional in bash but required by zsh)

Optionally you can specify the single features you want:

pip install "cache_decorator[compress_json, compress_pickle, numpy, pandas, excel, numba]"

If the installation fails you can try to add --user at the end of the command as:

pip install "cache_decorator[compress_json, compress_pickle, numpy, pandas, excel, numba]" --user

Tests Coverage

Since some software handling coverages sometime get slightly different results, here’s three of them:

Examples of Usage

To cache a function or a method you just have to decorate it with the cache decorator.

from time import sleep
from cache_decorator import Cache

@Cache()
def x(a, b):
    sleep(3)
    return a + b

class A:
    @Cache()
    def x(self, a, b):
        sleep(3)
        return a + b

Cache path

The default cache directory is ./cache but this can be setted by passing the cache_dir parameter to the decorator or by setting the environment variable CACHE_DIR. In the case both are setted, the parameter folder has precedence over the environment one.

from time import sleep
from cache_decorator import Cache

@Cache(cache_dir="/tmp")
def x(a):
    sleep(3)
    return a

The path format can be modified by passing the cache_path parameter. This string will be formatted with infos about the function, its parameters and, if it’s a method, the self attributes.

De default path is:

from time import sleep
from cache_decorator import Cache

@Cache(cache_path="{cache_dir}/{file_name}_{function_name}/{_hash}.pkl")
def x(a):
    sleep(3)
    return a

But can be modified giving cache a more significative name, for example we can add the value of a into the file name.

from time import sleep
from cache_decorator import Cache

@Cache(cache_path="{cache_dir}/{file_name}_{function_name}/{a}_{_hash}.pkl")
def x(a):
    sleep(3)
    return a

Depending on the extension of the file, different serialization and deserialization dispatcher will be called.

from time import sleep
from cache_decorator import Cache

@Cache(cache_path="/tmp/{_hash}.pkl.gz")
def x(a):
    sleep(3)
    return a

@Cache(cache_path="/tmp/{_hash}.json")
def x(a):
    sleep(3)
    return {"1":1,"2":2}

@Cache(cache_path="/tmp/{_hash}.npy")
def x(a):
    sleep(3)
    return np.array([1, 2, 3])

@Cache(cache_path="/tmp/{_hash}.npz")
def x(a):
    sleep(3)
    return np.array([1, 2, 3]), np.array([1, 2, 4])

Ignoring arguments when computing the hash

By default the cache is differentiate by the parameters passed to the function. One can specify which parameters should be ignored.

from time import sleep
from cache_decorator import Cache

@Cache(args_to_ignore=["verbose"])
def x(a, verbose=False):
    sleep(3)
    if verbose:
        print("HEY")
    return a

Multiple arguments can be specified as a list of strings with the name of the arguments to ignore.

from time import sleep
from cache_decorator import Cache

@Cache(args_to_ignore=["verbose", "multiprocessing"])
def x(a, verbose=False, multiprocessing=False):
    sleep(3)
    if verbose:
        print("HEY")
    return a

Cache validity

Cache also might have a validity duration.

from time import sleep
from cache_decorator import Cache

@Cache(
    cache_path="/tmp/{_hash}.pkl.gz",
    validity_duration="24d"
    )
def x(a):
    sleep(3)
    return a

In this example the cache will be valid for the next 24 days. and on the 25th day the cache will be rebuilt. The duration can be written as a time in seconds or as a string with unit. The units can be “s” seconds, “m” minutes, “h” hours, “d” days, “w” weeks.

Logging

Each time a new function is decorated with this decorator, a new logger is created. You can modify the default logger with log_level and log_format.

from time import sleep
from cache_decorator import Cache

@Cache(log_level="debug")
def x(a):
    sleep(3)
    return a

If the default format is not like you like it you can change it with:

from time import sleep
from cache_decorator import Cache

@Cache(log_format="%(asctime)-15s[%(levelname)s]: %(message)s")
def x(a):
    sleep(3)
    return a

More informations about the formatting can be found here https://docs.python.org/3/library/logging.html .

Moreover, the name of the default logger is:

logging.getLogger("cache." + function.__name__)

So we can get the reference to the logger and fully customize it:

import logging
from cache_decorator import Cache

@Cache()
def test_function(x):
    return 2 * x

# Get the logger
logger = logging.getLogger("cache.f")
logger.setLevel(logging.DEBUG)

# Make it log to a file
handler = logging.FileHandler("cache.log")
logger.addHandler(handler)

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.2.0

Jan 26, 2024

2.1.15

Jul 31, 2023

2.1.14

May 25, 2023

2.1.13

Nov 9, 2022

2.1.11

Aug 7, 2022

2.1.10

Aug 6, 2022

2.1.9

Aug 5, 2022

2.1.8

Jun 24, 2022

2.1.7

Jun 21, 2022

2.1.6

May 31, 2022

2.1.5

May 31, 2022

2.1.4

May 31, 2022

2.1.3

May 27, 2022

2.1.2

May 27, 2022

2.1.1

May 27, 2022

2.1.0

May 24, 2022

2.0.16

May 12, 2022

2.0.15

May 6, 2022

2.0.14

Apr 29, 2022

2.0.13

Dec 14, 2021

2.0.12

Nov 13, 2021

2.0.11

Nov 13, 2021

2.0.10

Nov 9, 2021

2.0.9

Sep 6, 2021

2.0.8

Sep 5, 2021

2.0.7

Sep 4, 2021

2.0.6

Jun 23, 2021

2.0.5

Jun 22, 2021

2.0.4

Jun 22, 2021

2.0.3

Apr 26, 2021

2.0.2

Apr 12, 2021

2.0.1

Mar 28, 2021

2.0.0

Mar 24, 2021

1.6.0

Feb 22, 2021

1.5.1

Feb 17, 2021

1.5.0

Jan 17, 2021

1.4.1

Oct 30, 2020

This version

1.4.0

Oct 8, 2020

1.3.2

Aug 12, 2020

1.3.0

Aug 12, 2020

1.2.5

Apr 5, 2020

1.2.4

Apr 5, 2020

1.2.3

Apr 5, 2020

1.2.2

Mar 29, 2020

1.2.1

Mar 28, 2020

1.2.0

Mar 27, 2020

1.1.9

Mar 27, 2020

1.1.8

Mar 27, 2020

1.1.7

Mar 27, 2020

1.1.6

Mar 27, 2020

1.1.5

Mar 27, 2020

1.1.3

Mar 20, 2020

1.1.2

Feb 28, 2020

1.1.1

Feb 25, 2020

1.1.0

Feb 25, 2020

1.0.0

Feb 24, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cache_decorator-1.4.0.tar.gz (12.6 kB view details)

Uploaded Oct 8, 2020 Source

File details

Details for the file cache_decorator-1.4.0.tar.gz.

File metadata

Download URL: cache_decorator-1.4.0.tar.gz
Upload date: Oct 8, 2020
Size: 12.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.1 CPython/3.8.5

File hashes

Hashes for cache_decorator-1.4.0.tar.gz
Algorithm	Hash digest
SHA256	`3126ebabf19c69cf54b2249d2486127ad898ff15781f2954d8efeb04780ef737`
MD5	`bcfa800e49a421b207ae793ba09cf595`
BLAKE2b-256	`b7aec6ba45e9cd5f0822c4209d577ecbaa64b8efe2b0cb8df037f4592b5ec970`