Skip to main content

Python utility for simple caching in AWS Lambda functions

Project description

Lambda Cache

Simple Caching for AWS Lambda

PackageStatus PyPI version PythonSupport License: MIT Documentation Status Documentation Status

Test Coverage Status Code style: black

Introduction

lambda-cache helps you cache data in your Lambda function across invocations. The package utilizes the internal memory of the lambda function's execution context to store data across multiple invocations. Doing this:

  • Reduces load on back-end systems
  • Reduces the execution time of the lambda
  • Guarantees functions will reference latest data after cache expiry
  • Reduces calls to APIs throttling limits or high charges

lambda-cache prioritizes simplicity over performance and flexibility.

The package is purpose-built for AWS Lambda functions, and currently supports SSM Parameters, Secrets from Secrets Manager and S3 Objects.

Installation

Include the package in your functions code zip-file using the following:

$ pip install lambda-cache -t /path/of/function

Refer to docs for how to include into your lambda function.

Usage

Below we describe further features of the package. For more info refer to the user guide.

SSM - Parameter Store

Cache single parameter

To cache a parameter from ssm, decorate your handler function like so:

from lambda_cache import ssm

@ssm.cache(parameter='/production/app/var')
def handler(event, context):
    var = getattr(context,'var')
    response = do_something(var)
    return response

All invocations of this function over in the next minute will reference the parameter from the function's internal cache, rather than making a network call to ssm. After one minute has lapsed, the the next invocation will invoke get_parameter to refresh the cache. The parameter value will be injected into the context object of your lambda handler.

Change cache expiry

The default max_age_in_seconds settings is 60 seconds (1 minute), it defines the maximum age of a parameter that is acceptable to the handler function. Cache entries older than this, will be refreshed. To set a longer cache duration (e.g 5 minutes), change the setting like so:

from lambda_cache import ssm

@ssm.cache(parameter='/production/app/var', max_age_in_seconds=300)
def handler(event, context):
    var = getattr(context,'var')
    response = do_something(var)
    return response

Note: The caching logic runs only at invocation, regardless of how long the function runs. A 15 minute lambda function will not refresh the parameter, unless explicitly refreshed using get_entry method. The library is primary interested in caching 'across' invocation rather than 'within' an invocation

Change cache entry settings

The default name of the parameter is the string after the last slash('/') character of its name. This means /production/app/var and test/app/var both resolve to just var. To over-ride this default, use entry_name setting like so:

from lambda_cache import ssm

@ssm.cache(parameter='/production/app/var', entry_name='new_var')
def handler(event, context):
    var = getattr(context,'new_var')
    response = do_something(var)
    return response

Cache multiple parameters

To cache multiple entries at once, pass a list of parameters to the parameter argument. This method groups all the parameter value under one python dictionary, stored in the Lambda Context under the entry_name.

Note: When using this method, entry_name is a required parameter, if not present NoEntryNameError exception is thrown.

from lambda_cache import ssm

@ssm.cache(parameter=['/app/var1', '/app/var2'], entry_name='parameters')
def handler(event, context):
    var1 = getattr(context,'parameters').get('var1')
    var2 = getattr(context,'parameters').get('var2')
    response = do_something(var)
    return response

Under the hood, we use the get_parameters API call for boto3, which translate to a single network call for multiple parameters. You can group all parameters types in a single call, including String, StringList and SecureString. StringList will return as a list, while all other types will return as plain-text strings. The library does not support returning SecureString parameters in encrypted form, and will only return plain-text strings regardless of String type.

Note: for this method to work, ensure you have both ssm:GetParameter and ssm:GetParameters (with the 's' at the end) in your function's permission policy

Decorator stacking

If you wish to cache multiple parameters with different expiry times, stack the decorators. In this example, var1 will be refreshed every 30 seconds, var2 will be refreshed after 60.

@ssm.cache(parameter='/production/app/var1', max_age_in_seconds=30)
@ssm.cache(parameter='/production/app/var2', max_age_in_seconds=60)
def handler(event, context):
    var1 = getattr(context,'var1')
    var2 = getattr(context,'var2')
    response = do_something(var)
    return response

Note: Decorator stacking performs one API call per decorator, which might result is slower performance

Cache invalidation

If you require a fresh value at some point of the code, you can force a refresh using the ssm.get_entry function, and setting the max_age_in_seconds argument to 0.

from lambda_cache import ssm

@ssm.cache(parameter='/prod/var')
def handler(event, context):

    if event.get('refresh'):
        # refresh parameter
        var = ssm.get_entry(parameter='/prod/var', max_age_in_seconds=0)
    else:
        var = getattr(context,'var')
    
    response = do_something(var)
    return response

You may also use ssm.get_entry to get a parameter entry from anywhere in your functions code.

To only get parameter once in the lifetime of the function, set max_age_in_seconds to some arbitary large number ~36000 (10 hours).

Return Values

Caching supports String, SecureString and StringList parameters with no change required (ensure you have kms:Decrypt permission for SecureString). For simplicity, StringList parameters are automatically converted into list (delimited by comma), while String and SecureString both return the single string value of the parameter.

Secrets Manager

Cache single secret

Secret support is similar, but uses the secret.cache decorator.

from lambda_cache import secret

@secret.cache(name='/prod/db/conn_string')
def handler(event, context):
    conn_string = getattr(context,'conn_string')
    return context

Change Cache expiry

The default max_age_in_seconds settings is 60 seconds (1 minute), it defines how long a parameter should be kept in cache before it is refreshed from ssm. To configure longer or shorter times, modify this argument like so:

from lambda_cache import secret

@secret.cache(name='/prod/db/conn_string', max_age_in_seconds=300)
def handler(event, context):
    var = getattr(context,'conn_string')
    response = do_something(var)
    return response

Note: The caching logic runs only at invocation, regardless of how long the function runs. A 15 minute lambda function will not refresh the parameter, unless explicitly refreshed using get_cache_ssm. The library is primary interested in caching 'across' invocation rather than 'within' an invocation

Change Cache entry settings

The name of the secret is simply shortened to the string after the last slash('/') character of the secret's name. This means /prod/db/conn_string and /test/db/conn_string resolve to just conn_string. To over-ride this default, use entry_name:

from lambda_cache import secret

@secret.cache(name='/prod/db/conn_string', entry_name='new_var')
def handler(event, context):
    var = getattr(context,'new_var')
    response = do_something(var)
    return response

Decorator stacking

If you wish to cache multiple secrets, you can use decorator stacking.

@secret.cache(name='/prod/db/conn_string', max_age_in_seconds=30)
@secret.cache(name='/prod/app/elk_username_password', max_age_in_seconds=60)
def handler(event, context):
    var1 = getattr(context,'conn_string')
    var2 = getattr(context,'elk_username_password')
    response = do_something(var)
    return response

Note: Decorator stacking performs one API call per decorator, which might result is slower performance.

Cache Invalidation

To invalidate a secret, use the get_entry, setting the max_age_in_seconds=0.

from lambda_cache import secret

@secret.cache(name='/prod/db/conn_string')
def handler(event, context):

    if event.get('refresh'):
        var = secret.get_entry(name='/prod/db/conn_string', max_age_in_seconds=0)
    else:
        var = getattr(context,'conn_string')
    response = do_something(var)
    return response

Return Values

Secrets Manager supports both string and binary secrets. For simplicity we will cache the secret in the format it is stored. It is up to the calling application to process the return as Binary or Strings.

S3

S3 support is considered experimental for now, but withing the python community we see a lot of folks pull down files from S3 for use in AI/ML models.

Files downloaded from s3 are automatically stored in the /tmp directory of the lambda function. This is the only writable directory within lambda, and has a 512MB of storage space.

Cache a single file

To download a file from S3 use the the same decorator pattern:

from lambda_cache import s3

@s3.cache(s3Uri='s3://bucket_name/path/to/object.json')
def s3_download_entry_name(event, context):

    # Object from S3 automatically saved to /tmp directory
    with open("/tmp/object.json") as file_data:
        status = json.loads(file_data.read())['status']

    return status

Change Cache expiry

The default max_age_in_seconds settings is 60 seconds (1 minute), it defines how long a file should be kept in /tmp before it is refreshed from S3. To configure longer or shorter times, modify this argument like so:

from lambda_cache import s3

@s3.cache(s3Uri='s3://bucket_name/path/to/object.json', max_age_in_seconds=300)
def s3_download_entry_name(event, context):
    with open("/tmp/object.json") as file_data:
        status = json.loads(file_data.read())['status']

    return status

Note: The caching logic runs only at invocation, regardless of how long the function runs. A 15 minute lambda function will not refresh the object, unless explicitly refreshed using s3.get_entry. The library is primary interested in caching 'across' invocation rather than 'within' an invocation

Check file before download

By default, lambda_cache will download the file once at cache has expired, however, to save on network bandwidth (and possibly time), we can set the check_before_download parameter to True. This will check the age of the object in S3 and download only if the object has changed since the last download.

from lambda_cache import s3

@s3.cache(s3Uri='s3://bucket_name/path/to/object.json', max_age_in_seconds=300, check_before_download=True)
def s3_download_entry_name(event, context):
    with open("/tmp/object.json") as file_data:
        status = json.loads(file_data.read())['status']

    return status

Note: we use the GetHead object call to verify the objects last_modified_date. This simplifies the IAM policy of the function, as it still only requires the s3:GetObject permission. However, this is still a GET requests, and will be charged as such, for smaller objects it might be cheaper to just download the object

Credit

Project inspired by:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lambda-cache-0.8.1.tar.gz (14.3 kB view details)

Uploaded Source

Built Distribution

lambda_cache-0.8.1-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file lambda-cache-0.8.1.tar.gz.

File metadata

  • Download URL: lambda-cache-0.8.1.tar.gz
  • Upload date:
  • Size: 14.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.6.10 Linux/5.0.0-1035-azure

File hashes

Hashes for lambda-cache-0.8.1.tar.gz
Algorithm Hash digest
SHA256 3ccaaccef02ae8e0c09918a8b1a5ec73707a79ee70b2029e2263c1f1a11786d9
MD5 f77f9779a218ec7377610e0ab5ed30b7
BLAKE2b-256 2cb5416098fd145bc09b1325ddb5e3d702ede7cedcf67041c4acd2f550b52409

See more details on using hashes here.

File details

Details for the file lambda_cache-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: lambda_cache-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 11.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.6.10 Linux/5.0.0-1035-azure

File hashes

Hashes for lambda_cache-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 27a92c8a7f5829fd65d1661c582c52f55d6d4c95460e4395b87091dd08a4311f
MD5 01c63de5829b5f5fdcbd5a5f69325de6
BLAKE2b-256 9f3cb64e2a147584485d33070cbfb9696156d80ae0592858ad8676de1c08d33d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page