Skip to main content

A distributed spinlock for Python

Project description

Distributed Spinlock

PyPi Supported Python versions linters coverage

Introduction

This is a simple, yet flexible implementation of a distributed spinlock mutex. A mutex is a way to lock a resource under contention, for more information about mutexes you can read here. In particular, a Spinlock is a type of mutex that blocks the waiting thread and continuously polls the target mutex to see when it can be acquired. This can be useful when you have many stateless distributed services that are in contention for the same resource. For example, this can be reflected when you want to avoid duplicate computations while a query is already being executed.

Note: the documentation in readthedocs is always generated against the latest published version, thus there might be differences if you are using the checkout from main avenue to install.

Preliminaries

The implementation is based on redis and practically is the only required dependency to successfully install the library. For more configuration options and usage directions, please read on.

Installation

To use the library, please install it as follows,

# assuming you are in a virtual environment
pip install py-dspilock

Any requirements will be installed automatically, but the minimum required version of redis client is set to be 5.0.0.

Usage

Out of the box, the library provides a distributed spinlock implementation that can be used with any object that implements Hashable. Practically, this means that the object instance can be hashed. For example,

# a string is hashable,
s = "example"
# its hash value can be extracted as such,
s_hash = hash(s)

Thus, a practical example that you can use out of the box is the following,

"""A basic example of how to use the HashDSpinlock."""

from dspinlock import HashDSpinlock

if __name__ == "__main__":
    with HashDSpinlock("example"):
        print("executed task_id: 1")

If you do not supply any arguments, then the default parameters are used. Meaning that the following hold,

  • redis connection gets created with the predefined parameters (host: localhost, db: 1, port: 6379)
  • fail_if_key_exists: is set to False
  • cached_if_computed: is set to False.

Parameterising redis connection

Ideally, the redis connection should not be created for every call we make to the spinlock mutex. Thus, ideally, we should be caching that and passing it as an argument as such,

"""An example of how to use the HashDSpinlock with an existing `redis` connection."""

from dspinlock import HashDSpinlock
from dspinlock.utils import create_redis_conn

# using default parameters
sess = create_redis_conn()

if __name__ == "__main__":
    with HashDSpinlock("example", sess=sess):
        print("executed task_id: 1")

If no arguments are supplied to create_redis_conn method, then a connection with the default parameters is created. To parameterise it, we can use the following,

"""An example of how to use the HashDSpinlock with an existing parameterised `redis` connection."""

from dspinlock import HashDSpinlock
from dspinlock.utils import create_redis_conn, RedisParameters

params = RedisParameters(
    redis_host="localhost",
    rdb=0,
    port=6379,
    ssl=False,
    socket_connect_timeout=2,
    decode_responses=True
)

# using default parameters
sess = create_redis_conn(params)

if __name__ == "__main__":
    with HashDSpinlock("example", sess=sess):
        print("executed task_id: 1")

For more information, please check the source code at the link.

Extending the base class to fit your needs

While a default Spinlock is supplied, it is very common that you might want to subclass it in order to customise its functionality. There are two avenues to do that, you can subclass HashDSpinlock directly, or you can subclass its base abstract class DSpinlockBase. Most of the functionality is contained in the base abstract class, hence you need to only implement the methods shown in HashDSpinlock and tweak any of the base class values such as,

  • max_spinlock_tries: The spinlock max retries, by default 10 tries.
  • spinlock_sleep_thresh: The spinlock sleep threshold, by default 0.5 seconds.
  • expire_at_timedelta: Sets timedelta from creation that the mutex expires, by default 1 hour.
  • max_block_time: The max block time allowed for a query mutex to be held, if not released it's forcefully unblocked.

Extending HashDSpinLock to tweak above values

"""An example of how to subclass `HashDSpinlock` while tweaking its base attributes."""
from dspinlock import HashDSpinlock

class CustomSpinlock(HashDSpinlock):
    """Custom class that overrides the basic variables"""
    spinlock_sleep_thresh: float = 0.1
    """Tweak the threshold for spinlock sleep."""

if __name__ == "__main__":
    with CustomSpinlock("example"):
        print("executed task_id: 1")

Advanced customisation

The library provides the primitives so that you can create your own subclasses based on either HashDSpinlock or the baseclass itself. To so do, you can override these functions,

  • _get_uid: returns the unique identifier for the instance, by default it is the hash of the object,
  • _get_tag: the tag of the instance which is used to "tag" the actual mutex value,
  • _unpack_value: defines the way we unpack the value retrieved from redis,
  • get_key: returns the key that should be uniquely identifying each entry.

You do not have to override all of them, but for more details you can see how the HashDSpinlock itself is implemented.

Enable logging

The library uses a custom logger in order to provide diagnostic information. To enable the logger, to do two things. Firstly you need to set the environment variable - using any value - with the key: SL_LOG_ENABLED. After, you have to set your basic logger to accept at least DEBUG level messages, as all diagnostic ones are set to that level.

The code snippet that accomplishes that is shown below,

import logging

from dspinlock import HashDSpinlock
from dspinlock.consts import SL_LOG_LEVEL

# ensure your basic logger exists
logging.basicConfig(level=SL_LOG_LEVEL)

if __name__ == "__main__":
    # now you see some debug messages in the default format of the library.
    with HashDSpinlock("example"):
        print("executed task_id: 1")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py-dspinlock-0.2.3.tar.gz (20.1 kB view details)

Uploaded Source

Built Distribution

py_dspinlock-0.2.3-py3-none-any.whl (19.9 kB view details)

Uploaded Python 3

File details

Details for the file py-dspinlock-0.2.3.tar.gz.

File metadata

  • Download URL: py-dspinlock-0.2.3.tar.gz
  • Upload date:
  • Size: 20.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for py-dspinlock-0.2.3.tar.gz
Algorithm Hash digest
SHA256 97767b70ae0983da371f141ed14b39e34f41064348ece261fa225e696299c0d8
MD5 9da6a00259aa64477a20af944abe8883
BLAKE2b-256 7b907faa33b1510da4aabd5ec431e154911f852684951d87119d1f51c93edf8b

See more details on using hashes here.

File details

Details for the file py_dspinlock-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: py_dspinlock-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 19.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for py_dspinlock-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 5f25d723f59e5ebc5101524b972d9a8e3528f664c7d923c2e3cea865e83c8143
MD5 625f94f17a325ad6f9ce51a03fde25ad
BLAKE2b-256 cbd3f33deecb7cd36dc31325f05f2a967e5af16621e7b9613e0b204f3be1b551

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page