Skip to main content

MutableMapping interfaces for common cloud storage providers

Project description

cloud-mappings

MutableMapping implementations for common cloud storage providers

Build PyPI version

For now Azure Blob Storage, Azure Table Storage, Google Cloud Storage, and AWS S3 are implemented. Contributions of new providers are welcome.

Installation

with pip:

pip install cloud-mappings

By default, cloud-mappings doesn't install any of the required storage providers dependencies. If you would like to install them alongside cloud-mappings you may run any combination of:

pip install cloud-mappings[azureblob,azuretable,gcpstorage,awss3]

Instantiation

AzureBlobMapping:

from cloudmappings import AzureBlobMapping

cm = AzureBlobMapping.with_pickle(
    container_name="CONTAINER_NAME",
    account_url="AZURE_BLOB_STORAGE_URL",
    credential=AZURE_CREDENTIAL_OBJECT,
    connection_string="AZURE_BLOB_STORAGE_CONNECTION_STRING",
)

AzureTableMapping:

from cloudmappings import AzureTableMapping

cm = AzureTableMapping.with_pickle(
    table_name="TABLE_NAME",
    endpoint="AZURE_TABLE_ENDPOINT",
    credential=AZURE_CREDENTIAL_OBJECT,
    connection_string="AZURE_TABLE_CONNECTION_STRING",
)

Note that Azure Table Storage has a 1MB size limit per entity.

GoogleCloudStorageMapping:

from cloudmappings import GoogleCloudStorageMapping

cm = GoogleCloudStorageMapping.with_pickle(
    bucket_name="BUCKET_NAME",
    project="GCP_PROJECT",
    credentials=GCP_CREDENTIALS_OBJECT,
)

AWSS3Mapping:

from cloudmappings import AWSS3Mapping

cm = AWSS3Mapping.with_pickle(
    bucket_name="AWS_BUCKET_NAME",
    silence_warning=False,
)

Note that AWS S3 does not support server-side atomic requests, so it is not recommended for concurrent use. A warning is printed out by default but may be silenced by passing silence_warning=True.

Usage

Use it just like a standard dict()!

cm["key"] = 1000
cm["key"] # returns 1000
del cm["key"]
"key" in cm # returns false

Etags

Each cloud-mapping keeps an internal dict of etags which it uses to ensure it is only reading/overwriting/deleting data it expects to. If the value in storage is not what the cloud-mapping expects, a cloudmappings.errors.KeySyncError() will be thrown.

If you would like to enable read (get) operations without ensuring etags, you can set read_blindly=True. This can be set in the constructor, or dynamically turned on and off with set_read_blindly(True) and set_read_blindly(False) respectively. Blindly reading a value that doesn't exist in the cloud will return None.

If you know what you are doing and you want an operation other than get to go through despite etags, you will need to sync your cloud-mapping with the cloud by calling either .sync_with_cloud() to sync all keys or .sync_with_cloud(key_prefix) to sync a specific key or subset of keys. By default .sync_with_cloud() is called on instantiation of a cloud-mapping if the underlying provider storage already exists. You may skip this initial sync by passing an additional sync_initially=False parameter when you instantiate your cloud-mapping.

The etags property on a cloud-mapping can be manually inspected and adjusted for advanced use cases, but it is not recommended if your use case can be accomplished with the above methods.

Serialisation

If you don't call .with_pickle() and instead pass your providers configuration directly to the CloudMapping class, you will get a "raw" cloud-mapping which accepts only byte-likes as values. Along with the .with_pickle() serialisation utility, .with_json() and .with_json_zlib() also exist.

You may build your own serialisation either using zict; or by calling .with_buffers([dumps_1, dumps_2, ..., dumps_N], [loads_1, loads_2, ..., loads_N]), where dumps and loads are the ordered functions to serialise and parse your data respectively.

Development

Code style: black

This project uses .devcontainer to describe the environment to use for development. You may use the environment described in this directory (it integrates automatically with vscode's 'remote containers' extension), or you may create your own environment with the same dependencies.

Dependencies

Install development dependencies with:

pip install .[azureblob,azuretable,gcpstorage,awss3,tests]

Tests

Set environment variables for each provider:

  • Azure Blob: AZURE_TENANT_ID, AZURE_CLIENT_ID, AZURE_CLIENT_SECRET, AZURE_BLOB_STORAGE_ACCOUNT_URL
  • Azure Table: AZURE_TABLE_STORAGE_CONNECTION_STRING
  • GCP Storage: GOOGLE_APPLICATION_CREDENTIALS (path to credentials file), GOOGLE_CLOUD_STORAGE_PROJECT
  • AWS S3: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY

Run tests with:

pytest --test_container_id <container-to-use-for-tests>

* Note that if the container specified it is expected that one test will fail.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cloud-mappings-0.9.3.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

cloud_mappings-0.9.3-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file cloud-mappings-0.9.3.tar.gz.

File metadata

  • Download URL: cloud-mappings-0.9.3.tar.gz
  • Upload date:
  • Size: 9.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.13

File hashes

Hashes for cloud-mappings-0.9.3.tar.gz
Algorithm Hash digest
SHA256 9b850afd25654346e39df5f04de194a1cf86d6b06e146555b7c3f6aabecc8830
MD5 c54d34dbfaf1eba1f555c398a3858755
BLAKE2b-256 a978670b36e4624d4aeebcd5c92d9238f1fece9c3ec38d0df185fefe51838315

See more details on using hashes here.

File details

Details for the file cloud_mappings-0.9.3-py3-none-any.whl.

File metadata

  • Download URL: cloud_mappings-0.9.3-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.13

File hashes

Hashes for cloud_mappings-0.9.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c78de2544b1ceaa30bc17128d51300069ca1a478041bf3df9ed7ed08a2bd2fcd
MD5 31254b31c49bf0a21725d07d48854922
BLAKE2b-256 c0afe475aa5f774a7e75ed9ca3b22b6003b334e67b2492176228192a567af145

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page