Extended pickling support for Python objects

These details have not been verified by PyPI

Project links

Homepage

Project description

cloudpickle

cloudpickle makes it possible to serialize Python constructs not supported by the default pickle module from the Python standard library.

cloudpickle is especially useful for cluster computing where Python code is shipped over the network to execute on remote hosts, possibly close to the data.

Among other things, cloudpickle supports pickling for lambda functions along with functions and classes defined interactively in the __main__ module (for instance in a script, a shell or a Jupyter notebook).

Cloudpickle can only be used to send objects between the exact same version of Python.

Using cloudpickle for long-term object storage is not supported and strongly discouraged.

Security notice: one should only load pickle data from trusted sources as otherwise pickle.load can lead to arbitrary code execution resulting in a critical security vulnerability.

Installation

The latest release of cloudpickle is available from pypi:

pip install cloudpickle

Examples

Pickling a lambda expression:

>>> import cloudpickle
>>> squared = lambda x: x ** 2
>>> pickled_lambda = cloudpickle.dumps(squared)

>>> import pickle
>>> new_squared = pickle.loads(pickled_lambda)
>>> new_squared(2)
4

Pickling a function interactively defined in a Python shell session (in the __main__ module):

>>> CONSTANT = 42
>>> def my_function(data: int) -> int:
...     return data + CONSTANT
...
>>> pickled_function = cloudpickle.dumps(my_function)
>>> depickled_function = pickle.loads(pickled_function)
>>> depickled_function
<function __main__.my_function(data:int) -> int>
>>> depickled_function(43)
85

Overriding pickle's serialization mechanism for importable constructs:

An important difference between cloudpickle and pickle is that cloudpickle can serialize a function or class by value, whereas pickle can only serialize it by reference. Serialization by reference treats functions and classes as attributes of modules, and pickles them through instructions that trigger the import of their module at load time. Serialization by reference is thus limited in that it assumes that the module containing the function or class is available/importable in the unpickling environment. This assumption breaks when pickling constructs defined in an interactive session, a case that is automatically detected by cloudpickle, that pickles such constructs by value.

Another case where the importability assumption is expected to break is when developing a module in a distributed execution environment: the worker processes may not have access to the said module, for example if they live on a different machine than the process in which the module is being developed. By itself, cloudpickle cannot detect such "locally importable" modules and switch to serialization by value; instead, it relies on its default mode, which is serialization by reference. However, since cloudpickle 1.7.0, one can explicitly specify modules for which serialization by value should be used, using the register_pickle_by_value(module)//unregister_pickle(module) API:

>>> import cloudpickle
>>> import my_module
>>> cloudpickle.register_pickle_by_value(my_module)
>>> cloudpickle.dumps(my_module.my_function)  # my_function is pickled by value
>>> cloudpickle.unregister_pickle_by_value(my_module)
>>> cloudpickle.dumps(my_module.my_function)  # my_function is pickled by reference

Using this API, there is no need to re-install the new version of the module on all the worker nodes nor to restart the workers: restarting the client Python process with the new source code is enough.

Note that this feature is still experimental, and may fail in the following situations:

If the body of a function/class pickled by value contains an import statement:

>>> def f():
>>> ... from another_module import g
>>> ... # calling f in the unpickling environment may fail if another_module
>>> ... # is unavailable
>>> ... return g() + 1

If a function pickled by reference uses a function pickled by value during its execution.

Running the tests

With tox, to test run the tests for all the supported versions of Python and PyPy:
```
pip install tox
tox
```
or alternatively for a specific environment:
```
tox -e py37
```
With py.test to only run the tests for your current version of Python:
```
pip install -r dev-requirements.txt
PYTHONPATH='.:tests' py.test
```

History

cloudpickle was initially developed by picloud.com and shipped as part of the client SDK.

A copy of cloudpickle.py was included as part of PySpark, the Python interface to Apache Spark. Davies Liu, Josh Rosen, Thom Neale and other Apache Spark developers improved it significantly, most notably to add support for PyPy and Python 3.

The aim of the cloudpickle project is to make that work available to a wider audience outside of the Spark ecosystem and to make it easier to improve it further notably with the help of a dedicated non-regression test suite.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

3.1.2

Nov 3, 2025

3.1.1

Jan 14, 2025

3.1.0

Oct 11, 2024

3.0.0

Oct 16, 2023

2.2.1

Jan 19, 2023

2.2.0

Sep 7, 2022

2.1.0

May 20, 2022

This version

2.0.0

Sep 10, 2021

1.6.0

Aug 25, 2020

1.5.0

Jul 1, 2020

1.4.1

Apr 29, 2020

1.4.0

Apr 27, 2020

1.3.0

Feb 10, 2020

1.2.2

Sep 10, 2019

1.2.1

Jun 10, 2019

1.2.0

Jun 7, 2019

1.1.1

May 15, 2019

1.0.0

May 6, 2019

0.8.1

Mar 25, 2019

0.8.0

Feb 13, 2019

0.7.0

Jan 23, 2019

0.6.1

Oct 10, 2018

0.6.0

Oct 8, 2018

0.5.6

Sep 13, 2018

0.5.5

Aug 24, 2018

0.5.4

Aug 24, 2018

0.5.3

May 14, 2018

0.5.2

Nov 21, 2017

0.5.1

Nov 8, 2017

0.5.0

Nov 8, 2017

0.4.4

May 14, 2018

0.4.3

Feb 13, 2018

0.4.2

Nov 8, 2017

0.4.1

Oct 26, 2017

0.4.0

Aug 9, 2017

0.3.1

May 31, 2017

0.3.0

May 30, 2017

0.2.2

Jan 2, 2017

0.2.1

Feb 17, 2016

0.2.0

Feb 17, 2016

0.1.1

Sep 5, 2015

0.1.0

Apr 13, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cloudpickle-2.0.0.tar.gz (60.0 kB view details)

Uploaded Sep 10, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cloudpickle-2.0.0-py3-none-any.whl (25.7 kB view details)

Uploaded Sep 10, 2021 Python 3

File details

Details for the file cloudpickle-2.0.0.tar.gz.

File metadata

Download URL: cloudpickle-2.0.0.tar.gz
Upload date: Sep 10, 2021
Size: 60.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/0.0.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for cloudpickle-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`5cd02f3b417a783ba84a4ec3e290ff7929009fe51f6405423cfccfadd43ba4a4`
MD5	`133a7fe80b773937498918d0715b56bf`
BLAKE2b-256	`25c429b4f944e8709c4cddfdb93f0278f57ba097ed46845d9bdf57178db31c64`

See more details on using hashes here.

File details

Details for the file cloudpickle-2.0.0-py3-none-any.whl.

File metadata

Download URL: cloudpickle-2.0.0-py3-none-any.whl
Upload date: Sep 10, 2021
Size: 25.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/0.0.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for cloudpickle-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6b2df9741d06f43839a3275c4e6632f7df6487a1f181f5f46a052d3c917c3d11`
MD5	`a897cd4df193aedbf48f8a5837879697`
BLAKE2b-256	`073cbf72ebd3e78eb1ef773f4f0650ecdc29c6454aeafe9c08f6da3f227dd2bc`

See more details on using hashes here.

cloudpickle 2.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cloudpickle

Installation

Examples

Overriding pickle's serialization mechanism for importable constructs:

Running the tests

History

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes