sharedmem

Dispatch your trivially parallizable jobs with sharedmem.

Project description

Dispatch your trivially parallizable jobs with sharedmem.

Now also supports Python 3.

sharedmem.empty creates numpy arrays shared by child processes.
sharedmem.MapReduce dispatches work to child processes, allowing work functions defined in nested scopes.
sharedmem.MapReduce.ordered and sharedmem.MapReduce.critical implements the equivelant concepts as OpenMP ordered and OpenMP critical sections.
Exceptions are properly handled, including unpicklable exceptions. Unexpected death of child processes (Slaves) is handled in a graceful manner.

Functions and variables are inherited from a fork syscall and the copy-on-write mechanism, except sharedmem variables which are writable from both child processes or the main process. Pickability of objects is not a concern.

Usual limitations of fork do apply. sharedmem.MapReduce is easier to use than multiprocessing.Pool, at the cost of not supporting Windows.

For documentation, please refer to http://rainwoodman.github.io/sharedmem .

Here we provide two simple examples to illustrate the usage:

"""
    Integrate [0, ... 1.0) with rectangle rule.
    Compare results from
    1. direct sum of 'xdx' (filled by subprocesses)
    2. 'shmsum', cummulated by partial sums on each process
    3. sum of partial sums from each process.

"""
xdx = sharedmem.empty(1024 * 1024 * 128, dtype='f8')
shmsum = sharedmem.empty((), dtype='f8')

shmsum[...] = 0.0

with sharedmem.MapReduce() as pool:

    def work(i):
        s = slice (i, i + chunksize)
        start, end, step = s.indices(len(xdx))

        dx = 1.0 / len(xdx)

        myxdx = numpy.arange(start, end, step) \
                * 1.0 / len(xdx) * dx

        xdx[s] = myxdx

        a = xdx[s].sum(dtype='f8')

        with pool.critical:
            shmsum[...] += a

        return i, a

    def reduce(i, a):
        # print('chunk', i, 'done', 'local sum', a)
        return a

    chunksize = 1024 * 1024

    r = pool.map(work, range(0, len(xdx), chunksize), reduce=reduce)

assert_almost_equal(numpy.sum(r, dtype='f8'), shmsum)
assert_almost_equal(numpy.sum(xdx, dtype='f8'), shmsum)

"""
    An example word counting program. The parallelism is per line.

    In reality, the parallelism shall be at least on a file level to
    benefit from sharedmem / multiprocessing.

"""
word_count = {
        'sharedmem': 0,
        'pool': 0,
        }

with sharedmem.MapReduce() as pool:

    def work(line):
        # create a fresh local counter dictionary
        my_word_count = dict([(word, 0) for word in word_count])

        for word in line.replace('.', ' ').split():
            if word in word_count:
                my_word_count[word] += 1

        return my_word_count

    def reduce(her_word_count):
        for word in word_count:
            word_count[word] += her_word_count[word]

    pool.map(work, file(__file__, 'r').readlines(), reduce=reduce)

    parallel_result = dict(word_count)

    # establish the ground truth from the sequential counter
    sharedmem.set_debug(True)

    for word in word_count:
        word_count[word] = 0

    pool.map(work, file(__file__, 'r').readlines(), reduce=reduce)
    sharedmem.set_debug(False)

for word in word_count:
    assert word_count[word] == parallel_result[word]

Project details

Release history Release notifications | RSS feed

0.3.8

Oct 1, 2020

0.3.7

Oct 16, 2019

0.3.6

Apr 25, 2019

0.3.5

Oct 25, 2016

This version

0.3.4

Oct 25, 2016

0.3.3

Mar 23, 2016

0.3.2

Jan 4, 2016

0.3.1

Nov 19, 2015

0.3

Jul 31, 2015

0.1

Jul 12, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sharedmem-0.3.4.tar.gz (15.5 kB view details)

Uploaded Oct 25, 2016 Source

Built Distribution

sharedmem-0.3.4-py2.7.egg (36.0 kB view details)

Uploaded Oct 25, 2016 Source

File details

Details for the file sharedmem-0.3.4.tar.gz.

File metadata

Download URL: sharedmem-0.3.4.tar.gz
Upload date: Oct 25, 2016
Size: 15.5 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for sharedmem-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`68c01737ec7af8a3c70c3a718af2dafeba3687bbe86b78363e01caf2e32033d2`
MD5	`281e3bf03fd9720aa2328e7a90ea36ae`
BLAKE2b-256	`e4cc8fc6934a0bf25e292393d5657ea0c391121d44ae26e64d8bd4f6060c6c9b`

See more details on using hashes here.

File details

Details for the file sharedmem-0.3.4-py2.7.egg.

File metadata

Download URL: sharedmem-0.3.4-py2.7.egg
Upload date: Oct 25, 2016
Size: 36.0 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for sharedmem-0.3.4-py2.7.egg
Algorithm	Hash digest
SHA256	`a6f27e8b794fd949c2c46c39061c4bd5a38d0b82f019efb2b00a256af9e2f608`
MD5	`cc7b5c54484024f726cb5ae03451581c`
BLAKE2b-256	`03830ef9a4ef0e50d3786b1fba52ead19eece79b4e354e696f1ac6225bc65f1d`