Skip to main content

Allows mapping over large lists of objects with a calculate() function on a supercomputer cluster

Project description

# ClusterPool

Provides a map function which works with the SLURM supercomputer scheduler and can be extended to others. Also works on “normal” computers, mainly for testing purposes. (see examples/testfile_primes.py)

A ClusterPool object is an object which gives you a map function. You tell what kind of architecture the ClusterPool exists on (currently only “SLURM” and “normal”) and it creates a dispatcher object which keeps track of the objects which need calculating, saves them to disk and collects them when they are done being calculated.

These objects are calculated by “Trawler” objects which get spun to life by the dispatcher and are given a directory full of pickled python objects. The Trawlers then randomly pick an object and throw up a sempahore on the filesystem to tell the other Trawlers they are working on that object. They load the object and the run object.calculate() (Note that currently the ClusterPool package can ONLY handle calling functions called calculate which take no arguments)

Project details


Release history Release notifications | RSS feed

This version

0.10

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ClusterPool-0.10.tar.gz (7.2 kB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page