Skip to main content

Easily map Python functions onto a cluster using a DRMAA-compatible grid engine like Sun Grid Engine (SGE).

Project description

A module to allow you to easily create jobs on the cluster directly from Python. You can directly map Python functions onto the cluster without needing to write any wrapper code yourself.

This is the ETS fork of an older project called Python Grid. It’s a lot simpler than the original version, because we use a Redis database for storing the inputs/outputs for each job instead of the ZeroMQ-based method they were using. The main benefit of this approach is you never run into issues with exceeding the message length when you’re parallelizing a huge job.

For some examples of how to use it, check out map_reduce.py (for a simple example of how you can map a function onto the cluster) and manual.py (for an example of how you can create list of jobs yourself) in the examples folder.

For complete documentation go here.

NOTE: You cannot use Grid Map on a machine that is not allowed to submit jobs (e.g., slave nodes).

Requirements

License

  • GPLv3

Changelog

  • 0.9.9
    • Changed way job results are retrieved to be a bit more efficient in cases of errors.
    • All job metadata is now retrieved before job output is, which should hopefully alleviate issues where we can’t get the metadata because its been flushed too quickly by the grid engine.
  • 0.9.8
    • Fixed a bug where only the first error was still showing because of an extra exception caused by job_output being undefined.
    • Fixed unhandled Exception with error code 24 (since somehow that is not an InvalidJobException, but just an Exception in drmaa-python).
  • 0.9.7
    • No longer dies with InvalidJobException when failing to retrieve job metadata from DRMAA service.
    • Now print all exceptions encountered for jobs submitted instead of just exiting after first one.
    • Die via exception instead of sys.exit when there were problems with some of the submitted jobs.
  • 0.9.6
    • Fixed bug where jobs were being aborted before they ran.
  • 0.9.5
    • Fixed bug where GRID_MAP_USE_MEM_FREE would only be interpretted as true if spelled ‘True’.
    • Added documentation describing how to override constants.
  • 0.9.4
    • Added support for overriding the default queue and other constants via environment variables. For example, to change the default queue, just set the environment variable GRID_MAP_DEFAULT_QUEUE.
    • Substantially more information is given about crashing jobs when we fail to unpickle the results from the Redis database.
  • 0.9.3
    • Fixed serious bug where gridmap could not be imported in some instances.
    • Refactored things a bit so there is no longer one large module with all of the code in it. (Doesn’t change package interface)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
gridmap-0.9.9.tar.gz (72.9 kB) Copy SHA256 hash SHA256 Source None Jul 30, 2013

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page