Jug · PyPI

A Task Based Parallelization Framework

These details have not been verified by PyPI

Project links

Homepage

Project description

Jug allows you to write code that is broken up into tasks and run different tasks on different processors.

https://anaconda.org/conda-forge/jug/badges/installer/conda.svg

https://img.shields.io/badge/CITATION-doi.org%2F10.5334%2Fjors.161-green.svg

It uses the filesystem to communicate between processes and works correctly over NFS, so you can coordinate processes on different machines.

Jug is a pure Python implementation and should work on any platform.

Python 2.6/2.7 and Python 3.3+ are supported.

Website: http://luispedro.org/software/jug

Documentation: https://jug.readthedocs.org/

Video: On vimeo or showmedo

Mailing List: http://groups.google.com/group/jug-users

Install

You can install Jug with pip:

pip install Jug

or use, if you are using conda, you can install jug from conda-forge using the following commands:

conda config --add channels conda-forge
conda install jug

Citation

If you use Jug to generate results for a scientific publication, please cite

Coelho, L.P., (2017). Jug: Software for Parallel Reproducible Computation in Python. Journal of Open Research Software. 5(1), p.30.

http://doi.org/10.5334/jors.161

Short Example

Here is a one minute example. Save the following to a file called primes.py (if you have installed jug, you can obtain a slightly longer version of this example by running jug demo on the command line):

from jug import TaskGenerator
from time import sleep

@TaskGenerator
def is_prime(n):
    sleep(1.)
    for j in range(2,n-1):
        if (n % j) == 0:
            return False
    return True

primes100 = [is_prime(n) for n in range(2,101)]

This is a brute-force way to find all the prime numbers up to 100. Of course, this is only for didactical purposes, normally you would use a better method. Similarly, the sleep function is so that it does not run too fast. Still, it illustrates the basic functionality of Jug for embarassingly parallel problems.

Type jug status primes.py to get:

Task name                  Waiting       Ready    Finished     Running
----------------------------------------------------------------------
primes.is_prime                  0          99           0           0
......................................................................
Total:                           0          99           0           0

This tells you that you have 99 tasks called primes.is_prime ready to run. So run jug execute primes.py &. You can even run multiple instances in the background (if you have multiple cores, for example). After starting 4 instances and waiting a few seconds, you can check the status again (with jug status primes.py):

Task name                  Waiting       Ready    Finished     Running
----------------------------------------------------------------------
primes.is_prime                  0          63          32           4
......................................................................
Total:                           0          63          32           4

Now you have 32 tasks finished, 4 running, and 63 still ready. Eventually, they will all finish and you can inspect the results with jug shell primes.py. This will give you an ipython shell. The primes100 variable is available, but it is an ugly list of jug.Task objects. To get the actual value, you call the value function:

In [1]: primes100 = value(primes100)

In [2]: primes100[:10]
Out[2]: [True, True, False, True, False, True, False, False, False, True]

Testimonials

“I’ve been using jug with great success to distribute the running of a reasonably large set of parameter combinations” - Andreas Longva

What’s New

version 1.6.7 (Fri Apr 13)

Fix issue with deeply recursive dependency structures and barrier()
Allow mapreduce.map() results to be used as dependencies

version 1.6.6 (Sat Apr 7)

Fix bug in shell’s invalidate() function
Fix wrong dependency handling with mapreduce.map()

version 1.6.5 (Mon Mar 12 2018)

Add get_tasks() to ‘jug shell’ and document ‘from jug.task import alltasks’ (patch by Renato Alves)

version 1.6.4 (Thu Nov 2 2017)

Fix exit_after_n_tasks. It would previously execute one task too many

version 1.6.3 (Wed Nov 1 2017)

Add citation request

version 1.6.2 (Thu Oct 26 2017)

Add return_value argument to jug_execute
Add exit_env_vars

version 1.6.1 (Thu Aug 29 2017) - Fix bug with invalidate() in the shell

version 1.6.0 (Thu Aug 24 2017) - Add ‘graph’ subcommand - Generates a graph of tasks - ‘jug execute –keep-going’ now ends with non-zero exit code in case of failures - Fix bug with cleanup in dict_store not providing the number of removed records - Add ‘jug cleanup –keep-locks’ to remove obsolete results without affecting locks

version 1.5.0 (Sun Jul 16 2017) - Add ‘demo’ subcommand - Add is_jug_running() function - Fix bug in finding config files - Improved –debug mode: check for unsupported recursive task creation - Add invalidate() to shell environment - Use ~/.config/jug/jugrc as configuration file - Add experimental support for extensible commands, use ~/.config/jug/jug_user_commands.py - jugrc: execute_wait_cycle_time_secs is now execute_wait_cycle_time - Expose sync_move in jug.utils

version 1.4.0 (Tue Jan 3 2017) - Fix bug with writing very large objects to disk - Smarter handling of –aggressive-unload (do not unload what will be immediately necessary) - Work around corner case in jug shell command - Add test-jug subcommand - Add return_tuple decorator

version 1.3.0 (Tue Nov 1 2016) - Update shell subcommand to IPython 5 - Use ~/.config/jugrc as configuration file - Cleanup usage string - Use bottle instead of web.py for webstatus subcommand - Add jug_execute function - Add timing functionality

version 1.2.2 (Sat Jun 25 2016) - Fix bugs in shell subcommand and a few corner cases in encoding/decoding results

version 1.2.1 (Mon Feb 15 2016) - Changed execution loop to ensure that all tasks are checked (issue #33 on github) - Fixed bug that made ‘check’ or ‘sleep-until’ slower than necessary - Fixed jug on Windows (which does not support fsync on directories) - Made Tasklets use slightly less memory

version 1.2 (Thu Aug 20 2015) - Use HIGHEST_PROTOCOL when pickle()ing - Add compress_numpy option to file_store - Add register_hook_once function - Optimize case when most (or all) tasks are already run - Add –short option to ‘jug status’ and ‘jug execute’ - Fix bug with dictionary order in kwargs (fix by Andreas Sorge) - Fix ipython colors (fix by Andreas Sorge) - Sort tasks in ‘jug status’

version 1.1 (Tue Mar 3 2015) - Python 3 compatibility fixes - fsync(directory) in file backend - Jug hooks (still mostly undocumented, but already enabling internal code simplification)

version 1.0 (Tue May 20 2014) - Adapt status output to terminal width (by Alex Ford) - Add a newline at the end of lockfiles for file backend - Add –cache-file option to specify file for status --cache

version 0.9.7 (Tue Feb 18 2014)

Fix use of numpy subclasses
Fix redis URL parsing
Fix shell for newer versions of IPython
Correctly fall back on non-sqlite status
Allow user to call set_jugdir() inside jugfile

version 0.9.6 (Tue Aug 6 2013)

Faster decoding
Add jug-execute script
Add describe() function
Add write_task_out() function

version 0.9.5 (May 27 2013)

Added debug mode
Even better map.reduce.map using blocked access
Python 3 support
Documentation improvements

For older version see ChangeLog file.

Join the chat at https://gitter.im/luispedro/jug

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.3.1

Nov 5, 2023

2.3.0

Jun 24, 2023

2.2.3

May 26, 2023

2.2.2

Jul 18, 2022

2.2.1

May 19, 2022

2.2.0

May 2, 2022

2.1.1

Mar 18, 2021

2.1.0

Mar 18, 2021

2.0.3

Sep 20, 2020

2.0.2

Jun 11, 2020

2.0.1

Jun 11, 2020

2.0.0

Feb 21, 2020

2.0.0rc0 pre-release

Jan 31, 2020

1.6.9

Aug 7, 2019

1.6.8

Jul 10, 2019

This version

1.6.7

Apr 13, 2018

1.6.6

Apr 7, 2018

1.6.5

Mar 12, 2018

1.6.4

Nov 2, 2017

1.6.3

Nov 1, 2017

1.6.2

Oct 26, 2017

1.6.1

Aug 29, 2017

1.6.0

Aug 24, 2017

1.5.0

Jul 16, 2017

1.4.0

Jan 3, 2017

1.3.0

Nov 1, 2016

1.2.2

Jun 25, 2016

1.2.1

Feb 15, 2016

1.2

Aug 20, 2015

1.1

Mar 3, 2015

1.0.1

Jun 15, 2014

1.0

May 20, 2014

1.0rc0 pre-release

Apr 21, 2014

0.9.7

Feb 18, 2014

0.9.6

Aug 6, 2013

0.9.5

May 27, 2013

0.9.4

Apr 15, 2013

0.9.3

Dec 2, 2012

0.9.2

Nov 4, 2012

0.9.1

Jun 11, 2012

0.9

Dec 6, 2011

0.8.1

Jul 5, 2011

0.8

Mar 28, 2011

0.8-b0 pre-release

Mar 10, 2011

0.7.4

Jan 17, 2011

0.7.3

Jan 4, 2011

0.7.2

Nov 4, 2010

0.7.1

Nov 2, 2010

0.7

Oct 21, 2010

0.6.99

Sep 29, 2010

0.6.9

Sep 22, 2010

0.6.2

Sep 14, 2010

0.6.1

Sep 13, 2010

0.6

Jun 2, 2010

0.5.5

Apr 26, 2010

0.5.4

Apr 26, 2010

0.5.3

Apr 26, 2010

0.5.2

Mar 30, 2010

0.5.1

Feb 8, 2010

0.5.0

Jan 25, 2010

0.5.0-rc-1 pre-release

Jan 11, 2010

0.5.0-rc-0 pre-release

Dec 14, 2009

0.5.0-beta-1 pre-release

Nov 20, 2009

0.5.0-beta-0 pre-release

Nov 11, 2009

0.4.1

May 31, 2009

0.4

May 27, 2009

0.4-rc0 pre-release

May 25, 2009

0.2

Jan 19, 2009

0.9.7-git

Apr 21, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Jug-1.6.7.tar.gz (63.2 kB view hashes)

Uploaded Apr 13, 2018 Source

Hashes for Jug-1.6.7.tar.gz

Hashes for Jug-1.6.7.tar.gz
Algorithm	Hash digest
SHA256	`a7faba838f3437163ae8459bff96e2c6ca1298312bdb9104c702685178d17269`
MD5	`95d8a3b37a1922f7a1001517cfb9d9a6`
BLAKE2b-256	`399e66b684380f13f7c0f80a1a4c9d3195bb64bab1e6d6ee7493122249adaa92`