Skip to main content

This package enables Python to offload sets of function calls to pools of remote worker processes.

Project description

Goliath

Authors

Logan Pulley, Zachary Oldham, Ilie Vartic, Deepan Venkatesh, Manikandan Swaminathan

Abstract

This package enables Python to offload sets of function calls to pools of remote worker processes.

Details

When handling large sets of data, the thread-pool model can often do wonders for parallelizing and thus speeding up a program. However, Python's native support for concurrency is more like polling than threading; it doesn't properly take advantage of multiple CPU cores. This can be frustrating when working in Python with a task that would be easily threadable in other languages.

Goliath enables Python to distribute a set of function calls over a set of servers. This essentially simulates the thread-pool model as a pool of servers, each maintaining a pool of Python worker processes. Additionally, these servers can be reached over the Internet, enabling a many-to-many relationship between clients requesting work and servers providing workers; each client can have work distributed across multiple servers, and each server can handle work from multiple clients.

Goliath abstracts this entire model and aggregates the results from the servers, finally returning the list of results to the coder.

Requirements

  • Python 3.7+

Installation

Install with pip:

pip install goliath

Usage

Sending work (Commander)

# foo.py

from goliath.commander import Commander

# The function to execute
def foo(bar, baz):
    return str(bar) + str(baz)

# Function to generate list of arguments to try
def foo_args(bar_range, baz_range):
    for bar in bar_range:
        for baz in baz_range:
            yield { 'bar': bar, 'baz': baz }

if __name__ == '__main__':
    # Create a commander (doesn't connect yet)
    cmdr = Commander([
        # Lieutenants can be hostnames, domains, IPs
        ('lieutenant-hostname', 8080),
        ('lieutenant.com', 3333),
        ('127.0.0.1', 2222)
    ])
    # Connect to lieutenants, run all the functions, and get results
    results = cmdr.run(foo, list(foo_args(range(10), range(10))), ['foo.py'])

Performing work (Lieutenant & Worker)

To run a lieutenant on this machine on port 3333 with 8 worker processes:

python3 -m goliath.lieutenant localhost 3333 8

Licensing

Goliath is open-source software, licensed under GNU's Lesser GPL.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for goliath, version 0.2.3
Filename, size File type Python version Upload date Hashes
Filename, size goliath-0.2.3-py3-none-any.whl (15.0 kB) File type Wheel Python version py3 Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page