multinode

Multinode's Python client

These details have not been verified by PyPI

Project description

What is Multinode?

Multinode lets you rapidly deploy cloud applications that perform asynchronous tasks.

Consider using Multinode if your application runs tasks that:

are triggered on demand by the user of the application;
take of the order of minutes or hours to complete;
require expensive hardware that should be provisioned only when required.

For example, Multinode can be used within:

a document/image/video processing app
a data analytics app
a scientific computing app

The main benefits of Multinode are:

Minimal boilerplate: Cloud API calls, cloud permissions, task lifecycle management and task data storage are abstracted away.
Responsive scaling: Compute resources are spun up as soon as a task is created, and torn down as soon as the task is complete.

Quick start

Deploy the Multinode control plane into your AWS account. (Instructions and Terraform code provided in the aws-infra folder.)

Install the Multinode Python package and authenticate with the Multinode control plane.

pip install multinode
multinode login

Define the task as a Python function.

# File: tasks/main.py

from multinode import Multinode

mn = Multinode()

@mn.function(cpu=4.0, memory="16 GiB")
def run_expensive_task(x):
    out =  # ... details of the task ...
    return out

multinode deploy tasks/ --project-name=my_project

Implement the rest of the application, invoking the function when needed.

# File: application/main.py
# NB can be a different codebase from tasks/

from multinode import get_deployed_function

run_expensive_task = get_deployed_function(
    project_name="my_project",
    function_name="run_expensive_task"
)

# ... other code ...

# Start a task invocation.
# The computation runs on *remote* hardware, which is *provisioned on demand*.
invocation_id = run_expensive_task.start(x=10000)

# ... other code ...

# Get the status of the task invocation, and the result (if available)
invocation = run_expensive_task.get(invocation_id)
print(invocation.status)  # e.g. PENDING, RUNNING, SUCCEEDED
print(invocation.result)  # e.g. 12345 (if available), or None (if still running)

Further functionality

In addition to the above basic functionality, Multinode allows you to:

Expose progress updates from an in-flight task.
Cancel a task programmatically.
Implement retries in case of code errors or hardware failures.
Configure timeouts and concurrency limits.
Spawn subtasks from a parent task.
Inspect task logs.
Add custom Python dependencies and environment variables.
Manage the lifecycle of the deployed application.

For further details, see the reference guide or the worked example.

Approaches to scaling: When to use Multinode?

Multinode's approach: Direct resource provisioning. Multinode makes direct API calls to the cloud provider, to provision a new worker for each new task.

Alternative approach: Autoscaling a warm worker pool. Popular alternative frameworks for asynchronous tasks include Celery and Kafka consumers. Applications written in these frameworks usually run on a warm pool of workers. Each worker stays alive between task executions. The number of workers is autoscaled according to some metric (e.g. the number of pending tasks).

Advantages of Multinode's approach:

Scales up immediately when new tasks are created; scales down immediately when a task finishes.
No risk of interrupting a task execution when scaling down.

Advantages of the alternative warm-pool-based approach:

More suitable for processing a higher volume of shorter-lived tasks.
Can maintain spare capacity to mitigate against cold starts.

Architecture

Currently, Multinode runs on AWS, using ECS/Fargate for the asynchronous tasks.

A (slightly simplified) architecture diagram is shown below

architecture

With minimal API changes, the framework can be extended to other AWS compute engines (e.g. EC2 with GPUs), to other cloud providers, and to Kubernetes.

We may implement these extensions if there is demand. We also welcome contributions from the open source community in this regard.

Currently, you need to deploy Multinode in your own AWS account. (Terraform is provided in the aws-infra folder.) We may offer Multinode as a managed service in the future.

Programming language support

Python is the only supported language at the moment.

If you need to invoke a deployed Python function from an application written in another language such as Javascript, then you will need to use the REST API. (Or you can contribute a Javascript client!)

Let us know if you want to define your functions in other languages.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.0.1

Oct 31, 2023

1.0.0

Oct 30, 2023

0.0.14

Oct 25, 2023

0.0.12

Oct 25, 2023

0.0.11

Oct 24, 2023

0.0.10

Oct 23, 2023

0.0.9

Oct 23, 2023

0.0.8

Oct 23, 2023

0.0.7

Oct 21, 2023

0.0.6

Oct 21, 2023

0.0.5

Oct 21, 2023

0.0.4

Oct 21, 2023

0.0.3

Oct 15, 2023

0.0.2

Oct 15, 2023

0.0.1

Sep 20, 2023

0.0.0

Sep 17, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

multinode-1.0.1-py3-none-any.whl (143.7 kB view details)

Uploaded Oct 31, 2023 Python 3

File details

Details for the file multinode-1.0.1-py3-none-any.whl.

File metadata

Download URL: multinode-1.0.1-py3-none-any.whl
Upload date: Oct 31, 2023
Size: 143.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.6.1 CPython/3.8.18 Linux/6.2.0-1015-azure

File hashes

Hashes for multinode-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`793ce3cf1f94530e908cbd4b2df51cca85219aeeb084b0a3bb642598aeb972d8`
MD5	`ede37f1e114c9261b4d5b8eff92655d9`
BLAKE2b-256	`60d195adac8772f39f870f94081ebee94ca24071d642804baf0b33518877dfa8`