Skip to main content

remote run management tool

Project description

remotemanager

Modular serialisation and management package for handling the running of functions on remote machines

Based off of the BigDFT RemoteRunner concept, remotemanager represents an improvement and expansion on the concepts based there.

Primary usage is via a Dataset, which connects to a remote machine via URL

You can think of the Dataset as a "container" of sorts for a calculation, to which "runs" are attached. These runs are then executed on the remote machine described by the provided URL

Installation

A quick install of the latest stable release can be done via pip install remotemanager

For development, you can clone this repo and install via cd remotemanager && pip install -e .[dev]

Tip: You can clone a specific branch with git clone -b devel.

If you want to build the docs locally a pandoc install is required.

You can install all required python packages with the [dev] or [docs] optionals.

HPC

Remotemanager exists to facilitate running on High Performance Compute machines (supercomputers). Script generation is ideally done via the BaseComputer module.

Existing Computers can be found at this repository. For creating a new machine class, see the documentation.

Documentation

See the documentation for further information, tutorials and api documentation.

Quickstart

This section will run through running a very basic function on a machine.

It roughly echoes the quickstart page found in the docs.

Function Definition

Start by defining your "calculation" as a python function.

def multiply(a, b):
    import time

    time.sleep(1)

    return a * b

Remote Connection

We need to be able to connect to a remote machine to run this function.

Assuming you can connect to a machine with a string like ssh user@host or just ssh machine, you can directly create a connection.

Use the URL module for this.

from remotemanager import URL

connection = URL("ssh@machine")

Commands

You can execute commands on this machine using the cmd method:

connection.cmd("pwd")
>> > "/home/user"

Running Functions

To execute your function on the specified machine, create a Dataset

from remotemanager import Dataset

ds = Dataset(function=multiply, url=connection, name="test")

Adding Runs

You can specify the inputs that your function should use by adding Runner instances to the Dataset

ds.append_run({"a": 10, "b": 7})

Running

Now run your Dataset with run().

You can wait for completion with wait()

ds.run()

ds.wait(1, 10)

Here, we are waiting for a maximum of 10s, and checking for results every 1s

Results

Result collection is done in two stages.

Once a run is complete, we must first fetch the results with fetch_results.

Then we can access the results at the results property

ds.fetch_results()

ds.results

>> > [70]

sanzu

The sanzu functionality allows you to tag a jupyter cell for remote running.

The cell will be converted into a function, set off to the specified remote machine, and executed there.

For detailed information, see the relevant section of the docs

To use this functionality, first enable the magic:

%load_ext
remotemanager

You should then create a URL instance for your machine:

from remotemanager import URL

connection = URL(...)

And now we can execute any cell on this machine by using the %%sanzu protocol:

%%sanzu
url = connection
%%sargs
a = 10
%%sargs
b = 7

a * b

>> > 70

This can be useful for doing file operations on the remote machine, however it is possible to access your results.

Accessing Sanzu Results

A sanzu run will inject a magic_dataset object into the jupyter runtime.

This is the Dataset that was used to execute the most recent cell, so you can access the information from there.

For our last run, we can see our results here:

print(magic_dataset.results)

>> > [70]

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remotemanager-0.14.3.tar.gz (109.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remotemanager-0.14.3-py3-none-any.whl (128.0 kB view details)

Uploaded Python 3

File details

Details for the file remotemanager-0.14.3.tar.gz.

File metadata

  • Download URL: remotemanager-0.14.3.tar.gz
  • Upload date:
  • Size: 109.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.8

File hashes

Hashes for remotemanager-0.14.3.tar.gz
Algorithm Hash digest
SHA256 46d68054dae910e97456f50b420d3c68181bcfa56d38e1e24262b99f743fd04d
MD5 d2d152bb56a7457ced62bb5d47c5f03a
BLAKE2b-256 9afbce04cd1ec8c85c4b2290d76f5df951aa815ecad767ffd1fc6344cf073dd1

See more details on using hashes here.

File details

Details for the file remotemanager-0.14.3-py3-none-any.whl.

File metadata

  • Download URL: remotemanager-0.14.3-py3-none-any.whl
  • Upload date:
  • Size: 128.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.8

File hashes

Hashes for remotemanager-0.14.3-py3-none-any.whl
Algorithm Hash digest
SHA256 0c81244eb10268b2475eedc81301484c7aa95f7f06d5c690485163335981ac08
MD5 984f9be105593c2583670e1b42c202a6
BLAKE2b-256 05fc93d1960e9906a3c9210b13d6aeff831622a99c945530c79b9947626a9069

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page