Skip to main content

remote run management tool

Project description

remotemanager

Modular serialisation and management package for handling the running of functions on remote machines

Based off of the BigDFT RemoteRunner concept, remotemanager represents an improvement and expansion on the concepts based there.

Primary usage is via a Dataset, which connects to a remote machine via URL

You can think of the Dataset as a "container" of sorts for a calculation, to which "runs" are attached. These runs are then executed on the remote machine described by the provided URL

Installation

A quick install of the latest stable release can be done via pip install remotemanager

For development, you can clone this repo and install via cd remotemanager && pip install -e .[dev]

Tip: You can clone a specific branch with git clone -b devel.

If you want to build the docs locally a pandoc install is required.

You can install all required python packages with the [dev] or [docs] optionals.

HPC

Remotemanager exists to facilitate running on High Performance Compute machines (supercomputers). Script generation is ideally done via the BaseComputer module.

Existing Computers can be found at this repository. For creating a new machine class, see the documentation.

Documentation

See the documentation for further information, tutorials and api documentation.

Quickstart

This section will run through running a very basic function on a machine.

It roughly echoes the quickstart page found in the docs.

Function Definition

Start by defining your "calculation" as a python function.

def multiply(a, b):
    import time

    time.sleep(1)

    return a * b

Remote Connection

We need to be able to connect to a remote machine to run this function.

Assuming you can connect to a machine with a string like ssh user@host or just ssh machine, you can directly create a connection.

Use the URL module for this.

from remotemanager import URL

connection = URL("ssh@machine")

Commands

You can execute commands on this machine using the cmd method:

connection.cmd("pwd")
>> > "/home/user"

Running Functions

To execute your function on the specified machine, create a Dataset

from remotemanager import Dataset

ds = Dataset(function=multiply, url=connection, name="test")

Adding Runs

You can specify the inputs that your function should use by adding Runner instances to the Dataset

ds.append_run({"a": 10, "b": 7})

Running

Now run your Dataset with run().

You can wait for completion with wait()

ds.run()

ds.wait(1, 10)

Here, we are waiting for a maximum of 10s, and checking for results every 1s

Results

Result collection is done in two stages.

Once a run is complete, we must first fetch the results with fetch_results.

Then we can access the results at the results property

ds.fetch_results()

ds.results

>> > [70]

sanzu

The sanzu functionality allows you to tag a jupyter cell for remote running.

The cell will be converted into a function, set off to the specified remote machine, and executed there.

For detailed information, see the relevant section of the docs

To use this functionality, first enable the magic:

%load_ext
remotemanager

You should then create a URL instance for your machine:

from remotemanager import URL

connection = URL(...)

And now we can execute any cell on this machine by using the %%sanzu protocol:

%%sanzu
url = connection
%%sargs
a = 10
%%sargs
b = 7

a * b

>> > 70

This can be useful for doing file operations on the remote machine, however it is possible to access your results.

Accessing Sanzu Results

A sanzu run will inject a magic_dataset object into the jupyter runtime.

This is the Dataset that was used to execute the most recent cell, so you can access the information from there.

For our last run, we can see our results here:

print(magic_dataset.results)

>> > [70]

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remotemanager-0.14.2.tar.gz (109.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remotemanager-0.14.2-py3-none-any.whl (128.1 kB view details)

Uploaded Python 3

File details

Details for the file remotemanager-0.14.2.tar.gz.

File metadata

  • Download URL: remotemanager-0.14.2.tar.gz
  • Upload date:
  • Size: 109.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.8

File hashes

Hashes for remotemanager-0.14.2.tar.gz
Algorithm Hash digest
SHA256 b09fcac5c92b4d222ef4f0568b1d2028e78586749e25da0dbc21df831c01a8a4
MD5 729b951311c60b4503def36a4bbf4546
BLAKE2b-256 b3c17240a7e29ade605a51cc9dc4d966bc2841b4ba3c7fd088596fb115b813eb

See more details on using hashes here.

File details

Details for the file remotemanager-0.14.2-py3-none-any.whl.

File metadata

  • Download URL: remotemanager-0.14.2-py3-none-any.whl
  • Upload date:
  • Size: 128.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.8

File hashes

Hashes for remotemanager-0.14.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ce4aeac109182d514c3de5a2514bab4dd3679cad12b7ea725665792c95a7076f
MD5 18f429b795a1ee43f5cea9b043751b8f
BLAKE2b-256 684942333ead54345b67aaa7ecf396110df919b8c93e160f3dd5389b73404410

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page