[WIP] Manage a large amount of computed resources, such as files, imports, etc.

These details have not been verified by PyPI

Project links

Project description

hadrosaur — computed resource management

logo

Do you want to compute thousands of resources (files, metadata, database imports, etc) in parallel, but have a hard time tracking completion status, errors, logs, and other runtime data? That's what this is for.

Work in progress

Quick usage tutorial

Install

pip install hadrosaur

Define a resource collection

Import the lib and initialize a project using a base directory. Files, metadata, and logs will all get stored under this directory.

from hadrosaur import Project

proj = Project('./base_directory')

Define a collection using a decorator around a function. The collection should have a unique name and must take these params:

ident — an identifier (unique across the collection) for each computed resource
args — a dictionary of optional arguments
subdir — the path of a directory in which you can store files for this resource

@proj.resource('collection_name')
def compute_resource(ident, args, subdir):
  # Run some things...
  # Maybe save stuff into subdir... 
  time.sleep(1)
  # Return metadata for the resource, such as run results, filepaths, etc.
  return {'ts': time.time()}

Fetch a resource

Use the proj.fetch(collection_name, ident, args) method to compute and cache resources in a collection.

If the resource has not yet been computed, the function will be run
If the resource was already computed in the past, then the saved results will get returned instantly
If an error is thrown in the function, logs will be saved and the status will be updated
If the function is backgrounded, then fetching the resource will show a "pending" status

>> proj.fetch('collection_name', 'uniq_ident123', optional_args)
{
  'result': {'some': 'metadata'},
  'status': {'completed': True, 'pending': False, 'error': False},
  '_paths': {
    'base': 'base_directory/collection_name/uniq_ident123',
    'error': 'base_directory/collection_name/uniq_ident123/error.log',
    'stdout': 'base_directory/collection_name/uniq_ident123/stdout.log',
    'stderr': 'base_directory/collection_name/uniq_ident123/stderr.log',
    'status': 'base_directory/collection_name/uniq_ident123/status.json',
    'result': 'base_directory/collection_name/uniq_ident123/result.json',
    'storage': 'base_directory/collection_name/uniq_ident123/storage/'
  }
}

Descriptions of each of the returned fields:

'result': any JSON-serializable data returned by the resource's function
'status': whether the resource has been computed already ("completed"), is currently being computed ("pending"), or threw a Python error while running the function ("error")
'paths': All the various filesystem paths associated with your resource
- 'base': The base directory that holds all data for the resource
- 'error': A Python stacktrace of any error that occured while running the resource's function
- 'stdout': A line-by-line log file of stdout produced by the resource's function (any print() calls)
- 'stderr': A line-by-line log of stderr messages printed by the resource's function (any sys.stderr.write calls)
- 'status': a JSON object of status keys for the resource ("completed", "pending", "error")
- 'result': Any JSON serializable data returned by the resource's function
- 'storage': Additional storage for any files written by the resource's function

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.1

Mar 12, 2020

0.4.0

Feb 19, 2020

0.3.2

Feb 7, 2020

0.3.1

Feb 7, 2020

0.3.0

Feb 6, 2020

0.2.0

Feb 6, 2020

0.1.0

Feb 1, 2020

This version

0.0.3

Jan 31, 2020

0.0.2

Jan 31, 2020

0.0.1

Jan 31, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hadrosaur-0.0.3.tar.gz (4.1 kB view hashes)

Uploaded Jan 31, 2020 Source

Built Distribution

hadrosaur-0.0.3-py3-none-any.whl (4.0 kB view hashes)

Uploaded Jan 31, 2020 Python 3

Hashes for hadrosaur-0.0.3.tar.gz

Hashes for hadrosaur-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`3f863bad9c3761956ec36027b95052b874b0662eedf326c4804e86f01c67b165`
MD5	`88afe8d0efceda843021a46a96db237b`
BLAKE2b-256	`8f456391c224a2cfe2a5bcee20b9c463961f5bb133eface97eea5af3ad109444`

Hashes for hadrosaur-0.0.3-py3-none-any.whl

Hashes for hadrosaur-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`02ac8aa91969519e99a94b319ad3bc10de37e079c17f912b5b4d2550a4d41e56`
MD5	`237f26ca9bd07119e5ab9447292414ed`
BLAKE2b-256	`038bbc2752a1c1056f473711ff28c58c0867bf25c52c77f4685649a5ab56f664`