Skip to main content

Instance tree generation for organization or higher throughput submission

Project description

Flux Hierarchy

Create trees of Flux instances

🚧 under development and experimental 🚧

PyPI version

https://github.com/converged-computing/flux-hierarchy/blob/main/img/flux-hierarchy-small.png?raw=true

This tool enables generation and orchestration of Flux hierarchies, or trees of instances. Such a setup can enable programmatic organization and submission of commands, or high throughput. Use cases we want to address:

  • Creation (and organization) of a Flux Hierarchy
  • Discovery of an existing Flux Hierarchy (e.g, for MCP)

Usage

Let's first create a hierarchy. This will be a Flux job. You'll need to be in a Flux instance where a handle is discoverable. E.g., in the DevContainer:

flux start

Then create a simple, flat hierarchy with all the resources allocated to one broker.

flux-hierarchy start ./examples/hierarchy-one.yaml

You can test throughput (this also starts the hierarchy):

flux-hierarchy throughput ./examples/hierarchy-one.yaml

For either of the above, the hierarchy will continue running (and you need to cancel the job).

flux cancel $(flux job last)

You can also view the shape of the hierarchy without running anything:

flux-hierarchy view ./examples/hierarchy-one.yaml
$ flux-hierarchy view ./examples/corona/hierarchy-2.yaml
=>
🌿 Leaf Broker Workers...{}
level1 [Nodes: 2]
    ├── level2 [Nodes: 1, Cores: 48]
    └── level2 [Nodes: 1, Cores: 48]

Development

To build and release:

python3 -m build
# or
python3 setup.py sdist bdist_wheel

twine upload dist/flux-hierarchy-<version>*

WIP / TODO / Would be nice

  • I can't remember command to get <host>:<rank> mapping (I came up with something)
  • Use kvs for uris, saving results, etc. instead of the local dir.
  • Have local throughput wait for results not rely on filesystem results (use job wait)
  • Some means to deploy submit to node as a service on the node (that knows about URIs)
  • Save result to kvs or similar (not filesystem)
  • Should be able to read in directory of active sockets to generate tree
  • Allow different job shapes / specs.
  • Expose simulation duration time
  • Expose other resource params
  • A check for non-local mode that we aren't going to ddos ssh.

License

HPCIC DevTools is distributed under the terms of the MIT license. All new contributions must be made under this license.

See LICENSE, COPYRIGHT, and NOTICE for details.

SPDX-License-Identifier: (MIT)

LLNL-CODE- 842614

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flux_hierarchy-0.0.13.tar.gz (23.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

flux_hierarchy-0.0.13-py3-none-any.whl (25.0 kB view details)

Uploaded Python 3

File details

Details for the file flux_hierarchy-0.0.13.tar.gz.

File metadata

  • Download URL: flux_hierarchy-0.0.13.tar.gz
  • Upload date:
  • Size: 23.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.2

File hashes

Hashes for flux_hierarchy-0.0.13.tar.gz
Algorithm Hash digest
SHA256 6d9e9cf5d47cb2fe48a5f2e08a2a69c8f182e26b3aa4f57f860ae50ccf788539
MD5 97a37b7774b60a0e1c879c4fe1bcea3a
BLAKE2b-256 53d4d516b11c7e7dd59f46a16f49a3551ec40a2dcc76f017dc77b8b220ebf322

See more details on using hashes here.

File details

Details for the file flux_hierarchy-0.0.13-py3-none-any.whl.

File metadata

File hashes

Hashes for flux_hierarchy-0.0.13-py3-none-any.whl
Algorithm Hash digest
SHA256 12baff4e4a36a50b7cec58e51a6cada80a0e5c44e3b38dcd2c41bd42333c88f6
MD5 230239f2f627d83b6c2f8a37de67f2d4
BLAKE2b-256 4fb33bd50f0ef7f44e3a3a9d47a45cb94e44156c4a20c6178d85252be3e0d384

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page