Dragon is a composable distributed run-time for managing dynamic processes, memory, and data at scale through high-performance communication objects.

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3

Project description

Dragon

Dragon is a distributed environment for developing high-performance tools, libraries, and applications at scale. This distribution package provides the necessary components to run the Python multiprocessing library using the Dragon implementation which provides greater scaling and performance improvements over the legacy multiprocessing that is currently distributed with Python.

For examples and the actual source code for Dragon, please visit its github repository: https://github.com/DragonHPC/dragon

Installing Dragon

Dragon currently requires a minimum python version of 3.10 with support for 3.11 and 3.12. Otherwise, just do a pip install:

pip install dragonhpc

After doing the pip install of the package, you have completed the prerequisites for running Dragon multiprocessing programs.

Dragon is built with manylinux2014 support and should function on most Linux distros.

Configuring Dragon's high performance network backend for HSTA

Dragon includes two separate network backend services for communication across compute nodes. The first is referred as the "TCP transport agent". This backend uses common TCP to perform any communication over the compute network. However, this backend is relatively low performing and can be a perforamnce bottleneck.

Dragon also includes the "High Speed Transport Agent (HSTA)", which supports UCX for Infiniband networks and OpenFabrics Interface (OFI) for HPE Slingshot. However, Dragon can only use these networks if its envionment is properly configured.

To configure HSTA, use dragon-config to provide an "ofi-runtime-lib" or "ucx-runtime-lib". The input should be a library path that contains a libfabric.so for OFI or a libucp.so for UCX. These are libraries are dynamically opened by HSTA at runtime. Without them, dragon will fallback to using the lower performing TCP transport agent

Example configuration commands appear below:

# For a UCX backend, provide a library path that contains a libucp.so:
dragon-config add --ucx-runtime-lib=/opt/nvidia/hpc_sdk/Linux_x86_64/23.11/comm_libs/12.3/hpcx/hpcx-2.16/ucx/prof/lib

# For an OFI backend, provide a library path that contains a libfabric.so:
dragon-config add --ofi-runtime-lib=/opt/cray/libfabric/1.22.0/lib64

As mentioned, if dragon-config is not run as above to tell Dragon where to appropriate libraries exist, Dragon will fall back to using the TCP transport agent. You'll know this because a message similar to the following will print to stdout:

Dragon was unable to find a high-speed network backend configuration.
Please refer to `dragon-config --help`, DragonHPC documentation, and README.md
to determine the best way to configure the high-speed network backend to your
compute environment (e.g., ofi or ucx). In the meantime, we will use the
lower performing TCP transport agent for backend network communication.

If you get tired of seeing this message and plan to only use TCP communication over ethernet, you can use the following dragon-config command to silence it:

dragon-config add --tcp-runtime=True

For help without referring to this README.md, you can always use dragon-config --help

Running a Program using Dragon and python multiprocessing

There are two steps that users must take to use Dragon multiprocessing.

You must import the dragon module in your source code and set dragon as the start method, much as you would set the start method for spawn or fork.
```
 import dragon
 import multiprocessing as mp
 ...
 if __name__ == "__main__":
     # set the start method prior to using any multiprocessing methods
     mp.set_start_method('dragon')
     ...
```
This must be done for once for each application. Dragon is an API level replacement for multiprocessing. So, to learn more about Dragon and what it can do, read up on multiprocessing.
You must start your program using the dragon command. This not only starts your program, but it also starts the Dragon run-time services that provide the necesssary infrastructure for running multiprocessing at scale.
```
 dragon myprog.py
```
If you want to run across multiple nodes, simply obtain an allocation through Slurm (or PBS) and then run dragon.
```
 salloc --nodes=2 --exclusive
 dragon myprog.py
```

If you find that there are directions that would be helpful and are missing from our documentation, please make note of them and provide us with feedback. This is an early stab at documentation. We'd like to hear from you. Have fun with Dragon!

Sanity check Dragon installation

Grab the following from the DragonHPC github by cloning the repository or a quick wget: p2p_lat.py

wget https://raw.githubusercontent.com/DragonHPC/dragon/refs/heads/main/examples/multiprocessing/p2p_lat.py .

If testing on a single compute node/instance, you can just do:

dragon p2p_lat.py --dragon
using Dragon
Msglen [B]   Lat [usec]
2  28.75431440770626
4  39.88605458289385
8  37.25141752511263
16  43.31085830926895
+++ head proc exited, code 0

If you're trying to test the same across two nodes connected via a high speed network, try to get an allocation via the workload manager first and then run the test, eg:

salloc --nodes=2 --exclusive
dragon p2p_lat.py --dragon
using Dragon
Msglen [B]   Lat [usec]
2  73.80113238468765
4  73.75898555619642
8  73.52533907396719
16  72.79851596103981

Environment Variables

DRAGON_DEBUG - Set to any non-empty string to enable more verbose logging

DRAGON_DEFAULT_SEG_SZ - Set to the number of bytes for the default Managed Memory Pool. The default size is 4294967296 (4 GB). This may need to be increased for applications running with a lot of Queues or Pipes, for example.

Requirements

Python 3.10
GCC 9 or later
Slurm or PBS+PALS (for multi-node Dragon)

Known Issues

For any issues you encounter, it is recommended that you run with a higher level of debug output. It is often possible to find the root cause of the problem in the output from the runtime. We also ask for any issues you wish to report that this output be included in the report. To learn more about how to enable higher levels of debug logging refer to dragon --help.

Dragon Managed Memory, a low level component of the Dragon runtime, uses shared memory. It is possible that things go wrong while the runtime is coming down and files are left in /dev/shm. Dragon does attempt to clean these up in the chance of a bad exit, but it may not succeed. In that case, running dragon-cleanup on your own will clean up any zombie processes or un-freed memory.

It is possible for a user application or workflow to exhaust memory resources in Dragon Managed Memory without the runtime detecting it. Many allocation paths in the runtime use "blocking" allocations that include a timeout, but not all paths do this if the multiprocessing API in question doesn't have timeout semantics on an operation. When this happens, you may observe what appears to be a hang. If this happens, try increasing the value of the DRAGON_DEFAULT_SEG_SZ environment variable to larger sizes (default is 4 GB, try increasing to 16 or 32 GB). Note this variable takes the number of bytes.

Python multiprocessing applications that switch between start methods may fail with this due to how Queue is being patched in. The issue will be addressed in a later update.

If there is a firewall blocking port 7575 between compute nodes, dragon will hang. You will need to specify a different port that is not blocked through the --port option to dragon. Additionally, if you specify --network-prefix and Dragon fails to find a match the runtime will hang during startup. Proper error handling of this case will come in a later release.

In the event your experiment goes awry, we provide a helper script, dragon-cleanup, to clean up any zombie processes and memory.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.14.0

May 18, 2026

0.13.2

Mar 30, 2026

0.13.1

Feb 18, 2026

This version

0.13

Nov 6, 2025

0.12.3

Oct 14, 2025

0.12.2

Sep 17, 2025

0.12.1

Aug 8, 2025

0.12.0

Jul 30, 2025

0.12rc4 pre-release

Jul 1, 2025

0.12rc3 pre-release

Jun 29, 2025

0.12rc2 pre-release

May 21, 2025

0.12rc1 pre-release

May 6, 2025

0.12rc0 pre-release

Apr 11, 2025

0.11.1

Mar 20, 2025

0.11

Mar 10, 2025

0.11rc5 pre-release

Mar 3, 2025

0.11rc4 pre-release

Feb 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dragonhpc-0.13-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (9.5 MB view details)

Uploaded Nov 6, 2025 CPython 3.12manylinux: glibc 2.17+ x86-64

dragonhpc-0.13-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (9.5 MB view details)

Uploaded Nov 6, 2025 CPython 3.11manylinux: glibc 2.17+ x86-64

dragonhpc-0.13-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (9.1 MB view details)

Uploaded Nov 6, 2025 CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file dragonhpc-0.13-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

Download URL: dragonhpc-0.13-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Upload date: Nov 6, 2025
Size: 9.5 MB
Tags: CPython 3.12, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.12

File hashes

Hashes for dragonhpc-0.13-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm	Hash digest
SHA256	`23721bd58b3bebcaed4ab84ef1cb0a0ba003b7ea9c3fca22ab0a1f913bfc1956`
MD5	`eb6b512f8f189f92b91d94e627a04b96`
BLAKE2b-256	`5c8780b132c98c741386c3390a07ab6e3c580f8a861988c20d64f6ea1fa30886`

See more details on using hashes here.

File details

Details for the file dragonhpc-0.13-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

Download URL: dragonhpc-0.13-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Upload date: Nov 6, 2025
Size: 9.5 MB
Tags: CPython 3.11, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.12

File hashes

Hashes for dragonhpc-0.13-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm	Hash digest
SHA256	`25f9b5817ce41d522221d529a6c5055de07cbce8e6cf7ad277b48fff3a5461c9`
MD5	`03604f052248048a897ec54d6518e55e`
BLAKE2b-256	`cfdc89310709d6ee66de33aa8064c3b42038a2000df4c338318b636cb533cdc5`

See more details on using hashes here.

File details

Details for the file dragonhpc-0.13-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

Download URL: dragonhpc-0.13-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Upload date: Nov 6, 2025
Size: 9.1 MB
Tags: CPython 3.10, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.12

File hashes

Hashes for dragonhpc-0.13-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm	Hash digest
SHA256	`bf69b8cfcae3b467234250735c4aed4b1dbdce4448cb584b1522ca0d1dce1b4c`
MD5	`10568cd09f7195a98c70b20edc36d9eb`
BLAKE2b-256	`3cde8dd049dec7a96f19eead1379afa4347af3b99f79bc852fb43ab79063c59f`

See more details on using hashes here.

dragonhpc 0.13

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Dragon

Installing Dragon

Configuring Dragon's high performance network backend for HSTA

Running a Program using Dragon and python multiprocessing

Sanity check Dragon installation

Environment Variables

Requirements

Known Issues

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes