pytorch plugin for flyte
Project description
Union PyTorch Plugin
Union can execute PyTorch distributed training jobs natively on a Kubernetes Cluster, which manages the lifecycle of worker pods, rendezvous coordination, spin-up, and tear down. It leverages the open-sourced TorchElastic (torch.distributed.elastic) launcher and the Kubeflow PyTorch Operator, enabling fault-tolerant and elastic training across multiple nodes.
This is like running a transient PyTorch cluster — worker groups are created for the specific job and torn down automatically after completion. Elastic training allows nodes to scale in and out, and failed workers can be restarted without bringing down the entire job.
To install the plugin, run the following command:
pip install --pre flyteplugins-pytorch
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file flyteplugins_pytorch-2.1.7-py3-none-any.whl.
File metadata
- Download URL: flyteplugins_pytorch-2.1.7-py3-none-any.whl
- Upload date:
- Size: 8.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
11dd39580541a4a83f26457e7b7c2c1ff1c0226fd22839495f08f7eaabe80c0f
|
|
| MD5 |
7b6dcf50a037a5a883b06ca411fc0c42
|
|
| BLAKE2b-256 |
58ace9050b7c9d8287538d668cd333c0cdbb804c420eb03416f2e6a32e3f0a46
|