A tool for launching and running commands on a cluster of EC2 instances
Project description
ec2-cluster
Simple library and CLI to manage and work with clusters of EC2 instances. Multi-purpose, but created to make distributed deep learning infrastructure easier.
Goals
- Provide the minimal set of features to run distributed deep learning training jobs on EC2 instances.
- Provide libraries, not a framework or platform.
- Make cluster environments reproducible to allow for parallelization of experiments
- Make cluster launches fast
- Focus on iterative, not disruptive, improvements on the common methodology of manually launching EC2 instances, ssh-ing to them, configuring environments by hand and running training scripts*
Quick Start
Libraries
EC2Node and EC2NodeCluster are classes for working with EC2 instances.
RemoteShell and ClusterShell
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ec2_cluster-0.0.0a1.tar.gz
(11.9 kB
view hashes)
Built Distribution
Close
Hashes for ec2_cluster-0.0.0a1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3705bfdcb1b2bfb9a1bc2d0b9ffe8f2966ff8e847c646862d174cbbbd3903a71 |
|
MD5 | 20cf7501a1f0e4088f56c9ce636acb70 |
|
BLAKE2b-256 | da6bb8e7694fd584a6d33fc075752cda0fc07dba488d82b0ccc1583ea43293a6 |