Tensorflow Cluster on Ray
Project description
How to Use?
tf_cluster = TensorflowCluster.build(resources=
{
"ps": {"cores": "2", "memory": "2", "gpu": "2", "instances": "1"},
"worker": {"cores": "2", "memory": "2", "gpu": "2", "instances": "1"},
"chief": {"cores": "2", "memory": "2", "gpu": "2", "instances": "1"}
},
event_log="/tmp/opal/4"
)
tf_cluster.start(model_process=process, args=None)
This training code will be attached to the existed perm-Ray cluster. If you want to debug, you can use ray.init() to init Ray cluster in local.
When you specify the event_log in tf builder, sidecar tensorboard will be started on one worker.
How to build
[Requirement] python -m pip install twine
python setup.py bdist_wheel –universal
python -m pip install xxxxxx.whl
Tips
To solve the problem of Python module importing on Ray perm-cluster, this project must use Ray 1.5+ version, refer to this RFC(https://github.com/ray-project/ray/issues/14019)
This project is only be tested by Tensorflow estimator training
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file raytf-0.0.1rc2-py2.py3-none-any.whl
.
File metadata
- Download URL: raytf-0.0.1rc2-py2.py3-none-any.whl
- Upload date:
- Size: 6.6 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.23.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0842c95517e9376adfd38c167ced6bc7719a2980f341fd43f6c69bfbcc1244c6 |
|
MD5 | bf38590ca769c126188312e127a619b9 |
|
BLAKE2b-256 | d534f90d1309661613af711788293310df99e4a7337769637cde234e3d686d03 |