Distributed torch training using horovod and slurm
Project description
dmlcloud
Flexibel, easy-to-use, opinionated
dmlcloud is a library for distributed training of deep learning models with torch. Its main aim is to do all these tiny little tedious things that everybody just copy pastes over and over again, while still giving you full control over the training loop and maximum flexibility.
Unlike other similar frameworks, such as lightning, dmcloud tries to add as little additional complexity and abstraction as possible. Instead, it is tailored towards a careful selected set of libraries and workflows and sticks with them.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
dmlcloud-0.3.1-py3-none-any.whl
(20.5 kB
view hashes)