No project description provided
Project description
MindSpore with Ascend on ModelArts Launcher
A simple and clean launcher helps you train deep model using MindSpore with Ascend on ModelArts(ROMA or HuaweiCould), no bells and whistles.
[!NOTE] This project is a shameless wrapper of scripts from HuaweiCloud, all credit goes to them.
Installation
pip install ma2l
Or, you can get pre-released version from test.pypi.org
pip install -i https://test.pypi.org/simple/ ma2l
Usage
Just submit the following command to your training job on cloud:
ma2l YOUR_TRAINING_COMMAND
For example, YOUR_TRAINING_COMMAND
might be like:
python your_train_script.py \
--arg1=value1 \
--arg2=value2 \
...
See the difference between the commands on the local machine and on the cloud:
- python your_train_script.py \
+ ma2l python your_train_script.py \
--arg1=value1 \
--arg2=value2 \
...
[!IMPORTANT] Don't forget to pass the argument that turns on distributed training to your training script, if it requires one.
Features
- No need to change a single line of the training script.
- There's no need to set any distribution-related environment variables, and we'll take care of everything for you.
- Supports a variety of hardware settings:
- single-node, single-npu
- single-node, multi-npus
- multi-node, multi-npus
- Modularity. Fully decoupled from your training code/repository.
Philosophies
So, what happens under the hood? After you have created a training job on ModelArts, the launcher does the following:
- Generate HCCL configuration files on each node, typically named
rank_table.json
, which is necessary for distributed training. - Automatically start n processes on each node for
YOUR_TRAINING_COMMAND
, based on the settings when you created the job. - Set the environment variables for each process on each node, such as
RANK_TABLE_FILE
,DEVICE_ID
,RANK_ID
, etc.
Simple and Easy, right? You don't need to change any code to adapt training scripts from your local machine to the cloud, and you don't need to struggle with environment variable settings on the local machine. Keep it in mind.
FAQs
What does ma2l
mean?
ma2l
is the abbreviation for MindSpore with Ascend on ModelArts Launcher.
Credits
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ma2l-0.1.2.tar.gz
.
File metadata
- Download URL: ma2l-0.1.2.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 947fac6348b0fe77524ae6137d5eb8771f2798f2c7a5252e57a207f4dd8e8801 |
|
MD5 | ee86513c008a9557feb409e2f9b5366d |
|
BLAKE2b-256 | 2368813b2247c9b7c5f5212742d11b4c968a5815c82861606c821d902817eebc |
File details
Details for the file ma2l-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: ma2l-0.1.2-py3-none-any.whl
- Upload date:
- Size: 10.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b1d798e5ef306f182e45af4af58e5264ed10acbb8cdc8bec175ebadadb8f53c8 |
|
MD5 | c7133f4487cde2c3c663e4d405d5dfdc |
|
BLAKE2b-256 | 2095305a3008a9dde7855f4002a131ac8bbca8fd15da1f97126e2b85ca1d3b4b |