9 projects
nemo-gym
NeMo Gym is a library for building reinforcement learning environments
megatron-core
Megatron Core - a library for efficient and scalable training of transformer based models
megatron-fsdp
**Megatron-FSDP** is an NVIDIA-developed PyTorch extension that provides a high-performance implementation of Fully Sharded Data Parallelism (FSDP)
megatron-bridge
Megatron Bridge: Training Recipes for Megatron-based LLM and VLM models
nemo-emerging-optimizers
A research project for emerging optimizers other than AdamW
nv-emerging-optimizers
A research project for emerging optimizers other than AdamW
emerging-optimizers
None
nvFSDP
nvFSDP - NVIDIA fork of Fully Sharded Data Parallelism that is cross-compatible with Megatron / TransformerEngine and native PyTorch.
nemo-rl
NeMo RL: A Scalable and Efficient Post-Training Library for Models Ranging from 1 GPU to 1000s, and from Tiny to >100B Parameters