Last released Jun 20, 2024
ReaL: Efficient RLHF Training of Large Language Models with Parameter Reallocation
Supported by