Last released Sep 5, 2024
ReaL: Efficient RLHF Training of Large Language Models with Parameter Reallocation
Supported by