Last released Jun 6, 2024
Train transformer language models with reinforcement learning.
Last released May 17, 2024
Parameter-Efficient Fine-Tuning (PEFT)
Supported by