Last released Jun 5, 2026
A TRL-like training library for end-to-end skill optimization of frozen LLM agents (based on Microsoft SkillOpt).
Supported by