PyPI recent updates for multireward-grpo

PyPI recent updates for multireward-grpo https://pypi.org/project/multireward-grpo/ Recent updates to the Python Package Index for multireward-grpo en 0.1.0 https://pypi.org/project/multireward-grpo/0.1.0/ Decoupled & conditioned multi-reward GRPO advantage estimators, a generalized trainer, and the Theorem-3 verification harness from the paper 'When and Why Decoupling and Conditioning Beat Reweighting in Multi-Reward GRPO'. eagle0504@gmail.com Tue, 23 Jun 2026 17:44:32 GMT