No project description provided
Project description
A unified platform for building, evaluating, and applying reward models.
Documentation | Examples | 中文
News
- 2025-10-20 - Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling - We released a new paper on learning generalizable reward criteria for robust modeling.
- 2025-10-17 - Taming the Judge: Deconflicting AI Feedback for Stable Reinforcement Learning - We introduced techniques to align judge feedback and improve RL stability.
- 2025-07-09 - Released RM-Gallery v0.1.0 on PyPI
Installation
RM-Gallery requires Python 3.10 or higher (< 3.13).
pip install rm-gallery
Or install from source:
git clone https://github.com/modelscope/RM-Gallery.git
cd RM-Gallery
pip install .
Quick Start
from rm_gallery.core.reward.registry import RewardRegistry
from rm_gallery.core.data.schema import DataSample
# Choose from 35+ pre-built reward models
rm = RewardRegistry.get("safety_listwise_reward")
# Evaluate your data
sample = DataSample(...)
result = rm.evaluate(sample)
See the quickstart guide for a complete example, or try our interactive notebooks.
Features
Pre-built Reward Models
Access 35+ reward models for different domains:
rm = RewardRegistry.get("math_correctness_reward")
rm = RewardRegistry.get("code_quality_reward")
rm = RewardRegistry.get("helpfulness_listwise_reward")
Custom Reward Models
Build your own reward models with simple APIs:
from rm_gallery.core.reward import BasePointWiseReward
class CustomReward(BasePointWiseReward):
def _evaluate(self, sample, **kwargs):
# Your evaluation logic
return RewardResult(...)
Learn more about building custom RMs
Benchmarking
Evaluate models on standard benchmarks:
- RewardBench2 - Latest reward model benchmark
- RM-Bench - Comprehensive evaluation suite
- Conflict Detector - Detect evaluation inconsistencies
- JudgeBench - Judge capability assessment
Applications
- Best-of-N Selection - Choose optimal responses from candidates
- Data Refinement - Improve dataset quality with reward signals
- RLHF Integration - Use rewards in reinforcement learning pipelines
- High-Performance Serving - Deploy models with fault-tolerant infrastructure
Documentation
Contributing
We welcome contributions! Please install pre-commit hooks before submitting pull requests:
pip install -e .
pre-commit install
See our contribution guide for details.
Citation
If you use RM-Gallery in your research, please cite:
@software{
title = {RM-Gallery: A One-Stop Reward Model Platform},
author = {The RM-Gallery Team},
url = {https://github.com/modelscope/RM-Gallery},
month = {07},
year = {2025}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rm_gallery-0.1.7.tar.gz.
File metadata
- Download URL: rm_gallery-0.1.7.tar.gz
- Upload date:
- Size: 180.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
105fe3cac94e058497f6eb8bbecfbce08bcaeefafbca8839ec4217ab587c60fe
|
|
| MD5 |
16e66aa184b7077620d598af032262a5
|
|
| BLAKE2b-256 |
e71b15db06b30535e65199f3a522315f8190209f45e797af2364a663de27147b
|
File details
Details for the file rm_gallery-0.1.7-py3-none-any.whl.
File metadata
- Download URL: rm_gallery-0.1.7-py3-none-any.whl
- Upload date:
- Size: 250.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6098749f58d035106ccc3b8a07c9267f777d7b94fa377a8b4a4da6c6949e1ecb
|
|
| MD5 |
aba4f1f117d14c573e738fd2e8726f78
|
|
| BLAKE2b-256 |
fbe9917b8b435229449c5ebcaa40c61194e0f5850e704d40ed60740682cb7b87
|