editscore

A high-fidelity reward model for instruction-based image editing.

These details have not been verified by PyPI

Project links

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

News | Quick Start | Benchmark Usage | Citation

EditScore is a series of state-of-the-art open-source reward models (7B–72B) designed to evaluate and enhance instruction-guided image editing.

✨ Highlights

State-of-the-Art Performance: Effectively matches the performance of leading proprietary VLMs. With a self-ensembling strategy, our largest model surpasses even GPT-5 on our comprehensive benchmark, EditReward-Bench.
A Reliable Evaluation Standard: We introduce EditReward-Bench, the first public benchmark specifically designed for evaluating reward models in image editing, featuring 13 subtasks, 11 state-of-the-art editing models (including proprietary models) and expert human annotations.
Simple and Easy-to-Use: Get an accurate quality score for your image edits with just a few lines of code.
Versatile Applications: Ready to use as a best-in-class reranker to improve editing outputs, or as a high-fidelity reward signal for stable and effective Reinforcement Learning (RL) fine-tuning.

🔥 News

2025-10-27: Released OmniGen2-EditScore7B-v1.1, achieving a 7.01 GEdit score within 700 steps, by incorporating the reweighting strategy from TempFlow. Furthermore, we improve json fixing method using this great library, now EditScore should be more stable across various conditions. Updated to it with pip install -U editscore.
2025-10-22: Introducing Our Reinforcement Learning Training Framework! We're excited to release our complete RL pipeline, the result of a massive effort to simplify fine-tuning for image editing models. Key features include:
- Ready-to-Use RL Dataset: Includes the complete dataset used in the EditScore project, along with clear usage guidelines and preparation scripts.
- An Easy-to-Use Reward Model: Seamlessly integrate EditScore as a reward signal.
- A Scalable Reward Server: Built with native multi-node support for high-throughput training.
- Flexible Training Code: Supports distributed training, variable image resolutions and mixed tasks (t2i, edit, in-context generation) out-of-the-box. Dive into our comprehensive guide on RL Fine-Tuning to get started.
2025-10-16: Training datasets EditScore-Reward-Data and EditScore-RL-Data are available.
2025-10-15: EditScore is now available on PyPI — install it easily with pip install editscore.
2025-10-15: Best-of-N inference scripts for OmniGen2, Flux-dev-Kontext, and Qwen-Image-Edit are now available! See this for details.
2025-09-30: We release OmniGen2-EditScore7B, unlocking online RL For Image Editing via high-fidelity EditScore. LoRA weights are available at Hugging Face and ModelScope.
2025-09-30: We are excited to release EditScore and EditReward-Bench! Model weights and the benchmark dataset are now publicly available. You can access them on Hugging Face: Models Collection and Benchmark Dataset, and on ModelScope: Models Collection and Benchmark Dataset.

📖 Introduction

While Reinforcement Learning (RL) holds immense potential for this domain, its progress has been severely hindered by the absence of a high-fidelity, efficient reward signal.

To overcome this barrier, we provide a systematic, two-part solution:

A Rigorous Evaluation Standard: We first introduce EditReward-Bench, a new public benchmark for the direct and reliable evaluation of reward models. It features 13 diverse subtasks and expert human annotations, establishing a gold standard for measuring reward signal quality.
A Powerful & Versatile Tool: Guided by our benchmark, we developed the EditScore model series. Through meticulous data curation and an effective self-ensembling strategy, EditScore sets a new state of the art for open-source reward models, even surpassing the accuracy of leading proprietary VLMs.

Benchmark results on EditReward-Bench.

We demonstrate the practical utility of EditScore through two key applications:

As a State-of-the-Art Reranker: Use EditScore to perform Best-of-N selection and instantly improve the output quality of diverse editing models.
As a High-Fidelity Reward for RL: Use EditScore as a robust reward signal to fine-tune models via RL, enabling stable training and unlocking significant performance gains where general-purpose VLMs fail.

This repository releases both the EditScore models and the EditReward-Bench dataset to facilitate future research in reward modeling, policy optimization, and AI-driven model improvement.

EditScore as a superior reward signal for image editing.

📌 TODO

We are actively working on improving EditScore and expanding its capabilities. Here's what's next:

Release training data for reward model and online RL.
Release RL training code applying EditScore to OmniGen2.
Provide Best-of-N inference scripts for OmniGen2, Flux-dev-Kontext, and Qwen-Image-Edit.

🚀 Quick Start

🛠️ Environment Setup

We offer two ways to install EditScore. Choose the one that best fits your needs. Method 1: Install from PyPI (Recommended for Users): If you want to use EditScore as a library in your own project. Method 2: Install from Source (For Developers): If you plan to contribute to the code, modify it, or run the examples in this repository

Prerequisites: Installing PyTorch

Both installation methods require PyTorch to be installed first, as its version is dependent on your system's CUDA setup.

# (Optional) Create a clean Python environment
conda create -n editscore python=3.12
conda activate editscore

# Choose the command that matches your CUDA version.
# This example is for CUDA 12.6.
pip install torch==2.7.1 torchvision --extra-index-url https://download.pytorch.org/whl/cu126

🌏 For users in Mainland China

```bash # Install PyTorch from a domestic mirror pip install torch==2.7.1 torchvision --index-url https://mirror.sjtu.edu.cn/pytorch-wheels/cu126 ```

Method 1: Install from PyPI (Recommended for Users)

pip install -U editscore

Method 2: Install from Source (For Developers)

This method gives you a local, editable version of the project.

Clone the repository

git clone https://github.com/VectorSpaceLab/EditScore.git
cd EditScore

Install EditScore in editable mode

pip install -e .

✅ (Recommended) Install Optional High-Performance Dependencies

For the best performance, especially during inference, we highly recommend installing vllm.

pip install -U vllm

🧪 Usage Example

Using EditScore is straightforward. The model will be automatically downloaded from the Hugging Face Hub on its first run.

from PIL import Image
from editscore import EditScore

# Load the EditScore model. It will be downloaded automatically.
# Replace with the specific model version you want to use.
model_path = "Qwen/Qwen2.5-VL-7B-Instruct"
lora_path = "EditScore/EditScore-7B"

scorer = EditScore(
    backbone="qwen25vl", # set to "qwen25vl_vllm" for faster inference
    model_name_or_path=model_path,
    enable_lora=True,
    lora_path=lora_path,
    score_range=25,
    num_pass=1, # Increase for better performance via self-ensembling
)

input_image = Image.open("example_images/input.png")
output_image = Image.open("example_images/output.png")
instruction = "Adjust the background to a glass wall."

result = scorer.evaluate([input_image, output_image], instruction)
print(f"Edit Score: {result['final_score']}")
# Expected output: A dictionary containing the final score and other details.

📊 Benchmark Your Image-Editing Reward Model

Install benchmark dependencies

To use example code for benchmark, run following

pip install -r requirements.txt

We provide an evaluation script to benchmark reward models on EditReward-Bench. To evaluate your own custom reward model, simply create a scorer class with a similar interface and update the script.

# This script will evaluate the default EditScore model on the benchmark
bash evaluate.sh

# Or speed up inference with VLLM
bash evaluate_vllm.sh

Apply EditScore to Image Editing

We offer two example use cases for your exploration:

Best-of-N selection: Use EditScore to automatically pick the most preferred image among multiple candidates.
Reinforcement fine-tuning: Use EditScore as a reward model to guide RL-based optimization.

For detailed instructions and examples, please refer to the documentation.

❤️ Citing Us

If you find this repository or our work useful, please consider giving a star ⭐ and citation 🦖, which would be greatly appreciated:

@article{luo2025editscore,
  title={EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling},
  author={Xin Luo and Jiahao Wang and Chenyuan Wu and Shitao Xiao and Xiyan Jiang and Defu Lian and Jiajun Zhang and Dong Liu and Zheng Liu},
  journal={arXiv preprint arXiv:2509.23909},
  year={2025}
}

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.2

Oct 27, 2025

0.1.8

Oct 17, 2025

0.1.7

Oct 17, 2025

0.1.6

Oct 16, 2025

0.1.5

Oct 16, 2025

0.1.4

Oct 16, 2025

0.1.3

Oct 16, 2025

0.1.2

Oct 15, 2025

0.1.1

Oct 15, 2025

0.1.0

Oct 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

editscore-0.2.tar.gz (169.0 kB view details)

Uploaded Oct 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

editscore-0.2-py3-none-any.whl (216.1 kB view details)

Uploaded Oct 27, 2025 Python 3

File details

Details for the file editscore-0.2.tar.gz.

File metadata

Download URL: editscore-0.2.tar.gz
Upload date: Oct 27, 2025
Size: 169.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for editscore-0.2.tar.gz
Algorithm	Hash digest
SHA256	`d45f00cb80f48dbb0e939bceaae188784eecd0fdf9b7a3446cb23e0db24f282c`
MD5	`e957b5fba90dd93d62ae518948ff2c22`
BLAKE2b-256	`19b73119b268369b32b89cf27ac445f0b74d43c27b11e92c4befbc2e825ab1bd`

See more details on using hashes here.

File details

Details for the file editscore-0.2-py3-none-any.whl.

File metadata

Download URL: editscore-0.2-py3-none-any.whl
Upload date: Oct 27, 2025
Size: 216.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for editscore-0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ab6a52a621a048e51525e1703b2eb8ab69775366ffe5fbf9256ee5930b68723`
MD5	`f9d591c9cc60cc7a3222a07c30a8cd6a`
BLAKE2b-256	`147d1b92d4170d421e946376a6552b0b833ebc8009460ed535966406acf7bdad`

See more details on using hashes here.

editscore 0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

News | Quick Start | Benchmark Usage | Citation

✨ Highlights

🔥 News

📖 Introduction

📌 TODO

🚀 Quick Start

🛠️ Environment Setup

Prerequisites: Installing PyTorch

Method 1: Install from PyPI (Recommended for Users)

Method 2: Install from Source (For Developers)

✅ (Recommended) Install Optional High-Performance Dependencies

🧪 Usage Example

📊 Benchmark Your Image-Editing Reward Model

Install benchmark dependencies

Apply EditScore to Image Editing

❤️ Citing Us

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes