Last released Jun 22, 2026
A reproducible caption-evaluation toolkit for VLMs with per-metric uv environments.
Last released Mar 18, 2024
[CVPR24] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Last released Mar 1, 2023
Evaluation code for machine-generated image captions in Japanese.
Supported by