MM-JudgeBias: A Benchmark for Evaluating Compositional Biases in MLLM-as-a-Judge
Sua Lee
1*
,
Sanghee Park
2,3
,
Jinbae Im
2,3*†
1
Seoul National University,
2
NAVER Cloud AI
3
KAIST AI
* Equal Contribution † Corresponding Author
Paper
arXiv
Code
Dataset
The webpage will be released soon!