A Diagnostic Framework for Auditing Reference-Free Vision-Language Metrics
A Diagnostic Framework for Auditing Reference-Free Vision-Language Metrics
Abstract. This paper introduces a diagnostic framework for auditing reference-free metrics used in vision-language evaluation. It focuses on understanding when these metrics are reliable and where they fail across different multimodal evaluation settings.