A Diagnostic Framework for Auditing Reference-Free Vision-Language Metrics

A Diagnostic Framework for Auditing Reference-Free Vision-Language Metrics

Arnav Charles, Srikant Panda, Amit Agarwal, Hitesh Laxmichand Patel, Priyaranjan Pattnayak, Bhavya Kumar, Tushar Kumar. IJCNLP (2025).

Abstract. This paper introduces a diagnostic framework for auditing reference-free metrics used in vision-language evaluation. It focuses on understanding when these metrics are reliable and where they fail across different multimodal evaluation settings.

Updated: