Co-occurring keywords
Papers
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search
ACL 2025
DHP Benchmark: Are LLMs Good NLG Evaluators?
NAACL 2025
Evaluating and Enhancing Large Language Models for Novelty Assessment in Scholarly Publications
NAACL 2025
Are Small Language Models Ready to Compete with Large Language Models for Practical Applications?
NAACL 2025