Large Language Models
6405 directly classified papers
Papers per year
Papers
Towards Long Context Hallucination Detection
NAACL 2025
DHP Benchmark: Are LLMs Good NLG Evaluators?
NAACL 2025