Co-occurring keywords
Papers
Evaluating the Factuality of Large Language Models Using Multiple Plug-and-Play Fact Sources
AAAI 2026
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation
EACL 2026
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts
EMNLP 2025
A Tale of Evaluating Factual Consistency: Case Study on Long Document Summarization Evaluation
ACL 2025