Co-occurring keywords
Papers
Auto-PRE: An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
AAAI 2026
Luna: A Lightweight Evaluation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
COLING 2025
Semantic Inversion, Identical Replies: Revisiting Negation Blindness in Large Language Models
EMNLP 2025
LLMs Do Not See Age: Assessing Demographic Bias in Automated Systematic Review Synthesis
IJCNLP 2025