Co-occurring keywords
Papers
SEEval: Advancing LLM Text Evaluation Efficiency and Accuracy through Self-Explanation Prompting
NAACL 2025
DHP Benchmark: Are LLMs Good NLG Evaluators?
NAACL 2025
EduCSW: Building a Mandarin-English Code-Switched Generation Pipeline for Computer Science Learning
ACL 2025
ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication
IJCNLP 2025