Co-occurring keywords
Papers
True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4
ACL 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
EMNLP 2023
Toward Stronger Textual Attack Detectors
EMNLP 2023