Co-occurring keywords
Papers
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
EMNLP 2025
MR. Judge: Multimodal Reasoner as a Judge
EMNLP 2025
ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts
EMNLP 2025