Co-occurring keywords
Papers
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
ACL 2024
Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives
EMNLP 2024
Exploring Question Guidance and Answer Calibration for Visually Grounded Video Question Answering
EMNLP 2024
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models’ Theory-of-Mind
ACL 2024
CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
EMNLP 2024