Co-occurring keywords
Papers
FineReason: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving
ACL 2025
Abdelhak at SemEval-2024 Task 9: Decoding Brainteasers, The Efficacy of Dedicated Models Versus ChatGPT
SEMEVAL 2024
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles
COLING 2024
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving
NIPS 2024
Automated Crossword Solving
ACL 2022