Jiayu Liu
14 papers · 2022–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Cross-Pollinator (11) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (7) π Renaissance Researcher (7)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(66)
β
The Questioner
π
Century Club
(11)
Conferences
ACL (4)
ICML (3)
AAAI (2)
NIPS (2)
EMNLP (1)
IJCAI (1)
JMLR (1)
Top co-authors
Research topics
Keywords
large language model
(6)
multi-agent system
(2)
question answering
(2)
natural language inference
(2)
domain adaptation
(2)
distribution shift
(2)
out-of-distribution generalization
(1)
mathematical reasoning
(1)
text classification
(1)
dialogue generation
(1)
information retrieval
(1)
geometry problem solving
(1)
magnetic resonance imaging
(1)
formal verification
(1)
symbolic reasoning
(1)
regularized risk minimization
(1)
question decomposition
(1)
risk minimization
(1)
covariate shift
(1)
claim verification
(1)
Papers
DIXITWORLD: Evaluating Multimodal Abductive Reasoning in Vision-Language Models with Multi-Agent Dixit Gameplay
ACL 2026
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents
ACL 2026
Mathematical Proof as a Litmus Test: Revealing Failure Modes of Advanced Large Reasoning Models
ACL 2026
CogMath: Assessing LLMsβ Authentic Mathematical Ability from a Human Cognitive Perspective
ICML 2025
Automated Creation of Reusable and Diverse Toolsets for Enhancing LLM Reasoning
AAAI 2025
Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Modelsβ Uncertainty?
ACL 2025
What Makes In-context Learning Effective for Mathematical Reasoning
ICML 2025
Decompose, Analyze and Rethink: Solving Intricate Problems with Human-like Reasoning Cycle
NIPS 2024
SocraticLM: Exploring Socratic Personalized Teaching with Large Language Models
NIPS 2024
Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process
IJCAI 2024
Monotonic Risk Relationships under Distribution Shifts for Regularized Risk Minimization
JMLR 2024
GProofT: A Multi-dimension Multi-round Fact Checking Framework Based on Claim Fact Extraction
EMNLP 2024
Learning by Applying: A General Framework for Mathematical Reasoning via Enhancing Explicit Knowledge Learning
AAAI 2023
Test-Time Training Can Close the Natural Distribution Shift Performance Gap in Deep Learning Based Compressed Sensing
ICML 2022