Shijie Xia
4 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13)
👥
Mega-Team
(28)
Conferences
AAAI (1)
ACL (1)
EMNLP (1)
NIPS (1)
Top co-authors
Keywords
large language model
(3)
text classification
(1)
mathematical reasoning
(1)
preference learning
(1)
autonomous agent
(1)
tool use
(1)
evaluation benchmark
(1)
safety evaluation
(1)
reasoning evaluation
(1)
multimodal model
(1)
scientific discovery
(1)
content generation
(1)
step-by-step reasoning
(1)
cognitive reasoning
(1)
user simulation
(1)
logical error
(1)
benchmark evaluation
(1)
reasoning quality
(1)
Papers
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
ACL 2026
Evaluating Mathematical Reasoning Beyond Accuracy
AAAI 2025
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
NIPS 2024
SAFETY-J: Evaluating Safety with Critique
EMNLP 2024